AI Ops – Senior Architect

Phoenix, AZ, US • Posted 3 hours ago • Updated 3 hours ago
Contract W2
6 Months
No Travel Required
On-site
$70 - $75/hr
Company Branding Image
Fitment

Dice Job Match Score™

🛠️ Calibrating flux capacitors...

Job Details

Skills

  • Google Cloud Platform
  • Data Science
  • Databricks
  • DevOps
  • Docker
  • Continuous Integration
  • Cyber Security
  • Data Engineering
  • Dynatrace
  • Cloud Computing
  • Collaboration
  • Continuous Delivery
  • Evaluation
  • Auditing
  • Budget
  • Capacity Management
  • Cloud Architecture
  • Amazon Web Services
  • AppDynamics
  • Artificial Intelligence
  • Natural Language Processing
  • Operational Efficiency
  • Optimization
  • Meta-data Management
  • Microsoft Azure
  • New Relic
  • Orchestration
  • Machine Learning Operations (ML Ops)
  • Mentorship
  • Microservices
  • Java
  • Kubernetes
  • Leadership
  • Machine Learning (ML)
  • IT Operations
  • Predictive Analytics
  • Python
  • Regulatory Compliance
  • Finance
  • Good Clinical Practice
  • Grafana
  • Health Care
  • Stacks Blockchain
  • Technical Direction
  • Telecommunications
  • Amazon SageMaker
  • Roadmaps
  • Scripting
  • Splunk
  • Time Series
  • Training
  • Vertex
  • Workflow
  • Risk Analysis

Summary

Description

We are seeking a highly skilled AI Ops – Senior Architect to lead the design, implementation, and optimization of AI-driven operational platforms across large-scale, mission-critical environments. The ideal candidate will possess deep expertise in machine learning–enabled operations, observability, automation frameworks, cloud engineering, and enterprise SRE/DevOps practices. This role will drive the transformation of traditional IT operations into intelligent, autonomous, self-healing systems.

The Senior Architect will collaborate with cross-functional engineering, cloud, platform, and data science teams to deliver predictive, proactive, and automated operational outcomes.

Key Responsibilities
AI-Driven Operations Architecture
Lead the architecture and implementation of AI-powered operational frameworks, including predictive analytics, anomaly detection, NLP-driven automation, and auto-remediation systems.
Define and evolve the overall AI Ops strategy, roadmap, standards, and governance.Implement intelligent monitoring and decision models that enhance reliability and operational efficiency.
Architect solutions that integrate machine learning models into production operations workflows.

Observability, Monitoring & Automation
Design end-to-end observability ecosystems (metrics, logs, traces, topology, events) integrated with AI/ML platforms.
Build anomaly detection models using ML and time-series analysis to identify issues before failures occur.
Drive automated incident detection, impact assessment, and classification using AI-based models.Implement proactive auto-healing and automated resolution workflows.

Cloud & Platform Engineering
Architect scalable AI Ops platforms using AWS, Azure, or Google Cloud Platform cloud-native services.
Design infrastructure and pipelines for AI-driven monitoring and operational insights.
Integrate AI Ops capabilities with Kubernetes, service mesh, cloud-native microservices, and distributed systems.
Optimize cost, performance, and reliability using intelligent orchestration and scaling.

Data Engineering & ML Ops Integration
Partner with data engineering teams to build robust data pipelines for operational data ingestion.
Work with ML Ops teams to operationalize ML models, including training, evaluation, deployment, and monitoring.
Ensure continuous retraining and drift detection for AI Ops models.
Define data taxonomies, quality standards, and metadata management for operational datasets.
SRE, DevOps & Automation Frameworks
Align AI Ops with SRE principles, SLIs, SLOs, and error budgets.
Integrate AI-driven insights into CI/CD pipelines and operational workflows.
Develop event-driven, automated runbooks using ML and rule-based systems.
Implement intelligent capacity planning, scaling, and resource optimization.

Security, Compliance & Governance
Ensure AI Ops solutions meet enterprise security, compliance, and audit requirements.
Define governance frameworks for AI model usage, transparency, and monitoring.
Collaborate with cybersecurity teams on intelligent threat detection and risk analysis.

Leadership & Collaboration
Provide architectural leadership and technical direction to engineering and operations teams.
Mentor teams on AI Ops concepts, automation, and intelligent operations.
Present architecture proposals and operational improvements to leadership stakeholders.
Influence enterprise-wide transformation toward autonomous operations.

Required Skills & Experience
12+ years of IT experience with 5+ years in SRE/DevOps/AI Ops architecture.

Strong expertise in:
AI Ops platforms (Moogsoft, Dynatrace Davis AI, BigPanda, New Relic AI, Datadog AIOps)
Observability stacks (Prometheus, Grafana, ELK, Splunk, AppDynamics)
ML pipelines and ML Ops tooling (SageMaker, Vertex AI, MLflow, Databricks)
Cloud architectures on AWS / Azure / Google Cloud Platform
Event-driven systems and automation tools
Strong programming/scripting in Python, Go, or Java for automation and ML integration.
Experience with Kubernetes, Docker, microservices, and distributed systems.
Deep understanding of time-series analysis, anomaly detection, NLP, and predictive analytics.
Experience operationalizing ML models and integrating them into production systems.

Preferred Qualifications
Certifications in cloud architecture or ML engineering.
Background in enterprise-scale SRE, observability, or operations automation.
Experience with LLM-based automation and AI agents for IT operations.
Experience in highly regulated industries (Finance, Healthcare, Telecom).

 

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 91165686
  • Position Id: 8977960
  • Posted 3 hours ago

Company Info

About Value Spectrum Technologies LLC

Step into a future defined by empowerment at Value Spectrum Technologies. With leading-edge software solutions and strategic consulting, were dedicated to shaping and elevating your digital tomorrow. Experience the synergy of innovation and collaboration as we unlock unparalleled opportunities for growth in the dynamic landscape of technology. Welcome to empowerment.

Join us in navigating the ever-evolving digital landscape with confidence, as we work together to unlock unprecedented opportunities and build a tomorrow that is truly empowered by the limitless possibilities of technology. Your digital future starts here.

About_Company_OneAbout_Company_Two
Contact the job poster
PC

Prazna Chingurupati

Recruiter @ Value Spectrum Technologies LLC
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Phoenix, Arizona

Today

Easy Apply

Third Party, Contract

60 - 65

Phoenix, Arizona

21d ago

Easy Apply

Contract

Depends on Experience

Phoenix, Arizona

2d ago

Easy Apply

Contract, Third Party

80 - 85

Sunnyvale, California

3d ago

Easy Apply

Contract

65 - 70

Search all similar jobs