AIOps Engineer

  • Reston, VIRGINIA
  • Posted 9 hours ago | Updated moments ago

Overview

On Site
DOE
Contract - W2

Skills

Optimization
Reliability Engineering
Scalability
Machine Learning (ML)
Elasticsearch
SQL
NoSQL
Database
JIRA
Confluence
Provisioning
Terraform
Ansible
Backend Development
Management
Mentorship
Knowledge Sharing
Training
Documentation
Machine Learning Operations (ML Ops)
Artificial Intelligence
Python
Scripting
Shell
Windows PowerShell
LangChain
Autogen
Amazon Web Services
Continuous Integration
Continuous Delivery
Version Control
Git
Docker
Kubernetes
DevOps
Jenkins
Selenium
Dashboard
Splunk
Communication
Collaboration
Regulatory Compliance
System On A Chip
Microsoft Azure
Google Cloud Platform
Google Cloud
Cloud Computing
Open Source

Job Details

Job Summary We are seeking a highly skilled AIOps Engineer to lead the development, integration, and optimization of AI-driven operations platforms. The ideal candidate will have deep expertise in AI/ML operations, infrastructure automation, and cloud-native technologies, and will play a key role in enhancing system reliability, scalability, and performance through intelligent automation. Key Responsibilities Design, develop, and support AI/ML operations platforms and autonomous agent frameworks. Integrate AI agents with diverse data sources including Elasticsearch, SQL/NoSQL databases, Jira, Confluence, and Git. Implement and maintain CI/CD pipelines and automate infrastructure provisioning using IaC tools (Terraform, Ansible). Support AWS operational environments and ensure alignment with core cloud services. Develop scripts and automation using Python for backend development and system tasks. Configure and manage containerized environments using Docker and Kubernetes. Troubleshoot and resolve complex DevOps and AI-related issues. Collaborate with cross-functional teams to align AI systems with business goals. Mentor junior engineers and contribute to code reviews and knowledge sharing. Create and maintain SOPs, training plans, and documentation for DevOps and AI operations. Measure and improve process efficiency and effectiveness through automation. Ensure compliance with security policies and regulatory standards. Required Qualifications Minimum 8 years of experience in AI development, MLOps, or related domains. Minimum 4 years of experience architecting AI solutions and environments. Strong proficiency in Python and scripting languages (Shell, PowerShell). Hands-on experience with agentic frameworks (e.g., LangChain, AutoGen). Experience with AWS cloud services and infrastructure automation. Proficiency in CI/CD tools and version control systems (Git). Familiarity with containerization technologies (Docker, Kubernetes). Strong understanding of DevOps tools and practices (Jenkins, SonarQube, Selenium, etc.). Experience with monitoring and dashboard tools (ELK, Splunk). Excellent communication and collaboration skills. Preferred Qualifications Experience leading global teams or projects. Familiarity with compliance frameworks (e.g., NIST, SOC1, SOC2). Knowledge of Azure, Google Cloud, and hybrid cloud environments. Experience with open-source contributions and plugin integrations. Education: Bachelors Degree
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.