Momento USA is a global technology consulting, talent acquisition, and creative development firm that addresses clients' most pressing needs and challenges. We are currently looking for a Lead AI Ops/SRE Engineer.
Role: Lead AI Ops/SRE Engineer
Location: Fremont, CA (Hybrid 3-4 days)
Duration: 12+ Months
Role Summary
We are looking for a strong hands-on Lead AI-Assisted SRE / AIOps Engineer to help operationalize and scale an SRE agent-driven operations model. This role will lead the onboarding of existing scripts, SOPs, and operational workflows into the SRE agent while also supporting production releases, validation, incident response, and operational governance.
This is not a pure support role. The ideal candidate must be technically strong, practical, and capable of using independent judgment rather than relying blindly on AI outputs.
Experience
- Total 14+ years of experience required and around 5+ years of hands-on experience in IT operations, cloud operations, SRE, platform support, or production engineering
- Proven experience in production support, incident handling, automation, and operational troubleshooting
- Experience working with monitoring, observability, scripting, and release validation
- Must have AIOps, AI-assisted operations, or automation-led support models Experience.
Key Responsibilities
- Lead the adoption and operationalization of the SRE agent across support and reliability workflows
- Translate existing scripts, runbooks, SOPs, and operational knowledge into agent-compatible workflows
- Work with teams to identify which use cases should be automated, semi-automated, or remain human-driven
- Validate agent outputs, recommendations, and remediation steps before operational use
- Support production releases, release validation, smoke testing, and post-release health checks
- Drive troubleshooting during incidents and ensure proper root cause analysis and follow-through
- Improve alert handling, event correlation, and operational response patterns
- Coordinate with engineering, operations, and platform teams on onboarding and process changes
- Mentor junior engineers and guide them on workflow design, validation, and operational execution
- Maintain high-quality documentation, runbooks, and operational standards
Required Technical Skills
- Strong hands-on scripting experience in PowerShell, Python, Shell/Bash
- Experience with monitoring, alerting, logs, dashboards, and incident workflows
- Good understanding of production support processes, release support, and validation practices
- Experience with cloud platforms, preferably Azure
- Familiarity with ITSM/ticketing tools such as ServiceNow, Jira, or similar
- Ability to understand existing operational scripts and modernize them into scalable workflows
- Experience with APIs, integrations, or automation pipelines is preferred
Exposure to Kubernetes / AKS/AI tools - ChatGPT, copilot is a plus
Thanks,
Samuel Brown
Momento USA | Exceeding Customer Expectations…
440 Benigno Blvd, Unit#A 2nd Floor. Bellmawr, NJ 08031
Interstate Business Park
Tel : Direct: / Extn 1020
Email: Web:
Note: Momento USA is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, gender identity, national origin, age, protected veteran status, or disability status.