SRE with Agentic AI Experience || REMOTE

Remote • Posted 1 hour ago • Updated 21 minutes ago
Contract W2
Remote
Fitment

Dice Job Match Score™

👾 Reticulating splines...

Job Details

Skills

  • SRE
  • AGENTIC AI

Summary

Role: SRE with Agentic AI Experience
Location: 100% Remote - Must be from EST/CST
Duration: 24+ Months
Must Have:
Minimum 12 years of experience required.
SRE, Agentic AI, experience in fixing Java production bugs, SQL and Kubernetes

Technology Stack & Environment

This role directly builds on the existing Lynx SRE technology stack:

  • Cloud: Microsoft Azure
  • Compute: Kubernetes, Docker
  • Applications: Java based services
  • CI/CD: GitHub Actions
  • Observability: Dynatrace (preferred and strategic)
  • Automation & Scripting: Python, Bash, Ansible
  • Agentic AI & Automation:
    • Microsoft Agent Framework
    • Azure hosted AI agents
    • Multi agent orchestration patterns (triage, comms, PIR agents)
    • Human in the loop safety and approval models

Required Qualifications

  • 7+ years of Site Reliability Engineering or Production Engineering experience
  • Strong hands on experience with:
    • Azure cloud infrastructure
    • Kubernetes and Docker
    • Java production systems
    • CI/CD pipelines (GitHub Actions)
    • Observability platforms (Dynatrace strongly preferred)
  • Demonstrated experience automating infrastructure and operational workflows
  • Deep understanding of SRE principles (SLIs, SLOs, error budgets)

Preferred / Differentiating Qualifications

  • Experience designing automation that replaces or materially reduces on call toil
  • Experience building or orchestrating AI agents applied to operational workflows
  • Familiarity with multi agent architectures or distributed automation systems
  • Strong judgment around risk management, safety boundaries, and human in the loop design
  • Experience working in healthcare or regulated environments

What Success Looks Like

  • Reduced manual operational toil and alert fatigue
  • Faster, more consistent incident triage and resolution
  • High quality, standardized incident communications
  • Fewer customer visible incidents through proactive automation
  • A measurable shift from human driven operations to AI assisted reliability

Summary

This role is ideal for an SRE who:

  • Can run production systems today
  • Thinks in systems and platforms, not tickets
  • Is motivated to automate themselves out of repetitive operational work
  • Understands that AI augments-not replaces-engineering accountability
Thanks & Regards,
Namrata Ahuja | Lead Talent Acquisition - US Staffing

T |

#LI-NA1

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 91010771
  • Position Id: 2026-5686
  • Posted 1 hour ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote

Today

Easy Apply

Third Party, Contract

Depends on Experience

Remote

Today

Easy Apply

Contract

Depends on Experience

Remote

Today

Easy Apply

Contract

Depends on Experience

Remote

Today

Easy Apply

Contract

$70+

Search all similar jobs