Overview
Skills
Job Details
Responsibilities:
Design, develop, and maintain automation frameworks for performance testing and
monitoring of QuickBooks infrastructure.
Ensure the scalability and reliability of services supporting 10M+ active users.
Build and optimize tooling using Python to automate deployment, monitoring, and
operational tasks.
Work with AWS cloud services to architect resilient and efficient infrastructure.
Partner with developers, QA, and operations teams to embed SRE best practices
into the product lifecycle.
Monitor, troubleshoot, and improve system performance and participate in on-call
rotations.
Qualifications:
5+ years of experience in Site Reliability Engineering, DevOps, or related roles.
Strong hands-on expertise with AWS (EC2, S3, RDS, Lambda, etc.).
Proficiency in Python for scripting and automation.
Experience with CI/CD pipelines, containerization (Docker, Kubernetes), and
observability tools (Prometheus, Grafana, Datadog, etc.).
Proven ability to troubleshoot complex distributed systems.
Excellent collaboration and communication skills.
Why Join Us:
Be part of a team directly responsible for keeping QuickBooks running for 10+
million global users.
Opportunity to work on cutting-edge SRE practices at scale.
Collaborate with talented engineers in a customer-first culture at Intuit.