Site Reliability Engineer (Kafka, AWS, Terraform)
Irvine, CA (Hybrid)
Long-Term Contract
Client is seeking an experienced Site Reliability Engineer (Kafka, AWS,
Terraform) for a full-time, hybrid position based in Irvine, CA.
The candidate will Deliver API - Platform Engineering by leading solution initiatives.
Recommend and re-engineer solutions to address complex incidents/issues. Advanced
troubleshooting, root-cause analysis, and recommending solutions/fixing bugs. Identify
toil and recommend/develop automation solutions. Understand business requirements
and provide solutions through the self-service platform. Participate in piloting new
products.
Qualifications:
Strong experience with Confluent Kafka and AWS cloud, including experience in
building and operating solutions for high-scale distributed systems.
Prior experience with enabling Observability-using tools for Distributed tracing,
Event logging, APM Synthetic monitoring.
Understanding of SRE Practices.
Experience in Automation.
Experience in building self-service platforms.
Prior experience with web services and messaging protocols.
Prior experience with Infrastructure such as Code with Ansible and Terraform,
OpenShift/Kubernetes.
Prior experience with public cloud providers (AWS).
Strong communication skills, high energy, take ownership and not afraid to
engage with customers.
Ability to collaborate with teams and impact decisions at the interpersonal level.
Nice to have:
Understanding of SDLC, SAFe terminologies, Agile development methodology.
Experience in CI/CD tools such as Jenkins and Git.