Overview
Skills
Job Details
Site Reliability Engineer (SRE) with 10+ Years - onsite- Atlanta/Dallas/Seattle/Kansas-Day 1 onsite
Site Reliability Engineer (SRE) Location: Atlanta/Dallas/Seattle/Kansas -Day 1 onsite
Job Description:
Seeking an experienced, Site Reliability Engineer who can operate independently with limited guidance and oversight. This individual will be passionate about end-user experience and will be part of a tight-knit, distributed engineering team developing and delivering a comprehensive dat a operations management solution. SRE is a critical role in the entire SDLC from coding, scaling, and ensuring production stability that includes responding to on-call incidents.
Tech Stack: AWS/Azure/Google Cloud Platform, CI/CD Pipeline, Jenkins, Java Microservices, Setting up dev pipeline, Big Query, Python, Terraform - Dataflow in Google Cloud Platform/AWS,API Programming, python, CICD
Tools & Frameworks: Nx build management, Monorepo architecture, Jenkins CI/CD, Fortify, Sonar, GitHub
Cloud & Data: Google Cloud Platform (GKE, Composer + Airflow, Dataflow + Apache Beam, BigQuery, BigTable, Firestore, GCS, PubSub, Vertex AI), Terraform, Helm Charts, GitOps
Other Technologies: Websockets, SSE, event-driven architecture
General Responsibilities
Contribute to Development Activities: SRE is expected to participate in SDLC activities that include design, develop, test, deploy, and operate, covering both frontend and backend
Cross-Functional Work: Collaborate with global teams to integrate with existing internal systems and Google Cloud Platform cloud Issue Resolution: Triage and resolve product or system issues, ensuring quality and performance Documentation: Write technical documentation, support guides, and run books
Agile Practices: Participate in sprint planning, retrospectives, and other agile activities
Compliance: Ensure software meets secure development guidelines and engineering standards
SRE Accountability General: Use coding, automation, and software engineering principles to ensure scalability, performance, and reliability efficiently and toil-free
IAC: Build infrastructure as code (IAC) patterns that meet security and engineering standards using one or more technologies (Terraform, scripting with cloud CLI, and programming with cloud SDK)
CI/CD: Build CI/CD pipelines for build, test and deployment of application and cloud architecture patterns, using platform (Jenkins) and cloud- native toolchains