Senior Site Reliability Engineer (Deployment)
Location: Austin, TX (Onsite 3 days/week)
Employment Type: W2 Only
Openings: 3
We are hiring Senior Site Reliability Engineers to support enterprise platform deployments for a fast-growing AI-focused technology company. This is a hands-on role working with customer infrastructure teams to deploy, secure, automate, and optimize Kubernetes-based platforms.
Key Responsibilities:
Deploy and manage applications on Kubernetes environments (AWS, Azure, Google Cloud Platform, or on-prem)
Automate infrastructure using Terraform and GitOps practices
Integrate identity management, networking, security, and data services
Implement observability solutions including monitoring, logging, and alerting
Support production deployments, performance validation, and operational readiness
Create deployment documentation, runbooks, and operational procedures
Required Skills:
Strong Kubernetes administration and troubleshooting experience
Terraform, Helm, Kustomize, and GitOps tools
Cloud platforms (AWS, Azure, or Google Cloud Platform)
Networking fundamentals including DNS, load balancing, TLS, and VPN/private connectivity
Identity and security integration (OIDC, OAuth, SSO, RBAC, secrets management)
Monitoring, logging, distributed tracing, and incident response
Experience with PostgreSQL or similar databases
Scripting/automation skills and ability to read/debug Go or Python code
Strong customer-facing communication skills
Preferred:
Experience in regulated environments (Healthcare, Financial Services, etc.)
Service Mesh, Disaster Recovery, or Chaos Engineering experience
Exposure to AI/LLM-based platforms
Candidates should be able to work onsite in Austin at least 3 days per week. Local candidates are preferred, but candidates willing to relocate or commute from nearby cities will be considered.