Site Reliability Engineer

Hybrid in Santa Clara, CA, US • Posted 23 hours ago • Updated 23 hours ago
Contract W2
Contract Independent
Contract Corp To Corp
12 Months
No Travel Required
Hybrid
Depends on Experience
Fitment

Dice Job Match Score™

⭐ Evaluating experience...

Job Details

Skills

  • Linux
  • multi-tenant environments
  • Virtual Machines
  • Kubernetes administration
  • Orchestration
  • agentic workflows
  • CLI''''s
  • MCP''''s
  • telemetry
  • automated runbooks
  • anomaly detection
  • LLM
  • workflows
  • Jenkins
  • Gitlab
  • CI/CD
  • Argo
  • Flux
  • Prometheus
  • Grafana
  • Victoria Metrics
  • Datadog
  • Splunk
  • Kibana

Summary

Site Reliability Engineer

Candidate local to Santa Clara, CA

Hybrid model

it is must for the candidate to come down for the client F2F interview. 

 

As an SRE, you''''ll also be working in conjunction with various teams such as software engineering to deploy these new products and manage our infrastructure, associated processes, and systems. Keen attention to detail, problem-solving abilities, and a solid knowledge base are essential.

 

What you''''ll be doing:

 

  • Design and operate a multi-cluster Kubernetes platform that provisions machines, workloads, and cloud instances on demand, including the controllers, CRDs, and ingress/DNS/TLS automation behind them.
  • Build and harden the platform''''s microservices —CI/CD, SSO, RBAC, secret encryption, and real-time monitoring workflows.
  • Integrate AI tooling into workloads, work on building agents and tools to support SRE teams to efficiently scale and operate
  • Own the production release path: Helm-driven deployments, multi-arch container builds, staged rollouts, and clean rollback playbooks.
  • Instrument the platform with audit logging, usage analytics, and automation that lets the SRE team support a large user base.

 

What we need to see:

 

  • 6+ years of DevOps/SRE experience operating production Kubernetes either on-premises or in the cloud, with depth in CRDs, operators, ingress, and cluster networking.
  • Experience in integrating AI tools with workflows.
  • Strong Python or go and understanding of TypeScript/React — comfortable moving across backend services, frontend UX, and infrastructure-as-code.
  • Production experience with cloud provisioning (AWS or equivalent), identity federation (OIDC/SAML), and secret management.
  • Solid grounding in relational databases, caching layers, and async networking patterns (SSH tunnels, WebSocket''''s, message queues).
  • BS/MS in CS or equivalent, with a track record of shipping internal developer platforms and CI/CD pipelines.

 

Ways to stand out from the crowd:

 

  • Prior work on Linux, multi-tenant environments, Virtual Machines, Kubernetes administration, and Orchestration.
  • Deep experience in agentic workflows, skills, tooling like CLI''''s and MCP''''s.
  • Comfort building AI-assisted tooling on top of platform telemetry — automated runbooks, anomaly detection, or LLM-driven ops workflows.
  • Knowledge and prior usage of CI tools like Jenkins/Gitlab CI, CD tools like Argo or Flux, Monitoring tools like PrometheGrafana or Victoria Metrics, Datadog, Splunk or Kibana.
  • Strong proponent of documentation and root causing issues — you leave behind runbooks and docs that let the next engineer ship on day one

 

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10126196
  • Position Id: 8972746
  • Posted 23 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

San Jose, California

Today

Easy Apply

Contract

60 - 65

Santa Clara, California

Today

Easy Apply

Contract

60 - 70

San Jose, California

2d ago

Easy Apply

Contract

Depends on Experience

San Jose, California

3d ago

Easy Apply

Contract

102 - 107

Search all similar jobs