Site Reliability Engineering Lead

Contract Corp-To-Corp, Contract W2, 6 to 12+ mos
Work from home not available Travel not required

Job Description

Job title: Site Reliability Engineering Lead

Location: Sunnyvale, CA 94085

Duration: 6-12+ months

Rate: Market DOE

Remarks: Local candidates preferred. C2C OK but H1B/EAD candidates must have at least 6 months before their VISA expires

We are looking for a hands-on, energetic and seasoned site reliability engineering lead to serve as a primary person responsible for the overall health, availability, reliability, scalability, and capacity planning of our critical services. While the engineering team will be primarily focused on development of new features, the SRE Lead will need to be in lock-step with them in gaining deep application knowledge to ensure durability and operability of the services. The SRE Lead will be responsible in automating mundane and repetitive procedures. The SRE Lead will work closely with system engineers, network engineers, database administrators, information security team to achieve service level objectives. The SRE Lead will be responsible for establishing and monitoring of service level indicators to maintain the overall health of the services. The SRE Lead will work in troubleshooting, triaging production issues, and further escalating the issues to the engineering team for a permanent fix. He/She will participate in the change management processes to ensure the durability and operability of the service.

Key Qualifications

5+ years of hands-own experience in deploying and troubleshooting of apps or services in a large scale Linux/Unix environment.

Establish change management discipline in roll-out and deployment of new product features.

Experienced in developing automated solution for deploying, monitoring, and logging of our critical services.

Proficient in solving problems wide range of issues across multiple technologies.

Experience in automation through scripts or through other tools for reducing

of manual processes.

Infrastructure knowledge of Network, Load Balancers, VM, Firewalls, Security Certificates, etc.

Working knowledge of Oracle.

Experience in troubleshooting and issue triaging.

Working knowledge of source control software (SVN or Git).

Ability to multi-task and manage tasks with varying priorities.

Ability to work independently with minimal supervision.

Excellent written and oral communication skills.

Preferred Qualifications:

Working knowledge of APM solutions such as AppDynamics or DynaTrace

Experience with containerized applications using Docker and Kubernetes

Understanding of Configuration Management systems like Chef

If interested, please send your resume to

Posted By

Joseph Huang

Dice Id : sis
Position Id : 42165
Have a Job? Post it

Similar Positions

Sr. Site Reliability Engineer
  • Bahwan CyberTek Inc.
  • Fremont, California
6073 - Sr. Site Reliability Engineer
  • Staff Tech
  • Fremont, CA
Site Reliability Engineer - 2 Positions
  • BayOne Solutions
  • San Jose, CA
Site Reliability Engineer (SRE)
  • Ideaon
  • Fremont, CA
Site Reliability Engineer
  • Radiansys, Inc.
  • Redwood City, California
Site Reliability Engineer
  • CyberCoders
  • San Mateo, CA
Site Reliability Engineer ^
  • Sunnyvale, CA
Senior Site Reliability Engineer
  • 3P&T Security Recruiting
  • Sunnyvale, California
Senior Site Reliability Engineer
  • Bayside Solutions
  • Campbell, CA
Senior DevOps / Site Reliability Engineer
  • Jefferson Frank
  • Santa Clara, CA
Sr. Site Reliability Engineer
  • Red Oak Technologies, Inc.
  • Sunnyvale, California
System Engineer II (Wearables HW Reliability Engineer)
  • WinMax Systems Corporation
  • Cupertino, CA
Site Reliability Engineer
  • Enquero
  • Mountain View, California