Site Reliability Engineer (SRE) - Remote

SRE/DevOps, Java, Python, CI/CD techniques, Application Support, Kubernetes, Docker, Prometheus, Distributed clusters, Cassandra
Contract W2, 6 Months
Depends on Experience
Travel not required

Job Description

Role: Site Reliability Engineer (SRE)

Location: Cupertino, CA

Duration: 6+ months

Type of hire: W2


You will be working on maintaining and improving client’s next generation Telemetry system. The system is critical for a wide range of our teams to maintain their services’ reliability and health. Expecting candidate to be highly self-motivated with a passion for excellence, quality and detail. They will support development and operations with a focus to improve stability and scalability of the overall system(SRE).
Key qualifications:

  • Certified Kubernetes Application Developer (CKAD) - Good to have
  • Deployment and triage large scale distributed applications on k8s and other cloud platforms. - Must have
  • Experience with technologies like Cassandra, Zookeeper, Kafka, Spark - Minimum knowledge
  • Ability to troubleshoot issues across the entire software stack - Must have
  • Experience with Helm, Docker, Terraform and general containerization of mirco-services - Must have
  • Strong coding in Python or Scala
  • Experience with PrometheGrafana and Telegraf - Good to have
  • Knowledge of the Linux operation system and its variations - Good to have (Not looking for sys-admin)
  • Excellent communications skills - Must have
  • On-call for applications running on k8s and other platforms
  • P2 Customer PD Incidents
  • Keeping dev and integration and production clusters online and meeting SLOs
  • Building tooling and automation for various operational tasks
  • Improving overall application health and customer experience
Dice Id : minds
Position Id : 7130355
Originally Posted : 3 months ago
Have a Job? Post it

Similar Positions

Site Reliability Engineer
  • Openmind Technologies
  • San Jose, CA, USA
Big Data Engineer
  • W3Global
  • New York, NY, USA
Remote Big Data/Hadoop Developer
  • Software Guidance & Assistance
  • Jacksonville, FL, USA
Sr Back-end Engineer
  • CodeBase Inc
  • Austin, TX, USA
DevOps Engineer
  • Nueva Solutions, Inc.
  • Sunnyvale, CA, USA
BIG Data Engineer : Multiple Location / Remote
  • Spar Information Systems
  • Richmond, VA, USA
Unix Site Reliability Engineer and/or DevOps Engineer (Multiple Openings)
  • Oxford Global Resources
  • Mountain View, CA, USA
Need Data Scientist FOR CITI-Alpharetta, GA
  • Vimerse Infotech Inc
  • Alpharetta, GA, USA
Software Dev Engineer 3
  • Smith Johnson Group Inc.
  • Mapleton, UT, USA