Sr Devops Engineer

Overview

On Site
Depends on Experience
Full Time

Skills

Devops
AWS
EKS

Job Details

Job Description :

As a Senior DevOps Engineer, you will actively interface with software developers, product managers, test engineers, and administrators on projects to design and develop the build, release, and deploy toolchain for DevOps while providing on-call support. You should be able to identify, troubleshoot and resolve issues quickly and effectively, sometimes under pressure. Responsibilities include capacity planning, high availability engineering, performance tuning, and automation/tools development. We have a focus on continuous improvement and INNOVATION.

You should have good leadership skills, experience managing infrastructure through multiple product releases, and have a passion for reliability and security. Work with management to set priorities, track operational metrics. Excellent communication skills and teamwork are a must!

Responsibilities:

  • Lead the design and management of highly available, scalable Kubernetes infrastructure using Amazon EKS.
  • Design, manage and optimize large scale EKS clusters with thousands of nodes.
  • Implement and manage infrastructure using Infrastructure as Code (IaC) with Terraform and Terragrunt.
  • Champion a Git-first approach to infrastructure and CI/CD automation.
  • Build and maintain robust CI/CD pipelines using Harness to streamline application delivery across environments.
  • Drive infrastructure automation and environment consistency across development, staging, and production.
  • Design and implement monitoring and observability solutions using New Relic, ensuring performance visibility and uptime.
  • Securely manage secrets using AWS Secrets Manager
  • Support production systems through incident management, root cause analysis, and proactive reliability improvements.
  • Collaborate with engineering and security teams to define best practices for deployment, security, and scalability.
  • Mentor and guide junior DevOps team members and developers in adopting modern infrastructure practices.
  • Support the larger team as an active contributor offering innovative ideas to solve problems, improve performance and drive what comes next.

Qualifications:

Required Skills/Experience:

Bachelor s degree in Computer Science, Engineering, or equivalent experience.

  • 5+ years in a , SRE, or Platform Engineering role supporting cloud-native applications.
  • Extensive experience with Kubernetes (EKS), including Helm, networking, and security.
  • Serverless computing as it related to CI/CD, Automation and IaC.
  • Proven expertise building pipelines with Harness, Git, and associated CI/CD tooling.
  • Proficient with Linux systems, including performance tuning and troubleshooting.
  • Strong scripting and automation skills using Python or Golang.
  • Solid understanding of AWS services: EKS, EC2, S3, IAM, RDS, VPC, Route53, CloudWatch, etc.
  • Proficient with Linux systems, including performance tuning and troubleshooting.
  • Experience with OS Internals (Building, Deploying and Software Operations).
  • Deep knowledge of networking fundamentals: DNS, TLS, HTTP/S, Load Balancing.
  • Strong understanding and experience with Storage as it relates to IO Operations with Development, Testing, Deployment and Operations.
  • Experience with Ansible for configuration and provisioning.
  • Working knowledge of MySQL and PostgreSQL.
  • Strong experience in monitoring, logging, and alerting, particularly using New Relic.
  • Experience with Kafka, RabbitMQ, or other pub/sub systems is a plus.

Demonstrated ability to support and troubleshoot distributed production

  • Outside-the-box thinking to continuously assess and improve efficiency throughout our DevOps/Cloud/SRE environments... Innovation is key!
  • This position is an onsite/hybrid role. Currently, the team is working Mon/Tue/Wed onsite in Redwood City, CA.
  • This position does have an on-call rotation. One week on-call every 3 weeks, as this responsibility is shared across the team.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.