Sr. DevOps Engineer (Storage Platform)

Hybrid in San Ramon, CA, US • Posted 1 day ago • Updated 1 day ago
Contract W2
Hybrid
Depends on Experience
Fitment

Dice Job Match Score™

📊 Calculating match score...

Job Details

Skills

  • Computer Networking
  • Change Management
  • Cloud Computing
  • Collaboration
  • Communication
  • Automated Testing
  • Backup
  • IT Service Management
  • ITIL
  • Incident Management
  • Enterprise Storage
  • Fiber Channel
  • Git
  • Golang
  • Grafana
  • Hypervisor
  • Continuous Delivery
  • Continuous Integration
  • Data Storage
  • Documentation
  • IP
  • Ansible
  • OpenStack
  • Operating Systems
  • Provisioning
  • Python
  • Recovery
  • Red Hat Enterprise Linux
  • Kubernetes
  • Legacy Systems
  • Management
  • Migration
  • NetApp
  • Infrastructure Lifecycle Management
  • Intellectual Property
  • SUSE Linux
  • Scalability
  • Scripting
  • Kernel-based Virtual Machine
  • Red Hat Linux
  • Root Cause Analysis
  • SDS
  • Storage
  • Bash
  • CentOS
  • Ceph
  • Technical Writing
  • Terraform
  • Testing
  • Ubuntu
  • Cisco MDS
  • Computer Science
  • DevOps
  • Linux
  • Storage Engineering
  • Telco
  • Virtual Machines
  • Wiki
  • Workflow
  • iSCSI

Summary

Job Summary
We are seeking a highly experienced Sr DevOps Engineer Storage Platforms to build, automate and operate large-scale Software Defined Storage (SDS) and Kubernetes platforms in a private cloud environment. This role focuses on Storage Engineering with Infrastructure-as-Code and GitOps practices while ensuring scalability, resilience, and performance.

This is a deeply technical role requiring expert-level understanding of Software Defined Storage, Kubernetes and extensive working knowledge on Linux Operating systems. You will also collaborate with platform and SRE teams to maintain secure, performant, and multitenant-isolated services that serve high-throughput, mission-critical applications.
Key Responsibilities

  • Deploy, automate, and operate large-scale Software Defined Storage architectures across private and public cloud regions within ITIL methodology.
  • Deploy and support enterprise storage platforms (Pure Storage, HPE, NetApp) and SDS solutions (Ceph, Longhorn).
  • Integrate self-service storage workflows for Kubernetes CSI and OpenStack consumers (VM and Baremetal).
  • Implement and manage backup solutions (preferably Rubik).
  • Build and maintain Infrastructure-as-Code for storage platforms using Ansible, Terraform, Helm and Git, with Python/Bash automation.
  • Implement CI/CD pipelines for infrastructure updates, patching, upgrades, testing, and rollback.
  • Implement and improve monitoring, alerting, and observability for storage systems (capacity, latency, IOPS, recovery health) using GitOps and tools such as Prometheus, Loki, and Grafana.
  • Perform deep troubleshooting across storage, Kubernetes, hypervisors, networking, and Linux systems.
  • Develop and maintain technical documentation, architecture diagrams, operational procedures, and runbooks
  • Participate in on-call rotations, incident response, and root cause analysis.
  • Collaborate globally on change management, documentation, and operational best practices.

Must Have

  • 6+ years of experience managing enterprise storage and Kubernetes platforms on Linux.
  • Strong hands-on experience with SDS solutions (Ceph, Longhorn) and storage migrations from legacy systems.
  • Expertise with block, file, and object storage, including Fibre Channel (Cisco MDS) and IP-based protocols (NVMe-oF or iSCSI farbics).
  • Expert knowledge of Kubernetes and Linux systems (Ubuntu, RHEL/CentOS).
  • Proficiency with Infrastructure-as-Code (IaC) (Ansible, Terraform) for provisioning storage and backup schedules
  • Expertise in backup technologies (preferably Rubik)
  • Strong scripting skills in Python and Bash (Golang (GO) a plus).
  • Experience operating 24x7 mission-critical production environments.
  • Hands-on experience with KVM hypervisors (Suse Harvester, OpenStack).
  • Strong written and verbal communication skills.
  • Proficiency with Git, CI/CD pipelines, and automated testing frameworks
  • Ability to write technical documentation and contribute to community wikis or knowledge bases.
  • Bachelor s degree in computer science or equivalent professional experience.

Nice to Have

  • OpenStack Cinder multi-backend administration.
  • Backup platforms (Rubrik).
  • Understanding of CIS/NIST security and infrastructure lifecycle management.
  • ITIL Foundation/advanced certifications in support of ITSM standard methodology.
  • Background in telco, edge cloud, or large enterprise environments.
  • CNCF Certified Kubernetes Administrator (CKA), Certified Kubernetes Security Specialist (CKS) or Red Hat specialist in Ceph Storage Administrator (EX125) certifications.
  • Master s degree in computer science, IT, Engineering, or a related field preferred; equivalent experience and relevant industry certifications will also be considered

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: prutx001
  • Position Id: 8954119
  • Posted 1 day ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Hybrid in San Ramon, California

Today

Easy Apply

Contract

Depends on Experience

Hybrid in Walnut Creek, California

7d ago

Easy Apply

Third Party, Contract

Depends on Experience

San Leandro, California

Today

Easy Apply

Contract

55 - 60

Concord, California

Today

Easy Apply

Third Party, Contract

Depends on Experience

Search all similar jobs