SRE Engineer (OpenStack /
Experience: SRE / OpenStack Platform / Private Cloud Operations
Timezone: EST & NA
Location: Remote
Must-Have (Non-Negotiable)
1) Strong Linux, Networking & Storage fundamentals
a) Linux internals, kernel tuning (RHCE-adjacent), filesystems, partitions
b) Storage: LVM, SCSI multipath, Ceph basics, IO troubleshooting
c) Networking: DHCP, DNS, VLANs, bonding, basic routing, tcpdump/traceroute
2) OpenStack Services Operations & Troubleshooting
a) Hands-on experience managing and triaging OpenStack services (Nova, Neutron, Cinder, Keystone)
b)Production issue troubleshooting, RCA, customer-facing debugging
Good to Have / Learnable
1) Kubernetes - Basic understanding of pods, services, and cluster concepts (deep expertise not mandatory)
2) Monitoring & Observability - Fundamentals of metrics, alerts, and logs (Grafana / Prometheus ecosystem preferred)
3) Automation - Basic scripting in Python or Go
Role Expectations
- Troubleshoot complex platform issues in customer OpenStack/Linux environments
- Participate in incident management, RCAs, and on-call rotation
- Proactive system monitoring and performance tuning
- Collaborate with engineering on fixes, PRs, and platform improvements
- Clear communication with customers over calls and written channels
Note: Linux / Openstack Certifications are desirable / not mandatory