Overview
On Site
Full Time
Skills
SQL
WEB SERVICES
REST
SOAP
Agile
HTTP
JIRA
JSON
dns
JMS
TCP
SPLUNK
nagios
udp
Kafka
APM
Confluence
DOCKER
Git
Microservice
Amazon Web Services
KUBERNETES
Incident Management
OPERATIONS
GCP
Grafana
Site Reliability Engineer
CONTINUOUS INTEGRATION/DELIVERY
Distributed Systems
Job Details
Title: Site Reliability Engineer (SRE)
Location: Austin, TX (Onsite)
Duration: Fulltime
Experience: 9+ Years
Technical Skills:
- 9+ years of professional engineering experience developing, managing, or supporting distributed systems
- 4+ SRE experience managing multi-cloud platforms
- Strong trouble shooting skills in debugging multiarchitecture systems and experience with microservices architecture patterns is must.
- Strong Experience in Issues Resolution and Incident management, RCA Creation, and follow-up.
- Enterprise Cloud infrastructure experience e.g., Google Cloud Platform, AWS
- Strong working knowledge of modern development technologies and tools e.g., Agile, CI/CD, Git, Jira, and Confluence.
- Experience in developing and managing operations leveraging key event streaming, messaging, and DB services e.g., MQ/JMS/Kafka, Cloud SQL, etc.
- Strong experience in using industry standard monitoring tools e.g., AppDynamics, Dynatrace, Splunk, Grafana, Nagios, Datadog, New Relic, Tempo, Loki, etc.
- Experience working with containers e.g., Docker, Kubernetes, Cloud Foundry, etc.
- Deep knowledge of Internet protocols and web services technologies e.g., HTTP, DNS, TCP/UDP, SOAP, JSON, and REST
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.