Site Reliability Engineer (SRE)

Overview

On Site
Depends on Experience
Full Time

Skills

DevOps
SRE
Configuration Management Database
Jenkins
Database

Job Details

Position Title: SRE Engineer

Location: NYC/NJ, Alpharetta (Atlanta) and Arlington/DC

The ideal candidate knows how to navigate complex, large systems of many hundreds of servers and many thousands of services. Understand how to effectively use a CMDB as 70-80% of the work is in pre production. Proactive monitoring of capacity of databases, services, market data, quotas. Ideal candidate knows about query optimization, knows how to read a query plan and suggest improvements, and in general knows how to optimize code, database and configuration to get the most efficient and effective use of a database.

Skill Set Needed

Architectural Experience : High Availability Disaster Recovery (HADR Queue Replication (QREP), Elasticity, Distributed DB, CMDB (configuration management database)

OS Platform: AIX/UNIX and AWS/LINUX

Database Platforms: PostgreSQL, Oracle, DB2, MySQL

Cloud & DevOps: AWS/Google Cloud Platform/Azure, Jenkins, Git, CI/CD pipelines

Containerization: Kubernetes, Docker

Scripting & Programming: Python, Shell, SQL, Java

Monitoring & Alerting: Splunk/ELK, Prometheus, Grafana, Dynatrace/AppDynamics/New Relic

Other Tools: ServiceNow, Jira, BigPanda, PagerDuty

Troubleshooting Skills: OS/Memory/Misc Issues

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.