Title- Site Reliability Engineer - SQL Server
remote
About the job
Our Client, a multi-national telecommunications technology is seeking an experienced Site Reliability Engineer (SRE) with hands-on SQL Server to join their Data Center Engineering team. This role requires a technically strong and operationally mature engineer who will help design, scale, and maintain the reliability of our physical and virtual data center infrastructure. You will be a technical leader responsible for ensuring system uptime, optimizing capacity and performance, and contributing to long-term infrastructure resiliency.
Key Responsibilities:
- Architect, deploy, and support SQL Server Always On Availability Groups across on-prem and Azure environments.
- Install, configure, and maintain SQL Server Failover Cluster Instances (FCI).
- Administer Windows Server Failover Clustering (WSFC), quorum, and node health.
- Plan and execute failover testing, DR drills, and recovery validation.
- Perform SQL Server installation, upgrades, patching, and migrations.
- Manage backup, restore, and disaster recovery strategies.
- Monitor and tune database performance (indexing, query optimization, waits).
- Troubleshoot replication, latency, blocking, and cluster-related issues.
- Configure and manage AG listeners, networking, DNS, and storage.
- Ensure database security, auditing, and compliance.
- Automate DBA tasks using T-SQL, PowerShell, and SQL Agent.
- Support application deployments and schema changes.
- Provide 24-7 production support for mission-critical systems.
- Maintain runbooks, standards, and operational documentation.
If you would like to learn more, please reach out to Elite technical
Required Skills
Education and Experience
- Bachelor-s degree in Computer Engineering, Electrical Engineering, Information Technology, or a related technical field.
- 4-7 years of experience in database administration and operations.
- Experience participating in or leading incident response and postmortem analysis processes.
- Previous exposure to hybrid environments integrating on-premise data centers with public or private cloud platforms is desirable.
- 4+ years of experience with SQL Server Database.
- 4+ Years of experience with any other RDBMS Database knowledge (oracle, PostgreSQL).
Required Skills:
- Hands-on experience with backup, recovery, and HA solutions.
- Strong proficiency in Linux and Debian environments.
- Proficiency in scripting for database automation.
- Excellent analytical, problem-solving, and troubleshooting skills.
- Strong communication skills for cross-team collaboration.
- Understanding of Oracle and MySQL databases is a plus, but not mandatory.