Site Reliability Engineer (45000)

Overview

On Site
$75
Contract - W2
Contract - Independent

Job Details

Site Reliability Engineer | 45000
DETAILS
Location: Irving, TX 75039 (hybrid onsite 2-days per week)
Position Type: 6M C2H
Hourly / Salary: to $135K

JOB SUMMARY
Vaco Technology is currently seeking a SRE for a 6M C2H opportunity that is located in Irving, TX 75039 (hybrid onsite 2-days per week).   The SRE will join a forward-thinking, technology-driven environment where they are redefining how technology supports customers, partners, and business operations.  The SRE Teams leads, directs, and provides accountability for building and running large-scale software systems.  The SRE will identify and deliver automation solutions designed to ensure HA and resiliency using expertise in software development, complexity analysis, and scalable system design.  The SRE will work closely with engineering teams, ensuring services / systems are highly stable and performant.
  • Collaboration / Architecture / Development – Partnering with Architecture / Development Teams, Ensuring Applications Highly Available / Reliable / Performant at Global Scale
  • Reliability Guidance – Collaborating with Architecture Team, Ensuring Reliability Factors are Accounted for in Business Features / Enablers
  • SLI / SLO Implementation – Guiding Development Teams in Understanding Established Service Level Objectives / Consequences | Implementing Appropriate SLIs to Support Objectives
  • Troubleshooting / Problem Resolution – Collaborating with Development Team Members to Swarm / Troubleshoot / Resolve Problems
  • Root Cause Analysis / Solution Planning – Guiding Ad-Hoc Teams to Brainstorm Solutions | Build Implementation Plans Based on Root Cause Analysis of Production Issues
  • Automation / Optimization – Designing / Building Automated Solutions to Optimize Application / Service / Platform Uptime with Minimal Human Intervention
  • On-Call Support – Availability for On-Call Rotation to Participate in Troubleshooting / Communication Efforts Outside Normal Business Hours
  • Standards / Mentorship – Implementing / Helping Create Standards / Best Practices | Mentoring Team Members to Drive Adoption Across Development Teams

About the Project: This initiative is a global digital modernization effort aimed at transforming financial services platforms to be highly available, scalable, and automated. The project focuses on migrating and optimizing applications for cloud-native architectures (Azure), implementing containerization (AKS / Kubernetes / Docker), and embedding reliability and observability into software systems through SRE practices. Key objectives include establishing SLOs / SLIs, building automated CI/CD pipelines, enhancing database performance (SQL Server / Oracle / NoSQL), and enabling enterprise-wide monitoring and incident management. The project spans multiple regions (LATAM / Europe / China / USA / Canada) and emphasizes collaboration between development, architecture, and operations teams to deliver resilient, performant, and data-driven financial services at global scale.

JOB REQUIREMENTS
  • Site Reliability Engineer – Identifying / Delivering Automation Solutions, Ensuring HA / Resiliency
  • SLO / SLI Management – Defining / Implementing / Evaluating SLOs/SLIs | Associated Consequences
  • Database Design / Optimization – Oracle / MS SQL Server / NoSQL (CosmosDB) | Designing / Evolving Database Schemas | Performing Query Performance Analysis | Indexing to Deliver Scalable / Performant Services
  • Pipeline Automation – Azure DevOps (YAML / ARM) / Terraform / Jenkins / Chef / Octopus Deploy | Designing / Building / Optimizing Automated Pipelines with Automated Testing / Automated Security Controls
  • DevOps / Containerization – AKS (Azure Kubernetes Service) / Kubernetes (Open Source) / Docker
  • Code Scanning – SonarQube / Checkmarx | Configurations / CI/CD Integrations / Running Scans / Triaging, etc.
  • Test Automation – Xamarin UITest / SpecFlow / DevTest / Selenium / Test Data Manager / Postman / Maven / TestNG / JMeter
  • High-Level Programming / Scripting – Java / C# (.NET MVC / .NET Core) / Go | PowerShell / Bash
  • Root Cause Analysis / Problem Management – Performing Root Cause Analysis / Managing Problems
  • SCRUM / Agile Leadership – Working in SCRUM / Agile Teams | Demonstrated Success Leading Improvements





Determining compensation for this role (and others) at Vaco/Highspring depends upon a wide array of factors including but not limited to the individual’s skill sets, experience and training, licensure and certifications, office location and other geographic considerations, as well as other business and organizational needs. With that said, as required by local law in geographies that require salary range disclosure, Vaco/Highspring notes the salary range for the role is noted in this job posting. The individual may also be eligible for discretionary bonuses, and can participate in medical, dental, and vision benefits as well as the company’s 401(k) retirement plan.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.