Overview
On Site
61.69/hr - 69.51/hr
Contract - W2
Skills
System Administration
Financial Services
Finance
Production Support
Java
Collaboration
Budget
Incident Management
Programming Languages
Python
Bash
Root Cause Analysis
Reliability Engineering
FOCUS
Scalability
Cloud Computing
Microsoft Azure
Google Cloud
Google Cloud Platform
Orchestration
Kubernetes
Grafana
Continuous Integration
Continuous Delivery
Workflow
Version Control
Terraform
Ansible
Conflict Resolution
Problem Solving
Job Details
Outstanding long-term contract opportunity! A well-known Financial Services Company is looking for a Site Reliability Engineer/Production support Engineer in Charlotte, NC, Dallas TX or Iselin NJ (Hybrid).
Work with the brightest minds at one of the largest financial institutions in the world. This is a long-term contract opportunity that includes a competitive benefit package! Our client has been around for over 150 years and is continuously innovating in today's digital age. If you want to work for a company that is not only a household name, but also truly cares about satisfying customers' financial needs and helping people succeed financially, apply today.
Contract Duration: W2/ 12 Months with extensions and Contract to hire
Required Skills & Experience
Work with the brightest minds at one of the largest financial institutions in the world. This is a long-term contract opportunity that includes a competitive benefit package! Our client has been around for over 150 years and is continuously innovating in today's digital age. If you want to work for a company that is not only a household name, but also truly cares about satisfying customers' financial needs and helping people succeed financially, apply today.
Contract Duration: W2/ 12 Months with extensions and Contract to hire
Required Skills & Experience
- 5+ years of experience in SRE, platform engineering, or production support roles.
- 2 years of experience programming in one or more languages such as Python, Java, or Go.
- 1+ years of experience with Cloud technologies
- Lead complex, high-impact initiatives including systems consultation and SRE strategy implementation.
- Drive observability improvements by identifying gaps in monitoring, logging, and tracing across platforms.
- Collaborate with engineering teams to define SLIs, SLOs, and error budgets.
- Automate operational tasks and incident response workflows using modern programming languages (e.g., Python, Go, Bash).
- Design and implement scalable, resilient systems using infrastructure-as-code and CI/CD pipelines.
- Conduct root cause analyses and postmortems to improve system reliability.
- Consult on technical changes and enhancements with a focus on performance, scalability, and fault tolerance.
- Partner with architects and engineers to align with enterprise strategies and ensure secure, maintainable solutions.
- Strong understanding of distributed systems, cloud platforms (OpenShift, Azure, Google Cloud Platform), and container orchestration (Kubernetes).
- Experience with observability tools such as Prometheus, Grafana, AppD, or Spunk.
- Familiarity with CI/CD workflows, version control systems, and infrastructure-as-code tools (e.g., Terraform, Ansible).
- Proven ability to identify and remediate gaps in system observability and performance.
- Excellent problem-solving skills and ability to lead cross-functional teams.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.