Sr. Site Reliability Engineer

company banner
Judge Group, Inc.
Engineer, Engineering, Systems, Programming, Java, Python, Oracle, Networking
Full Time

Job Description

Location: Phoenix, AZ
Enterprise Financial Services partner of The Judge Group, seeking a Senior Site Reliability Engineer to augment their Phoenix-based team with this DIRECT-HIRE, permanent addition.

** This client is not able to provide sponsorship upon conversion at this time - This is a FULL TIME and DIRECT HIRE opportunity. Applicants will need to be eligible for outright hire without sponsorship support presently, or at any time down the line.

IMPORTANT: Applicants will NOT be presented to the hiring decision maker(s) without first coordinating a preliminary qualification discussion with a Judge Delivery representative. CONTACT: Sky Donovan - to coordinate a time to connect and learn more.

Position Summary:
Serve as a core member of an SRE team of engineers and architects to build observability, alerting, tracing, automation, and self-healing capabilities for mission critical high availability platform for credit and fraud risk decisioning.

Day to Day:

  • Building tools and frameworks for deployments, observability and alerts for distributed systems and data layer
  • maintaining the highest levels of platform availability, reengineer systems and code for continuous improvement by providing 24*7 support to critical platforms and journeys.
  • Implement software development practices to build observability, alerting, tracing, automation, and self-healing capabilities to maintain the highest levels of platform availability.
  • Building, maintaining, and improving the software deployment platform, automated analysis, and hardware health Instrumenting for observability, defining KPIs & sharing operational insights
  • Be part of high-performance team of SRE engineers and architects.
  • Drive accountability for quality aspects in release, system performance, platform availability, operational efficiency, risk management, information security and data management.
  • Champion and drive cross-organizational development efforts for the optimization of application monitoring and resiliency capabilities.
  • Cultivate an environment of innovation and continuous improvement, leading changes that drive efficiencies into existing engineering and delivery processes. Implement tools, automation, and processes to ensure software is released in a streamlined and reliable manner from development to production.
  • Consistently question assumptions, challenge the status quo, and strive for improvement. Integrity, passion, high-energy, personal accountability, and a desire to lead others to excellence.
  • Consistent track record of influential leadership and building high-performance self-managing, self-motivated teams.
  • Collaborate with delivery teams to resolve technical direction and approach to system design and implementation.
  • Act as forward-thinking engineer and remain current on modern technology-leading trends.

Minimum Qualifications

  • Blend of business and technical capability and has a strong desire to learn new things as well as research customer insights, trends, and business challenges.
  • 7+ years of proven experience building and shipping large scale technical products.
  • Position requires a Bachelor's or Master's degree in CS, Engineering, Information Systems, or a related STEM field
  • Programming: Java, Python, Node, understanding of databases (Couchbase/Cassandra), Kafka, Hadoop, Redis, Oracle etc. Architecting, designing, developing, and troubleshooting systems CICD pipelines - Jenkins or other tools, GH actions
  • Observability: Open Telemetry - tracing, metrics, logging, Splunk, Kibana, Prometheus, Grafana; Build custom tools Cloud - container based technologies, Kubernetes, Docker, Istio, service discovery, OpenShift, AWS, Google Cloud Platform etc.
  • Understanding of AI/ML concepts, frameworks Networking principles, Operating Systems, understanding of load balancing at L4 and L7 - F5 LTM, GTM, DNS, infrastructure knowledge, cloud operations
  • Capacity management, DR, and HA strategies Large scale software product engineering experience with contemporary tools and delivery methods in a complex environment (e.g., DevOps, CD/CI, Agile, etc.)
  • Extensive experience as a leader within a Technology organization and/or in a product development company with experience developing observability and self-healing capabilities
  • Strong written and oral communication skills.
  • Must be comfortable delivering presentations to small and large groups including executive level management internally and externally.
  • Experience with matrix organizations consisting of multi-functional teams and experience in driving large-scale change efforts.

IMPORTANT: Applicants will NOT be presented to the hiring decision maker(s) without first coordinating a preliminary qualification discussion with a Judge Delivery representative. CONTACT: Sky Donovan - to coordinate a time to connect and learn more.


This job and many more are available through The Judge Group. Find us on the web at

Company Information

The Judge Group, celebrating its 50th anniversary, is a leading professional services firm specializing in talent, technology, and learning solutions. We consult, staff, train, and solve. Through our work we make people and organizations better. Our services are successfully delivered through a network of more than 30 offices in the United States, Canada, and India. The Judge Group serves more than 50 of the Fortune 100 and is responsible for over 9,000 professionals on assignment annually across a wide range of industries.

Dice Id : cxjudgpa
Position Id : 769831
Originally Posted : 2 months ago

Similar Positions at Judge Group, Inc.

Software Engineer
  • Phoenix, AZ
  • 16 hours ago
Senior Engineer Operations
  • Phoenix, AZ
  • 16 hours ago
Sr, Enterprise Batch Services Scheduler
  • Phoenix, AZ
  • 16 hours ago
Salesforce Engineer
  • Phoenix, AZ
  • 16 hours ago
Site Reliability Engineering Manager
  • San Diego, CA
  • 16 hours ago
Big Data Java Engineer
  • Phoenix, AZ
  • 16 hours ago
  • Kalamazoo (charter Township), MI
  • 16 hours ago
Software Reliability Engineer (74088-1)
  • Chicago, IL
  • 16 hours ago
Senior Staff Software Engineer
  • Carlsbad, CA
  • 16 hours ago
Sr Software Engineer
  • Los Angeles, CA
  • 16 hours ago