Apply Now

SRE Lead || Phoenix, AZ || LOCALS ONLY || Hybrid

Hybrid in Phoenix, AZ, US • Posted 1 day ago • Updated 1 day ago

Contract W2

12 Months

No Travel Required

Hybrid

Depends on Experience

Value Spectrum Technologies LLC

Fitment

Dice Job Match Score™

⭐ Evaluating experience...

Job Details

Skills

Apache Kafka
Amazon Web Services
Grafana
SRE
NodeJS
Java
AIOPS
Lead

Summary

SRE Lead & Monitoring Consultant

Key Responsibilities

SRE Practice Development

•⁠ ⁠Assess operational maturity and build SRE transformation roadmap
•⁠ ⁠Establish SLOs, SLIs, and error budgets for critical services
•⁠ ⁠Design incident management processes and on-call strategies
•⁠ ⁠Implement chaos engineering and resilience testing
•⁠ ⁠Mentor teams on SRE principles and best practices

Monitoring & Observability

•⁠ ⁠Deploy and configure Datadog, Splunk, Grafana, and Prometheus
•⁠ ⁠Implement metrics collection, log aggregation, and APM
•⁠ ⁠Build custom dashboards and alerting configurations
•⁠ ⁠Set up anomaly detection and intelligent alerting
•⁠ ⁠Configure automated health checks and remediation
•⁠ ⁠Establish golden signals monitoring (latency, traffic, errors, saturation)

Reliability & Compliance

•⁠ ⁠Conduct reliability reviews and performance optimization
•⁠ ⁠Design disaster recovery and failover procedures
•⁠ ⁠Implement security monitoring and audit logging
•⁠ ⁠Configure fraud detection and transaction monitoring
•⁠ ⁠Create runbooks and operational documentation

Required Qualifications

Experience:

•⁠ ⁠7+ years in Site Reliability Engineering, DevOps, or infrastructure engineering
•⁠ ⁠3+ years in SRE leadership roles.
The ideal candidate will possess strong expertise in Java, Node.js, Kafka, AWS Cloud, and modern AIOps/Observability practices.
Implement proactive monitoring and predictive alerting using AIOps platforms and machine learning-driven insights.
•⁠ 3+ years hands-on experience with Datadog, Splunk, Grafana, and Prometheus.
Strong hands-on experience with Java and Node.js application architectures.
•⁠ ⁠Previous experience in fintech or regulated industries.
•⁠ ⁠Proven track record building SRE practices from scratch.

Technical Skills

•⁠ ⁠Deep understanding of SRE principles, error budgets, and SLO/SLI frameworks.
•⁠ ⁠Expertise with cloud platforms (AWS, Azure, or Google Cloud Platform).
•⁠ ⁠Proficiency with Kubernetes, Docker, and infrastructure as code (Terraform, Ansible).
•⁠ ⁠Strong programming/scripting skills (Python, Go, Bash).
•⁠ ⁠Experience with incident management and post-mortem culture.
•⁠ ⁠Knowledge of compliance requirements (SOC 2, PCI-DSS, ISO 27001).

Soft Skills

•⁠ ⁠Exceptional leadership and mentoring abilities.
•⁠ ⁠Strong communication and stakeholder management.
•⁠ ⁠Data-driven decision-making approach.
•⁠ ⁠Collaborative mindset with ability to drive cultural change.

Preferred Qualifications

•⁠ ⁠Cloud certifications (AWS, Google Cloud Platform, Azure) or Kubernetes certifications (CKA/CKAD).
•⁠ ⁠Experience with ELK stack.
•⁠ ⁠Background in cloud cost optimization.
•⁠ ⁠Multi-cloud or hybrid cloud experience.

Deliverables

•⁠ ⁠SRE maturity assessment and transformation roadmap
•⁠ ⁠Fully configured monitoring stack with Datadog, Splunk, Grafana, and Prometheus
•⁠ ⁠SLO/SLI definitions and error budgets
•⁠ ⁠Custom dashboards, alerting, and automated remediation
•⁠ ⁠Incident management framework and runbooks
•⁠ ⁠Chaos engineering test suite

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 91165686
Position Id: 8996149
Posted 1 day ago

Company Info

About Value Spectrum Technologies LLC

Step into a future defined by empowerment at Value Spectrum Technologies. With leading-edge software solutions and strategic consulting, were dedicated to shaping and elevating your digital tomorrow. Experience the synergy of innovation and collaboration as we unlock unparalleled opportunities for growth in the dynamic landscape of technology. Welcome to empowerment.

Join us in navigating the ever-evolving digital landscape with confidence, as we work together to unlock unprecedented opportunities and build a tomorrow that is truly empowered by the limitless possibilities of technology. Your digital future starts here.

Go to company profile

Contact the job poster

Mahesh Goud Uppala

Recruiter @ Value Spectrum Technologies LLC

View Profile

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Hybrid in Phoenix, Arizona

•

Yesterday

Role Summary We are seeking a Network SRE to ensure the reliability, scalability, and performance of cloud and hybrid network platforms. This role applies SRE principles to networking by shifting from manual network operations to automated, observable, and resilient network services. The ideal candidate is a network engineer who thinks like a software engineer and SRE. Key Responsibilities: Network Reliability Engineering Define SLIs, SLOs, and Error Budgets for network services. Design networks

Easy Apply

Contract

Depends on Experience

Java Lead Backend || Phoenix, AZ || LOCALS ONLY

Hybrid in Phoenix, Arizona

•

Yesterday

Lead Java Backend Developer (Spring Boot & Node.js) Job DescriptionPosition OverviewWe are seeking a highly skilled Java Backend Developer with expertise in Spring Boot and Node.js to design, develop, and maintain scalable backend applications and APIs. The ideal candidate will have experience building microservices, integrating third-party systems, and delivering high-performance solutions in cloud-based environments. Key ResponsibilitiesDesign, develop, and maintain robust backend application

Easy Apply

Contract

Depends on Experience

Kafka Admin || Onsite in Phoenix, AZ || W2 & C2C || Need Local to AZ

Phoenix, Arizona

•

Yesterday

Kafka Administrator Location : Phoenix, Arizona Experience: 8+ Job Description : We are seeking a skilled Kafka Administrator to manage and optimize our Apache Kafka infrastructure. The ideal candidate will ensure high availability, performance, and security of Kafka clusters used for real-time data streaming. Key Responsibilities: Install, configure, and maintain Kafka clusters and components. Monitor performance and troubleshoot issues. Manage topics, partitions, and consumer groups. Implement

Easy Apply

Third Party, Contract

55 - 60

AI-Ops Architect || On-site at Phoenix, AZ || W2 & C2C || Need Local to AZ

Phoenix, Arizona

•

Yesterday

Role:Senior AI Ops Architect Location:Onsite in Phoenix, AZ Experience:12+ Job Description We are seeking a highly skilled AI Ops Senior Architect to lead the design, implementation, and optimization of AI-driven operational platforms across large-scale, mission-critical environments. The ideal candidate will possess deep expertise in machine learningenabled operations, observability, automation frameworks, cloud engineering, and enterprise SRE/DevOps practices. This role will drive the trans

Easy Apply

Third Party, Contract

60 - 65

Search all similar jobs

SRE Lead || Phoenix, AZ || LOCALS ONLY || Hybrid

Value Spectrum Technologies LLC

Dice Job Match Score™

Job Details

Skills

Summary

SRE Lead & Monitoring Consultant

Key Responsibilities

Required Qualifications

Preferred Qualifications

Deliverables

Company Info

About Value Spectrum Technologies LLC

Mahesh Goud Uppala

Similar Jobs