Site Reliability Engineer - Charlotte NC, Onsite

Charlotte, NC, US • Posted 1 day ago • Updated 1 day ago
Contract W2
Travel Required
On-site
50+
Fitment

Dice Job Match Score™

👾 Reticulating splines...

Job Details

Skills

  • Kubernetes
  • Dashboard
  • Continuous Integration
  • Continuous Delivery
  • Splunk

Summary

Hi, 
We have a position which is suitable to your skillset. Please go through the below JD and let me know your interest.
 
Looking for candidate on W2 
 
Title                                : Site Reliability Engineer
Location                        : Charlotte NC, Onsite
Relevant Experience (in Yrs.): 6+ years
Detailed Job Description:
• Proven experience in high-availability, high-transaction environments (preferably payments or financial services).
• Strong background in production resiliency and recovery (recovery execution, run books/playbooks, RCA mind-set).
• Incident pattern analysis + MTTR baselines (P2 Major/Minor) and recurring failure taxonomy (by rail/service).
• Senior-level observability expertise: dashboards, monitors, and alerts (Datadog preferred; similar tools considered).
• Splunk, Datadog, SQLs, JQL Jira Query language, Gitlab,
• Experience of CI / CD metrics and generating code quality, changes, testing automation executive’s reports from Gitlab
• Understand quality of stories, metrics, monitoring experiences - help get data to showcase deficiencies
• Senior CI/CD experience: pipeline design/operation, release safety patterns, and rollback readiness.
• Experience using metrics and monitoring data to identify and communicate deficiencies.
• Automation skills: Python and/or PowerShell (or equivalent) for building repeatable recovery workflows and operational tooling.
• Kubernetes/container platform production troubleshooting (deployments, pods, config drift, safe restarts, and "why did this change break prod” investigations
• Experience with identity/credentials/certificate & secret-rotation resilience (preventing outages during password rotations, certificate upgrades, and secret propagation; implementing guardrails and monitoring for these events).
• Batch/scheduler/job-execution reliability (detecting/preventing silent job failures, validating multi-DC scenarios, and building controls to ensure scheduled processing does not impact customers).
• Distributed integration failure-handling (timeouts, retries, backpressure, idempotency, duplicate prevention, and reconciliation—especially across vendor/downstream dependencies).
 
Thanks & Regards

Venkatesh Kundurthi

Team Lead || ASCII Group, LLC

Office:
Ext. 104; Direct:
38345 W. 10 Mile Rd, Ste.#365; Farmington, MI  48335

Email: 

Website: 

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10117479
  • Position Id: 8935175
  • Posted 1 day ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Charlotte, North Carolina

13d ago

Easy Apply

Contract

Depends on Experience

Charlotte, North Carolina

Today

Easy Apply

Full-time

100,000 - 110,000

Charlotte, North Carolina

Today

Easy Apply

Full-time

$100,000 - $110,000

Charlotte, North Carolina

Today

Contract

USD 61.00 - 66.00 per hour

Search all similar jobs