Apply Now

Site Reliability Engineer - Charlotte NC, Onsite

Charlotte, NC, US • Posted 1 day ago • Updated 1 day ago

Contract W2

Travel Required

On-site

50+

Fitment

Dice Job Match Score™

👾 Reticulating splines...

Job Details

Skills

Kubernetes
Dashboard
Continuous Integration
Continuous Delivery
Splunk

Summary

Hi,

We have a position which is suitable to your skillset. Please go through the below JD and let me know your interest.

Looking for candidate on W2

Title : Site Reliability Engineer

Location : Charlotte NC, Onsite

Relevant Experience (in Yrs.): 6+ years

Detailed Job Description:

• Proven experience in high-availability, high-transaction environments (preferably payments or financial services).

• Strong background in production resiliency and recovery (recovery execution, run books/playbooks, RCA mind-set).

• Incident pattern analysis + MTTR baselines (P2 Major/Minor) and recurring failure taxonomy (by rail/service).

• Senior-level observability expertise: dashboards, monitors, and alerts (Datadog preferred; similar tools considered).

• Splunk, Datadog, SQLs, JQL Jira Query language, Gitlab,

• Experience of CI / CD metrics and generating code quality, changes, testing automation executive’s reports from Gitlab

• Understand quality of stories, metrics, monitoring experiences - help get data to showcase deficiencies

• Senior CI/CD experience: pipeline design/operation, release safety patterns, and rollback readiness.

• Experience using metrics and monitoring data to identify and communicate deficiencies.

• Automation skills: Python and/or PowerShell (or equivalent) for building repeatable recovery workflows and operational tooling.

• Kubernetes/container platform production troubleshooting (deployments, pods, config drift, safe restarts, and "why did this change break prod” investigations

• Experience with identity/credentials/certificate & secret-rotation resilience (preventing outages during password rotations, certificate upgrades, and secret propagation; implementing guardrails and monitoring for these events).

• Batch/scheduler/job-execution reliability (detecting/preventing silent job failures, validating multi-DC scenarios, and building controls to ensure scheduled processing does not impact customers).

• Distributed integration failure-handling (timeouts, retries, backpressure, idempotency, duplicate prevention, and reconciliation—especially across vendor/downstream dependencies).

Thanks & Regards

Venkatesh Kundurthi

Team Lead || ASCII Group, LLC

Office:

Ext. 104; Direct:

38345 W. 10 Mile Rd, Ste.#365; Farmington, MI 48335

Email:

Website:

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 10117479
Position Id: 8935175
Posted 1 day ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Charlotte, North Carolina

•

13d ago

Role :Site Reliability Engineer Location : Charlotte , NC Local candidates only. Required Qualifications: 8+ years of Software Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education 5+ years of experience in Production support/Site Reliability Engineering teams with continued focus on improving Platform health 1+ years of experience in fintech or bankingCopilot/AI experience Familiar with Agile

Easy Apply

Contract

Depends on Experience

Resiliency and Recovery Engineer

Charlotte, North Carolina

•

Today

Role: Resiliency and Recovery Engineer Location: Charlotte, NC (Onsite) Type: Full-time role with TCS (TATA Consultancy Services) Job Description The Resiliency & Recovery Engineer (Contractor) is a senior, hands-on engineering role focused on improving production resiliency and recovery outcomes across critical services and payment rails. This role is responsible for driving measurable improvements such as faster recovery (reduced time to restore service), stronger and actionable alert covera

Easy Apply

Full-time

100,000 - 110,000

Resiliency & Recovery Engineer

Charlotte, North Carolina

•

Today

Job Role: Resiliency & Recovery Engineer Location: Charlotte, NC Job Description: The Resiliency & Recovery Engineer (Contractor) is a senior, hands-on engineering role focused on improving production resiliency and recovery outcomes across critical services and payment rails. This role is responsible for driving measurable improvements such as faster recovery (reduced time to restore service), stronger and actionable alert coverage, increased automation to reduce manual toil, and safer releases

Easy Apply

Full-time

$100,000 - $110,000

Systems Operations Engineer

Charlotte, North Carolina

•

Today

Location: Charlotte, NC Salary: $61.00 USD Hourly - $66.00 USD Hourly Description: Job Title: Senior Site Reliability Engineer (Systems Operations Engineer) Location: Charlotte, NC or Irving, TX Schedule: Hybrid - 3 days per week onsite (mandatory) Contract: 18 months (with possible extension and eligibility for conversion) About the Role We are seeking a highly skilled Senior Site Reliability Engineer (SRE) to support key Shared Services Operations Technology platforms, including Payment

Contract

USD 61.00 - 66.00 per hour

Search all similar jobs

Site Reliability Engineer - Charlotte NC, Onsite

Dice Job Match Score™

Job Details

Skills

Summary

Similar Jobs