SRE / Principal Engineer

Remote • Posted 1 day ago • Updated 1 day ago
Contract Corp To Corp
Contract W2
Contract Independent
12 Months
Remote
Depends on Experience
Fitment

Dice Job Match Score™

📊 Calculating match score...

Job Details

Skills

  • Java
  • Financial Services
  • Mortgage
  • PCI DSS
  • SLA

Summary

Role Overview

The SRE / Principal Engineer is the highest technical escalation tier within the NOC. You will be engaged exclusively for architecture-level and code-level issues that cannot resolve within SLA. You act as the bridge to Engineering for product defects, own complex RCAs and post-incident reviews, and drive platform reliability improvements across all tenants in the shared pool.

Key Responsibilities

Provide architecture-level and code-level diagnosis and remediation for critical incidents.

Serve as the primary liaison to Engineering for confirmed product defects.

Own and deliver complex Root Cause Analysis (RCA) and post-incident review documents.

Drive post-incident improvement actions, including permanent code or configuration fixes.

Review and approve changes to the platform architecture arising from incident learnings.

Set technical standards for runbooks, diagnostic tooling, and monitoring instrumentation.

Advise the Engineering Manager on capacity, resilience, and observability improvements.

Provide cross-tenant knowledge identify systemic risks that affect multiple clients.

Engage on Sev1 bridge calls as technical authority when requires escalation.

Required Skills & Qualifications

8+ years of experience in senior platform engineering, SRE, or technical operations roles.

Deep expertise in distributed systems architecture and microservices-based platforms.

Proficiency in at least one JVM language (Java/Kotlin) and Python or Go.

Expert-level debugging: heap dumps, thread dumps, memory profiling, distributed tracing.

Strong understanding of mortgage industry workflows and lending platform architecture.

Experience contributing to or reviewing code in production environments.

Demonstrated ability to produce board-ready RCA and post-incident reports.

Familiarity with security and compliance considerations in financial services (SOC2, PCI DSS, etc.).

Preferred Skills

Prior hands-on experience with MACER or similar enterprise mortgage orchestration platforms.

Track record of working directly with product engineering teams to resolve systemic defects.

Experience leading SRE or reliability engineering functions.

AWS / Azure certified solutions architect or equivalent credentials.

Key Performance Indicators (KPIs)

Quality and depth of RCA documents assessed by Engineering Manager and CSM.

Time-to-permanent-fix for platform defects bridged to Engineering.

Reduction in repeat Sev1/Sev2 incidents attributable to systemic improvements.

Number of cross-tenant learnings shared and operationalized per quarter.

Stakeholder satisfaction score from CSM post-incident reviews.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 90932951
  • Position Id: 8994644
  • Posted 1 day ago
Contact the job poster
PD

Pradip Divekar

Recruiter @ Nitor Infotech
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote

Yesterday

Easy Apply

Full-time

70 - 80

Remote

Today

Full-time

Remote or Illinois

Today

Full-time

USD 130,295.00 - 260,590.00 per year

Remote

Today

Full-time

USD 195,300.00 - 270,400.00 per year

Search all similar jobs