Site Reliability Engineer

New York, NY, US • Posted 8 hours ago • Updated 8 hours ago
Contract Independent
Contract W2
Contract Corp To Corp
No Travel Required
On-site
Depends on Experience
Fitment

Dice Job Match Score™

⭐ Evaluating experience...

Job Details

Skills

  • REST APIs
  • SQL
  • AWS
  • Python
  • FIX protocol
  • DNS
  • networking
  • FIX logs
  • (TCP/IP
  • routing)
  • co-location (colo)

Summary

Our client is looking  Site Reliability Engineer project NYC, NY (Hybrid)  below is the detailed requirements.

Job Title           :  Site Reliability Engineer

Location           :  NYC, NY (Hybrid) 

Duration          :  Contract

 

Candidates with strong experience in FIX and the Google SRE model.

 

Job Description:

 

Required Qualifications:

  • Bachelor's degree or Maters Degree in Computer science, or a related field, with minimum 12+ Years of Overall IT experience
  • 5+ years in a senior SRE role or a similar position, demonstrating deep knowledge and expertise in site reliability engineering and operations
  • Knowledge of FIX protocol and messages, ability to read FIX logs
  • Familiarity with REST APIs and a strong understanding of API integration
  • Proficient in Python and scripting for automation and system management, with a proven track record of developing and implementing automation solutions
  • Expertise in SQL and transactional databases, including querying and troubleshooting
  • Strong analytical and troubleshooting skills with a proven ability to identify and resolve technical issues through root cause analysis
  • In-depth knowledge of core networking concepts including TCP/IP, routing, and DNS.
  • Familiarity with maintaining and troubleshooting systems within both cloud (AWS) and co-location (colo)
  • Availability for flexible work hours and willingness to cover US markets trading sessions, including L2 on-call coverage
  • Knowledge of change management processes and risk management

 

 

What You’ll Do:

 

  • Support the SRE team in developing and implementing enhancements to support workflows, focusing on automation and efficiency improvements
  • Handle technical escalations, troubleshoot complex FIX and API connectivity issues, and actively participate in on-call rotations during non-traditional hours to ensure rapid response and resolution
  • Adhere to and administer incident and change management policies
  • Coordinate incident resolution efforts and implement change management protocols to maintain and enhance system reliability
  • Work closely with the Lithuania office to ensure smooth operation and alignment of SRE practices across time zones
  • Coordinate Incident Post Mortems and RCA analysis
  • Design, implement, and maintain comprehensive monitoring, logging, and tracing solutions (observability stack) to provide deep insights into system performance and user experience
  • Partner with product and engineering teams to define clear Service Level Indicators (SLIs) and Service Level Objectives (SLOs), managing error budgets to ensure service reliability meets business needs

 

 

 

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10120137
  • Position Id: 68112-10367-
  • Posted 8 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Jersey City, New Jersey

6d ago

Easy Apply

Contract, Third Party

Depends on Experience

Jersey City, New Jersey

Today

Full-time

USD 152,000.00 - 215,000.00 per year

New York, New York

8d ago

Easy Apply

Full-time, Third Party, Contract

Depends on Experience

New York, New York

Today

Contract

USD 110,000.00 - 120,000.00 per year

Search all similar jobs