Apply Now

Job for Site reliability engineer ,GA and New York (Onsite )

• Posted 6 days ago • Updated 15 minutes ago

Contract Corp To Corp

Fitment

Dice Job Match Score™

🔢 Crunching numbers...

Job Details

Skills

Production Management
Impact Analysis
Wealth Management
Investment Management
Optimization
Collaboration
SAP WM
SQL
React.js
Cloud Computing
IT Service Management
Java
AngularJS
Spring Framework
IBM DB2
Unix
Scripting
TWS
CA Workload Automation AE
Data Link Layer
Network Layer
Production Support
Debugging
Agile
Communication
Negotiations
Splunk
Continuous Integration
Continuous Delivery
DevOps
Incident Management
Service Level
Capacity Management
Software Engineering
Management
Disaster Recovery
Shell Scripting
Linux
Grafana
Kibana
Problem Solving
Conflict Resolution
Brokerage
BMC Control-M
cron
Computer Science
Information Systems

Summary

Position Overview:

The Wealth Management Production Management Site Reliability Engineer position is a highly visible/critical role, which will be a team member of technical SMEs managing the stability and optimization of the Wealth Management systems. Scope includes but not limited to, the day-to-day support of the organization's technology related outages, collaboration on technology projectsfocused on stability, optimization, business impact analysis, and associated risk-related methodologies. This role will be responsible for overall stability of the Wealth Management Investment Management application platforms, participation on key optimization initiatives, and collaboration with multiple technical teams within . Additionally, partner with WM business units, various levels of management and staff to collect, analyze and make recommendations on optimizing the platform. This position will mainly perform DevOps/SRE role in Java, Unix & SQL technologies technology.

Primary Skills / Must have

Site Reliability Engineer (SRE) in which 80% will be support [React/Protect], 10% will be in Dev Ops[Enable] space.
Proven track record supporting large scale multi-tiered cloud-based applications.
Analyze ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns
Hands on experience with Java, Angular, Spring, DB2, Unix scripting and experienced in scheduler tools such as TWS, autosys
L2-L3 Production Support, Debugging skills, problem solving
Experience working in an Agile Development environment
Proven ability to understand and troubleshoot complex problems under pressure
Excellent communication skills (both written and oral), listening skills, influencing and negotiation skills
Experience with performance troubleshooting and remediation
Experience with observability tools such as Splunk, Kibana, Grafana, Prometheus
Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating, and lead in DevOps automation and best practices.

Responsibilities include:

Incident Management -Create and manage necessary process involving incidents
Partner with Ops Control to ensure IT and/or End User communications are handled appropriately
Engage with the development team throughout the life cycle to support Application build for Reliability
Develop software to automate manual operational work
Run, maintain and improve the service against established Service Level Objectives by applying software engineering principles
Responsible for the availability, performance, change (CP) management, monitoring, and capacity management of their services
Troubleshoot priority incidents, conduct blameless post-mortems and ensure permanent closure of the incidents
Analyze patterns of production incidents, develop permanent remediation plans, and implement automation to prevent future incidents from occurring through software engineering
Manage process related functions around large-scale events such as disaster recovery. Communicate closely with impacted groups to ensure all events are properly managed.

Secondary Skills / Desired skills

Having good expertise on Linux and shell scripting. Need to be very comfortable with Linux
Grafana/Kibana dashboarding experience
Good problem-solving skills
Good communicator
Good understanding of brokerage business
Jobs (controlM/CBSS/CRON) experience
Bachelor's/Master's Degree in Computer Science, Information Systems or related field

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 10457702
Position Id: 2026-38986
Posted 6 days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Lead Site Reliability Engineer

Wilmington, Delaware

•

Today

Job Description Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability. As a Lead Site Reliability Engineer at JPMorgan Chase within the Enterprise technology, Corporate technology team , you hold a leadership role in your team, demonstrate strong knowledge across multiple technical domains, and advise others on the technical and business issues facing them. Take lead and con

Full-time

Site Reliability Engineer, Enterprise Technology Services

Sunnyvale, California

•

Today

Imagine what we could do together. At Apple, new ideas have a way of becoming excellent products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish! The people here at Apple don't just build products - they craft the kind of wonder that's revolutionized entire industries. It's the diversity of those people and their ideas that encourages the innovation that runs through everything we do, from amazing technol

Full-time

Site Reliability Engineer - Platform Infrastructure Engineering

Mountain View, California

•

Today

Company Overview ID.me is the next-generation digital identity wallet that simplifies how individuals securely prove their identity online. Consumers can verify their identity with ID.me once and seamlessly login across websites without having to create a new login and verify their identity again. Over 152 million users experience streamlined login and identity verification with ID.me at 20 federal agencies, 45 state government agencies, and 70+ healthcare organizations. More than 600+ consumer

Full-time

USD 168,926.00 - 192,500.00 per year

Principal Site Reliability Engineer

No location provided

•

Today

Job Description As a Principal Site Reliability Engineer, you will play a pivotal role in building and operating the Oracle HealthPatient Portal. In this role, you will design, build, and operate highly reliable, scalable infrastructure that supports Commercial and Federal customers. You will also contribute to the next evolution of cloud operations by advancing automation, observability, and AI-assisted reliability practices. You will work within a globally distributed team to deliver robust

Full-time

USD 99,600.00 - 234,600.00 per year

Search all similar jobs