Job for Site reliability engineer ,GA and New York (Onsite )

• Posted 6 days ago • Updated 15 minutes ago
Contract Corp To Corp
Fitment

Dice Job Match Score™

🔢 Crunching numbers...

Job Details

Skills

  • Production Management
  • Impact Analysis
  • Wealth Management
  • Investment Management
  • Optimization
  • Collaboration
  • SAP WM
  • SQL
  • React.js
  • Cloud Computing
  • IT Service Management
  • Java
  • AngularJS
  • Spring Framework
  • IBM DB2
  • Unix
  • Scripting
  • TWS
  • CA Workload Automation AE
  • Data Link Layer
  • Network Layer
  • Production Support
  • Debugging
  • Agile
  • Communication
  • Negotiations
  • Splunk
  • Continuous Integration
  • Continuous Delivery
  • DevOps
  • Incident Management
  • Service Level
  • Capacity Management
  • Software Engineering
  • Management
  • Disaster Recovery
  • Shell Scripting
  • Linux
  • Grafana
  • Kibana
  • Problem Solving
  • Conflict Resolution
  • Brokerage
  • BMC Control-M
  • cron
  • Computer Science
  • Information Systems

Summary

Position Overview:

The Wealth Management Production Management Site Reliability Engineer position is a highly visible/critical role, which will be a team member of technical SMEs managing the stability and optimization of the Wealth Management systems. Scope includes but not limited to, the day-to-day support of the organization's technology related outages, collaboration on technology projectsfocused on stability, optimization, business impact analysis, and associated risk-related methodologies. This role will be responsible for overall stability of the Wealth Management Investment Management application platforms, participation on key optimization initiatives, and collaboration with multiple technical teams within . Additionally, partner with WM business units, various levels of management and staff to collect, analyze and make recommendations on optimizing the platform. This position will mainly perform DevOps/SRE role in Java, Unix & SQL technologies technology.

Primary Skills / Must have

  • Site Reliability Engineer (SRE) in which 80% will be support [React/Protect], 10% will be in Dev Ops[Enable] space.
  • Proven track record supporting large scale multi-tiered cloud-based applications.
  • Analyze ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns
  • Hands on experience with Java, Angular, Spring, DB2, Unix scripting and experienced in scheduler tools such as TWS, autosys
  • L2-L3 Production Support, Debugging skills, problem solving
  • Experience working in an Agile Development environment
  • Proven ability to understand and troubleshoot complex problems under pressure
  • Excellent communication skills (both written and oral), listening skills, influencing and negotiation skills
  • Experience with performance troubleshooting and remediation
  • Experience with observability tools such as Splunk, Kibana, Grafana, Prometheus
  • Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating, and lead in DevOps automation and best practices.

Responsibilities include:

  • Incident Management -Create and manage necessary process involving incidents
  • Partner with Ops Control to ensure IT and/or End User communications are handled appropriately
  • Engage with the development team throughout the life cycle to support Application build for Reliability
  • Develop software to automate manual operational work
  • Run, maintain and improve the service against established Service Level Objectives by applying software engineering principles
  • Responsible for the availability, performance, change (CP) management, monitoring, and capacity management of their services
  • Troubleshoot priority incidents, conduct blameless post-mortems and ensure permanent closure of the incidents
  • Analyze patterns of production incidents, develop permanent remediation plans, and implement automation to prevent future incidents from occurring through software engineering
  • Manage process related functions around large-scale events such as disaster recovery. Communicate closely with impacted groups to ensure all events are properly managed.

Secondary Skills / Desired skills

  • Having good expertise on Linux and shell scripting. Need to be very comfortable with Linux
  • Grafana/Kibana dashboarding experience
  • Good problem-solving skills
  • Good communicator
  • Good understanding of brokerage business
  • Jobs (controlM/CBSS/CRON) experience
  • Bachelor's/Master's Degree in Computer Science, Information Systems or related field
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10457702
  • Position Id: 2026-38986
  • Posted 6 days ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Wilmington, Delaware

Today

Full-time

Sunnyvale, California

Today

Full-time

Mountain View, California

Today

Full-time

USD 168,926.00 - 192,500.00 per year

No location provided

Today

Full-time

USD 99,600.00 - 234,600.00 per year

Search all similar jobs