SRE Lead

Jersey City, NJ, US • Posted 1 day ago • Updated 1 day ago
Full Time
On-site
$100,000 - $140,000/yr
Fitment

Dice Job Match Score™

📋 Comparing job requirements...

Job Details

Skills

  • AppDynamic
  • Splunk

Summary

Job Description

Role: Manager SRE

Automation Lead Leading Automation SRE, Responsible to perform end to end Self-Healing automation solution to reduce manual effort/TOIL.

Primary Skill Observability, Telemetry and event co-relation

Secondary Skill Shell Script, Linux, Monitoring tools - Big Panda Splunk, AppD etc.

Automation Engineer:

  1. 15+ years of experience in leading Automation SRE teams.
  2. Advanced working experience with two or more of the following: Unix/Linux, Windows Server, Oracle, MSSQL, MongoDB.
  3. Experience with Python, Java, Curl scripting or any other types of scripting.
  4. Experience with two or more of the following observability tools: AppDynamics, Big Panda, Elastic Search (ELK), Google Cloud Logging, Grafana, Prometheus, Splunk, Thousand Eyes.
  5. Experience with logging, monitoring, and event detection on Cloud or Distributed platforms.
  6. Experience working with one or more of the following: AutoSys, CRON, Windows Scheduler or other logical batch schedulers.
  7. Provides technical direction regarding monitoring and logging to less experienced staff or develops highly complex original solutions. Acts as an Expert technical resource for modeling, simulation and analysis efforts.
  8. Experience creating and modifying technical documentation such as environment flow, functional requirements, nonfunctional requirements.
  9. Outstanding problem solving and analytical skills with ability to turn findings into strategic imperatives.
  10. Technical operations application support experience.
  11. Minimum 4-6 years of hands-on experience into SRE implementation of monitoring system development for application reliability using Splunk, Grafana, App Dynamics, Big panda.
  12. Completely On-Prim environment, so we would require strong candidates on the above skills.
  13. Overall, we are looking for an Automation Engineer, who could reduce the toil issues and enhance the system towards reliability and scalability.

Nature of the Job :

1. Collaborate with Production support team, identify the existing manual activities, and automate.

2. Identify toil area where it can be automated to avoid manual intervention

3. Build Monitoring system and observability platform for more Stack traces and s and Dashboards.

4. Ability to define SLA, SLO and SLI and implement the same for better monitoring

5. Scalability, reliability, and observability are the primary goals for reduction of MTTD and MTTR

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 501494924
  • Position Id: 8927292
  • Posted 1 day ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

New York, New York

2d ago

Full-time

USD 140,000.00 - 225,000.00 per year

Jersey City, New Jersey

Today

Full-time

USD 152,000.00 - 215,000.00 per year

New York, New York

Today

Full-time

USD 160,000.00 - 185,000.00 per year

Jersey City, New Jersey

Today

Full-time

USD 152,000.00 - 215,000.00 per year

Search all similar jobs