Lead SRE Engineer

• Posted 1 day ago • Updated 7 hours ago
Full Time
USD $150,000.00 - 224,000.00 per year
Fitment

Dice Job Match Score™

⭐ Evaluating experience...

Job Details

Skills

  • Motivation
  • FOCUS
  • Cloud Computing
  • Systems Design
  • Training
  • Service Delivery
  • Product Management
  • Management
  • Workflow
  • IaaS
  • IT Management
  • Coaching
  • Optimization
  • Software Development
  • Operational Efficiency
  • Onboarding
  • Reliability Engineering
  • SQL
  • NoSQL
  • Database
  • Orchestration
  • Microsoft Azure
  • Windows PowerShell
  • Software Architecture
  • Design Patterns
  • Kubernetes
  • System Monitoring
  • JIRA
  • New Relic
  • Jenkins
  • Tableau
  • Root Cause Analysis
  • Agile
  • Continuous Integration
  • Continuous Delivery
  • English
  • Analytical Skill
  • Attention To Detail
  • Cross-functional Team
  • Soft Skills
  • Decision-making
  • Communication
  • Teamwork
  • Collaboration
  • DevOps
  • Recruiting
  • Innovation
  • Leadership
  • Conflict Resolution
  • Problem Solving
  • Process Improvement
  • Project Management
  • Quality Assurance
  • Risk Management

Summary

Posting Type

Hybrid

Job Overview
Lead Site Reliability Engineer (SRE) is responsible for driving customer confidence by assuring the quality of Relativity's current and future software products and to contribute to a motivating environment that empowers teams and individuals to engineer performant and reliable software through SRE best practices and principles. At it's core the Lead Site Reliability Engineer is responsible for the system availability and the reliability of key platform services and applications, with central focus on cross-service reliability of RelativityOne.

Job Description and Requirements

Role Responsibilities:

  • Proactively monitoring and actioning found items related to baseline performance and reliability of RelativityOne's core cloud product.

  • Providing feedback to Engineering teams regarding areas of the software that require increased reliability monitoring and alerting when gaps are identified.

  • Plan and Coordinate mid-size software projects that provide SRE enhanced tools and systems to improve SRE operational challenges.

  • Develop software and tools with mastery of Agile software development best practices.

  • Exhibit technical leadership through system design and implementation, in order to minimize assessed risk.

  • Lead and organize team members through onboarding and coaching activities.

  • Work with SRE leadership in order to identify skills and training opportunities for other SRE team members

  • Represent SRE team goals and progress on key results, through engagement with key stakeholders within engineering, service delivery, and product verticals.

  • Coordinates with product management, management, and team members on projects

  • Lead customer consultation engagement processes related to large and complex workspace challenges and workflows that help drive the way we scale and monitor within RelativityOne's core products.

  • Support Client Services by offering expertise to customers on performance related incidents and requests, including internal SRE escalations for our most complex customer challenges.

  • Support Problem and Incident Managers by providing information regarding trends of recurring issues within the application and cloud infrastructure.

  • Planning, developing, and delivering tools and systems to improve SRE operational challenges through Agile software development best practices.

  • Takes Technical leadership and coaching of SRE Team members.

  • Lead the team in application of a blameless postmortem culture; responsible for any post-incident actions that involve the development or optimization of any part of the software development lifecycle or the incident lifecycle.

  • Proactively Identify areas for improvement, lead post-incident reviews, and driving initiatives to enhance system reliability, performance, and operational efficiency.

  • Documenting the team knowledge, continuously improving processes and infrastructure to enhance platform reliability and performance.

  • Active participation in the recruitment and onboarding process of new team members.

  • Demonstrating consistent commitment to company core values.

  • Performing additional duties as assigned.

Preferred qualifications:

  • 7+ years of experience in Site Reliability Engineering or DevOps.

  • Experience with other tools such as SQL and NoSQL databases and orchestration services.

  • Experience in dealing with MS Azure (AS-900, AS-104, PowerShell).

  • Good knowledge of a software architecture and design patterns (Kubernetes).

  • Experience in a system monitoring and alerting.

  • Experience in using JIRA, New Relic, Jenkins, Tableau.

  • Experience with Relativity Server or RelativityOne software product (RCA Preferred)

  • Good knowledge of agile methodologies and a rapid development cycle.

  • Experience with DevOps practices, including CI/CD.

  • Very good knowledge of English (spoken & written).

  • Excellent problem-solving and communication skills.

  • Excellent analytical skills.

  • Meticulous attention to detail.

  • An eagerness to learn, explore, and introduce new technologies.

  • Ability to work independently and efficiently under pressure, drive projects to completion and meet deadlines.

  • Ability to work in a fast-paced and dynamic environment.

  • Ability to work as a team player in a cross-functional team to develop practical solutions and ensure positive user experiences.

Soft skills:

  • Strong problem-solving and decision-making skills.

  • Willingness to take ownership and drive topics end-to-end.

  • Personal initiative, commitment, perseverance, and resilience.

  • Well-developed communication and teamwork skills.

  • An innovative approach and a well-founded way of working.

  • Aspiration for DevOps principles and SRE engineering excellence

  • Drive to empower your colleagues.

Relativity is committed to competitive, fair, and equitable compensation practices.

This position is eligible for total compensation which includes a competitive base salary, an annual performance bonus, and long-term incentives.

The expected salary range for this role is between following values:
$150,000 and $224,000

The final offered salary will be based on several factors, including but not limited to the candidate's depth of experience, skill set, qualifications, and internal pay equity. Hiring at the top end of the range would not be typical, to allow for future meaningful salary growth in this position.

Required Skills:
Documentations, Innovation, Leadership, Problem Solving, Process Improvements, Project Management, Quality Assurance (QA), Risk Management, Technical Knowledge, Troubleshooting
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10286858
  • Position Id: 1180df03dc1cf88ddde35e99b667823e
  • Posted 1 day ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Hampden, Massachusetts

Today

Full-time

USD 160,000.00 - 185,000.00 per year

Remote

Today

Full-time

St. Petersburg, Florida

Today

Full-time

Columbus, Ohio

Today

Full-time

Search all similar jobs