Site Reliability Engineer (SRE) @ Canada

Remote • Posted 2 hours ago • Updated 2 hours ago
Contract W2
Remote
$60 - $65/hr
Fitment

Dice Job Match Score™

👾 Reticulating splines...

Job Details

Skills

  • Dynatrace
  • Splunk
  • and Grafana
  • Wireshark
  • hybrid cloud infrastructure (VMware
  • Linux
  • Windows
  • Azure) and container orchestration (Kubernetes
  • Docker).

Summary

Site Reliability Engineer (SRE) Shared SRE Services group

About Us:

Our Shared SRE Services offerings for a leading finance company are pivotal in offering secure, reliable, and high-performance trading solutions to our clients worldwide. In this context to support their SRE strategy, We are looking for a seasoned Site Reliability Engineering (SRE) Engineer to join our team and contribute to the continuous improvement and reliability of our enclave product/service lines.

Position Overview:

We are seeking a Site Reliability Engineer (SRE) to join our dynamic team, focusing on the reliability and performance of different product/service lines. The ideal candidate will bring a deep understanding of SRE principles, including Incident Management, combined with expertise in DevOps practices and software development. This role demands strong technical skills in monitoring and observability tools such as Dynatrace, Splunk, and Grafana, coupled with exceptional Root Cause Analysis and troubleshooting abilities. Specialization in networking, including Cisco, Arista, AVI, and proficiency with network debugging tools like Wireshark, is crucial for success in this position.

Key Responsibilities:

  • Incident Management and Reliability: Lead the incident management process, ensuring high availability and performance of the applications. Develop and implement SRE practices to improve system reliability and resilience.
  • Monitoring and Observability: Utilize Dynatrace, Splunk, and Grafana to monitor system health, detect anomalies, and provide actionable insights for performance optimization.
  • Root Cause Analysis: Conduct thorough root cause analysis of incidents and outages, developing long-term solutions to prevent recurrence.
  • DevOps Practices: Collaborate with development and operations teams to streamline CI/CD pipelines, automate workflows, and implement infrastructure as code (IaC) for efficient service deployment and management.
  • Networking Expertise: Provide expertise in networking technologies (Cisco, Arista, AVI, etc.), ensuring robust network infrastructure design, implementation, and troubleshooting. Utilize tools like Wireshark for in-depth network analysis and debugging.
  • Collaboration and Leadership: Work closely with cross-functional teams to share knowledge, mentor junior engineers, and lead by example in adopting best practices in SRE, DevOps, and networking.
  • Innovation and Continuous Improvement: Stay abreast of industry trends and new technologies, advocating for and implementing innovative solutions to enhance system reliability and performance.

Qualifications:

  • Bachelor s or Master s degree in Computer Science, Information Technology, or related field.
  • 10+ years of experience in an SRE/DevOps role, with a proven track record in managing high-availability systems.
  • Strong expertise in monitoring and observability tools (Dynatrace, Splunk, Grafana).
  • Proficient in network debugging and analysis tools, including Wireshark.
  • Solid understanding of on-prem and hybrid cloud infrastructure (VMware, Linux, Windows, Azure) and container orchestration (Kubernetes, Docker).
  • Certifications in relevant technologies (Dynatrace, Splunk) are a plus.
  • Excellent communication and leadership skills, capable of leading incident response initiatives and collaborating effectively across teams.
  • Excellent problem-solving skills, with the ability to conduct comprehensive root cause analysis and troubleshooting.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10124157
  • Position Id: 8905174
  • Posted 2 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote

Yesterday

Easy Apply

Contract

Depends on Experience

Remote

14d ago

Easy Apply

Contract

70 - 85

Remote

Today

Easy Apply

Full-time

$120000 - $145000

Remote

3d ago

Easy Apply

Contract, Third Party

Depends on Experience

Search all similar jobs