Site Reliability Engineer (SRE) with ServiceNow & Application Infrastructure - Canada

Overview

On Site
Accepts corp to corp applications
Contract - W2
Contract - Independent
Contract - 19 day((s))

Skills

Automation
Python
ServiceNow
Site Reliability Engineering
Application Infrastructure
observability and monitoring dashboards
capacity management
incident response
Troubleshooting ServiceNow issues
task optimization
technical debt

Job Details

Role: Site Reliability Engineer (SRE), ServiceNow, Application Infrastructure

Location: Montreal, QC, Canada (Onsite from Day 1)

Description:

  • Looking for an intermediate between 2 to 5 years' experience.
  • SRE practices include task optimization and automation, prioritizing technical debt, observability and monitoring dashboards, capacity management, incident response, and problem elimination.
  • Prior experience in the financial services industry is not required, and we welcome candidates from all industries and backgrounds to apply.

Responsibilities include:

  • Delivery of improvements that will maximize the availability and performance of supported systems through optimized and automated operational tasks, collaborating on the development of operational tools, ongoing problem management, and architecture reviews with colleagues.
  • Troubleshooting ServiceNow issues, and also some on-premise capabilities in a Linux environment from time to time, collaborating with others to get to the bottom of issues, and agreeing on lasting improvements that can be made.
  • Exploring and delivering observability including metrics, logging, tracing and alerting that can define and measure the target reliability of a product.
  • Being dependable and responsive during agreed hours, like when part of the on-call rotation with the rest of the global team (with a time-off in lieu system).
  • A commitment to understanding the Firm's ServiceNow instances and related dependencies, contributing to their documentation.
  • Identification and prioritization of technical debt that can impact client satisfaction or operational efficiency.
  • Give feedback on policy and procedures related to the delivery of SRE and operational practices with a view to continually making the Firm safer and more efficient.

Skills required:

  • The ideal candidate would have at least one of: Software development skills in one or more programming language, e.g. Python, ServiceNow administration or development experience.
  • 7+ years of experience.
  • Ability to respond appropriately during occasional technical emergencies, like outages.

Skills desired:
ServiceNow administration or development experience, although this can be acquired by the successful candidate via on the job and via training.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.