Overview
Skills
Job Details
Job Title: Business Systems Analyst
Location: Scarborough, ON (Hybrid)
Duration/Term: Long-Term Contract
Job Summary:
We re seeking a Business Systems Analyst to support our Site Reliability Engineering (SRE) team, focused on translating operational needs into actionable solutions that enhance system reliability and observability. This role bridges the gap between business and technical teams, ensuring alignment on incident management, process automation, compliance, and tool integration across cloud environments such as AWS, Google Cloud Platform, or Azure.
Key Responsibilities:
- Collaborate with SREs, DevOps, infrastructure teams, and product owners to gather and document performance and reliability requirements.
- Translate business goals into technical requirements for observability, incident response, and SLA/SLO/SLI definitions.
- Identify repetitive operational tasks and support automation efforts using tools like Ansible, Jenkins, or custom scripts.
- Track and report on reliability metrics (e.g., uptime, MTTR, MTBF, error budgets) through dashboards using tools like Grafana, Datadog, or Splunk.
- Participate in post-incident reviews, documenting root causes and coordinating corrective actions.
- Support integration of monitoring, alerting, and ITSM tools (e.g., PagerDuty, Prometheus, ServiceNow).
- Maintain compliance with internal risk and security standards, supporting audits and governance reviews.
- Act as a liaison between the SRE team and business units, ensuring clarity around reliability objectives.
- Contribute to change advisory board (CAB) discussions, providing impact assessments for production changes.
Required Skills & Experience:
- Proven experience as a Business Systems Analyst in an SRE, DevOps, or cloud-native environment.
- Working knowledge of SLIs, SLOs, error budgets, and site reliability principles.
- Familiarity with monitoring/observability tools, ITSM platforms, and incident management workflows.
- Experience documenting requirements, workflows, reports, and reliability dashboards.
- Understanding of infrastructure and operations in AWS, Azure, or Google Cloud Platform.
- Strong collaboration, documentation, and communication skills across technical and non-technical teams.
- Experience with Agile/Scrum methodologies is highly desirable.
Key Skills: SRE Concepts (SLI/SLO/Error Budgets), Observability Tools (Grafana, Splunk, Datadog), Cloud (AWS/Google Cloud Platform/Azure), Process Automation (Ansible, Jenkins), Incident Management, Change Management, CI/CD Tool Integration, Compliance & Governance
VDart Group, a global leader in technology, product, and talent management, empowers businesses with comprehensive solutions through our four distinct, industry-leading business units With a diverse team of over 4,000 professionals across 13 countries, we deliver strong results across various industries, including Fortune 500 companies
Committed to "People, Purpose, Planet," we prioritize social responsibility and sustainability, as evidenced by our EcoVadis Bronze Medal Certification and participation in the UN Global Compact
Our dedication to delivering strong results has earned us recognition as a trusted advisor for businesses seeking to drive innovation and growth, including many Fortune 500 companies Join our network! Partner with VDart Group to leverage our global network, industry expertise, and proven track record with a diverse clientele