Junior Site Reliability Engineer (Onsite - Seatle, WA)

  • Seattle, WA
  • Posted 6 hours ago | Updated 6 hours ago

Overview

On Site
USD 88,500.00 - 162,500.00 per year
Full Time

Skills

Preventive Maintenance
Project Management
Performance Management
Real-time
Dashboard
Recovery
Service Level
Optimization
Continuous Integration
Continuous Delivery
Workflow
Root Cause Analysis
Software Engineering
Scalability
Computer Science
Reliability Engineering
FOCUS
Incident Management
New Relic
Grafana
Splunk
Elasticsearch
Scripting
Python
Bash
Java
Cloud Computing
Amazon Web Services
Google Cloud
Google Cloud Platform
Microsoft Azure
Orchestration
Docker
Kubernetes
Analytical Skill
ROOT
Communication
Reporting
Documentation
Collaboration
Attention To Detail
Value Engineering

Job Details

Job Description

Nordstrom Technology is committed to delivering reliable and scalable systems that power critical services for our customers. We are seeking a motivated and detail-oriented Junior Site Reliability Engineer 1 (SRE) to join our team with a strong focus on proactive monitoring, incident response, and root cause analysis. This role is ideal for someone passionate about ensuring system stability and performance, while diving deep technically to understand and resolve issues when incidents occur.

As a Junior SRE, you will play a key role in maintaining "eyes on glass" monitoring to detect and respond to system anomalies, ensuring the health and reliability of our services. You will also collaborate with teams to address root causes of incidents and continuously improve observability and reliability processes.

This role is considered onsite. Candidates must be willing to work in office 5 days/week at Nordstrom's headquarters in Seattle, WA. The scheduled shift for this role will be 3pm-11pm PST.

Day in the life...
  • Monitor critical systems: Maintain real-time "eyes on glass" monitoring dashboards to proactively identify and respond to anomalies in system performance and availability.
  • Incident response: Participate in on-call rotations to respond to incidents, troubleshoot issues, mitigate outages, and restore service as quickly as possible.
  • Root cause analysis: Dive deep into technical investigations to identify the underlying causes of incidents, documenting findings and working with teams to prevent recurrence.
  • Observability enhancement: Collaborate with teams to refine monitoring, logging, and alerting systems to provide actionable insights and reduce time-to-detection and resolution.
  • Automation: Write and maintain scripts to automate routine operational tasks, incident remediation, and reporting.
  • SLOs and SLIs: Support the definition and tracking of Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to measure and improve system reliability.
  • System optimization: Assist in improving CI/CD pipelines and workflows to ensure seamless deployments and minimize downtime.
  • Documentation: Create and maintain detailed documentation for monitoring configurations, incident handling procedures, and root cause analysis findings.
  • Collaboration: Work closely with software engineering and infrastructure teams to improve fault tolerance, scalability, and operational readiness.

You own this if you have...
  • Bachelor's degree in computer science, engineering, or a related field, or equivalent practical experience required.
  • Foundational understanding of site reliability engineering principles, with a strong focus on monitoring, alerting, and incident management.
  • Exposure to observability tools such as Prometheus, Datadog, New Relic, or Grafana, and logging platforms like Splunk or Elasticsearch.
  • Basic proficiency in one or more programming or scripting languages (e.g., Python, Go, Bash, or Java) to assist with automation and troubleshooting.
  • Familiarity with cloud platforms (AWS, Google Cloud Platform (Google Cloud Platform), or Azure) and their services.
  • Understanding of containerization and orchestration technologies like Docker and Kubernetes.
  • Strong analytical skills with the ability to dive deep into technical issues to identify and resolve root causes.
  • Excellent communication skills for incident reporting, documentation, and collaboration with cross-functional teams.
  • A proactive mindset and attention to detail, with a willingness to learn and grow in a fast-paced, collaborative environment.

We've got you covered...

Our employees are our most important asset and that's reflected in our benefits. Nordstrom is proud to offer a variety of benefits to support employees and their families, including:
  • Medical/Vision, Dental, Retirement and Paid Time Away
  • Life Insurance and Disability
  • Merchandise Discount and EAP Resources

A few more important points...

The job posting highlights the most critical responsibilities and requirements of the job. It's not all-inclusive. There may be additional duties, responsibilities and qualifications for this job.

Nordstrom will consider qualified applicants with criminal histories in a manner consistent with all legal requirements.

Applicants with disabilities who require assistance or accommodation should contact the nearest Nordstrom location, which can be identified at

2022 Nordstrom, Inc

Current Nordstrom employees: To apply, log into Workday, click the Careers button and then click Find Jobs.

Pay Range Details

The pay range(s) below are provided in compliance with state specific laws. Pay ranges may be different in other locations.

Washington: $88,500 - $162,500 Annually

This position may be eligible for performance-based incentives/bonuses. Benefits include 401k, medical/vision/dental/life/disability insurance options, PTO accruals, Holidays, and more. Eligibility requirements may apply based on location, job level, classification, and length of employment. Learn more in the Nordstrom Benefits Overview by copying and pasting the following URL into your browser: _Overview_15_Full_Time_ES-US.pdf
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.