Site Reliability Engineer - Senior (CPE)

  • San Diego, CA
  • Posted 8 days ago | Updated 7 hours ago

Overview

On Site
Full Time

Skills

Software Management
Amazon Web Services
Cloud Computing
Root Cause Analysis
Collaboration
Performance Analysis
Capacity Management
Process Automation
Scripting
Design Architecture
Scrum

Job Details

Duration:0-12 month(s)

Description/Comment: Hands-on application management and support for AWS cloud environments, including full-stack diagnosis, fault resolution and root cause analysis.
Proactive monitoring of production systems and identify issues before service impact.
Drive and Implement monitoring tools/metrics/reports for tracking application/service performance.
Collaborate with engineering and system teams to drive changes and ensure optimal application performance and resiliency.
Lead service and system performance analysis, service capacity planning, and service continuity validation for multiple applications.
Identify areas for process automation, and develop automated scripts/tools to for regular operational activities.
Review and influence design, architecture, standards, and methods for deploying, monitoring and operating services and applications.
Actively participate and/or commit in the execution of tasks required to meet milestones and deliverables set by the SCRUM team throughout the release cycle.
Provide rotational on-call support.

Additional Job Details:**Max Bill rate ***/hr** Hybrid scheduled, will go onsite as needed. Ideal candidate will be local to San Diego or in surrounding area *please complete the NCD Pre-Screening form for each candidate and attach with the resume at time of submittal
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.