Site Reliability Engineer // Devops

Overview

On Site
Depends on Experience
Contract - W2
Contract - 1 Year(s)

Skills

DevOps
Amazon S3
Amazon RDS
Amazon EC2
Ansible
Apache JMeter
Apache Maven
Apache Subversion
Automated Testing
Bash
Cloud Computing
Cloudant
Python

Job Details

Site Reliability Engineer // Devops
Hope all is well. Below are 10+ Sr SRE Openings we have in Reston, VA for 12+ month extendable contract

Required Skills:
8-10 years overall experience
Hands-On in at least one language - Java (must), Python (3-4 yrs)
Hands-On experience with automated testing tools (JMeter, Junit, Mockito, Postman)
Hands-On experience with a source code management system like GIT or SVN including pull, push, branch, commit and merge functions
Hands-On experience creating, configuring and maintaining cloud-based applications and infrastructure for the rapid development and monitoring of applications and services:

AWS, EC2, Fargate, CloudFormation, RDS, ElasticCache, S3
Experience with Cloud Migrations with reliability and availability as core focus
Experience in implementing the SRE at the team/enterprise level with hands-on implementation of SRE practices and improving the metrics
Hands-On experience with monitoring tools (Splunk, Dynatrace) and dashboard development including development and customization of dashboards
Hands-On experience with the build, deploy, and packaging process and best practices. Familiar using DevOps automation tools (UCD, Jenkins, Maven, SonarQube, Chef, Ansible, Puppet)
Scripting skills for automation (Linux bash and Windows)

General Required Skills:
Ability to diagnose and optimize software code for reliability and resiliency Knowledge of the incident management process and reporting tools (ServiceNow, Jira Service Desk)
Good communication and documentation skills. An SRE must document their work, collect and document tribal knowledge (the good stuff in people s head), and make it accessible to others.
Experience triaging incidents and conducting RCAs (Root Cause Analysis)
Nice to have skills:
Ability to diagnose technical problems, isolate and debug issues, formulate creative solutions, analyze alternative approaches, and implement a timely solution.
Experience providing alternatives and estimates for implementing a fix or automation to improve reliability.
Ability to juggle several different tasks at a time, and able to frequently adjust for new tasks or higher priority tasks.
Experience with a modern RDBMS or NoSQL, like Postgres, MySQL, DB2, Oracle, MongoDB, and Cloudant

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.