Site Reliability Engineer (SRE) - Cloud

    • SPARTA, Inc. dba Cobham Analytic Solutions
  • Alexandria, VA
  • Posted 60+ days ago | Updated 4 hours ago

Overview

On Site
Compensation information provided in the description
Full Time

Skills

Value engineering
Technical Support
Satellite
Open source
Customer engineering
Leadership
Network engineering
Design engineering
Migration
Operations
Training
Risk management
Back office
Front office
Change control
Policies
Regulatory Compliance
Specification
Acquisition
Servers
Virtual machines
FOCUS
System security
STIG
Capacity management
Metrics
IT management
IMPACT
IT service management
Provisioning
Tier 3
Product support
Embedded systems
Documentation
Automation
IT operations
Process improvement
Workflow
Integration testing
Computer science
Information Technology
Operating systems
Computer networking
Identity management
Design
Access control
UPS
HVAC
Cabling
Data
Layout
Writing
Staff management
Software engineering
Systems architecture
Root cause analysis
Teamwork
Network
Infrastructure management
Progress Chef
Puppet
Docker
Orchestration
Kubernetes
Linux
Bash
Scripting
IaaS
Microsoft Azure
Google Cloud
Google Cloud Platform
Git
Software development
Web development
Management
Security management
GitLab
Ansible
Terraform
OpenStack
Red Hat Enterprise Linux
Virtualization
Continuous integration
Continuous delivery
Change management
Configuration Management
Content management
Software deployment
C++
Python
Amazon Web Services
Security+
Cloud computing
DoD
Security clearance
Art
Accessibility
AIM
Quest

Job Details

In a world of possibilities, pursue one with endless opportunities. Imagine Next!

When it comes to what you want in your career, if you can imagine it, you can do it at Parsons. Imagine a career working with intelligent, diverse people sharing a common quest. Imagine a workplace where you can be yourself. Where you can thrive. Where you can find your next, right now. We've got what you're looking for.

Job Description:

Parsons/Space Ground System Solutions (SGSS) has an immediate full-time opening for a Site Reliability Engineer (SRE) on our IT Support team located in Alexandria, VA. In this role, you will help continue expansion of satellite ground system software to hybrid and private cloud infrastructure.

You will help manage, support and facilitate infrastructure operation for developers who are building systems containing government-off-the-shelf, commercial, and open-source software. Your ability to approach software engineering with a focus on IT operations which systematically designs, implements, manages, and automates Application and Infrastructure security tools is critical. You will serve as a critical link between the software development team and NRLs sponsors and customers, engineering and delivering operational solutions through automation that ensure that customer's and operator's reliability and maintainability needs are met. You will serve as a system stability advocate for customers, bringing back essential insights and proposed solutions to the software team. SRE is a unique role that requires either a background as a software developer with additional operations experience, or as a system administrator or IT ops lead with significant software development skills. SREs draw heavily and broadly on other domains including network engineering also form important parts of the daily work.

The successful candidate will:
  • Passionately support the design, engineering, and coordination of legacy environment migrations.
  • Provide a holistic IT Service Delivery view to dynamic provisioning, capacity planning, scheduled maintenance, system\\platform performance metrics, change management, high quality of services, promoting first class automation and selection of rationally priced technical options
  • Develop and configuration manage automation for deploying, operating, monitoring, and remediating failures and performance issues in systems deployed on-premise, private cloud, commercial cloud, and hybrid environments for government customers in all phases of integration and operations.
  • Identify opportunities for RCA and develop processes to address gaps and promote quality system usage and management
  • Select, implement, support, and migrate among providers of infrastructure; manage layered infrastructure at the physical and logical levels; provide expertise, automation, documentation, training and support to consumers of infrastructure as integrated with the application stack.
  • Serve as a subject matter resource in automation of cloud cyber risk mitigation in (AWS, AWS GovCloud, and classified offerings).
  • Document and automate processes discovered through your back-office engagement with software engineers and front office engagement with users, operators, security engineers, and other customers of SGSS and NRL software.
  • Engage with and selectively manage internal and customer change control boards, policy mandates, and compliance frameworks.

What You'll Be Doing:
  • With a software engineering approach to IT operations, design, implement, manage, and automate Application and Infrastructure security tools (containerized Scanning and VM tools) along with integrations to CI/CD pipelines, automated workflows, script-based integrations, etc.
  • Identify appropriate Cloud based (i.e., AWS GovCloud) infrastructure to meet mission requirements, including the specification, acquisition, configuration, dynamic provisioning and maintenance of servers
  • Bring structured engineering judgement and software engineering expertise to the tension between standardization and specialization - reliably deploying SGSS and NRL standardized software and infrastructure in specialized mission environments, monitoring and supporting its performance, and aiding the team in improving the overall quality of the delivered capability through monitoring, automation, and process improvement
  • Specify and configure physical and virtual machines with RedHat Enterprise Linux with a heavy focus on stable and supported operating systems. This includes proactive and consistent maintenance to ensure systems \\ platforms (i.e.,Open Stack, Kubernetes, etc.) are up and available 98% of time
  • Perform current state analysis of an organization's system security controls and measures against DISA STIG standards, and provide recommendations for enhancement
  • As the system stability advocate, implement configuration management automation (e.g., Ansible) to maintain configuration
  • As the entrusted reliability liaison, assist the development team with requirements verification. The SRE's holistic view should include but is not limited to capacity planning, system\\platform performance metrics, change management, high quality of services, promoting first class automation and reasonable cost implementation options
  • As a technical change agent, assist the organization and technical lead(s) in identifying technical problems, perform root cause analysis and corrective actions follow-up, develop managerial summaries and technical steps for implementing software updates, 'fixes' and/or replacements
  • Conduct post-incident reviews. Identify what's working and what's not. Develop new\\revised response plans that improve the software development lifecycle, revise documentation, implement engineering processes that positively impact IT service delivery and builds customer confident post system maintenance & provisioning
  • Fix Support Escalation Issues; serve on the tier 3 support team for integrated product support and proactive response to complex support challenges
  • Develop and maintain Infrastructure-as-Code (IaC) with security embedded using such technologies as Terraform
  • Document, train, and operate a software assurance capability at multiple security levels
  • Document tribal knowledge and integrating into practical use - documentation, automation, monitoring and remediation.
  • Support feedback from practical experience to software development, support, IT operations and on-call process improvement
  • Develop workflow in Python or similar scripting languages if or as needed
  • Build Software for Support Team; build and implement services to improve the quality of support team delivery; improve monitoring and alerting internally and at customer sites in integration, test, and ops

What Required Skills You'll Bring:
  • Must have a minimum active DoD Secret security clearance with ability to obtain a TS/SCI
  • Bachelor's Degree in relevant field (i.e., Computer Science, Software Engineer, Information Technology)
  • Minimum of fifteen (15) years' experience in a professional relative field
  • 15+ years of experience with core infrastructure capabilities: operating systems, networking, identity management and access control
  • 12-15 years of experience with all design aspects of the data center support systems, to include AC/DC power, UPS, HVAC, carrier infrastructures, internal/external cable plant, and overall data center layout
  • 12+ years of experience demonstrating the ability to communicate clearly, verbally and in writing, to supported staff, management, and government customers
  • Understanding of all layers of software engineering and system architecture
  • Strong understanding of RCA Methodologies
  • Demonstrated history of teamwork and service skills
  • Proficiency in securing systems on the application, network, and infrastructure layers
  • 12+ years of experience with designing solutions in cloud-optimized, private cloud and hybrid environments
  • 12+ years of experience supporting secure, scalable, and elastic applications on distributed architectures
  • Expert understanding in infrastructure management process and tools like Terraform and AWS Cloud formation
  • Experience with server configuration processes and tools such as Chef, Ansible, or Puppet
  • Expert in creating, deploying, maintaining, and troubleshooting Docker or Podman images and orchestration with Kubernetes
  • Proficiency with Linux, especially RHEL families, and Bash scripting
  • Proficiency in implementing greenfield cloud infrastructure on AWS/Azure/Google Cloud Platform
  • Understanding of CI/CD and related concepts
  • Expert ability to execute advanced git actions like rebasing and squashing
  • Ability to assist other engineers with source code management in git
  • Basic understanding of software development and web application development concepts
  • Ability to discuss technical tasks and team process topics with team members
  • Ability to operate and manage work, strategically reason, and build relationships and influence others

What Desired Skills You Might Bring:
  • Experience serving in security management in a classified IT development program
  • Familiarity with specific tools: Gitlab, Ansible, Terraform, OpenStack, AWS GovCloud, RedHat Enterprise Virtualization
  • Familiarity with CI/CD and tooling for CM, build, deployment, and code quality around C++ and Python
  • AWS Certifications
  • DoD 8750 certification at IAT Level II (CompTIA Security+; Cloud+, CASP+), will be required to attain and maintain as part of the job

Space Ground Systems Solutions (SGSS), a wholly owned subsidiary under the Parsons Corporation, is passionate about making our nation the undisputed leader in Space because we understand that ensuring our security for future generations depends on it. We have emerged as a leader in the development of cutting-edge solutions for the Department of Defense and Intelligence
Community. Our tremendous success can be attributed to our people and our priorities. Do you want to be part of a team that is helping the government solve major national security challenges in the space domain? We need your help.

SGSS believes in taking care of their employees by offering:
  • ALL benefits fully funded, for your entire family (This includes Medical/Dental/Vision/Group Life/STD/LTD - no employee premiums)
  • SGSS funded HSA (Health Savings Account) provided, with SGSS funding the maximum amount allowed by the IRS
  • Retirement Savings Plan (RSP/401k) with a 20% annual company contribution - no employee contribution required

Apply now to work for a company that truly believes YOU are the key to OUR success . . . Come BE the difference.

Minimum Clearance Required to Start:
Secret

This position is part of our Federal Solutions team.

Our Federal Solutions segment delivers resources to our US government customers that ensure the success of missions around the globe. Our diverse, intelligent employees drive the state of the art as they provide services and solutions in the areas of defense, security, intelligence, infrastructure, and environmental. We promote a culture of excellence and close-knit teams that take pride in delivering, protecting, and sustaining our nation's most critical assets, from Earth to cyberspace. Throughout the company, our people are anticipating what's next to deliver the solutions our customers need now.

Salary Range:
$104,200.00 - $182,400.00

Parsons is an equal opportunity employer committed to diversity, equity, inclusion, and accessibility in the workplace. Diversity is ingrained in who we are, how we do business, and is one of our company's core values. Parsons equally employs representation at all job levels for minority, female, disabled, protected veteran and LGBTQ+.

We truly invest and care about our employee's wellbeing and provide endless growth opportunities as the sky is the limit, so aim for the stars! Imagine next and join the Parsons quest-APPLY TODAY!
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.