DevOps Engineer T3
Austin, TX or Herndon, VA Hybrid Role. Required to attend onsite 3 days a week, however may increase depending on project requirements and customer needs.
6 months right to hire
Our Cloud Infrastructure Platform Engineering team plays a pivotal role in designing and deploying robust infrastructure solutions to support SAP products and services. We are an automation-first organization, prioritizing the deployment of all our cloud resources via automation to enhance efficiency and consistency. Our engineers collaborate closely with internal product teams and customer-facing units to deliver adaptable and scalable code that meets the requirements of our diverse clientele. Candidates will have the opportunity to work across various cloud providers using a wide range of Infrastructure as Code (IaC) tooling and practices, developing provider-agnostic solutions that ensure seamless functionality across different platforms. Our infrastructure supports a broad client base, including local, state, and federal government agencies, as well as private sector organizations engaged in government-related missions.
RESPONSIBILITIES:
Write, modify, run terraform to deploy and maintain infrastructure across multiple cloud service providers. Be able to debug errors when deploying terraform.
Run ansible playbooks to perform configuration and maintenance tasks. Be able to develop new ansible roles and troubleshooting as errors occur.
Use GitLab with multiple repositories, create and improve CI utilizing runners to maintain infrastructure.
Configure, build, and deploy containerized services using Docker and/or Kubernetes.
Develop standard operating procedures, knowledge base articles, technical bulletins, and other documents in support of the infrastructure.
Operate in a security-first mindset, performing all other responsibilities with security in mind.
Implement monitoring, config management, and logging capabilities to manage a multiple tenant cloud infrastructure across multiple cloud service providers.
Use generative AI elements to increase efficiency and speed, improve accuracy and consistency, enhance security, and better manage resources where practical and within security boundary guidelines.
KNOWLEDGE AND SKILLS:
Knowledge of AWS foundational technologies (EC2, S3, IAM, Route53, VPC).
Knowledge of Azure foundational technologies (Virtual Networks, Application Gateway, Storage Account, Virtual Machines, Load Balancer, Resource Groups).
Knowledge of Google Cloud Platform foundational technologies (Projects, Compute Engine, GKE, Cloud Storage, VPC).
Proficient in using Terraform to build modules for deploying infrastructure in AWS, Azure, and/or Google Cloud Platform.
Proficient in CI/CD technologies including pipeline design and implementation using tools such as GitLab runners and ConcourseCI.
Working knowledge of HashiCorp Vault for secret management, including secure storage and retrieval of credentials, API keys, and certificates.
Experience with Ansible for automation, configuration management, and playbook development
Proficient in a scripting or programming language for automation (bash, python).
Experience with RBAC design and Single Sign-On (SSO) for identity management.
Investigation skills to review detailed Infrastructure and configurations.
Experience with logging and log processing in a cloud environment.
Working knowledge of networking: ip routing, subnetting, network troubleshooting.
Strong Linux OS skills (deployment, configuration, and troubleshooting).
Knowledge of ITIL-compliant ticket management using ServiceNow, including incident resolution, change requests and sprint planning.
MINIMUM QUALIFICATIONS:
Bachelor s degree in Computer Science or equivalent practical experience.
5-8+ years of experience with the following technologies:Terraform, Ansible, CI/CD, Secrets Management, Identity Management (SSO).
Experience with Linux operating system internals and administration (e.g., filesystems, inodes, system calls, hardening) and networking (e.g., TCP / IP, routing, DNS, network topologies, SDN).
PREFERRED QUALIFICATIONS:
Expertise in designing, analyzing and troubleshooting large-scale distributed systems.
Ability to debug and optimize code and automate routine tasks.
Systematic problem-solving approach coupled with strong communication skills and a sense of ownership and drive.
Hands-on experience developing infrastructure as code (IaC).