Systems Engineer
Primary Location: Westlake, TX
Secondary location: Merrimack, NH
Shift: M-F; 8:30 – 5:30 (most of work done during this time frame)
On-call - 1 week every 6-7 weeks
Skills:
- Enterprise production support experience for application and infrastructure
in hybrid environments (on-prem/cloud)
- Observability tools: Splunk, Datadog, etc. used for incident triage and RCA
- Cloud development (Azure or AWS) using Python
- Hands-on experience with container orchestration, preferably with
Kubernetes
- Incident, Change and Problem Management
- ITSM tools such as ServiceNow
Team
The team comes from diverse technical backgrounds, and the responsibilities provide the opportunity for a variety of challenges. Ideal candidates will have a background in either systems engineering or software engineering with a desire to learn and understand the other or previous experience as a systems engineer or SRE. We are looking for a Systems Thinking Leader who has helped teams scale through production insights, operational automation, developer guidance, real-time metrics, and automation
The Role
Our Application Systems Engineering Support Group within Enterprise Infrastructure combines Operations Perfection with the Development Experience to deliver services at high scale, high availability with resilience by using automation and Infrastructure Code. We build reliability into our ecosystem by applying standard processes in Resiliency Engineering, Automation and Observability.
Team
The team comes from diverse technical backgrounds, and the responsibilities provide the opportunity for a variety of challenges. Ideal candidates will have a background in either systems engineering or software engineering with a desire to learn and understand the other or previous experience as a systems engineer or SRE. We are looking for a Systems Thinking Leader who has helped teams scale through production insights, operational automation, developer mentorship, real-time metrics, and automation.
The Expertise You Have
- Bachelor’s degree or higher in a technology related field (e.g. Engineering, Computer Science, etc.) required, Master’s degree a plus
- Proven experience with monitoring and management tools (Splunk, Datadog, Catchpoint, Grafana, AWX/Ansible, etc.) and building automation
- Experience using CI/CD Tools (Jenkins, uDeploy) and the backend code implemented by it
- Experience with building and operating highly resilient platforms in Azure/AWS Cloud environments
- Experience in Cloud development (Azure or AWS) using Python, shell scripting, PowerShell etc. and cloud migration skills a plus
- Hands-on experience with container orchestration, preferably with Kubernetes
- Hands-on experience with Linux and windows operating systems.
- Hands-on experience with container orchestration is a plus.
- Hands-on experience with ITSM tools such as ServiceNow and ITSM process such as Incident, Change and Problem Management.
The Skills You Bring
- Experience managing systems using infrastructure as code tools (AWX/Ansible, Terraform, Rundeck …)
- Proven understanding of Cloud Computing and DevOps concepts including CI/CD pipelines
- Ability to automate in a programming language like Python, Shell Script or PowerShell
- Experience with Hands-on Kubernetes skills and knowledge a plus.
- Hands on experience with one or more observability tools (Datadog, Grafana, Prometheus etc…)
- Proven experience in maintaining scalability and resiliency of complex environment.
- Proficient communication skills with an ability to reach both technical and non-technical audiences
- Quick learner with ability to pick up new tools
- Ability to work with a variety of individuals and groups, both in person and virtually, in a constructive and collaborative manner and build and maintain effective relationships