System Engineer

Overview

Hybrid
Depends on Experience
Contract - Independent
Contract - W2
Contract - 6 Month(s)
No Travel Required

Skills

ELK Stack
Elasticsearch
Logstash (ingest pipeline)
Kiban
Shell
Python
Selenium
VuGen scripts
SSL certs
encryption
Linux
DB2
SQL
Data dog

Job Details

Vega Consulting Solutions is hiring! Lead Systems Engineer (Datadog, AWS & ServiceNow Integration). This position is hybrid. Candidates will be required to the client site in Wash DC at least one a month.

Job Summary

  • 5-8 years strong IT experience and good working knowledge of a variety of technology platforms in a distributed environment including: Microsoft systems (e.g. Windows Server, Active Directory, Exchange, SharePoint), Linux/Unix, VMWare, SQL Server, database architectures, TCP/IP, VPNs, Mainframe, LAN/WAN technologies and architectures
  • A minimum of 3 years hands-on experience installing, integrating, managing and maintaining monitoring tools like Data Dog administration and support. Or similar Log Management experience with ELK Stack Elasticsearch (search and analytics engine), Logstash (ingest pipeline), and Kibana (visualization and creating dashboards)
    Experience in writing Shell, Python, Selenium, VuGen scripts
    Experience with SSL certs, encryption methods on Linux
    Experience in developing and implementing systems monitoring and alerting strategies in diverse, large-scale environments
    Experience developing and documenting processes, procedures, and policies for tool usage and integration
    Author tool maintenance and training documentation as well as support requests for training on tool usage
    Knowledge and experience with configuring alerts, dashboards and ad-hoc reports
    Strong understanding of service level management (SLAs, SLRs, etc.)
    Determine and document tool backup and recovery procedures
    Experience with data management tools and databases (e.g., DB2, SQL -familiarity desired)
    Experience in systems and Java applications troubleshooting using monitoring tools like DataDog
    Understanding and experience with both waterfall and agile Software Development Life Cycles (SDLC)
    Bachelor of Science in Computer Science or related field (i.e., Engineering, Applied Science, Math, etc.) or equivalent experience.
  • Experience with SAFe agile methodologies

Responsibilities

  • Lead the architecture, design, and implementation of end-to-end monitoring solutions using Datadog, ensuring high availability and performance of cloud-based services.
  • Oversee the deployment and management of AWS resources (EC2, RDS, Lambda, ECS/EKS, S3, etc.), ensuring adherence to best practices for scalability, security, and cost optimization.
  • Define monitoring strategies and best practices, including Datadog dashboards, monitors, alerts, and custom metrics for comprehensive observability.
  • Architect and manage the integration of Datadog with ServiceNow to automate incident management workflows, event correlation, and CMDB synchronization.
  • Provide technical leadership and mentorship to junior engineers on best practices for monitoring, logging, and observability.
  • Collaborate with cross-functional teams to integrate monitoring and logging into CI/CD pipelines and cloud infrastructure.
  • Drive continuous improvement in system reliability, including SLO/SLI definitions, synthetic monitoring, and anomaly detection.
  • Contribute to and enforce Infrastructure as Code (IaC) standards using Terraform, CloudFormation, or similar tools.
  • Participate in high-severity incident management, root cause analysis, and the implementation of corrective actions to prevent future occurrences.

Requirements

  • Bachelor s degree in Computer Science, Information Technology, or a related field (or equivalent experience).
  • 5+ years of experience with AWS cloud services, including deployment, management, and optimization of cloud infrastructure.
  • 3+ years of hands-on experience with Datadog, including complex dashboards, integrations, and custom metrics.
  • 2+ years of experience integrating Datadog with ServiceNow, including incident management workflows, event management, and CMDB integration.
  • Demonstrated experience leading teams or projects in a cloud operations or DevOps environment.
  • Strong proficiency in scripting and automation (Python, Bash, or similar).
  • Solid understanding of networking, security best practices, distributed systems, and troubleshooting complex cloud architectures. Preferred Skills:
  • Experience with Infrastructure as Code (Terraform, CloudFormation).
    AWS certifications (e.g., AWS Certified Solutions Architect, DevOps Engineer).
  • Experience with Kubernetes monitoring and log aggregation solutions (Fluentd, ELK stack).
    Familiarity with other observability tools like Prometheus or Grafana.
  • ServiceNow certifications or experience with ServiceNow ITOM modules (Discovery, Event Management, CMDB).

If you have the required Data Dog Engineering skills and live within a commutable distance to Wash DC, Select Apply Now and a Vega Staffing Specialist will reach out to you

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.