Senior Systems Engineer

Overview

Remote
$120,000 - $130,000
Full Time
No Travel Required

Skills

Continuous Delivery
Continuous Improvement
Amazon S3
Attention To Detail
Database
Microsoft Azure
System Monitoring

Job Details

Currently seeking a Senior Systems Engineer who performs high-level, day-to-day operational support of complex application cloud systems to join our VA team. Develops solutions to routine technical problems of limited scope. Follows standard practices and procedures in analyzing situations or data from which answers can be readily obtained. This is a 100% Remote role. Monday - Friday 8AM to 5PM EST.

Essential Functions:

  • Detect, isolate, document, rapidly report, and resolve system outages or problems encountered during operations of the scientific workstations, which includes the collections of diagnostic data, restoring the system operation, development of workarounds, and other activities necessary for recovery of a system.
  • Accurately document problems in logging and discrepancy reporting tools.
  • Work directly with the customer in most aspects of the day-to-day activities.
  • Respond to user calls regarding hardware and software problems, correcting or ensuring that problems are escalated when required. Communicate with users and senior management the status of key problem statuses.
  • Perform hands-on repair of equipment and maintenance/installation of computing infrastructure.
  • Maintain and troubleshoot all hardware associated with end-user computing, including printers, workstations, router switches, etc.
  • Implement continuous improvement methodology through the use of IT systems or procedure.
  • Maintain inventory of system assets.
  • Ensure compliance with VA standards and security policies.
  • Provide documentation, training, and additional duties as assigned.

Education:

  • Bachelors and five (5) years or more experience; Masters and three (3) years or more experience; PhD and 0 years related experience
  • Degree in Computer Science, Information Technology, Systems Engineering, or a related field.
  • Advanced certifications in relevant areas (e.g., Red Hat Certified Engineer, Microsoft Certified Systems Engineer, AWS Certified Solutions Architect) are preferred.

Required Experience:

  • Experience with AWS and/or Azure Cloud experience with S3, Step Functions, Batch Jobs, CloudWatch,
  • Minimum 3 years SQL query and monitoring experience.
  • Minimum of 3-5 years of experience in systems engineering or a related field, particularly in production environments.
  • Proven track record of managing and maintaining large-scale production systems.
  • Strong proficiency in Linux/Unix and Windows operating systems.
  • Experience with system administration tasks including user management, permissions, and system monitoring.
  • Proficiency in scripting languages such as Python, Bash, or PowerShell for automation and configuration management.
  • Experience with automation tools like Ansible, Puppet, Chef, or SaltStack.
  • Knowledge of cloud-native technologies and infrastructure-as-code (IaC) tools such CloudFormation.
  • Experience with monitoring tools like DynaTrace, ScienceLogic, and CloudWatch.
  • Ability to troubleshoot and optimize system performance and reliability.
  • Understanding of network protocols, firewall configurations, and VPN setup.
  • Experience with network monitoring and diagnostic tools.
  • Knowledge of security best practices for production systems.
  • Experience implementing security measures, conducting audits, and ensuring compliance with industry standards.
  • Working knowledge of database systems such as AWS RDS SQL Server and Oracle RDMBS/RDS.
  • Experience with database performance tuning and backup/recovery processes.
  • Experience with continuous integration/continuous deployment (CI/CD) pipelines.
  • Familiarity with version control systems like Git and CI/CD tools like GitHub Actions, AWS CodeBuild and CodeDeploy.
  • Strong analytical and problem-solving skills with the ability to troubleshoot complex system issues.
  • Excellent communication and interpersonal skills, with the ability to work effectively in cross-functional teams.
  • Ability to prioritize tasks, manage time effectively, and meet deadlines in a high-pressure environment.
  • Strong attention to detail to ensure system stability and data integrity.
  • Ability to quickly adapt to new technologies and processes.
  • Willingness to be on-call for production system support as required.
  • Strong documentation skills for maintaining system configurations, processes, and procedures.
  • Ability to manage and contribute to multiple projects, ensuring timely completion and quality results.
  • Ability to work collaboratively with development, operations, and security teams to ensure seamless production system operations.

Clearance Requirement: Candidates are required to obtain and maintain a public trust clearance.

DPCG is an Equal Opportunity Employer committed to hiring and developing the most qualified individuals based on merit, experience, and business needs, without regard to any protected status under applicable law.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.