Senior Systems Engineer

Overview

On Site
Full Time

Skills

Application development
Customer experience
Reliability engineering
Systems architecture
Application Support
Effective communication
Risk assessment
SQL
Performance monitoring
Capacity management
New Relic
Document engineering
Generative Artificial Intelligence (AI)
Data Analysis
Servers
Legal
Leadership
Collaboration
FOCUS
Network
Scripting
Software development
Artificial intelligence
Machine Learning (ML)
Recovery
Linux
Unix
Articulate
Onboarding
Data
Management
Training
Python
Bash
Ansible
YAML
Regular expression
Mentorship
Automation
Design
Apache Tomcat
MySQL
Red Hat Enterprise Linux
RabbitMQ
Elasticsearch
Kibana
Nginx
HAProxy
Continuous integration
Continuous delivery
Software deployment
Git
Jenkins
High availability
Replication
Hardening
Regulatory Compliance
Metrics
Splunk
HP
Software performance management
SNMP
Zabbix
ServiceNow
Documentation
Confluence
JIRA
JSON
XML
Testing
POSTMAN
IMPACT
Configuration management database
Reporting
Grafana
Dashboard

Job Details

Location: Temple Terrace, FL
Salary: Negotiable
Description: Our client is currently seeking a Senior Systems Engineer
[ Additional Description ]

Title: Senior Systems & Software Engineer (W2)

Location: Temple Terrace, FL

Position Type: Long Term Contract

DESCRIPTION:

"MUST LEGALLY BE ABLE TO SUPPORT SERVERS THAT HOST GOVERNMENT AND LEGAL ENTITIES.

Our Global Technology Solutions group is looking for a Senior Systems & Software Engineer to lead and collaborate with portfolio teams across all LOB's to support a framework that combines engineering and application development to drive operational stability. The candidate will leverage some of the latest AIOPS technology to develop a holistic approach to enhance systems and application reliability with a focus on superior customer experience.

The ideal candidate will collaborate with the core teams combining software practices and engineering to strengthen the application/system reliability along with operational support. Advanced knowledge of system architecture, network, Centralized Logging (ELK), and operational stability will help transform the way the teams operate today. The candidate will possess advanced scripting and coding capabilities to develop artifacts for alert & event correlation ingested from diverse monitoring sources and leverage AI/ML to automate recovery actions.

Five or more years of experience as a Full stack Linux Systems & Application Support Engineer
Strong Knowledge of Unix/Linux based systems, and experience troubleshooting applications running on these systems
Ability to apply a systematic approach to solve problems with a sense of ownership and focus
Effective communication skills with the ability to articulate technical details to different audience
Strong Experience with application onboarding - capturing requirements, understanding data sources, application relationships, manage meetings, training, etc
Hands on Scripting & Programming in Python, bash, Ansible, YAML, etc.
Understanding of data parsing and regex syntax etc.
Develop new processes to prevent problem recurrence and automated recoveries & Mentor staff to replace manual processes with automation
Contribute to team design discussions with detailed technical information.
Identify strategic/tactical solutions and provides risk assessments and recommendations.
Good knowledge with Tomcat, MySQL/Percona support and SQL queries, RHEL, RabbitMQ, Elasticsearch, Logstash, Kibana, nginx, haproxy
CI/CD - Deployment pipeline experience (GIT, Jenkins, Ansible or equivalent technologies)
Understanding of HA design, cross-site replication, local and global load balancers, etc
Experience with Security Hardening & Vulnerability/Compliance, OS patching
Strong knowledge of performance monitoring, metrics, capacity planning, and management
Familiarity with Splunk, HP OMi/Infrastructure agents, APM/New Relic, Elastic Agent, Catchpoint, syslog events, SNMP events, Zabbix, ServiceNow, etc
Strong skills in creating documentation - engineering runbooks, support procedures, user onboarding and support documentation
Familiarity with Confluence and JIRA, LLM, Gen AI etc.
Data ingestion & enrichment from various sources, webhooks, and REST APIs with JSON/XML payloads & testing POSTMAN etc.
Hands on experience Elastic to monitor and manage critical applications and infrastructure
Influencing other teams and engineering groups in adopting AIOPS best practices.
Troubleshooting problems, involving the appropriate resources and driving resolution of issues with a focus on minimizing impact to our customers.
Understanding of CMDB and asset relationships, topology maps, and alert enrichment

Develop new processes to prevent problem recurrence and automated recoveries

Strong data analytics and centralized reporting (ex. Grafana dashboard integration)

Contact:

This job and many more are available through The Judge Group. Please apply with us today!

About Judge Group, Inc.