Devops Linux Systems

Overview

On Site
Depends on Experience
Contract - W2
Contract - Independent
Contract - 30 Month(s)

Skills

Unix/Linux
Python
bash
Ansible
YAML
Tomcat
MySQL/Percona
SQL queries
RHEL
RabbitMQ
Elasticsearch
Logstash
Kibana
nginx
haproxy
GIT
Jenkins
metrics
capacity planning
and management
Confluence
JIRA
LLM
Gen AI
webhooks
and REST APIs with JSON/XML payloads & testing POSTMAN

Job Details

Job Title: Devops Linux Systems

Work Location: Temple Terrace, FL (ONSITE)

Duration: Long Term Contract

 

Job Description:

The ideal candidate will collaborate with the core teams combining software practices and engineering to strengthen the application/system reliability along with operational support. Advanced knowledge of system architecture, network, Centralized Logging (ELK), and operational stability will help transform the way the teams operate today. The candidate will possess advanced scripting and coding capabilities to develop artifacts for alert & event correlation ingested from diverse monitoring sources and leverage AI/ML to automate recovery actions.

Five or more years of experience as a Full stack Linux Systems & Application Support Engineer
• Strong Knowledge of Unix/Linux based systems, and experience troubleshooting applications running on these systems
• Ability to apply a systematic approach to solve problems with a sense of ownership and focus
• Effective communication skills with the ability to articulate technical details to different audience
• Strong Experience with application onboarding - capturing requirements, understanding data sources, application relationships, manage meetings, training, etc
• Hands on Scripting & Programming in Python, bash, Ansible, YAML, etc.
• Understanding of data parsing and regex syntax etc.
• Develop new processes to prevent problem recurrence and automated recoveries & Mentor staff to replace manual processes with automation
• Contribute to team design discussions with detailed technical information.
• Identify strategic/tactical solutions and provides risk assessments and recommendations.
• Good knowledge with Tomcat, MySQL/Percona support and SQL queries, RHEL, RabbitMQ, Elasticsearch, Logstash, Kibana, nginx, haproxy
• CI/CD - Deployment pipeline experience (GIT, Jenkins, Ansible or equivalent technologies)
• Understanding of HA design, cross-site replication, local and global load balancers, etc
• Experience with Security Hardening & Vulnerability/Compliance, OS patching
• Strong knowledge of performance monitoring, metrics, capacity planning, and management
• Familiarity with Splunk, HP OMi/Infrastructure agents, APM/New Relic, Elastic Agent, Catchpoint, syslog events, SNMP events, Zabbix, ServiceNow, etc
• Strong skills in creating documentation - engineering runbooks, support procedures, user onboarding and support documentation
• Familiarity with Confluence and JIRA, LLM, Gen AI etc.
• Data ingestion & enrichment from various sources, webhooks, and REST APIs with JSON/XML payloads & testing POSTMAN etc.
• Hands on experience Elastic to monitor and manage critical applications and infrastructure
• Influencing other teams and engineering groups in adopting AIOPS best practices.
• Troubleshooting problems, involving the appropriate resources and driving resolution of issues with a focus on minimizing impact to our customers.
• Understanding of CMDB and asset relationships, topology maps, and alert enrichment
• Develop new processes to prevent problem recurrence and automated recoveries
• Strong data analytics and centralized reporting (ex. Grafana dashboard integration)

 

 

--
Thanks,
Rajkumar,
Ph: x 105