Senior Linux Server Specialist

Overview

On Site
45-60/hour
Contract - W2
Contract - 1+ year
No Travel Required

Skills

Unix Administration
Linux Admin
High Availability
Disaster Recovery
HA DR
HA/DR
Ansible Playbooks
Terraform
Cloud Formation
DevOps
Dell EMC PowerMAX
LVM
Bash
Python

Job Details

We have been retained by our client in the NW Houston area to deliver a Senior Linux Server Specialist on a long-term contract or contract to hire basis. This is onsite work with onsite servers, and onsite team collaboration in northwest Houston. This is an excellent company and culture.



We are currently seeking an experienced Senior Linux Server Specialist to join the Linux support team. The ideal candidate will be responsible for troubleshooting, deployment, and operational support of Linux based systems as well as on-going maintenance. As part of the Linux team, the candidate will work closely with business and other IT groups to provide support and technical expertise as needed.



Responsibilities include:




  • Support the operation and maintenance of Linux servers, ensuring operational availability & performance, conducting health checks, managing software upgrades, patching (including testing and implementation), system optimization and administration. HA/DR.

  • Monitor server health and performance to identify issues, bugs, or potential improvements.

  • Strict adherence to change management processes to ensure changes are properly planned, documented, and deployed.

  • Develop, review, and update existing operational documentation (SOPs, application checklists, playbooks, etc) (Ansible Playbooks: for a really simple configuration management and multi-machine deployment system).

  • Collaborate with the Security Operations Center (SOC) team for process optimization, tool tuning & integration, information sharing, playbook development and incident response.

  • Implement automated near real-time monitoring of all tools to ensure proper operation and collection of pertinent data.

  • Incident and Problem Management; including both during and post-incident, along with Root Cause Analysis.

  • Application support, issue management and escalation.

  • Perform incident investigation, diagnosis, and resolution.

  • Perform system monitoring using Grafana and Prometheus.

  • Perform remediation to resolve a security breach, threat, or vulnerability that can potentially harm a computer system, network, or an application. Address the problem or vulnerability by modifying a configuration or by patching or updating the operating system or application.




Qualifications:



The successful candidate will meet the following qualifications:




  • 5-7+ years of experience installing, administering, and maintaining Linux servers (Red Hat Linux based server experience is highly preferred. Experience with Linux servers running Oracle are a plus.)




  • 3-5+ years of experience designing and implementing redundant systems including data backups/recoveries, high availability, load balancing, and disaster recovery. HA/DR.

  • 3-5+ years of experience designing, analyzing, and repairing large-scale distributed systems

  • Experience with deploying and maintaining on-premises Linux servers (AWS hybrid or AWS Private Cloud is preferred.)

  • Experience in application deployment automation, modern DevOps practices, and infrastructure as code (Terraform, or Amazon Cloud Formation are a plus.)

  • Experience with IT automation tools such as Ansible Automation Platform, Ansible Playbooks, (any of these are helpful, are all are not being used Chef, Puppet, Terraform, or Amazon Cloud Formation.)

  • Knowledgeable of core IT infrastructure technologies including virtualization, VMs, virtual machines, networking, and storage management.




  • Technical documentation skills.

  • Comfortable interacting with technical and management teams at various levels in a professional manner.

  • Takes ownership of areas of responsibility and makes recommendations and decisions on the improvement and operation of those areas.

  • High level of organizational skills.

  • Knowledge of and experience with Security Design and Implementation.

  • Ability to participate in a rotating shift that will sometimes includes after-hours technical support.

  • Knowledge of backup and recovery methods and verification.

  • Knowledge of Dell EMC PowerMax storage (i.e. PowerStore 9200T) and Dell EMC Isilon storage, including snapshots (i.e. EMC Isilon SnapshotIQ is a licensed software module that lets you create new snapshots and manage snapshot schedules. Isilon scale-out network-attached storage (NAS) platform combines modular hardware with unified software to harness unstructured data. PowerPath, ) are a nice plus!

  • Excellent written and verbal communications

  • Ability to work in a fast paced, schedule-driven, and customer-oriented environment

  • Experience with Bash, Perl, and Python scripting for automation are a plus!

  • Experience with Logical Volume Management (LVM) including expansion of file systems





Preferred Qualifications:




  • Experience supporting container-based platforms




  • SUSE Manager for patching Linux servers

  • Red Hat Satellite for patching Linux servers

  • Prometheus and Grafana for system performance monitoring





One year renewable contract. Contract to hire is an option being considered.





Employment Type: Contract or Contract to Hire







Hourly Rate: $48.00 56.75 per hour (w-2 employment)

(higher pay possible, if candidate fits well enough)





Location: Houston, Texas - NW Houston





Immigration: s and those authorized to work in the US are encouraged to apply. We are unable to sponsor H1b candidates at this time.



No third parties. No consulting firms. Principals only. No C2C. Candidates only.







Please apply with resume



NW Houston onsite contract opp t y: Senior Linux Server Specialist









Any candidate is encouraged to apply, call or to send a text to: