Infrastructure Lead

  • Southlake, TX
  • Posted 14 days ago | Updated 11 days ago

Overview

On Site
$100,000 - $115,000
Full Time

Skills

Linux
Windows
Storage
Active Directory

Job Details

Responsibilities

  • 8+ years of work experience in Infra & environment support.
  • Effectively handle the infrastructure outages & Performance Issues with quick analysis and resolution
  • Extensive hands-on experience in troubleshooting Infrastructure failures (OS, N/W, Storage, File Systems, LDAP, Active Directory and SAML etc.) that can impact the application availability and performance.
  • Manage incidents and effectively communicate with users, application owners and senior stakeholders across all areas.
  • Expertise in implementation of common infrastructure activities (i.e., Patch Mgt., Migrations, Upgrades, Assessments, Certification/password renewals etc.).
  • Coordinate application resiliency exercises within required recovery time objective, perform functional and non-functional validations during and post DR exercises.
  • Actively participate in Change management process with view to manage risk in production environment.
  • Minimize manual involvement by driving solutions, automation and implementing continuous improvements that creates an operating environment, including development & configuration for dynamic monitoring, ing & recovery.
  • Identify and/or analyze patterns of incidents/problem, conduct flawless post-mortems, develop permanent remediation plans, implement automation to prevent future incidents from re-occurring again.
  • Build and improve the SOPs for all the maintenance activities.
  • Challenge existing infrastructure setup, processing and suggest different ways to solve problems or improve stability.

Technical Skills

  • Expertise Linux Server Administration -Primary / Windows Server Administration-Secondary
  • Extensive hands-on experience in troubleshooting Infrastructure failures (OS, N/W, Storage, File Systems, LDAP, Active Directory and SAML etc.) that can impact the application availability and performance.
  • Should have deep expertise in implementation of common infrastructure activities (i.e., Patch Mgt., Migrations, Upgrades, Assessments, Certification/password renewals etc.).
  • Programming: Shell/Python.
  • Troubleshooting of Web Application/Service and Database failures from infra prospective
  • Observability stack AppDynamics, Splunk, 1000Eyes, ITRS or similar Tools.
  • Experience with common networking protocols and services including TCP, UDP, DNS, DHCP, HTTP, SSH, FTP, SNMP, and LDAP
  • Working knowledge with Remedy, JIRA etc.
  • Good To Have:
  • Fair Understanding of CI/CD and DevOps Tools
  • Expertise in Google Cloud Platform.
  • Configuration Management: Salt Stack, Ansible, Puppet, Terraform etc.

Thanks