Overview
Skills
Job Details
Infrastructure Operations Lead
Onsite in Oak Ridge, Tennessee
U.S. Citizenship Required
We are hiring for a highly skilled Infrastructure Operations Lead to join our team! Working onsite in Oak Ridge, Tennessee and having U.S. Citizenship is required.
You will be responsible for leading the IT Infrastructure Operations contractor team that provides operations and maintenance to the network and systems. The Infrastructure Operations Lead is responsible for maintaining and operating the hybrid infrastructure, that includes a combination of on premise and cloud-based infrastructure. Infrastructure operations shall support and maintain the Local Area Network (LAN), Wide Area Network (WAN), Wireless Local Area Network (WLAN) and data center infrastructure to ensure an effective and efficient IT environment for its customers.
Job Responsibilities:
- Maintaining and operating applications, systems and network.
- Support the configuration management and change management process.
- Implementing approved changes and reporting progress and status.
- Providing patch management and updates to remove vulnerabilities from infrastructure and end points.
- Provide Network Administration network to include the LAN, WAN, and WLAN.
- Provide System Administration for systems and applications.
- Perform Operations and Maintenance on Data Center Operations.
- Provide technical expertise to operate, backup, monitor and maintain the data centers both on-prem and cloud based, including physical and virtual platforms, database servers, storage area networks, print servers, and other data center capabilities.
- Monitor Infrastructure availability and performance.
- Provide Non-Core Hours Operations Support for operation and cyber security activities that require a response to critical cybersecurity incidents, network outages, major service disruptions and other unplanned emergency events.
- Provides designs and estimates and estimates to clients to support procurements.
- Provides timely and relevant communication to leadership, staff, other teams, and customers regarding information issues, problems and concerns.
- Lead special projects as assigned and report progress to leadership.
- Support and lead process and technology improvement initiatives.
- Manages staff to include schedule, priorities, performance, and expectations.
- Accountable for meeting Service Level Agreements and Key Performance Indicators.
- Reviews reports and monitors dashboards for performance and support needs.
- Report status to leadership, staff, and teams.
- Develop and review documentation in support of changes and procedures.
- Responsible for the day-to-day operation of the team, providing overall guidance and supervision.
- Escalation point for customer problems and questions.
- Coordinate with other customer entities as well as internal/external service providers as necessary to prevent, correct, or detect network problems and/or outages and determine causes and solutions.
- Lead supports daily scrum with team, and client meetings, and is a technical working lead for their team of Network and System Administrators.
- Server Management: Overseeing Cisco UCS and HPE ProLiant servers, handling deployment, configuration, and maintenance. Managing infrastructure resources for internal teams and external customers and partners.
- Windows Server Administration: Managing Windows Server 2016 and 2019 environments. Configuring Active Directory Domain Services (AD DS), Group Policy Management, and ensuring secure user authentication.
- Virtualization Infrastructure: Architect and implement VMware vSphere and ESXi infrastructures. Managing VMware vSphere and ESXi infrastructure.
- Azure Tenant Management: Managing Azure resources (virtual machines, storage, networking). Implementing access controls and optimizing resource usage.
- Backup and Disaster Recovery: Design backup strategies, and retention schedules. Managing Cohesity backup appliances.
- Network Monitoring and Secret Management: Utilizing SolarWinds Network Performance Monitor (NPM) for network monitoring. Administering Delenia Secret Server for secure storage and access control.
- Network Management: DNS/DHCO, F5s, Cisco Routers and Switches, PaloAlto products, and VDI.
- Certificate Management: Handling Entrust Certificate Services and Venafi SSL Certificate Manager. Issuing, renewing, and managing SSL/TLS certificates.
Minimum Qualifications:
- U.S Citizenship Required
- Bachelor's Degree in IT, Computer Science or a related field, or equivalent relevant experience; Master's Degree preferred.
- 7-10 years of experience in Information Technology with 5+ years of experience managing IT staff.
Certifications/Training Requirements:
- AZ-104 Microsoft Azure Administrator
- AZ-140 Configuring and Operating Microsoft Azure Virtual Desktop (AVD)
- Cisco certified Network Associate (CCNA)
- CompTIA Security +
Other Job Specific Skills:
- Attention to detail, and a commitment to quality work.
- Ability to elicit, analyze, document, and validate the requirements that support the required system changes to meet business needs.
- Ability to document processes and procedures for standardization.
- Ability to follow customer policies and processes in support of production systems and the changes to production systems.
- Must have the ability to work under aggressive deadlines, manage multiple tasks, and prioritize as necessary.
- Excellent analytical, comprehension, communication, writing, and interpersonal skills.
- Strong skills in IT fundamentals including, but not limited to: IT security tools, Microsoft security tools, server administration, networking, database support/administration, infrastructure support, and IT security design.
- Proven ability to investigate, troubleshoot and resolve complicated technical issues.
- Demonstrated capability to creatively solve complex requirements needs.
- Ability to plan out tasks for the team and support day-to-day cyber operations activities.
- Proven ability to lead a team.