Overview
Hybrid
Depends on Experience
Contract - Independent
Contract - W2
Contract - 12 Month(s)
Skills
LogScale
Splunk
Dynatrace
Git
Jenkins
Artifactory
Red Hat Linux
Openshift
Windows
Java
Oracle
Job Details
Job Title- Site Reliability Architect/Lead
SHIFT: Sat & Sun - 7 pm to 7 am & 2 weekdays 8 hrs each day
Hybrid Locations - Pittsburgh, Cleveland, Phoenix, Dallas, Birmingham
Contract Duration - 12+ months
Position:
- Role Overview: As an SRC Lead, you ll be at the forefront of ensuring the reliability, availability, and performance of critical enterprise technology and security applications. Your leadership will drive operational excellence, foster collaboration, and elevate the overall reliability of our systems within the Site Reliability Center (SRC). You ll work closely with cross-functional teams, mentor engineers, and contribute to the success of the organization.
Top Technologies:
- Monitoring and Debugging Tools (LogScale, Splunk, Dynatrace)
- DevOps pipeline (Git, Jenkins, Artifactory)
- Infrastructure (Red Hat Linux, Openshift, Windows)
- Networking (DNS, Load-balancing, Network tracing, Firewall)
- Database (Oracle, SQL)
- API understanding & Web services technologies: (SOAP, JSON, REST)
- Directories (LDAP, Active Directory)
- Java
Secondary:
- Python/Java Scripting, Ansible, Powershell for Automation purposes
- Modern development technologies and tools: (Agile, CI/CD, Git, Jenkins)
- Kafka Event Streaming
- ETL/Informatica
Nice to Have:
- Database (Mongo, Cassandra, other databases)
- Evolve
Key responsibilities:
- Create and Maintain documentation to ensure knowledge accessibility.
- Liaise with other application support teams and internal/external business and technical partners.
- Provide ad hoc and on-demand reports.
- Perform timely escalation of critical issues and proactively identify patterns of recurring issues to improve production.
- Lead problem resolution and conduct root cause analysis and establish processes that will help incident prevention.
- Participate in the Incident and Problem Management processes as a resolver accountable for root cause analysis, resolution and reporting.
- Guidance to all staff involved and vendors in driving a coordinated approach for results.
- Reduce escalations to Level 3 based on incremental learning about applications.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.