Overview
Skills
Job Details
Site Reliability Engineer
Location - Onsite 3-5 days per week in Timonium, Maryland 21093
12+ Months Contract with potential for extension
Interview Mode: Mandatory In-person interview in Timonium, Maryland 21093 (locals only)
Job Description: Site Reliability Engineer
We are looking for a talented Site Reliability Engineer (SRE) with a strong background in Google Cloud Platform (Google Cloud Platform), RedHat OpenShift and Linux administration. The ideal candidate will be responsible for ensuring the reliability, performance, and scalability of our on-premise and cloud-based systems.
System Reliability: Ensure the reliability and uptime of critical services and infrastructure.
Google Cloud Expertise: Design, implement, and manage cloud infrastructure using Google Cloud services.
Linux Administration: Perform standard Linux system administration tasks, including monitoring, debugging, and optimizing system performance.
Automation: Develop and maintain automation scripts and tools to improve system efficiency and reduce manual intervention.
Monitoring and Incident Response: Implement monitoring solutions and respond to incidents to minimize downtime and ensure quick recovery.
Collaboration: Work closely with development and operations teams to improve system reliability and performance.
Capacity Planning: Conduct capacity planning and performance tuning to ensure systems can handle future growth.
Documentation: Create and maintain comprehensive documentation for system configurations, processes, and procedures.
Qualifications:
Education: Bachelor s degree in Computer Science, Engineering, or a related field.
Experience: 5+ years of experience in site reliability engineering or a similar role.
Mandatory Skills:
Proficiency in Google Cloud services (Compute Engine, Kubernetes Engine, Cloud Storage, etc.).
Strong Linux administration skills, including experience with system monitoring, debugging, and performance tuning.
Experience with automation tools (Terraform, Ansible, Puppet).
Familiarity with CI/CD pipelines and tools (Azure pipelines Jenkins, GitLab CI, etc.).
Strong scripting skills (Python, Bash, etc.).
Knowledge of networking concepts and protocols.
Experience with monitoring tools (Prometheus, Grafana, etc.).
Any one Certifications:
Google Cloud Professional DevOps Engineer
Google Cloud Professional Cloud Architect
Red Hat Certified Engineer (RHCE) or similar Linux certification