Overview
Skills
Job Details
For SRE they need basic system monitoring, Ansible Scripting, Azure, Cloud operating Network, Python, basic understanding of the cloud.
Job Description: We are seeking a dedicated Site Reliability Engineer II to join our team. In this role, you will be responsible for ensuring the reliability, availability, and performance of our systems and services. You will work closely with cross-functional teams to implement best practices and drive improvements in our infrastructure.
Responsibilities- Monitor and maintain the health of production systems and services. - Develop and implement automation tools to improve system reliability and efficiency. - Collaborate with development teams to design scalable and robust systems. - Troubleshoot and resolve incidents, ensuring minimal downtime. - Participate in on-call rotations to provide 24/7 support for critical systems. - Continuously improve system performance and reliability through proactive measures.
Education Qualification- Bachelor's degree in Computer Science, Information Technology, or a related field.
Required Skills- Proven experience as a Site Reliability Engineer or similar role. - Strong understanding of cloud infrastructure and services. - Proficiency in scripting languages such as Python, Bash, or similar. - Experience with monitoring and logging tools. - Knowledge of containerization and orchestration technologies. - Excellent problem-solving and communication skills.
Nice to Have Skills - Experience with CI/CD pipelines. - Familiarity with configuration management tools. - Understanding of network protocols and security best practices.