Overview
Skills
Job Details
Role: Site Observability Engineer
Duration: 12+ Months
Location: Newtown Square, PA (Remote options are ok)
**Job Summary**:
We are seeking a skilled and experienced Senior Observability Engineer to join the Observability team. The ideal candidate will be responsible for improving our monitoring and alerting posture for Cloud Infrastructure. The role requires a strong understanding of observability tools and practices, with a focus on Prometheus, Grafana, Gardener Kubernetes, and Splunk. Experience with Dynatrace is a plus.
- Implement, manage, and improve monitoring solutions that use Prometheus, ensuring high availability and accurate alerting for our systems.
- Contribute to the development of observability strategies to improve our Cloud monitoring posture.
- Collaborate with development teams to integrate observability into the CI/CD pipeline and throughout the application lifecycle.
- Respond to and investigate incidents, providing thorough post-mortem analyses and implementing preventive measures.
- Stay current with the latest trends and best practices in site reliability and observability.
- Work with cross-functional teams to ensure system reliability, scalability, and performanc