Overview
Hybrid
$140,000 - $180,000
Full Time
Skills
Prometheus
Grafana
Job Details
About The Role
We're looking for a skilled Monitoring Engineer to strengthen our infrastructure operations team. You'll be the driving force behind our monitoring ecosystem, ensuring system health and operational visibility across all environments. This role combines technical expertise with collaborative problem-solving to create monitoring solutions that empower our teams.
What You'll Do
- Design and implement comprehensive monitoring strategies using Prometheus Alertmanager
- Develop and optimize PromQL queries that precisely identify system issues
- Create intuitive Grafana dashboards providing actionable insights for various stakeholders
- Build robust log collection pipelines with Fluent-bit and Elasticsearch
- Craft Kibana dashboards and alerts that transform raw logs into operational intelligence
- Partner with the CA Spectrum team on SNMP event monitoring and infrastructure visibility
- Maintain consistency across monitoring configurations while adapting to team-specific needs
- Collaborate across engineering teams to understand monitoring requirements and implement effective solutions
What You Bring
- 2-4 years experience in infrastructure monitoring or DevOps roles
- Strong hands-on expertise with Prometheus, Grafana, and Alertmanager ecosystems
- Proficiency in PromQL for effective metric-based monitoring
- Experience implementing log collection pipelines using Fluent-bit and Elasticsearch
- Knowledge of Kibana dashboards and alerting capabilities
- Familiarity with SNMP-based monitoring tools like CA Spectrum
- Scripting abilities using Bash, Python, or similar technologies
- Clear communication skills and a collaborative, solution-oriented mindset
Bonus Points For
- Experience with major cloud platforms (AWS, Azure, Google Cloud Platform)
- Knowledge of Infrastructure as Code and CI/CD methodologies
- Kubernetes monitoring experience
- Configuration management using tools like Terraform or Ansible
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.