Overview
On Site
Contract - W2
Contract - 18 Month(s)
Skills
AWS
Kubernetes
Lambda
EC2
S3
Prometheus /Grafana
Job Details
Job Title : Cloud Vizualization Engineer (PromethGrafana)
location : Westlake, TX / Merrimack, NH (hybrid)
Duration : 18 Month
location : Westlake, TX / Merrimack, NH (hybrid)
Duration : 18 Month
Must Have Skills:
Prometheus /Grafana
AWS, EC2, S3, Lambda
Kubernetes
Prometheus /Grafana
AWS, EC2, S3, Lambda
Kubernetes
Preferred Skills:
Datadog
Datadog
Notes:
. This one we're going to be looking for someone who is really strong Prometheus and Grafana experience to help out as they migrate to OTEL Observability platform.
Ideally, they'd like someone to have come from a Software Engineering background earlier in their career and they got into the Cloud Space, They will not be doing much programming and development like the other roles, but will do some scripting working in automation with Python.
They'd be expected to provide L3 support as needed.
. This one we're going to be looking for someone who is really strong Prometheus and Grafana experience to help out as they migrate to OTEL Observability platform.
Ideally, they'd like someone to have come from a Software Engineering background earlier in their career and they got into the Cloud Space, They will not be doing much programming and development like the other roles, but will do some scripting working in automation with Python.
They'd be expected to provide L3 support as needed.
Description
Environment:
We use Prometheus, which monitors cloud-native systems, such as Kubernetes. The data is graphically processed with the help of Grafana and made available in a dashboard and alerts to alerting.
Looking for an experienced SME/ Sr Engineer with a deep understanding of Grafana and Prometheus to join our team. In this role, you will be responsible for optimizing and advancing our monitoring and observability systems. Your expertise will be critical in ensuring the reliability, performance, and scalability of our infrastructure.
Additionally, we are looking for engineer to be doing automation and building tools for Observability domain using opensource technology (OTEL, Open Search, Grafana, Open Tofu) and Cloud Technologies (EKS, EC2, S3, Cloud Networking). Expertise in any one or more programing language (Python, Go lang, Java)
We use Prometheus, which monitors cloud-native systems, such as Kubernetes. The data is graphically processed with the help of Grafana and made available in a dashboard and alerts to alerting.
Looking for an experienced SME/ Sr Engineer with a deep understanding of Grafana and Prometheus to join our team. In this role, you will be responsible for optimizing and advancing our monitoring and observability systems. Your expertise will be critical in ensuring the reliability, performance, and scalability of our infrastructure.
Additionally, we are looking for engineer to be doing automation and building tools for Observability domain using opensource technology (OTEL, Open Search, Grafana, Open Tofu) and Cloud Technologies (EKS, EC2, S3, Cloud Networking). Expertise in any one or more programing language (Python, Go lang, Java)
Key responsibilities:
1. Monitoring and Alerting:
Design and manage alerting rules for proactive issue identification and resolution.
Continuously improve and expand monitoring coverage to meet evolving needs.
Collaborate with teams to define alert thresholds and escalation procedures.
2. Data Analysis and Visualization:
Analyze metrics data to identify performance bottlenecks and areas for improvement.
Create meaningful visualizations and reports to provide insights for stakeholders.
Contribute to the enhancement of data retention and archiving strategies.
3. Scaling and Optimization:
Collaborate with the infrastructure team to ensure seamless integration and scalability of Grafana and Prometheus.
Fine-tune configurations to achieve optimal resource utilization and performance.
Proven experience as an L3 Engineer specializing in Grafana and Prometheus administration.
Proficiency in creating custom Grafana dashboards and queries.
Strong understanding of monitoring best practices, alerting, and data analysis.
Knowledge of time-series databases and storage strategies.
4. Automation and Development
Scripting and automation skills for efficient system management.
Building OTEL based component for Observability Stack
Automation building Observability query language conversions
The Team
This is an opportunity for a highly motivated Senior Cloud Engineer to join the Cloud Observability Platform team in Cloud and Platform Engineering (CAPE) who are responsible for enabling the next generation cloud applications and platforms across Fidelity. You will work in a diverse, open and transparent culture using innovative solutions with the latest cloud native technologies and leading-edge engineering practices.
1. Monitoring and Alerting:
Design and manage alerting rules for proactive issue identification and resolution.
Continuously improve and expand monitoring coverage to meet evolving needs.
Collaborate with teams to define alert thresholds and escalation procedures.
2. Data Analysis and Visualization:
Analyze metrics data to identify performance bottlenecks and areas for improvement.
Create meaningful visualizations and reports to provide insights for stakeholders.
Contribute to the enhancement of data retention and archiving strategies.
3. Scaling and Optimization:
Collaborate with the infrastructure team to ensure seamless integration and scalability of Grafana and Prometheus.
Fine-tune configurations to achieve optimal resource utilization and performance.
Proven experience as an L3 Engineer specializing in Grafana and Prometheus administration.
Proficiency in creating custom Grafana dashboards and queries.
Strong understanding of monitoring best practices, alerting, and data analysis.
Knowledge of time-series databases and storage strategies.
4. Automation and Development
Scripting and automation skills for efficient system management.
Building OTEL based component for Observability Stack
Automation building Observability query language conversions
The Team
This is an opportunity for a highly motivated Senior Cloud Engineer to join the Cloud Observability Platform team in Cloud and Platform Engineering (CAPE) who are responsible for enabling the next generation cloud applications and platforms across Fidelity. You will work in a diverse, open and transparent culture using innovative solutions with the latest cloud native technologies and leading-edge engineering practices.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.