Monitoring and Observability Architect

Raritan, NJ, US • Posted 2 days ago • Updated 2 days ago
Full Time
On-site
$120,000 - $140,000/yr
Company Branding Image
Fitment

Dice Job Match Score™

⭐ Evaluating experience...

Job Details

Skills

  • OpenTelemetry
  • Jaeger
  • Zipkin
  • Prometheus
  • Grafana
  • Datadog
  • Splunk
  • ELK
  • New Relic
  • Dynatrace
  • AppDynamics
  • APM
  • RUM

Summary

Job Title: Tools Architect 
Location (Complete Work Address with Zip code): 1003 US-202, Raritan, NJ 08869
Job Title: Monitoring and Observability Architect
Role Overview

We are seeking an experienced Monitoring and Observability Architect to design, implement, and optimize enterprise-wide observability solutions across cloud, on-premises, and hybrid environments. This role is responsible for defining monitoring strategies, improving system reliability, and enabling proactive incident detection through metrics, logs, and traces.

The ideal candidate combines deep technical expertise with architectural vision to build scalable, secure, and resilient observability platforms that support modern DevOps and SRE practices.

Key Responsibilities

Architecture & Strategy

  • Define enterprise observability architecture aligned with business and IT objectives.
  • Design monitoring frameworks for applications, infrastructure, networks, and cloud-native platforms.
  • Establish standards, governance, and best practices for monitoring and alerting.

Implementation & Engineering

  • Architect and deploy tools such as Prometheus, Grafana, Datadog, Splunk, ELK, New Relic, Dynatrace, AppDynamics, etc.
  • Implement distributed tracing (OpenTelemetry, Jaeger, Zipkin).
  • Design centralized logging and log aggregation solutions.
  • Enable APM, RUM, synthetic monitoring, and infrastructure monitoring.

Cloud & DevOps Integration

  • Integrate observability into CI/CD pipelines.
  • Support Kubernetes and container observability.
  • Enable Infrastructure-as-Code monitoring automation (Terraform, ARM, CloudFormation).
  • Collaborate with SRE and DevOps teams to enhance reliability and performance.

Reliability & Incident Management

  • Define SLI/SLO/SLAs and error budgets.
  • Develop intelligent alerting strategies to reduce noise.
  • Enable root cause analysis and performance optimization.
  • Support major incident investigations.

Security & Compliance

  • Ensure monitoring solutions meet security and compliance requirements.
  • Implement role-based access control (RBAC) and secure data handling.

Stakeholder Collaboration

  • Partner with customer, engineering, operations, security, and business teams.
  • Provide technical leadership and mentorship.
  • Present architecture designs to leadership and governance boards.

Disclaimer

HCL is an equal opportunity employer, committed to providing equal employment opportunities to all applicants and employees regardless of race, religion, sex, color, age, national origin, pregnancy, sexual orientation, physical disability or genetic information, military or veteran status, or any other protected classification, in accordance with federal, state, and/or local law. Should any applicant have concerns about discrimination in the hiring process, they should provide a detailed report of those concerns to for investigation.

Compensation and Benefits

A candidate s pay within the range will depend on their work location, skills, experience, education, and other factors permitted by law. This role may also be eligible for performance-based bonuses subject to company policies. In addition, this role is eligible for the following benefits subject to company policies: medical, dental, vision, pharmacy, life, accidental death & dismemberment, and disability insurance; employee assistance program; 401(k) retirement plan; 10 days of paid time off per year (some positions are eligible for need-based leave with no designated number of leave days per year); and 10 paid holidays per year.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: hcl001
  • Position Id: 12345
  • Posted 2 days ago

Company Info

About HCLTech

HCLTech is a global technology company, home to more than 223,000 people across 60 countries, delivering industry-leading capabilities centered around digital, engineering and cloud, powered by a broad portfolio of technology services and products. 

We work with clients across all major verticals, providing industry solutions for Financial Services, Manufacturing, Life Sciences and Healthcare, Technology and Services, Telecom and Media, Retail and CPG, and Public Services. Consolidated revenues as of 12 months ending March 2025 totaled $13.8 billion. 

To learn how we can supercharge progress for you, visit hcltech.com.

About_Company_One
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Hybrid in Raritan, New Jersey

2d ago

Easy Apply

Full-time

$80,000 - $180,000

Remote

30+d ago

Easy Apply

Full-time

$80,000 - $120,000

Remote

30+d ago

Easy Apply

Full-time

$80,000 - $140,000

Search all similar jobs