Job Title: Tools Architect
Location (Complete Work Address with Zip code): 1003 US-202, Raritan, NJ 08869
Job Title: Monitoring and Observability Architect
Role Overview
We are seeking an experienced Monitoring and Observability Architect to design, implement, and optimize enterprise-wide observability solutions across cloud, on-premises, and hybrid environments. This role is responsible for defining monitoring strategies, improving system reliability, and enabling proactive incident detection through metrics, logs, and traces.
The ideal candidate combines deep technical expertise with architectural vision to build scalable, secure, and resilient observability platforms that support modern DevOps and SRE practices.
Key Responsibilities
Architecture & Strategy
- Define enterprise observability architecture aligned with business and IT objectives.
- Design monitoring frameworks for applications, infrastructure, networks, and cloud-native platforms.
- Establish standards, governance, and best practices for monitoring and alerting.
Implementation & Engineering
- Architect and deploy tools such as Prometheus, Grafana, Datadog, Splunk, ELK, New Relic, Dynatrace, AppDynamics, etc.
- Implement distributed tracing (OpenTelemetry, Jaeger, Zipkin).
- Design centralized logging and log aggregation solutions.
- Enable APM, RUM, synthetic monitoring, and infrastructure monitoring.
Cloud & DevOps Integration
- Integrate observability into CI/CD pipelines.
- Support Kubernetes and container observability.
- Enable Infrastructure-as-Code monitoring automation (Terraform, ARM, CloudFormation).
- Collaborate with SRE and DevOps teams to enhance reliability and performance.
Reliability & Incident Management
- Define SLI/SLO/SLAs and error budgets.
- Develop intelligent alerting strategies to reduce noise.
- Enable root cause analysis and performance optimization.
- Support major incident investigations.
Security & Compliance
- Ensure monitoring solutions meet security and compliance requirements.
- Implement role-based access control (RBAC) and secure data handling.
Stakeholder Collaboration
- Partner with customer, engineering, operations, security, and business teams.
- Provide technical leadership and mentorship.
- Present architecture designs to leadership and governance boards.
Disclaimer
HCL is an equal opportunity employer, committed to providing equal employment opportunities to all applicants and employees regardless of race, religion, sex, color, age, national origin, pregnancy, sexual orientation, physical disability or genetic information, military or veteran status, or any other protected classification, in accordance with federal, state, and/or local law. Should any applicant have concerns about discrimination in the hiring process, they should provide a detailed report of those concerns to for investigation.
Compensation and Benefits
A candidate s pay within the range will depend on their work location, skills, experience, education, and other factors permitted by law. This role may also be eligible for performance-based bonuses subject to company policies. In addition, this role is eligible for the following benefits subject to company policies: medical, dental, vision, pharmacy, life, accidental death & dismemberment, and disability insurance; employee assistance program; 401(k) retirement plan; 10 days of paid time off per year (some positions are eligible for need-based leave with no designated number of leave days per year); and 10 paid holidays per year.