Senior Tools Architect - Enterprise Observability & AIOps
San Jose, CA, US • Posted 10 hours ago • Updated 53 minutes ago

VDart, Inc.
Dice Job Match Score™
🎯 Assessing qualifications...
Job Details
Skills
- Python
- AWS
- ServiceNow
- grafana
- Moogsoft
Summary
Role :- Senior Tools Architect Enterprise Observability & AIOps (Hands-On)
We are seeking a senior-level Tools Architect who is a recognized expert in building and operating modern, enterprise-grade observability and AIOps platforms at global scale. This is a deeply hands-on role: you will personally design, deploy, and tune full-stack observability solutions, build executive and operations dashboards that become the daily heartbeat of the organization, and drive AIOps-led incident management with Moogsoft. You will own deep integration with ServiceNow and serve as the primary technical interface to senior leadership (CTO, CISO, Head of Platform Engineering) for all observability, alerting, and tooling strategy in person, in the San Jose, every day.
Key Responsibilities
- Architect and hands-on implement a unified observability platform covering on-prem, multi-cloud (Azure + AWS), containers, and SaaS applications.
- Own the full Moogsoft AIOps deployment: ingestion pipelines, situation clustering, noise reduction, automated remediation workflows, and integration with incident/response tools.
- Design, build, and maintain enterprise single-pane-of-glass dashboards (executive, NOC/SOC, service-owner, and engineering views) in tools such as Grafana, Datadog, Dynatrace, New Relic, or Lightstep.
- Lead deep bi-directional integration between observability tools and ServiceNow (Event Management, CMDB, Incident, Change, ITSM workflows, Service Mapping, Discovery).
- Drive event correlation, alerting rationalization, and elimination of alert fatigue using Moogsoft and supporting tools.
- Hands-on build and maintain data ingestion pipelines (metrics, events, logs, traces) using Prometheus, OpenTelemetry, Fluent Bit/Fluentd, Elastic, Splunk, Datadog agents, etc.
- Create and present observability maturity roadmaps, AIOps business cases, SLA/SLO reporting, and tool rationalization plans to C-level executives and the board in-person in San Jose.
- Own licensing strategy, cost governance (FinOps for observability), and vendor relationships across the entire stack.
- Mentor observability engineers and act as the final escalation owner for major incidents and platform issues.
Required Experience & Skills
- 10+ years in enterprise IT operations with 6+ years owning large-scale observability and AIOps platforms (5,000+ servers, 50,000+ containers, multi-region).
- Deep, hands-on expertise with Moogsoft AIOps (recent versions) you have built or rebuilt Moogsoft environments from scratch, tuned clustering algorithms, and delivered >80% noise reduction.
- Proven track record building and operating enterprise dashboards that are used daily by executives, NOC/SOC, and engineering teams.
- Expert-level ServiceNow integration experience:
- Event Management (event rules, alert grouping, MID servers)
- Bi-directional incident sync with Moogsoft or other tools
- CMDB population via Discovery and Service Mapping
- Custom ServiceNow dashboards and Performance Analytics
- Broad modern observability stack experience (at least four of the following required):
- Metrics & Dashboards: Grafana (advanced), Datadog, Dynatrace, New Relic, Lightstep
- Logs & Tracing: Elastic Stack, Splunk, Loki, OpenTelemetry
- Cloud-native: Azure Monitor, CloudWatch, Google Operations
- AIOps: Moogsoft (mandatory), BigPanda, PagerDuty SignalFlow
- Strong scripting/automation: Python (mandatory), Go, PowerShell, Ansible.
- Comfortable and effective presenting in-person to senior leadership and war-room teams in San Jose HQ on a daily basis.
Certifications (at least two required)
- Moogsoft Certified Engineer or Architect
- ServiceNow Certified Implementation Specialist Event Management
- Datadog Certified Architect, Dynatrace Associate/Pro, Grafana TCO, etc.
- ITIL v4 Foundation or higher
- Dice Id: 10330808
- Position Id: 2026-93850/499991
- Posted 10 hours ago
Company Info
VDart, headquartered in Atlanta, GA, is a global leader in digital talent solutions and IT staffing, delivering top technology professionals to businesses worldwide. With a strong presence across North America, Europe and Asia, we specialize in helping organizations navigate complex technology landscapes with the right expertise.
Through a strategic, client-focused approach, we have placed over 20,000 professionals across key industries and advanced technology solutions. Whether placing top talent in cutting-edge roles or providing strategic digital workforce solutions, our network of 4,000 specialists across 13 countries is committed to excellence, agility and impact.
Backed by 18 years of industry experience, we go beyond staffing to build long-term partnerships that accelerate digital transformation and drive sustained growth. Whether you need a technology partner to fuel innovation or specialized workforce solutions to maintain a competitive edge, VDart delivers the right people, skills and mindset to create a lasting impact in a digital-first world.
Similar Jobs
It looks like there aren't any Similar Jobs for this job yet.
Search all similar jobs