Senior Tools Architect Enterprise Observability & AIOps
San Jose, CA, US • Posted 3 hours ago • Updated 3 hours ago

K-Tek Resourcing LLC
Dice Job Match Score™
⏳ Almost there, hang tight...
Job Details
Skills
- Algorithms
- Amazon Web Services
- Configuration Management Database
- Dashboard
- Dynatrace
- Ansible
- Business Cases
- Grafana
- IT Operations
- IT Service Management
- Business Intelligence
- Cloud Computing
- Clustering
- Event Management
- Fluency
- Incident Management
- Leadership
- Licensing
- Mapping
- Mentorship
- Microsoft Azure
- NOC
- New Relic
- Presentations
- Python
- Reporting
- Roadmaps
- SAP BASIS
- SLA
- SaaS
- Scratch
- Scripting
- Servers
- ServiceNow
- Splunk
- System On A Chip
- Vendor Relationships
- WAR
- Windows PowerShell
- Workflow
- ITIL
- Moogsoft Certified
- ITIL v4
- ITILv4
- v4
- CTO
- CISO
- Elastic
- Datadog
- AIOps deployment
- AIOps
- deployment
- Change
- ITSM workflows
- Service Mapping
- Discovery
Summary
Role: Senior Tools Architect Enterprise Observability & AIOps (Hands-On)
Location: San Jose, California 100% On-site 5 days a week required
Mode: Contract
Work Authorization: USC-EAD/-EAD/L2-EAD/TN/E3
Job Description:
About the Role
We are seeking a senior-level Tools Architect who is a recognized expert in building and operating modern, enterprise-grade observability and AIOps platforms at global scale. This is a deeply hands-on role: you will personally design, deploy, and tune full-stack observability solutions, build executive and operations dashboards that become the daily heartbeat of the organization, and drive AIOps-led incident management with Moogsoft. You will own deep integration with ServiceNow and serve as the primary technical interface to senior leadership (CTO, CISO, Head of Platform Engineering) for all observability, alerting, and tooling strategy in person, in the San Jose, every day.
Key Responsibilities
- Architect and hands-on implement a unified observability platform covering on-prem, multi-cloud (Azure + AWS), containers, and SaaS applications.
- Own the full Moogsoft AIOps deployment: ingestion pipelines, situation clustering, noise reduction, automated remediation workflows, and integration with incident/response tools.
- Design, build, and maintain enterprise single-pane-of-glass dashboards (executive, NOC/SOC, service-owner, and engineering views) in tools such as Grafana, Datadog, Dynatrace, New Relic, or Lightstep.
- Lead deep bi-directional integration between observability tools and ServiceNow (Event Management, CMDB, Incident, Change, ITSM workflows, Service Mapping, Discovery).
- Drive event correlation, alerting rationalization, and elimination of alert fatigue using Moogsoft and supporting tools.
- Hands-on build and maintain data ingestion pipelines (metrics, events, logs, traces) using Prometheus, OpenTelemetry, Fluent Bit/Fluentd, Elastic, Splunk, Datadog agents, etc.
- Create and present observability maturity roadmaps, AIOps business cases, SLA/SLO reporting, and tool rationalization plans to C-level executives and the board in-person in San Jose.
- Own licensing strategy, cost governance (FinOps for observability), and vendor relationships across the entire stack.
- Mentor observability engineers and act as the final escalation owner for major incidents and platform issues.
Required Experience & Skills
- 10+ years in enterprise IT operations with 6+ years owning large-scale observability and AIOps platforms (5,000+ servers, 50,000+ containers, multi-region).
- Deep, hands-on expertise with Moogsoft AIOps (recent versions) you have built or rebuilt Moogsoft environments from scratch, tuned clustering algorithms, and delivered >80% noise reduction.
- Proven track record building and operating enterprise dashboards that are used daily by executives, NOC/SOC, and engineering teams.
- Expert-level ServiceNow integration experience:
- Event Management (event rules, alert grouping, MID servers)
- Bi-directional incident sync with Moogsoft or other tools
- CMDB population via Discovery and Service Mapping
- Custom ServiceNow dashboards and Performance Analytics
- Broad modern observability stack experience (at least four of the following required):
- Metrics & Dashboards: Grafana (advanced), Datadog, Dynatrace, New Relic, Lightstep
- Logs & Tracing: Elastic Stack, Splunk, Loki, OpenTelemetry
- Cloud-native: Azure Monitor, CloudWatch, Google Operations
- AIOps: Moogsoft (mandatory), BigPanda, PagerDuty SignalFlow
- Strong scripting/automation: Python (mandatory), Go, PowerShell, and Ansible.
- Comfortable and effective at presenting in person to senior leadership and war-room teams in San Jose HQ on a daily basis.
Certifications (at least two required)
- Moogsoft Certified Engineer or Architect
- ServiceNow Certified Implementation Specialist Event Management
- Datadog Certified Architect, Dynatrace Associate/Pro, Grafana TCO, etc.
- ITIL v4 Foundation or higher
- Dice Id: 10411276
- Position Id: 8860130
- Posted 3 hours ago
Company Info
Vision
To be a trusted partner and advisor to our customers
Mission
At K-Tek we believe in understanding the specific needs of the customer and tailor-creating innovative solutions to meet these needs. We invest in our employees and customers. We build a relation of trust with our customers through empathy, solutions and being the first time right.
Who We Are & What We Do
K-Tek Resourcing is a consulting organization with offices in Houston TX and St. Paul, MN. It is supported by 2 global delivery centers, located in India. With its global employee strength of over 250, K-Tek has been supporting its clients for over 9 years. We have been consistently achieving a growth of 30% Year on Year. We have an extensive experience of working in domains including BFSI, Retail, Healthcare and Pharma, Oil & Gas, Travel & Hospitality and Insurance. The technologies we service are IT Infrastructure, Mobile Technologies, Cloud & Big Data Solutions. We understand the needs of our customers and provide them with customized solutions and resources with the tenet of being the "First Time Right".
Values
-Commitment to our customers success through Integrity
-Excellence through Quality
-Growth through customer value creation
Similar Jobs
It looks like there aren't any Similar Jobs for this job yet.
Search all similar jobs
