Observability Architect Engineer

Overview

On Site
Compensation information provided in the description
Full Time

Skills

Optimization
Visualization
Dashboard
Real-time
SNMP
Scripting
Workflow
Reliability Engineering
Root Cause Analysis
Design Review
Leadership
SaaS
Documentation
High-level Design
Regulatory Compliance
Enterprise Networks
Network
Performance Monitoring
Analytics
Python
Ansible
Terraform
API
Mentorship
Architectural Design
Grafana
Performance Management
Performance Tuning
Cisco
Amazon Web Services
Microsoft Azure
Cloud Computing
Computer Networking
Dragon NaturallySpeaking
DNS
DHCP
IP Address Management
NAC
Microsoft Windows
Migration
Incident Management
Taxes
Life Insurance
LOS
Collaboration
Partnership
Business Transformation
Law

Job Details

Description

Summary We are seeking a senior engineer/architect to lead the design, implementation, and optimization of full stack observability solutions across large-scale enterprise environments. This role will drive architecture and automation for network, application, and infrastructure monitoring, leveraging platforms such as Catalyst Center, ThousandEyes, Grafana, and custom automation frameworks. The ideal candidate demonstrates deep, hands-on expertise in observability, automation, and performance analytics, with a proven ability to deliver scalable solutions and mentor engineering teams. Key Responsibilities Own end-to-end observability architecture: Design and implement integrated monitoring solutions for network, application, and infrastructure domains, ensuring visibility, reliability, and actionable insights. Lead Catalyst Center-driven automation: Develop templates, workflows, and closed-loop operations for network assurance, leveraging Catalyst Center APIs and automation tools. ThousandEyes deployment and analytics: Architect and operationalize ThousandEyes for synthetic and real-user monitoring, path visualization, and outage detection across distributed environments. Grafana dashboarding and analytics: Build and maintain Grafana dashboards for real-time and historical performance analytics, integrating diverse data sources (SNMP, API, logs, metrics). Automation and integration: Develop and maintain automation scripts and frameworks (Python, Ansible, Terraform) for observability, alerting, and remediation workflows. Performance and reliability engineering: Define SLOs/SLIs, implement proactive monitoring, and drive root-cause analysis for critical incidents. Mentor and uplift engineering teams: Conduct design reviews, develop standards and runbooks, and deliver enablement sessions for operations and field engineers. Stakeholder leadership: Collaborate with security, cloud, application, and operations teams to translate business outcomes into technical architectures and measurable milestones. Documentation & governance: Produce HLD/LLD, as-builts, standards, compliance artifacts, and reusable templates for observability and automation. Required Qualifications (Must-Have) 10+ years experience in enterprise networking, systems, or cloud engineering, including 3-5+ years leading observability and automation initiatives at scale. Proven, exceptional hands-on skills with Catalyst Center, ThousandEyes, and Grafana for monitoring, analytics, and automation. Deep expertise in network and application performance monitoring, synthetic and real-user analytics, and incident response. Strong experience with automation frameworks (Python, Ansible, Terraform) and API integrations. Demonstrated success leading complex, multi-phase deployments and mentoring senior engineers.

Skills

Cloud, Architectural design, grafana, automation, thousand eyes, catalyst center, Monitoring tools, Architecture, performance management, performance optimization

Top Skills Details

Cloud,Architectural design,grafana,automation,thousand eyes,catalyst center,Monitoring tools,Architecture,performance management,performance optimization

Additional Skills & Qualifications

Preferred Qualifications Certifications in observability, automation, or cloud platforms (e.g., Cisco Certified Specialist - Observability, AWS/Azure monitoring). Experience with cloud networking, hybrid connectivity, and integration of DNS/DHCP/IPAM data sources. Familiarity with Zero Trust, NAC posture, and security monitoring. Experience with data center and campus interconnect monitoring (ACI concepts beneficial but not required). Work Style & Travel Must be able to work onsite at client locations as required. Off-hours change windows may be needed for critical migrations and incident response.

Experience Level

Expert Level

Job Type & Location
This is a Contract position based out of Los Angeles, CA.
Pay and Benefits
The pay range for this position is $85.00 - $110.00/hr.
Eligibility requirements apply to some benefits and may depend on your job
classification and length of employment. Benefits are subject to change and may be
subject to specific elections, plan, or program terms. If eligible, the benefits
available for this temporary role may include the following:
Medical, dental & vision
Critical Illness, Accident, and Hospital
401(k) Retirement Plan - Pre-tax and Roth post-tax contributions available
Life Insurance (Voluntary Life & AD&D for the employee and dependents)
Short and long-term disability
Health Spending Account (HSA)
Transportation benefits
Employee Assistance Program
Time Off/Leave (PTO, Vacation or Sick Leave)
Workplace Type
This is a fully onsite position in Los Angeles,CA.
Application Deadline
This position is anticipated to close on Dec 6, 2025.
>About TEKsystems:
We're partners in transformation. We help clients activate ideas and solutions to take advantage of a new world of opportunity. We are a team of 80,000 strong, working with over 6,000 clients, including 80% of the Fortune 500, across North America, Europe and Asia. As an industry leader in Full-Stack Technology Services, Talent Services, and real-world application, we work with progressive leaders to drive change. That's the power of true partnership. TEKsystems is an Allegis Group company.

The company is an equal opportunity employer and will consider all applications without regards to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law.

About TEKsystems and TEKsystems Global Services

We're a leading provider of business and technology services. We accelerate business transformation for our customers. Our expertise in strategy, design, execution and operations unlocks business value through a range of solutions. We're a team of 80,000 strong, working with over 6,000 customers, including 80% of the Fortune 500 across North America, Europe and Asia, who partner with us for our scale, full-stack capabilities and speed. We're strategic thinkers, hands-on collaborators, helping customers capitalize on change and master the momentum of technology. We're building tomorrow by delivering business outcomes and making positive impacts in our global communities. TEKsystems and TEKsystems Global Services are Allegis Group companies. Learn more at TEKsystems.com.

The company is an equal opportunity employer and will consider all applications without regard to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About TEKsystems c/o Allegis Group