Sr. Network Observability Engineer SME

Overview

Contract - W2
Contract - Independent
Contract - of Contract

Skills

bash
Network Engineering
routing
Network Monitoring
cloud infrastructure operations
o Azure Monitor / Network Watcher / Log Analytics
o GCP Cloud Monitoring / Logging / Network Intelligence Center
o OCI Monitoring / Logging / Network Visualizer
Scripting skills (Python
or PowerShell)
cloud traffic flow

Job Details

Job Title: Sr. Network Observability Engineer SME
Location: Plano, TX & Pleasanton, CA (Hybrid)
Duration: Long-Term of Contract

We are seeking a Senior Network Observability Engineer SME with deep expertise in cloud-based network visibility and performance monitoring across Azure, Google Cloud Platform, and OCI. The ideal candidate will have hands-on experience with Grafana, and native observability and monitoring tools provided by Microsoft, Google Cloud, and Oracle Cloud. This role will focus on building and scaling proactive monitoring, alerting, and telemetry strategies for enterprise cloud networks.

What You'll Do:

  • Design and implement end-to-end network observability and monitoring solutions across Azure, Google Cloud Platform, and OCI environments.
  • Leverage and integrate tools such as Grafana, Azure Monitor, Google Cloud Operations Suite (formerly Stackdriver), and OCI Monitoring and Logging.
  • Develop and maintain custom dashboards, alerts, and metrics pipelines for real-time visibility into network health, latency, packet loss, throughput, and service availability.
  • Collaborate with network, cloud, and SRE teams to identify observability gaps and implement improvements.
  • Define SLIs, SLOs, and KPIs for network performance and availability monitoring.
  • Provide expert-level support in incident analysis, troubleshooting, and root cause determination.
  • Automate data collection, log ingestion, and telemetry exports using APIs and scripting (e.g., Python, Bash).
  • Document observability standards, procedures, and best practices.

What You Know:

Required:

  • 7+ years in network engineering, network monitoring, or cloud infrastructure operations.
  • Proven experience with Grafana for real-time visualization and dashboard creation.
  • Strong understanding of network performance metrics and cloud-native monitoring tools:
    • Azure Monitor / Network Watcher / Log Analytics
    • Google Cloud Platform Cloud Monitoring / Logging / Network Intelligence Center
    • OCI Monitoring / Logging / Network Visualizer
  • Solid grasp of network protocols, routing, latency analysis, and cloud traffic flow.
  • Scripting skills (Python, Bash, or PowerShell) for automation of observability tasks.
  • Familiarity with telemetry collection, time-series databases, and log aggregation platforms.

Preferred:

  • Certifications such as Azure Administrator, Google Cloud Platform Network Engineer, OCI Architect Associate, or equivalent.
  • Experience integrating observability tools with incident management platforms (e.g., PagerDuty, ServiceNow).
  • Knowledge of Prometheus, OpenTelemetry, or other open-source monitoring frameworks.
  • Exposure to SRE practices and DevOps monitoring pipelines.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.