Integration Observability Architect

Remote • Posted 1 hour ago • Updated 1 hour ago
Contract W2
12 Months
No Travel Required
Remote
Depends on Experience
Fitment

Dice Job Match Score™

👾 Reticulating splines...

Job Details

Skills

  • Extract, Transform, Load
  • ELT
  • Dynatrace
  • Databricks
  • Microsoft Azure
  • Splunk
  • SQL
  • Apache Hadoop
  • Database
  • Data Quality
  • Data Flow

Summary

Data / Batch / Integration Observability Architect

Experience & Purpose:

Minimum 15+ years of experience in the data domain, with strong expertise in defining and implementing monitoring and observability frameworks for enterprise-scale data ecosystems. Responsible for establishing a scalable data observability strategy across pipelines, batch workloads, databases, and integration layers to ensure end-to-end visibility, reliability, operational resilience, and business-impact awareness.

Key Responsibilities

  • Assess observability across:
    • Batch jobs, schedulers, ETL/ELT pipelines, and data platforms
    • Database monitoring, performance, and query behavior
    • Integration and middleware workflows across systems
  • Evaluation:
    • Pipeline visibility (latency, failures, throughput, dependencies, data SLAs)
    • Effectiveness of schedulers/orchestration platforms (e.g., ActiveBatch, Airflow, Control-M)
    • Database observability and performance monitoring practices
  • Identify:
    • Blind spots in data flow, lineage, and cross-system dependencies
    • Failure detection gaps beyond job-level (data quality, freshness, volume anomalies)
    • Inefficiencies in retry mechanisms, alerting, and operational workflows
  • Define:
    • Standard observability patterns and frameworks for data workloads
    • Dependency-aware monitoring models across upstream and downstream systems
    • Actionable dashboards, alerts, and SLAs aligned to business impact
    • Repeatable onboarding patterns for new pipelines and data services
  • Enable intelligent observability:
    • Reduce alert noise and improve signal quality and actionability
    • Correlate events across pipelines, databases, and integrations
    • Link technical failures to business outcomes and downstream impact
  • Incorporate AI capabilities:
    • Anomaly detection in pipeline behavior, data patterns, and performance trends
    • Failure prediction and early warning signals for batch/data workflows
    • Intelligent alerting and correlation across data ecosystems leveraging AIOps platforms such SNOW ITOM, Moog soft or Big Panda
  • Contribute to:
    • Target-state data observability architecture and engineering blueprint
    • Retrofit and modernization guidance for existing pipelines and platforms
    • Integration with ITSM, incident management, and operational workflows

Technical Skills

  • Experience with (any of the following):
    • Schedulers / Orchestration: ActiveBatch, Airflow, Control-M, Autosys
    • Data Platforms: Azure Data Factory, Databricks, Snowflake, Hadoop ecosystem
    • Observability Tools: Azure Monitor, Log Analytics (KQL), Splunk, ELK, Dynatrace, Prometheus
  • Hands-on experience with:
    • ActiveBatch (job scheduling and monitoring)
    • SQL Sentry or similar tools (database observability)
    • Azure Log Analytics (KQL for data monitoring)
    • Azure Monitor (data-related metrics/logs)
    • Understanding of Data pipelines and integration patterns
  • Working knowledge of:
    • Data pipelines (ETL/ELT), batch processing, and integration patterns
    • Database systems and performance monitoring tools (e.g., SQL Sentry or equivalent)
    • Logs, metrics, and event correlation across distributed systems

Expectations / Success Criteria

  • Identify and eliminate critical data pipeline blind spots and failure gaps
  • Establish standard, reusable observability patterns for data workloads
  • Enable end-to-end visibility across upstream and downstream dependencies
  • Improve alert quality, reduce noise, and accelerate issue detection and resolution (MTTR)
  • Deliver a practical, implementable data observability blueprint
  • Drive adoption of proactive and AI-assisted monitoring practices across data ecosystems
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: prutx001
  • Position Id: 8973759
  • Posted 1 hour ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote

Today

Easy Apply

Contract

50 - 65

Remote

Today

Easy Apply

Contract

80

Remote

Today

Easy Apply

Contract, Third Party

Depends on Experience

Remote

2d ago

Easy Apply

Contract, Third Party

70+

Search all similar jobs