Observability Engineer (3448) - Tampa/Dallas

  • Coppell, TX
  • Posted 1 day ago | Updated 3 hours ago

Overview

On Site
Contract - W2

Skills

Observability Engineer (3448) - Tampa/Dallas

Job Details



Observability & AIOps Engineer


Location: Dallas or Tampa | Hybrid: 3 days onsite
Contract: 6-month contract-to-hire


We are seeking a senior-level Observability & AIOps Engineer with hands-on experience in Java and Python to enhance enterprise IT observability, resilience, and reliability. This role blends hands-on engineering with architectural guidance to optimize monitoring, performance, and reliability across IT systems.


Key Responsibilities



  • Design, prototype, test, and document observability and reliability solutions.

  • Publish technology strategies, observability standards, and best practices.

  • Translate business goals into technical solutions that meet non-functional requirements.

  • Create Observability Driven Development procedures and promote adoption of open-standard frameworks (OTel, MELTS).

  • Implement AI-augmented testing strategies for federated execution and enterprise governance.

  • Collaborate with SREs and production support teams to improve distributed tracing, trade processing reliability, and chaos testing.

  • Design and implement full-stack applications for operational predictability and prescriptive disruption response.

  • Establish monitoring and alerting standards for performance, scalability, availability, and reliability.


Experience & Qualifications



  • Distributed Applications: 10+ years designing and implementing distributed systems.

  • Networking & Infrastructure: 5+ years in networking, middleware, infrastructure, and database architecture.

  • Highly Available Architecture: 5+ years implementing highly available solutions.

  • Disaster Recovery: 5+ years with disaster recovery methodologies and patterns.

  • Hands-On Development: Senior-level expertise in Java and Python for observability and reliability engineering.


Knowledge & Skills



  • Strong problem-solving and independent work capabilities.

  • Familiarity with public cloud environments (AWS, Azure) is a plus.

  • Performance analysis, tuning, and engineering experience is desirable.

  • Knowledge of monitoring/observability tools: Dynatrace, Splunk, Grafana, Prometheus, OpenTelemetry, CloudWatch, CloudTrail.

  • Ability to design solutions that improve resilience, reliability, and operational efficiency.




Dexian is a leading provider of staffing, IT, and workforce solutions with over 12,000 employees and 70 locations worldwide. As one of the largest IT staffing companies and the 2nd largest minority-owned staffing company in the U.S., Dexian was formed in 2023 through the merger of DISYS and Signature Consultants. Combining the best elements of its core companies, Dexian's platform connects talent, technology, and organizations to produce game-changing results that help everyone achieve their ambitions and goals.


Dexian's brands include Dexian DISYS, Dexian Signature Consultants, Dexian Government Solutions, Dexian Talent Development and Dexian IT Solutions. Visit to learn more.


Dexian is an Equal Opportunity Employer that recruits and hires qualified candidates without regard to race, religion, sex, sexual orientation, gender identity, age, national origin, ancestry, citizenship, disability, or veteran status.


Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Dexian DISYS