Overview
Skills
Job Details
Observability & AIOps Engineer
Location: Dallas or Tampa | Hybrid: 3 days onsite
Contract: 6-month contract-to-hire
We are seeking a senior-level Observability & AIOps Engineer with hands-on experience in Java and Python to enhance enterprise IT observability, resilience, and reliability. This role blends hands-on engineering with architectural guidance to optimize monitoring, performance, and reliability across IT systems.
Key Responsibilities
- Design, prototype, test, and document observability and reliability solutions.
- Publish technology strategies, observability standards, and best practices.
- Translate business goals into technical solutions that meet non-functional requirements.
- Create Observability Driven Development procedures and promote adoption of open-standard frameworks (OTel, MELTS).
- Implement AI-augmented testing strategies for federated execution and enterprise governance.
- Collaborate with SREs and production support teams to improve distributed tracing, trade processing reliability, and chaos testing.
- Design and implement full-stack applications for operational predictability and prescriptive disruption response.
- Establish monitoring and alerting standards for performance, scalability, availability, and reliability.
Experience & Qualifications
- Distributed Applications: 10+ years designing and implementing distributed systems.
- Networking & Infrastructure: 5+ years in networking, middleware, infrastructure, and database architecture.
- Highly Available Architecture: 5+ years implementing highly available solutions.
- Disaster Recovery: 5+ years with disaster recovery methodologies and patterns.
- Hands-On Development: Senior-level expertise in Java and Python for observability and reliability engineering.
Knowledge & Skills
- Strong problem-solving and independent work capabilities.
- Familiarity with public cloud environments (AWS, Azure) is a plus.
- Performance analysis, tuning, and engineering experience is desirable.
- Knowledge of monitoring/observability tools: Dynatrace, Splunk, Grafana, Prometheus, OpenTelemetry, CloudWatch, CloudTrail.
- Ability to design solutions that improve resilience, reliability, and operational efficiency.
Dexian is a leading provider of staffing, IT, and workforce solutions with over 12,000 employees and 70 locations worldwide. As one of the largest IT staffing companies and the 2nd largest minority-owned staffing company in the U.S., Dexian was formed in 2023 through the merger of DISYS and Signature Consultants. Combining the best elements of its core companies, Dexian's platform connects talent, technology, and organizations to produce game-changing results that help everyone achieve their ambitions and goals.
Dexian's brands include Dexian DISYS, Dexian Signature Consultants, Dexian Government Solutions, Dexian Talent Development and Dexian IT Solutions. Visit to learn more.
Dexian is an Equal Opportunity Employer that recruits and hires qualified candidates without regard to race, religion, sex, sexual orientation, gender identity, age, national origin, ancestry, citizenship, disability, or veteran status.