Site Reliability & Observability Architect

Overview

On Site
Hybrid
BASED ON EXPERIENCE
Contract - Independent
Contract - W2
Contract - 1+ mo(s)

Skills

Architectural Design
System Integration
Management
Design Patterns
Specification Gathering
Continuous Improvement
Emerging Technologies
IT Strategy
Systems Architecture
Software Development
Cyber Security
Stacks Blockchain
Mainframe
High Availability
Leadership
Computer Science
Information Systems
Programming Languages
Cloud Architecture
DevOps
Dynatrace
Grafana
New Relic
Splunk
AppDynamics
Microsoft Azure
Amazon Web Services
IBM SmartCloud
Google Cloud
Google Cloud Platform
OCI
Orchestration
Kubernetes
Docker
Scripting
Python
Java
Bash
Problem Solving
Conflict Resolution
Analytical Skill
Documentation
Technical Writing
Security Awareness
Regulatory Compliance
Reliability Engineering
Infrastructure Architecture
Cloud Computing
Virtualization
Computer Networking
Firewall
Routing
Virtual Private Network
Collaboration
Communication
FOCUS
Professional Services
Genetics
Law
Privacy
Artificial Intelligence

Job Details

Title: Site Reliability & Observability Architect
Location: DFW, Tx - Hybrid
Rate: $35-40/hour
Work Requirements: , Holders or Authorized to Work in the U.S.

What You ll Do
This list reflects the current responsibilities; additional essential or non-essential tasks may be assigned as needed.

  • Architectural Design: Develop and maintain the overall architecture of IT solutions, ensuring they are scalable, secure, and efficient.

  • Stakeholder Collaboration: Work with business leaders, project managers, and IT teams to gather requirements and translate them into technical solutions.

  • System Integration: Oversee integration of new technologies and systems into the existing IT environment, ensuring compatibility and performance.

  • Documentation: Create detailed architectural documentation, including diagrams, design patterns, and technical specifications.

  • Compliance and Security: Ensure solutions comply with industry standards and regulations; implement robust security measures to protect data and systems.

  • Continuous Improvement: Stay updated with emerging technologies and industry trends and incorporate relevant advancements into IT strategy.

Minimum Qualifications

  • Bachelor s degree in Computer Science, Information Systems, Technology, or related technical discipline, or equivalent experience.

  • 5+ years of experience in system architecture, software development, cloud computing (e.g., AWS, Azure), and cybersecurity.

  • 3+ years of experience designing, deploying, and configuring telemetry solutions across diverse tech stacks (mainframe, midrange, cloud).

  • 5+ years of experience in observability, site reliability engineering, DevOps, or a related technical field.

  • Prior experience architecting observability solutions for high-availability, large-scale, or mission-critical environments.

  • Demonstrated leadership in driving observability initiatives and influencing engineering culture.

Preferred Qualifications

  • Master s degree in Computer Science, Engineering, Information Systems, or related field.

  • Familiarity with multiple programming languages and frameworks.

  • Certifications in cloud architecture, DevOps, or observability tools.

  • Experience with automation frameworks, Infrastructure as Code (IaC), and integrating observability with deployment pipelines.

Skills, Licenses & Certifications

  • Technical Expertise: In-depth knowledge of observability pillars (metrics, logs, traces), telemetry, and distributed tracing frameworks (e.g., OpenTelemetry, Jaeger, Zipkin).

  • Tool Proficiency: Hands-on experience with observability platforms (Dynatrace, Prometheus, Grafana, Datadog, New Relic, ELK/EFK Stack, Splunk, AppDynamics).

  • Cloud & Containerization: Experience with cloud platforms (Azure, AWS, IBM Cloud, Google Cloud Platform, OCI) and container orchestration (Kubernetes, Docker).

  • Programming & Scripting: Proficiency in at least one language for automation and tool integration (Python, Go, Java, Bash).

  • Problem-Solving: Strong analytical and troubleshooting skills with complex, distributed systems.

  • Documentation: Ability to produce clear, concise technical documentation and architectural diagrams.

  • Security Awareness: Understanding of security implications within observability pipelines and compliance requirements.

  • Advanced IT Knowledge: Expertise in monitoring, logging, AIOps, and notification systems; hands-on experience with AIOps solutions (e.g., BigPanda).

  • Resiliency Practices: Mastery of modern resiliency practices and observability s role in system reliability.

  • Infrastructure Design: Moderate knowledge in designing enterprise-grade infrastructure solutions (on-prem, hybrid, cloud, virtualization).

  • Networking: Knowledge of firewalls, routing, VPNs, and load balancers.

  • Collaboration: Excellent communication, interpersonal skills, and proven ability to work effectively with cross-functional teams.

    Our benefits package includes:

  • Comprehensive medical benefits
  • Competitive pay, 401(k)
  • Retirement plan
  • and much more!

  • About INSPYR Solutions
    Technology is our focus and quality is our commitment. As a national expert in delivering flexible technology and talent solutions, we strategically align industry and technical expertise with our clients business objectives and cultural needs. Our solutions are tailored to each client and include a wide variety of professional services, project, and talent solutions. By always striving for excellence and focusing on the human aspect of our business, we work seamlessly with our talent and clients to match the right solutions to the right opportunities. Learn more about us at inspyrsolutions.com.

    INSPYR Solutions provides Equal Employment Opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, national origin, age, disability, or genetics. In addition to federal law requirements, INSPYR Solutions complies with applicable state and local laws governing nondiscrimination in employment in every location in which the company has facilities

Information collected and processed through your application with INSPYR Solutions (including any job applications you choose to submit) is subject to INSPYR Solutions Privacy Policy and INSPYR Solutions AI and Automated Employment Decision Tool Policy: . By submitting an application, you are consenting to being contacted by INSPYR Solutions through phone, email, or text.


Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About INSPYR Solutions