grafana Jobs in dallas, tx

Refine Results
1 - 20 of 193 Jobs

Senior SRE/Observability Engineer

VDart, Inc.

Irving, Texas, USA

Contract, Third Party

Role: Senior SRE/Observability Engineer Location: Irving, TX (Onsite) Type: Contract Responsibilities: Design, develop and manage observability solutions, including metric identification\validation, centralizing in GEM\Prometheus & visualizing in Grafana dashboards. Write and manage complex queries and alert definitions. Bridge the gap between Operations Support teams and SRE operations. Configure and manage monitoring, alerts, and observability using a range of tools including GEM, Splunk, Ne

Sr. Kubernetes Engineer on W2

Divit Technologies, Inc.

Dallas, Texas, USA

Full-time

Preferred Qualifications- Education & Prior Job Experience Azure, AWS, or Kubernetes Technical Certifications10+ years working with Microsoft Azure Kubernetes Service (AKS) or Amazon Elastic Kubernetes Service (EKS), specifically administering and maintaining cluster lifecycleSkills, Licenses & Certifications Experience keeping production environments operating at peak performance on the cloud and in containersExperience managing production Kubernetes infrastructureHands-on experience with infra

Chaos Engineer

Zenox Global, LLC

Arlington, Texas, USA

Full-time

Title- Chaos Engineer Location- Arlington, Texas- USA (Day 1 Onsite) Full-Time/Direct Hire Key Responsibilities: Chaos Testing and Experimentation: Design and execute chaos engineering experiments to identify weaknesses in systems and improve resilience. System Analysis: Analyze system behavior under stress conditions and develop strategies to mitigate potential failures. Performance Monitoring: Continuously monitor system performance, identify vulnerabilities, and implement improvements to enh

Chaos Engineer - Fulltime only

EdHike, LLC

Arlington, Texas, USA

Full-time

Hi Team, we are looking out for a Fulltime - Chaos Engineer Location- Arlington, Texas- USA (Day 1 Onsite) Type- FTE only USA Residents Key Responsibilities: Chaos Testing and Experimentation: Design and execute chaos engineering experiments to identify weaknesses in systems and improve resilience.System Analysis: Analyze system behavior under stress conditions and develop strategies to mitigate potential failures.Performance Monitoring: Continuously monitor system performance, identify vulnerab

CDN Observability Engineer

Exafluence

Plano, Texas, USA

Full-time

Design and deliver observability platforms that empower application owners to monitor CDN performance and behavior. This role emphasizes solution design and development, with deep technical knowledge in CDN and networking. Senior level resources with 10-15 years of experience. Key Responsibilities: Lead solution design to build observability frameworks for CDN platforms. Develop tools and dashboards that provide transparency into CDN usage and performance. Collaborate with application owners to

ITSM Change Manager

Kforce Technology Staffing

Westlake, Texas, USA

Contract

RESPONSIBILITIES: Kforce has a client that is seeking a ITSM Change Manager in Westlake, TX. Duties include: * Demonstrates an understanding of policies, procedures, standards, environment, and underlying importance/need of change management processes * Reviews and assesses the content of change requests to ensure required details are included * Participates in change review and Change Advisory Board meetings * Facilities and provides oversight for systems installation, code deployment and infr

Site Reliability and operations Engineer (SRE)

Genesis10

Irving, Texas, USA

Full-time

Genesis10 is currently seeking a Site Reliability and Operations Engineer (SRE) with our client in the financial Industry located in Irving, TX. This is a 12+ month contract position. Responsibilities: Design, develop, and optimize distributed caching and compute grid solutions on Kubernetes/OpenShift Understanding of microservices and containerized workloads using Kubernetes, Docker, and Helm Implement high-throughput compute grid solutions using IBM Spectrum Symphony, Tibco Grid Server or simi

DevOps/SRE Engineer

Judge Group, Inc.

Irving, Texas, USA

Full-time

Location: Irving, TX Salary: $67.00 USD Hourly - $72.00 USD Hourly Description: Job Description: Site Reliability and Operations Engineer (SRE) Location: Irving, TX About the Role: As a Site Reliability and Operations Engineer (SRE) at our company, you will consult on complex initiatives with broad impact and large-scale planning for Systems Operations Engineering. You will review and analyze multifaceted, larger-scale, or longer-term Systems Operations Engineering challenges that require

Senior Full Stack Developer

Pepsico

Plano, Texas, USA

Full-time

Overview We are seeking an experienced Senior Developer (10+ years) to lead the design and development of a scalable, enterprise-grade observability platform. This role requires deep technical expertise in building real-time monitoring and data visualization solutions across complex, distributed systems. As a Senior Developer, you will play a key role in shaping the architecture, driving performance optimizations, and ensuring seamless integration of observability tools with our platform infras

Systems Engineer

Kforce Technology Staffing

Westlake, Texas, USA

Contract

RESPONSIBILITIES: Kforce has a client that is seeking a Systems Engineer in Westlake, TX. REQUIREMENTS: * Bachelor's degree or higher in a technology related field (e.g. Engineering, Computer Science, etc.) required, Master's degree a plus * Proven experience with monitoring and management tools (Splunk, Datadog, Catchpoint, Grafana, AWX/Ansible, etc.) and building automation * Experience using CI/CD Tools (Jenkins, uDeploy) and the backend code implemented by it * Experience with building and

CTO Associate Manager (L09)

Pepsico

Plano, Texas, USA

Full-time

Overview We are looking for a seasoned SaaS Platform Lead & Operations Expert to lead the operational oversight, stakeholder engagement, and business alignment of our enterprise data and observability platform. This role is pivotal in bridging the gap between DataOps execution and business expectations by enabling transparent communication, proactive issue identification, and operational excellence across complex data ecosystems. The ideal candidate will not only bring deep technical expertise

Sr API Engineer

Southern Glazer's Wine & Spirits

Dallas, Texas, USA

Full-time

What You Need To Know Open the door to a groundbreaking tech career with an industry leader. Southern Glazer's Wine & Spirits is North America's preeminent wine and spirits distributor, as well as a family-owned, privately held company with a 50+ year legacy of success. To create a new era in alcohol beverage sales and service, we're heavily invested in the most transformative new technologies - and the most brilliant tech professionals. Southern Glazer's was named by Newsweek as a Most Loved W

LDAP Directory Engineer

Judge Group, Inc.

Dallas, Texas, USA

Full-time

Location: Dallas, TX Description: Our client is currently seeking a LDAP Directory Engineer LDAP Directory Engineering: Design, deploy, and maintain LDAP directory infrastructure (e.g., OpenLDAP, PingDirectory).Configure directory schemas, manage directory trees, and enforce robust access control policies.Monitor directory performance, troubleshoot issues, and apply necessary upgrades or patches.Implement replication, synchronization, and high-availability solutions to ensure directory servic

Lead SQL/Python Architect- Only W2 Resources (hybrid)

Nasscomm, Inc.

Remote

Contract

Creating reusable Python libraries and frameworks to be used by developers' firm-wide Working with devops engineers to implement CI/CD pipelines for promotion of Python code Create documentation including best practices for stability, monitoring and observability of Python code using tools like Splunk, Dynatrace, Grafana, etc. Assist with packaging and distributing Python software and tools Proactively seek process and technical improvements and take the initiative to communicate and implement s

UI Automation Engineer (JavaScript)

Xoriant Corporation

Remote

Contract

Extensive expertise with test management, collaboration, and reporting tools such as Jira, Confluence, Spira, Grafana, or equivalent platformsSolid experience with both cloud-native and traditional test automation tools for web applications, including Mabl and TestCompleteStrong programming background in JavaScript and VBScriptComprehensive hands-on experience in developing test automation frameworks and automating test cases for unit, functional, performance, and API testingProven experience in

Site Reliability Engineer

Raas Infotek LLC

Remote

Contract

Position: Site Reliability EngineerType: W2 Contract We re looking for a Site Reliability Engineer (SRE) to ensure the performance, scalability, and reliability of our production systems. This role blends software engineering with infrastructure expertise to drive automation, monitoring, and continuous improvement across cloud platforms. Responsibilities:Build and maintain infrastructure-as-code and CI/CD pipelinesDevelop monitoring, alerting, and incident response toolsEnsure high availability

MuleSoft API Principal Engineer

Western Alliance Bank

Dallas, Texas, USA

Full-time

Job Title: MuleSoft API Principal Engineer Location: TX - Dallas/Irving What you'll do: We are seeking a highly experienced MuleSoft Principal Engineer II with 12+ years of expertise in API architecture, design, development, and full lifecycle management to lead the Payments API ecosystem. This role requires deep technical knowledge of MuleSoft , along with advanced API Management and API Security for Banking-as-a-Service (BaaS). The ideal candidate will be responsible for optimizing vCore uti

Data Engineer/Sr Data Engineer, IT Analytics

American Airlines Inc

Dallas, Texas, USA

Full-time

Location: DFW Headquarters Building 8 (DFW-SV08) Cities: Requisition ID: 78905 Job Description Intro Are you ready to explore a world of possibilities, both at work and duringyour time off? Join our American Airlines family, and you'll travel the world, grow your expertise and become the best version of you. As you embark on a new journey, you'll tackle challenges with flexibility and grace, learning new skills and advancing your career while having the time of your life. Feel free to enrich

Site Reliability Engineer

Ztek Consulting

Remote

Full-time

Note: This is a fulltime and remote opportunity. Role: Site Reliability Engineer Work location: Remote Job Description Site Reliability Engineer Must Have Technical/Functional Skills Experience in Cloud platforms (AWS, Azure, Google Cloud) and hybrid environments.Proficiency in container technologies (Docker, Container, Podman).Strong knowledge of Linux administration and networking concepts.Experience with Infrastructure as Code (IaC) tools like Terraform, Ansible, Helm, or Pulumi.Monitoring

DevOps Engineer

Kforce Technology Staffing

Remote or Atlanta, Georgia, USA

Contract

RESPONSIBILITIES: Kforce has a client that is seeking a Fully Remote DevOps Engineer to join their team. This candidate will be specifically focused on more of the legacy tandem system and will mainly be working on the pure operations side (change validations, incident response, monitoring (Grafana, Prometheus, etc), troubleshooting, etc. If they have Google Cloud Platform it's a plus, but a lot of the work will be in Linux along with monitoring. Overall, we are looking for someone who has been