only W2 :: 100% remote :: QA/SDET Lead :: Azure Devops and AI/ML Strong

Overview

Remote

45 - 50

Contract - Independent

Contract - W2

Contract - 12 Month(s)

No Travel Required

Unable to Provide Sponsorship

Skills

API

Artificial Intelligence

Automated Testing

Behavior-driven Development

Cloud Computing

Continuous Delivery

Continuous Integration

Continuous Integration and Development

Cucumber

Data Validation

Database QA

Databricks

DevOps

Evaluation

Integration Testing

Log Analysis

Machine Learning (ML)

Machine Learning Operations (ML Ops)

Microsoft Azure

Performance Metrics

Python

Reporting

Streaming

Testing

Workflow

YAML

Job Details

Experience level: 10 + years

Must have skills: SDET skills, Azure devops, python, pytest test automation and Framework, Al ML practices, RAGAS, LLMS

Work Location: Anywhere in US, Supporting during PST hours.

Onsite/Remote: Remote

Brief JD:

Experience in designing LLM/RAG test automation solutions

Experience in testing for bias, drift, and fairness.

Familiarity with performance metrics (precision, recall, F1, ROC-AUC).

Knowledge of MLflow MLOps framework

Knowledge of tools for synthetic data generation and boundary testing

Knowledge of Azure DevOps for CI/CD pipeline development

Experience in RAG/pipeline evaluation frameworks -

Knowledge of Azure DevOps for CI/CD pipeline development

Experience in RAG/pipeline evaluation frameworks -

pytest: Test automation framework

DeepEval or TruLens: LLM test assertions

RAGAS: RAG-specific metrics

0 Eleuther: LLM evaluation harness

Ο Garak or Promptfoo: LLM red-teaming

Evidently: Drift/performance monitoring

Exposure to explainability frameworks (SHAP, LIME, Captum)

Handson experience in API and Database testing.

Practical knowledge of Databricks, Azure Cloud services land distributed data validation.

Proficiency with Azure DevOps pipelines YAML templates, agent pools, CICD workflows.

Experience in implementing AIML practices in testing e.g., test generation, anomaly detection, log analysis to improve test efficiency and coverage.

Familiarity with Cucumber (BDD) and test reporting frameworks (e.g., Allure).

Strong understanding of integration testing across Databricks streaming jobs, applications

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Job Details

Share