only W2 :: 100% remote :: QA/SDET Lead :: Azure Devops and AI/ML Strong

  • Posted 2 hours ago | Updated 2 hours ago

Overview

Remote
45 - 50
Contract - Independent
Contract - W2
Contract - 12 Month(s)
No Travel Required
Unable to Provide Sponsorship

Skills

API
Artificial Intelligence
Automated Testing
Behavior-driven Development
Cloud Computing
Continuous Delivery
Continuous Integration
Continuous Integration and Development
Cucumber
Data Validation
Database QA
Databricks
DevOps
Evaluation
Integration Testing
JD
Log Analysis
Machine Learning (ML)
Machine Learning Operations (ML Ops)
Microsoft Azure
Performance Metrics
Python
Reporting
Streaming
Testing
Workflow
YAML

Job Details

Experience level: 10 + years

Must have skills: SDET skills, Azure devops, python, pytest test automation and Framework, Al ML practices, RAGAS, LLMS

Work Location: Anywhere in US, Supporting during PST hours.

Onsite/Remote: Remote

Brief JD:

Experience in designing LLM/RAG test automation solutions

Experience in testing for bias, drift, and fairness.

Familiarity with performance metrics (precision, recall, F1, ROC-AUC).

Knowledge of MLflow MLOps framework

Knowledge of tools for synthetic data generation and boundary testing

Knowledge of Azure DevOps for CI/CD pipeline development

Experience in RAG/pipeline evaluation frameworks -

Knowledge of Azure DevOps for CI/CD pipeline development

Experience in RAG/pipeline evaluation frameworks -

pytest: Test automation framework

DeepEval or TruLens: LLM test assertions

RAGAS: RAG-specific metrics

0 Eleuther: LLM evaluation harness

Ο Garak or Promptfoo: LLM red-teaming

Evidently: Drift/performance monitoring

Exposure to explainability frameworks (SHAP, LIME, Captum)

Handson experience in API and Database testing.

Practical knowledge of Databricks, Azure Cloud services land distributed data validation.

Proficiency with Azure DevOps pipelines YAML templates, agent pools, CICD workflows.

Experience in implementing AIML practices in testing e.g., test generation, anomaly detection, log analysis to improve test efficiency and coverage.

Familiarity with Cucumber (BDD) and test reporting frameworks (e.g., Allure).

Strong understanding of integration testing across Databricks streaming jobs, applications

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.