Data Scientist (Big Data Engineer) 3

Overview

Remote

On Site

Depends on Experience

Contract - Independent

Contract - W2

Contract - 08 Month(s)

No Travel Required

Unable to Provide Sponsorship

Skills

ETL/ELT

CI/CD tools

data scientists

Azure Data Lake Storage

data warehouses

Azure cloud platform

Databricks

Delta Lake

Databricks Certified Associate Developer

Job Details

Solicitation Reference Number: 2026C0014

DIrect Client: Texas Department of Family and Protective Services

Working Title: Data Scientist (Big Data Engineer) 3

Work Location: Austin, Tx - Telework

JD:

The Worker is responsible for developing, maintaining, and optimizing big data solutions using the Databricks Unified Analytics Platform.

This role supports data engineering, machine learning, and analytics initiatives within this organization that relies on large-scale data processing.

Duties include:

Designing and developing scalable data pipelines
Implementing ETL/ELT workflows
Optimizing Spark jobs
Integrating with Azure Data Factory
Automating deployments
Collaborating with cross-functional teams
Ensuring data quality, governance, and security.

CANDIDATE SKILLS AND QUALIFICATIONS

Minimum Requirements: Candidates that do not meet or exceed the minimum stated requirements (skills/experience) will be displayed to customers but may not be chosen for this opportunity.
Years	Required/Preferred	Experience
4	Required	Implement ETL/ELT workflows for both structured and unstructured data
4	Required	Automate deployments using CI/CD tools
4	Required	Collaborate with cross-functional teams including data scientists, analysts, and stakeholders
4	Required	Design and maintain data models, schemas, and database structures to support analytical and operational use cases
4	Required	Evaluate and implement appropriate data storage solutions, including data lakes (Azure Data Lake Storage) and data warehouses
4	Required	Implement data validation and quality checks to ensure accuracy and consistency
4	Required	Contribute to data governance initiatives, including metadata management, data lineage, and data cataloging
4	Required	Implement data security measures, including encryption, access controls, and auditing; ensure compliance with regulations and best practices
4	Required	Proficiency in Python and R programming languages
4	Required	Strong SQL querying and data manipulation skills
4	Required	Experience with Azure cloud platform
4	Required	Experience with DevOps, CI/CD pipelines, and version control systems
4	Required	Working in agile, multicultural environments
4	Required	Strong troubleshooting and debugging capabilities
3	Required	Design and develop scalable data pipelines using Apache Spark on Databricks
3	Required	Optimize Spark jobs for performance and cost-efficiency
3	Required	Integrate Databricks solutions with cloud services (Azure Data Factory)
3	Required	Ensure data quality, governance, and security using Unity Catalog or Delta Lake
3	Required	Deep understanding of Apache Spark architecture, RDDs, DataFrames, and Spark SQL
3	Required	Hands-on experience with Databricks notebooks, clusters, jobs, and Delta Lake
1	Preferred	Knowledge of ML libraries (MLflow, Scikit-learn, TensorFlow)
1	Preferred	Databricks Certified Associate Developer for Apache Spark
1	Preferred	Azure Data Engineer Associate

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Job Details

Share