Overview
Hybrid
$120,000 - $130,000
Full Time
Skills
python
pyspark
DataFrames
data systems
ETL / ELT pipelines
Job Details
Title: Data Engineer (Python, Pyspark)
Location: Dallas, TX
Job Summary:
We are seeking a highly skilled Data Engineer with strong hands-on experience in Python and PySpark to design, build, and maintain large-scale data pipelines and data processing systems. The ideal candidate will have a strong foundation in distributed data processing, data modeling, and ETL development within cloud or big data ecosystems.
Key Responsibilities:
- Design, develop, and maintain data ingestion and transformation pipelines using PySpark and Python.
- Work with large, complex data sets to extract insights and enable data-driven decision-making.
- Implement and optimize ETL workflows for batch and real-time data processing.
- Collaborate with data scientists, analysts, and platform teams to ensure reliable and scalable data delivery.
- Apply data quality checks, validation rules, and error handling to maintain data integrity.
- Optimize PySpark jobs for performance, scalability, and cost efficiency.
- Integrate data from various sources such as APIs, relational databases, and cloud storage.
- Support data governance and security best practices across all pipelines.
Required Skills & Qualifications:
- 3 7 years of experience as a Data Engineer or in a similar data-focused role.
- Strong programming skills in Python and PySpark (DataFrames, RDDs, Spark SQL).
- Experience with distributed data systems (e.g., Hadoop, Spark, Databricks, or EMR).
- Proficiency in SQL for data manipulation and query optimization.
- Experience building ETL / ELT pipelines using modern data orchestration tools (e.g., Airflow, Azure Data Factory, Glue).
- Familiarity with cloud platforms (AWS, Azure, or Google Cloud Platform) and their data services.
- Strong understanding of data modeling, data partitioning, and performance tuning.
- Hands-on experience with version control (Git) and CI/CD for data workflows.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.