Overview
Hybrid
Depends on Experience
Contract - W2
Contract - Independent
Contract - 12 Month(s)
Skills
STAR
Dimensional
Spark
Python
Pyspark
Databricks
Collibra
Starburst
Airflow
Lakehouse
ETL
RedShift
Snowflake
Modeling
CDP
Customer Data Platform
Customer Data
Data Engineer
S3
Lakehouse architectures
real-time databases
Big Data
on-prem
Cloud
CFS2
EDS
AWS
data pipelines
data stores
Job Details
Data Engineer III
Duration: - 12 Months
Location: - San Francisco, CA / Hybrid
Qualifications:
Role Overview: As a Data Engineer, this CW will be responsible for collecting, parsing, managing, analyzing, and visualizing large sets of data to turn information into actionable insights. They will work across multiple platforms to ensure that data pipelines are scalable, repeatable, and secure, capable of serving multiple users.
Qualifications:
- Bachelor s degree in Computer Science, Information Systems, or a related field, or equivalent experience.
- 2+ years experience with tools such as Databricks, Collibra, and Starburst.
- 3+ years experience with Python and PySpark.
- Experience using Jupyter notebooks, including coding and unit testing.
- Recent accomplishments working with relational and NoSQL data stores, methods, and approaches (STAR, Dimensional Modeling).
- 2+ years of experience with a modern data stack (Object stores like S3, Spark, Airflow, Lakehouse architectures, real-time databases) and cloud data warehouses such as RedShift, Snowflake.
- Overall data engineering experience across traditional ETL & Big Data, either on-prem or Cloud.
- Data engineering experience in AWS (any CFS2/EDS) highlighting the services/tools used.
- Experience building end-to-end data pipelines to ingest and process unstructured and semi-structured data using Spark architecture.
Responsibilities:
Key Responsibilities:
- Design, develop, and maintain robust and efficient data pipelines to ingest, transform, catalog, and deliver curated, trusted, and quality data from disparate sources into our Common Data Platform.
- Actively participate in Agile rituals and follow Scaled Agile processes as set forth by the CDP Program team.
- Deliver high-quality data products and services following Safe Agile Practices.
- Proactively identify and resolve issues with data pipelines and analytical data stores.
- Deploy monitoring and alerting for data pipelines and data stores, implementing auto-remediation where possible to ensure system availability and reliability.
- Employ a security-first, testing, and automation strategy, adhering to data engineering best practices.
- Collaborate with cross-functional teams, including product management, data scientists, analysts, and business stakeholders, to understand their data requirements and provide them with the necessary infrastructure and tools.
- Keep up with the latest trends and technologies, evaluating and recommending new tools, frameworks, and technologies to improve data engineering processes and efficiencies.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.