AWS Data Warehouse Engineer

Overview

Hybrid

Depends on Experience

Contract - W2

Contract - Independent

Contract - 12 Month(s)

Skills

STAR

Dimensional

Spark

Python

Pyspark

Databricks

Collibra

Starburst

Airflow

Lakehouse

ETL

RedShift

Snowflake

Modeling

CDP

Customer Data Platform

Customer Data

Data Engineer

Lakehouse architectures

real-time databases

Big Data

on-prem

Cloud

CFS2

EDS

AWS

data pipelines

data stores

Job Details

Data Engineer III

Duration: - 12 Months

Location: - San Francisco, CA / Hybrid

Qualifications:

Role Overview: As a Data Engineer, this CW will be responsible for collecting, parsing, managing, analyzing, and visualizing large sets of data to turn information into actionable insights. They will work across multiple platforms to ensure that data pipelines are scalable, repeatable, and secure, capable of serving multiple users.

Qualifications:

Bachelor s degree in Computer Science, Information Systems, or a related field, or equivalent experience.
2+ years experience with tools such as Databricks, Collibra, and Starburst.
3+ years experience with Python and PySpark.
Experience using Jupyter notebooks, including coding and unit testing.
Recent accomplishments working with relational and NoSQL data stores, methods, and approaches (STAR, Dimensional Modeling).
2+ years of experience with a modern data stack (Object stores like S3, Spark, Airflow, Lakehouse architectures, real-time databases) and cloud data warehouses such as RedShift, Snowflake.
Overall data engineering experience across traditional ETL & Big Data, either on-prem or Cloud.
Data engineering experience in AWS (any CFS2/EDS) highlighting the services/tools used.
Experience building end-to-end data pipelines to ingest and process unstructured and semi-structured data using Spark architecture.

Responsibilities:

Key Responsibilities:

Design, develop, and maintain robust and efficient data pipelines to ingest, transform, catalog, and deliver curated, trusted, and quality data from disparate sources into our Common Data Platform.
Actively participate in Agile rituals and follow Scaled Agile processes as set forth by the CDP Program team.
Deliver high-quality data products and services following Safe Agile Practices.
Proactively identify and resolve issues with data pipelines and analytical data stores.
Deploy monitoring and alerting for data pipelines and data stores, implementing auto-remediation where possible to ensure system availability and reliability.
Employ a security-first, testing, and automation strategy, adhering to data engineering best practices.
Collaborate with cross-functional teams, including product management, data scientists, analysts, and business stakeholders, to understand their data requirements and provide them with the necessary infrastructure and tools.
Keep up with the latest trends and technologies, evaluating and recommending new tools, frameworks, and technologies to improve data engineering processes and efficiencies.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Job Details

Share