Data Engineer

Overview

On Site
$60 - $70
Accepts corp to corp applications
Contract - Independent
Contract - W2
Contract - 12 month(s)
No Travel Required

Skills

ETL
spark
hadoop
CI/CD
Apache
AI/ML
Cloud SQL

Job Details

JD
We are seeking a highly skilled and experienced Data Engineer with a strong background in AI/ML to design, build, and optimize robust data pipelines and infrastructure on the Google Cloud Platform (Google Cloud Platform). The ideal candidate will be passionate about leveraging data to power machine learning initiatives, ensuring data quality, accessibility, and scalability for advanced analytics and AI applications.

MINIMUM 10 YEARS OF EXPERIENCE

Responsibilities:
Data Pipeline Development: Design, build, and maintain scalable, efficient, and reliable ETL/ELT data pipelines for batch and real-time processing using Google Cloud Platform services (e.g., Dataflow, Dataproc, Cloud Composer, Pub/Sub).
AI/ML Data Preparation: Collaborate closely with Data Scientists and Machine Learning Engineers to understand data requirements for model training, evaluation, and serving. Prepare, transform, and curate large, diverse datasets (structured, unstructured, streaming) to optimize them for AI/ML workloads.
Google Cloud Platform Ecosystem Expertise: Leverage a wide range of Google Cloud Platform data and AI/ML services, including:
Data Warehousing & Storage: BigQuery (for analytics and BigQuery ML), Cloud Storage, Cloud SQL, Cloud Bigtable.
Data Processing: Dataflow, Dataproc (Spark, Hadoop), Cloud Composer (Apache Airflow), Data Fusion.
AI/ML Services: Vertex AI (for model training, deployment, MLOps, Pipelines, Workbench, AutoML), AI Platform, TensorFlow Enterprise, Keras, PyTorch.
Data Governance & Quality: Implement and enforce data quality, security, and governance standards throughout the data lifecycle, ensuring data accuracy, consistency, and compliance with regulations.
Performance Optimization: Monitor, troubleshoot, and optimize the performance and cost-effectiveness of data pipelines and AI/ML infrastructure.
Automation & MLOps: Automate data processes, develop CI/CD pipelines for data and ML models, and contribute to MLOps best practices for seamless deployment and monitoring of AI/ML solutions.
Collaboration & Communication: Work effectively with cross-functional teams, including Data Scientists, Analysts, Software Engineers, and Product Managers, to understand data needs and deliver impactful solutions.
Innovation & Research: Stay up-to-date with the latest advancements in data engineering, AI/ML, and Google Cloud Platform technologies, continuously exploring and recommending new tools and approaches.

Qualifications:
Bachelor's or Master's degree in Computer Science, Data Engineering, or a related quantitative field.
Proven experience as a Data Engineer, with a strong focus on building data solutions for AI/ML applications.
In-depth knowledge and hands-on experience with Google Cloud Platform (Google Cloud Platform) data services (BigQuery, Dataflow, Dataproc, Cloud Storage, Cloud Composer, etc.).
Strong proficiency in programming languages such as Python (essential), and experience with Scala or Java is a plus.
Expertise in SQL and experience with various database technologies (relational, NoSQL, data warehouses).
Familiarity with machine learning concepts, algorithms, and workflows (e.g., feature engineering, model training, evaluation, deployment).
Experience with machine learning frameworks like TensorFlow or PyTorch.
Understanding of distributed systems, big data technologies, and real-time data processing.
Experience with version control systems (e.g., Git) and CI/CD practices.
Excellent problem-solving, analytical, and communication skills.
Google Cloud Professional Data Engineer certification is a strong plus
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.