Remote- Sr.AWSDataEngineerwithAI/ML-15+YearsCandidate

Overview

Remote
$60 - $65
Accepts corp to corp applications
Contract - W2
Contract - 12 Month(s)

Skills

AWS
Data Engineer
AI/ML

Job Details

Only 15+ Years Candidate

Role: Sr. AWS Data Engineer with AI/ML

Location: Remote

Duration: 12+ Months

Project Overview:

It s one of the workstreams of Project Acuity. PASD Data Platform

includes centralized web application for internal PASD users across the

Recruitment Business to support marketing and operational use cases.

Building a database at the patient level will provide significant

benefit to PASD s future reporting capabilities and engagement of

external stakeholders.

Role Scope / Deliverables:

As a Sr Data Engineer on the AWS Cloud team, you will be responsible for

providing design and development of data ingestion pipelines from

disparate data sources into the cloud. You will lead the delivery of the

data products, leveraging Cloud Native strategies and best practices,

drawing from 15+ years of IT experience.

Must have:

15 years of experience in design and delivery of

Distributed Systems capable of handling petabytes of data in a

distributed environment.

10 years of experience in the development of Data Lakes

with Data Ingestion from disparate data sources, including relational

databases, flat files, APIs, and streaming data.

Experience in providing Design and development of Data

Platforms and data ingestion from disparate data sources into the cloud.

Expertise in core AWS Services including AWS IAM, VPC,

EC2, EKS/ECS, S3, RDS, DMS, Lambda, CloudWatch, CloudFormation,

CloudTrail, CloudWatch.

Proficiency in programming languages like Python and

PySpark to ensure efficient data processing. preferably Python.

Architect and implement robust ETL pipelines using AWS

Glue, defining data extraction methods, transformation logic, and data

loading procedures across different data sources

15 years of Experience in using IaC tools like Terraform

etc.

10 years of experience in development of CI/CD pipelines

(GitHub Actions, Jenkins).

Experience in the development of Event-Driven Distributed

Systems in the Cloud using Serverless Architecture.

Ability to work with Infrastructure team for AWS service

provisioning for databases, services, network design, IAM roles and AWS

cluster.

2-3 years of experience working with Document DB.

Ability to design, orchestrate and schedule jobs using

Airflow.

Knowledge of AWS AI Services like AWS Entity Resolution,

AWS Comprehend.

Ability to run custom LLMs using Amazon SageMaker.

Ability to use Large Language Models (LLMs) for Data

Classification and Identification of PII data entities

Nice to have Skills:

10 years of experience in the development of Data Audit,

Compliance and Retention standards for Data Governance, and automation

of the governance processes.

Experience in data modelling with NoSQL Databases like

Document DB.

Experience in using column-oriented data file format like

Apache Parquet, and Apache Iceberg as the table format for analytical

datasets.

Expertise in development of Retrieval-Augmented

Generation (RAG) and Agentic Workflows for providing context to LLMs

based on proprietary enterprise data.

Ability to develop re-ranking strategies using results

from Index and Vector stores for LLMs to improve the quality of the

output.

Thanks

Yashasvi Hasija

Technical Recruiter | Empower Professionals

......................................................................................................................................

|

LinkedIn: linkedin.com/in/yashasvi-hasija-6a745625b

100 Franklin Square Drive Suite 104 | Somerset, NJ 08873

Certified NJ and NY Minority Business Enterprise (NMSDC)

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.