AWS Data Lake House Engineer

Overview

Remote
$DOE
Contract - W2
Contract - Long Term

Skills

AWS
R
Data Lake

Job Details

Role: AWS Data Lake House Engineer

Location: Remote

Duration: 12 months

Job Description:

Client seeks to implement a Data Lake House on the AWS Commercial cloud. This scope of work will deploy previously created templates from a prior work effort and implement additional features as defined by the State. This SOW will implement a data lake that is scalable for use across State agencies and will support the secure storage and analytics of sensitive data such as HIPAA, MARS-E, PHI, PII, and other confidential data. ADS will work with the Contractor's Amazon Web Service (AWS) professional services team such as their data engineer, platform engineer, and solution architect, to design and implement the Lakehouse environment.

REQUIREMENTS:

Implement a Data Lake House in the AWS Commercial environment as a template solution that will serve as the model for the next generation of Vermont's data platform, building on

existing work available as templates and deployed in AWS Gov for the Department of Public Safety. The following features are implemented other than adjustments to take advantage of features available in Commercial that are unavailable in Gov.

Required Skills:

Design and implement a data lake in AWS that is compliant with requirements and best practices protecting sensitive data, including health data and other sensitive data, in collaboration with the Vermont Technical Lead, using the latest Lake House standards and best practices.

The lake house will be implemented in a way that the State can maintain, enhance, and expand it on its own.

The lake house will be implemented using templates that the State can use to refine and implement its data strategy in other domains.

The lake house will be implemented using a medallion architecture.

All layers will be scalable as needed to ensure capabilities and cost can be tuned.

The State prefers to use Python-based solutions as much as possible, while also providing support for R and other data management languages.

Outputs of the development work are not intended be used for mission critical operational decision support and control system automation at this time. Data will be ingested in a read-only manner and will be aggregated for reporting and analytical purposes only.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.