Overview
On Site
USD 90,501.00 per year
Contract - W2
Skills
Scalability
IaaS
Specification Gathering
Business Intelligence
Analytics
Collaboration
Documentation
Data Integration
Workflow
Regulatory Compliance
Data Governance
Meta-data Management
Data Quality
Data Engineering
Data Processing
Apache Spark
Amazon Web Services
Electronic Health Record (EHR)
Performance Tuning
Extract
Transform
Load
Data Lake
Amazon Redshift
Python
PySpark
SQL
Informatica
Data Management
Cloud Computing
Job Details
** The quickest way to be considered for this role is to CALL US directly! Click "Apply On Web" or "Apply Now" to access our Recruiter s contact details and give us a call today! **
===
** We will NOT accept 3rd Party (C2C) Contractors **
===
Position:AWS Data Engineer (IDMC)
Job Ref #:44042 - JIH5JP00003721
Duration:12+ Months (On-going Contract)
Location:ONSITE - Torrance, CA 90501
Pay Rate:$74.00 per hour (W2 ONLY)
HIRING REQUIREMENTS:
** NOT accepting any 3rd Party C2C candidates for any reason**
** NOT accepting candidates who are on work visa's, such asor H1B**
** NOT accepting candidates who live more than 45 miles from Torrance
** NOT accepting candidates willing to relocate (LOCAL CANDIDATES Only)**
MON-FRI - This role is 100% ONSITE, no remote work offered.
DAILY TASKS:
Develop and Maintain Data Integration Solutions:
. Design and implement data integration workflows using AWS Glue/EMR, AWS MWAA(Airflow), Lambda, Redshift
. Demonstrate proficiency in PySpark, Apache Spark and Python for data processing large datasets
. Ensure data is accurately and efficiently extracted, transformed, and loaded into target systems.
Ensure Data Quality and Integrity:
. Validate and cleanse data to maintain high data quality.
. Ensure data quality and integrity by implementing monitoring, validation, and error handling mechanisms within data pipelines
Optimize Data Integration Processes:
. Enhance the performance, optimization of data workflows to meet SLAs, scalability of data integration processes and cost-efficiency on AWS cloud infrastructure.
. Identify and resolve performance bottlenecks, fine-tuning queries, and optimizing data processing to enhance Redshift's performance
. Regularly review and refine integration processes to improve efficiency.
Support Business Intelligence and Analytics:
. Translate business requirements to technical specifications and coded data pipelines
. Ensure timely availability of integrated data for business intelligence and analytics.
. Collaborate with data analysts and business stakeholders to meet their data requirements.
Maintain Documentation and Compliance:
. Document all data integration processes, workflows, and technical & system specifications.
. Ensure compliance with data governance policies, industry standards, and regulatory requirements.
. Scan, Profile Metadata Data Catalog and Data Observability
. Define Data Quality Rules in Informatica Cloud.
REQUIRED EXPERIENCE:
. This person will require AWS 80% and Informatica 20% skills.
. 5+ years of experience in Data Engineering
. Strong experience with distributed data processing (Spark, AWS Glue, EMR, or equivalent).
. Hands-on expertise with ETL pipelines, and performance optimization.
. Strong hands-on expertise in building and optimizing ETL pipelines into Data Lake and Amazon Redshift
. Proficiency in Python, PySpark and SQL; familiarity with Iceberg tables preferred
. 2+ years Informatica Cloud experience (IDMC- Informatica Intelligent Data Management Cloud )
==
==
Calance Consultant Benefits Offerings:
- EPO/PPO Medical Plans
- HMO/PPO Dental programs
- Vision - VSP (Vision Plan Summary)
- 401K Retirement vesting program (VOYA)
- Paid Bi-Weekly/Direct Deposit
- Flex Spending Plan
- Voluntary Life, AD&D, STD or LTD plans
===
** We will NOT accept 3rd Party (C2C) Contractors **
===
Position:AWS Data Engineer (IDMC)
Job Ref #:44042 - JIH5JP00003721
Duration:12+ Months (On-going Contract)
Location:ONSITE - Torrance, CA 90501
Pay Rate:$74.00 per hour (W2 ONLY)
HIRING REQUIREMENTS:
** NOT accepting any 3rd Party C2C candidates for any reason**
** NOT accepting candidates who are on work visa's, such asor H1B**
** NOT accepting candidates who live more than 45 miles from Torrance
** NOT accepting candidates willing to relocate (LOCAL CANDIDATES Only)**
MON-FRI - This role is 100% ONSITE, no remote work offered.
DAILY TASKS:
Develop and Maintain Data Integration Solutions:
. Design and implement data integration workflows using AWS Glue/EMR, AWS MWAA(Airflow), Lambda, Redshift
. Demonstrate proficiency in PySpark, Apache Spark and Python for data processing large datasets
. Ensure data is accurately and efficiently extracted, transformed, and loaded into target systems.
Ensure Data Quality and Integrity:
. Validate and cleanse data to maintain high data quality.
. Ensure data quality and integrity by implementing monitoring, validation, and error handling mechanisms within data pipelines
Optimize Data Integration Processes:
. Enhance the performance, optimization of data workflows to meet SLAs, scalability of data integration processes and cost-efficiency on AWS cloud infrastructure.
. Identify and resolve performance bottlenecks, fine-tuning queries, and optimizing data processing to enhance Redshift's performance
. Regularly review and refine integration processes to improve efficiency.
Support Business Intelligence and Analytics:
. Translate business requirements to technical specifications and coded data pipelines
. Ensure timely availability of integrated data for business intelligence and analytics.
. Collaborate with data analysts and business stakeholders to meet their data requirements.
Maintain Documentation and Compliance:
. Document all data integration processes, workflows, and technical & system specifications.
. Ensure compliance with data governance policies, industry standards, and regulatory requirements.
. Scan, Profile Metadata Data Catalog and Data Observability
. Define Data Quality Rules in Informatica Cloud.
REQUIRED EXPERIENCE:
. This person will require AWS 80% and Informatica 20% skills.
. 5+ years of experience in Data Engineering
. Strong experience with distributed data processing (Spark, AWS Glue, EMR, or equivalent).
. Hands-on expertise with ETL pipelines, and performance optimization.
. Strong hands-on expertise in building and optimizing ETL pipelines into Data Lake and Amazon Redshift
. Proficiency in Python, PySpark and SQL; familiarity with Iceberg tables preferred
. 2+ years Informatica Cloud experience (IDMC- Informatica Intelligent Data Management Cloud )
==
==
Calance Consultant Benefits Offerings:
- EPO/PPO Medical Plans
- HMO/PPO Dental programs
- Vision - VSP (Vision Plan Summary)
- 401K Retirement vesting program (VOYA)
- Paid Bi-Weekly/Direct Deposit
- Flex Spending Plan
- Voluntary Life, AD&D, STD or LTD plans
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.