Data Engineer (Python)

Overview

Remote
Depends on Experience
Accepts corp to corp applications
Contract - W2
Contract - 12 Month(s)

Skills

Python
Data pipeline building
Data Engineer
SQL & large-scale data processing
Data cleaning
transformation
validation
Automation & scripting

Job Details

Job Title: Data Engineer (Python)

Location: Remote

Duration: 12 Months

Job Description: Summary:

  • The CWs will support efforts to perform data mitigations on large scale datasets (image, video, text) leveraged by FAIR research teams. The goal is to proactively mitigate potential risks associated with these datasets.

Job Responsibilities:

  • Preprocessing: converting original datasets into a format that can be consumed by mitigation pipelines.
  • Filtering: running filtering using Integrity's pipeline.
  • Post-processing: consuming filtering results to filter in the original datasets, repackaging, and re-ingestion.
  • Optimization: identify optimization opportunities and improve the process.

Skills:

  • Software engineering skills include writing scripts to automate file processing and data transferring and creating tools to improve productivity and streamline workflows.
  • Data Management: Data pipeline building. Data processing and cleaning, transformation and formatting, data quality control and validation
  • Communication - effective communication skills to collaborate with stakeholders and team members

Must-Have Skills

  • 4+ years experience in Python.
  • Some data management experience, e.g. SQL, process large data.
  • Able to be flexible and work well in different environments with varying tasks and responsibilities
  • Background in AI building tools
  • 4+ years experience.

Education/Experience:

  • Strongly prefer CWs with prior experience at client (full 2 years preferred), as this would enable them to leverage their existing knowledge of client internal tools and processes, facilitating a faster onboarding process and more effective contribution to the FAIR mitigation efforts.

Key Skills: Python, Data pipeline building, Data Engineer, SQL & large-scale data processing, Data cleaning, transformation, validation, Automation & scripting

About VDart Group

VDart Group is a global leader in technology, product, and talent solutions, serving Fortune 500 clients in 13 countries. With over 4,000 professionals worldwide, we deliver innovation, operational excellence, and measurable outcomes across industries. Guided by our commitment to People, Purpose, and Planet, VDart is recognized with an EcoVadis Bronze Medal and as a UN Global Compact member, reflecting our dedication to sustainable practices.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About VDart, Inc.