SME Data Engineer

  • Posted 12 days ago | Updated 12 days ago

Overview

Remote
Depends on Experience
Full Time

Skills

develop
and maintain scalable data pipelines and architectures to support data extraction
transformation
and loading (ETL/ELT) processes. Utilize strong SQL skills to perform complex data transformations and optimize database queries
ensuring high performance and efficiency. Building comprehensive datasets by aggregating data sourced from various relational databases
facilitating data analysts and data scientists in creating machine learning models
reports
and dashboards. Collaborate with cross-functional teams (data analysts
data scientists
and business stakeholders) to understand business requirements and translate them into technical solutions. Assist with the implementation of data migration/pipelines from on-prem to cloud/non-relational storage platforms. Leverage distributed computing frameworks like Apache Spark to process large volumes of data efficiently. Utilizing data analysis
problem-solving
investigative
and creative thinking skills to handle extremely large datasets
transforming them into various formats for diverse analytical products. Respond to data queries/analysis requests from various groups within an organization. Create and publish regularly scheduled and/or ad hoc reports as needed. Troubleshoot data-related issues
identify root causes
and implement solutions to ensure data integrity and accuracy. implement best practices for data governance
security
and quality supporting the core business applications. Responsible for data engineering source code control using GitLab

Job Details

Each day U.S. Customs and Border Protection (CBP) oversees the massive flow of people, capital, and products that enter and depart the United States via air, land, sea, and cyberspace. The volume and complexity of both physical and virtual border crossings require the application of solutions to promote efficient trade and travel. Further, effective solutions help CBP ensure the movement of people, capital, and products is legal, safe, and secure. CBP seeks capable, qualified, and versatile SME Data Engineers to help develop complex data analytical solutions for law enforcement personnel to assess risk of potential threats entering the country.


Requirements

Responsibilities include, but are not limited to:

  • Design, develop, and maintain scalable data pipelines and architectures to support data extraction, transformation, and loading (ETL/ELT) processes. Utilize strong SQL skills to perform complex data transformations and optimize database queries, ensuring high performance and efficiency.
  • Building comprehensive datasets by aggregating data sourced from various relational databases, facilitating data analysts and data scientists in creating machine learning models, reports, and dashboards.
  • Collaborate with cross-functional teams (data analysts, data scientists, and business stakeholders) to understand business requirements and translate them into technical solutions.
  • Assist with the implementation of data migration/pipelines from on-prem to cloud/non-relational storage platforms.
  • Leverage distributed computing frameworks like Apache Spark to process large volumes of data efficiently.
  • Utilizing data analysis, problem-solving, investigative, and creative thinking skills to handle extremely large datasets, transforming them into various formats for diverse analytical products.
  • Respond to data queries/analysis requests from various groups within an organization. Create and publish regularly scheduled and/or ad hoc reports as needed.
  • Troubleshoot data-related issues, identify root causes, and implement solutions to ensure data integrity and accuracy.
  • implement best practices for data governance, security, and quality supporting the core business applications.

Responsible for data engineering source code control using GitLab

About CMCI