Overview
Skills
Job Details
W2 Contract Opportunity
No H1B,OPT,CPT
Currently, we have an opening for Data Engineer with our Client at Alpharetta, GA | On-Site. I appreciate your time and look forward to hearing from you.
Data Engineer
Alpharetta, GA (On Site Only)
Our client is looking for a Statistical Consultant/Data Engineer to join our world-class Global Identity and Fraud Analytics team. In this exciting role, you will have the opportunity to work on a variety of challenging projects across multiple industries including Financial Services, Telecommunications, eCommerce, Healthcare, Insurance and Government.
What You'll Do:
- Work with data scientist team to migration the analytical data and projects to Google Cloud Platform environment and ensure the smooth project transition
- Prepare and build data and analytical automation pipeline for self-serving machine learning projects: gather data from multiple sources and systems, integrating, consolidating and cleansing data, and structuring data for use by our clients our client facing projects.
- Design and Code analysis scripts that can run on Google Cloud Platform using BigQuery/Python/Scala leverage multiple Core data sources
Qualifications: 3-5 years of professional data engineering or data wrangling experience in
- working with Hadoop based or Cloud based big data management environment
- bash scripting or similar experience for data movement and ETL
- Big data queries in Hive/Impala/Pig/BigQuery (Sufficient in BigQuery API libraries to data prep automation is a plus)
- Advanced Python programming including PySpark (Scala is a plus) with strong coding experience and Proficient in data studio, Big Table, GitHub working experience (Cloud composer and Data flow is a plus)
- basic Google Cloud Platform certification is a plus
- Knowledge of Kubernetes is a plus (or other types of Google Cloud Platform native tools of the container-orchestration system for automating computer application deployment, scaling, and management)
- Basic knowledge in machine learning (ensemble machine learning models, unsupervised machine learning models) with experience using Tensorflow and PyTorch is a plus
- Basic knowledge in graph mining and graph data model is a plus
- Understand best practices for data management, maintenance, and reporting and use that knowledge to implement improvements in our solutions.
What You'll Do
Build automated ML/AI modules, job, and data preparation pipelines by gathering data from multiple sources and systems, integrating, consolidating and cleansing data, and structuring data and analytical procedures for use by our clients in our solutions.
Perform design, creation, and interpretation of large and highly complex datasets Consult with internal and external clients to understand the business requirements so successfully build datasets and implement complex big data solutions (under senior lead's supervision).
Ability to work with Technology and D&A teams to review, understand and interpret the business requirements to design and build missing functionalities to support the identity and fraud analytics needs (under senior lead's supervision).
Ability to work on the end-to-end interpretation , design, creation, and build of large and highly complex analytics related capabilities (under senior lead's supervision).
Strong oral and written communication skills, and ability to collaborate with cross-functional partners
Things that would stand out on resume -
1- Masters Degree in Computer Science & Data Science
2- Previous Company - Any Bank, Ecommerce