Lead Data Engineer / Architect

Overview

On Site
Hybrid
Depends on Experience
Contract - W2

Skills

Hadoop
Python

Job Details

Job Role: Lead Data Scientist/ Engineer / Architect (8 + yrs)

location: Pittsburgh, PA/ Philadelphia, PA/ Cleveland, OH/ Columbus, OH/ New Jersey, NJ DC/ Atlanta, GA/ Raleigh NC/ Dallas, TX

Type: Hybrid ( 1-2 days a week onsite)

Candidate Technical and skills profile:

  • 6+ years of financial solutions architecture, software development, data engineering, data science or business intelligence engineering experience with minimum 3 Years recent hands-on experience in PySpark
  • 3+ year of experience with Machine Learning code development
  • Deep knowledge of Hadoop ecosystem and Big Data technologies such as Spark, Hive, Hbase, Oozie, Kafka, YARN, SLURM
  • Spark query tuning and performance optimization
  • Experience and good understanding of Apache Spark Data sources API
  • Advanced experience in Python and common python libraries/ Scala/ Java
  • Strong analytical experience with database in writing complex queries, query optimization, debugging, user-defined functions, views, indexes, etc.
  • Strong working experience with source control systems such as Git, Bitbucket, and Jenkins build and continuous integration tools.
  • Experience working with Microservices, Rest API and Oauth
  • Experience working with one or more Agile development methods
  • proven consulting and delivery leadership in data transformation, data modeling, data analytics, data visualization and/or data science

Must have technical skills/experience (ask for alternative/tool/version):

  • Hadoop
  • PySpark/ Python
  • Graph database (Neo4J)
  • Databricks (1 Azure, 2 AWS, 3 Google)

Flex Skills:

  • 2+ years of experience with a public cloud (AWS, Microsoft Azure)
  • 4+ years of experience with NoSQL implementation (Mongo, Cassandra)
  • 1+ year of experience with process orchestration including AirFlow, KubeFlow
  • Data lake and Delta lake experience
  • Familiarity with Metadata Management, Data Quality frameworks and Data as a Service concepts a big plus
  • Banking or financial services experience is a big plus

Soft skills that would make a candidate successful in this role:

  • Demonstrated leadership abilities with track record of driving results through cross-functional organization
  • Ability to manage multiple projects with tight timelines
  • Highly motivated, self-directed
  • Strong analytical and problem-solving skills, written and verbal communication skills, and project management skills

Degrees or certifications for the candidate to be successful:

  • Bachelor's degree in Computer Science, Engineering, Statistics or another quantitative subject

Role Differentiator:

  • First vendor independent modelling strategy for all financial crime modelling and platform, technologies looking to be explored are leading in the industry