Lead Data Engineer/Architect, Python

python, big data, data engineering, python data engineering, DESIRED: OOD, OOP, Kubernetes, Azure DevOps, microservices, MapReduce, MPP, Hadoop, Hive, machine learning, artificial intelligence
Full Time, full-time employment
140000-170000/yr
Travel not required

Job Description

We have been retained by our client in Houston, Texas to deliver a Lead Data Engineer/Architect on a direct-hire basis. This offer is regular full-time, salaried position with a big data engineering team with a massive amount of career opportunity, and large bonuses and stock offered. This package is amazing! Low turnover at this company.

Design distributed software architecture and implement systems for data - capturing, loading, processing, big data, or data for machine learning, artificial intelligence systems using distributed file systems, using python, and modern data platform technologies, massively parallel processing (MPP), MapReduce (MapR), Hadoop. This is software architecture and engineering of distributed MPP systems with Python. Data Engineering with Python. Leadership of design. Mentoring data engineers.

Work with algorithms. Create algorithms in python. Extend python algorithms or Python automation tools and features. Work with big data, modern tools in data science, side by side with engineering and business decision makers to help decide how to best impact the bottom line and profit. This is a Leadership position mentoring junior engineers from a design perspective. Think like and work as an architect, reviewing code, keeping things flexible, or extendable, or scalable, easy to maintain, driving technical thought and leadership, but also doing implementation. Take existing Python framework extend it, keep it flexible, and 8 data engineers on this team, all Python, 8 data engineers, 7 data scientists. The team uses: Kubernetes, Distributed File Systems, and Massively Parallel Processing (MPP), Azure DevOps, but this role almost all python and data engineering / architecture.


Responsibilities:

  • Work directly with engineering Subject Matter Experts and business operations stakeholders and data scientists to understand the objectives of data analytics sought, distributed data systems needed, and recommend and develop solutions, and deliver accurate, clean, and well-prepared data sets in a timely manner.
  • Collect, explore, and prepare data for advanced analytics and machine learning.
  • Perform exploratory data analysis and present findings for further analysis.
  • Architect and implement high quality analytical applications and data products.
  • Automate manual data flows for repeated use and scalability with python.
  • Develop data-intensive applications with APIs and streaming data pipelines with python.
  • Operationalize statistical and machine learning models.
  • Assists data analysts and data scientists with data extraction, feature engineering, query optimization, and data processing with python
  • Implement data quality checks to ensure data accuracy, consistency, and reliability
  • Identify opportunities for data improvements and presents recommendations to management


Requirements

We seek a candidate with:

  • 5+ years Python for data engineering, or creation of data models or creation of or enhancements of algorithms all with Python.
  • 3+ years design of distributed software architecture to implement systems for data - capturing, loading, processing, big data, or data for machine learning, artificial intelligence systems using distributed file systems, using python, and modern data platform technologies, massively parallel processing data sets. Skills such as the following are helpful (all are not required): MapReduce (MapR), Hadoop, NoSQL databases, Columnar Databases, Hive, Cassandra, Neo4J, DynamoDB, DocumentAPI, Couchbase, Azure Cosmos DB, software architecture and engineering of distributed MPP systems with Python. Apache, Kubernetes, massively parallel processing (MPP), distributed data systems, Kubernetes, docker, microservices, Azure DevOps.
    Data engineering with Python is required.
  • 5+ years of SQL
  • JSON calls to a NoSQL database
  • Data modeling
  • exploratory data analysis using common statistical methods
  • time series analysis
  • IIoT, IoT, industrial internet of things, internet of things collecting data
  • An understanding of OOP OOD Design Principles and Patterns, SOLID principles, Testing, CI/CD, version control, and basically how Kubernetes works, Kubernetes is a plus, but not required.
  • Experience in presenting and explaining complex concepts to engineering or business stakeholders
  • Big data, machine learning, artificial intelligence

Employment Type: Direct Hire

Pay Rate: $140,000 to 170,000 per year salary

Benefits: life, health, dental, vision, 401k matching, paid vacation, paid holidays, paid sick days, performance bonus. This is an excellent benefits package. Big bonuses. Stock.

Location: Houston, Texas

Immigration: US citizens and those authorized to work in the US are encouraged to apply. We are unable to sponsor H1b candidates at this time.

No third parties. No consulting firms. Principals only.


Please apply with resume ( MS Word format preferred ).

Houston career oppty: Lead Data Engineer/Architect, Python job posting details:

http://www.computerstaff.com/?jobIdDescription=559

Any candidate is encouraged to call or to send a text to:
817-424-1411

Dice Id : 10117243
Position Id : 559
Originally Posted : 1 month ago
Have a Job? Post it

Similar Positions

Data Engineer, Python
  • Computer Staff, Inc.
  • Houston, TX, USA
Data Architect
  • Odyssey Information Services
  • Houston, TX, USA
Lead Data Engineer
  • Infinity Consulting Solutions
  • Houston, TX, USA
Lead Machine Learning Engineer
  • Genuent Global, LLC
  • Houston, TX, USA
Technology Leader, Data Engineer
  • Infinity Consulting Solutions
  • Houston, TX, USA
ETL Data Engineer
  • PREDICTif Solutions
  • Houston, TX, USA
Cloud Data Engineer
  • Decide Consulting
  • Houston, TX, USA
Hadoop Engineer
  • Brothers Consulting
  • Austin, TX, USA
Machine Learning engineer (100% Remote)
  • Galaxy i Technologies, Inc.
  • Austin, TX, USA
Machine learning Engineer
  • Galaxy i Technologies, Inc.
  • Austin, TX, USA