Engineer, Big Data - Databricks/Python/SQL/Power BI - Remote

Overview

Remote
On Site
Full Time

Skills

Databricks
SQL
Microsoft Power BI
FOCUS
Real-time
Extraction
API
Scalability
Data Analysis
Data Modeling
Analytics
Apache Hadoop
HDFS
Apache ZooKeeper
Cloudera
Kerberos
Apache Kafka
Apache Storm
Streaming
Big Data
Apache Spark
Apache Hive
Cloudera Impala
Apache Kylin
Extract
Transform
Load
Talend
NoSQL
Apache HBase
Database
Data Warehouse
Data Validation
Data Quality
Meta-data Management
Data Governance
Java
Scala
Python
Web Applications
Web Services
SOAP
Intranet
Health Care

Job Details

Job Description

Job Description
Job Summary
Responsible for collecting, storing, processing, and analyzing large sets of data. The primary focus will be on choosing optimal solutions to use for these purposes, then maintaining, implementing, and monitoring them. He/she will also be responsible to follow architecture and best practices used across the enterprise.
Knowledge/Skills/Abilities
Define ideal Architecture, Evaluating tools and Frameworks, Standards & Best Practices for implementing scalable business solutions
Implement Batch and Real-time data ingestion/extraction processes through ETL, Streaming, API, etc., between diverse source and target systems with structured and unstructured datasets
Design and build data solutions with an emphasis on performance, scalability, and high-reliability
Code, test, and document new or modified data systems to create robust and scalable applications for data analytics
Build data model for analytics and application layers
Working closely with multiple teams and Business partners, for collecting requirement and providing optimal solution
Proven experience on Hadoop cluster components and services (like HDFS, YARN, ZOOKEEPER, AMBARI/CLOUDERA MANAGER, SENTRY/RANGER, KERBEROS, etc.)
Ability to participate in troubleshooting technical issues while engaged with infrastructure and vendor support teams.
Job Qualifications

Required Education
Bachelor's Degree
Required Experience
Experience in building stream-processing systems, using solutions such as Kafka, Storm or Spark-Streaming
Proven experience on Big Data tools such as, Spark, Hive, Impala, Polybase, Phoenix, Presto, Kylin, etc.
Experience with integration of data from multiple data sources (using ETL tool such, Talend, etc.)
Experience building solutions with NoSQL databases, such as HBase, Memsql
Strong experience on Database technologies, Data Warehouse, Data Validation & Certification, Data Quality, Metadata Management and Data Governance
Experience with programming language such as, Java/Scala/Python, etc.
Experience implementing Web application and Web Services APIs (REST/SOAP)
Preferred Education
Master's Degree
Preferred Experience
Experience in the healthcare industry is preferred

To all current Molina employees: If you are interested in applying for this position, please apply through the intranet job listing.

Molina Healthcare offers a competitive benefits and compensation package. Molina Healthcare is an Equal Opportunity Employer (EOE) M/F/D/V.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.