Overview
Skills
Job Details
Role Summary:
The ideal candidate will have industry-leading programming skills and established knowledge of implementing, designing, deploying, and maintaining big data analytics platforms in a cloud environment. The solution architect will use knowledge of healthcare data to influence the implementation and governance of our data architecture.
Your designs will account for data movement, storage, compute and BI consumption, and will comply with security and data governance standards. You will work closely with a variety of partners within the Data and Analytics organization.
Experience:
8+ years of overall experience in big data, database and enterprise data architecture and delivery
8+ years of programming proficiency in a subset of Python, Java, and Scala.
5+ years of hands-on experience building solutions on distributed processing frameworks such as Spark, Hadoop or Databricks.
5+ years of experience architecting, developing, releasing, and maintaining enterprise data lake platforms.
3+ years of experience implementing cloud-based systems. AWS and/or Databricks.
Qualification & Experience:
Bachelor's degree with a preferred area of study in information technology, computer science, computer engineering or related fields.
10+ years of experience working with datasets with strong understanding on how data should be optimized within transactional, reporting, and big data architectures.
7+ years of experience in health care industry working with extensive amounts of healthcare data (claim and clinical sides) and major databases
8+ years of overall experience in big data, database and enterprise data architecture and delivery
8+ years of programming proficiency in a subset of Python, Java, and Scala
5+ years of hands-on experience building solutions on distributed processing frameworks such as Spark, Hadoop or Databricks
5+ years of experience architecting, developing, releasing, and maintaining enterprise data lake platforms
3+ years of experience implementing cloud-based systems. AWS and/or Databricks preferred
Technical Skills:
Strong SQL skills to create/maintain DB objects, query/load required data using data governance (e.g. business glossary, data dictionary, data catalog, data quality, master data management, etc.) and visualization tools to bring data literacy to the organization.
Practical experience on workload management, monitoring, and performance tuning Apache Spark jobs
Broad knowledge of data technologies, tools, and disciplines including data modeling, dimensional models, third-normal-form structures, ETL/ELT, change data capture and slowly changing dimensions
Experience with healthcare data - health care industry working with extensive amounts of healthcare data (claim and clinical sides) and major databases.
Experience with Machine Learning & MLOPs is a big plus