Do you want to do something meaningful with Big Data? At Optum, we're leading the fastest moving industry in the world. Health care is transforming and we're innovating to help make the health care system work better for everyone. Our technology teams are working on projects with national and international visibility and impact. As a Principal Data Engineer, you will join a company and a team that is finding solutions to transform terabytes of healthcare information into actionable data. Join us. Let innovation and performance fuel your life's best work.(sm)Optum Analytics solutions create a longitudinal view of both individual patients and patient populations. We gather, normalize, and analyze data from disparate sources that, uniquely, span the continuum of care--including EHRs, Practice Management Systems and claims. Our EHR data alone accounts for over 80M patient lives across the U.S.We are looking for a Senior Data Engineer who is eager to tackle the challenges of processing vast amounts of EHR data originating from multiple sources. You will be the driving force, and thought-leader behind helping us build services based on NLP products.Your primary responsibilities will be to design and maintain data pipelines and services using best practices for data management and governance, and deploy machine learning and NLP applications in production. You will be working with EHR data and working across teams with ETL, NLP engineers and data scientists, researchers and clinicians to provide data services with high data quality control standards. You will need to develop a deep understanding of the data and drive efforts to maintain and improve data quality and usability. You should understand the importance and value of writing maintainable, documented, and well-tested code throughout the entire product lifecycle. Above all, you should be curious about what is possible in healthcare with the right tools and infrastructure.
Degree in computer science or related field
5+ years of experience building and maintaining data pipelines and data assets
3+ years of experience with distributed data processing frameworks such as Spark, Hive, and MapReduce
5+ years of programming experience, preferably in Python, following best practices
5+ years of demonstrated knowledge of data management best practices
3+ years of experience with ETL
5+ years of experience with leading data engineering projects and mentoring colleagues
Experience with machine learning techniques and open source libraries, such as scikit-learn, tensorflow, keras, NLTK, or SpaCY
Experience running machine learning or NLP applications at scale
Experience with data pipeline frameworks such as Airflow, Luigi or Oozie
Experience with search engines (Elasticsearch, Solr)
Experience with cloud-based computing (AWS, Azure)
Experience with Scala, in particular with Spark Scala API
Familiarity with EHR data and standards (HL7, FHIR)
Experience with HBase or other non-relational data bases
Experience with explaining, educating, presenting and/or training non-engineers on engineering concepts and processes
Strong prioritization skills; ability to manage ad-hoc requests in parallel with ongoing projects
Attention to detail, intellectual curiosity, collaborative attitude and strong communication skills
Careers with Optum Here's the idea. We built an entire organization around one giant objective; make health care work better for everyone. So when it comes to how we use the world's large accumulation of health-related information, or guide health and lifestyle choices or manage pharmacy benefits for millions, our first goal is to leap beyond the status quo and uncover new ways to serve. Optum, part of the UnitedHealth Group family of businesses, brings together some of the greatest minds and most advanced ideas on where health care has to go in order to reach its fullest potential. For you, that means working on high performance teams against sophisticated challenges that matter. Optum, incredible ideas in one incredible company and a singular opportunity to do your life's best work.(sm)
For more information on our Internal Job Posting Policy, click here.
Diversity creates a healthier atmosphere: UnitedHealth Group is an Equal Employment Opportunity / Affirmative Action employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, age, national origin, protected veteran status, disability status, sexual orientation, gender identity or expression, marital status, genetic information, or any other characteristic protected by law.
UnitedHealth Group is a drug - free workplace. Candidates are required to pass a drug test before beginning employment.
Job Keywords: Software engineering, information technology, medical informatics, AWS, amazon web services, information extraction, information retrieval, data mining, data science, data scientist, natural language processing engineer, data pipelines, statistics, Hadoop, API, Hive, Python, Scala, HBase Boston, MA, Massachusetts, Austin, TX, Texas, NLP, Natural Language Processing, Spark, Hive, MapReduce, Python, ETL, Data Sets
Job Title: Senior Data Engineer/Scientist (NLP) - Boston, MA, Austin, TX or Telecommute
Shift: Day Job
Business: OI Business Operations
Telecommuter Position: Yes
Job Level: Individual Contributor
Overtime Status: Exempt
Posted Date: 5/22/2019
Country: United States
Department: Optum Enterprise Analytics