Local San Francisco Bay area candidates only!
Direct W2 employees only! No 3rd party candidates!
**This position will start remote and will be on campus on an on/off schedule
Our team is in the process of establishing a modern data platform that will enable the real/near-real time insights of the Clinical trial management landscape to support Roche drug discovery, research and development initiatives. This requires an enablement of world class Cloud infrastructure and Data Engineering technologies and practices.
The candidate will join our existing Data Engineering team to build and deploy the Data solutions on the Clinical Trial Operations and Predictive capabilities. These platforms are built on AWS Cloud with S3, Glue, Talend, Redshift, Alation along with other AI capabilities
Bachelor/Master degree in related fields (Computer Science, Computer Engineering, Mathematical Engineering, Information Systems) or related fields (or equivalent work-related experience)
3 years+ Experience with Data Engineering in Cloud Data Solutions (AWS preferred)
5 years+ Experience building Data Platforms, Data lakes, Modern Data warehouse architectures and Self-service Business Intelligence solutions
Expertise in designing efficient Data Models, optimizing existing Data Marts, developing and deploying Data structures based on those Data Models
Expertise in designing and implementing Data security to ensure the compliance of all the data assets and analytical applications
3 years+ Experience in SQL, Relational databases
5 years+ Extensive experience with data processing and ETL/ELT techniques
2 years+ Experience developing and supporting scalable data pipelines using technologies such as Kafka, Spark, Airflow to support Batch and streaming data efficiently
3 years+ Python programming experience.
Experience with high performance distributed data computing.
Experience with good software development, automation practices, including collaborative development using DevOps pipelines.
Build processes supporting data transformation, data structures, metadata, dependency and workload management.
Excellent communication, advanced English reading, writing, listening and speaking skills.
1 year+ Experience in Data Visualization tools such as Tableau, Power BI etc.
Knowledge in Graph and NoSQL databases
Previous experience with Informatica, Talend tools
Exposure to Data Science, ML/AI Technologies and Capabilities