Role: Sr. Data Engineer Location: Remote Client: Octave health
Job Summary:
We re looking for a Sr. Data Engineer with strong data platform experience to help evolve our modern data stack and contribute to the foundation of our emerging AI and ML platform. This role sits at the intersection of data engineering, platform architecture and machine learning enablement and will bring high-quality, scalable, and ethical AI into real-world use. You will partner closely with data scientists, analysts, and product managers to ensure our platform supports reliable data pipelines, scalable analytics, and production ready machine learning systems in addition to defining new architecture, best practices, and patterns for fellow engineers to inherit. The ideal candidate is both a systems thinker and a hands-on builder who thrives in evolving environments and is passionate about creating reliable data infrastructure that enables peers and partner teams to move faster with data.
Required Skills:
Proficiency in SQL and Python with strong familiarity towards modern data engineering frameworks, infrastructure, and tooling.
Proficiency with data ops best practices, monitoring, pipeline automation, and CI/CD.
Knowledge of modern compute and ML frameworks/libraries (i.e., Spark, TensorFlow, PyTorch, scikit-learn).
Ability to build production APIs and services, inclusive of MCP servers that expose internal data/services to LLMs.
Comfort using AI tools in day-to-day workflows, with a willingness to continuously rethink and improve how work gets done.
Curiosity and openness to experimenting with new tools and approaches; prior experience with AI tools is a plus.
Education & Experience:
Bachelor s degree (or equivalent) in Computer Science, Data Science, Statistics, Engineering or a related field.
5+ years of experience in data engineering, platform engineering, or ML engineering.
Experience working with major cloud data platforms and tools:
Preferred experience:
o Healthcare, behavioral health, EHR systems, and/or regulated industries.
o Specific expertise with: AWS/Google Cloud Platform, dbt, Airflow Airbyte, Redshift/BigQuery.