Overview
On Site
140k - 180k
Full Time
Skills
Training
Optimization
NATURAL
Collaboration
Artificial Intelligence
Data Compression
Python
TypeScript
Java
Terraform
Amazon SageMaker
Redis
PostgreSQL
WebSocket
HTTP
System On A Chip
Kubernetes
Streaming
Management
LangChain
Microsoft Certified Professional
Evaluation
Apache Kafka
Snow Flake Schema
Amazon S3
Machine Learning (ML)
Continuous Integration and Development
Mentorship
Machine Learning Operations (ML Ops)
Speech Processing
Prompt Engineering
Research
Regulatory Compliance
HIPAA
Insurance
SAP BASIS
Job Details
We're looking for an entrepreneurial Senior Machine Learning Engineer with experience taking voice-centric AI systems (TTS, STT, LLM-driven dialog) from prototype to large-scale production. You'll own the full ML lifecycle-research, data pipelines, training, evaluation, deployment, and ongoing optimization-powering sub-second, natural voice conversations at scale.
This role is ideal for someone passionate about pushing the limits of conversational AI: creating highly optimized, domain-specific models that are faster, leaner, and more cost-efficient than general-purpose solutions. You'll collaborate closely with product, infrastructure, and compliance teams, while setting the technical bar for model excellence and ML best practices.
Required Skills & Experience
#LI-EM1
This role is ideal for someone passionate about pushing the limits of conversational AI: creating highly optimized, domain-specific models that are faster, leaner, and more cost-efficient than general-purpose solutions. You'll collaborate closely with product, infrastructure, and compliance teams, while setting the technical bar for model excellence and ML best practices.
Required Skills & Experience
- Experience: 7+ years building production ML systems, including 2+ years in speech or conversational AI. Proven track record deploying large-scale voice AI or LLM products.
- Fine-tuning & compression (LoRA, QLoRA, quantization, pruning, distillation).
- Speech (ASR: Whisper, NeMo, Kaldi; TTS: Tacotron, FastSpeech, VITS).
- LLMs & dialogue (GPT-class, RAG, LangGraph, LangChain, MCP).
- Strong in Python; bonus for TypeScript/Node/Java.
- Infra & Ops (Kubernetes, Helm, Terraform, MLflow/SageMaker).
- Data systems (Kafka, Redis, Postgres, Snowflake).
- Streaming protocols (gRPC, WebSockets, HTTP/2, WebRTC).
- Security & compliance (HIPAA, SOC2, HITRUST).
- Product-oriented, entrepreneurial, strong problem solver, effective communicator, and technical leader.
- Optimize & Fine-Tune Models: Apply LoRA, QLoRA, RLHF, and other parameter-efficient techniques. Use quantization, pruning, and distillation to shrink models while preserving quality.
- Build End-to-End Pipelines: Design STT, TTS, and LLM systems achieving
- Scale Inference: Optimize serving on Kubernetes/EKS with dynamic batching, speculative decoding, and streaming protocols.
- Advance Dialogue Management: Extend LangGraph/LangChain flows and MCP schemas for complex multi-turn conversations.
- Data & Evaluation: Develop pipelines for conversational logs (Kafka ? Snowflake/S3) and create frameworks to measure ASR accuracy, TTS quality, and task completion.
- Lead & Mentor: Define ML best practices, champion model CICD and monitoring, and mentor teammates on ML Ops, speech processing, and prompt engineering.
- Innovate & Research: Run POCs with cutting-edge models (e.g., Whisper-v3, Bark) and stay ahead of the latest in speech + LLM research.
- Ensure Reliability & Compliance: Implement HIPAA-grade security, PHI safeguards, and robust fallback strategies.
- Bonus eligible
- Medical, Dental, and Vision Insurance
- Vacation Time
- Stock Options
#LI-EM1
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.