Software Engineer, Voice Interaction

Redwood City, CA, US • Posted 5 days ago • Updated 9 hours ago
Full Time
On-site
Fitment

Dice Job Match Score™

⭐ Evaluating experience...

Job Details

Skills

  • Innovation
  • Expect
  • Natural Language
  • Echo Cancellation
  • NATURAL
  • Embedded Systems
  • Auditing
  • Network
  • Real-time
  • Decision-making
  • Reasoning
  • Computer Hardware
  • Consumer Electronics
  • Debugging
  • C++
  • Streaming
  • Cloud Computing
  • API
  • Roadmaps
  • Blueprint
  • Shipping
  • Video Games
  • Machine Learning (ML)
  • Artificial Intelligence
  • Publications
  • Robotics
  • Genetics

Summary

Join Us in Building the Future of Home Robotics

At Sunday, we're developing personal robots to reclaim the hours lost to repetitive tasks. We're focused on an ambitious goal to make generalized robots broadly accessible, enabling households to take back quality time.

We have spent the last 18 months building a talented team, securing capital, and validating our technology. We are now seeking passionate individuals to join us in the next phase of our growth. If you are ready to apply your skills to the forefront of robotics innovation, we'd love to hear from you.

What To Expect

As a Software Engineer, Voice Interaction, you will own the full voice pipeline that connects our users with Memo's core robotic and AI systems. Integrating machine learning models across local and cloud compute, you will transform raw audio signals into actionable instructions in a domestic environment. As part of the broader team, you will also contribute to the behavior stack that drives Memo's high-level decision making and task execution.

What You'll Do
  • Develop and maintain the full voice pipeline from microphone array input through wake word detection, speech-to-text, natural language understanding, and text-to-speech output
  • Configure and integrate microphone array for domestic use, tuning onboard audio processing (beamforming, noise suppression, echo cancellation) and supplementing with additional processing where needed
  • Integrate the voice subsystem with high level robot behaviors, enabling the robot to receive, interpret, and act intelligently on voice commands
  • Design and optimize TTS output to deliver natural, responsive spoken interactions in real time on embedded hardware
  • Define and enforce guardrails around voice input and output, including content filtering, prompt boundary enforcement, output length limits, and auditing to ensure the system operates within intended use
  • Evaluate and integrate STT/TTS engines and models, making informed tradeoffs between accuracy, latency, and resource consumption
  • Build reliable, well-tested software that runs on our robot, Memo, under real-world conditions including ambient noise, partial utterances, and unreliable network connectivity
  • Deliver a successful voice interaction experience to our Beta users

What You'll Bring
  • 2+ years experience developing voice-driven systems including speech-to-text, text-to-speech, and real-time audio processing, with at least one end-to-end pipeline shipped to users or deployed on hardware
  • Strong understanding of both classical decision-making approaches (state machines, behavior trees, planning) and modern ML-driven reasoning (LLMs, VLMs)
  • Experience working on compute-constrained platforms where software meets hardware (robotics, edge devices, consumer electronics, or similar) including debugging problems that cut across both
  • Proficiency in C++, with experience in asynchronous programming, streaming/buffering patterns, and integration with cloud API services

Nice To Have
  • Experience as a founding or early hire; able to define a release roadmap where no blueprint exists
  • Experience shipping responsive AI systems in robotics, video games, or embodied AI
  • Strong understanding of ML-driven control for embodied AI (end-to-end learning, reinforcement learning, VLAs)
  • Practical experience (or high curiosity) in interfacing with multimodal models
  • Publications in multimodal models, audio interpretation, or robotics

At Sunday Robotics, we're building technology shaped by real people - curious, creative, and diverse. We're proud to be an equal opportunity employer and consider all qualified applicants regardless of race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.

Even if you don't meet every single requirement, we encourage you to apply. Studies show that women and underrepresented groups often hold back unless they meet 100% of the criteria - we don't want that to be the reason we miss out on great talent.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 80183762
  • Position Id: e6b0d1fde40824109d138cc27a613b2
  • Posted 5 days ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

San Mateo, California

Today

Full-time

USD 193,300.00 - 289,900.00 per year

Cupertino, California

Today

Full-time

San Francisco, California

Today

Full-time

USD 220,000.00 - 405,000.00 per year

Cupertino, California

Today

Full-time

Search all similar jobs