Apply Now

Software Engineer, Voice Interaction

Redwood City, CA, US • Posted 30+ days ago • Updated 11 hours ago

Full Time

On-site

Fitment

Dice Job Match Score™

🛠️ Calibrating flux capacitors...

Job Details

Skills

Innovation
Expect
Natural Language
Echo Cancellation
NATURAL
Embedded Systems
Auditing
Network
Real-time
Decision-making
Reasoning
Computer Hardware
Consumer Electronics
Debugging
C++
Streaming
Cloud Computing
API
Roadmaps
Blueprint
Shipping
Video Games
Machine Learning (ML)
Artificial Intelligence
Publications
Robotics
Genetics

Summary

Join Us in Building the Future of Home Robotics

At Sunday, we're developing personal robots to reclaim the hours lost to repetitive tasks. We're focused on an ambitious goal to make generalized robots broadly accessible, enabling households to take back quality time.

We have spent the last 18 months building a talented team, securing capital, and validating our technology. We are now seeking passionate individuals to join us in the next phase of our growth. If you are ready to apply your skills to the forefront of robotics innovation, we'd love to hear from you.

What To Expect

As a Software Engineer, Voice Interaction, you will own the full voice pipeline that connects our users with Memo's core robotic and AI systems. Integrating machine learning models across local and cloud compute, you will transform raw audio signals into actionable instructions in a domestic environment. As part of the broader team, you will also contribute to the behavior stack that drives Memo's high-level decision making and task execution.

What You'll Do

Develop and maintain the full voice pipeline from microphone array input through wake word detection, speech-to-text, natural language understanding, and text-to-speech output
Configure and integrate microphone array for domestic use, tuning onboard audio processing (beamforming, noise suppression, echo cancellation) and supplementing with additional processing where needed
Integrate the voice subsystem with high level robot behaviors, enabling the robot to receive, interpret, and act intelligently on voice commands
Design and optimize TTS output to deliver natural, responsive spoken interactions in real time on embedded hardware
Define and enforce guardrails around voice input and output, including content filtering, prompt boundary enforcement, output length limits, and auditing to ensure the system operates within intended use
Evaluate and integrate STT/TTS engines and models, making informed tradeoffs between accuracy, latency, and resource consumption
Build reliable, well-tested software that runs on our robot, Memo, under real-world conditions including ambient noise, partial utterances, and unreliable network connectivity
Deliver a successful voice interaction experience to our Beta users

What You'll Bring

2+ years experience developing voice-driven systems including speech-to-text, text-to-speech, and real-time audio processing, with at least one end-to-end pipeline shipped to users or deployed on hardware
Strong understanding of both classical decision-making approaches (state machines, behavior trees, planning) and modern ML-driven reasoning (LLMs, VLMs)
Experience working on compute-constrained platforms where software meets hardware (robotics, edge devices, consumer electronics, or similar) including debugging problems that cut across both
Proficiency in C++, with experience in asynchronous programming, streaming/buffering patterns, and integration with cloud API services

Nice To Have

Experience as a founding or early hire; able to define a release roadmap where no blueprint exists
Experience shipping responsive AI systems in robotics, video games, or embodied AI
Strong understanding of ML-driven control for embodied AI (end-to-end learning, reinforcement learning, VLAs)
Practical experience (or high curiosity) in interfacing with multimodal models
Publications in multimodal models, audio interpretation, or robotics

At Sunday Robotics, we're building technology shaped by real people - curious, creative, and diverse. We're proud to be an equal opportunity employer and consider all qualified applicants regardless of race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.

Even if you don't meet every single requirement, we encourage you to apply. Studies show that women and underrepresented groups often hold back unless they meet 100% of the criteria - we don't want that to be the reason we miss out on great talent.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 80183762
Position Id: e6b0d1fde40824109d138cc27a613b2
Posted 30+ days ago

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Research Engineer, Voice

Palo Alto, California

•

Today

About Inflection AI Inflection AI is a Public Benefit Corporation empowering people with human-centered, emotionally intelligent AI. We're shaping the future of AI by combining emotional intelligence (EQ) and raw intelligence (IQ) to elevate people's potential. Inflection AI created Pi, the world's first emotionally intelligent AI, to help people work through decisions, emotions, and challenges. Pi is a personal AI agent powered by Inflection AI's foundation model, proving that AI can be perso

Full-time

USD 225,000.00 - 325,000.00 per year

Senior Frontend Engineer, Voice Communication

San Mateo, California

•

Today

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences- all created by our global community of developers and creators. At Roblox, we're building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.We're on a mission to connect a billion people with op

Full-time

USD 243,290.00 - 295,250.00 per year

Sr. Software Engineer, Siri Speech

Cupertino, California

•

Today

Join the team redefining what a deeply personal and integrated assistant can be. As part of the Siri organization, you will help shape one of the world's most widely used AI assistants, powered by our next-generation of Apple Intelligence, with capabilities like personal context understanding and on-screen awareness, built with privacy from the ground up. Your work will have direct, meaningful impact for users across iOS, iPadOS, macOS, watchOS, and visionOS. This is a rare opportunity to buil

Full-time

Member of Technical Staff (AI Software Engineer, Multimodal)

San Francisco, California

•

Today

Perplexity is hiring builders to join our Multimodal AI group, an industry-leading team defining the next generation of human-AI interaction. Our team is creating experiences that move beyond the touch interface, allowing people to communicate with AI through the form factors that best meet their needs. These include through voice, images, video, or new modalities we have yet to invent. As an engineer on the Multimodal AI team, you will work across the stack to build the product experiences and

Full-time

USD 220,000.00 - 405,000.00 per year

Search all similar jobs

More jobs at Sunday Inc in Redwood City, CA