Overview
Skills
Job Details
Hi All,
We are hiring for a Voice Recognition Engineer Browser-Based Speech Interfaces Role for a long term contract and REMOTE JOB OPPORTUNITY.
Please go through the JD shared below and share your updated resume along with your following details ASAP.
Job Title: Voice Recognition Engineer Browser-Based Speech Interfaces
Location: Remote (Client is in St. Louis, MO Central times)
Duration: Long Term Contract
Job Description:
We re seeking a Voice Recognition Engineer to design and implement speech-driven interfaces that work seamlessly across major browsers. This role focuses on tailoring and extending Web Speech APIs and related technologies to deliver accurate, responsive, and user-friendly voice recognition experiences. You ll help us unlock natural voice interaction for our products, ensuring accessibility, speed, and reliability across platforms.
Summary Requirement:
Voice Recognition(VR) implemented across Chrome, Edge, Safari, Firefox, Brave. (required)
Customize Web Speech API and other speech frameworks for product needs
Other Speech Frameworks include: (required to have more than one tool to have used for speech recognition.
- ElevenLabs (Scribe)
- AssemblyAI (Speech-to-Text API)
- Deepgram (Speech-to-Text API)
- OpenAI Whisper API
- Microsoft Azure Text-to-Speech / Speech-to-Text
- Amazon Transcribe / Polly
Extend capabilities with custom logic, error handling, and multilingual support
Improve speed, accuracy, and resilience in noisy environments
VR on multiple devices (laptop, desktop, mobile, tablet)
Key Responsibilities:
| Area | What You ll Do |
| Cross-Browser Voice Recognition | - Implement and optimize voice recognition across Chrome, Edge, Safari, and Firefox and ensure good performance across most popular new browsers (Brave, etc) - Ensure consistent performance and compatibility across environments |
| API Integration & Customization | - Tailor Web Speech API and other speech frameworks for product needs - Extend capabilities with custom logic, error handling, and multilingual support |
| Performance & Accuracy | - Improve recognition speed, accuracy, and resilience in noisy environments - Benchmark and refine models for real-world use cases |
| User Experience Alignment | - Collaborate with product and design teams to ensure intuitive voice interactions - Support accessibility and inclusive design through speech-driven features. Support configurable durations of allowed speech before concluding user speech complete. |
| Collaboration & Pilots | - Partner with technical leads and product managers to align voice features with roadmap - Support pilots and demos with clients and partners |
Ideal Candidate
| Trait | Description |
| Browser-Savvy Engineer | Experienced in applying voice recognition across multiple browsers on multiple devices (laptop, desktop, mobile, tablet) |
| API Tailor | Skilled at customizing Web Speech API and related/similar frameworks |
| Accuracy-Focused | Dedicated to improving recognition speed and reliability across languages |
| Collaborative Partner | Works well across product, design, and technical teams |
| Innovative Builder | Excited to push the boundaries of speech-driven interfaces |
THANKS & REGARDS
PRABHASH
SYSTEM EDGE USA LLC