Overview
Skills
Job Details
Web Browser Voice Recognition Years | Azure Speech-to-Text Amazon Transcribe / Polly OpenAI Whisper API AssemblyAI Deepgram ElevenLabs Years |
|
|
Summary Requirement
Voice Recognition(VR) implemented across Chrome, Edge, Safari, Firefox, Brave. (required)
Customize Web Speech API and other speech frameworks for product needs
Other Speech Frameworks include: (required to have more than one tool to have used for speech recognition.
Extend capabilities with custom logic, error handling, and multilingual support
Improve speed, accuracy, and resilience in noisy environments
VR on multiple devices (laptop, desktop, mobile, tablet)
We re seeking a Voice Recognition Engineer to design and implement speech-driven interfaces that work seamlessly across major browsers. This role focuses on tailoring and extending Web Speech APIs and related technologies to deliver accurate, responsive, and user-friendly voice recognition experiences. You ll help us unlock natural voice interaction for our products, ensuring accessibility, speed, and reliability across platforms.
Key Responsibilities
Area | What You ll Do |
Cross-Browser Voice Recognition | - Implement and optimize voice recognition across Chrome, Edge, Safari, and Firefox and ensure good performance across most popular new browsers (Brave, etc) - Ensure consistent performance and compatibility across environments |
API Integration & Customization | - Tailor Web Speech API and other speech frameworks for product needs - Extend capabilities with custom logic, error handling, and multilingual support |
Performance & Accuracy | - Improve recognition speed, accuracy, and resilience in noisy environments - Benchmark and refine models for real-world use cases |
User Experience Alignment | - Collaborate with product and design teams to ensure intuitive voice interactions - Support accessibility and inclusive design through speech-driven features. Support configurable durations of allowed speech before concluding user speech complete. |
Collaboration & Pilots | - Partner with technical leads and product managers to align voice features with roadmap - Support pilots and demos with clients and partners |
Ideal Candidate
Trait | Description |
Browser-Savvy Engineer | Experienced in applying voice recognition across multiple browsers on multiple devices (laptop, desktop, mobile, tablet) |
API Tailor | Skilled at customizing Web Speech API and related/similar frameworks |
Accuracy-Focused | Dedicated to improving recognition speed and reliability across languages |
Collaborative Partner | Works well across product, design, and technical teams |
Innovative Builder | Excited to push the boundaries of speech-driven interfaces |