Voice Recognition Engineer

Overview

Remote
$90 - $95
Contract - W2
Contract - 6 Month(s)

Skills

Voice Recognition

Job Details

Strong browser based Voice Recognition.
Fill below matrix

Web Browser

Voice

Recognition

Years

Azure Speech-to-Text

Amazon Transcribe / Polly

OpenAI Whisper API

AssemblyAI

Deepgram

ElevenLabs

Years

Summary Requirement

Voice Recognition(VR) implemented across Chrome, Edge, Safari, Firefox, Brave. (required)

Customize Web Speech API and other speech frameworks for product needs


Other Speech Frameworks include: (required to have more than one tool to have used for speech recognition.

Extend capabilities with custom logic, error handling, and multilingual support

Improve speed, accuracy, and resilience in noisy environments

VR on multiple devices (laptop, desktop, mobile, tablet)

We re seeking a Voice Recognition Engineer to design and implement speech-driven interfaces that work seamlessly across major browsers. This role focuses on tailoring and extending Web Speech APIs and related technologies to deliver accurate, responsive, and user-friendly voice recognition experiences. You ll help us unlock natural voice interaction for our products, ensuring accessibility, speed, and reliability across platforms.

Key Responsibilities

Area

What You ll Do

Cross-Browser Voice Recognition

- Implement and optimize voice recognition across Chrome, Edge, Safari, and Firefox and ensure good performance across most popular new browsers (Brave, etc) - Ensure consistent performance and compatibility across environments

API Integration & Customization

- Tailor Web Speech API and other speech frameworks for product needs - Extend capabilities with custom logic, error handling, and multilingual support

Performance & Accuracy

- Improve recognition speed, accuracy, and resilience in noisy environments - Benchmark and refine models for real-world use cases

User Experience Alignment

- Collaborate with product and design teams to ensure intuitive voice interactions - Support accessibility and inclusive design through speech-driven features. Support configurable durations of allowed speech before concluding user speech complete.

Collaboration & Pilots

- Partner with technical leads and product managers to align voice features with roadmap - Support pilots and demos with clients and partners

Ideal Candidate

Trait

Description

Browser-Savvy Engineer

Experienced in applying voice recognition across multiple browsers on multiple devices (laptop, desktop, mobile, tablet)

API Tailor

Skilled at customizing Web Speech API and related/similar frameworks

Accuracy-Focused

Dedicated to improving recognition speed and reliability across languages

Collaborative Partner

Works well across product, design, and technical teams

Innovative Builder

Excited to push the boundaries of speech-driven interfaces

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.