Overview
On Site
$60 - $65
Contract - W2
Able to Provide Sponsorship
Skills
Amazon Web Services
Android
Apache Kafka
Artificial Intelligence
Authentication
Back Office
Benchmarking
Bridging
Budget
Build Vs Buy
C
C++
Cloud Computing
Communication
Continuous Delivery
Continuous Integration
DLP
Dashboard
Data Collection
Data Flow
Data Retention
Data-flow Diagrams
DevOps
Digital Signal Processing
Docker
Echo Cancellation
Encryption
FAR
GitHub
Grafana
Hierarchical Storage Management
HTTP
IOS Development
Kotlin
Java
Kubernetes
Leadership
Job Details
Mobile Solution Architect Voice AI & Mobile
Experience: 10+ years overall (3+ years in mobile or voice AI)
Competency Track: Digital React Native
Engagement: Full-time; Onsite/Hybrid as per client need
Role Overview
Lead architecture and delivery for an enterprise-grade speech/voice platform spanning mobile (iOS/Android), edge containers, and cloud services. Drive PoCs, define the target state, and guide engineering squads on performance, security, privacy, cost, and operability.
Key Responsibilities
- Target Architecture: Define end-to-end architecture across mobile clients, edge containers, and cloud back end (e.g., Azure Cognitive Services); cover iOS Native STT, Android Native STT, authentication, observability/monitoring, and runtime topology.
- PoCs & Benchmarking: Plan and execute PoCs comparing STT/TTS engines for accuracy, latency, and cost (e.g., Native iOS/Android, Azure, Google, AWS, Whisper, Vosk). Produce WER/latency dashboards and recommendation papers.
- Wake Word & Audio Pipeline: Design wake-word detection and audio pipeline (noise suppression, end-pointing, streaming). Evaluate on-device DSP, beamforming, and VAD strategies.
- Domain Modeling: Implement custom vocabulary (e.g., product brands like Stihl , SKU/UPC patterns). Define data collection loops and training/retuning cycles.
- APIs & Streaming: Design low-latency streaming APIs (WebSocket/HTTP/2) for realtime transcription and TTS with backpressure, retries, and QoS controls.
- Security & Compliance: Map data flows to CCPA/GDPR; author threat models; set encryption strategy (data in transit/at rest), token lifecycles, data-retention and redaction policies.
- Platform & DevEx: Establish CI/CD for mobile and speech services using GitHub Actions/Azure DevOps; containerize services; define infra as code; codify SLAs/SLOs and error budgets.
- Operations: Produce run-books, on-call playbooks, and capacity/cost forecasts; hand off to Dev and SRE/Operations teams.
- Stakeholder Leadership: Partner with Product, Security, Legal, and Store Ops; present architecture decisions and trade-offs to senior leadership.
Essential Qualifications
- 8+ years in software/solution architecture with 3+ years in mobile or voice AI.
- Deep knowledge of at least one cloud speech stack: Native iOS/Android STT, Azure Cognitive Services, Google Speech, AWS Transcribe/Polly, Whisper.
- Hands-on React Native with native bridges (Swift/ObjC; Kotlin/Java) and performance tuning on device.
- Familiarity with on-device ASR/TTS frameworks (Whisper.cpp, Vosk, Mimic, TFLite) and model optimization (quantization, pruning).
- Proven design of lowlatency streaming services (WebSocket/HTTP2), including telemetry and tracing.
- Cloud scaling and cost optimization with containers/Kubernetes and IaC.
- Security & compliance expertise: OAuth2/OIDC, JWT, TLS, data retention, CCPA/GDPR.
- Audio DSP/noise reduction experience; wake-word/VAD/endpointing.
- Experience with enterprise mobility (e.g., Zebra Android devices) and MDM policies.
- Ability to design and interpret WER experiments; familiarity with dataset curation and metric design.
Desirable Qualifications
- Experience deploying speech workloads to edge locations (store/backoffice) with intermittent connectivity.
- Knowledge of beamforming, echo cancellation, and farfield mic arrays.
- Experience with cost modeling of STT/TTS at scale and token/compute budgeting for ondevice vs. cloud tradeoffs.
- Familiarity with observability stacks (OpenTelemetry, PrometheGrafana) and synthetic QoE testing.
- Prior retail domain experience (store operations, scan/lookup, SKU/UPC workflows).
Tooling & Technology Stack (Representative)
- Mobile: React Native, Swift/ObjC, Kotlin/Java, RN bridge modules, Zebra devices.
- Speech/ML: Azure Cognitive Services, Google Speech, AWS Transcribe/Polly, Whisper/Whisper.cpp, Vosk, TFLite, ONNX Runtime, KServe.
- APIs/Runtime: Node.js/Java/Kotlin services, gRPC/WebSocket/HTTP2, NGINX/Envoy, Redis/Kafka.
- Infra: Docker, Kubernetes (AKS/GKE/EKS), Helm, Terraform, GitHub Actions, Azure DevOps.
- Security/Privacy: OAuth2/OIDC, JWT, mTLS, KMS/HSM, Vault, DLP/redaction.
- Observability: OpenTelemetry, Prometheus, Grafana, ELK, Azure Monitor.
Deliverables
- Target Architecture & Integration Diagrams (current future state)
- PoC benchmark reports (WER/latency/cost) with recommendations
- Threat model, data-flow diagrams, and encryption/keymanagement plan
- CI/CD pipelines, infra-as-code baselines, and environment promotion strategy
- Runbooks, SLAs/SLOs, capacity and cost forecasts; handover package to Dev/SRE
Soft Skills
- Excellent stakeholder communication; ability to influence without authority
- Pragmatic approach to build vs. buy; datadriven decision making
- Mentorship of developers and mobile engineers; champion of engineering best practices
| Mahesh Recruitment Manager VSB Tech Consulting Services D:
Looking forward to work with you... As we Believe, |
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.