Overview
Skills
Job Details
Mastech Digital provides digital and mainstream technology staff as well as Digital Transformation Services for all American Corporations. We are currently seeking an AI Engineer for our client in the Business Services domain. We value our professionals, providing comprehensive benefits and the opportunity for growth. This is a Contract position, and the client is looking for someone to start immediately.
Duration: 6 Months Contract (Possibility of extension)
Location: Remote - US, Canada, India
Salary: $70.00-$76.00/Hourly
Role: AI Engineer
Primary Skills: Python
Role Description: The AI Engineer must have 3+ years of experience. Fir this role, you must be a strong AI Engineer with experience in assisting with the AI Voice project.
We are seeking a skilled and innovative AI Engineer with hands-on experience in building and optimizing voice models. In this role, you will work on developing, training, and refining AI models for voice synthesis, voice cloning, speech recognition, and/or voice transformation. Your work will contribute to cutting-edge applications in conversational AI, voice assistants, and generative audio.
An ideal candidate would be someone who has:
- Developed and optimized text-to-speech models that achieved human-like voice synthesis, maintaining the unique style of voice actors across multiple languages.
- Implemented real-time processing solutions that reduced inference time to under 1 second, enhancing user interaction and experience.
- Managed large-scale datasets for voice cloning projects, ensuring high performance and reliability while supporting multilingual transcriptions.
Key Responsibilities
- Design, develop, and fine-tune deep learning models for voice synthesis (e.g., TTS, voice cloning).
- Implement and optimize neural network architectures such as Tacotron, FastSpeech, WaveNet, or similar.
- Collect, preprocess, and augment speech datasets.
- Collaborate with product and engineering teams to integrate voice models into production systems.
- Perform evaluation and quality assurance of voice model outputs.
- Research and stay current on advancements in speech processing, audio generation, and machine learning.
Required Qualifications:
- Bachelor?s or Master?s degree in Computer Science, Electrical Engineering, or related field.
- Strong experience with Python and machine learning libraries (e.g., PyTorch, TensorFlow).
- Hands-on experience with speech/audio processing and relevant toolkits (e.g., Librosa, ESPnet, Kaldi).
- Familiarity with voice model architectures (TTS, ASR, vocoders).
- Understanding of deep learning concepts and model training processes.
Preferred Qualifications:
- Experience with deploying models to real-time applications or mobile devices.
- Knowledge of data labeling, voice dataset creation, and noise handling techniques.
- Experience with cloud-based AI/ML infrastructure (e.g., AWS, Google Cloud Platform).
- Contributions to open-source projects or published papers in speech/voice-related domains.
Education: Bachelor?s degree
Experience: Minimum 3+ years of experience
Relocation: This position will not cover relocation expenses
Travel: No
Local Preferred: Yes
Note: Must be able to work on a W2 basis (No C2C)
Recruiter Name: Neha Naithani
Recruiter Phone:
Benefits:
We have various coverages and additional benefits to choose from:
- Medical, Dental (Including Ortho) & Vision Insurance (Option to Enroll).
- Paid Leaves (Wherever applicable).
- Life & Disability Coverage (Upon eligibility).
- 401K Option, Education Assistance Program and more.
Mastech Digital is an Equal Opportunity Employer - All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or protected veteran status.
Minimum Education Required: Bachelor
Years of Experience Required: 3-5 Years
Expected Travel Time: None