Linguist

Remote • Posted 12 hours ago • Updated 12 hours ago
Contract W2
No Travel Required
Remote
$45 - $55/hr
Fitment

Dice Job Match Score™

✨ Finding the perfect fit...

Job Details

Skills

  • Python
  • Regex
  • Regular Expression
  • Transcription
  • SAMPA
  • IPA
  • TOBI
  • Marathi
  • Tamil
  • Arabic
  • TTS
  • Text-to-Speech
  • phonetics
  • phonology
  • sociolinguistics
  • dialectology
  • waveforms
  • spectrograms
  • NLPi18n
  • speech

Summary

Required Years of Experience: 3-4 years of experience required.

Location: 100% Remote (but should be based in within United States)

Education: Bachelor s degree in linguistics, language technologies, computational linguistics, speech science, or related field.

Language requirements:

1. Tamil

2. Marathi

3. Egyptian Arabic & Modern Standard Arabic (+ familiarity with Modern Standard Arabic is nice to have

Candidates need to be Linguists experts, not just native speakers.

Summary:

The main function of a TTS Linguist is to determine speech data needs and make for data-based model and product improvements.

Top 3 must-have HARD skills:

Native-level fluency in mentioned languages.
Familiarity with command-line, scripting, and versioning systems
Phonetics/phonology, including experience with transcription in IPA or SAMPA -OR- experience with regexes

Good to have skills:

Python
Data analysis

Story Behind the Need Business Group & Key Projects:

Core responsibilities of TTS:

  • Build and improve TTS models (speech generation) to sound more natural, expressive, and robust, including things like prosody and non-verbal cues (e.g., laughter/breath)
  • Multilingual + i18n support: expand language coverage and handle tricky cases like code-switching, accents, and language ID
  • Deploy/integrate models into products (sometimes including on-device inference constraints for wearables)
  • Evaluation + quality measurement: develop pipelines/guidelines to measure naturalness and expressivity
  • Native speaker expertise for new TTS locales

Compelling Story & Candidate Value Proposition:

Varied tasks, lots of opportunities to leverage native speaker insights for linguistic applications
This role allows a lot of freedom to affect the voice quality for users in their native language to ensure they have the best possible user experience

Typical Day in the Role:

  • Provide linguistic expertise in the areas of phonetics, phonology, lexicography, dialectology, and NLP.
  • Provide native speaker input and feedback on product quality
  • Develop and/or evaluate large-scale labeled datasets for varying NLP applications such as language ID, text normalization, G2P, and audio alignments
  • Create and perfect text normalization processes.
  • Manage lexical and phrasal transcriptions and related metadata.
  • Analyze system metrics such as user opinion, lexicon transcription coverage, and POS tagger performance and remedy pain points.

How will performance be measured:

Task completion

Job Responsibilities:

  • Provide linguistic expertise in the areas of phonetics, phonology, lexicography, dialectology, and NLP.
  • Design and conduct experiments for evaluating transcription quality.
  • Develop manual and automated processes for multiple concurrent projects including ensuring high-quality label alignments, prosodic classification, POS identification and disambiguation, targeted modeling data, and user feedback.
  • Create and perfect text normalization and inverse text normalization processes.
  • Manage lexical and phrasal transcriptions and related metadata.
  • Analyze system metrics such as user opinion, lexicon transcription coverage, and POS tagger performance and remedy pain points.

Skills:

  • Knowledge of phonetics, phonology, sociolinguistics, dialectology, and other areas of linguistics.
  • Ability to analyze waveforms and spectrograms.
  • Knowledge of prescriptive writing and punctuation conventions for at least one language.
  • Excellent communication skills both verbal and written.
  • Knowledge in transcription and annotation systems such as SAMPA, IPA, and ToBI.

Education/Experience:

  • Bachelor s degree in linguistics, language technologies, computational linguistics, speech science, or related field.

Comments: Please make sure to tell candidates that assisted AI use in interviews is grounds for immediate disqualification.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.
  • Dice Id: 10123373
  • Position Id: SUC_SHA_Lingui
  • Posted 12 hours ago
Create job alert
Set job alertNever miss an opportunity! Create an alert based on the job you applied for.

Similar Jobs

Remote

Today

Easy Apply

Contract

$40 - $55

Remote

Today

Easy Apply

Contract

$40 - $55

Remote

Today

Part-time

USD 89,649.00 - 152,404.00 per year

Remote or New York, New York

19d ago

Full-time

USD 89,649.00 - 152,404.00 per year

Search all similar jobs