Overview
Remote
$40 - $50
Contract - W2
Contract - 24 Month(s)
Skills
machine translation
ASR
TTS
LLM
NLP
language typology
syntax
morphology
sociolinguistics
corpus linguistics
writing systems
pragmatics
phonology
Python
SQL
Afro-Asiatic
Indo-Aryan
Atlantic-Congo
Austronesian
linguistic
Responsible AI
lexical resources
Job Details
Job Title: Computational Linguist
Duration: 24 months
Location: Remote (PST preference, open however)
Must-Have Skills:
- Perform linguistic error analysis of machine translations and identifying the most frequent and severe error categories
- Experience with Python
- the following families or groups: Afro-Asiatic, Indo-Aryan, Atlantic-Congo, or Austronesian.
Nice-to-have Skills:
- Experience with creating and/or maintaining specialized lexical resources (e.g., profanity dictionaries) a plus
- Ability to independently work through ambiguous requests, based on priorities established by CWAM, and perform under pressure. Able to work cross functionally.
Years of Experience:
- 0-3 years
Degrees/Certifications Required:
- Graduate degree in Linguistics or related field is a must; PhD is a plus
Main duties:
- Perform linguistic analyses on large datasets.
- Perform linguistic error analysis of AI model outputs, determining what the most frequent and severe error categories are.
- Write and revise guidelines for human annotation and translation projects.
- Conduct typological and sociolinguistic research on a large number of languages, highlighting their similarities and differences.
- Perform linguistic analyses for Responsible AI (toxic language, hate speech, gender bias and other cultural biases) in massively multilingual settings.
- Conduct linguistic literature reviews on various NLP-adjacent topics, and summarize findings.
- Compare the quality of human translations between vendors, identify error patterns, and provide actionable feedback.
- Provide information or guidance relative to any aspect of linguistic knowledge (typology, morpho-syntax, sociolinguistics, classification, phonetics/phonology, pragmatics, etc.).
- Reach out to and collaborate with native speakers in various languages.
- Communicate results of linguistic analyses to engineers and research scientists.
Skills:
- Must have strong written and spoken communication skills, especially business and research communication.
- Must be near-native proficient in a language other than English, more specifically a language of the following families or groups: Afro-Asiatic, Indo-Aryan, Atlantic-Congo, or Austronesian.
- Working knowledge in other languages is a plus. Proficiency in a low-resource language is valued.
- Must be able to code in Python (must) and query databases using SQL, other coding languages used for data analysis (e.g., R) are a plus.
- Must be able to independently work through complex requests and perform under pressure.
- Strong ability to work independently, prioritize, plan, and track work, as well as report progress
- education or training in the basics of project management is a plus
- self-motivation is a must
- Working knowledge of international language-classification standards is valued.
Education:
- Graduate degree in Linguistics or related field is a must; PhD is a plus
- a background or specialization in corpus linguistics is a plus
- experience with field work is a plus
- a graduate degree in Literature or English is not an appropriate substitution
- degree in Computer Science with a specialization in NLP is not an appropriate substitution
- Must have a very firm grasp of the following linguistic fields: language typology, syntax, morphology, sociolinguistics (especially dialectology and discourse analysis), corpus linguistics, writing systems, pragmatics, phonology.
- Must have some experience with applying basic Natural Language Processing techniques.
Experience:
- Years of experience: 0-3
- Experience working cross-functionally
- Experience collaborating with machine learning, NLP, or software engineers, or data scientists
- Experience contributing to research papers
- Important: Preferably no known conflicts of interest in the fields of machine translation, ASR, TTS, or LLM research (as FAIR Linguists need to be contributing to research papers)
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.