Computational Linguist - Hindi, Telugu and/or Bengali

Overview

On Site
$40 - $58
Contract - W2
Contract - 09 Month(s)

Skills

Computational Linguistics
Data Analysis
Data Collection
Internationalization And Localization
Large Language Models (LLMs)
Linguistics
Machine Learning (ML)
Natural Language Processing
Prompt Engineering
R
Regular Expression
Science
SQL
Semantics
Translation
Organizational Skills
Python
RegEx

Job Details

Location:

Onsite - New York, NY | Burlingame, CA | Seattle, WA - Either of One Location.

Summary:

  • We are looking for a Linguist to help us develop language components for a variety of voice-enabled technologies and products. We are seeking candidates with native or near-native fluency in Hindi, Telugu and/or Bengali with strong linguistic data analysis and language technology experience to manage data collection, LLM-powered data synthesis and data annotation tasks, prompt engineering, localization and quality evaluations.

Job Responsibilities:

  • Provide linguistic expertise in the areas of syntax, semantics, pragmatics and sociolinguistics
  • Collaborate with other linguists and data operations teams in data collection, data curation, translation, localization and annotation efforts
  • Evaluate and curate data sets for ML models using LLM solutions
  • Assess model and data quality
  • Prompt engineering
  • Collaboratively develop complex and consistent linguistic analyses

Required Qualifications:

  • Master s degree in general Linguistics or Linguistics with an emphasis on Romance languages, Computational Linguistics, Speech Science, or related field
  • Native or near-native fluency in Hindi, Telugu and/or Bengali
  • Awareness of Indian languages and their linguistic, cultural, local nuances
  • Knowledge of syntax, semantics, pragmatics, sociolinguistics, corpus linguistics, and other areas of linguistics
  • Experience working with speech and text data in multiple languages
  • Familiar with Large Language Models (LLMs), prompt engineering and their applications
  • Comfortable working in a fast paced, highly collaborative, dynamic work environment
  • Strong organizational skills and detail oriented
  • Excellent communication skills both verbal and written
  • Experience with database queries and data analysis processes (i.e. SQL, spreadsheets, R, Unix, or others)

Preferred (additional) Qualifications:

  • PhD in Linguistics or Romance languages, language technologies, computational linguistics, speech science, or related field,
  • Proficiency in Python
  • Experience with machine learning frameworks, NLP Libraries and Tools

Must-Have Skills:

  • Proficient in Indian languages
  • Master s degree is good - bachelor s degree with experience
  • data linguistic, RegEx, SQL

Nice-to-have Skills:

  • Python language

Years of Experience:

  • 2-3 years of experience
  • If they have master s degree and good academic background then no experience is fine

Requisition Details:

  • If someone has bachelor's degree, then we need at least 2 years of experience

Degrees/Certifications Required:

  • Masters, bachelors

How many rounds of interviews?

  • 30 min 1st round - Behavioral interview - validation
  • 45 min 2nd round - technical - examples to work through - coder pad link

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.