Overview
Skills
Job Details
Data Scientist
Experience: 8+ Years
Location: Onsite 3 days/week in Irving, TX or Cape Girardeau, MO
What's in it for you?
As a Data Scientist, you'll join a dynamic team of AI/ML professionals, working on the forefront of generative AI innovation. We're seeking individuals with hands-on experience in advanced architectures-Transformers, GANs, VAEs, Diffusion Models-as well as expertise in fine-tuning Large Language Models (LLMs), building robust NLP pipelines, developing GenAI validation frameworks, and constructing multi-agent reasoning systems using tools like LangChain, AutoGen, or CrewAI.
Responsibilities:
-
Design and implement state-of-the-art AI architectures including Transformers, GANs, VAEs, and Diffusion Models for both generative and predictive use cases.
-
Fine-tune and optimize performance of LLMs (e.g., GPT, LLaMA, Claude) through prompt engineering and other techniques for specific applications.
-
Build and maintain NLP pipelines focused on tokenization, embeddings, attention mechanisms, and integration with vector databases for semantic search and retrieval.
-
Develop validation frameworks for GenAI applications, applying suitable evaluation methodologies and metrics to ensure model reliability and effectiveness.
-
Design and deploy multi-step reasoning agents using LangChain, AutoGen, or CrewAI to support complex task execution and decision-making.
-
Implement tool orchestration and memory management strategies to preserve context and enhance agent performance over time.
-
Collaborate closely with cross-functional teams-including data scientists, engineers, and product managers-to integrate AI capabilities into scalable, production-ready applications.
Educational Qualifications:
-
Bachelor's or Master's degree in Engineering, Computer Science, or related field (BE/ME/BTech/MTech/BSc/MSc)
-
Relevant technical certifications across multiple technologies are a plus
Key Skills (Mandatory):
-
Strong knowledge of AI/ML architectures: Transformers, GANs, VAEs, Diffusion Models
-
Hands-on experience with LLMs such as OpenAI, GPT, LLaMA, Claude; fine-tuning experience preferred
-
Expertise in NLP concepts: tokenization, embeddings, attention mechanisms, vector databases
-
Experience designing validation pipelines and evaluation metrics for AI/GenAI applications
-
Proven ability to develop multi-step reasoning agents using LangChain, AutoGen, or CrewAI
-
Familiarity with tool orchestration and memory management in agent frameworks