Overview
Skills
Job Details
Job Description -
Title : AI/ML Lead Architect Large Language Model (LLM) Development
Location : Indiana
Position : Remote
We are a forward-thinking firm embarking on the development of a proprietary, fully owned Large Language Model (LLM) tailored to deliver transformative solutions in three specialized sectors:
Legal Services & Documentation, Healthcare & Medical Analysis, and Business Intelligence & Analytics. The resulting model will be linguistically versatile, supporting both Arabic and English content, and architected for deployment in cloud and on-premises environments.
Role Overview:
As the AI/ML Lead Architect, you will play a central role in steering the technical design, training strategy, and deployment of an advanced transformer-based language model.
You will work across all stages of the LLM lifecycle: from data curation and model design to distributed training, fine-tuning, optimization, and production deployment.
This role demands both technical depth and a vision for innovating specialized, domain-specific AI solutions.
Key Responsibilities for the AI/ML Lead Architect :
Design and architect advanced transformer-based LLMs tailored for legal, healthcare, and business analytics domains.
Lead and mentor a high-caliber team of researchers and engineers throughout the model development lifecycle.
Oversee the preparation and curation of large-scale multilingual (Arabic/English) datasets relevant to target domains.
Spearhead fine-tuning and training of LLMs from scratch, with a focus on domain specialization.
Implement and optimize distributed training frameworks (e.g., DeepSpeed, FairScale, Horovod) for scalable model development.
Apply state-of-the-art techniques in attention mechanisms, tokenization, model quantization, pruning, and deployment optimization.
Evaluate and iterate on open-source models such as LLaMA (2/3), Mistral, CodeLlama, Alpaca, leveraging their architectures and adapting them for proprietary needs.
Work closely with product stakeholders to ensure solutions are deployable both on cloud and on-premises environments.
Establish best practices for model evaluation, benchmarking, and responsible AI deployment, particularly around sensitive legal and medical data.
Document technical designs and processes for knowledge sharing and regulatory compliance
.
Required Qualifications for the AI/ML Lead Architect :
5+ years experience in AI/ML research and development with a specialization in modern transformer architectures (e.g., GPT, BERT, T5, LLaMA, Mitsral).
Proven expertise in LLM fine-tuning and original model training.
Robust experience with distributed training frameworks (DeepSpeed, FairScale, Horovod, or similar).
In-depth understanding of attention mechanisms, tokenization, and optimization strategies for large neural models.
Demonstrable hands-on work with major open-source LLMs (LLaMA 2/3, Mistral, CodeLlama, Alpaca, etc.).
Experience with model quantization, pruning, and deployment optimization to achieve efficient inference on diverse hardware.
Record of domain-specific LLM projects within legal, healthcare/medical, or business analytics/finance sectors.
Comfortable working in multilingual environments, especially with datasets and content in Arabic and English.