Overview
Remote
Depends on Experience
Full Time
Skills
LLM
Hugging Face Transformers
distributed training tools (DeepSpeed
Accelerate)
Python
PyTorch/TF
Docker/Kubernetes
Job Details
Title: Transformer Model Builder AI/ML
Looking for an experienced AI/ML Transformer Model Builder to design, finetune, optimize, and deploy transformerbased models (LLMs and related architectures) for production AI services, with strong skills in training frameworks, model optimization, and MLOps.
Responsibilities
- Design and implement training workflows for transformer models including tokenization, dataset pipelines, and augmentation.
- Finetune and optimize models using techniques like LoRA, quantization, pruning, and distillation to meet latency and cost targets.
- Scale training with distributed runtimes and optimizers (e.g., DeepSpeed, ZeRO) and manage mixedprecision training for efficiency.
- Integrate models into production: build inference services, optimize serving (batching, caching, Triton), and implement CI/CD for model artifacts.
- Collaborate with ML engineers and product teams to define evaluation metrics, A/B tests, and monitoring for model drift and performance.
- Document experiments and maintain model registry with reproducible training configs and checkpoints.
Required qualifications
- 5+ years ML engineering experience with 3+ years working on transformer models and LLMs.
- Handson experience with Hugging Face Transformers and model hubs, tokenizers, and finetuning workflows.
- Experience with distributed training tools (DeepSpeed, Accelerate) and optimization techniques for large models.
- Strong software engineering skills in Python, PyTorch/TF, and production deployment (Docker, Kubernetes).
- Proven track record of shipping models to production with observability and rollback plans.
We are proud to be an Equal Employment Opportunity (EEO) and Affirmative Action employer. We at HL Solutions do not discriminate based on Race, Religion, Color, National origin, Sex, Sexual orientation, Gender identity, Gender expression, Age, and Disability status.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.