Overview
Skills
Job Details
ML Engineer (Contract 1 Month, Hybrid in San Jose, CA)
Location: San Jose, CA (Hybrid 2 days/week onsite)
Duration: 1 month (potential for extension)
Summary:
We re seeking an experienced LLM Engineer to design, fine-tune, and deploy large language models for production use. The ideal candidate combines deep expertise in machine learning with hands-on experience optimizing inference systems and integrating models into scalable applications. You ll collaborate closely with research, data, and product teams to transform advanced AI models into impactful, real-world solutions.
Responsibilities:
Design, train, and fine-tune large language models (LLMs) to meet performance and alignment goals.
Implement and optimize inference pipelines, ensuring efficiency through techniques like quantization, distillation, and caching.
Integrate models into APIs and production systems for seamless deployment.
Lead experimentation on new architectures, prompt-engineering strategies, and retrieval systems to enhance model capability.
Collaborate cross-functionally with data scientists, ML engineers, and product teams to translate research insights into deployable solutions.
Monitor model performance, conduct evaluations, and iterate based on data-driven feedback.