Overview
Skills
Job Details
LLM Engineer
Location: San Jose, CA (Hybrid 2 days/week onsite)
Looking only for W2
Key Responsibilities:
Model Development & Optimization
-
Design, train, fine-tune, and evaluate large language models (LLMs).
-
Ensure high performance, efficiency, and alignment with product or research goals.
Systems Integration & Deployment
-
Implement scalable inference pipelines.
-
Optimize serving infrastructure (quantization, caching, distillation, etc.).
-
Integrate models into applications, platforms, or APIs.
Research & Cross-Functional Collaboration
-
Drive experimentation with new architectures, retrieval systems, and prompt-engineering techniques.
-
Collaborate with product, data, and MLOps teams to transition research into production-ready features.