Mountain View, California
•
Yesterday
A global consumer device company based in Mountain View, CA is looking forAI Infrastructure & Experience Engineerto join their team! Key Responsibilities Deploy and tune multiple LLMs and generative multimodal models on local inference hardware. Optimize performance metrics (TTFT, tokens/sec) via model quantization, caching strategies, and architecture-specific adjustments. Leverage deep knowledge of the CUDA environment to build custom kernels, ensuring maximum utilization of the low cost GPU
Easy Apply
Contract
Depends on Experience


