Mountain View, California
•
Yesterday
Job Category: Technical Job Title: AI Infrastructure & Experience Engineer Duties: Key Responsibilities Inference Optimization: Deploy and tune multiple LLMs and generative multimodal models on local inference hardware. Optimize performance metrics (TTFT, tokens/sec) via model quantization, caching strategies, and architecture-specific adjustments. Systems Engineering & CUDA: Leverage deep knowledge of the CUDA environment to build custom kernels, ensuring maximum utilization of the l
Easy Apply
Third Party, Contract
$79 - $79














