AI Infrastructure & Experience Engineer

Mountain View, CA, US • Posted 9 days ago • Updated 6 days ago

Contract W2

Contract Corp To Corp

6 Months

No Travel Required

On-site

$79 - $79/hr

Sorry this job is no longer available. The Similar Jobs shown below might interest you.

View all jobs from this company

Similar Jobs

AI Infrastructure & Experience Engineer

Mountain View, California

•

4d ago

Recent experience in model optimization requiredHardware & Compute:Proven experience with NVIDIA eco-systems and ARM64 architecture.Systems Programming:Advanced proficiency in C++, Python, and Rust. Deep familiarity with CUDA and the ability to author/debug custom CUDA kernels for compute-intensive tasks.AI/ML Frameworks:Extensive experience with modern inference engines (llama.cpp, TensorRT-LLM, Ollama) and orchestration frameworks (LiteLLM).Software Engineering:Robust understanding of asynchro

Easy Apply

Contract

Depends on Experience

AI Infrastructure & Experience Engineer

Mountain View, California

•

2d ago

Job Category: Technical Job Title: AI Infrastructure & Experience Engineer Duties: Key Responsibilities Inference Optimization: Deploy and tune multiple LLMs and generative multimodal models on local inference hardware. Optimize performance metrics (TTFT, tokens/sec) via model quantization, caching strategies, and architecture-specific adjustments. Systems Engineering & CUDA: Leverage deep knowledge of the CUDA environment to build custom kernels, ensuring maximum utilization of the l

Easy Apply

Third Party, Contract

$79 - $79

AI Inference Engineer

San Jose, California

•

4d ago

AI Inference Engineer Location:San Jose, CA Contract / C2H Duration: 6-12Months About the Role We are seeking a highly skilled AI Inference Engineer to join our team and drive the performance, scalability, and reliability of our large-scale model serving infrastructure. This role sits at the intersection of systems engineering, GPU optimization, and distributed infrastructure, and is ideal for someone who thrives on squeezing maximum performance out of production AI workloads. The ideal candida

Easy Apply

Contract

Depends on Experience

Sr. Cloud AI Infrastructure Engineer

Palo Alto, California

•

Today

Business Unit What the Role Entails 1.Architecture Research: Conduct in-depth research into the underlying hardware logic of various AI accelerators; evaluate the power-efficiency ratio and suitability of different heterogeneous architectures in the context of Large Language Model (LLM) inference and training. 2.Operator & Performance Optimization: Design and optimize high-performance operator libraries for large-scale cloud computing environments; resolve long-tail latency issues in hardware

Full-time

USD 145,100.00 - 273,200.00 per year

Search all similar jobs