Sunnyvale, California
•
Today
Define the joint optimization of model compression and silicon architecture for Amazon\'s next generation of edge and cloud inference accelerators. Your work will set the technical targets that propagate across the model, compiler, runtime, and silicon stack.We are hiring a Principal Applied Scientist to be the technical leader who closes the loop between compression science and silicon design. Today\'s generation ships advanced quantization and large-model distillation in production, running mu
Full-time









