Cloud Machine Learning Engineer - US remote

Overview

Remote
Full Time

Skills

GitHub
Art
Open Source
SaaS
MEAN Stack
IBM
Microsoft
Bridging
Technical Writing
Deep Learning
PyTorch
Cloud Computing
Amazon Web Services
Amazon SageMaker
Amazon EC2
Amazon S3
Microsoft Azure
Google Cloud Platform
Google Cloud
Machine Learning Operations (ML Ops)
Docker
TypeScript
Rust
MongoDB
Kubernetes
Documentation
Product Development
SAP BASIS
Training
Collaboration
Machine Learning (ML)
Artificial Intelligence

Job Details

At Hugging Face, we're on a journey to democratize good AI. We are building the fastest growing platform for AI builders with over 5 million users & 100k organizations who collectively shared over 1M models, 300k datasets & 300k apps. Our open-source libraries have more than 400k+ stars on Github.

Hugging Face has become the most popular, community-driven project for training, sharing, and deploying the most advanced machine learning models. Workload efficiency is key to our mission of democratizing state of the art and we are always looking to push the boundaries for faster, and more efficient ways to train and deploy models.

About the Role

We are looking for a Cloud Machine Learning engineer responsible to help build machine learning solutions used by millions leveraging cloud technologies. You will work on integrating Hugging Face's open-source libraries like Transformers and Diffusers, with major cloud platforms or managed SaaS solutions.

You may want to take a look at these announcements to get a better sense of what this role might mean in practice :
Hugging Face and AWS partner to make AI more accessible
Hugging Face and IBM partner on watsonx.ai, the next-generation enterprise studio for AI builders
Introducing SafeCoder
Hugging Face Collaborates with Microsoft to launch Hugging Face Model Catalog on Azure

Responsibilities

We are looking for talented people with deep experience and passion for both Machine Learning (at the framework level) and Cloud Services:
  • Bridging and integrating transformers/diffusers models with a different Cloud provider.
  • Ensuring the above models meet the expected performance
  • Designing & Developing easy-to-use, secure, and robust Developer Experiences & APIs for our users.
  • Write technical documentation, examples and notebooks to demonstrate new features
  • Sharing & Advocating your work and the results with the community.

About You

You'll enjoy working on this team if you have experience with and interest in deploying machine learning systems to production and build great developer experiences. The ideal candidate will have skills including:
  • Deep experience building with Hugging Face Technologies, including Transformers, Diffusers, Accelerate, PEFT, Datasets
  • Expertise in Deep Learning Framework, preferably PyTorch, optionally XLA understanding
  • Strong knowledge of cloud platforms like AWS and services like Amazon SageMaker, EC2, S3, CloudWatch and/or Azure and Google Cloud Platform equivalents.
  • Experience in building MLOps pipelines for containerizing models and solutions with Docker
  • Familiarity with Typescript, Rust, and MongoDB, Kubernetes are helpful
  • Ability to write clear documentation, examples and definition and work across the full product development lifecycle
  • Bonus: Experience with Svelte & TailwindCSS

More about Hugging Face

We are actively working to build a culture that values diversity, equity, and inclusivity.We are intentionally building a workplace where people feel respected and supported-regardless of who you are or where you come from. We believe this is foundational to building a great company and community. Hugging Face is an equal opportunity employer and we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

We value development.You will work with some of the smartest people in our industry. We are an organization that has a bias for impact and is always challenging ourselves to continuously grow. We provide all employees with reimbursement for relevant conferences, training, and education.

We care about your well-being. We offer flexible working hours and remote options. We offer health, dental, and vision benefits for employees and their dependents. We also offer parental leave and flexible paid time off.

We support our employees wherever they are. While we have office spaces in NYC and Paris, we're very distributed and all remote employees have the opportunity to visit our offices. If needed, we'll also outfit your workstation to ensure you succeed.

We want our teammates to be shareholders. All employees have company equity as part of their compensation package. If we succeed in becoming a category-defining platform in machine learning and artificial intelligence, everyone enjoys the upside.

We support the community. We believe major scientific advancements are the result of collaboration across the field. Join a community supporting the ML/AI community.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.