Overview
On Site
$92000 - $100000
Full Time
No Travel Required
Skills
GenAI
AI
ML
DOCKER
python
Kubernetes
API
machine learning
Job Details
Role :: GenAI Model Deployment Engineer
Location :: Phoenix, Az
Type :: Fulltime
Job Description
Experience Required - 6+ Years
Must Have Technical/Functional Skills
- Proven experience in deploying and managing machine learning models in production.
- Strong programming skills in Python.
- Experience with API development and integration.
- In-depth knowledge of Generative AI model architecture and GPU architecture.
- Familiarity with cloud platforms and containerization technologies (e.g., Docker, Kubernetes).
- Excellent problem-solving skills and the ability to work independently and as part of a team.
- Strong communication skills and the ability to collaborate effectively with stakeholders.
Roles & Responsibilities:
- Deploy and manage large language models (LLMs) in production environments.
- Perform benchmarking to evaluate model performance and optimize deployment strategies.
- Research and implement new frameworks and technologies for hosting and serving models.
- Collaborate with cross-functional teams to integrate machine learning models into existing systems.
- Develop and maintain APIs for model inference and other machine learning services.
- Monitor and troubleshoot model performance and infrastructure issues.
- Stay up-to-date with the latest advancements in machine learning, AI, and related technologies.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.