GenAI DevOps Engineer

Overview

On Site

$92000 - $100000

Full Time

No Travel Required

Skills

GenAI

DOCKER

python

Kubernetes

API

machine learning

Job Details

Role :: GenAI Model Deployment Engineer

Location :: Phoenix, Az

Type :: Fulltime

Job Description

Experience Required - 6+ Years

Must Have Technical/Functional Skills

Proven experience in deploying and managing machine learning models in production.
Strong programming skills in Python.
Experience with API development and integration.
In-depth knowledge of Generative AI model architecture and GPU architecture.
Familiarity with cloud platforms and containerization technologies (e.g., Docker, Kubernetes).
Excellent problem-solving skills and the ability to work independently and as part of a team.
Strong communication skills and the ability to collaborate effectively with stakeholders.

Roles & Responsibilities:

Deploy and manage large language models (LLMs) in production environments.
Perform benchmarking to evaluate model performance and optimize deployment strategies.
Research and implement new frameworks and technologies for hosting and serving models.
Collaborate with cross-functional teams to integrate machine learning models into existing systems.
Develop and maintain APIs for model inference and other machine learning services.
Monitor and troubleshoot model performance and infrastructure issues.
Stay up-to-date with the latest advancements in machine learning, AI, and related technologies.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Job Details

About Stanley David and Associates

Share