GenAI Developer/ML Engineer/Data Scientist | W2 only

Overview

Hybrid
Depends on Experience
Contract - W2
Contract - 12 Month(s)
50% Travel

Skills

GenAI
LLM
RAG
Python

Job Details

Job Title: GenAI Developer/ML Engineer/Data Scientist

Location: Charlotte, NC (Hybrid-3 Days Onsite, Only local)

Duration: 12 Month contract with Possible extension

Position Overview

This is a Software Engineering / AI Engineering role focused on building and scaling LLM-powered applications. The team has already built and deployed an internal chatbot on a RAG (Retrieval Augmented Generation) framework. The chatbot allows users across the bank to query documents and data interactively, with plans to scale from 1,000 users this year to 3,000+ users next year.

It s not a short-term project this is an ongoing initiative with strong business backing, so the role provides both stability and cutting-edge technical exposure.

Key Skills Needed:

  • LLMs & Inference: Hands-on with Llama 3, Mistral, Quinn (if possible), and VLLM inference engine (biggest priority).
  • Model Serving: Nvidia Triton experience is a big plus.
  • Programming: Strong Python (3.10+), with Flask/FastAPI. Java is secondary but useful for REST services.
  • Databases: Redis + Vector DBs (critical for RAG), and strong SQL.
  • MLOps / Infra: Containers, OpenShift/Kubernetes, CI/CD tools (XLR, Datical), GPU resource management.
  • Critical Thinking: Not just coding must be able to challenge product requirements if they re not technically feasible.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.