Overview
Skills
Job Details
Job Title: GenAI Developer/ML Engineer/Data Scientist
Location: Charlotte, NC (Hybrid-3 Days Onsite, Only local)
Duration: 12 Month contract with Possible extension
Position Overview
This is a Software Engineering / AI Engineering role focused on building and scaling LLM-powered applications. The team has already built and deployed an internal chatbot on a RAG (Retrieval Augmented Generation) framework. The chatbot allows users across the bank to query documents and data interactively, with plans to scale from 1,000 users this year to 3,000+ users next year.
It s not a short-term project this is an ongoing initiative with strong business backing, so the role provides both stability and cutting-edge technical exposure.
Key Skills Needed:
- LLMs & Inference: Hands-on with Llama 3, Mistral, Quinn (if possible), and VLLM inference engine (biggest priority).
- Model Serving: Nvidia Triton experience is a big plus.
- Programming: Strong Python (3.10+), with Flask/FastAPI. Java is secondary but useful for REST services.
- Databases: Redis + Vector DBs (critical for RAG), and strong SQL.
- MLOps / Infra: Containers, OpenShift/Kubernetes, CI/CD tools (XLR, Datical), GPU resource management.
- Critical Thinking: Not just coding must be able to challenge product requirements if they re not technically feasible.