AI Engineer Small Language Models (SLM)
Location: Charlotte, NC / Philadelphia, PA (Hybrid)
Duration: Long-Term Contract
Experience Required: 4 5 Years
Job Summary
We are seeking an AI Engineer with hands-on experience building, fine-tuning, and deploying Small Language Models (SLMs) and enterprise AI solutions. The ideal candidate will have strong expertise in model optimization, inference pipelines, Retrieval-Augmented Generation (RAG), and deploying AI applications in cloud environments.
This role offers an opportunity to work on cutting-edge AI initiatives, developing scalable and production-ready intelligent systems that support enterprise business objectives.
Key Responsibilities
- Design, build, fine-tune, and optimize Small Language Models (SLMs) for enterprise use cases.
- Develop and deploy AI/ML solutions using Python and modern AI frameworks.
- Build and maintain Retrieval-Augmented Generation (RAG) pipelines, vector search systems, and knowledge retrieval solutions.
- Fine-tune, evaluate, and improve transformer-based language models.
- Integrate AI-powered solutions into enterprise applications and workflows.
- Develop scalable inference pipelines and optimize model performance.
- Implement MLOps practices, CI/CD pipelines, and cloud-based deployment strategies.
- Collaborate with product managers, software engineers, data scientists, and business stakeholders to deliver AI solutions.
- Monitor and improve model performance, reliability, and scalability in production environments.
- Stay current with emerging AI technologies, frameworks, and best practices.
Required Qualifications
- 4 5 years of experience in Software Engineering, Machine Learning Engineering, or AI Engineering.
- Strong programming expertise in Python.
- Hands-on experience working with Small Language Models (SLMs) and Large Language Models (LLMs).
- Experience building and deploying AI solutions in production environments.
- Strong understanding of machine learning workflows, model evaluation, and deployment strategies.
- Experience developing enterprise-grade AI applications.
Required Technical Skills
- Python
- Small Language Models (SLMs)
- Large Language Models (LLMs)
- Retrieval-Augmented Generation (RAG)
- LangChain
- Hugging Face
- LlamaIndex
- Vector Databases
- Embeddings
- Machine Learning Model Deployment
- AWS Cloud Services
- REST APIs & Application Integration
- CI/CD Pipelines
- MLOps Practices
Preferred Qualifications
- Experience with enterprise AI platforms and AI application frameworks.
- Knowledge of model optimization techniques and inference performance tuning.
- Experience deploying and managing AI workloads in cloud environments.
- Familiarity with scalable AI architecture and distributed systems.
- Exposure to regulated or enterprise-scale environments.
Nice to Have
- Prompt Engineering experience.
- Multi-Agent AI Systems development.
- Conversational AI and chatbot implementation experience.
- Experience working with advanced AI orchestration frameworks.
Key Competencies
- AI Solution Design
- Problem Solving
- Analytical Thinking
- Innovation & Continuous Learning
- Cross-Functional Collaboration
- Communication Skills
- Production AI Deployment
- Performance Optimization