Hi,
Job Title: GenAI Engineer (Java/Python Developer Background)
Location: Sunnyvale, CA/Charlotte, NC - Hybrid 3X per week
Duration: 12 months
Employment Type: W2 Contract
Experience: 5+ Years
Required Skills
5+ years of experience in software development using Java and/or Python
Strong experience with Generative AI and Large Language Models (LLMs)
Hands-on experience with LangChain, LangGraph, LlamaIndex, or similar frameworks
Experience building RAG (Retrieval-Augmented Generation) applications
Experience integrating AI models from OpenAI, Anthropic, Gemini, Azure OpenAI, or open-source LLMs
Strong understanding of prompt engineering and prompt optimization
Experience with vector databases such as Pinecone, ChromaDB, Weaviate, Milvus, or FAISS
Experience developing and consuming REST APIs
Strong knowledge of Spring Boot (Java) or FastAPI/Flask (Python)
Experience with SQL and NoSQL databases
Familiarity with Docker and Kubernetes Experience with cloud platforms such as AWS, Azure, or Google Cloud Platform
Knowledge of Git and CI/CD pipelines
Strong debugging and performance optimization skills
Roles & Responsibilities
Design and develop enterprise-grade GenAI applications using Java or Python.
Build and optimize RAG pipelines leveraging vector databases and embeddings.
Develop AI-powered chatbots, assistants, and intelligent automation solutions.
Integrate enterprise applications with LLM APIs and AI services.
Design scalable backend APIs to support AI-driven products.
Create and maintain prompt templates for optimal model performance.
Develop agentic workflows using modern AI orchestration frameworks.
Collaborate with data engineers and ML teams to build end-to-end AI solutions.
Optimize latency, scalability, and cost of AI applications.
Implement AI monitoring, logging, and observability solutions.
Build secure and production-ready AI services following best practices.
Participate in code reviews, architecture discussions, and technical design sessions.
Troubleshoot production issues and continuously improve AI application performance.
Stay current with the latest advancements in Generative AI, LLMs, and AI frameworks.