Gen AI Engineer

Overview

On Site
$100,000 - $160,000
Full Time

Skills

GenAI
RAG
Python
LLM
Agentic AI

Job Details

Role: Gen AI Engineer

Location: Richardson, TX,Bridgewater, NJ; Sunnyvale, CA; Austin, TX; Raleigh, NC; Richardson, TX; Tempe, AZ; Phoenix, AZ; Charlotte, NC; Houston, TX; Denver, CO; Hartford, CT; New York, NY, Palm Beach, FL; Tampa, FL or Alpharetta, GA, Long Island, NY -Onsite

Job Type : Fulltime

 

 

Required Qualifications

  • Bachelor’s degree in Computer Science, AI/ML, or related field.
  • 4 years of experience in software engineering or data science, with 2–3 years in Gen AI or LLM-based systems.
  • Strong Python programming skills and experience with ML/AI libraries (Hugging Face Transformers, LangChain, PyTorch).
  • Hands-on experience with vector databases (FAISS, Pinecone, Weaviate, Azure AI Search).
  • Familiarity with cloud platforms and Gen AI services (AWS, Azure, Google Cloud Platform).
  • Experience with REST API development (FastAPI, Flask) and containerization (Docker).
  • Solid understanding of AI governance, model safety, and prompt engineering.
  • This position is located in Bridgewater, NJ; Sunnyvale, CA; Austin, TX; Raleigh, NC; Richardson, TX; Tempe, AZ; Phoenix, AZ; Charlotte, NC; Houston, TX; Denver, CO; Hartford, CT; New York, NY, Palm Beach, FL; Tampa, FL or Alpharetta, GA, Long Island, NY or is willing to relocate.
  • Candidates authorized to work for any employer in the United States without employer-based visa sponsorship are welcome to apply. Infosys is unable to provide immigration sponsorship for this role at this time

 

Key Responsibilities

  • Design, develop, and deploy Gen AI applications using LLMs and agentic frameworks (e.g., LangGraph, AutoGen, Crew AI).
  • Fine-tune open-source and proprietary LLMs using techniques like LoRA, QLoRA, and PEFT.
  • Build and optimize RAG pipelines with hybrid retrieval, semantic chunking, and vector search.
  • Integrate Gen AI solutions with cloud-native services (AWS Bedrock, Azure OpenAI, Google Cloud Platform Vertex AI).
  • Work with unstructured data (PDFs, HTML, audio, images) and multimodal models.
  • Implement LLMOps practices including prompt versioning, caching, observability, and cost tracking.
  • Evaluate model performance using tools like RAGAS, DeepEval, and FMeval.
  • Collaborate with product managers, data engineers, and UX teams to deliver production-ready solutions.
  • Mentor junior engineers and contribute to code reviews, design discussions, and best practices.

 

Preferred Qualifications:  

  • Exposure to agentic workflows and autonomous agents.
  • Experience with CI/CD pipelines and DevOps tools (GitHub Actions, Jenkins, Terraform).
  • Familiarity with front-end integration (React, Angular, TypeScript) and GraphQL APIs.
  • Knowledge of model interpretability, bias mitigation, and human-in-the-loop systems.
  • Experience with multimodal models and perception systems (e.g., vision + language).

 

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.