Overview
On Site
USD 120,000.00 - 180,000.00 per year
Full Time
Skills
Prototyping
Cloud Computing
Optimization
Technical Writing
Computer Science
Software Development
Python
Vector Databases
Prompt Engineering
Google Cloud Platform
Google Cloud
API
Microservices
Distributed Computing
Version Control
Git
Docker
Kubernetes
Generative Artificial Intelligence (AI)
Orchestration
LangChain
LlamaIndex
Continuous Integration
Continuous Delivery
Microsoft Azure
DevOps
YAML
Terraform
Communication
Collaboration
Technical Direction
Cross-functional Team
Artificial Intelligence
Workflow
Productivity
Job Details
We are looking for an experienced and innovative Generative AI Engineer to join a growing Data & AI team. This position is ideal for a seasoned software engineer with deep expertise in generative AI and a passion for building robust, scalable AI-powered applications. You will lead efforts in designing, prototyping, and deploying GenAI solutions, while contributing to the evolution of the broader AI platform across cloud environments such as Google Cloud Platform (Google Cloud Platform) and Microsoft Azure. Key Responsibilities:
- Architect and implement scalable GenAI solutions aligned with business needs
- Design and prototype LLM-powered applications using methods like Retrieval-Augmented Generation (RAG), prompt engineering, fine-tuning, and vector search
- Develop APIs and reusable components in Python to support AI services
- Utilize orchestration frameworks (e.g., LangChain, LangGraph, LlamaIndex) to build modular AI workflows
- Deploy, monitor, and maintain GenAI pipelines using cloud-native tools, Kubernetes, and serverless platforms
- Analyze system and model performance, identifying opportunities for optimization
- Maintain clear technical documentation and support materials for deployed systems
- Hybrid schedule: 3 days per week onsite in Houston, TX
- Open office environment
- Bachelor's or Master's degree in Computer Science, Artificial Intelligence, or a related field
- 5+ years of professional software development experience with strong proficiency in Python
- 2+ years of hands-on experience developing GenAI or LLM-based applications
- Experience with agent-based AI workflows using orchestration tools such as LangGraph
- Deep understanding of retrieval pipeline design, including chunking strategies, embedding models, and integration with vector databases
- Strong working knowledge of GenAI concepts such as:
- Retrieval-Augmented Generation (RAG)
- Vector databases (e.g., FAISS, Pinecone, ChromaDB)
- Prompt engineering and LLM fine-tuning
- APIs from platforms like OpenAI, Claude, or Gemini
- Experience deploying AI solutions on Google Cloud Platform and/or Azure
- Solid understanding of API development, microservices, and distributed system architecture
- Familiar with version control systems like Git or Azure DevOps
- Proficient with Docker and Kubernetes for containerization and orchestration
- Hands-on experience with GenAI orchestration tools (LangChain, LlamaIndex, etc.)
- Familiarity with DevOps practices, including CI/CD pipelines (e.g., Azure DevOps, YAML) and infrastructure-as-code (e.g., Terraform)
- Strong communication and collaboration skills, with the ability to influence technical direction
- Comfortable working both independently and in cross-functional team environments
- Interest in leveraging AI to enhance development workflows and productivity
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.