Overview
Skills
Job Details
Title: Generative AI Engineer
Location: Houston TX
Duration: 12+ Months
Position Summary:
About the Role:
We are looking for a highly motivated and skilled Generative AI Engineer to join our AI/ML team. You will be responsible for designing, building, and deploying advanced generative models that leverage cutting-edge techniques in deep learning, NLP, and multimodal learning. Your work will directly contribute to products and solutions that transform user experiences through content generation, automation, and intelligent interaction.
Key Responsibilities:
Develop, fine-tune, and deploy generative models (e.g., GPT, LLaMA, Stable Diffusion, DALL E, etc.)
Build scalable and efficient pipelines for training, evaluation, and inference of generative models
Collaborate with cross-functional teams (engineering, product, design, research) to integrate models into real-world applications
Stay current with research advancements in generative AI and propose innovative applications
Optimize model performance and reduce latency for production environments
Ensure ethical and responsible use of generative technologies, including bias mitigation and model interpretability
Contribute to documentation, code quality, and model governance best practices
Required Qualifications:
Bachelor's or Master's degree in Computer Science, Machine Learning, Mathematics, or related field (PhD preferred for research-focused roles)
Strong experience with deep learning frameworks (e.g., PyTorch, TensorFlow, Hugging Face Transformers)
Hands-on experience with LLMs, diffusion models, or VAEs in production or research
Solid programming skills in Python; familiarity with cloud platforms (AWS, Google Cloud Platform, Azure) is a plus
Familiarity with prompt engineering, fine-tuning, LoRA/QLoRA, and model compression techniques
Strong understanding of NLP, computer vision, and generative model architectures
Experience working with large-scale datasets and distributed training
Preferred Qualifications:
Published research in top AI/ML conferences (NeurIPS, ICML, CVPR, ACL, etc.)
Experience with retrieval-augmented generation (RAG), vector databases (e.g., FAISS, Pinecone), and prompt optimization
Exposure to multi-modal generative systems combining text, image, audio, or video
Experience building tools, APIs, or applications powered by generative AI
Why Join Us:
Work with cutting-edge generative AI technologies and shape the future of intelligent applications
Collaborate with world-class engineers and researchers in a fast-paced, innovative environment
Flexible work arrangements and competitive compensation
Opportunity to drive high-impact projects with real-world applications