Gen AI Architect


QualiTest
Dice Job Match Score™
🔢 Crunching numbers...
Job Details
Skills
- Amazon SageMaker
- C#
- Artificial Intelligence
- Generative Artificial Intelligence (AI)
- Java
- LangChain
- Large Language Models (LLMs)
- Machine Learning (ML)
- Machine Learning Operations (ML Ops)
- Microsoft Azure
- Python
- Semantic Search
- Vector Databases
- Enterprise Architecture
- Computer Science
- API
- GitLab
- RAG
Summary
Are you interested in working with the World’s leading AI-first Quality Engineering Company? Ready to advance your career, team up with global thought leaders across industries and make a difference every day? Join us at QualityAI (Formerly known as Qualitest)!
We are looking for a Gen AI Architect to join our growing team in United States!
Job Overview:
We are seeking a Generative AI Architect to lead the design, architecture, and delivery of enterprise-grade AI solutions powered by Large Language Models (LLMs), multi-modal AI, and Retrieval-Augmented Generation (RAG) pipelines. The ideal candidate will combine deep technical expertise in AI/ML systems with proven experience in enterprise architecture, ensuring solutions are scalable, secure, compliant, and aligned with business goals.
This role involves defining the technical roadmap for Generative AI initiatives, selecting and integrating AI frameworks, orchestrating model lifecycle management, and guiding cross-functional teams to deliver production-ready Gen AI solutions. You will be the go-to expert for translating high-level business needs into robust, future-proof AI architectures.
Key Responsibilities:
- Architect Gen AI Systems - Design and evolve architectures for LLM-powered applications, RAG workflows, multi-agent AI, and vector search integration.
- Technology Evaluation - Select and recommend AI frameworks, vector databases (Weaviate, Pinecone, Milvus), and orchestration tools (LangChain, LangGraph) that meet performance, scalability, and compliance needs.
- Prompt & Model Strategy - Define prompt engineering standards, fine-tuning approaches, and model governance guidelines for consistent and reliable outputs.
- Scalable API Design - Architect secure, high-performance RESTful APIs (e.g., FastAPI) for AI service integration.
- Data Architecture - Oversee the design and preparation of large, complex datasets (structured/unstructured) for training, fine-tuning, and inference.
- Cloud AI Integration - Architect and deploy AI workloads on AWS (Bedrock, SageMaker), Azure (OpenAI, ML), or Google Cloud Platform (Vertex AI) with multi-cloud readiness.
- Security & Compliance - Ensure solutions adhere to enterprise security policies, AI governance frameworks, and data privacy regulations (GDPR, HIPAA, SOC 2).
- Performance Optimization - Implement GPU optimization, model quantization, caching strategies, and distributed inference for real-time workloads.
- Leadership & Mentorship - Guide engineering and data science teams on best practices in Gen AI architecture, scalability, and ethical AI.
Required Skills and Qualifications:
- Bachelor’s or Master’s degree in Computer Science, Engineering, or related technical discipline.
- 10+ years in software development/architecture, with 3+ years in AI/ML and at least 2 years in Generative AI system design.
- Proven experience architecting and deploying enterprise-scale LLM-based applications.
- Expertise in RAG techniques, vector database design, and semantic search optimization.
- Strong Python proficiency and familiarity with other enterprise languages (Java, C#, Go).
- Proficiency with Generative AI libraries and frameworks (LangChain, Hugging Face, Transformers).
- In-depth knowledge of REST API design, microservices, and event-driven architecture.
- Hands-on with multi-cloud AI services (AWS Bedrock, Azure OpenAI, Google Cloud Platform Vertex AI).
- Experience in MLOps, CI/CD automation (Azure DevOps, GitHub Actions, Jenkins, GitLab CI).
- Strong problem-solving, analytical, and communication skills.
Preferred Qualifications:
- Prior work with regulated industry data (finance, healthcare, insurance).
- Experience integrating multi-modal AI (text, image, audio, video) into enterprise solutions.
- Familiarity with open-source LLMs (LLaMA, Mistral, Ollama).
- AI and cloud architecture certifications (AWS ML Specialty, Azure AI Engineer Associate).
- Dice Id: IBASE
- Position Id: CON_GenAI
- Posted 16 hours ago
Company Info
We are the world's largest pure play quality assurance company. Quality assurance is at the core of our business and everything that we do. Our team of focused specialists provides a broad service offer that goes beyond functional testing to encompass automation.
Mission: To enable every client and every brand to navigate an ever- changing world by delivering smarter quality assurance and testing solutions to meet their precise technology needs - mitigated of risk, exceptional to use and ready to perform.

Similar Jobs
It looks like there aren't any Similar Jobs for this job yet.
Search all similar jobs