Role : Principal AI Architect
Location : MountainView, CA (Onsite)
AI Architecture & System Design
AI system architectures: multi-agent orchestration layers, RAG pipelines, hybrid retrieval systems (knowledge graphs + vector search), text-to-SQL engines, and real-time inference APIs.
Define and own technical blueprints for new AI products from data ingestion and embedding pipelines through to response generation, evaluation, and production monitoring.
Solve hard engineering problems: latency, precision/recall trade-offs, context window management, hallucination mitigation, and cost-efficient LLM usage at scale.
Make deliberate, well-documented architecture decisions with clear trade-off analysis (build vs. buy, framework selection, deployment topology).
Implementation
Write production-quality code Python, SQL, API services across the full AI lifecycle: data qualification, model training, evaluation, containerised deployment, and API serving.
Build and own reusable, framework-quality components (chunking pipelines, retrieval layers, agent tool-calling modules) that accelerate team velocity.
Own CI/CD pipelines, Docker-based deployment, and production telemetry for AI services.
AI Market Intelligence & Technology Strategy
Track and evaluate the AI landscape new LLMs, agentic frameworks (LangGraph, Google ADK, CrewAI, AutoGen), retrieval methods, fine-tuning techniques, and emerging tooling.
Translate AI market trends into actionable roadmap inputs surfacing opportunities for step change capability improvements before competitors do.
Must-Have Experience
12+ years of hands-on experience in AI/ML engineering and data science, with significant depth in production system delivery.
Deep, working expertise in LLM application development: LangChain, LangGraph, tool-calling agents, RAG, prompt engineering, embedding pipelines, and hybrid retrieval.
Proven track record architecting and shipping multi-agent systems, knowledge graph-powered retrieval (Neo4j or equivalent), and real-time inference APIs.
Strong ML fundamentals: XGBoost, deep learning, NLP, time-series forecasting, propensity modelling, experimental design, and causal inference.
Experience delivering AI systems in regulated industries (financial services, cybersecurity, healthcare) with SOX, GDPR, or SOC 2 compliance awareness.
Expert-level Python and SQL; fluency with Google Cloud Platform, AWS, Docker, FastAPI, BigQuery, FAISS, and CI/CD tooling.
QUALIFICATIONS
Master's or PhD in Computer Science, Operations Research, Statistics, or a related quantitative field
AWS Certified Machine Learning Engineer or Google Cloud Platform Professional ML Engineer certification.
Completion of an AI Strategy or AI Governance programmed.
Prior experience at a data science / ML services firm, enterprise SaaS, or fintech where you shipped AI to external customers, not just internal tools.
Hands-on experience with Snowflake Cortex or comparable enterprise LLM deployment platforms.
Open-source contributions to AI/ML tooling, published technical writing, or conference presentations.