Job Title: Senior AI Engineer
Location: Santa Clara, CA
Duration: 5+ Months
Agentic Architecture: Design and deploy multi-agent systems that utilize reflection, self-correction, and planning loops to solve open-ended business problems.
Orchestration & State Management: Build complex, cyclic AI workflows using LangGraph to handle long-running tasks, state persistence, and error recovery.
Model Optimization: Implement and fine-tune Gemini (2.5 Pro/Flash) models within the Vertex AI ecosystem, optimizing for latency, token efficiency, and reasoning quality.
Tool Integration: Develop secure, robust interfaces for agents to interact with external APIs, databases (SQL & Vector), and legacy enterprise systems.
Advanced RAG: Architect "Agentic RAG" systems that don't just retrieve data, but intelligently route queries, verify source relevance, and synthesize multi-step answers.
Observability & MLOps: Set up advanced monitoring for agentic behavior, including tracing "chain-of-thought" reasoning, cost tracking, and automated evaluation of agent performance.
Skills:
Core Languages: 7+ years of experience with Python, with deep expertise in asynchronous programming and building scalable APIs (FastAPI/Flask).
Agentic Frameworks: Proven track record of shipping production-grade applications using LangGraph
Google Cloud Platform Ecosystem: Expert-level experience with Vertex AI, including Model Garden, Vertex AI Search & Conversation, and deploying models via Vertex Endpoints.
LLM Expertise: Deep understanding of Gemini capabilities (function calling, multimodal input, long-context window management).
Vector Infrastructure: Hands-on experience with Vertex AI Vector Search (formerly Matching Engine) or similar databases like Pinecone/Weaviate.
Engineering Excellence: Strong grasp of CI/CD for AI, containerization (Docker/GKE), and "eval-driven development" to measure agent accuracy.
Experience with Model Context Protocol (MCP) for standardized tool-calling.