Gen AI Architect

Overview

Hybrid
$160,000 - $170,000
Full Time
Able to Provide Sponsorship

Skills

Gen AI
AI
ML
Architect
Langchain
GCP

Job Details

Responsibilities:
Define and own the GENAI solution architecture including model selection, fine-tuning strategy, retrieval-augmented generation (RAG), vector stores, and prompt engineering pipelines.
Lead the design, development, and delivery of high-quality, scalable software applications supporting Generative AI initiatives.
Collaborate with Data Scientists, MLOps, Cloud Architects, Product Managers, and Legal teams for safe, ethical and compliant AI.
Should have hands-on experience on ML engineering. Deployment of models from development to production and troubleshoot if any issue arises
Develop Minimum Viable Products (MVPs) based on both clearly defined and evolving requirements, iterating quickly to meet rapidly changing needs.
Mentor and guide engineers of all levels, fostering a collaborative and high-performing team environment.
Effectively communicate technical concepts and project updates to engineers, leaders, and executives.
Capable of and will lead a small squad of engineers, providing technical guidance.
Proactively identify and resolve complex technical challenges within the environment.
Required Skills & Experience:
5+ years of experience in software application development with a strong focus on Python.
Advanced proficiency in data engineering principles and technologies.
Advanced proficiency in SQL and database design.
Strong API development experience, specifically with FastAPI.
Extensive experience developing both analytical and operational applications/systems.
Deep understanding of software development best practices, including testing, deployment, and troubleshooting.
Experience with Apache Kafka.
Working knowledge of Snowflake.
Working knowledge on ML engineering and deployment of models from development to production
Experience in building GEN AI apps using python, PyTorch, LangChain or equivalent
Proficiency with cloud AI services: Google Cloud Platform Vertex AI
Ability to adapt quickly and efficiently to a new and complex environment
Excellent communication and collaboration skills.
Ability to work on-site in San Antonio or Plano, TX (RTO policy: 4 days in office with manager's discretion for flexibility).
Preferred Skills & Experience:
Java development experience (including Java APIs and Spring/Spring Boot).
Experience with dbt (Data Build Tool).
JavaScript and front-end development experience.
AWS, Google Cloud Platform
Qualifications: (Please list all required qualifications) Click here to enter text.
(Rationalizes basic requirements for candidates to apply. Helps w/rationalization when detailed.
- Bachelor s or master s degree in computer science, Engineering, or related field.
- Experience with ML Engineering with additional exposure to Data Engineering
- Certification in software engineering, ML, or related field is a plus
- Experience
Must Have Technical/Functional Skills
Python, SQL and database design, FastAPI, Kafka, Snowflake, DBT, PyTorch, LangChain or equivalent, Google Cloud Platform Vertex AI, AWS, Google Cloud Platform, Java development experience (including Java APIs and Spring/Spring Boot)
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About Stanley David and Associates