job summary:
Are you a visionary technologist ready to transform experimental AI models into scalable, enterprise-grade applications? We are partnering with a premier technology solutions provider seeking a Senior AI Engineer to lead the architecture and deployment of cutting-edge, cloud-based commercial AI products. In this pivotal role, you will bridge the gap between data science prototypes and production software, ensuring that complex AI solutions are robust, ethical, and highly efficient. This permanent placement opportunity offers an excellent environment to shape the future of AI technology, along with a comprehensive benefits package including medical, dental, and vision coverage to keep you and your family healthy.
location: Cleveland, Ohio
job type: Permanent
salary: $180,000 - 200,000 per year
work hours: 9am to 5pm
education: No Degree Required
responsibilities:
What you'll be doing in this role:
- Lead the implementation of rigorous evaluation frameworks to monitor model performance, drift, and cost in real-time.
- Architect and develop high-performance backend services and APIs using Python (FastAPI) to serve large language models at scale.
- Design advanced Retrieval-Augmented Generation (RAG) systems, selecting and managing vector databases and optimizing embedding strategies for accuracy and speed.
- Establish comprehensive model observability and guardrail systems to monitor real-time performance, detect distribution drift, and implement automated safety filters that mitigate hallucinations, bias, and toxic outputs in production environments.
- Build robust integration layers that connect AI agents securely to external enterprise systems, CRMs, and legacy databases.
- Conduct code reviews, provide technical guidance, and foster a culture of continuous learning and innovation within the engineering team.
- Collaborate with infrastructure teams to define deployment strategies, ensuring solutions scale dynamically under load.
- Define the end-to-end architecture for AI products on cloud platforms (preferably Google Cloud Platform), ensuring high availability, security, and cost-effectiveness.
qualifications:
6+ years of professional software engineering experience, with at least 3 years explicitly dedicated to AI and Machine Learning application development.
Expert-level proficiency in Python application development and modern API architecture (REST, GraphQL, gRPC) utilizing enterprise standards like static type checking.
Extensive hands-on experience building production-grade applications using modern LLM frameworks (such as LangChain, LangGraph, or LlamaIndex).
Deep understanding of vector databases (e.g., Pinecone, Weaviate, PostgreSQL) and advanced search algorithms.
Strong command of LLMOps principles, including model registry, versioning, and serving infrastructure within a cloud environment (specifically Google Cloud).
Familiarity with TypeScript for rapid prototyping and building robust integration layers.
Solid understanding of standard application development lifecycles and Git workflows.
Equal Opportunity Employer: Race, Color, Religion, Sex, Sexual Orientation, Gender Identity, National Origin, Age, Genetic Information, Disability, Protected Veteran Status, or any other legally protected group status.
At Randstad Digital, we welcome people of all abilities and want to ensure that our hiring and interview process meets the needs of all applicants. If you require a reasonable accommodation to make your application or interview experience a great one, please contact
Pay offered to a successful candidate will be based on several factors including the candidate's education, work experience, work location, specific job duties, certifications, etc. In addition, Randstad Digital offers a comprehensive benefits package, including: medical, prescription, dental, vision, AD&D, and life insurance offerings, short-term disability, and a 401K plan (all benefits are based on eligibility).
This posting is open for thirty (30) days.
![]()