Overview
Skills
Job Details
Skills :
4 - 7 years of experience in DevOps, MLOps, platform engineering, or cloud infrastructure.
Strong skills in containerization (Docker, Kubernetes), API hosting, and cloud-native services.
Experience with vector DBs (e.g., FAISS, Pinecone, Weaviate) and model hosting stacks.
Familiarity with logging frameworks, APM tools, tracing layers, and prompt/versioning logs.
Bonus: exposure to LangChain, LangGraph, LLM APIs, and retrieval-based architectures.
Responsibilities :
Set up and manage runtime environments for LLMs, vector DBs, and orchestration flows (e.g., LangGraph).
Support deployments in cloud, hybrid, and client-hosted environments.
Containerize systems for deployment (Docker, Kubernetes, etc.) and manage inference scaling.
Integrate observability tooling: prompt tracing, version logs, eval hooks, error pipelines.
Collaborate on RAG stack deployments (retriever, ranker, vector DB, toolchains).
Support CI/CD, secrets management, error triage, and environment configuration.
Contribute to platform-level IP, including reusable scaffolding and infrastructure accelerators.
Ensure systems are compliant with governance expectations and auditable (esp. in insurance contexts).
Preferred Attributes :
Systems thinker with strong debugging skills..
Able to work across cloud, on-prem, and hybrid client environments.
Comfortable partnering with architects and engineers to ensure smooth delivery.
Proactive about observability, compliance, and runtime reliability.