=== POSTING ===
MLOps Engineer Bolingbrook, IL
Job Description
We are seeking an experienced MLOps Engineer to operationalize, scale, and maintain enterprise AI/ML platforms across cloud, hybrid, and on-prem environments. This role focuses on building reliable, secure, and observable ML systems supporting LLM workloads, Retrieval-Augmented Generation (RAG), document intelligence, multimodal processing, and predictive ML pipelines.
The ideal candidate brings strong engineering discipline, automation mindset, and hands-on experience delivering production-grade AI systems with robust governance, security, and performance optimization.
Key Responsibilities
- Design, build, and automate end-to-end ML pipelines (data ingestion, feature engineering, training, evaluation, packaging, deployment).
- Implement model CI/CD including versioning, automated testing, canary/blue-green deployments, and rollback strategies.
- Operationalize LLM and RAG systems, including embedding workflows, vector indexing, latency optimization, and grounding quality checks.
- Productionize document intelligence and multimodal pipelines (OCR parsing, enrichment, batch and streaming workflows).
- Establish observability and monitoring for data quality, model drift, safety indicators, inference latency, and error conditions.
- Enforce Responsible AI practices including auditability, reproducibility, governance metadata, lineage tracking, and approval workflows.
- Secure model serving environments through container hardening, IAM, secrets management, and network isolation.
- Optimize GPU/CPU utilization, autoscaling, throughput, and cost efficiency.
- Develop reusable templates, reference architectures, starter repositories, and technical documentation.
Required Skills & Experience - Strong proficiency in Python, CI/CD, Docker, and Kubernetes.
- Hands-on experience operationalizing LLM, RAG, and predictive ML systems in production.
- Solid foundation in data engineering, schema governance, and batch/stream processing.
- Strong security mindset, including PII controls, secrets management, network boundaries, and auditability.
- Experience with Google Cloud Platform, including:
- Vertex AI (training, tuning, deployment, orchestration, model registry, monitoring)
- BigQuery / BigQuery ML
- Cloud Composer and Dataflow
- GKE or Cloud Run for scalable model serving
- Artifact Registry, Cloud Build, and Cloud Deploy for CI/CD
Preferred Qualifications - Experience with agentic reasoning patterns and workflow orchestration.
- Familiarity with LLM evaluation frameworks, grounding validation, bias, and safety checks.
- Contributions to open-source ML or MLOps tools.
**ALL successful candidates for this position are required to work directly for PRIMUS. No agencies please only W2**
For immediate consideration, please contact:
Rachna Gaur
PRIMUS Global Services
Phone:
Email: