Apply Now

Senior MLOps Engineer - Full Time - no 3rd parties

Remote • Posted 30+ days ago • Updated 8 days ago

Full Time

Remote

$140,000 - $180,000/yr

Fitment

Dice Job Match Score™

🔢 Crunching numbers...

Job Details

Skills

sagemaker
AWS
MLops
machine learning
Amazon SageMaker
Amazon Web Services
Machine Learning (ML)
Machine Learning Operations (ML Ops)
DevOps
Orchestration

Summary

100% remote role - work on EST

We are seeking a Senior MLOps Engineer to support large-scale production machine learning environments focused on text, image, and video processing workloads in AWS.

This is a highly operational and infrastructure-focused role. The ideal candidate has hands-on experience deploying, monitoring, scaling, and optimizing ML systems in production environments particularly within AWS SageMaker ecosystems.

This is NOT a data science or model research role. The focus is production reliability, deployment governance, infrastructure scalability, observability, and operational efficiency.

Responsibilities

ML Deployment & Operations

Design, deploy, and support end-to-end production ML pipelines
Manage ML promotion across Dev, QA, and Production environments
Implement deployment standards, rollback strategies, and recovery mechanisms
Support containerized inference and orchestration patterns

AWS & Infrastructure Management

Configure and manage AWS SageMaker pipelines, endpoints, and monitoring
Optimize GPU and CPU infrastructure selection and scaling
Benchmark infrastructure performance and tune autoscaling behavior
Perform load testing and production infrastructure optimization

Monitoring & Reliability

Implement monitoring, alerting, observability, and drift detection
Track latency, throughput, error rates, and model/data drift
Build A/B testing and controlled rollout frameworks
Ensure governance, reproducibility, security, and cost efficiency

Large-Scale ML Workloads

Support production ML systems across text, image, and video workloads
Manage high-throughput infrastructure and large-scale data movement
Prevent compute, networking, and storage bottlenecks
Support systems processing hundreds of thousands of requests daily

Collaboration

Partner closely with ML Engineers, Platform Engineering, DevOps, and Data teams
Operationalize ML models into stable production systems
Help drive scalability, reliability, and infrastructure best practices

Required Qualifications

Strong hands-on experience operating production ML systems at scale
Deep AWS SageMaker experience including:
- Pipelines
- Endpoints
- Monitoring
- Multi-environment deployments
Experience operationalizing PyTorch and TensorFlow models
Experience with containerized ML deployment and orchestration
Experience optimizing GPU/CPU infrastructure for ML workloads
Strong monitoring and observability experience
Experience implementing deployment governance and rollback strategies

Strongly Preferred

Experience supporting:
- Transformer-based NLP systems
- Computer vision workloads
- Ranking/reranking systems
Familiarity with:
- ANN systems
- HNSW indexing
- Large-scale neural network operational workloads
Experience supporting high-volume text, image, and video dataset
Candidates must be able to work directly on W2 or approved independent consulting arrangements.

NO 3RD PARTY FIRMS, LAYERED VENDORS, OR STAFFING PASSTHROUGHS.
Third-party submissions will NOT be reviewed or responded to.

If interested, please send:

Updated resume
Current location
Work authorization status
Availability
Salary expectations

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

Dice Id: 10112164
Position Id: 8972935
Posted 30+ days ago

Contact the job poster

Amy Denenberg

SilverSearch, Inc. Recruiter @ SilverSearch, Inc.

View Profile

Create job alert

Never miss an opportunity! Create an alert based on the job you applied for.

Remote or Chicago, Illinois

•

Today

Summary: The Opportunity Hyatt Hotels Corporation seeks an enthusiastic Senior ML Engineer to join our Data Science and Machine Learning department. In this role, you will be collaborating closely with the broader Data and Analytics team, where you'll be instrumental in continuing to make Hyatt a leading hospitality company. You will be part of a team that is passionate about our purpose, committed to nurturing curiosity and new skills, and building connections across the organization with coll

Full-time

USD 133,200.00 - 173,000.00 per year

DevOps Engineer - Senior Vice President

Remote or New York, New York

•

Today

About the Role The Platform Infrastructure team at iCapital plays a critical role in ensuring that both production and development environments operate smoothly, securely, and reliably. This role leverages advanced cloud capabilities to support the Platform Infrastructure strategy of market agility and lean operating principles, with a strong emphasis on quality to meet the ever-growing demands of our clients. We are seeking highly collaborative, creative, and intellectually curious MLOps/DevO

Full-time

USD 180,000.00 - 230,000.00 per year

Machine Learning Software Engineer II

Remote or Dallas, Texas

•

Today

Cambium Learning Group is an award-winning educational technology solutions leader dedicated to helping all students reach their potential through individualized and differentiated instruction. Using a research-based, personalized approach, Cambium Learning Group delivers SaaS resources and instructional products that engage students and support teachers in fun, positive, safe and scalable environments. These solutions are provided through Learning A-Z (online differentiated instruction for el

Full-time

Machine Learning/AI Engineer

Remote

•

Today

Who is Element? We serve as a partner at the intersection of innovation and our clients' needs, efficiently crafting meaningful user experiences for government and commercial customers. By breaking down complex problems to their fundamental elements, we create modern digital solutions that drive efficiencies, maximize taxpayer dollars, and deliver essential outcomes that serve the people. Why Work at Element? Make an impact that resonates-join our vibrant team and discover how you can improve

Full-time

USD 135,000.00 - 145,000.00 per year

Search all similar jobs

Remote jobs at SilverSearch, Inc.

Senior MLOps Engineer - Full Time - no 3rd parties

Dice Job Match Score™

Job Details

Skills

Summary

Responsibilities

ML Deployment & Operations

AWS & Infrastructure Management

Monitoring & Reliability

Large-Scale ML Workloads

Collaboration

Required Qualifications

Strongly Preferred

Amy Denenberg

Similar Jobs