Senior ML Engineer (Document and Image Processing)

Overview

Remote
Depends on Experience
Contract - Independent
Contract - W2
Contract - 5 Year(s)

Skills

machine learning
image processing
docuemnt processing
Microsoft DiT

Job Details

Senior Machine Learning Engineer (Document and Image Processing)
Location: Remote
Employment Type: W2 / C2C
Experience Level: 10+ years (Requires at least 2 years of stay in the U.S.A)


VA Tech Solutions is seeking a Senior Machine Learning Engineer with deep expertise in document and image processing. This role will drive the development of end-to-end machine learning pipelines for scanned documents and unstructured image data, while applying strong ML fundamentals such as feature engineering, model selection, hyperparameter tuning, and evaluation metrics.

  • Key Responsibilities

    • Develop machine learning pipelines for document classification, OCR, layout detection, and key-value extraction using computer vision and NLP techniques

    • Perform data preprocessing, feature extraction, and augmentation tailored to document and image formats

    • Train and optimize models using AWS SageMaker, managing full ML lifecycle from experimentation to deployment

    • Implement and maintain CI/CD pipelines using GitLab for model retraining and integration with production services

    • Build and manage cloud infrastructure using Terraform and Ansible to support scalable document processing workflows

    • Apply model evaluation techniques such as cross-validation, precision/recall, F1-score, and confusion matrices to ensure robust performance

    • Incorporate techniques for model interpretability, drift detection, and continuous improvement

    • Collaborate in Agile teams with data scientists and engineers to ensure ML models are deployed, monitored, and improved over time


    Required Qualifications

    • 10+ years of professional experience in machine learning or data science, with a focus on document/image understanding

    • Good to have MicroSoft DiT and OCR (Document Image Transformers)
    • Strong proficiency in Python, along with libraries such as NumPy, Pandas, Scikit-learn, OpenCV, and TensorFlow or PyTorch

    • Solid understanding of supervised and unsupervised learning, model tuning, feature importance, and bias/variance trade-offs

    • Hands-on experience with AWS ML tools including SageMaker, Lambda, S3, Step Functions, and CloudWatch

    • Familiarity with containerization (Docker) and orchestration tools (Kubernetes on AWS EKS)

    • Experience implementing monitoring and logging for deployed ML models

    • Comfort with code quality tools and DevOps practices for ML pipelines (Terraform, Ansible, GitLab CI/CD)

 

 

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.

About VA Tech Solutions Inc