AWS Textract / Document AI Engineer

Overview

Remote
$30 - $40
Contract - W2
Contract - Independent
Contract - 12 Month(s)

Skills

Amazon Textract
Textract
Document AI Engineer
AI/ML
SageMaker

Job Details

Job Title: AWS Textract Developer / Document AI Engineer

Location: Remote

Employment Type: Full-Time

Job Summary:

We are seeking an experienced AWS Textract Developer to design and implement intelligent document processing solutions using Amazon Textract and related AWS AI/ML services. The ideal candidate will have strong expertise in extracting printed text, handwriting, tables, and form data from various document types while ensuring high accuracy, scalability, and integration with existing enterprise systems.

Key Responsibilities:

Design, develop, and deploy document processing workflows leveraging Amazon Textract for text, handwriting, and structured data extraction.

Integrate Textract outputs with AWS Lambda, S3, Comprehend, and Rekognition for advanced data enrichment and automation.

Build scalable data pipelines and APIs for processing and delivering extracted data into downstream systems (databases, analytics dashboards, or CRMs).

Optimize accuracy and performance of OCR and layout detection models using Textract capabilities.

Work with unstructured and semi-structured documents such as invoices, forms, contracts, and handwritten notes.

Implement automation for document ingestion, validation, and data normalization.

Collaborate with business analysts, data engineers, and data scientists to define use cases and deliver AI-driven document intelligence.

Ensure compliance, security, and performance best practices across all AWS workloads.

Required Skills & Qualifications:

Strong hands-on experience with Amazon Textract and AWS AI/ML services (Comprehend, Lambda, S3, SageMaker, etc.).

Experience in Python or Node.js for developing Textract processing scripts and APIs.

Familiarity with OCR concepts, document layout analysis, and NLP-based text extraction.

Understanding data pipelines, ETL processes, and AWS Step Functions.

Knowledge of JSON, CSV, and structured output formats for downstream integration.

Experience with AWS SDKs, Boto3, and serverless architecture.

Excellent problem-solving, communication, and documentation skills.

Preferred Qualifications:

Experience with machine learning-based document classification or custom model training.

Familiarity with Amazon Comprehend, OpenSearch, or Athena for post-processing and analytics.

Prior experience working with financial, healthcare, or legal document automation projects.

AWS Certified Developer or AWS Machine Learning Specialty certification is a plus.

Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.