Overview
Skills
Job Details
Role- Senior AWS Data Architect (AWS)
Location- Atlanta, GA-Hybrid 3 days
Duration- 12+ Months
Must Have Skills-
- Data Architect, PyTorch, ML experience
- Data Architect
- ETL
- CloudTrail, Cloud Watch
- AWS
Required Qualifications
- 7+ years in data architecture, data engineering, or a related field.
- Expert-level knowledge of AWS Glue, S3, Athena, Lambda, Step Functions, IAM.
- Strong hands-on experience with ETL/ELT, distributed processing, and job optimization.
- Expertise in DAG-based orchestration: Airflow, MWAA, Step Functions, or Glue Workflows.
- Experience supporting ML pipelines and PyTorch-based workflows.
- Strong proficiency in Python, SQL, and scalable data-processing frameworks.
- Solid understanding of OLTP, OLAP, and Lakehouse data-modeling patterns.
- Experience with CI/CD and DevOps practices on AWS.
Preferred Qualifications
- AWS certifications (e.g., Data Analytics Specialty, Solutions Architect).
- Experience with EMR, Redshift, Kinesis, Kafka.
- Knowledge of MLOps tools: SageMaker, MLflow, Feature Stores.
- Familiarity with IaC tools: Terraform, CloudFormation.
- Experience working in enterprise-scale, highly regulated environments.
Overview
We are seeking a highly skilled Senior Data Architect to design, build, and optimize enterprise data platforms on the AWS ecosystem. The ideal candidate brings deep experience in AWS Glue, distributed data processing, DAG-based orchestration, and machine-learning pipeline enablement (including PyTorch).
This role requires a blend of hands-on architecture, strategic planning, and technical leadership.
Key Responsibilities-
Data Architecture & Design
- Design end-to-end architectures across batch, streaming, and real-time use cases.
- Develop scalable data models, lakehouse structures, and metadata strategies aligned with business needs.
- Architect ETL/ELT solutions using AWS Glue Jobs, Glue Data Catalog, Glue Studio, and Glue Workflows.
- Design systems leveraging AWS services: S3, Glue, Lake Formation, Athena, Redshift, IAM, KMS, VPC, CloudWatch, CloudTrail.
- Ensure adherence to the AWS Well-Architected Framework (security, cost, performance, reliability).
Data Engineering & Pipelines
- Build and optimize pipelines using Glue, Lambda, Step Functions, EMR, Athena, and S3.
- Implement DAG-based orchestration with Apache Airflow, AWS MWAA, or Glue Workflows.
- Establish robust processes for data quality, lineage, reliability, and observability.
Machine Learning Pipeline Enablement
- Collaborate with ML teams to productionize PyTorch models.
- Architect feature store solutions, data-prep workflows, and training/inference pipelines.
- Optimize storage and compute architectures for large-scale model training and batch inference.
Security, Governance & Compliance
- Define and enforce data-governance policies, IAM architectures, and encryption standards.
- Ensure compliance with frameworks such as GDPR, HIPAA, etc.
- Implement metadata and lineage strategies via Glue Data Catalog and related tools.
- Utilize AWS KMS, Secrets Manager, Parameter Store for encryption and secrets management.
Technical Leadership
- Establish architectural standards and guide teams on AWS best practices.
- Mentor data engineers, analysts, and ML engineers.
- Evaluate and drive adoption of scalable, cost-efficient cloud solutions.
- Lead cost-optimization initiatives and architecture reviews.
Shivam Kumar
Technical recruiter | Empower Professionals
......................................................................................................................................
| Phone: x 336 | | Fax:
100 Franklin Square Drive Suite 104 | Somerset, NJ 08873
Certified NJ and NY Minority Business Enterprise (NMSDC)
E