Enterprise Data Architect

Overview

On Site
Depends on Experience
Contract - W2
Contract - Independent
Contract - 12 Month(s)

Skills

Data Warehouse/Big Data Architect
Apache Spark processing framework
spark programming languages
functional programming
Spark SQL programming
SQL
MapReduce
HDFS
Hive/Impala
AWS Athena
Redshift
S3
Teradata
HBase
MySQL/Postgres
MongoDB
AWS cloud

Job Details

Job Duties and Responsibilities

  • Deploy enterprise-ready, secure, and compliant data-oriented solutions leveraging Data Warehouse, Big Data and Machine Learning frameworks.
  • Optimizing data engineering and machine learning pipelines
  • Reviews architectural designs to ensure consistency & alignment with defined target architecture and adherence to established architecture standards.
  • Support data and cloud transformation initiative.
  • Contribute to our cloud strategy based on prior experience.
  • Understand the latest technologies in a rapidly innovative marketplace.
  • Independently work with all stakeholders across the organization to deliver point and strategic solutions.
  • Assist solution providers with the definition and implementation of technical and business strategies.

Skills - Experience and Requirements - A successful Architect will have the following:

  • Should have prior experience in working as a Data Warehouse/Big Data Architect.
  • Experience in advanced Apache Spark processing framework, spark programming languages such as Scala/Python/Advanced Java with sound knowledge in shell scripting.
  • Should have experience in both functional programming and Spark SQL programming dealing with processing terabytes of data.
  • Specifically, this experience must be in writing Big Data data engineering jobs for large scale data integration in AWS. Prior experience in writing Machine Learning data pipeline using Spark programming language is an added advantage.
    Advanced SQL experience including SQL performance tuning is a must.
  • Should have worked on other big data frameworks such as MapReduce, HDFS, Hive/Impala, AWS Athena.
  • Experience in logical & physical table design in Big Data environment to suite processing frameworks.
  • Knowledge of using, setting up and tuning resource management frameworks such as Yarn, Mesos or standalone spark.
  • Experience in writing spark streaming jobs (producers/consumers) using Apache Kafka or AWS Kinesis is required.
  • Should have knowledge in variety of data platforms such as Redshift, S3, Teradata, HBase, MySQL/Postgres, MongoDB
  • Experience in AWS services such as EMR, Glue, S3, Athena, DynamoDB, IAM, Lambda, Cloud watch and Data pipeline.
  • Must have used the technologies for deploying specific solutions in the area of Big Data and Machine learning.
  • Experience in AWS cloud transformation projects are required.
  • Telecommunication experience is an added advantage.