Senior Data Engineer

Overview

Remote
Hybrid
Full Time
Accepts corp to corp applications
Contract - W2
Contract - 1 year

Skills

AWS
Python
SQL
SPARK
AZURE

Job Details

Responsibilities:
  • Design, construct, install, test, and maintain highly scalable data management systems on Azure and AWS, utilizing services such as Azure Data Lake, Azure Data Factory, AWS S3, and AWS Glue.
  • Integrate new data management technologies and software engineering tools into existing structures, focusing on Azure and AWS ecosystems.
  • Utilize Azure Databricks and AWS EMR for big data processing tasks to create data set processes for data modeling, mining, and production.
  • Develop analytics applications using Azure Synapse Analytics and AWS Redshift to leverage the data pipeline, providing actionable insights into key business performance metrics.
  • Collaborate with stakeholders across the Executive, Product, Data, and Design teams to support their data infrastructure needs, using Azure and AWS cloud services.
  • Implement data security and compliance procedures using Azure Security Center and AWS Identity and Access Management (IAM).
  • Evaluate and improve existing systems, incorporating new technologies and software engineering tools available in Azure and AWS ecosystems.
  • Monitor analytics and metrics results using tools like Azure Monitor and AWS CloudWatch.
  • Implement systems for tracking data quality and consistency using Azure and AWS services, ensuring data integrity across the pipeline.
  • Automate data workflows and pipeline processes using Azure Logic Apps and AWS Step Functions, integrating with Apache Airflow for orchestration.
Qualifications:
  • Bachelor's or Master's degree in Computer Science, Engineering, or related field.
  • Proven experience as a Data Engineer or similar role, with a deep understanding of Azure and AWS cloud services relevant to data engineering (e.g., Azure Data Lake, Azure Data Factory, AWS S3, AWS Glue).
  • Expertise in SQL and programming languages, especially Python, for data manipulation and analysis.
  • Strong experience with big data tools such as Apache Spark and Hadoop.
  • Proficiency in data pipeline and workflow management tools, particularly Apache Airflow.
  • Experience with data modeling, data warehousing, and building ETL pipelines in both Azure (using Azure Data Factory, Azure Databricks) and AWS environments (using AWS Glue, AWS Data Pipeline).
  • Knowledge of stream-processing systems (e.g., Apache Kafka, Azure Stream Analytics, Amazon Kinesis) is a plus.
  • Familiarity with machine learning algorithms and data science principles is beneficial.
  • Strong problem-solving skills and ability to project manage and multitask.
  • Excellent communication and teamwork skills, with an ability to navigate and negotiate with stakeholders across the organization.

About CodeMax Consulting, LLC