Python Developer - Pyspark

Overview

On Site
BASED ON EXPERIENCE
Contract - W2
Contract - Independent

Skills

SPARK
PYTHON
PYSPARK
PY SPARK
ETL
SCALA
JAVA
SPARK CORE
POWER BI
SQL
HADOOP
HIVE
KAFKA
AWS
GOOGLE CLOUD
GCP
AZURE
SNOWFLAKE
DATABRICKS
SPARK SQL
SPARK STREAMING
AB INITIO
DATA FRAMES

Job Details

APN Consulting has an immediate need for a direct client requirement:

Python Developer - Pyspark
Location - Charlotte, NC (Hybrid)
Long-term Contract

Key Responsibilities:

  • She was involved in data manipulation using Python scripts Spark Scala which will be useful for faster data processing.
  • Created Parameterized Queries generated Tabular reports Sub reports Cross Tabs Drill down reports using Expressions Functions Charts Maps Sorting the data Defining Data sources and Subtotals for the reports

Required Skills

  • Experience in Developing Spark applications using Spark SQL in Databricks for data extraction transformation and aggregation from multiple file formats for analyzing and transforming the data to uncover insights into customer usage patterns
  • Good understanding of Spark Architecture including Spark Core Spark SQL T SQL Data Frames Spark Streaming Driver Node Worker Node Stages Executors and Tasks
  • Domain Knowledge of Finance Logistics and Health insurance
  • Strong skills in visualization tools Power BI Confidential Excel formulas Pivot Tables Charts and DAX Commands
  • Expertise in various phases of project life cycles Design Analysis Implementation and testing ETL Tools Ab initio, GDE, Ab Initio Co Op Databases, Teradata Oracle 10g, 11g, Exadata, My SQL, SQL Server 2008, DB2 Cosmos DB, Hadoop, Hive, Snowflake Other tools Autosys Control, M Client ALM, Jenkins, IBM Udeploy GitHub WinSCP, MSVisio, AWS Services, Service Now, AUTOMIC AROW, Airflow Windows, Powershell Python, IDLE, Kafka, Google Cloud Platform, Google Cloud Platform, Azure Big Data Technologies, Hadoop, Spark