Senior BigData Developer/ Engineer
Location: McLean, VA 22102
Duration: 12+ months with possible extension
Only for 10+ yeras of expereicned candidate with below mandatory Skills-
Strong with Python
Python programming experience with Object oriented and Multithreading concept.
Experience with Cluster area (Batch processing)
Some experience with Spark
Nice to have:
• Bachelor's degree in Computer Science or Information Technology
- Cleanse, manipulate and analyze large datasets (Structured and Unstructured data – XMLs, JSONs, PDFs) using Hadoop platform.
- Develop Python, PySpark, Spark scripts to filter/cleanse/map/aggregate data.
- Manage and implement data processes (Data Quality reports)
- Develop data profiling, deduping logic, matching logic for analysis
- Programming Languages experience in Python, PySpark and Spark for data ingestion
- Programming experience in BigData platform using Hadoop platform