Overview
Skills
Job Details
Location: McLean, VA
Work Setting: Onsite
Key Responsibilities:
Write effective Python code for data extraction, transformation, and loading (ETL)
Perform data manipulation and analysis using Pandas, NumPy, and SQL
Work with structured and unstructured data sources (Excel, CSV, text files) from a Data Lake environment
Implement robust error handling and exception handling in Python scripts
Develop and maintain unit tests and regression tests to ensure data accuracy and code stability
Collaborate with data engineers, analysts, and business teams to gather requirements and deliver data solutions
Optimize and refactor existing code for better performance and scalability
Required Skills:
Strong programming experience in Python
Proficiency in Pandas, NumPy, and working with SQL queries
Experience working with Data Lakes and processing large volumes of data
Ability to parse and consolidate data from Excel, CSV, and plain text formats
Hands-on experience writing unit tests, performing regression testing, and implementing error/exception handling