Overview
Remote
Depends on Experience
Full Time
No Travel Required
Skills
Databricks
Python
PySpark
ETL
Databricks Certification
Databricks certified Data Engineer
SQL
AWS
Job Details
Role: Sr. Databricks Architect/Data Engineering Architect with AWS, Python, Pyspark & Databricks Certification.
Location: 1775 Tysons Blvd, Tysons, VA
Duration: Full-time
Job Description:
Satsyil Corp is currently seeking a highly skilled and motivated Senior Databricks Architect to join our team and contribute to the success of our Enterprise Data Services project. As a Databricks Architect, you will play a crucial role in developing and optimizing Spark applications in AWS Databricks, leveraging your expertise in Python, SQL, and PySpark. The ideal candidate will have a solid background in data engineering and demonstrate proficiency in designing and building efficient ETL pipelines using Apache Spark in the Databricks environment.
Roles and Responsibilities:
- Develop Spark applications in AWS Databricks, utilizing Python, PySpark, SQL and to meet project requirements and data processing needs.
- Design and implement robust ETL pipelines using Apache Spark in Databricks, ensuring data integrity, efficiency, and scalability.
- Collaborate with cross-functional teams to understand business requirements and design solutions that leverage structured, semi-structured, and unstructured data effectively.
- Write high-quality code in a timely manner, adhering to coding standards, best practices, and established development processes.
- Utilize version control systems like Git to manage code base and ensure seamless collaboration within the team.
- Merge and consolidate various data sets using Pyspark code, enabling streamlined data processing and analysis.
- Work with APIs to facilitate data ingestion from diverse sources and integrate data into the ecosystem.
- Apply expertise in Databricks delta lake to optimize data storage, query performance, and overall data processing efficiency.
- Demonstrate knowledge of application development life cycles and promote continuous integration/deployment practices for efficient project delivery.
- Perform query tuning, performance tuning, troubleshooting, and debugging for Spark and other big data solutions to enhance system efficiency and reliability.
- Exhibit expertise in database concepts and SQL to efficiently manipulate, process, and extract insights from complex datasets.
- Apply database engineering and design principles to ensure data infrastructure meets high standards of scalability, reliability, and performance.
- Leverage previous experience in handling large-scale distributed systems to deliver and operate data solutions efficiently.
- Demonstrate a successful track record of extracting value from extensive, disconnected datasets to drive data-driven decision-making.
Required Qualifications:
- A minimum of 10+ years of hands-on experience in Spark, with proficiency in either Python or PySpark.
- Databricks Certified Data Engineer Associate or Professional Certification required
- Strong knowledge of the Databricks platform and previous experience working with it.
- Extensive experience with Apache Spark and a proven history of successful development in this environment.
- Proficiency in at least one programming language (Python, PySpark).
- Previous experience in ETL and data application development, coupled with expertise in version control systems like Git.
- Ability to write Pyspark code for data merging and transformation.
- Experience working with APIs for data ingestion and integration.
- Familiarity with Databricks delta lake and expertise in query optimization techniques.
- Sound understanding of application development life cycles and continuous integration/deployment practices.
- Proven experience in query tuning, performance tuning, troubleshooting, and debugging Spark and other big data solutions.
- Solid knowledge of database concepts and SQL.
- Strong background in handling large and complex datasets from various sources and databases.
- Proficient understanding of database engineering and design principles.
- A successful history of extracting value from extensive, disconnected datasets to drive data-driven decision-making.
Join our dynamic team of data experts and contribute your skills to shaping the future of our Enterprise Data Services project. Apply now and be part of an exciting journey in data architecture and engineering.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.