Data Engineer with PySpark

  • Rocky Hill, CT
  • Posted 21 days ago | Updated moments ago

Overview

On Site
BASED ON EXPERIENCE
Full Time
Contract - W2
Contract - Independent

Skills

JSON
PYSPARK
PYTHON
CI/CD

Job Details

Job Title: Data Engineer with PySpark -W2 only
Location: Rancho Cucamonga,CA/ (Onsite Role)


We are open for Visa Sponsorship



As a Data Engineer, you will play a key role in designing and implementing robust data pipelines and infrastructure to support our data-driven initiatives.
Key Responsibilities:
  • Design, develop, and maintain scalable data pipelines using PySpark.
  • Collaborate with data scientists and analysts to understand data requirements and implement solutions.
  • Parse JSON files to SQL Tables using PySpark.
  • Build automation pipelines for Continuous Integration/Continuous Deployment (CI/CD) using Azure DevOps or similar tools.
  • Optimize data workflows for performance, reliability, and efficiency.
  • Implement data quality checks and monitoring processes to ensure data integrity.
  • Troubleshoot and resolve issues related to data pipeline performance and reliability.
  • Document technical designs, processes, and procedures.
Requirements:
  • Bachelor's degree in Computer Science, Engineering, or related field.
  • Proven experience working as a Data Engineer in a professional setting.
  • Hands-on experience with PySpark for data processing and analysis.
  • Strong proficiency in SQL and relational databases (e.g., PostgreSQL, MySQL).
  • Experience building automation pipelines for CI/CD using Azure DevOps or similar tools (e.g., Jenkins, GitLab CI).
  • Experience Parsing JSON files to SQL Tables suing Python/PySpark.
  • Familiarity with cloud platforms such as Azure, AWS, or Google Cloud Platform.
  • Excellent problem-solving skills and attention to detail.
  • Strong communication and collaboration skills.
Nice to Have:
  • Experience with containerization technologies (e.g., Docker, Kubernetes).
  • Knowledge of big data technologies (e.g., Hadoop, Spark).
  • Familiarity with data warehousing concepts and tools (e.g., Snowflake, Redshift).
  • Understanding of machine learning concepts and algorithms.
Why Join Us:
  • Opportunity to work on cutting-edge data technologies and projects.
  • Collaborative and supportive team environment.
  • Competitive salary and benefits package.
  • Career growth and development opportunities.
If you are passionate about leveraging data to drive business impact and thrive in a fast-paced environment, we would love to hear from you! Apply now to join our team as a Data Engineer and contribute to our mission of transforming data into insights.

Infowave Systems is an equal opportunity employer that is committed to diversity and inclusion in the workplace.