Overview
Hybrid
Depends on Experience
Contract - W2
Contract - 12 Month(s)
Able to Provide Sponsorship
Skills
Hadoop
Pyspar
Kafka
Big data
Job Details
Position title: Big data developer
Location: Charlotte, NC
Onsite: 3 days a week
Contract: 6-24 months to perm
F2F interview needed
Must have: Hadoop, Pyspark and Kafka knowledge
Key Responsibilities:
- Design and implement scalable data ingestion and transformation pipelines using PySpark or Scala, Hadoop, Hive, and Dremio.
- Build and manage Kafka batch pipelines for reliable data streaming and integration.
- Work with on-prem Hadoop ecosystems (Cloudera, Hortonworks, MapR) or cloud-native big data platforms.
- Develop and maintain RESTful APIs using Python (FastAPI, Flask, or Django) to expose data and services.
- Collaborate with data scientists, ML engineers, and platform teams to ensure seamless data flow and system performance.
- Monitor, troubleshoot, and optimize production data pipelines and services.
- Ensure security, scalability, and reliability across all data engineering components.
- (Optional but valuable) Contribute to the design and deployment of AI-driven RAG systems for enterprise use cases.
Required Skills & Qualifications:
- Experience in Big Data Engineering.
- Strong hands-on experience with PySpark or Scala.
- Deep expertise in on-prem Hadoop distributions (Cloudera, Hortonworks, MapR) or cloud-based big data platforms.
- Proficiency in Kafka batch processing, Hive, and Dremio.
- Solid understanding of REST API development using Python frameworks.
- Familiarity with cloud platforms (Google Cloud Platform, AWS, or Azure).
- Experience or exposure to AI and RAG architectures is a plus.
- Excellent problem-solving, communication, and collaboration skills.
Employers have access to artificial intelligence language tools (“AI”) that help generate and enhance job descriptions and AI may have been used to create this description. The position description has been reviewed for accuracy and Dice believes it to correctly reflect the job opportunity.