Overview
Skills
Job Details
Position Title : Big Data Engineer
Location : NJ/NY (Hybrid)
Experience : 10+ Years Preferred
Employee Type : Full Time with Benefits
Job Description:
We are seeking a highly skilled and experienced Big Data Engineer to join our data engineering team. The ideal candidate will have a strong background in building large-scale distributed data processing systems and enterprise-level Big Data solutions. You will play a key role in designing, developing, and optimizing data pipelines and platforms that support critical business operations.
Primary Skills: Java ,Scala, ETL, Spark, Hadoop, Hive, Impala, Sqoop, HBase, Confluent Kafka, Oracle, Linux, Git, Jenkins CI/CD, etc.
Key Responsibilities:
- Design and implement scalable data processing systems using Big Data technologies.
- Lead end-to-end Big Data solution implementations at enterprise scale.
- Develop and maintain data pipelines using Java, Scala, Spark, and Hadoop ecosystem tools.
- Collaborate with cross-functional teams to understand data requirements and deliver solutions.
- Monitor and troubleshoot performance issues across Hadoop clusters and data jobs.
- Work with Cloudera support to resolve open issues and implement cluster configuration changes.
- Ensure data governance, security, and compliance across platforms.
- Manage batch processing workflows and job scheduling using AutoSys.
- Conduct performance tuning and log analysis for distributed systems.
Required Qualifications:
- Bachelor s or Master s degree in Computer Science, Engineering, Software Engineering, or a related field.
- 10+ years of experience in software development and data engineering.
- Minimum 6 years of experience in leading Big Data solutions with at least one full-cycle implementation.
- Strong programming skills in Java, J2EE, Scala.
- Hands-on experience with Spark, Hadoop, HDFS, YARN, Hive, Impala, HBase, Kafka (Confluent).
- Experience with NoSQL databases, SQL, Elasticsearch, and Oracle.
- Familiarity with Cloudera Hadoop logs, cluster configuration, and performance troubleshooting.
- Knowledge of Hadoop Security, Data Management, and Governance.
- Experience with Linux, Git, Jenkins, and CI/CD pipelines.
Disclaimer
Capgemini is an Equal Opportunity Employer encouraging diversity in the workplace. All qualified applicants will receive consideration for employment without regard to race, national origin, gender identity/expression, age, religion, disability, sexual orientation, genetics, veteran status, marital status or any other characteristic protected by law.
This is a general description of the Duties, Responsibilities and Qualifications required for this position. Physical, mental, sensory or environmental demands may be referenced in an attempt to communicate the manner in which this position traditionally is performed. Whenever necessary to provide individuals with disabilities an equal employment opportunity, Capgemini will consider reasonable accommodations that might involve varying job requirements and/or changing the way this job is performed, provided that such accommodations do not pose an undue hardship.
Capgemini is committed to providing reasonable accommodations during our recruitment process. If you need assistance or accommodation, please reach out to your recruiting contact.
Click the following link for more information on your rights as an Applicant ;br />