Req ID: 105432
NTT DATA Services strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.
We are currently seeking a Spark Engineer with Hadoop to join our team in Irving, Texas (US-TX), United States (US).
-Analyze and understand data sources & APIs
-Design and Develop methods to connect & collect data from different data sources
-Design and Develop methods to filter/cleanse the data
-Design and Develop SQL, Hive queries, APIs to extract data from the store
-Work closely with data Scientists to ensure the source data is aggregated and cleansed
-Work with product managers to understand the business objectives
-Work with cloud and data architects to define robust architecture in cloud setup pipelines and workflows
-Work with DevOps to build automated data pipelines
-4+ years of Spark
-3+ years of Hadoop ecosystem and Big Data technologies
-3+ years of Hadoop (Cloudera) or Cloud Technologies building pipelines using Spark /Pyspark
-2+ years Hadoop eco-system (HDFS, MapReduce, Yarn, Hive, Pig, Impala, Spark, Kafka,)
-2+ years SQL
-Machine learning engineering
-Programming in Scala and Python
-ETL tools like Ab Initio
-HTTP and invoking web-APIs
-NLP and text processing
-Work with distributed teams
About NTT DATA Services
NTT DATA Services is a global business and IT services provider specializing in digital, cloud and automation across a comprehensive portfolio of consulting, applications, infrastructure and business process services. We are part of the NTT family of companies, a partner to 85 % of the Fortune 100.
NTT DATA Services is an equal opportunity employer and will consider all qualified applicants for employment without regard to race, gender, disability, age, veteran-status, sexual orientation, gender identity, or any other class protected by law.