Data Transformation Lead

USD55 - USD65 per hour

Full Time


    • SQL
    • Unix Scripting
    • data flow
    • Big Data

    Job Description

    *W2 only position* This is not open to C2C and No Visa sponsorship is available at this time.

    Data Transformation Lead

    The individual will be part of DPS team. As a Data Transformation Lead. He/she should efficiently handle requirement discussions with clients and responsible for creating detailed technical specifications, developing application and system code and participating in code reviews. Should have good Data background and large data migration program experience.
    Analysing raw data, drawing conclusions & developing recommendations
    Creating business requirements along with traceability tracking
    Candidate will be responsible for the design and development of technical solutions utilizing the big data platform.
    Candidate should exhibit strong understanding of complex business problems to ensure projects are leveraging the appropriate technology and the technical design enables the delivery of a comprehensive solution.
    Responsibilities will include leading discussions with stake partners on their data transformation requirements defining technical requirements, loading data into HDFS, managing HDFS framework, managing Hive databases, data extraction, data transformation, automating jobs, productionalizing jobs, and exploring new big data technologies within a Massively Parallel Processing environment.
    Closely working with information security and audit teams to enable application/process certification.
    Good Hands on knowledge on UNIX scripting.
    Insurance or Banking background preferred.

    Mandatory Skills
    Education : Graduate/Post graduate
    6-8 years of IT experience
    Architecting, developing, implementing and maintaining Big Data solutions using Cloudera Hadoop - HDFS, Sqoop, Map reduce, Spark and Hive
    Extensive Unix scripting
    Extensive SQL
    Data Integration

    Desirable skills
    Good Exposure to Core Java , J2EE Technologies
    ETL tools
    Scheduling tools (Autosys)
    Knowledge of Mainframe data handling (VSAM/IMS/DB2)
    Data Mining/Data Warehousing/Business Intelligence
    MPP systems