Big Data Architect

Data Architect, Scala, Python, Java, SQL, Hadoop Ecosystem, Cloudera
Full Time, Contract Corp-To-Corp, Contract Independent, Contract W2, C2H Corp-To-Corp, C2H Independent, C2H W2, Part Time, Long Term
Market
Telecommuting not available Travel not required

Job Description

Data Architect
Princeton/NYC
Long Term


Requirements
Our ideal candidate has a background in distributed applications, data warehousing, and databases.
The candidate is highly proficient in hadoop ecosystem, and spark data stack.
The candidate has strong programming skills in Scala, Java, Python, SQL and has an expertise in statistical algorithms for data analysis.

Project:
Deliver a Data Service Layer that acquires data from multiple financial data sources to manage data sets for various business applications.
The data lake provides high data quality, rich meta-data, and on-demand transformations to build data feeds for reporting and risk computations.

Responsibilities
•Responsible for Big Data solution architecture, design, development, and deliver production grade solutions
•Hands-on expertise with various big data technologies and the ability to lead an agile delivery team
•Knowledge to measure performance of data solutions, diagnose bottlenecks, and tools to monitor and tune performance.
•Deploy flexible, scalable, and resilient data solutions to meet evolving client data product requirements.


Minimum Requirements
•BS/MS degree in Computer Science, Engineering, Applied Mathematics or a related field or equivalent experience
•7+ years of hands-on programming expertise in Scala, Java, SQL, Python
•3+ years of experience with large datasets in Hadoop (Cloudera) and Spark Ecosystem
•Hands-on experience in Hadoop data storage, data stores (HBase, Cassandra), and tools (Oozie, Sqoop, Flume etc)
•Well versed in Cloudera (CDH 5.x) to manage security, metadata, lineage, job management, Optimizer, Record Service etc
•Expertise in kafka (distributed logs) and Spark streaming architecture and development
•Experience in design and development of SQL on Hadoop applications (Spark SQL, Impala) and Query Optimization
•Troubleshoot, tune, and accelerate data pipelines, data queries, and real time streaming events.
•Passionate, self-motivated and willingness to learn

Nice to Have
•Expertise in leading cloud technologies like Amazon Web Services
•Certification in Hadoop and Spark a plus

Posted By

Tanvir Ahmed

Contact
Dice Id : 10113363
Position Id : 448570
Have a Job? Post it

Similar Positions

Big Data Architect
  • MW Partners LLC
  • New York City, NY
Big Data Architect
  • CLIECON SOLUTIONS
  • New York, NY
Data Engineer
  • Talented IT
  • New York, NY
Big data Developer
  • TSR Consulting Services, Inc.
  • New York, NY
Big Data Resource
  • Cygnus Professionals
  • Nyc, NY
Big Data Developer
  • XDuce
  • Weehawken, NJ
Senior BigData Developer- Only Locals
  • Dotcom Team, LLC
  • New York, NY
Big Data Developer
  • Infinity Consulting Solutions
  • New York, NY
Big Data Engineer
  • iTech Solutions
  • New York, NY
BigData Admin/Developer
  • HALLMARK GLOBAL TECHNOLOGIES INC
  • Nyc, NY
Scala/Java/Cloud/Big Data Engineers
  • Case Interactive
  • New York, NY
Big Data Engineer Consultant
  • Axis Group, LLC
  • Berkeley Heights, NJ
Big Data Modeler
  • HAN IT Staffing Inc.
  • Jersey City, NJ
Technical Lead - Talend Bigdata Developer
  • Cortex consultants LLC
  • Whippany, NJ
Big Data Developer
  • MphasiS Corporation USA
  • Jersey City, NJ