Solution Architect (Hadoop/ Cloudera)

Architect, hadoop, Cloudera, Flume-Kafka, hive, Big data
Full Time, Contract W2, C2H Corp-To-Corp, C2H Independent, C2H W2, 12+ months
Telecommuting not available Travel not required

Job Description

Solution Architect (Team Lead)

Primary location: Herndon, VA

but client can cosider candidates for their another offices located at Pontiac, Plano, and El Paso


Responsible for the overall Architecture Design of the Cloudera-based Data Lake.  The Architect leverages their knowledge of Cloudera, NoSQL, BI/ Analytics, App Dev and Network to support the DSF platform for FRB customers to enable ingestion, transformation, storage and reporting for analytic and big data projects within the Federal Reserve. 



-          Assist in the support and design of National IT’s DSF service-based big data platform solution that accounts for a variety of National IT users encompassing various  data types that empower FRB staff to take advantage of the big data platform investment.

-          Provide a solution design that both meets the business requirements but also the best technical practices of standing up a Cloudera-based data Lake.

-          Working with the Hadoop Admin, prescribe data ingestion standards that will implement solutions similar to Flume-Kafka integration.

-          Responsible for creating all processing data tenants which include utilization of Spark, Hive, SQL and other Cloudera based frameworks.

-          Work in concert with the Security Administrator to outline data security utilizing core Cloudera platform, such as  Apache Sentry to securely manage authentication by verifying National IT  user credentials.

-          Deploy authorization scheme that limits the user’s access to a given resource as defined by National IT business requirements.

-          Define roles that depict a user of the Big Data Service i.e. a National IT individual identified by underlying National IT official authentication system.

-          Define the user access template that combine multiple access rules that utilize the National IT Active Directory access model combined with the Cloudera security model.

-          Provide expert data model consultation on decisions on NoSQL vs Relational Database vs in- memory data storage.

Dice Id : atitx
Position Id : 011974
Have a Job? Post it