Bachelor*s degree from an accredited college in a related discipline or equivalent experience with 11 years of total professional IT experience
The candidate will build algorithms to meet business requirements and use statistical, mathematical, and predictive modelling skills to build a comprehensive platform for data
analysis; use Business Intelligence tools to create analytical models; integrate data sets and work with multiple systems to extract and use data for deeper analysis; have the ability
to use the Hadoop environment with Hive and MapReduce skills; be able to build programs in programming languages and scripts; have an understanding of and use skills for
Natural Language Processing, Machine Learning, Statistical Analysis, Predictive Modelling, and Hypothesis Testing.
*Experience with statistical analysis.
*Working knowledge of scripting languages.
*Experience working in a Big Data environment.
*Proficiency at transforming data, data classification and translations, as well as resolving data quality and data cleansing
*Intermediate design and use of relational databases including experience working with working knowledge of dimensional modeling, star schemas and working with time-series
data *Experience working with Hadoop (Cloudera, Hortonworks, etc.) and working with MapReduce development, using Pig and Hive
*Experience with NoSQL databases such as Cassandra, Hbase, CouchDB
*Advanced skills/expertise in data mining, text mining or distributed computing
*Experience and proficiency in utilizing statistical/analytic packages such as SAS, R, SPSS, S-Plus, Matlab to develop statistical models
*Proficiency and advanced ability to leverage scripting languages: Python, Perl, Ruby
*Software development skills in Java including working with Mahout, including integrating with search engines/libraries such as Lucene and/or Solr
*Excellent research, analytical, written and oral communications skills
1651 Old Meadow Rd, Suite 205 McLean, VA, 22102