In this article we will analyze the NFL play by play dataset. The data consists of each play for all games from 2002 thourgh 2013. It is roughly around 600k rows and hardly qualifies as big data. The main point of this article is to illustrate the use of Cloudera Impala for Big Data anlaysis. We will also see the comparison performance against Hive. Check the complete analysis here

Related posts:

  1. Color Analysis of Flags – Patterns and symbols – Visualizations and Dashboards
  2. Impact of sports in movies : Simple Data visualization Analysis – Basketball, Football, Hockey, Lacrosse, Miscellaneous sports
  3. VC Investment Data visualization and analytics
  4. D3 based Data visualizations and Self service Business intelligence
  5. InfoCaptor releases self hosted web dashboard

Posted in Other | Tagged , , , , , , , , , , , , , , , , ,