Homework 5: (40 pts)

On the VM instances where you installed Hadoop in HW3, do the following:

  1. Download, install, and run a stable release of Apache HBase: https://hbase.apache.org/downloads.html
  2. Expand your MapReduce program developed in HW4 to i) store all the missing Poker cards in HBase and ii) count the number of records in the generated HBase table.
  3. Run the expanded MapReduce program in your Hadoop-HBase system in the fully distributed mode.
Submission requirements: A zipped file that contains

  1. A text file that contains a random number (less than 52) of different Poker cards (each card is uniquely represented by both its rank and suit)
  2. A text file that contains all the missing Poker cards identified by your expanded MapReduce program
  3. A Screenshot showing that the Hadoop-HBase system is running properly
  4. A Screenshot showing that new records of all the missing Poker cards have been stored in HBase
  5. The modified Java programs of your MapReduce solution integrated with HBase