Hadoop Session

From Gridkaschool
Revision as of 09:05, 25 August 2013 by Kamir1604 (talk | contribs)

Hadoop: Quickstart

Requirements:

You need a notebook with VMWare Player or VMWare Fusion (if you work on a Mac). Just in case, this works not for you, we will have a local Hadoop cluster. The Development Environment is based on Oracle JDK (Version 1.6) and Eclipse. We work with Git and Maven.

Please download the following two VM images:


Quickstart VM

http://www.cloudera.com/content/support/en/downloads/download-components/download-products.html?productID=F6mO278Rvo

Cloud-Connector VM


http://training.cloudera.com/cloudera/VMs/Cloudera-Training-Get2EC2-VM-1.0-vmware.zip



Objectives

1. Build your own Hadoop cluster in the Amazon Cloud.

2. Collect Twitter data with Flume.

3. Create, deploy and execute a MapReduce Program for the Java-API (MRv1).

4. Learn how to work with your data using the Hadoop-GUI HUE.



Ressources

... more content comming soon!