Hadoop Session

From Gridkaschool
Revision as of 20:09, 24 August 2013 by Kamir1604 (talk | contribs)

Hadoop: Quickstart

Requirements:

You need a notebook with VMWare Player or VMWare Fusion (if you work on a Mac). Just in case, this works not for you, we will have a local Hadoop cluster. The Development Environment is based on Oracle JDK (Version 1.6) and Eclipse. We work with Git and Maven.

Please download the following two VM images:



Objectives

1. Build your own Hadoop cluster in the Amazon Cloud.

2. Collect Twitter data with Flume.

3. Create, deploy and execute a MapReduce Program for the Java-API (MRv1).

4. Learn how to work with your data using the Hadoop-GUI HUE.



Ressources

... more content comming soon!