Hadoop Session

From Gridkaschool
Revision as of 20:13, 24 August 2013 by Kamir1604 (talk | contribs)
Jump to navigationJump to search
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.

Hadoop: Quickstart

Requirements:

You need a notebook with VMWare Player or VMWare Fusion (if you work on a Mac). Just in case, this works not for you, we will have a local Hadoop cluster. The Development Environment is based on Oracle JDK (Version 1.6) and Eclipse. We work with Git and Maven.

Please download the following two VM images:


Quickstart VM http://www.cloudera.com/content/support/en/downloads/download-components/download-products.html?productID=F6mO278Rvo


Objectives

1. Build your own Hadoop cluster in the Amazon Cloud.

2. Collect Twitter data with Flume.

3. Create, deploy and execute a MapReduce Program for the Java-API (MRv1).

4. Learn how to work with your data using the Hadoop-GUI HUE.



Ressources

... more content comming soon!