Difference between revisions of "Hadoop Session"

From Gridkaschool
(A workshop in 4 sessions)
(Objectives)
Line 29: Line 29:
 
2. Collect Twitter data with Flume. ''(0.5 to 1.0 hour)''
 
2. Collect Twitter data with Flume. ''(0.5 to 1.0 hour)''
   
3. Create, deploy and execute a MapReduce Program for the Java-API (MRv1). ''(up to 1.0 hour)''
+
3. Create, deploy and execute a MapReduce program for the Java-API (MRv1) and run benchmarks. ''(up to 1.0 hour)''
   
 
4. Learn how to work with your data using the Hadoop-GUI HUE. ''(0.5 to 1.0 hour)''
 
4. Learn how to work with your data using the Hadoop-GUI HUE. ''(0.5 to 1.0 hour)''
 
 
 
   
 
==Ressources==
 
==Ressources==

Revision as of 12:59, 25 August 2013

Hadoop: Quickstart

Requirements:

You need a notebook with VMWare Player or VMWare Fusion (if you work on a Mac). Just in case, this works not for you, we will have a local Hadoop cluster. The Development Environment is based on Oracle JDK (Version 1.6) and Eclipse, Git and Maven.


Please download the following two VM images:

Quickstart VM http://www.cloudera.com/content/support/en/downloads/download-components/download-products.html?productID=F6mO278Rvo

Cloud-Connector VM http://training.cloudera.com/cloudera/VMs/Cloudera-Training-Get2EC2-VM-1.0-vmware.zip

Do you have a connection to the internet form inside this VM? It should work out of the box, but sometimes it does not ;-(

So please prepare your VM image on your computer before the session starts. All things related to the cloud setup will be prepared for you.

If you have any trouble please send your feedback or help request to: mirko@cloudera.com Thanks, and see you soon.

Objectives

1. Plan and Build your own Hadoop cluster in the Amazon Cloud. (1.5 to 2.0 hours)

2. Collect Twitter data with Flume. (0.5 to 1.0 hour)

3. Create, deploy and execute a MapReduce program for the Java-API (MRv1) and run benchmarks. (up to 1.0 hour)

4. Learn how to work with your data using the Hadoop-GUI HUE. (0.5 to 1.0 hour)

Ressources

... more content comming soon!