Hadoop Session
From Gridkaschool
Hadoop: Quickstart
Requirements:
You need a notebook with VMWare Player or VMWare Fusion (if you work on a Mac). Just in case, this works not for you, we will have a local Hadoop cluster. The Development Environment is based on Oracle JDK (Version 1.6) and Eclipse. We work with Git and Maven.
Please download the following two VM images:
Quickstart VM
Cloud-Connector VM
http://training.cloudera.com/cloudera/VMs/Cloudera-Training-Get2EC2-VM-1.0-vmware.zip
Objectives
1. Build your own Hadoop cluster in the Amazon Cloud.
2. Collect Twitter data with Flume.
3. Create, deploy and execute a MapReduce Program for the Java-API (MRv1).
4. Learn how to work with your data using the Hadoop-GUI HUE.
Ressources
... more content comming soon!