Hadoop (Cloudera) Training

Cloudera's Distribution Including Apache Hadoop (CDH) adalah pendistribusian Hadoop pertama dalam area komersial maupun non-komersial.

Deliverable CDH :

  • Lengkap, paket yang disediakan untuk semua komponen yang dibutuhkan untuk digunakan ketika menggunakan Apache Hadoop.
  • 100% murni Apache Hadoop, kuat untuk lingkungan produksi
  • The Cloudera Ready Certification Program

Komponen CDH adalah sebagai berikut :

  • Apache Hadoop - reliable, scalable distributed storage and computing
  • Apache Hive - SQL-like language and metadata repository
  • Apache Pig - High-level language for expressing data analysis programs
  • Apache HBase - Hadoop database for random, real-time read/write access
  • Apache Zookeeper - Highly-reliable distributed coordination service
  • Apache Whirr - Library for running Hadoop in the cloud
  • Apache Flume - Distributed service for collecting and aggregating log and event data
  • Apache Sqoop - Integrating Hadoop with RDBMS
  • Hue - Browser-based desktop interface for interacting with Hadoop
  • Oozie - Server-based workflow engine for Hadoop activities

CDHv3 Distribution Details :  

Supported Operation System :

Red Hat RHEL 5, RHEL 6
Centos CentOS 5
Ubuntu Lucid, Maverick

Supported Build Infrastructur and Cloud Platform :

Build Infrastructure Apache Maven
Cloud Platform Rackspace, Amazon EC2, Softlayer

Component Version :

Apache Hadoop v0.20.2 + 923
Apache Hive v0.7.0 +27
Apache Pig  v0.8.0 +20
Apache HBase v0.90.1 +15
Apache Zookeeper v3.3.2 +12
Apache Whirr v0.3.0 +5
Apache Flume v0.9.3 +15
Apache Sqoop v1.2 + 24
Hue v1.2.0 +54
Oozie v2.3.0 +31