Cookbook

This section is a collection of tips/tricks that are useful when working with Hadoopy.

Enabling Verbose Output

Map/Reduce Output Compression

Tweakable Jobconfs

Status/Counters in Jobs

Accessing Jobconfs Inside Jobs

Using Writetb to Write Multiple Parts

Reverse SSH Tunnel

Timing Sections of Code

Skipping Jobs in a Large Workflow

Randomly Sampling Key/Value Pairs