Example Projects¶
Compute Vector Statistics¶
Compute statistics for different vector groups. The first job is a basic implementation, the second uses the “In-mapper” combine design pattern to minimize the amount of data sent to the reducer(s) during the shuffle phase.
Below is the driver and test script for the above jobs.