Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Use Cases:

  • Validator Filter: All records of a transform that are invalid go into one dataset; the remainder go into another.

...

Existing APIs (in MapReduceContext); sets a single Dataset as the output for the MapReduce job:

context.setOutput(String datasetName);

...

Additional APIs (in MapReduceContext); adds a Dataset to the set of output Datasets for the MapReduce job:

context.addOutput(String datasetName);
context.addOutput(String datasetName, Dataset dataset);

 

Example Usage:

public void beforeSubmit(MapReduceContext context) throws Exception {

  

context.setOutput();

}



Approach:
Take an approach similar to org.apache.hadoop.mapreduce.lib.output.MultipleOutputs.

...