Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Goals


Checklist

  •  User stories documented (Albert/Vinisha)
  •  User stories reviewed (Nitin)
  •  Design documented (Albert/Vinisha)
  •  Design reviewed (Terence/Andreas)
  •  Feature merged ()
  •  Examples and guides ()
  •  Integration tests () 
  •  Documentation for feature ()
  •  Blog post

...

Use Cases

  1.  

  2.  

User Stories

  1. As a pipeline developer, I want to be able to join (inner, left outer, right outer, full outer) two or more stage outputs on some common fields, or do a cross join.
  2. As a pipeline developer, I want to be able to get metrics on number of records in and records out of the join.
  3. [UI] As a pipeline developer, I want to be able to see the schema of all input into the join, and the schema output by the join.
  4. As a pipeline developer, I want to be able to choose whether the pipeline with the join runs with mapreduce or spark.
  5. As a plugin developer, I want to be able to write a plugin that gets data from multiple stages joins them.

Design