Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 3 Next »

Goals

To allow users to use the Hydrator drag and drop UI to easily create pipelines that run on Spark Streaming, leveraging built-in capabilities like windowing and machine learning.

Checklist

  • User stories documented (Albert)
  • User stories reviewed (Nitin)
  • Design documented (Albert)
  • Design reviewed (Terence/Andreas)
  • Feature merged ()
  • Examples and guides ()
  • Integration tests () 
  • Documentation for feature ()
  • Blog post

Use Cases

  1.  

  2.  

User Stories

  1. As a pipeline developer, I want to be able to join (inner, left outer, right outer, full outer) two or more stage outputs on some common fields, or do a cross join.
  2. As a pipeline developer, I want to be able to get metrics on number of records in and records out of the join.
  3. [UI] As a pipeline developer, I want to be able to see the schema of all input into the join, and the schema output by the join.
  4. As a pipeline developer, I want to be able to choose whether the pipeline with the join runs with mapreduce or spark.
  5. As a plugin developer, I want to be able to write a plugin that gets data from multiple stages joins them.

Design

 

  • No labels