Introduction
Spark plugins that trains and classify data based on Multinomial/Binary Logistic Regression.
Use-case
Following are the use-cases that the plugin should support:
- User should be able to train the data.
- User should be able to classify the test data using the model build while training the data.
- User should be able to provide the list of columns(features) to use for training.
- User should be able to provide the list of columns(features) to classify.
- User should be able to provide the column to be used as prediction field while training/classification.
- User should be able to provide the number of features to be used while training/classification.
- User should be able to provide the number of classes to be used while training/classification.
- User should be able to provide the file set name to save the training model.
- User should be able to provide the path of the file set.
User Stories
- User should be able to train the data.
- User should be able to classify the test data using the model build while training the data.
Example
Implementation Tips
Design
Table of Contents
Checklist
- User stories documented
- User stories reviewed
- Design documented
- Design reviewed
- Feature merged
- Examples and guides
- Integration tests
- Documentation for feature
- Short video demonstrating the feature