Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Introduction
This wiki will outline how we plan to orchestrate the execution of CDAP programs on top of Kubernetes.

...

  1. Docker Registry - a stateless server-side application used for storing and distributing Docker images.
  2. Docker Hub - might be too heavyweight and reliant on external services for our use case.
  3. Quay (from CoreOS) - not free or open source, so not high on the list.

 

Miscellaneous

 

TODO:

  • Have some numbers around building a Docker image.
  • How can Kubernetes be the runtime under the Twill API, instead of YARN? What are the issues with this integration? What in the Twill API can't be supported?
  • Is there a programmatic API (or at least RESTful) around Kubernetes command-line?
  • How can CDAP master talk to the Kubernetes master to get program status (or any of the Kubernetes interactions)?
  • How long will a Docker image take to run a CDAP program - with and without a base image that has as much as possible of the common stuff?
  • How can we leverage functionality in Kubernetes to avoid a dependency on Zookeeper? Or should we just use etcd regardless of whether we're using Kubernetes or not?
  • Do we need provisioner hooks? For instance, to kick off an instance of Docker Registry after provisioning a Kubernetes cluster?
  • Do research about difficulty of use for YARN vs Kubernetes, ZooKeeper vs etcd.