Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • (tick) Send supported HBase version by CDAP
  • Gather information about CDH version compatibility changes – Talk to Cloudera and compile 

Infrastructure components used by Cask Data Application Platform (CDAP)

Following are the underlying infrastructure components used by CDAP and/or CDAP Applications running in CDAP.  The components presented below are in no priority order. 

...


Functional use of infrastructure components

This section provides information about how and for what the components underneath are used. 
HDFS
  • CDAP Stream
  • Apache Tephra WAL
  • Deployed Application Artifact and Dataset Artifact
  • Aggregated Logs
  • CDAP Fileset Dataset
  • YARN distributed cache 
  • Coprocessor jars 
HBase
  • CDAP System data/metadata (ex: Preferences, Application, Namespace, Artifact…)
  • Metrics Cube
  • Lineage
  • Workflow Statistics
  • Run Record and Statistics
  • Checkpoint information
  • CDAP Table Dataset
Kafka
  • Logs
  • Metrics
  • Audit Logs (Will be moved to HBase in 4.0)
  • Metadata updates (Will be moved to HBase in 4.0)
  • Notifications (Will be moved to HBase in 4.x)
YARN
  • System Services
  • User applications
Zookeeper
  • Routing Tables
  • Coordination
  • Secret keys 
    • Auth keys
Hive
  • Dataset integration 
    • Schema
    • Properties
    • Serde
KMS
  • User Secrets (Ex: Password, access tokens etc..) 

Failure Scenarios

  • HDFS
    • Upgrade
    • Downgrade
    • Restart
    • Data Node Outage
  • HBase
    • Upgrade
    • Downgrade
    • Restart
    • Region Server Outage
  • Zookeeper
    • Upgrade
    • Downgrade
    • Network Partition 
  • YARN
    • Upgrade
    • Downgrade
    • Node Manager Outage
    • RM Outage
  • Kafka
    • Upgrade
    • Downgrade
    • Disk Outage
  • KMS
    • Upgrade 
    • Downgrade 
    • Outage

...