Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • User specifies how they would like to handle errors during ingesting, depending on option chosen, the errors in processing are handled.
  • User should be able to specify account credentials in configuration.

User Configurations

SectionUser Configuration LabelLabel DescriptionMandatoryMacro-enabledOptionsDefaultVariableUser Widget
StandardReference NameThis will be used to uniquely identify this source for lineage, annotating metadata, etc++

referenceNameText Box

TableDatabase table name++

tableText Box

Column Family

Column family to use for all inserted rows.

++

columnFamilyText Box

Instance IDBigtable instance ID++

instanceText Box

Project IDThe ID of the project in Google Cloud
If not specified, will be automatically read from the cluster environment

+

projectText Box

Service Account File Path

Path on the local file system of the service account key used for
authorization.

If the plugin is run on a Google Cloud Dataproc cluster, the service account key does not need to be provided and can be set to 'auto-detect'.
Credentials will be automatically read from the cluster environment.

When running on other clusters, the file must be present on every node in the cluster.

See Google's documentation on Service account credentials for details.


+

serviceFilePathText Box

Key Alias

Name of the field for row key.


+
__key__keyAliasText Box
Error HandlingOn Record Error

How to handle error in record processing. Error will be thrown if failed to serialize value according to provided input schema.

+
  • Skip error
  • Fail pipeline
Skip erroron-errorRadio Button (layout: block)

Bigtable Overview

Storage model

...

  • Task will be split using org.apache.hadoop.hbase.mapreduce.TableOutputFormat.
  • Values will be converted into bytes using input schema.
  • Supported input field types: boolean, int, long, float, double, bytes, string.
  • All information about logical types will be lost when inserted because of schema-less DB nature.

Reference

https://cloud.google.com/bigtable/docs/