Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Info

This content is now maintained on the CDAP doc wiki, here. Please add any comments to the CDAP wiki.

This page describes how to parse JSONs in wranglerWrangler. If the JSON has arrays, this will be a short tutorial to explain how the arrays can be exploded/flattened into columns for further processing /and cleanup.

Parsing JSON

Steps to follow,:

  1. Go to CDF instance.

  2. Navigate to wrangler Wrangler from the sidebar (appears on by clicking the hamburger icon in the navbar).

  3. From the a wrangler Wrangler source (GCS or BigQuery), read a JSON file to wrangleWrangle.

  4. Once navigated to wrangler Wrangler tab, open the dropdown on the column and choose Parse → JSON.

  5. After Step 4, the fields in the JSON will form the column. Identify the column that has rows.

  6. Open the dropdown drop-down from that specific column and select Explode → Array (by flattening).

  7. This will explode the elements in the JSON array as individual data in the same column in wranglerWrangler.

After step 7 further directives can be applied based on cleanup needed.

...

Page Properties
hiddentrue

Related issues