Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

This document shows you how to use the Google Cloud Speech-to-Text transform plugin to convert audio files into text files.

Before you begin

  1. Ensure that you have enabled the Speech-to-Text API.

  2. In this guide, you build a pipeline that reads data from Google Cloud Storage. Upload the following speech file (hello.wav) to a Cloud Storage bucket.

View file
namehello.wav

Instructions

  1. In the left navigation panel, navigate to the Pipeline Studio.

  2. In the left panel, under the Source section,select a source.In this example, Google Cloud Storage.

  3. Under the Transform section, select Google Cloud Speech-to-Text.

  4. Under the Sink section, select a sink. In this example, Google Cloud Storage.

  5. On the canvas, connect the three items.

  6. Hover over the source, Google Cloud Storage. Click on the Properties button that appears.

  7. In the Google Cloud Storage Properties window, set Path to your Cloud Storage bucket path, and make sure the Format is “blob”.

  8. Click the X button at the top right to save your changes.

  9. Hover over the transform, Google Cloud Speech-to-Text. Click on the Properties button that appears.

  10. Specify the Sampling Rate, Parts, and Text fields.

  11. Click Get Schema, and then Apply, to automatically apply the output schema.

  12. Click the X button at the top right to save your changes.

  13. Hover over the sink, Google Cloud Storage. Click on the Properties button that appears. Set the Path to an output bucket.

  14. Name your pipeline and click Deploy.

  15. Click Run. It takes a few minutes for the pipeline to run.

  16. Once the pipeline succeeds, you can view your transcribed text data in Cloud Storage or whichever sink you chose. 

Filter by label (Content by label)
showLabelsfalse
max5
spacescom.atlassian.confluence.content.render.xhtml.model.resource.identifiers.SpaceResourceIdentifier@957
showSpacefalse
sortmodified
typepage
reversetrue
labelskb-how-to-article
cqllabel = "kb-how-to-article" and type = "page" and space = "KB"

...