Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

This page describes how to grant service account user permission to the Cloud Data Fusion. When the Cloud Data Fusion provisions a Dataproc cluster, a user managed service account can be specified. That service account will be used in the virtual machines in that Dataproc cluster. If the user doesn’t specify any service account, the default Google-managed Compute Engine service account will be used. Regardless of what service account being used in the Dataproc cluster, the Cloud Data Fusion needs to have permission to use that service account.

Info

Without the service account user permission, Cloud Data Fusion cannot provision Dataproc cluster, resulting in the following error when trying to execute a data pipeline:

PROVISION task failed in REQUESTING_CREATE state for program run [pipeline-name] due to Dataproc operation failure: INVALID_ARGUMENT: User not authorized to act as service account '[service-account-name]'

Before you begin

Please read the Dataproc service accounts documentation if you want to provide a user managed service account in the compute profile in Cloud Data Fusion.

...