Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Current »

Overview

This document provides best practises for configuring output directory for file based sink plugins (S3, GCS).

General Tips

  • Ensure the output paths are unique if there are multiple file based sinks in the same pipeline

    • Having same output path in a pipeline (ex: Two error collectors having same paths) will result in a pipeline failure with an error: “User class threw exception: org.apache.hadoop.mapredue.FileAlreadyExistsException: Output directory xxxxxx already exists.

  • No labels