Introduction
Google provides BigQuery for querying massive datasets by enabling super-fast SQL queries against append-only tables using the processing power of Googles's infrastructure. Users can move their data into BigQuery and let it to handle the hard work.
Now CDAP provides the interface for users to handle their datasets in BigQuery.
Use-Case
1. User is able to query their datasets stored in Google BigQuery.
2. User should specify the limit time for the querying.
3. User is able to specify the limit size of the dataset to query.
4. User is able to poll for the result.
5. User can list the query result history for a duration of time.
6. The schema is automatically pulled from the table.
7. User can pull the field names from the query.
Design
Query:
Inputs | type | required | default |
---|---|---|---|
ProjectId | String | Yes |
|
Credencial | String | Yes | |
Query | String | Yes | |
Limit Time | Integer (min) | No | 10 |
Limit Size | Integer (GB) | No | 50 |
Poll Results:
Using jobId:
Inputs | type | Required |
---|---|---|
PorjectId | String | Yes |
JobId | String | Yes |
Latest results:
Inputs | Type | Required |
---|---|---|
ProjectId | String | Yes |
Poll Numer | Integer | Yes |