Skip to main content

BigQuery

BigQuery is a highly efficient, serverless, and cost-effective enterprise data warehouse that provides extensive capabilities for BI (Business Intelligence), machine learning, and AI (Artificial Intelligence).

ClickPipes offers seamless support for data synchronization and data transformation tasks using BigQuery as the source database. This article serves as a comprehensive guide, providing step-by-step instructions on adding BigQuery data sources to ClickPipes.

Supported Versions and Architectures

CategoryDescription
VersionCloud Version
ArchitectureSupports Google Cloud BigQuery service

Supported Data Types

CategoryData Types
Numeric TypesINT64, NUMERIC, FLOAT64, BIGNUMERIC
String TypesSTRING
Date & Time TypesDATE, DATETIME, TIME, TIMESTAMP
Boolean TypeBOOL
Binary TypeBYTES
Structured TypesARRAY, STRUCT

Precautions

  • Mirror Engine must have access to Google Cloud Services
  • Appropriate permissions must be configured in Google Cloud for ClickPipes to access BigQuery
  • For optimal performance, it's recommended to use a Mirror Engine deployed in the same region as your BigQuery instance

Preparations

  1. Log in to Google Cloud Role page to create a role that will contain the permissions required for ClickPipes to operate BigQuery.

    1. Click CREATE ROLE.
    2. On the redirected page, enter the role name and click ADD PERMISSIONS.
    3. In the pop-up dialog, search for each permission one by one and grant them accordingly.
Minimum Permissions List (Click to expand)
bigquery.datasets.create
bigquery.datasets.get
bigquery.datasets.update
bigquery.jobs.create
bigquery.jobs.get
bigquery.jobs.list
bigquery.jobs.listAll
bigquery.jobs.delete
bigquery.jobs.update
bigquery.routines.list
bigquery.routines.get
bigquery.tables.create
bigquery.tables.delete
bigquery.tables.get
bigquery.tables.getData
bigquery.tables.list
bigquery.tables.setCategory
bigquery.tables.update
bigquery.tables.updateData
  1. After the permission selection is complete, click CREATE.
  1. Log in to Google Cloud Credentials page to create a service account that will be used for subsequent authentication.

    1. At the top of the page, click CREATE CREDENTIALS > Service Account.

    2. In the Service account details section, provide the name, ID, and description for the service account, and then click CREATE AND CONTINUE.

      Create access account

    3. In the Role drop-down box, select the role we just created (bigquery-role), and click DONE at the bottom of the page.

      Grant access

  2. Create an authentication key for the service account.

    1. On the Credentials page, click the service account you just created at the bottom of the page.

    2. On the KEYS table, click ADD KEY > Create new key.

      Create Key

    3. In the pop-up dialog, select the key type as JSON, and click CREATE.

      After the operation is completed, the key file will be automatically downloaded and saved to your computer. In order to ensure the security of your account, please keep the key file safe.

    4. Log in to Google Cloud Console to create datasets and tables, skipping this step if tables already exists.

      1. Create BigQuery Dataset.

        tip

        To ensure that ClickPipes properly reads the dataset information, when creating the dataset, select the Location type as Multi-region.

      2. Create Tables.

Source Config

  • Connection Settings
    • Connection name: Fill in a unique name that has business significance.
    • Connection type: Currently only supported as a Target.
    • Service Account(JSON): Open the key file that you have downloaded in advance using a text editor. Copy the content of the file and paste it into the text box provided.
    • Table Set ID: Select a dataset that already exists in BigQuery.