Table of Contents

Search

  1. Preface
  2. Part 1: Introduction to Google BigQuery connectors
  3. Part 2: Data Integration with Google BigQuery V2 Connector
  4. Part 3: Data Integration with Google BigQuery Connector

Google BigQuery Connectors

Google BigQuery Connectors

Mapping tasks with CDC sources example

Mapping tasks with CDC sources example

Your organization needs to replicate real-time changed data from a mission-critical production system to minimize intrusive, non-critical work, such as offline reporting or analytical operations system. You can use Google BigQuery V2 Connector to capture changed data from any CDC source and write the changed data to a Google BigQuery target. Add the CDC sources in mappings, and then run the associated mapping tasks to write the changed data to the target.
  1. In
    Data Integration
    , click
    New
    Mapping
    Create
    .
    The
    New Mapping
    dialog box appears.
  2. Enter a name and description for the mapping.
  3. On the Source transformation, specify a name and description in the general properties.
  4. On the
    Source
    tab, select any configured CDC connection and specify the required source properties.
  5. On the Target transformation, specify a name and description in the general properties.
  6. On the
    Target
    tab, perform the following steps to configure the target properties:
    1. In the
      Connection
      field, select the Google BigQuery V2 connection.
    2. In the
      Target Type
      field, select the type of the target object.
    3. In the
      Object
      field, select the required target object.
    4. In the
      Operation
      field, select
      Data Driven
      to properly handle insert, update, and delete records from the source.
    5. In the
      Data Driven Condition
      field, leave the field empty.
    6. In the
      Update Column
      field, select the key columns to upsert or update data to or delete data from Google BigQuery.
    7. In the
      Advanced Properties
      section, you must select CDC in the
      Write Mode
      property.
    8. You can only configure the following advanced target properties for CDC mode:
      • Target Dataset ID
      • Target Table Name
      • Job Poll Interval in Seconds
      • Pre SQL
      • Pre SQL Configuration
      • Post SQL
      • Post SQL Configuration
  7. On the
    Field Mapping
    tab, map the incoming fields to the target fields. You can manually map an incoming field to a target field or automatically map fields based on the field names.
  8. In the
    Actions
    menu, click
    New Mapping Task
    .
    The
    New Mapping Task
    page appears.
  9. In the
    Definition
    tab, enter the task name and select the configured mapping.
  10. In the
    CDC Runtime
    tab, specify the required properties for the selected CDC source.
    For more information about the
    CDC Runtime
    properties, see the source properties for the selected CDC source.
  11. On the
    Runtime Options
    tab, add the following properties in the
    Advanced Session Properties
    section:
    1. Select
      Commit on End of File
      from the menu, and keep the property disabled.
    2. Select
      Recovery Strategy
      and set
      Resume from last checkpoint
      as the value of the property.
  12. Click
    Save
    Run
    the mapping task.
    Alternatively, you can create a schedule that runs the mapping task on a recurring basis without manual intervention. You can define the schedule to minimize the time between mapping task runs.
In
Monitor
, you can monitor the status of the logs after you run the task.

0 COMMENTS

We’d like to hear from you!