Table of Contents

Search

  1. Preface
  2. Data Replication Overview
  3. Understanding Data Replication
  4. Sources - Preparation and Replication Considerations
  5. Targets - Preparation and Replication Considerations
  6. Starting the Server Manager
  7. Getting Started with the Data Replication Console
  8. Defining and Managing Server Manager Main Servers and Subservers
  9. Creating and Managing User Accounts
  10. Creating and Managing Connections
  11. Creating Replication Configurations
  12. Materializing Targets with InitialSync
  13. Scheduling and Running Replication Tasks
  14. Implementing Advanced Replication Topologies
  15. Monitoring Data Replication
  16. Managing Replication Configurations
  17. Handling Replication Environment Changes and Failures
  18. Troubleshooting
  19. Data Replication Files and Subdirectories
  20. Data Replication Runtime Parameters
  21. Command Line Parameters for Data Replication Components
  22. Updating Configurations in the Replication Configuration CLI
  23. DDL Statements for Manually Creating Recovery Tables
  24. Sample Scripts for Enabling or Disabling SQL Server Change Data Capture
  25. Glossary

CDC Publisher

CDC Publisher

The Informatica CDC Publisher is a Java-based tool that is used to stream change data to a target messaging system such as Apache Kafka.
The CDC Publisher contains the following components that move data:
  • The CDC Publisher
    Extractor
    consumes a stream of change data from the source. The incoming data records include schema information, row-based data changes, and transactional boundary metadata. The Extractor performs the following functions:
    • Assigns a sequence ID that is both repeatable and increasing to each change data record.
    • Interacts with the component that supplies the streamed data.
    • Ignores records that are older than the current restart point.
    • Verifies that data is in an expected format.
    • Places the results on an outbound queue for Formatter processing.
  • The
    Filter
    component optionally filters the extracted change data based on lists of source objects to include or exclude that you specify.
  • The
    Formatter
    receives change data from the CDC Publisher Extractor, formats the data based on the generated Avro schema of the selected format (flat, nested, or generic) for inclusion in messages, and sends the formatted messages to the Connector.
  • The
    Connector
    reads the formatted messages from the Formatter and connects to the target messaging system to apply the messages. The Connector applies the message data in a consistent, ordered, and recoverable manner.
The following image shows the basic architecture of the Java-based CDC Publisher:
Informatica CDC Publisher architecture showing the Extractor, Formatter, and Connector subcomponents and data flow.

0 COMMENTS

We’d like to hear from you!