Table of Contents

Search

  1. Preface
  2. Introduction to Informatica Edge Data Streaming
  3. Licenses
  4. Using Informatica Administrator
  5. Creating and Managing the Edge Data Streaming Service
  6. Edge Data Streaming Entity Types
  7. Edge Data Streaming Nodes
  8. Data Connections
  9. Working With Data Flows
  10. Managing the Edge Data Streaming Components
  11. Security
  12. High Availability
  13. Disaster Recovery
  14. Monitoring Edge Data Streaming Entities
  15. Appendix A: Troubleshooting
  16. Appendix B: Frequently Asked Questions
  17. Appendix C: Regular Expressions
  18. Appendix D: Command Line Program
  19. Appendix E: Configuring Edge Data Streaming to Work With a ZooKeeper Observer
  20. Appendix F: Glossary

User Guide

User Guide

Data Duplication

Data Duplication

In a deployment in which each target system receives data from a standalone instance of a target service, load balancing is not a requirement.
EDS
duplicates the data across the data targets.
EDS
delivers all the messages published by the source service to all the data target services so that they have the complete data set for analysis.
For example, you create separate target services to send data to an HDFS cluster, a Cassandra database, and a RulePoint instance.
The following image shows
EDS
data duplication across the data targets:
The data flow has one source service that publishes data as messages 1 and 2 over the topic logs. Three target services receive both messages and write them to the targets.
EDS
duplicates data as follows:
  1. A source service reads data from the data source and publishes it as two messages over a topic called logs.
  2. EDS
    delivers both the messages to all the target services.
  3. The target services receive both messages and send them to the targets.

0 COMMENTS

We’d like to hear from you!