Table of Contents


  1. Preface
  2. Introduction to Big Data Streaming
  3. Big Data Streaming Administration
  4. Sources in a Streaming Mapping
  5. Targets in a Streaming Mapping
  6. Streaming Mappings
  7. Window Transformation
  8. Appendix A: Connections
  9. Appendix B: Sample Files

Third-Party Applications

Third-Party Applications

Big Data Streaming uses third-parties distributions to connect to a Spark engine on a Hadoop cluster.
Big Data Streaming pushes job processing to the Spark engine. It uses YARN to manage the resources on a Spark cluster more efficiently.


We’d like to hear from you!