Table of Contents

Search

  1. Preface
  2. Introduction to Mass Ingestion
  3. Prepare
  4. Create
  5. Deploy
  6. Run
  7. Monitor
  8. Appendix A: infacmd mi Command Reference

Mass Ingestion Guide

Mass Ingestion Guide

Overview

Overview

Use Informatica Mass Ingestion (the Mass Ingestion tool) to ingest large amounts of data from a relational database to a Hive or HDFS target.
The Mass Ingestion tool simplifies the process of ingesting data by providing a wizard that you can use to create a mass ingestion specification. A mass ingestion specification is a configuration that you can design to specify the data that you want to ingest and how you want to ingest it.
The wizard walks you through the steps that you can use to configure each part of the specification, including the relational source and the Hive or HDFS target, and any parameters that you want to configure for the source, such as a parameter to filter certain columns or to mask the data to protect private information.
When you run the mass ingestion specification, the Mass Ingestion tool uses Data Engineering Integration to run the ingestion job on a Hadoop cluster. The specification replaces the need to manually create and run mappings, and it can ingest all of your data in one run. As the schemas in the relational database evolve, the specification can accommodate and ingest only the incremental data.

0 COMMENTS

We’d like to hear from you!