For Data Engineering Integration, you can use Cloudera Data Platform (CDP) as a compute cluster to execute data engineering jobs in the Hadoop environment. You can use Cloudera CDP when you run data engineering jobs on the Spark engine. Cloudera CDP is not supported on the Blaze engine.
Cloudera CDP uses a base cluster and workload clusters to execute data engineering jobs. This architecture allows you to deploy workloads and share data among components by utilizing a shared catalog, unified security, consistent governance, and data life cycle management.
You can use Cloudera CDP when you run a mapping in the Hadoop environment with the following connections:
PowerExchange for Amazon Redshift
PowerExchange for Amazon S3
PowerExchange for HDFS
PowerExchange for Microsoft Azure Blob Storage
PowerExchange for Microsoft Azure CosmosDB SQL API
PowerExchange for Microsoft Azure Data Lake Storage Gen1
PowerExchange for Microsoft Azure Data Lake Storage Gen2
PowerExchange for Microsoft Azure SQL Data Warehouse
You can also use Cloudera CDP when you run a PowerExchange for HDFS mapping in the native environment.
For more information, see the
Informatica® Data Engineering 10.4.1 Integration Guide