Implementing Data Engineering Integration with Google Dataproc

Implementing Data Engineering Integration with Google Dataproc

Overview

Overview

Google Dataproc is a lightweight implementation of Hadoop and Apache Spark on the Google cloud platform. When you integrate Informatica Data Engineering products with Dataproc, you configure an on-premises Informatica domain to run jobs on the Dataproc cloud cluster.
When configuration is complete, you can run mappings from the Developer tool in the Dataproc environment.
For information about supported security providers, see the topic "Supported Identity Providers" in the
Informatica Security Guide
.
For more information about product requirements and supported platforms, see the Product Availability Matrix on Informatica Network: https://network.informatica.com/community/informatica-network/product-availability-matrices
The following image shows an overview of the integration process:

0 COMMENTS

We’d like to hear from you!