Table of Contents


  1. Preface
  2. Introduction to Informatica Data Engineering Integration
  3. Mappings
  4. Mapping Optimization
  5. Sources
  6. Targets
  7. Transformations
  8. Python Transformation
  9. Data Preview
  10. Cluster Workflows
  11. Profiles
  12. Monitoring
  13. Hierarchical Data Processing
  14. Hierarchical Data Processing Configuration
  15. Hierarchical Data Processing with Schema Changes
  16. Intelligent Structure Models
  17. Blockchain
  18. Stateful Computing
  19. Appendix A: Connections Reference
  20. Appendix B: Data Type Reference
  21. Appendix C: Function Reference

Configure Mappings to Run on Dataproc

Configure Mappings to Run on Dataproc

To run mappings on the Dataproc cluster, configure mappings with the following properties:
  1. In the
    section, create a parameter with the values shown in the following table:
    Default Value
    Use the dropdown control to select the Hadoop connection that the cluster configuration created.
  2. In the
    section, choose the following values:
    • Under Validation Environments, select Spark.
    • For Execution Environment, select Hadoop.
    • Under the Hadoop section, for the Connection property, click the right side of the Value cell to see the dropdown menu, and choose the pushdownConnection parameter.

Updated March 31, 2021