Connections and Cluster Distributions that Support Data Preview
Connections and Cluster Distributions that Support Data Preview
When you preview data using the Spark engine, configure the mapping with a supported connection and Hadoop distribution.
Connections
You can use the Spark engine to preview data on mappings that use the following connections:
HBase
HDFS
Hive
JDBC configured for Sqoop
When you configure a mapping that uses a JDBC connection, you can use a generic JDBC connection, or use the specialized drivers for Oracle or Teradata.
Cluster Distributions
You can preview data in mappings configured to run with the following distributions:
Amazon EMR
Azure HDInsight*
Cloudera CDH
Cloudera CDP
Hortonworks HDP
MapR*
* Effective in version 10.2.2, previewing data on these distributions is supported for technical preview only.
Technical preview functionality is supported for evaluation purposes but is unwarranted and is not production-ready. Informatica recommends that you use in non-production environments only. Informatica intends to include the preview functionality in an upcoming release for production use, but might choose not to in accordance with changing market or technical circumstances. For more information, contact Informatica Global Customer Support.
Before you preview data on Amazon EMR, you must configure the
/etc/hosts
file on all nodes in the cluster to include the machine and IP address of the Data Integration Service.