To read and process data from Hive, HBase, or HDFS sources that use Kerberos authentication, you must configure Kerberos authentication for mappings in the native environment.
To read and process data from Hive, HBase, or HDFS sources, perform the following steps:
Complete the tasks for running mappings in a Kerberos-enabled Hadoop environment.
Complete the tasks for running mappings in the Hadoop environment when Informatica uses Kerberos authentication.
Create matching operating system profile user names on the machine that runs the Data Integration Service and each Hadoop cluster node used to run Informatica mapping jobs.
Create an Active Directory user that matches the operating system profile user you created in step 3.
Create an SPN associated with the user.
Use the following guidelines when you create the SPN and keytab files:
The UPN must be the same as the SPN.
Enable delegation in Active Directory.
Use the ktpass utility to create an SPN associated with the user and generate the keytabs file.
For example, enter the following command:
ktpass -out infa_hadoop.keytab -mapuser joe -pass tempBG@2008 -princ joe/domain12345@HADOOP-AD-REALM -crypto all
The
-out
parameter specifies the name and path of the keytab file. The
-mapuser
parameter is the user to which the SPN is associated. The
-pass
parameter is the password for the SPN in the generated keytab. The