Big Data Streaming User Guide

10.2.1
- 10.5.8
- 10.5.7
- 10.5.4
- 10.5.2
- 10.5.1
- 10.5
- 10.4.1
- 10.4.0
- 10.2.2 Service Pack 1
- 10.2.2

Back Next

Prerequisites to Read From or Write to a Kerberised Kafka Cluster

To read from or write to a Kerberised Kafka cluster, configure the default realm, KDC, Hadoop connection properties, and Kafka data object read or write data operation properties.

Before you read from or write to a Kerberized Kafka cluster, perform the following tasks:

Ensure that you have the krb5.conf file for the Kerberised Kafka server.

Configure the default realm and KDC. If the default

/etc/krb5.conf

file is not configured or you want to change the configuration, add the following lines to the

/etc/krb5.conf

file:

[libdefaults]
default_realm = <REALM NAME>
dns_lookup_realm = false
dns_lookup_kdc = false
ticket_lifetime = 24h
renew_lifetime = 7d
forwardable = true

[realms]
<REALM NAME> = {
kdc = <Location where KDC is installed> 
admin_server = <Location where KDC is installed>
                                }
 [domain_realm]
.<domain name or hostname> = <KERBEROS DOMAIN NAME>
<domain name or hostname> = <KERBEROS DOMAIN NAME>

To pass a static JAAS configuration file into the JVM using the

java.security.auth.login.config

property at runtime, perform the following tasks:

Ensure that you have JAAS configuration file.

For information about creating JAAS configuration and configuring Keytab for Kafka clients, see the Apache Kafka documentation at https://kafka.apache.org/0101/documentation/#security

For example, the JAAS configuration file can contain the following lines of configuration:

//Kafka Client Authentication. Used for client to kafka broker connection
KafkaClient {
com.sun.security.auth.module.Krb5LoginModule required
doNotPrompt=true
useKeyTab=true
storeKey=true
keyTab="<path to keytab  file>/<keytab file name>"
principal="<principal name>"
client=true
};

Place the JAAS config file and keytab file in the same location on all the nodes of the Hadoop cluster.

Informatica recommends that you place the files in a location that is accessible to all the nodes in the cluster. Example:

/etc

/temp

On the

Spark Engine

tab of the Hadoop connection properties, update the

extraJavaOptions

property of the executor and the driver in the

Advanced Properties

property. Click

Edit

and update the properties in the following format:

infaspark.executor.extraJavaOptions=-Djava.security.egd=file:/dev/./urandom 
-XX:MaxMetaspaceSize=256M -Djavax.security.auth.useSubjectCredsOnly=true 
-Djava.security.krb5.conf=/<path to krb5.conf file>/krb5.conf 
-Djava.security.auth.login.config=/<path to jAAS config>/<kafka_client_jaas>.config 
                                
infaspark.driver.cluster.mode.extraJavaOptions=-Djava.security.egd=file:/dev/./urandom 
-XX:MaxMetaspaceSize=256M -Djavax.security.auth.useSubjectCredsOnly=true 
-Djava.security.krb5.conf=/<path to krb5.conf file>/krb5.conf 
-Djava.security.auth.login.config=<path to jaas config>/<kafka_client_jaas>.config

Configure the following properties in the data object read or write operation:

Data object read operation. Configure the

Consumer Configuration Properties

property in the advanced properties.

Data object write operation. Configure the

Producer Configuration Properties

property in the advanced properties.

Specify the following value:

security.protocol=SASL_PLAINTEXT,sasl.kerberos.service.name=kafka,sasl.mechanism=GSSAPI

To embed the JAAS configuration in the

sasl.jaas.config

configuration property, perform the following tasks:

On the

Spark Engine

tab of the Hadoop connection properties, update the

extraJavaOptions

property of the executor and the driver in the

Advanced Properties

property. Click

Edit

and update the properties in the following format:

infaspark.executor.extraJavaOptions = -Djava.security.egd=file:/dev/./urandom 
-XX:MaxMetaspaceSize=256M -XX:+UseG1GC -XX:MaxGCPauseMillis=500 
-Djava.security.krb5.conf=<path to krb5.conf file>

infaspark.driver.cluster.mode.extraJavaOptions = -Djava.security.egd=file:/dev/./urandom 
-XX:MaxMetaspaceSize=256M -XX:+UseG1GC -XX:MaxGCPauseMillis=500 
-Djava.security.krb5.conf=<path to krb5.conf file>

Configure the following properties in the data object read or write operation:

Data object read operation. Configure the

Consumer Configuration Properties

property in the advanced properties.

Data object write operation. Configure the

Producer Configuration Properties

property in the advanced properties.

Specify the following value:

security.protocol=SASL_PLAINTEXT,sasl.kerberos.service.name=kafka,sasl.mechanism=GSSAPI,
sasl.jaas.config=com.sun.security.auth.module.Krb5LoginModule required useKeyTab=true 
storeKey=true doNotPrompt=true serviceName="<service_name>" keyTab="<location of keytab file>" 
client=true principal="<principal_name>";

The following image shows the

Advanced Properties

property in the Hadoop connection: