To integrate the Python installation with the Hadoop connection, configure the Spark advanced properties that the Spark engine requires to run the Python transformation.
In the Administrator tool, complete the following tasks:
Navigate to
Manage > Connections
.
Select the Hadoop connection that connects to the Hadoop cluster.
Edit the
Spark configuration
.
Edit the
Advanced Properties
.
Configure the following advanced properties:
infaspark.pythontx.exec
Location of the Python executable binary. Set the property to the following value: