Install Python for the Python transformation on Hadoop

Install Python for the Python transformation on Hadoop

Integrate Python with the Hadoop connection

Integrate Python with the Hadoop connection

To integrate the Python installation with the Hadoop connection, configure the Spark advanced properties that the Spark engine requires to run the Python transformation.
In the Administrator tool, complete the following tasks:
  1. Navigate to
    Manage > Connections
    .
  2. Select the Hadoop connection that connects to the Hadoop cluster.
  3. Edit the
    Spark configuration
    .
  4. Edit the
    Advanced Properties
    .
  5. Configure the following advanced properties:
    infaspark.pythontx.exec
    Location of the Python executable binary. Set the property to the following value:
    $INFA_HOME/services/shared/spark/python/bin/python3
    infaspark.pythontx.executorEnv.PYTHONHOME
    Location of the Python installation directory. Keep the following default value:
    $INFA_HOME/services/shared/spark/python/

0 COMMENTS

We’d like to hear from you!