Data Engineering Integration
- Data Engineering Integration 10.2.2 HotFix 1
- All Products
Property
| Description
|
---|---|
Spark Staging Directory
| The HDFS file path of the directory that the Spark engine uses to store temporary files for running jobs. The YARN user, Data Integration Service user, and mapping impersonation user must have write permission on this directory.
/tmp/SPARK_<user name> .
When you run Sqoop jobs on the Spark engine, the Data Integration Service creates a Sqoop staging directory within the Spark staging directory to store temporary files:
<Spark staging directory>/sqoop_staging
|
Spark Event Log Directory
| Optional. The HDFS file path of the directory that the Spark engine uses to log events.
|
YARN Queue Name
| The YARN scheduler queue name used by the Spark engine that specifies available resources on a cluster. The name is case sensitive.
|
Advanced Properties
| List of advanced properties that are unique to the Spark engine. The advanced properties include a list of default properties.
You can configure run-time properties for the Hadoop environment in the Data Integration Service, the Hadoop connection, and in the mapping. You can override a property configured at a high level by setting the value at a lower level. For example, if you configure a property in the Data Integration Service custom properties, you can override it in the Hadoop connection or in the mapping. The Data Integration Service processes property overrides based on the following priorities:
Informatica does not recommend changing these property values before you consult with third-party documentation, Informatica documentation, or Informatica Global Customer Support. If you change a value without knowledge of the property, you might experience performance degradation or other unexpected results.
|