Table of Contents

Search

  1. About the Enterprise Data Preparation Administrator Guide
  2. Introduction to Enterprise Data Preparation Administration
  3. Getting Started
  4. Administration Process
  5. User Account Setup
  6. Search Configuration
  7. Roles, Privileges, and Profiles
  8. Data Asset Access and Publication Management
  9. Masking Sensitive Data
  10. Monitoring Enterprise Data Preparation
  11. Backing Up and Restoring Enterprise Data Preparation
  12. Managing the Data Lakehouse
  13. Schedule Export, Import and Publish Activities
  14. Interactive Data Preparation Service
  15. Enterprise Data Preparation Service

Enterprise Data Preparation Administrator Guide

Enterprise Data Preparation Administrator Guide

Managing Advanced Properties in a Data Lakehouse

Managing Advanced Properties in a Data Lakehouse

You can configure the advanced properties for the Enterprise Data Preparation.
When you add or edit the advanced properties, you might need to log in again to reflect the changes.

Upload and Download Options

Configure the following options:
Max File Size Upload
Maximum size of the files that the users upload. The default value is 1,024 MB.
Max Rows To Download
Maximum rows that are exported to a CSV file for an asset. You can specify a maximum of 2,000,000,000. Enter a value of -1 to export all rows.

Data Preparation Options

Configure the following options:
Default Data Prep Sample Size
The default number of sample rows to fetch for data preparation. You can specify a maximum number of 1,000,000 rows and a minimum of 1 rows.
Max Data Prep Sample Size
Maximum number of rows to fetch for data preparation. You can specify a maximum number of 1,000,000 rows and a minimum of 1 rows.
Execution Engine for Hive Sampling
Engine for Hive sampling such as Hadoop Cluster, MR, Spark, or Tez. The default is Hadoop Cluster.

Visualization Options

Configure the following options:
Zeppelin URL
The URL to access the Zeppelin framework. The URL must be in
http[s]://<Zeppelin host name>:<port>

Catalog Options

Configure the following options:
Max Recommendations to Display
Maximum number of recommended data assets to display on a
Projects
Add Worksheet
page. You can specify a maximum of 50 recommendations. A value of 0 means no recommendations will be displayed.
Enable Business Title
Displays the business glossary terms, display names, and Axon terms in the data preview or preparation of the assets.

Hadoop Options

Configure the following options:
Hive Staging Connection
Hive connection data preview and preparation of complex files.
Hive Table Storage Format
Stores the data in the format such as Hadoop Cluster, ORC, or Parquet. The default is Hadoop Cluster.

Help Options

Configure the following options:
Documentation
Select online or offline help for reference.
Enable Help Video
Enables or disables the help video on the home page.