The Data Preparation Service manages data preparation within Enterprise Data Lake. When an analyst prepares data in a project, the Data Preparation Service stores worksheet metadata in the Data Preparation repository.
The service connects to the Hadoop cluster to read sample data from Hive tables. The service connects to the HDFS system in the Hadoop cluster to store the sample data being prepared in the worksheet.
Create the Data Preparation Service before you create the Enterprise Data Lake Service. You must associate the Enterprise Data Lake Service with a Data Preparation Service.