To perform data movement and data masking operations for Hadoop connections, you can create a Hadoop plan. Add groups and data masking components to a Hadoop plan. You cannot perform data subset or data generation operations for Hadoop sources and targets.
Open a project and click
Execute
.
Click
Actions
New
.
In the
New Plan
dialog box, enter a name and optional description for the plan.
Select
Hadoop
plan type.
Click
Next
.
To add a data masking operation to the plan, click
Add Masking Components
.
Select the policies and rules to add to the plan. Click
OK
.
Click
Next
.
To add groups to the plan, click
Add Groups
. You can add groups to a plan to move data from a source to a target.
Select the groups to add to the plan. Click
OK
.
Click
Next
.
Review all the masking components and groups.
You cannot edit the groups.
Click
Next
.
Configure source and target connections.
If you select an HDFS target connection, you can choose to select the resource format. Select Avro or Parquet. Default is None.
Configure target properties and error and recovery settings.
Configure advanced settings. You can choose to persist mapping to store mappings for future use. You can select Blaze or Spark execution engine.