You can configure Fast Clone to unload source data to output files on a
Cloudera or Hortonworks distribution of Hadoop or to a Hive data warehouse target. With these Hadoop target types, Fast Clone does not generate a load script for a target load utility.
On the
Target DB
tab, select
CLOUDERA
,
HIVE
, or
HORTONWORKS
in the
Database Type
list.
Enter the version, host name, port number, and user name for connecting to the target.
To specify the directory in the distributed file system for the
Cloudera or Hortonworks output files, click
Select
and then either browse to the directory or enter the path to the directory in the
Path
field.
The user name that you specify for connecting to the Hadoop target must have write permissions on this directory.
Click
Connect
.
The Fast Clone Console connects to the target.
The Fast Clone Console requires the 64-bit
Java Runtime Environment (JRE) 1.8
to connect to these Hadoop targets.
For Hive targets, select a database in the
Database
list.
The Fast Clone Console uses the definition of the selected database to determine the location of flat files that store table data on the HDFS.
Before you unload source data to a Hive target, enter the field delimiter that the Hive target uses in data files in the
Column separator
field on the
Runtime Settings
tab >
Format Settings
view. By default, Hive data warehouses use the ASCII start of heading (SOH) character as the field delimiter.