Table of Contents

Search

  1. Preface
  2. Introduction to TDM Administration
  3. User and Role Administration
  4. Security Administration
  5. System Preferences
  6. TDM Server Administration
  7. Connections
  8. Pass Phrases
  9. Global Parameters
  10. Dictionaries
  11. User-Defined Datatypes

Hadoop HDFS Connection

Hadoop HDFS Connection

Use a Hadoop HDFS connection in a TDM plan to generate PowerCenter mappings and write to the target. Create and manage Hadoop connections from Test Data Manager.
The following table describes the Hadoop HDFS connection properties that you can configure:
Property
Description
Name
Required. Name of the connection. The name is not case sensitive and must be unique within the domain. The name cannot exceed 128 characters, contain spaces, or contain the following special characters:
~ ` ! $ % ^ & * ( ) - + = { [ } ] | \ : ; " ' < , > . ? /
Connection Type
Required. The connection type. Select Hadoop HDFS.
Description
The description of the connection. The description cannot exceed 4000 characters.
Owner
The owner of the connection. Default is the user who creates the connection. You can change the owner of the connection.
HDFS Connection URI
Required. The URI to access HDFS.
Use the following format to specify the NameNode URI in Cloudera and Hortonworks distributions:
hdfs://<namenode>:<port>
Where
  • <namenode> is the host name or IP address of the NameNode.
  • <port> is the port that the NameNode listens for remote procedure calls (RPC).
Use one of the following formats to speaaacify the NameNode URI in MapR distribution:
-
  • maprfs:///
  • maprfs:///mapr/my.cluster.com/
Where my.cluster.com is the cluster name that you specify in the mapr-clusters.conf file.
HIVE URL
Required. The URL to the Hive host.
Specify the URL in the following format:
jdbc:hive://
hostname
:
portnumber
/default
HIVE Username
The Hive user name.
Use Parameter in Password
Uses a parameter in the password for the Hive user.
HIVE Password
The password for the Hive user.
Hadoop Distribution
Required. The name of the Hadoop distribution. You can choose one of the following options:
  • Cloudera CDH
  • Hortonworks HDP
  • MapR
  • Microsoft HDInsight
  • IBM BigInsights
  • Amazon EMR
Default is Cloudera CDH.
Directory
Required. The path to the HDFS directory.