Table of Contents

Search

  1. Preface
  2. Introduction to TDM Administration
  3. User and Role Administration
  4. Security Administration
  5. System Preferences
  6. TDM Server Administration
  7. Connections
  8. Dictionaries
  9. User-Defined Datatypes

HDFS Connections

HDFS Connections

You can add and use Hadoop Distributed File System (HDFS) connections.
Use an HDFS connection to access data in the Hadoop cluster. The HDFS connection is a file system type connection.
In the Administrator tool, you must create a cluster configuration object for the Hadoop clusters. Create and manage HDFS connections in Test Data Manager.
The following table describes HDFS connection properties:
Property
Description
Name
Required. Name of the connection. The name is not case sensitive and must be unique within the domain. The name cannot exceed 128 characters, contain spaces, or contain the following special characters:
~ ` ! $ % ^ & * ( ) - + = { [ } ] | \ : ; " ' < , > . ? /
Connection Type
Required. The connection type. Select HDFS.
Description
The description of the connection. The description cannot exceed 765 characters.
Owner
The owner of the connection. Default is the user who creates the connection. You can change the owner of the connection.
User Name
Required. User name to access HDFS.
NameNode URI
Required. The URI to access HDFS.
Use the following format to specify the NameNode URI in Cloudera and Hortonworks distributions:
hdfs://<namenode>:<port>
Where
  • <namenode> is the host name or IP address of the NameNode.
  • <port> is the port that the NameNode listens for remote procedure calls (RPC).
Use one of the following formats to specify the NameNode URI in MapR distribution:
-
  • maprfs:///
  • maprfs:///mapr/my.cluster.com/
Where my.cluster.com is the cluster name that you specify in the mapr-clusters.conf file.
Directory
Required. The path to the HDFS directory.
Cluster Configuration
The name of the cluster configuration associated with the Hadoop environment.