Creating an HBase Data Object

Creating an HBase Data Object

Create an HBase data object to specify an HBase resource.
  1. Select a project or folder in the Object Explorer view.
  2. Click
    File
    New
    Data Object
    .
  3. Select
    HBase Data Object
    and click
    Next
    .
    The
    New HBase Data Object
    dialog box appears.
  4. Enter a name for the data object.
  5. Click
    Browse
    next to the
    Location
    option and select the target project or folder.
  6. Click
    Browse
    next to the
    Connection
    option and select a connection from which you want to import the HBase resource.
  7. To add a resource to the data object, click
    Add
    next to the
    Resource
    option.
    If a default Metadata Access Service is not set, a message appears to configure the Metadata Access Service. Click
    OK
    and set one Metadata Access Service as default. After you set a default Metadata Access Service, the
    Add Resource
    dialog box appears. If the Metadata Access Service does not exist, contact the Informatica administrator to create a new Metadata Access Service.
  8. Navigate or search for the resources to add to the data object and click
    OK
    .
    You can add one HBase table to the data object.
  9. Click
    Next
    . The
    Column Families
    dialog box appears.
  10. Select a column family and specify the columns in it. Choose to add columns or get all columns.
    • To manually add, or search and add columns to the column family, select the
      Add Columns
      option.
    • To read from or write all columns in the column family to a single binary column, select the
      Get all columns
      option.
  11. Add the columns in the column family. Choose to add columns or search the column names in the column family and add the columns.
    • To specify the columns from the column family when you know the column name and data type, select the column family to which you want to add the columns and click
      Add
      . Configure the add properties.
    • To search columns in the column family and add them, click
      Search and Add
      . The
      Search and Add
      dialog box appears.
  12. Specify the following details in the
    Search and Add
    dialog box:
    1. Specify the maximum rows in the HBase tables you want to include in the search.
    2. Specify the threshold value of the column occurrence percentage.
    3. Click
      Go
      .
      The column name and the occurrence percentage of the column in the table appears in the results.
    4. Select the columns that you want to specify for the column family. Configure the add properties.
  13. Click
    Next.
    The
    Create Row
    dialog box appears.
  14. Select the
    Include Row ID
    option to generate a row ID for the HBase table.
  15. Specify the data type, precision, and scale for the row ID and click
    Next.
    The
    Review Columns
    dialog box appears. The column family name is the prefix for all the column names in that column family for unique identification. Default data type of the row ID is String.
  16. Review the columns in the column families and click
    Finish
    .
    The data object appears under Data Object in the project or folder in the Object Explorer view. You can also add resources to a data object after you create it.

0 COMMENTS

We’d like to hear from you!