Select the type and the location of the publication source, from where
Data Integration Hub
picks up the files, in the
Source
page of the publication wizard.
Select the source type from the
Source Type
list.
If you select the HDFS source type, the
HDFS Connection
field appears on the page.
If you select the HDFS source type, select a connection from the
HDFS Connection
list.
To use file transfer to publish files from remote servers, select
Use File Transfer
.
The
Connection
and
Delete Files from Source After Pickup
fields appear on the page.
If you selected to use file transfer, configure the following file transfer properties:
Connection
Select a connection from the list.
Delete Files from Source After Pickup
Data Integration Hub
deletes the files after it reads them. You must have permissions on the remote server to delete the files.
Enter the location of the files that contain the source data in the
Directory
field.
If you use file transfer, you cannot use patterns to define the directory path. You can, however, use patterns to define the file name.
Click
Configure
.
The
Configure Flat File Source
dialog box appears.
In the
Configure Flat File Source
dialog box configure the following properties and then click
OK
:
Logical Name
Description name of the source. The name identifies the source when you configure field mapping for the publication.
File Name
Name of the file that
Data Integration Hub
picks up and publishes to the publication repository.
The file name can include asterisk wildcards and variables. Enter variables in the following format:
($pattern)
. For example:
Input_($PublicationName)_($sequence).in
.
Configure source based on
Select one of the following options:
Topic Table. Select the topic table that represents the structure of the flat file.
Sample file. A sample file that represents the structure of the flat file. Browse to select and upload the file.
File Format
Expand the
File Format
area to view and edit the format of the file. Changing the file format might affect the column structure.
The
File Format
area can include the following properties:
Code page
Character encoding used in the file.
Number of initial rows to skip
Number of rows at the beginning of the file that
Data Integration Hub
ignores when it reads the file.
Data Integration Hub
published only the subsequent rows.
Import column names
Optional. Select
Yes
to use the column names in the file as the default column headers in the table. Enter the number of the row that serves as the file's header row in
From row
.
Delimiter
Delimiter used in the file to separate between columns. Select a predefined delimiter or select
Custom
to define a custom delimiter.
For information about the supported column delimiters, see the sections about importing delimited flat files and about updating delimited file properties in the
PowerCenter Designer Guide
.
Text qualifier
Optional. Symbols used in the file to enclose a string.
Thousands Separator
Optional. Symbol used in the file as a thousands separator.
Decimal Separator
Symbol used in the file as a decimal separator.
Datetime Format
Date and time format used in the file. Select a predefined format or select
Other
to define a custom format. For information about the supported datetime formats, see the
PowerCenter Transformation Language Reference
.
The datetime format can contain up to 50 characters.
Column Structure
The column structure depends on the file format settings that you configure in the
File Format
area. The table must contain at least one column.
If you provide a sample file,
Data Integration Hub
reads the file according to the file format settings and presents the columns that it detects in the sample file. Use the
Add Column
button, the up and down arrows, and the remove icon to add, order, and remove table columns.
Click the edit icon to the right of a column to edit column parameters.
Each column must contain the following parameters:
Column Name
Must begin with an alphabetic character or an underscore and can contain only alphanumeric characters and underscores.
If you select the option
Import column names
,
Data Integration Hub
populates the column names with the strings of the defined row. If you do not select the option
Import column names
,
Data Integration Hub
assigns default names to the columns. For example,
Field1
,
Field2
,
Field
n
.
Data Type
Select from the list of available data types. By default,
Data Integration Hub
reads the data as string.
Precision
Enabled only for data types that support precision.
Scale
Enabled only for data types that support data scaling.
Sample File Preview
If you select a sample file that represents the structure of the flat file, this area shows the data in the sample file. The
Preview
area presents the data in the structure that
Data Integration Hub
applies when it distributes the data into topic fields in the topic table, according to the
File Format
parameters.
The
Configure Flat File Source
dialog box closes.
Repeat steps
6 through
7 for each table topic that appears on the