Table of Contents

Search

  1. About the Data Vault Administrator Guide
  2. Introduction to the Data Vault
  3. Data Vault Service Startup and Shutdown
  4. Data Vault Configuration
  5. Data Vault ODBC Setup
  6. Data Vault Administration
  7. Data Repartitioning
  8. Partial Data Vault Copy
  9. Archived Data Migration
  10. Data Vault Administration Tool
  11. Data Vault Logs
  12. User Account Privileges
  13. ssasql Command Line Program
  14. Data Vault Audit Log
  15. Sample Configuration Files

Data Vault Administrator Guide

Data Vault Administrator Guide

ssapart

ssapart

Repartitions data files in the Data Vault.
The ssapart command uses the following syntax:
ssapart -t <schema_name.table_name> -c <column_name> [-n <number_of_rows>] [-m <number_of_parallel_processes>] [-r <temporary folder location>] [-a <schema_name.new_table_name>] [-d (treat integer as date)] [-h (displays help screen)] <connection> <user>/<password>
You can specify the flags in any order.
If you repartition data files in an indexed table, you must renew the table index.
The following table describes the ssapart command arguments:
Option
Argument
Description
-t
schema_name.table_name
Required. Name of the schema and table that you want to repartition.
-c
column_name
Required. Name of the column that you want to use as the partition key to repartition the data files.
You can use one column as the partition key. If you enter multiple columns, the command fails.
-n
number_of_rows
Optional. Number of rows in each repartitioned data file.
Configure the same row count as the original data files to create repartitioned data files with the same number of rows. You can use the file report to view the row count of the original data files.
Configure a smaller row count to increase the number of repartitioned data files. Configure a larger row count to decrease the number of repartitioned data files.
Default is 10,000.
-m
number_of_parallel_processes
Optional. Number of threads that run in parallel to repartition the data files.
Increase the number of threads to increase the data repartitioning performance.
Default is 1.
-r
temporary_folder_location
Optional. Directory where the Data Vault creates temporary files and log files.
The directory must include space up to twice the size of the archived table. Use the table report to view the archived table size.
The Data Vault deletes the temporary files when the Data Vault completes the data repartitioning. The log files remain in the directory.
Default is the current working directory.
-a
schema_name.new_table_name
Required to create a table in which to register the repartitioned data files. Name of the table that the Data Vault creates.
If you do not use the -a option, the Data Vault registers the repartitioned data files to the original archived table.
-d
-
Required if the partition key column you specify for the -c option stores dates in an integer datatype. Processes the partition key column value as a date datatype.
The Data Vault only converts the value from numeric to date when the Data Vault creates the temporary files. The Data Vault does not convert the value to the date datatype in the database.
For example, the partition key column includes value 11302006. If you specify the -c option, the Data Vault temporarily converts the value to 11/30/2006.
Default is no conversion. The Data Vault uses the datatype of the partition key column.
-h
-
Optional. Displays onscreen help. The help shows the syntax of the command and the possible flags.
-
connection
Required. Data Vault connection name.
-
database
Required. Name of the database in the Data Vault. The database corresponds to the Data Archive archive folder.
-
user/password
Required. Name and password of the user account that connects to the Data Vault.

0 COMMENTS

We’d like to hear from you!