Data Ingestion and Replication
- Data Ingestion and Replication
- All Products
Property
| Description
|
---|---|
Task Name
| Enter a name that you want to use to identify the application ingestion and replication task, if
you do not want to use the generated name. Using a
descriptive name will make finding the task easier
later. Task names can contain Latin alphanumeric characters, spaces, periods (.), commas (,), underscores (_), plus signs (+), and hyphens (-). Task names cannot include other special characters. Task names are not case sensitive. Maximum length is 50 characters.
If you include spaces in the task name, after you deploy the task, the spaces do not appear in the corresponding job name.
|
Location
| The project or project\folder in Explore that will contain the task definition. If you do not specify a project, the "Default" project is used.
|
Runtime Environment
| Select the runtime environment that you want to use to run the task. By default, the runtime environment that you initially entered when you began defining the task is displayed. You can use this runtime environment or select another one.
To refresh the list of runtime environments,
click Refresh . The runtime environment can be a Secure Agent group that
consists of one or more Secure Agents. A Secure Agent is
a lightweight program that runs tasks and enables secure
communication. Alternatively, for application ingestion and replication initial
load jobs that have selected source types, you can use a
serverless runtime environment hosted on Microsoft
Azure.You
cannot choose a serverless runtime environment if a
local runtime environment was previously
selected. The Cloud Hosted Agent is not supported. Select Set as default to use the specified runtime environment as
your default environment for all tasks you create.
Otherwise, leave this check box cleared. |
Description
| Optionally, enter a description you want to use for the task.
Maximum length is 4,000 characters.
|
Schedule
| If you want to run an initial load task based on a schedule instead of manually starting it, select
Run this task based on a schedule . Then select a schedule that was previously defined in Administrator.
The default option is
Do not run this task based on a schedule .
This field is not available for incremental load and combined initial and incremental load tasks.
To view and edit the schedule options, go to Administrator. If you edit the schedule, the changes will apply to all jobs that use the schedule. If you edit the schedule after deploying the task, you do not need to redeploy the task.
If the schedule criteria for running the job is met but the previous job run is still
active, Application Ingestion and
Replication skips
the new job run. |
Execute in Taskflow | Select this check box to make the task available in Data Integration
to add to a taskflow as an event source.You can then include
transformations in the taskflow to transform the ingested
data. Available for initial load and incremental load tasks
with Snowflake targets that don't use the
Superpipe option. |
Option | Description |
---|---|
Apply Cycle Interval | Specifies the amount of time that must elapse before an
application ingestion and replication job ends
an apply cycle. You can specify days, hours, minutes,
and seconds or specify values for a subset of these time
fields leaving the other fields blank. The default value is 15 minutes. |
Apply Cycle Change Limit | Specifies the number of records that must be processed
before an application ingestion and replication job ends
an apply cycle. When this record limit is reached, the
ingestion job ends the apply cycle and writes the change
data to the target. The default value is 10000 records. During
startup, jobs might reach this limit more frequently
than the apply cycle interval if they need to catch up
on processing a backlog of older data. |
Low Activity Flush Interval | Specifies the amount of time, in hours, minutes, or both,
that must elapse during a period of no change activity
on the source before an application ingestion and replication job ends
an apply cycle. When this time limit is reached, the
ingestion job ends the apply cycle and writes the change
data to the target. If you do not specify a value for this option, an application ingestion and replication job ends
apply cycles only after either the Apply
Cycle Change Limit or Apply
Cycle Interval limit is reached. No default value is provided. |
Option | Description |
---|---|
Checkpoint All Rows | Indicates whether an application ingestion and replication job performs checkpoint processing
for every message that is sent to the Kafka target. If this
check box is selected, the Checkpoint Every
Commit , Checkpoint Row
Count , and Checkpoint
Frequency (secs) options are ignored.
|
Checkpoint Every Commit | Indicates whether an application ingestion and replication job
performs checkpoint processing for every commit that
occurs on the source. |
Checkpoint Row Count | Specifies the maximum number of messages that an application ingestion and replication job sends to
the target before adding a checkpoint. If you set this
option to 0, the job does not perform checkpoint processing
based on the number of messages. If you set this option to
1, the job adds a checkpoint for each message. |
Checkpoint Frequency (secs) | Specifies the maximum number of seconds that must elapse before an
application ingestion and replication job adds a
checkpoint. If you set this option to 0, an application ingestion and replication does not perform checkpoint
processing based on elapsed time. |
Option
| Description
|
---|---|
Ignore
| Do not replicate DDL changes that occur on the source database to the
target. |
Replicate
| Allow the application ingestion and replication job to
replicate the DDL changes to the target.
The types of supported DDL operations are:
Application ingestion and replication jobs
doesn't support modifying or renaming columns for Google
BigQuery target, and adding columns for Oracle targets. |
Stop Job
| Stop the application ingestion and replication job. |
Stop Table
| Stop processing the source object on which the DDL change occurred.
When one or more objects are excluded from replication
because of the Stop Object schema drift option, the
status of the job changes to Running with
Warning . The application ingestion and replication job
cannot retrieve the data changes that occurred on the
source object after the job stops processing the
changes. This action leads to data loss on the target.
To avoid data loss, you must re-synchronize the source
and target objects that the job stopped processing
before you resume the application ingestion and replication job.
|