Table of Contents

Search

  1. Preface
  2. Data Profiling
  3. Profiles
  4. Profile results
  5. Tuning data profiling task performance
  6. Troubleshooting

Data Profiling

Data Profiling

Source Mapplets

Source Mapplets

You can run a profiling task on the output from a source mapplet that you create in
Data Integration
. A source mapplet is a mapplet that has a source connection and a single output. You can also run a profiling task on a mapplet as a source object.
Before you use the mapplet as a source connection, make sure that source connections used in a mapplet are associated with an active Secure Agent.
The following image shows a source mapplet that has multiple source connections, transformations, and a mapplet output:  The mapplet contains an Oracle source, SQL source, and real-time source. Multiple transformations read the sources and write the transformed data to a single output.

Supported connections for source mapplets

You can run a profile on source mapplets that use the following connections:
  • Amazon Athena
  • Amazon Redshift
  • Amazon S3
  • Azure Data Lake Store
  • Databricks
  • Flat File
  • Google BigQuery
  • Google Cloud Storage
  • JDBC V2
  • Microsoft Azure Synapse
  • Microsoft Fabric OneLake
  • Microsoft SQL Server
  • Oracle
  • ODBC
  • Oracle Object Source
  • PostgreSQL
  • Salesforce
  • SAP BW
  • SAP Table
  • Snowflake

Supported transformations for source mapplets

You can run a profile on source mapplets that use the following transformations:
  • Aggregator
  • Expression
  • Filter
  • Joiner
  • Sorter
  • Union

Supported profiling capabilities

  • Simple dynamic filter
  • Drilldown
  • Export profiling results
  • Compare columns
  • Compare profile runs
  • Sampling all rows
  • Associate rules with profiles
  • Run queries on profiling results
For more information about creating mapplets, see
Components
in the
Data Integration
help.

0 COMMENTS

We’d like to hear from you!