PowerExchange for Microsoft Azure Blob Storage User Guide

10.5.4
- 10.5.6
- 10.5.3
- 10.5.1
- 10.5
- 10.4.1
- 10.4.0

Back Next

Prerequisites

Before you use PowerExchange for Microsoft Azure Blob Storage, perform the following tasks:

Verify that the Hadoop Distribution Directory property in the developerCore.ini file is set based on the Hadoop distribution that you use.

To run a mapping to process complex files, you must configure the INFA_PARSER_HOME environment variable for the Data Integration Service in Informatica Administrator. Set the value of the environment variable to the absolute path of the Hadoop distribution directory on the machine that runs the Data Integration Service.

When you import a data object in a mapping, do not use the MapR distribution.

Configure Databricks Connection Advanced Properties

Verify that a Databricks connection is created in the domain. If you want to read NULL values from or write NULL values to an Azure source, configure the following advanced properties in the Databricks connection:

infaspark.flatfile.reader.nullValue=True

infaspark.flatfile.writer.nullValue=True

Configure Azure Blob Storage Access in Azure Databricks Cluster

Verify that a cluster configuration is created in the domain. Set your Azure Blob Storage account name and account key under

Spark Config

in your Databricks cluster configuration to access the Azure Blob Storage. Add "spark.hadoop" as a prefix to the Hadoop configuration key as shown in the following text:

spark.hadoop.fs.azure.account.key.<your-storage-account-name>.blob.core.windows.net <your-storage-account-access-key>

In case of multiple Azure Blob Storage accounts, you must configure the account name and account key for each of the Azure Blob Storage account.

Configure Azure Blob Storage SAS Access in Azure Databricks Cluster

Verify that a cluster configuration is created in the domain. Set your Azure Blob Storage account name and SAS token under

Spark Config

in your Databricks cluster configuration to access the Azure Blob Storage. Add "spark.hadoop" as a prefix to the Hadoop configuration key as shown in the following text:

spark.hadoop.fs.azure.sas.<container-name>.<storage-account-name>.blob.core.windows.net <sas-token-for-your-blob-account>

Configure access to secure transfer-enabled storage accounts

Verify that the Secure transfer required
option in the Configuration
tab in your Azure Blob Storage account is enabled. In addition, set the following custom property for the Data Integration Service:

SecureTransferRequired=True

After you configure the custom property, restart the Data Integration Service.

PowerExchange for Microsoft Azure Blob Storage Configuration

Download Guide

Watch

Comments

Communities

Knowledge Base

Success Portal