The Microsoft Azure marketplace solution, when you deploy on a virtual network, creates and connects the following resources in the network:
Informatica domain server on a virtual machine, with
one additional high availability virtual machine
.
Informatica clients on a bastion server.
Microsoft SQL Server for the repositories in the Informatica domain.
Databricks cluster or Databricks workspace.
The following image shows the architecture of the
Data Engineering Integration
on Microsoft Azure:
The numbers in the architecture diagram correspond to items in the following list:
A resource group on the Azure platform.
A virtual network that includes a subnet.
A subnet to contain specific elements of the deployment.
A network security group that includes the
Data Engineering Integration
deployment.
The Informatica services on Azure Virtual Machine.
Microsoft SQL Server database instance to act as Informatica domain repositories:
Domain configuration repository
Model repository
Informatica services for high availability on Azure Virtual Machine.
Bastion server, if you choose to deploy one.
CIDR IP address range that you use to access the Informatica services URL and virtual machines.
After the completion of the automated deployment, you can create a mapping that connects to data sources and targets in existing Azure storage. To do this, manually create the necessary connections to Azure resources. Then you can create mappings and run them on the Databricks runtime engine. For more information about creating connections and configuring access to Azure storage resources, see the
Connections Reference.