Table of Contents

Search

  1. Abstract
  2. Supported Versions
  3. Integrating Data Engineering Integration on the AWS Platform with Databricks and Delta Lake

Integrating Data Engineering Integration on the AWS Platform with Databricks and Delta Lake

Integrating Data Engineering Integration on the AWS Platform with Databricks and Delta Lake

Manually Creating a JDBC Connection to Databricks Resources

Manually Creating a JDBC Connection to Databricks Resources

To enable the domain to access Databricks resources in the cloud platform environment, you must manually create and configure a JDBC connection.
You can use the Informatica Administrator or Developer tools to create a connection. The following steps tell how to create a JDBC connection using the Administrator tool.
  1. In the Administrator tool, click the
    Connections
    tab.
  2. Select the domain, click the Actions menu, and select
    New
    Connection
    .
    The following image shows the New Connection control:
    The
    New Connection
    dialog box opens.
  3. Expand the Databases section, select
    JDBC
    , and click
    OK
    .
    The following image shows the New Connection dialog box:
    The
    Configure New Connection
    dialog box opens.
  4. Configure the following connection properties:
    Property
    Description
    Name
    Type a name for the connection. Example: DatabricksCxn
    ID
    Default: Automatically populated with the connection name. Changing this default value is optional.
    Username
    Type the following value to use the Databricks user token:
    token
    Password
    Value of the token that you configured for the Databricks user. See Prepare a Databricks User for Token Authentication
    JDBC Driver Class Name
    Type the following value:
    com.simba.spark.jdbc4.Driver
    Connection String
    Connection to the Databricks resource. This connection string contains all the information that the domain needs to connect to the resource.
    The connection string contains the following elements:
    • jdbc:spark://<server host name>.
    • Port number.
    • Transport mode.
    • ssl. Use
      1
      to enable SSL.
    • httpPath.
    • UID. User ID that will be used to run jobs on the cluster. Use
      token
      .
    • PWD. Value of the token that you configured for the Databricks user. See Prepare a Databricks User for Token Authentication.
    Example:
    jdbc:spark://abc-a1b2c3d4-cf1b.cloud.databricks.com:443/default;transportMode=http;ssl=1;httpPath=sql/protocolv1/o/5275227018426481/1234-567890-blued456;AuthMech=3;UID=token;PWD=<personal-access-token>
    To get the value of these parameters from the Advanced Options area of the cluster configuration settings:
    1. In the Databricks environment, select Clusters.
    2. Select the cluster to connect to.
    3. Expand the Advanced Options and click the
      JDBC/ODBC
      tab.
    For more information about the JDBC connection string for Databricks, see the Databricks documentation.
  5. Click
    Finish
    .

0 COMMENTS

We’d like to hear from you!