Table of Contents

Search

  1. Preface
  2. Workflows
  3. Workflow Variables
  4. Workflow Parameters
  5. Cluster Tasks
  6. Command Task
  7. Human Task
  8. Mapping Task
  9. Notification Task
  10. Gateways
  11. Workflow Recovery
  12. Workflow Administration

Developer Workflow Guide

Developer Workflow Guide

General Options

General Options

The following table describes the general options that you can set for a Databricks cluster:
Property
Description
Cluster Name
Name of the cluster to create.
Databricks Runtime Version
The Databricks version to run on the cluster.
Default is the latest supported version.
To manually enter a version, select
(Assign to Task Input)
. The Developer tool creates a Create Cluster Task Input called
Databricks Runtime Version
. Set the value of the input to the Databricks version tag string.
For example, set the value to
5.5.x-scala2.11
to use Databricks version 5.5.
Python Version
The Python version to run if you include a Python transformation.
Default is the latest supported version.
Driver Type
The type of node that you want to use for the driver node.
Default is the worker type ID.
Worker Type
The type of node that you want to use for the worker node.
Workers
The number of worker nodes to create for the cluster. If you configure the cluster to scale automatically, this property is ignored.
Default is 1.
Autoscale
Automatically scales the number of worker nodes based on workload.
Min Workers
The minimum number of worker nodes to use when the cluster is configured to scale automatically.
Default is 0.
Max Workers
The maximum number of worker nodes to use when the cluster is configured to scale automatically.
Default is 1.

0 COMMENTS

We’d like to hear from you!