wb resource create dataproc-cluster

Name

wb-resource-create-dataproc-cluster - Add a controlled GCP Dataproc cluster resource with Jupyter. For a detailed explanation of parameters, see https://cloud.google.com/dataproc/docs/reference/rest/v1/projects.regions.clusters#Cluster

Synopsis

wb resource create dataproc-cluster [--quiet] [--autoscaling-policy=<autoscalingPolicy>] [--bucket=<configBucket>] [--cluster-id=<clusterId>] [--description=<description>] [--format=<format>] [--idle-delete-ttl=<idleDeleteTtl>] [--image-version=<imageVersion>] [--region=<region>] [--software-framework=<softwareFrameworkType _ >] [--temp-bucket=<tempBucket>] [--workspace=<id>] [--components=<components>[, <components>...]]... [--initialization-actions=<initializationAct_ _ ions>[,<initializationActions>...]]... [-M=<String=String>[, <String=String>...]]... [--properties=<String=String>[, <String=String>...]]... (--id=<id>) [[--manager-machine-type=<machineType>] [--manager-image-uri=<imageUri>] [[--manager-accelerator-type=<type>] [--manager-accelerator-count=<count>]] [[--manager-boot-disk-type=<bootDiskType>] [--manager-boot-disk-size=<bootDiskSizeGb>] [--manager-num-local-ssds=<numLocalSsds>] [--manager-local-ssd-interface=<localSsdInte_ _ rface>]]] [[--num-workers=<numNodes>] [--worker-machine-type=<machineType>] [--worker-image-uri=<imageUri>] [[--worker-accelerator-type=<type>] [--worker-accelerator-count=<count>]] [[--worker-boot-disk-type=<bootDiskType>] [--worker-boot-disk-size=<bootDiskSizeGb>] [--worker-num-local-ssds=<numLocalSsds>] [--worker-local-ssd-interface=<localSsdInter_ _ face>]]] [[--num-secondary-workers=<numNodes>] [--secondary-worker-machine-type=<machineTyp_ _ e>] [--secondary-worker-image-uri=<imageUri>] [--secondary-worker-type=<type>] [[--secondary-worker-accelerator-type=<type>_ ] [--secondary-worker-accelerator-count=<count _ >]] [[--secondary-worker-boot-disk-type=<bootDis_ _ kType>] [--secondary-worker-boot-disk-size=<bootDisk_ _ SizeGb>] [--secondary-worker-num-local-ssds=<numLocal_ _ Ssds>] [--secondary-worker-local-ssd-interface=<loc_ _ alSsdInterface>_]]]

Description

Add a controlled GCP Dataproc cluster resource with Jupyter. For a detailed explanation of parameters, see https://cloud.google.com/dataproc/docs/reference/rest/v1/projects.regions.clusters#Cluster

Options

  • --id=<id>
    ID of the resource, scoped to the workspace. Only use letters, numbers, dashes, and underscores.

  • --id=<id>
    ID of the resource, scoped to the workspace. Only use letters, numbers, dashes, and underscores.

  • --description=<description>
    Description of the resource.

  • --workspace=<id>
    Workspace id to use for this command only.

  • --format=<format>
    Set the format for printing command output: JSON, TEXT. Defaults to the config format property.

    Default: null
    
  • --quiet
    Suppress interactive prompt.

  • --cluster-id=<clusterId>
    The unique name to give to the dataproc cluster. Cannot be changed later. The instance name must be 1 to 52 characters long and contain only lowercase letters, numeric characters, and dashes. The first character must be a lowercase letter and the last character cannot be a dash. If not specified, a value will be auto-generated for you.

  • --region=<region>
    The Google Cloud region of the cluster.

  • --image-version=<imageVersion>
    The dataproc cluster image version containing versions of its software components. See https://cloud.google.com/dataproc/docs/concepts/versioning/dataproc-version-clusters for the full list of image versions and their bundled software components.

  • --initialization-actions=<initializationActions>[,<initializationActions>...]
    A comma separated list of initialization scripts to run during cluster creation.The path must be a URL or Cloud Storage path, e.g. 'gs://path-to-file/file-name'.

  • --components=<components>[,<components>...]
    Comma-separated list of components.

  • --properties=<String=String>[,<String=String>...]
    Properties in the format key=value.

  • --software-framework=<softwareFrameworkType>
    Software framework for the cluster. Available frameworks are: NONE, HAIL.

    Default: NONE
    
  • --bucket=<configBucket>
    Resource name of the cluster staging bucket. If not specified, a default staging bucket will be created.

  • --temp-bucket=<tempBucket>
    Resource name of the cluster temp bucket. If not specified, a default temp bucket will be created.

  • --autoscaling-policy=<autoscalingPolicy>
    Autoscaling policy id to attach to the cluster.

  • -M, --metadata=<String=String>[,<String=String>...]
    Custom metadata to apply to this cluster.

    specify multiple metadata in the format of --metadata="key1=value1" -key2=value2.

    It allows multiple metadata entries split by "," like --metadata=key1=value1,key2=value2

    By default, set Workbench CLI server terra-cli-server=[CLI_SERVER_ID]

    and the Workbench workspace id (terra-workspace-id=[WORKSPACE_ID]).

  • --idle-delete-ttl=<idleDeleteTtl>
    Time-to-live after which the resource becomes idle and is deleted.

Manager node configurations

  • --manager-machine-type=<machineType>
    The machine type of the manager node.

    Default: n2-standard-2
    
  • --manager-image-uri=<imageUri>
    The image URI for the manager node.

  • --manager-accelerator-type=<type>
    The type of accelerator for the manager.

  • --manager-accelerator-count=<count>
    The count of accelerators for the manager.

    Default: 0
    
  • --manager-boot-disk-type=<bootDiskType>
    The type of boot disk for the manager node.

  • --manager-boot-disk-size=<bootDiskSizeGb>
    The size of the boot disk in GB for the manager node.

    Default: 500
    
  • --manager-num-local-ssds=<numLocalSsds>
    The number of local SSDs for the manager node.

    Default: 0
    
  • --manager-local-ssd-interface=<localSsdInterface>
    The interface type of local SSDs for the manager node.

    Default: scsi
    

Worker node configurations

  • --num-workers=<numNodes>
    The number of worker nodes.

    Default: 2
    
  • --worker-machine-type=<machineType>
    The machine type of the worker node.

    Default: n2-standard-2
    
  • --worker-image-uri=<imageUri>
    The image URI for the worker node.

  • --worker-accelerator-type=<type>
    The type of accelerator for the worker.

  • --worker-accelerator-count=<count>
    The count of accelerators for the worker.

    Default: 0
    
  • --worker-boot-disk-type=<bootDiskType>
    The type of boot disk for the worker node.

  • --worker-boot-disk-size=<bootDiskSizeGb>
    The size of the boot disk in GB for the worker node.

    Default: 500
    
  • --worker-num-local-ssds=<numLocalSsds>
    The number of local SSDs for the worker node.

    Default: 0
    
  • --worker-local-ssd-interface=<localSsdInterface>
    The interface type of local SSDs for the worker node.

    Default: scsi
    

Secondary worker node configurations

  • --num-secondary-workers=<numNodes>
    The number of secondary worker nodes.

    Default: 0
    
  • --secondary-worker-machine-type=<machineType>
    The machine type of the secondary worker node.

    Default: n2-standard-2
    
  • --secondary-worker-image-uri=<imageUri>
    The image URI for the secondary worker node.

  • --secondary-worker-type=<type>
    The type of the secondary worker. Valid values are preemptible, non-preemptible, and spot.

    Default: spot
    
  • --secondary-worker-accelerator-type=<type>
    The type of accelerator for the secondary worker.

  • --secondary-worker-accelerator-count=<count>
    The count of accelerators for the secondary worker.

    Default: 0
    
  • --secondary-worker-boot-disk-type=<bootDiskType>
    The type of boot disk for the secondary worker node.

  • --secondary-worker-boot-disk-size=<bootDiskSizeGb>
    The size of the boot disk in GB for the secondary worker node.

    Default: 500
    
  • --secondary-worker-num-local-ssds=<numLocalSsds>
    The number of local SSDs for the secondary worker node.

    Default: 0
    
  • --secondary-worker-local-ssd-interface=<localSsdInterface>
    The interface type of local SSDs for the secondary worker node.

    Default: scsi
    

Last Modified: 16 January 2025