Deploying an Instant Cluster
Deploying an instant cluster
Deploy an instant cluster by clicking the Deploy Cluster button on the top right of the Clusters page. In the next screen, you can choose your contract duration (starting from 1 day) and the number of GPUs (depending on the available resources) you would like to have in your cluster.
Select the operating system and CUDA version you want to use (we recommend CUDA 13.0):

Next, select your shared filesystem size. File systems are mounted as follows:
- Local storage is mounted to - /mnt/local_diskon each worker node.
- SFS is mounted to - /homeon all nodes, including the jump host.

You also need to supply your SSH public key before you deploy. We recommend you choose the cluster hostname appropriately, since your worker nodes will inherit the hostname as the prefix.
Once the above steps are done, you deploy the cluster, just like you would an ordinary DataCrunch cloud instance.
Accessing your cluster
Once deployment has been done on DataCrunch cloud dashboard, please give the cluster around 20 minutes to start.
Please also note that the jump host node will become accessible a few minutes before the worker nodes are ready, when starting the cluster for the first time.
Once the cluster has been created, you can proceed to log in by copying the ssh ubuntu@CLUSTER_IP command from the Clusters screen in your dashboard.
You can login to the individual worker nodes from your jumphost by running ssh WORKER_NAME
Running jobs
We recommend using Slurm to run jobs on the cluster.
Last updated
Was this helpful?