ssh-Colab is a Python module to facilitate remote access to Google Colaboratory (Colab) through Secure Shell (SSH) connections, secured by a third-party software, ngrok. The module automates the tedious routine to set up ngrok tunnels needed for TPU runtime applications and services like TensorBoard. It also provides subroutines for (1) Kaggle Data API installation, (2) Kaggle competition data downloads, (3) data transfers between Colab and Google Cloud Storage (GCS), and (4) Google Drive mounting.
-
Launch a Colab notebook. Choose a runtime type you prefer.
-
Install ssh-Colab. Type and run the following command in a new notebook cell:
!pip install ssh-Colab
Or you can use this command:
!pip install git+https://github.com/libinruan/ssh_Colab.git#egg=ssh_Colab
Another way to install this package is to Git clone its repository to Colab. Run in a new notebook cell:
!git clone https://github.com/libinruan/ssh_Colab.git
%cd ssh_Colab
!sudo python setup.py install
-
Initiate the establishment of tunnels:
import sshColab
sshColab.connect([LOG_DIR='/path/to/log/'])
The default TensorBoard log directory is /log/fit
.
-
Retrieve information that is used for establishing the SSH connection:
sshColab.info()
If you are running a non-TPU-enabled notebook, the setup instruction of TPU resolver is skipped.
-
To activate Kaggle API installation/authentication and download competition data, run:
sshColab.kaggle([data=<name-of-competition>, output=<output-directory>])
Note that the default competition name is tabular-playground-series-mar-2021
. The data is unzipped to the destination folder /kaggle/input
by default.
-
To mount a google drive, run:
-
To connect with GCS, initiate the connection:
sshColab.GCSconnect()
To create a GCS Bucket, run:
sshColab.create_bucket(<project_id>, <bucket_name>)
To list blobs in a GCS bucket, run:
sshColab.list_blobs(<project_id>, <bucket_name>)
To upload files from Colab to a GCS Bucket, run:
sshColab.upload_to_gcs(<project_id>, <bucket_name>, [file=<local_file> ,ext=<file_extension>])
To download files from a GCS Bucket to Colab, run:
sshColab.download_to_colab(<project_id>, <bucket_name>, [file=<local_file>])
-
To disable ngrok tunnels created, run the command below:
sshColab.kill()
A short Colab notebook is provided in the link below. Users can
find a simple end-to-end application starting from ssh-Colab installation, SSH
tunnel creation, to the use of TensorBoard after training a 3-layer MNIST
convolutional neural network.
What's missed in this quick start guide is how to may our way to Colab instances from
local machines. The reference listed below can be a start point for interested
users:
version 0.3.3: Addition of the output argument for function kaggle().
version 0.3.0: Addition of functions for communicating with Google Cloud Storage.
version 0.2.0: Addition of Google Drive mounting function.
version 0.1.3: Addition of Kaggle API installation/authentication and competition data downloading function.