For TAs

What to Tell Your Students

Students need to know that the cluster resources should only be used for the course. If students run out of GPU time for a course they need to contact a TA. TAs can ask our service desk to increase time for given students.

Before the new semester starts, access to the cluster is disabled and all home directories are deleted as well as all data under /cluster/courses. Tell your students to copy away their data in time and make sure that data in your course directory is also copied away for use then the course is held again.

Running Jobs

For running jobs your students need to be aware of the following:

  • The course tag(s).
  • Information an anything that you have prepared under /cluster/courses and how to use it.

Jobs can be canceled by JupyterHub sessions if the cluster is full. These jobs will automatically restart when there are enough nodes free again. This is very unlikely though as the cluster in the past has never been used to 100% capacity.

To avoid running the whole job from the beginning students need to know techniques to use checkpoints or save state in order to resume from somewhere shortly before the job was killed. It is up to you to help them.

JupyterHub

For running Jupyter notebooks on our JupyterHub (if it was requested by your head-TA) students need to know the following:

  • The course tag(s).
  • Details about the environment(s) under /cluster/courses that you have enabled in the hub.

To use Jupyter with https://student-jupyter.inf.ethz.ch requires you to setup a Jupyter environment under /cluster/courses/{course tag} that we will enable in the chooser for environment of the hub. Instruction can be found here.

Python and Anaconda Environments

We recommend that you set up a python virtual environment or anaconda installation under /cluster/courses/{course tag} with all the packages installed for your particular course. This is especially preferred if you need something like pytorch that requires multiple gigabytes. Then tell your students how to activate the environment.

Students can of course set up their own python environment or install anaconda in their home directory, but that will use up a lot of disk space.

In /cluster/data we keep recent anaconda and miniconda installers ready. Please use these for your class instead of letting each student download them, which would again use unneeded space.

Storage

By default pip and conda cache downloaded packages in the home directory of a user, which quickly fills the home directory. Students only have 20GB of storage.

Please tell your students to use the --no-cache-dir option when using pip. Alternatively pip cache purge can be used to clear the cache after installation.

For conda there is not way to avoid the cache but with conda clean --all the cache can be flushed.

If you provide scripts to your students then please integrate the above to save space in the home directories.

Sharing Data with Students

You can create a folder

/cluster/courses/{course tag}/data_for_users

which will be automatically linked as {course tag}_data into the home/work directory of any participant that logs in to a login node or starts a Jupyter notebook for your course. Examples, templates and solutions can be placed there for all of your participants. Only participants and TAs of the course can access the data. The data will be read only - participants can copy data such as Jupyter notebooks into their home directory from there to alter them.

By default all files in the data_for_users folder are readable by the participants of the course. Be careful what you put there and when.

Page URL: https://isg.inf.ethz.ch/bin/view/Main/ServicesClusterComputingStudentClusterTAs
2024-11-21
© 2024 Eidgenössische Technische Hochschule Zürich