How to handle long running jobs¶
If you want to run long running jobs on our Linux workstations you have to abide by a few rules.
General rules¶
- Do not bother anyone with your jobs. Keep the machine usable. Use nice or renice with at least nice level 15, better 18.
- Do not run RAM or CPU time intensive jobs at all on our login server login.phys.ethz.ch as there are usually dozens of users logged in and try to do their daily work on it. Clogging up those machines causes many users not being able to work anymore, so no long running or heavily resource consuming computing jobs are allowed on these two machines. Please use one of the computing nodes instead.
- You can use ps, top or htop to monitor CPU and memory usage of the running jobs. Use nvtop to monitor GPU and GPU memory.
- Keep an eye on the RAM usage of your programs and avoid using too much memory.
- Do not run computational jobs on a network filesystem (group share, nor home share)! Use the local disks (e.g. in /scratch/) instead for such cases.
- You should not run several jobs at once unless you know what you are doing. In many cases, it's less efficient than running jobs in sequence.
Personal Workstations¶
If you run long running jobs on your personal workstation you do not need to look after keeping the machine usable for other users. Though