“interactive” script for SLURM

I recently rolled out a new distributed model for our research computing cluster at work. We’re using GlusterFS for networked home directories and SLURM for job/resource scheduling. GlusterFS allows us to scale storage with minimal downtime or service disruption, and SLURM allows us to treat compute nodes as generic resources for running users’ jobs (ie, […]

Sysadmin is happy when users use SLURM

I recently rolled out a new distributed model for our research computing cluster at work. We’re using GlusterFS for networked home directories and SLURM for job/resource scheduling. GlusterFS allows us to scale storage with minimal downtime or service disruption, and SLURM allows us to treat compute nodes as generic resources for running users’ jobs (ie, […]

This server should be on fire

Sometimes users run scripts without checking the parameters first, often recycling scripts they’ve used elsewhere, at former jobs or at other institutes. Sometimes this works out, sometimes it doesn’t… With a load of ~75 and 32 physical cores, the CPU usage on this server is oversubscribed 2.3 times a “normal” 100% load (where 100% of […]

When I’m not compiling kernels at home…

Contrary to popular belief, hacking on Linux kernels and recompiling Android is not my day job… I am gainfully employed as a Systems Analyst/Administrator/whatever at the International Livestock Research Institute (ILRI) in Nairobi, Kenya. See? We use high-performance Linux servers to analyze research data, including genetic sequence data, remote sensing imagery, GIS layers, etc. I’m […]