Learn HPC schedulingby running real experiments.
A living lab notebook of SLURM experiments, field notes, and hands-on challenges for curious cluster engineers.
Built in the open, for curious operators
Engineering notes, field reports, and new drills as we learn.
In the lab now
Working experiments today
Challenges
ExploreHands-on challenges exploring SLURM internals, edge cases, and advanced scheduling algorithms
Tutorials
ExploreExpert deep-dives and comprehensive guides for mastering HPC cluster management
Video Library
ExploreVisual walkthroughs and screencast tutorials for hands-on learning
Resources
ExploreProduction-ready scripts, templates, and curated GitHub repositories
On deck
Next on the bench
Challenges
Practice real-world SLURM problems in a playful sandbox. Submit jobs, break queues, fix scheduling failures — all inside your browser.
Want to help shape it? Drop your email in the lab notes below and we'll invite small waves for fast feedback.
Learn SLURM by tinkering together
Short lab notes with experiments, failures, and fixes from real clusters. Built for the curious and the hands-on.
Runbook autopsies
Post-mortems, fixes, and what we learned.
Queue-first heuristics
Backfill recipes, fair-share tuning, and GPU-aware tips.
Notes + telemetry
Short lessons paired with interactive terminals.