Jobstats is an open-source tool that helps researchers and system administrators understand how efficiently their HPC jobs use resources on Slurm clusters. It collects detailed information about CPU, GPU, and memory usage for each job using Prometheus and displays it through Grafana dashboards. With Jobstats, you can see in real time how your job is performing, compare what you requested versus what you actually used, and also get intelligent recommendations. It also keeps a history of past jobs so you can spot trends and make smarter choices in future runs. For researchers, that means faster queues and fewer wasted resources; for admins, it means better overall cluster efficiency and easier planning. Jobstats turns raw usage data into clear, actionable insights that help everyone make the most of HPC resources.
Articles (1)
Discovery HPC Cluster CPU Usage Tips.