Job Metrics Reporting
Overview of the Job Metrics Dashboard for Slurm
Job Metrics Reporting
The Job Metrics Dashboard provides a comprehensive view of job details tailored specifically for Slurm. It serves as a lightweight analytics tool to provide insights into cluster usage, efficiency, and user activity.
This feature allows administrators and users to visualize job metrics, analyze historical data, and generate reports based on Slurm job accounting data collected by the Slurm History Ingestor.
Key Features
- Job Details: Detailed metrics for individual jobs including wait times and exit codes.
- Cluster Usage: Overview of core hours, job counts, and wait times over time.
- User Activity: Insights into active users, groups, and account usage.
- Visualizations: Graphs and charts for core hours over time, usage by group, and more.
Architecture
The system consists of two main parts:
- Slurm History Ingestor (Go Scraper): A standalone service that fetches job history from the Slurm REST API and stores it in a PostgreSQL database.
- Dashboard Plugin: A frontend component within this Next.js application that queries the database to visualize the metrics.
Requirements
To enable the Job Metrics Dashboard, you need:
- Go Scraper: Installed and running. See Go Scraper.
- PostgreSQL Database: To store the collected metrics.
- Plugin Configuration: Enabling the plugin in the Next.js application.
Please refer to the Installation and Configuration guides for detailed setup instructions.