 Research Cyberinfrastructure

High-Performance Computing (HPC) Documentation.

Categories (6)

Innovator Cluster

The Innovator HPC cluster boasts advanced CPU and GPU architectures for parallel computing and acceleration of machine learning applications. The computational tasks of AI researchers and simulations will gain significant boost with NVIDIA A100 GPU hardware.

Slurm (Cluster Resource Manager)

Slurm is an open source, fault-tolerant, and highly scalable cluster management and job scheduling system for large and small Linux clusters. The Slurm resource manager has three key functions. First, it allocates exclusive and/or non-exclusive access to resources (compute nodes) to users for some duration of time so they can perform work. Second, it provides a framework for starting, executing, and monitoring work (normally a parallel job) on the set of allocated nodes. Finally, it arbitrates contention for resources by managing a queue of pending work.

Software

Research software tutorials and information.

Workshops

Research Cyberinfrastructure workshop tutorials and other information.

Systems Tutorials

Discovery Cluster

This MRI-funded high-performance computing (HPC) cluster is designed to accelerate advanced research in GPU-intensive fields such as machine learning, deep learning, data analytics, and scientific simulation. Featuring the latest NVIDIA H100 GPUs, the system provides researchers with cutting-edge capabilities for massively parallel workloads and AI model training. This resource supports the SDSU research community by enabling scalable, high-throughput computation for projects that demand significant GPU acceleration.