Anatomy of a Resource Management System for HPC Clusters
A. Keller, A. Reinefeld, Annual Review of Scalable Computing 3 (2001) 1–31.
Download
No fulltext has been uploaded.
Journal Article
| Published
| English
Author
Keller, AxelLibreCat;
Reinefeld, Alexander
Abstract
Workstation clusters are often not only used for high-throughput computing in time-sharing mode but also for running complex parallel jobs in space-sharing mode. This poses several difficulties to the resource management system, which must be able to reserve computing resources for exclusive use and also to determine an optimal process mapping for a given system topology.
On the basis of our CCS software, we describe the anatomy of a modern resource management system. Like Codine, Condor, and LSF, CCS provides mechanisms for the user-friendly system access and management of clusters. But unlike them, CCS is targeted at the effective support of space-sharing parallel computers and even metacomputers. Among other features, CCS provides a versatile resource description facility, topology-based process mapping, pluggable schedulers, and hooks to metacomputer management.
Publishing Year
Journal Title
Annual Review of Scalable Computing
Volume
3
Page
1-31
LibreCat-ID
Cite this
Keller A, Reinefeld A. Anatomy of a Resource Management System for HPC Clusters. Annual Review of Scalable Computing. 2001;3:1-31. doi:10.1142/9789812810182_0001
Keller, A., & Reinefeld, A. (2001). Anatomy of a Resource Management System for HPC Clusters. Annual Review of Scalable Computing, 3, 1–31. https://doi.org/10.1142/9789812810182_0001
@article{Keller_Reinefeld_2001, title={Anatomy of a Resource Management System for HPC Clusters}, volume={3}, DOI={10.1142/9789812810182_0001}, journal={Annual Review of Scalable Computing}, author={Keller, Axel and Reinefeld, Alexander}, year={2001}, pages={1–31} }
Keller, Axel, and Alexander Reinefeld. “Anatomy of a Resource Management System for HPC Clusters.” Annual Review of Scalable Computing 3 (2001): 1–31. https://doi.org/10.1142/9789812810182_0001.
A. Keller and A. Reinefeld, “Anatomy of a Resource Management System for HPC Clusters,” Annual Review of Scalable Computing, vol. 3, pp. 1–31, 2001.
Keller, Axel, and Alexander Reinefeld. “Anatomy of a Resource Management System for HPC Clusters.” Annual Review of Scalable Computing, vol. 3, 2001, pp. 1–31, doi:10.1142/9789812810182_0001.