TY - CONF AU - Lietsch, Stefan AU - Kao, Odej ID - 2413 T2 - Proc. Intelligence in Communication Systems (INTELLCOMM) TI - CoLoS - A System for Device Unaware and Position Dependent Communication Based on the Session Initiation Protocol VL - 190 ER - TY - CONF AU - Birkenheuer, Georg AU - Hagelweide, Wilke AU - Hagemeier, Björn AU - Japs, Viktor AU - Keller, Matthias AU - Mayr, Nikolas AU - Meyer, Jan AU - Schumacher, Tobias AU - Voß, Kerstin AU - Zajac, Markus ID - 2414 T2 - Proc. GI Informatiktage TI - PIRANHA – Hunter of Idle Resources VL - 2 ER - TY - CONF AU - Kao, Odej AU - Hovestadt, Matthias AU - Keller, Axel ID - 1993 T2 - Proc. Advanced Research Workshop on High Perfomance Computing: Technology and Applications TI - SLA-aware Job Migration in Grid Environments ER - TY - CONF AU - Burchard, Lars-Olof AU - Heiss, Hans-Ulrich AU - Hovestadt, Matthias AU - Kao, Odej AU - Keller, Axel AU - Linnert, Barry ID - 1994 T2 - Proceedings of the GI-Meeting on Operating Systems TI - An Architecture for SLA-aware Resource Management ER - TY - CONF AB - The next generation Grid will demand the Grid middleware to provide flexibility, transparency, and reliability. This implies the appliance of service level agreements to guarantee a negotiated level of quality of service. These requirements also affect the local resource management systems providing resources for the Grid. At this a gap between these demands and the features of today's resource management systems becomes apparent. In this paper we present an approach which closes this gap. Introducing the architecture of the virtual resource manager we highlight its main features of runtime responsibility, resource virtualization, information hiding, autonomy provision, and smooth integration of existing resource management system installations. AU - Burchard, Lars-Olof AU - Hovestadt, Matthias AU - Kao, Odej AU - Keller, Axel AU - Linnert, Barry ID - 1995 T2 - Proc. Int. Symposium on Cluster Computing and the Grid (CCGRID) TI - Virtual Resource Manager: An Architecture for SLA-aware Resource Management ER - TY - CONF AU - Groppe, Sven AU - Böttcher, Stefan AU - Birkenheuer, Georg ID - 2416 T2 - Proc. Int. Conf. on Enterprise Information Systems (ICEIS) TI - Efficient Querying of Transformed XML Documents ER - TY - CONF AU - Groppe, Sven AU - Böttcher, Stefan AU - Heckel, Reiko AU - Birkenheuer, Georg ID - 2417 T2 - Proc. East-European Conf. on Advances in Databases and Information Systems (ADBIS) TI - Using XSLT Stylesheets to Transform XPath Queries ER - TY - CONF AB - Nearly all existing HPC systems are operated by resource management systems based on the queuing approach. With the increasing acceptance of grid middleware like Globus, new requirements for the underlying local resource management systems arise. Features like advanced reservation or quality of service are needed to implement high level functions like co-allocation. However it is difficult to realize these features with a resource management system based on the queuing concept since it considers only the present resource usage. In this paper we present an approach which closes this gap. By assigning start times to each resource request, a complete schedule is planned. Advanced reservations are now easily possible. Based on this planning approach functions like diffuse requests, automatic duration extension, or service level agreements are described. We think they are useful to increase the usability, acceptance and performance of HPC machines. In the second part of this paper we present a planning based resource management system which already covers some of the mentioned features. AU - Hovestadt, Matthias AU - Kao, Odej AU - Keller, Axel AU - Streit, Achim ID - 1998 KW - High Performance Computing KW - Service Level Agreement KW - Grid Resource KW - Resource Management System KW - Advance Reservation T2 - Proc. Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP) TI - Scheduling in HPC Resource Management Systems: Queuing vs. Planning VL - 2862 ER - TY - CONF AU - P. Miller, Barton AU - Labarta, Jesús AU - Schintke, Florian AU - Simon, Jens ID - 2426 SN - 978-3-540-45706-0 T2 - Proc. European Conf. on Parallel Processing (Euro-Par) TI - Performance Evaluation, Analysis and Optimization VL - 2400 ER - TY - JOUR AB - Workstation clusters are often not only used for high-throughput computing in time-sharing mode but also for running complex parallel jobs in space-sharing mode. This poses several difficulties to the resource management system, which must be able to reserve computing resources for exclusive use and also to determine an optimal process mapping for a given system topology. On the basis of our CCS software, we describe the anatomy of a modern resource management system. Like Codine, Condor, and LSF, CCS provides mechanisms for the user-friendly system access and management of clusters. But unlike them, CCS is targeted at the effective support of space-sharing parallel computers and even metacomputers. Among other features, CCS provides a versatile resource description facility, topology-based process mapping, pluggable schedulers, and hooks to metacomputer management. AU - Keller, Axel AU - Reinefeld, Alexander ID - 1999 JF - Annual Review of Scalable Computing TI - Anatomy of a Resource Management System for HPC Clusters VL - 3 ER - TY - CONF AB - The Testbed and Applications working group of the European Grid Forum (EGrid) is actively building and experimenting with a grid infrastructure connecting several research-based supercomputing sites located in Europe. The paper reports on our first feasibility study: running a self-migrating version of the Cactus simulation code across the European grid testbed, including "live" remote data visualization and steering from different demonstration booths at Supercomputing 2000, in Dallas, TX. We report on the problems that had to be resolved for this endeavour and identify open research challenges for building production-grade grid environments. AU - Gehring, Jörn AU - Keller, Axel AU - Reinefeld, Alexander AU - Streit, Achim ID - 2000 T2 - Proc. Int. Symposium on Cluster Computing and the Grid (CCGRID) TI - Early Experiences with the EGrid Testbed ER - TY - CONF AB - The availability of commodity high performance components for workstations and networks made it possible to build up large, PC based compute clusters at modest costs. These clusters seem to be a realistic alternative to proprietary, massively parallel systems with respect to the price/performance ratio. However, from the administration point of view, those systems are still often solely a collection of autonomous nodes, connected by a fast short area network. Therefore, aiming at providing the best possible performance in daily work to all users, a lot of work has to be done before obtaining the expected result. The paper describes the problem areas we had to cope with during the integration of two large SCI clusters (one with 64 and one with 192 processors) in the environment of the Paderborn Center for Parallel Computing. AU - Keller, Axel AU - Krawinkel, Andreas ID - 2002 T2 - Proc. Int. Symposium on Cluster Computing and the Grid (CCGRID) TI - Lessons Learned While Operating Two Large SCI Clusters ER - TY - GEN AU - Hungershöfer, Jan AU - Streit, Achim AU - Wierum, Jens-Michael ID - 2427 TI - Efficient Resource Management for Malleable Applications ER - TY - CONF AU - Schintke, Florian AU - Simon, Jens AU - Reinefeld, Alexander ID - 2431 T2 - Proc. Int. Conf. on Computational Science (ICCS) TI - A Cache Simulator for Shared Memory Systems VL - 2074 ER - TY - CONF AB - RsdEditor is a graphical user interface which produces specifications of computational resources. It is used in the RSD (Resource and Service Description) environment for specifying, registering, requesting and accessing resources and services in a metacomputer. RsdEditor was designed to be used by the administrators and users of metacomputing environments. At the administrator level, the GUI is used to describe the available computing and networking components of a metacomputer. At the user level, RsdEditor can be used to specify which characteristics of the computational resources are needed to execute a meta-application. This paper is organized as follows: it first introduces RsdEditor. It then briefly describes the RSD environment, and finally, it highlights various features and implementation issues of RsdEditor. AU - Baraglia, Ranieri AU - Keller, Axel AU - Laforenza, Domenico AU - Reinefeld, Alexander ID - 2003 T2 - Proc. Heterogenous Computing Workshop HCW at IPDPS TI - RsdEditor: A Graphical User Interface for Specifying Metacomputer Components ER - TY - THES AU - Simon, Jens ID - 2434 SN - 3-934445-03-9 TI - Werkzeugunterstützte effiziente Nutzung von Hochleistungsrechnern ER - TY - CONF AB - With the recent availability of cost-effective network cards for the PCI bus, researchers have been tempted to build up large compute clusters with standard PCs. Many of them are operated with workstation cluster management software in high-throughput or single user mode. For very large clusters with more than 100 PEs, however, it becomes necessary to implement a full fledged resource management software that allows to partition the system for multi-user access. In this paper, we present our Computing Center Software (CCS), which was originally designed for managing massively parallel high-performance computers, and now adapted to modern workstation clusters. It provides - partitioning of exclusive and non-exclusive resources, - hardware-independent scheduling of interactive and batch jobs, - open, extensible interfaces to other resource management systems, - a high degree of reliability. AU - Brune, Matthias AU - Keller, Axel AU - Reinefeld, Alexander ID - 2004 T2 - Proc. Int. Conf. on High-Performance Computing and Networking (HPCN) TI - Resource Management for High-Performance PC Clusters ER - TY - CHAP AB - With a steadily increasing number of services, metacomputing is now gaining importance in science and industry. Virtual organizations, autonomous agents, mobile computing services, and high-performance client–server applications are among the many examples of metacomputing services. For all of them, resource description plays a major role in organizing access, use, and administration of the computing components and software services. We present a generic Resource and Service Description (RSD) for specifying the hardware and software components of (meta-) computing environments. Its graphical interface allows metacomputer users to specify their resource requests. Its textual counterpart gives service providers the necessary flexibility to specify topology and properties of the available system and software resources. Finally, its internal object-oriented representation is used to link different resource management systems and service tools. With these three representations, our generic RSD approach is a key component for building metacomputer environments. AU - Brune, Matthias AU - Gehring, Jörn AU - Keller, Axel AU - Reinefeld, Alexander ED - Buya, R. ID - 2005 T2 - High-Performance Cluster Computing: Architecture and Systems TI - Specifying Resources and Services in Metacomputing Systems ER - TY - JOUR AB - We present a software system for the management of geographically distributed high‐performance computers. It consists of three components: 1. The Computing Center Software (CCS) is a vendor‐independent resource management software for local HPC systems. It controls the mapping and scheduling of interactive and batch jobs on massively parallel systems; 2. The Resource and Service Description (RSD) is used by CCS for specifying and mapping hardware and software components of (meta‐)computing environments. It has a graphical user interface, a textual representation and an object‐oriented API; 3. The Service Coordination Layer (SCL) co‐ordinates the co‐operative use of resources in autonomous computing sites. It negotiates between the applications' requirements and the available system services. AU - Brune, Matthias AU - Gehring, Jörn AU - Keller, Axel AU - Reinefeld, Alexander ID - 2007 JF - Concurrency, Practice, and Experience TI - Managing Clusters of Geographically Distributed High-Performance Computers VL - II(15) ER - TY - CHAP AB - The growing maturity of hardware and software components has tempted researchers to build very large SCI clusters with several hundred processors that are operated as high-performance compute servers in multi-user mode. In this chapter, we present a resource management software for the user access and system administration of high-performance compute clusters named Computing Center Software (CCS). It is in day-to-day use since 1992 on various parallel systems and has recently been adapted to the management of SCI clusters. CCS provides pluggable schedulers, optimal space partitioning for multiple users, reliable user access, and powerful tools for specifying resources and services by means of a specification language and a graphical user interface. After a brief introduction in the remainder of this section, we describe the CCS system architecture and the characteristics of its resource description facilities. AU - Brune, Matthias AU - Keller, Axel AU - Reinefeld, Alexander ED - Hellwagner, Hermann ED - Reinefeld, Alexander ID - 2008 T2 - SCI - Scalable Coherent Interface: Architecture and Software for High Performance Compute Clusters TI - Multi-User System Management on SCI Cluster ER -