TY - CONF AU - Lerch, Nicolas AU - Nitsche, Holger AU - Voss, Kerstin AU - Hovestadt, Matthias ID - 2408 T2 - Proc. Cracow Grid Workshop (CGW) TI - First Steps of a Monitoring Framework to Empower Risk Assessment on Grids ER - TY - CONF AU - Birkenheuer, Georg AU - Döhre, Sven AU - Hovestadt, Matthias AU - Kao, Odej AU - Voss, Kerstin ID - 2409 T2 - Proc. Cracow Grid Workshop (CGW) TI - On Similarities of Grid Resources for Identifying Potential Migration Targets ER - TY - CONF AU - Birkenheuer, Georg AU - Djemame, Karim AU - Gourlay, Iain AU - Kao, Odej AU - Padgett, James AU - Voß, Kerstin ID - 2410 T2 - Proc. WS-Agreement Workshop (Open Grid Forum 18) TI - Using WS-Agreement for Risk Management in the Grid ER - TY - CONF AB - The next generation grid applications demand grid middleware for a flexible negotiation mechanism supporting various ways of quality-of-service (QoS) guarantees. In this context, a QoS guarantee covers simultaneous allocations of various kinds of different resources, such as processor runtime, storage capacity, or network bandwidth, which are specified in the form of service level agreements (SLA). Currently, a gap exists between the capabilities of grid middleware and the underlying resource management systems concerning their support for QoS and SLA negotiation. In this paper we present an approach which closes this gap. Introducing the architecture of the virtual resource manager, we highlight its main QoS management features like run-time responsibility, co-allocation, and fault tolerance. AU - Burchard, Lars-Olof AU - Heine, Felix AU - Hovestadt, Matthias AU - Kao, Odej AU - Keller, Axel AU - Linnert, Barry ID - 1992 T2 - Proc. IEEE Int. Parallel & Distributed Processing Symposium (IPDPS) TI - A Quality-of-Service Architecture for Future Grid Computing Applications. ER - TY - CONF AU - Lietsch, Stefan AU - Kao, Odej ID - 2413 T2 - Proc. Intelligence in Communication Systems (INTELLCOMM) TI - CoLoS - A System for Device Unaware and Position Dependent Communication Based on the Session Initiation Protocol VL - 190 ER - TY - CONF AU - Birkenheuer, Georg AU - Hagelweide, Wilke AU - Hagemeier, Björn AU - Japs, Viktor AU - Keller, Matthias AU - Mayr, Nikolas AU - Meyer, Jan AU - Schumacher, Tobias AU - Voß, Kerstin AU - Zajac, Markus ID - 2414 T2 - Proc. GI Informatiktage TI - PIRANHA – Hunter of Idle Resources VL - 2 ER - TY - CONF AU - Kao, Odej AU - Hovestadt, Matthias AU - Keller, Axel ID - 1993 T2 - Proc. Advanced Research Workshop on High Perfomance Computing: Technology and Applications TI - SLA-aware Job Migration in Grid Environments ER - TY - CONF AU - Burchard, Lars-Olof AU - Heiss, Hans-Ulrich AU - Hovestadt, Matthias AU - Kao, Odej AU - Keller, Axel AU - Linnert, Barry ID - 1994 T2 - Proceedings of the GI-Meeting on Operating Systems TI - An Architecture for SLA-aware Resource Management ER - TY - CONF AB - The next generation Grid will demand the Grid middleware to provide flexibility, transparency, and reliability. This implies the appliance of service level agreements to guarantee a negotiated level of quality of service. These requirements also affect the local resource management systems providing resources for the Grid. At this a gap between these demands and the features of today's resource management systems becomes apparent. In this paper we present an approach which closes this gap. Introducing the architecture of the virtual resource manager we highlight its main features of runtime responsibility, resource virtualization, information hiding, autonomy provision, and smooth integration of existing resource management system installations. AU - Burchard, Lars-Olof AU - Hovestadt, Matthias AU - Kao, Odej AU - Keller, Axel AU - Linnert, Barry ID - 1995 T2 - Proc. Int. Symposium on Cluster Computing and the Grid (CCGRID) TI - Virtual Resource Manager: An Architecture for SLA-aware Resource Management ER - TY - CONF AU - Groppe, Sven AU - Böttcher, Stefan AU - Birkenheuer, Georg ID - 2416 T2 - Proc. Int. Conf. on Enterprise Information Systems (ICEIS) TI - Efficient Querying of Transformed XML Documents ER - TY - CONF AU - Groppe, Sven AU - Böttcher, Stefan AU - Heckel, Reiko AU - Birkenheuer, Georg ID - 2417 T2 - Proc. East-European Conf. on Advances in Databases and Information Systems (ADBIS) TI - Using XSLT Stylesheets to Transform XPath Queries ER - TY - CONF AB - Nearly all existing HPC systems are operated by resource management systems based on the queuing approach. With the increasing acceptance of grid middleware like Globus, new requirements for the underlying local resource management systems arise. Features like advanced reservation or quality of service are needed to implement high level functions like co-allocation. However it is difficult to realize these features with a resource management system based on the queuing concept since it considers only the present resource usage. In this paper we present an approach which closes this gap. By assigning start times to each resource request, a complete schedule is planned. Advanced reservations are now easily possible. Based on this planning approach functions like diffuse requests, automatic duration extension, or service level agreements are described. We think they are useful to increase the usability, acceptance and performance of HPC machines. In the second part of this paper we present a planning based resource management system which already covers some of the mentioned features. AU - Hovestadt, Matthias AU - Kao, Odej AU - Keller, Axel AU - Streit, Achim ID - 1998 KW - High Performance Computing KW - Service Level Agreement KW - Grid Resource KW - Resource Management System KW - Advance Reservation T2 - Proc. Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP) TI - Scheduling in HPC Resource Management Systems: Queuing vs. Planning VL - 2862 ER - TY - CONF AU - P. Miller, Barton AU - Labarta, Jesús AU - Schintke, Florian AU - Simon, Jens ID - 2426 SN - 978-3-540-45706-0 T2 - Proc. European Conf. on Parallel Processing (Euro-Par) TI - Performance Evaluation, Analysis and Optimization VL - 2400 ER - TY - CONF AB - The Testbed and Applications working group of the European Grid Forum (EGrid) is actively building and experimenting with a grid infrastructure connecting several research-based supercomputing sites located in Europe. The paper reports on our first feasibility study: running a self-migrating version of the Cactus simulation code across the European grid testbed, including "live" remote data visualization and steering from different demonstration booths at Supercomputing 2000, in Dallas, TX. We report on the problems that had to be resolved for this endeavour and identify open research challenges for building production-grade grid environments. AU - Gehring, Jörn AU - Keller, Axel AU - Reinefeld, Alexander AU - Streit, Achim ID - 2000 T2 - Proc. Int. Symposium on Cluster Computing and the Grid (CCGRID) TI - Early Experiences with the EGrid Testbed ER - TY - CONF AB - The availability of commodity high performance components for workstations and networks made it possible to build up large, PC based compute clusters at modest costs. These clusters seem to be a realistic alternative to proprietary, massively parallel systems with respect to the price/performance ratio. However, from the administration point of view, those systems are still often solely a collection of autonomous nodes, connected by a fast short area network. Therefore, aiming at providing the best possible performance in daily work to all users, a lot of work has to be done before obtaining the expected result. The paper describes the problem areas we had to cope with during the integration of two large SCI clusters (one with 64 and one with 192 processors) in the environment of the Paderborn Center for Parallel Computing. AU - Keller, Axel AU - Krawinkel, Andreas ID - 2002 T2 - Proc. Int. Symposium on Cluster Computing and the Grid (CCGRID) TI - Lessons Learned While Operating Two Large SCI Clusters ER - TY - CONF AU - Schintke, Florian AU - Simon, Jens AU - Reinefeld, Alexander ID - 2431 T2 - Proc. Int. Conf. on Computational Science (ICCS) TI - A Cache Simulator for Shared Memory Systems VL - 2074 ER - TY - CONF AB - RsdEditor is a graphical user interface which produces specifications of computational resources. It is used in the RSD (Resource and Service Description) environment for specifying, registering, requesting and accessing resources and services in a metacomputer. RsdEditor was designed to be used by the administrators and users of metacomputing environments. At the administrator level, the GUI is used to describe the available computing and networking components of a metacomputer. At the user level, RsdEditor can be used to specify which characteristics of the computational resources are needed to execute a meta-application. This paper is organized as follows: it first introduces RsdEditor. It then briefly describes the RSD environment, and finally, it highlights various features and implementation issues of RsdEditor. AU - Baraglia, Ranieri AU - Keller, Axel AU - Laforenza, Domenico AU - Reinefeld, Alexander ID - 2003 T2 - Proc. Heterogenous Computing Workshop HCW at IPDPS TI - RsdEditor: A Graphical User Interface for Specifying Metacomputer Components ER - TY - CONF AB - With the recent availability of cost-effective network cards for the PCI bus, researchers have been tempted to build up large compute clusters with standard PCs. Many of them are operated with workstation cluster management software in high-throughput or single user mode. For very large clusters with more than 100 PEs, however, it becomes necessary to implement a full fledged resource management software that allows to partition the system for multi-user access. In this paper, we present our Computing Center Software (CCS), which was originally designed for managing massively parallel high-performance computers, and now adapted to modern workstation clusters. It provides - partitioning of exclusive and non-exclusive resources, - hardware-independent scheduling of interactive and batch jobs, - open, extensible interfaces to other resource management systems, - a high degree of reliability. AU - Brune, Matthias AU - Keller, Axel AU - Reinefeld, Alexander ID - 2004 T2 - Proc. Int. Conf. on High-Performance Computing and Networking (HPCN) TI - Resource Management for High-Performance PC Clusters ER - TY - CONF AU - Brune, Matthias AU - Reinefeld, Alexander AU - Varnholt, Jörg ID - 2436 T2 - Proc. Int. Symp. High-Performance Distributed Computing (HPDC) TI - A Resource Description Environment for Distributed Computing Systems ER - TY - CONF AB - RSD (Resource and Service Description) is a scheme for specifying resources and services in complex heterogeneous computing systems and metacomputing environments. At the system administrator level, RSD is used to specify the available system components, such as the number of nodes, their interconnection topology, CPU speeds, and available software packages. At the user level, a GUI provides a comfortable, high-level interface for specifying system requests. A textual editor can be used for defining repetitive and recursive structures. This gives service providers the necessary flexibility for fine-grained specification of system topologies, interconnection networks, system and software dependent properties. All these representations are mapped onto a single, coherent internal object-oriented resource representation. Dynamic aspects (like network performance, availability of compute nodes, and compute node loads) are traced at runtime and included in the resource description to allow for optimal process mapping and dynamic task load balancing at runtime at the metacomputer level. This is done in a self-organizing way, with human system operators becoming only involved when new hardware/software components are installed. AU - Brune, Matthias AU - Gehring, Jörn AU - Keller, Axel AU - Reinefeld, Alexander ID - 2009 T2 - Proc. Int. Conf. on High-Performance Computing Systems (HPCS) TI - RSD - Resource and Service Description ER -