TY - CONF AU - Platzner, Marco AU - Döhre, Sven AU - Happe, Markus AU - Kenter, Tobias AU - Lorenz, Ulf AU - Schumacher, Tobias AU - Send, Andre AU - Warkentin, Alexander ID - 2365 SN - 1-60132-064-7 T2 - Proc. Int. Conf. on Engineering of Reconfigurable Systems and Algorithms (ERSA) TI - The GOmputer: Accelerating GO with FPGAs ER - TY - CONF AB - Service Level Agreements (SLAs) have focal importance if the commercial customer should be attracted to the Grid. An SLA-aware resource management system has already been realize, able to fulfill the SLA of jobs even in the case of resource failures. For this, it is able to migrate checkpointed jobs over the Grid. At this, virtual execution environments allow to increase the number of potential migration targets significantly. In this paper we outline the concept of such virtual execution environments and focus on the SLA negotiation aspects. AU - Battré, Dominic AU - Hovestadt, Matthias AU - Kao, Odej AU - Keller, Axel AU - Voss, Kerstin ID - 1975 T2 - Proc. Int. DMTF Academic Alliance Workshop on Systems and Virtualization Management: Standards and New Technologies TI - Virtual Execution Environments and the Negotiation of Service Level Agreements in Grid Systems ER - TY - CONF AB - OpenCCS is an SLA-aware resource management system which uses transparent checkpointing of applications and migration of checkpoint datasets for ensuring SLA-compliance also in case of resource outages. Migration of checkpoints presumes a high grade of compatibility between source and target resource. Hence, even in large Grid systems only a small number of resources are eligible migration targets. This short paper describes the concept of virtual execution environments and how they increase the number of potential migration targets. It will also outline an implementation within OpenCCS. AU - Battré, Dominic AU - Hovestadt, Matthias AU - Kao, Odej AU - Keller, Axel AU - Voss, Kerstin ID - 1980 T2 - Proc. Int. Conf. on Services Computing (SCC) TI - Virtual Execution Environments for ensuring SLA-compliant Job Migration in Grids ER - TY - CONF AU - Birkenheuer, Georg AU - Brinkmann, André AU - Dömer, Hubert AU - Effert, Sascha AU - Konersmann, Christoph AU - Niehörster, Oliver AU - Simon, Jens ID - 2357 T2 - Proc. Gemeinsamer Workshop der GI/ITG Fachgruppen "Betriebssysteme" und "KuVS": Virtualized IT infrastructures and their management TI - Virtual Supercomputer for HPC and HTC ER - TY - CONF AU - Lietsch, Stefan AU - Marquardt, Oliver ID - 2389 SN - 978-3-540-76857-9 T2 - Proc. Int. Symp. on Visual Computing (ISVC) TI - A CUDA-Supported Approach to Remote Rendering VL - 4841 ER - TY - CONF AU - Voss, Kerstin AU - Djemame, Karim AU - Gourlay, Iain AU - Padgett, James ED - Altmann, Jörn ED - Veit, Daniel ID - 2396 T2 - Proc. Int. Worksh. on Grid Economics and Business Models (GECON) TI - AssessGrid, Economic Issues Underlying Risk Awareness in Grids VL - 4685 ER - TY - CONF AU - Beutel, Jan AU - Dyer, Matthias AU - Lim, Roman AU - Plessl, Christian AU - Woehrle, Matthias AU - Yuecel, Mustafa AU - Thiele, Lothar ID - 2393 KW - WSN KW - testing KW - verification SN - 1-4244-1231-5 T2 - Proc. Int. Conf. Networked Sensing Systems (INSS) TI - Automated Wireless Sensor Network Testing ER - TY - CONF AU - Voss, Kerstin ID - 2390 SN - 0-7695-3007-9 T2 - Proc. Int. Conf. on Semantics, Knowledge and Grid (SKG) TI - Comparing Fault Tolerance Mechanisms for Self-Organizing Resource Management in Grids ER - TY - CONF AU - Lietsch, Stefan AU - Zabel, Henning AU - Berssenbruegge, Jan ID - 2398 SN - 0-7918-3806-4 T2 - Proc. ASME Computers and Information in Engineering Conference (CIE) TI - Computational Steering of Interactive and Distributed Virtual Reality Applications ER - TY - CONF AU - Voss, Kerstin ID - 2397 SN - 0-7695-2858-9 T2 - Proc. Int. Conf. on Networking and Services (ICNS) TI - Enhance Self-managing Grids by Risk Management ER - TY - CONF AU - Battré, Dominic AU - Djemame, Karim AU - Kao, Odej AU - Voss, Kerstin ID - 2391 T2 - Proc. Int. Conf. on Security and Privacy in Communications Networks (SecureComm) TI - Gaining Users' Trust by Publishing Failure Probabilities ER - TY - CONF AU - Birkenheuer, Georg AU - Majlender, Peter AU - Nitsche, Holger AU - Voss, Kerstin AU - Weber, Elmar ID - 2399 T2 - Proc. Cracow Grid Workshop (CGW) TI - Gather and Prepare Monitoring Data for Estimating Resource Stability ER - TY - CONF AU - Woehrle, Matthias AU - Plessl, Christian AU - Beutel, Jan AU - Thiele, Lothar ID - 2392 KW - WSN KW - testing KW - distributed KW - embedded SN - 978-1-59593-694-3 T2 - Proc. Workshop on Embedded Networked Sensors (EmNets) TI - Increasing the Reliability of Wireless Sensor Networks with a Distributed Testing Framework ER - TY - CONF AB - Service level agreements (SLAs) are powerful instruments for describing all obligations and expectations in a business relationship. It is of focal importance for deploying Grid technology to commercial applications. The EC-funded project HPC4U (Highly Predictable Clusters for Internet Grids) aimed at introducing SLA-awareness in local resource management systems, while the EC-funded project AssessGrid introduced the notion of risk, which is associated with every business contract. This paper highlights the concept of planning based resource management and describes the SLA-aware scheduler developed and used in these projects. AU - Battré, Dominic AU - Hovestadt, Matthias AU - Kao, Odej AU - Keller, Axel AU - Voss, Kerstin ID - 1986 T2 - Proc. Workshop of the UK PLANNING AND SCHEDULING Special Interest Group (PlanSIG) TI - Planning-based Scheduling for SLA-awareness and Grid Integration ER - TY - CONF AU - Battré, Dominic AU - Hovestadt, Matthias AU - Kao, Odej AU - Keller, Axel AU - Voss, Kerstin ID - 1988 T2 - Proc. Cracow Grid Workshop, Academic Computer Center CYFRNET TI - Transparent Cross Border Migration of Parallel Multi Node Applications ER - TY - CONF AU - Berssenbrügge, Jan AU - Lietsch, Stefan ID - 2400 SN - 978-3-939350-28-6 T2 - Proc. Worksh. Augmented & Virtual Reality in der Produktentstehung TI - Verteilte Berechnung und Darstellung automobiler Scheinwerfer VL - 209 ER - TY - CONF AU - Lerch, Nicolas AU - Nitsche, Holger AU - Voss, Kerstin AU - Hovestadt, Matthias ID - 2408 T2 - Proc. Cracow Grid Workshop (CGW) TI - First Steps of a Monitoring Framework to Empower Risk Assessment on Grids ER - TY - CONF AU - Djemame, Karim AU - Gourlay, Iain AU - Padgett, James AU - Birkenheuer, Georg AU - Hovestadt, Matthias AU - Kao, Odej AU - Voss, Kerstin ID - 2402 SN - 0-7695-2734-5 T2 - Proc. Int. Conf. on e-Science and Grid Computing TI - Introducing Risk Management into the Grid ER - TY - CONF AU - Lietsch, Stefan AU - Zabel, Henning AU - Berssenbruegge, Jan AU - Wittenberg, Veit AU - Eikermann, Martin ID - 2407 SN - 3-540-48628-3 T2 - Proc. Int. Symp. on Visual Computing (ISVC) TI - Light Simulation in a Distributed Driving Simulator VL - 4291 ER - TY - CONF AU - Birkenheuer, Georg AU - Döhre, Sven AU - Hovestadt, Matthias AU - Kao, Odej AU - Voss, Kerstin ID - 2409 T2 - Proc. Cracow Grid Workshop (CGW) TI - On Similarities of Grid Resources for Identifying Potential Migration Targets ER - TY - CONF AU - Voss, Kerstin ID - 2406 SN - 0-7695-2622-5 T2 - Proc. Int. Conf. on Networking and Services (ICNS) TI - Risk Aware Migrations for Prepossessing SLAs ER - TY - CONF AU - Hovestadt, Matthias AU - Kao, Odej AU - Voss, Kerstin ID - 2403 SN - 0-7695-2670-5 T2 - Proc. Int. Conf. on Services Computing (SCC) TI - The First Step of Introducing Risk Management for Prepossessing SLAs ER - TY - CONF AU - Birkenheuer, Georg AU - Djemame, Karim AU - Gourlay, Iain AU - Kao, Odej AU - Padgett, James AU - Voß, Kerstin ID - 2410 T2 - Proc. WS-Agreement Workshop (Open Grid Forum 18) TI - Using WS-Agreement for Risk Management in the Grid ER - TY - CONF AB - The next generation grid applications demand grid middleware for a flexible negotiation mechanism supporting various ways of quality-of-service (QoS) guarantees. In this context, a QoS guarantee covers simultaneous allocations of various kinds of different resources, such as processor runtime, storage capacity, or network bandwidth, which are specified in the form of service level agreements (SLA). Currently, a gap exists between the capabilities of grid middleware and the underlying resource management systems concerning their support for QoS and SLA negotiation. In this paper we present an approach which closes this gap. Introducing the architecture of the virtual resource manager, we highlight its main QoS management features like run-time responsibility, co-allocation, and fault tolerance. AU - Burchard, Lars-Olof AU - Heine, Felix AU - Hovestadt, Matthias AU - Kao, Odej AU - Keller, Axel AU - Linnert, Barry ID - 1992 T2 - Proc. IEEE Int. Parallel & Distributed Processing Symposium (IPDPS) TI - A Quality-of-Service Architecture for Future Grid Computing Applications. ER - TY - CONF AU - Lietsch, Stefan AU - Kao, Odej ID - 2413 T2 - Proc. Intelligence in Communication Systems (INTELLCOMM) TI - CoLoS - A System for Device Unaware and Position Dependent Communication Based on the Session Initiation Protocol VL - 190 ER - TY - CONF AU - Birkenheuer, Georg AU - Hagelweide, Wilke AU - Hagemeier, Björn AU - Japs, Viktor AU - Keller, Matthias AU - Mayr, Nikolas AU - Meyer, Jan AU - Schumacher, Tobias AU - Voß, Kerstin AU - Zajac, Markus ID - 2414 T2 - Proc. GI Informatiktage TI - PIRANHA – Hunter of Idle Resources VL - 2 ER - TY - CONF AU - Burchard, Lars-Olof AU - Heiss, Hans-Ulrich AU - Hovestadt, Matthias AU - Kao, Odej AU - Keller, Axel AU - Linnert, Barry ID - 1994 T2 - Proceedings of the GI-Meeting on Operating Systems TI - An Architecture for SLA-aware Resource Management ER - TY - CONF AU - Groppe, Sven AU - Böttcher, Stefan AU - Birkenheuer, Georg ID - 2416 T2 - Proc. Int. Conf. on Enterprise Information Systems (ICEIS) TI - Efficient Querying of Transformed XML Documents ER - TY - CONF AU - Kao, Odej AU - Hovestadt, Matthias AU - Keller, Axel ID - 1993 T2 - Proc. Advanced Research Workshop on High Perfomance Computing: Technology and Applications TI - SLA-aware Job Migration in Grid Environments ER - TY - CONF AU - Groppe, Sven AU - Böttcher, Stefan AU - Heckel, Reiko AU - Birkenheuer, Georg ID - 2417 T2 - Proc. East-European Conf. on Advances in Databases and Information Systems (ADBIS) TI - Using XSLT Stylesheets to Transform XPath Queries ER - TY - CONF AB - The next generation Grid will demand the Grid middleware to provide flexibility, transparency, and reliability. This implies the appliance of service level agreements to guarantee a negotiated level of quality of service. These requirements also affect the local resource management systems providing resources for the Grid. At this a gap between these demands and the features of today's resource management systems becomes apparent. In this paper we present an approach which closes this gap. Introducing the architecture of the virtual resource manager we highlight its main features of runtime responsibility, resource virtualization, information hiding, autonomy provision, and smooth integration of existing resource management system installations. AU - Burchard, Lars-Olof AU - Hovestadt, Matthias AU - Kao, Odej AU - Keller, Axel AU - Linnert, Barry ID - 1995 T2 - Proc. Int. Symposium on Cluster Computing and the Grid (CCGRID) TI - Virtual Resource Manager: An Architecture for SLA-aware Resource Management ER - TY - CONF AB - Nearly all existing HPC systems are operated by resource management systems based on the queuing approach. With the increasing acceptance of grid middleware like Globus, new requirements for the underlying local resource management systems arise. Features like advanced reservation or quality of service are needed to implement high level functions like co-allocation. However it is difficult to realize these features with a resource management system based on the queuing concept since it considers only the present resource usage. In this paper we present an approach which closes this gap. By assigning start times to each resource request, a complete schedule is planned. Advanced reservations are now easily possible. Based on this planning approach functions like diffuse requests, automatic duration extension, or service level agreements are described. We think they are useful to increase the usability, acceptance and performance of HPC machines. In the second part of this paper we present a planning based resource management system which already covers some of the mentioned features. AU - Hovestadt, Matthias AU - Kao, Odej AU - Keller, Axel AU - Streit, Achim ID - 1998 KW - High Performance Computing KW - Service Level Agreement KW - Grid Resource KW - Resource Management System KW - Advance Reservation T2 - Proc. Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP) TI - Scheduling in HPC Resource Management Systems: Queuing vs. Planning VL - 2862 ER - TY - CONF AU - P. Miller, Barton AU - Labarta, Jesús AU - Schintke, Florian AU - Simon, Jens ID - 2426 SN - 978-3-540-45706-0 T2 - Proc. European Conf. on Parallel Processing (Euro-Par) TI - Performance Evaluation, Analysis and Optimization VL - 2400 ER - TY - CONF AU - Schintke, Florian AU - Simon, Jens AU - Reinefeld, Alexander ID - 2431 T2 - Proc. Int. Conf. on Computational Science (ICCS) TI - A Cache Simulator for Shared Memory Systems VL - 2074 ER - TY - CONF AB - The Testbed and Applications working group of the European Grid Forum (EGrid) is actively building and experimenting with a grid infrastructure connecting several research-based supercomputing sites located in Europe. The paper reports on our first feasibility study: running a self-migrating version of the Cactus simulation code across the European grid testbed, including "live" remote data visualization and steering from different demonstration booths at Supercomputing 2000, in Dallas, TX. We report on the problems that had to be resolved for this endeavour and identify open research challenges for building production-grade grid environments. AU - Gehring, Jörn AU - Keller, Axel AU - Reinefeld, Alexander AU - Streit, Achim ID - 2000 T2 - Proc. Int. Symposium on Cluster Computing and the Grid (CCGRID) TI - Early Experiences with the EGrid Testbed ER - TY - CONF AB - The availability of commodity high performance components for workstations and networks made it possible to build up large, PC based compute clusters at modest costs. These clusters seem to be a realistic alternative to proprietary, massively parallel systems with respect to the price/performance ratio. However, from the administration point of view, those systems are still often solely a collection of autonomous nodes, connected by a fast short area network. Therefore, aiming at providing the best possible performance in daily work to all users, a lot of work has to be done before obtaining the expected result. The paper describes the problem areas we had to cope with during the integration of two large SCI clusters (one with 64 and one with 192 processors) in the environment of the Paderborn Center for Parallel Computing. AU - Keller, Axel AU - Krawinkel, Andreas ID - 2002 T2 - Proc. Int. Symposium on Cluster Computing and the Grid (CCGRID) TI - Lessons Learned While Operating Two Large SCI Clusters ER - TY - CONF AB - RsdEditor is a graphical user interface which produces specifications of computational resources. It is used in the RSD (Resource and Service Description) environment for specifying, registering, requesting and accessing resources and services in a metacomputer. RsdEditor was designed to be used by the administrators and users of metacomputing environments. At the administrator level, the GUI is used to describe the available computing and networking components of a metacomputer. At the user level, RsdEditor can be used to specify which characteristics of the computational resources are needed to execute a meta-application. This paper is organized as follows: it first introduces RsdEditor. It then briefly describes the RSD environment, and finally, it highlights various features and implementation issues of RsdEditor. AU - Baraglia, Ranieri AU - Keller, Axel AU - Laforenza, Domenico AU - Reinefeld, Alexander ID - 2003 T2 - Proc. Heterogenous Computing Workshop HCW at IPDPS TI - RsdEditor: A Graphical User Interface for Specifying Metacomputer Components ER - TY - CONF AU - Brune, Matthias AU - Reinefeld, Alexander AU - Varnholt, Jörg ID - 2436 T2 - Proc. Int. Symp. High-Performance Distributed Computing (HPDC) TI - A Resource Description Environment for Distributed Computing Systems ER - TY - CONF AB - With the recent availability of cost-effective network cards for the PCI bus, researchers have been tempted to build up large compute clusters with standard PCs. Many of them are operated with workstation cluster management software in high-throughput or single user mode. For very large clusters with more than 100 PEs, however, it becomes necessary to implement a full fledged resource management software that allows to partition the system for multi-user access. In this paper, we present our Computing Center Software (CCS), which was originally designed for managing massively parallel high-performance computers, and now adapted to modern workstation clusters. It provides - partitioning of exclusive and non-exclusive resources, - hardware-independent scheduling of interactive and batch jobs, - open, extensible interfaces to other resource management systems, - a high degree of reliability. AU - Brune, Matthias AU - Keller, Axel AU - Reinefeld, Alexander ID - 2004 T2 - Proc. Int. Conf. on High-Performance Computing and Networking (HPCN) TI - Resource Management for High-Performance PC Clusters ER - TY - CONF AB - CCS is a resource management system for parallel high-performance computers. At the user level, CCS provides vendor-independent access to parallel systems. At the system administrator level, CCS offers tools for controlling (i.e, specifying, configuring and scheduling) the system components that are operated in a computing center. Hence the name "Computing Center Software". CCS provides: hardware-independent scheduling of interactive and batch jobs; partitioning of exclusive and non-exclusive resources; open, extensible interfaces to other resource management systems; a high degree of reliability (e.g. automatic restart of crashed daemons); fault tolerance in the case of network breakdowns. The authors describe CCS as one important component for the access, job distribution, and administration of networked HPC systems in a metacomputing environment. AU - Keller, Axel AU - Reinefeld, Alexander ID - 2011 T2 - Proc. Heterogenous Computing Workshop (HCW) at IPPS TI - CCS Resource Management in Networked HPC Systems ER - TY - CONF AB - RSD (Resource and Service Description) is a scheme for specifying resources and services in complex heterogeneous computing systems and metacomputing environments. At the system administrator level, RSD is used to specify the available system components, such as the number of nodes, their interconnection topology, CPU speeds, and available software packages. At the user level, a GUI provides a comfortable, high-level interface for specifying system requests. A textual editor can be used for defining repetitive and recursive structures. This gives service providers the necessary flexibility for fine-grained specification of system topologies, interconnection networks, system and software dependent properties. All these representations are mapped onto a single, coherent internal object-oriented resource representation. Dynamic aspects (like network performance, availability of compute nodes, and compute node loads) are traced at runtime and included in the resource description to allow for optimal process mapping and dynamic task load balancing at runtime at the metacomputer level. This is done in a self-organizing way, with human system operators becoming only involved when new hardware/software components are installed. AU - Brune, Matthias AU - Gehring, Jörn AU - Keller, Axel AU - Reinefeld, Alexander ID - 2009 T2 - Proc. Int. Conf. on High-Performance Computing Systems (HPCS) TI - RSD - Resource and Service Description ER - TY - CONF AU - Brune, Matthias AU - Hellmann, Christian AU - Keller, Axel ID - 2013 T2 - Proc. Workshop Hypercomputing at ITG/GI-Conference Architekur von Rechensystemen TI - A Closer Step towards Management of Metacomputing-Resources ER - TY - CONF AU - Fischer, Markus AU - Simon, Jens ID - 2441 T2 - Proc. European Parallel Virtual Machine / Message Passing Interface Users’ Group Meeting (EuroPVM/MPI) TI - Embedding SCI into PVM VL - 1332 ER - TY - CONF AU - Heinz, Oliver AU - Simon, Jens ID - 2439 T2 - Proc. Int. Conf. on Architecture of Computing Systems (ARCS) TI - Experiences with a SCI Multiprocessor Workstation Cluster ER - TY - CONF AU - Simon, Jens AU - Heinz, Oliver ID - 2440 T2 - Proc. Workshops im Rahmen der 14. ITG/GI-Fachtagung Architektur von Rechensystemen TI - SCI multiprocessor PC cluster in a WindowsNT environment ER - TY - CONF AU - Reinefeld, Alexander AU - Baraglia, Ranieri AU - Decker, Thomas AU - Gehring, Jörn AU - Laforenza, Domenico AU - Ramme, Friedhelm AU - Römke, Thomas AU - Simon, Jens ID - 2442 T2 - Proc. Heterogenous Computing Workshop (HCW) TI - The MOL Project: An Open, Extensible Metacomputer ER - TY - CONF AU - Simon, Jens AU - Weicker, Reinhold AU - Vieth, Marco ID - 2438 SN - 978-3-540-69549-3 T2 - Proc. European Conf. on Parallel Processing (Euro-Par) TI - Workload Analysis of Computation Intensive Tasks: Case Study on SPEC CPU95 Benchmarks VL - 1300 ER - TY - CONF AU - Simon, Jens AU - Wierum, Jens-Michael ID - 2445 T2 - Proc. European Conf. on Parallel Processing (Euro-Par) TI - Accurate Performance Prediction for Massively Parallel Systems and its Applications VL - 1124 ER - TY - CONF AU - Simon, Jens AU - Wierum, Jens-Michael ID - 2444 T2 - Proc. Annual Int. Conf. on High-Performance Computers (HPCS) TI - Performance Prediction of Benchmark Programs for Massively Parallel Architectures ER - TY - CONF AU - Simon, Jens AU - Wierum, Jens-Michael ID - 2443 SN - 978-3-540-61142-4 T2 - Proc. Int. Conf. on High-Performance Computing and Networking (HPCN-Europe) TI - Sequential Performance versus Scalability: Optimizing Parallel LU-Decomposition VL - 1067 ER -