TY - JOUR AB - Intel recently introduced the Heterogeneous Architecture Research Platform, HARP. In this platform, the Central Processing Unit and a Field-Programmable Gate Array are connected through a high-bandwidth, low-latency interconnect and both share DRAM memory. For this platform, Open Computing Language (OpenCL), a High-Level Synthesis (HLS) language, is made available. By making use of HLS, a faster design cycle can be achieved compared to programming in a traditional hardware description language. This, however, comes at the cost of having less control over the hardware implementation. We will investigate how OpenCL can be applied to implement a real-time guided image filter on the HARP platform. In the first phase, the performance-critical parameters of the OpenCL programming model are defined using several specialized benchmarks. In a second phase, the guided image filter algorithm is implemented using the insights gained in the first phase. Both a floating-point and a fixed-point implementation were developed for this algorithm, based on a sliding window implementation. This resulted in a maximum floating-point performance of 135 GFLOPS, a maximum fixed-point performance of 430 GOPS and a throughput of HD color images at 74 frames per second. AU - Faict, Thomas AU - D’Hollander, Erik H. AU - Goossens, Bart ID - 16422 JF - Algorithms KW - pc2-harp-ressources SN - 1999-4893 TI - Mapping a Guided Image Filter on the HARP Reconfigurable Architecture Using OpenCL ER - TY - JOUR AB - Heterogeneous computing that exploits simultaneous co-processing with different device types has been shown to be effective at both increasing performance and reducing energy consumption. In this paper, we extend a scheduling framework encapsulated in a high-level C++ template and previously developed for heterogeneous chips comprising CPU and GPU cores, to new high-performance platforms for the data center, which include a cache coherent FPGA fabric and many-core CPU resources. Our goal is to evaluate the suitability of our framework with these new FPGA-based platforms, identifying performance benefits and limitations.We target the state-of-the-art HARP processor that includes 14 high-end Xeon classes tightly coupled to a FPGA device located in the same package. We select eight benchmarks from the high-performance computing domain that have been ported and optimized for this heterogeneous platform. The results show that a dynamic and adaptive scheduler that exploits simultaneous processing among the devices can improve performance up to a factor of 8 × compared to the best alternative solutions that only use the CPU cores or the FPGA fabric. Moreover, our proposal achieves up to 15% and 37% of improvement compared to the best heterogeneous solutions found with a dynamic and static schedulers, respectively. AU - Rodríguez, Andrés AU - Navarro, Angeles AU - Asenjo, Rafael AU - Corbera, Francisco AU - Gran, Rubén AU - Suárez, Darío AU - Nunez-Yanez, Jose ID - 16423 JF - The Journal of Supercomputing KW - pc2-harp-ressources SN - 0920-8542 TI - Parallel multiprocessing and scheduling on the heterogeneous Xeon+FPGA platform ER - TY - CONF AB - Transactional Memory (TM) has been considered as a promising alternative to existing synchronization operations, which are often the largest stumbling block to unleashing parallelism of applications. Efficient implementations of TM, however, are challenging due to the tension between lowering performance overhead and avoiding unnecessary aborts. In this paper, we present Reachability-based Optimistic Concurrency Control for Transactional Memory (ROCoCoTM), a novel scheme which offloads concurrency control (CC) algorithms, the central building blocks of TM systems, to reconfigurable hardware. To reduce the abort rate, an innovative formalization of mainstream CC algorithms is developed to reveal a common restriction that leads to unnecessary aborts. This restriction is resolved by the ROCoCo algorithm with a centralized validation phase, which can be efficiently pipelined in hardware. Thanks to a high-performance offloading engine implemented in reconfigurable hardware, ROCoCo algorithm results in decreased abort rates and reduced performance overhead. The whole system is implemented on Intel's HARP2 platform and evaluated with the STAMP benchmark suite. Experiments show 1.55x and 8.05x geomean speedup over TinySTM and an HTM based on Intel TSX, respectively. Given the fast-growing deployment of commodity CPU-FPGA platforms, ROCoCoTM paves the way for software programmers to exploit heterogeneous computing resources with a high-level transactional abstraction to effectively extract the parallelism in modern applications. AU - Li, Zhaoshi AU - Liu, Leibo AU - Deng, Yangdong AU - Wang, Jiawei AU - Liu, Zhiwei AU - Yin, Shouyi AU - Wei, Shaojun ID - 16427 KW - pc2-harp-ressources SN - 9781450369381 T2 - Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture TI - FPGA-Accelerated Optimistic Concurrency Control for Transactional Memory ER - TY - CONF AU - Rehlaender, Philipp AU - Grote, Tobias AU - Tikhonov, Sergey AU - Niejende, Hugues AU - Schafmeister, Frank AU - Bocker, Joachim AU - Thiemann, Peter ID - 16433 SN - 9789075815313 T2 - 2019 21st European Conference on Power Electronics and Applications (EPE '19 ECCE Europe) TI - A PCB Integrated Winding Using a Litz Structure for a Wireless Charging Coil ER - TY - CONF AU - Rehlaender, Philipp AU - Schafmeister, Frank AU - Bocker, Joachim AU - Grote, Tobias ID - 16438 SN - 9781728136660 T2 - 2019 IEEE 28th International Symposium on Industrial Electronics (ISIE) TI - Analytical Topology Comparison for a Single Stage On-Board EV-Battery Converter ER - TY - CHAP AU - Rehlaender, Philipp AU - Schroeer, Maik AU - Chadha, Gavneet AU - Schwung, Andreas ID - 16443 SN - 2661-8141 T2 - Proceedings of the International Neural Networks Society TI - Traffic Sign Detection Using R-CNN ER - TY - JOUR AU - Sahai, Tuhin AU - Ziessler, Adrian AU - Klus, Stefan AU - Dellnitz, Michael ID - 16709 JF - Nonlinear Dynamics SN - 0924-090X TI - Continuous relaxations for the traveling salesman problem ER - TY - CONF AU - Pfeifer, Florian AU - Dietrich, André AU - Marten, Thorsten AU - Tröster, Thomas AU - Nacke, Bernard ID - 16793 SN - 978-3-95735-104-3 T2 - Proceedings of 7th International Conference on Hot Sheet Metal Forming of High-Performance Steel TI - Investigation on Inductive Heating of Sheet Metal for an Industrial Hot Stamping Process ER - TY - CONF AU - Striewe, Jan André AU - Thomas, Robert AU - Fischer, Fabian AU - Wiens, Timo AU - Tröster, Thomas ID - 16794 TI - Energieabsorptions- und Versagensverhalten eines automobilen Seitenschwellers mit lokaler Verstärkung aus kohlenstofffaserverstärktem Kunststoff nach Alterung ER - TY - GEN AU - Ahlers, Dominik AU - Tröster, Thomas ID - 16825 TI - Performance Parameters and HIP Routes for additively manufactured titanium alloy Ti6Al4V ER - TY - CONF AU - Camberg, Alan Adam AU - Hielscher, Christian ID - 16826 T2 - Aachen Body Engineering Days 2019 TI - A holistic approach to the lightweight design of tailored structural components using the example of a hybrid A-pillar ER - TY - CONF AU - Camberg, Alan Adam AU - Tröster, Thomas ID - 16827 T2 - 26. Sächsische Fachtagung Umformtechnik TI - Challenges in fracture modeling under non-isothermal forming conditions using the example of a new forming process for aluminum blanks ER - TY - CONF AU - Tinkloh, Steffen Rainer AU - Wu, Tao AU - Tröster, Thomas AU - Niendorf, Thomas ID - 16831 TI - A micromechanical based finite element simulation of process induced residual stresses in metal-CFRP-hybrid structures ER - TY - GEN AB - In this work we describe our results achieved in the ProtestNews Lab at CLEF 2019. To tackle the problems of event sentence detection and event extraction we decided to use contextualized string embeddings. The models were trained on a data corpus collected from Indian news sources, but evaluated on data obtained from news sources from other countries as well, such as China. Our models have obtained competitive results and have scored 3rd in the event sentence detection task and 1st in the event extraction task based on average F1-scores for different test datasets. AU - Skitalinskaya, Gabriella AU - Klaff, Jonas AU - Spliethöver, Maximilian ID - 16847 TI - CLEF ProtestNews Lab 2019: Contextualized Word Embeddings for Event Sentence Detection and Event Extraction VL - 2380 ER - TY - GEN AB - State-of-the-art frameworks for generating approximate circuits usually rely on information gained through circuit synthesis and/or verification to explore the search space and to find an optimal solution. Throughout the process, a large number of circuits may be subject to processing, leading to considerable runtimes. In this work, we propose a search which takes error bounds and pre-computed impact factors into account to reduce the number of invoked synthesis and verification processes. In our experimental results, we achieved speed-ups of up to 76x while area savings remain comparable to the reference search method, simulated annealing. AU - Witschen, Linus Matthias AU - Ghasemzadeh Mohammadi, Hassan AU - Artmann, Matthias AU - Platzner, Marco ID - 16853 KW - Approximate computing KW - parameter selection KW - search space exploration KW - verification KW - circuit synthesis T2 - Fourth Workshop on Approximate Computing (AxC 2019) TI - Jump Search: A Fast Technique for the Synthesis of Approximate Circuits ER - TY - JOUR AU - Atkins, Marc AU - Gilroy, Bernard Michael AU - Seiler, Volker ID - 16882 IS - 2 JF - Intereconomics TI - New Dimensions of Service Offshoring in World Trade VL - 54 ER - TY - CHAP AU - Gilroy, Bernard Michael AU - Golderbein, Alexander AU - Peitz, Christian AU - Stöckmann, Nico ED - Fortz, B. ED - Labbé, M. ID - 16883 T2 - Operations Research Proceedings 2018 TI - The Impact of Monetary Policy on Investment Bank Profitability in Unequal Economies ER - TY - JOUR AU - Krimphove, Dieter AU - Peitz, Christian ID - 16884 JF - Fintechs: Rechtliche Grundlagen moderner Finanztechnologien TI - Social-Trading und Copy-Trading ER - TY - JOUR AU - Heinen, Matthias AU - Vrabec, Jadran ID - 16955 JF - The Journal of Chemical Physics KW - pc2-ressources SN - 0021-9606 TI - Evaporation sampled by stationary molecular dynamics simulation ER - TY - JOUR AU - Fingerhut, Robin AU - Herres, Gerhard AU - Vrabec, Jadran ID - 16958 JF - Molecular Physics KW - pc2-ressources SN - 0026-8976 TI - Thermodynamic factor of quaternary mixtures from Kirkwood–Buff integration ER -