TY  - JOUR
AB  - Branch and bound (B&B) algorithms structure the search space as a tree and eliminate infeasible solutions early by pruning subtrees that cannot lead to a valid or optimal solution. Custom hardware designs significantly accelerate the execution of these algorithms. In this article, we demonstrate a high-performance B&B implementation on FPGAs. First, we identify general elements of B&B algorithms and describe their implementation as a finite state machine. Then, we introduce workers that autonomously cooperate using work stealing to allow parallel execution and full utilization of the target FPGA. Finally, we explore advantages of instance-specific designs that target a specific problem instance to improve performance.

We evaluate our concepts by applying them to a branch and bound problem, the reconstruction of corrupted AES keys obtained from cold-boot attacks. The evaluation shows that our work stealing approach is scalable with the available resources and provides speedups proportional to the number of workers. Instance-specific designs allow us to achieve an overall speedup of 47 × compared to the fastest implementation of AES key reconstruction so far. Finally, we demonstrate how instance-specific designs can be generated just-in-time such that the provided speedups outweigh the additional time required for design synthesis.
AU  - Riebler, Heinrich
AU  - Lass, Michael
AU  - Mittendorf, Robert
AU  - Löcke, Thomas
AU  - Plessl, Christian
ID  - 18
IS  - 3
JF  - ACM Transactions on Reconfigurable Technology and Systems (TRETS)
KW  - coldboot
SN  - 1936-7406
TI  - Efficient Branch and Bound on FPGAs Using Work Stealing and Instance-Specific Designs
VL  - 10
ER  - 
TY  - CONF
AB  - Compared to classical HDL designs, generating FPGA with high-level synthesis from an OpenCL specification promises easier exploration of different design alternatives and, through ready-to-use infrastructure and common abstractions for host and memory interfaces, easier portability between different FPGA families. In this work, we evaluate the extent of this promise. To this end, we present a parameterized FDTD implementation for photonic microcavity simulations. Our design can trade-off different forms of parallelism and works for two independent OpenCL-based FPGA design flows. Hence, we can target FPGAs from different vendors and different FPGA families. We describe how we used pre-processor macros to achieve this flexibility and to work around different shortcomings of the current tools. Choosing the right design configurations, we are able to present two extremely competitive solutions for very different FPGA targets, reaching up to 172 GFLOPS sustained performance. With the portability and flexibility demonstrated, code developers not only avoid vendor lock-in, but can even make best use of real trade-offs between different architectures.
AU  - Kenter, Tobias
AU  - Förstner, Jens
AU  - Plessl, Christian
ID  - 1592
KW  - tet_topic_hpc
T2  - Proc. Int. Conf. on Field Programmable Logic and Applications (FPL)
TI  - Flexible FPGA design for FDTD using OpenCL
ER  - 
TY  - JOUR
AU  - Schumacher, Jörn
AU  - Plessl, Christian
AU  - Vandelli, Wainer
ID  - 1589
JF  - Journal of Physics: Conference Series
TI  - High-Throughput and Low-Latency Network Communication with NetIO
VL  - 898
ER  - 
TY  - CONF
AB  - Version Control Systems (VCS) are a valuable tool for software development
and document management. Both client/server and distributed (Peer-to-Peer)
models exist, with the latter (e.g., Git and Mercurial) becoming
increasingly popular. Their distributed nature introduces complications,
especially concerning security: it is hard to control the dissemination of
contents stored in distributed VCS as they rely on replication of complete
repositories to any involved user.

We overcome this issue by designing and implementing a concept for
cryptography-enforced access control which is transparent to the user. Use
of field-tested schemes (end-to-end encryption, digital signatures) allows
for strong security, while adoption of convergent encryption and
content-defined chunking retains storage efficiency. The concept is
seamlessly integrated into Mercurial---respecting its distributed storage
concept---to ensure practical usability and compatibility to existing
deployments.
AU  - Lass, Michael
AU  - Leibenger, Dominik
AU  - Sorge, Christoph
ID  - 19
KW  - access control
KW  - distributed version control systems
KW  - mercurial
KW  - peer-to-peer
KW  - convergent encryption
KW  - confidentiality
KW  - authenticity
SN  - 978-1-5090-2054-6
T2  - Proc. 41st Conference on Local Computer Networks (LCN)
TI  - Confidentiality and Authenticity for Distributed Version Control Systems - A Mercurial Extension
ER  - 
TY  - GEN
AU  - Tölke, Christian
ID  - 5418
TI  - Sicherheit von hybriden FPGA-Systemen in der industriellen Automatisierungstechnik -- Anforderungen und Umsetzung
ER  - 
TY  - GEN
AU  - Wüllrich, Gunnar
ID  - 5420
TI  - Dynamic OpenCL Task Scheduling for Energy and Performance in a Heterogeneous Environment
ER  - 
TY  - THES
AU  - Kenter, Tobias
ID  - 161
TI  - Reconfigurable Accelerators in the World of General-Purpose Computing
ER  - 
TY  - CHAP
AB  - In this chapter, we present an introduction to the ReconOS operating system for reconfigurable computing. ReconOS offers a unified multi-threaded programming model and operating system services for threads executing in software and threads mapped to reconfigurable hardware. By supporting standard POSIX operating system functions for both software and hardware threads, ReconOS particularly caters to developers with a software background, because developers can use well-known mechanisms such as semaphores, mutexes, condition variables, and message queues for developing hybrid applications with threads running on the CPU and FPGA concurrently. Through the semantic integration of hardware accelerators into a standard operating system environment, ReconOS allows for rapid design space exploration, supports a structured application development process and improves the portability of applications between different reconfigurable computing systems.
AU  - Agne, Andreas
AU  - Platzner, Marco
AU  - Plessl, Christian
AU  - Happe, Markus
AU  - Lübbers, Enno
ED  - Koch, Dirk
ED  - Hannig, Frank
ED  - Ziener, Daniel
ID  - 29
SN  - 978-3-319-26406-6
T2  - FPGAs for Software Programmers
TI  - ReconOS
ER  - 
TY  - CONF
AU  - Riebler, Heinrich
AU  - Vaz, Gavin Francis
AU  - Plessl, Christian
AU  - Trainiti, Ettore M. G.
AU  - Durelli, Gianluca C.
AU  - Bolchini, Cristiana
ID  - 31
T2  - Proc. HiPEAC Workshop on Reonfigurable Computing (WRC)
TI  - Using Just-in-Time Code Generation for Transparent Resource Management in Heterogeneous Systems
ER  - 
TY  - CONF
AU  - Kenter, Tobias
AU  - Plessl, Christian
ID  - 24
T2  - Proc. Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC)
TI  - Microdisk Cavity FDTD Simulation on FPGA using OpenCL
ER  -