--- _id: '46120' abstract: - lang: eng text: The rise of exascale supercomputers has fueled competition among GPU vendors, driving lattice QCD developers to write code that supports multiple APIs. Moreover, new developments in algorithms and physics research require frequent updates to existing software. These challenges have to be balanced against constantly changing personnel. At the same time, there is a wide range of applications for HISQ fermions in QCD studies. This situation encourages the development of software featuring a HISQ action that is flexible, high-performing, open source, easy to use, and easy to adapt. In this technical paper, we explain the design strategy, provide implementation details, list available algorithms and modules, and show key performance indicators for SIMULATeQCD, a simple multi-GPU lattice code for large-scale QCD calculations, mainly developed and used by the HotQCD collaboration. The code is publicly available on GitHub. author: - first_name: Lukas full_name: Mazur, Lukas id: '90492' last_name: Mazur orcid: ' 0000-0001-6304-7082' - first_name: Dennis full_name: Bollweg, Dennis last_name: Bollweg - first_name: David A. full_name: Clarke, David A. last_name: Clarke - first_name: Luis full_name: Altenkort, Luis last_name: Altenkort - first_name: Olaf full_name: Kaczmarek, Olaf last_name: Kaczmarek - first_name: Rasmus full_name: Larsen, Rasmus last_name: Larsen - first_name: Hai-Tao full_name: Shu, Hai-Tao last_name: Shu - first_name: Jishnu full_name: Goswami, Jishnu last_name: Goswami - first_name: Philipp full_name: Scior, Philipp last_name: Scior - first_name: Hauke full_name: Sandmeyer, Hauke last_name: Sandmeyer - first_name: Marius full_name: Neumann, Marius last_name: Neumann - first_name: Henrik full_name: Dick, Henrik last_name: Dick - first_name: Sajid full_name: Ali, Sajid last_name: Ali - first_name: Jangho full_name: Kim, Jangho last_name: Kim - first_name: Christian full_name: Schmidt, Christian last_name: Schmidt - first_name: Peter full_name: Petreczky, Peter last_name: Petreczky - first_name: Swagato full_name: Mukherjee, Swagato last_name: Mukherjee citation: ama: 'Mazur L, Bollweg D, Clarke DA, et al. SIMULATeQCD: A simple multi-GPU lattice code for QCD calculations. Computer Physics Communications. Published online 2023. doi:10.48550/ARXIV.2306.01098' apa: 'Mazur, L., Bollweg, D., Clarke, D. A., Altenkort, L., Kaczmarek, O., Larsen, R., Shu, H.-T., Goswami, J., Scior, P., Sandmeyer, H., Neumann, M., Dick, H., Ali, S., Kim, J., Schmidt, C., Petreczky, P., & Mukherjee, S. (2023). SIMULATeQCD: A simple multi-GPU lattice code for QCD calculations. Computer Physics Communications. https://doi.org/10.48550/ARXIV.2306.01098' bibtex: '@article{Mazur_Bollweg_Clarke_Altenkort_Kaczmarek_Larsen_Shu_Goswami_Scior_Sandmeyer_et al._2023, title={SIMULATeQCD: A simple multi-GPU lattice code for QCD calculations}, DOI={10.48550/ARXIV.2306.01098}, journal={Computer Physics Communications}, author={Mazur, Lukas and Bollweg, Dennis and Clarke, David A. and Altenkort, Luis and Kaczmarek, Olaf and Larsen, Rasmus and Shu, Hai-Tao and Goswami, Jishnu and Scior, Philipp and Sandmeyer, Hauke and et al.}, year={2023} }' chicago: 'Mazur, Lukas, Dennis Bollweg, David A. Clarke, Luis Altenkort, Olaf Kaczmarek, Rasmus Larsen, Hai-Tao Shu, et al. “SIMULATeQCD: A Simple Multi-GPU Lattice Code for QCD Calculations.” Computer Physics Communications, 2023. https://doi.org/10.48550/ARXIV.2306.01098.' ieee: 'L. Mazur et al., “SIMULATeQCD: A simple multi-GPU lattice code for QCD calculations,” Computer Physics Communications, 2023, doi: 10.48550/ARXIV.2306.01098.' mla: 'Mazur, Lukas, et al. “SIMULATeQCD: A Simple Multi-GPU Lattice Code for QCD Calculations.” Computer Physics Communications, 2023, doi:10.48550/ARXIV.2306.01098.' short: L. Mazur, D. Bollweg, D.A. Clarke, L. Altenkort, O. Kaczmarek, R. Larsen, H.-T. Shu, J. Goswami, P. Scior, H. Sandmeyer, M. Neumann, H. Dick, S. Ali, J. Kim, C. Schmidt, P. Petreczky, S. Mukherjee, Computer Physics Communications (2023). date_created: 2023-07-24T10:55:25Z date_updated: 2023-07-26T09:21:35Z department: - _id: '27' doi: 10.48550/ARXIV.2306.01098 language: - iso: eng publication: Computer Physics Communications status: public title: 'SIMULATeQCD: A simple multi-GPU lattice code for QCD calculations' type: journal_article user_id: '90492' year: '2023' ... --- _id: '46119' article_number: '014503' author: - first_name: Luis full_name: Altenkort, Luis last_name: Altenkort - first_name: Alexander M. full_name: Eller, Alexander M. last_name: Eller - first_name: Anthony full_name: Francis, Anthony last_name: Francis - first_name: Olaf full_name: Kaczmarek, Olaf last_name: Kaczmarek - first_name: Lukas full_name: Mazur, Lukas id: '90492' last_name: Mazur orcid: ' 0000-0001-6304-7082' - first_name: Guy D. full_name: Moore, Guy D. last_name: Moore - first_name: Hai-Tao full_name: Shu, Hai-Tao last_name: Shu citation: ama: Altenkort L, Eller AM, Francis A, et al. Viscosity of pure-glue QCD from the lattice. Physical Review D. 2023;108(1). doi:10.1103/physrevd.108.014503 apa: Altenkort, L., Eller, A. M., Francis, A., Kaczmarek, O., Mazur, L., Moore, G. D., & Shu, H.-T. (2023). Viscosity of pure-glue QCD from the lattice. Physical Review D, 108(1), Article 014503. https://doi.org/10.1103/physrevd.108.014503 bibtex: '@article{Altenkort_Eller_Francis_Kaczmarek_Mazur_Moore_Shu_2023, title={Viscosity of pure-glue QCD from the lattice}, volume={108}, DOI={10.1103/physrevd.108.014503}, number={1014503}, journal={Physical Review D}, publisher={American Physical Society (APS)}, author={Altenkort, Luis and Eller, Alexander M. and Francis, Anthony and Kaczmarek, Olaf and Mazur, Lukas and Moore, Guy D. and Shu, Hai-Tao}, year={2023} }' chicago: Altenkort, Luis, Alexander M. Eller, Anthony Francis, Olaf Kaczmarek, Lukas Mazur, Guy D. Moore, and Hai-Tao Shu. “Viscosity of Pure-Glue QCD from the Lattice.” Physical Review D 108, no. 1 (2023). https://doi.org/10.1103/physrevd.108.014503. ieee: 'L. Altenkort et al., “Viscosity of pure-glue QCD from the lattice,” Physical Review D, vol. 108, no. 1, Art. no. 014503, 2023, doi: 10.1103/physrevd.108.014503.' mla: Altenkort, Luis, et al. “Viscosity of Pure-Glue QCD from the Lattice.” Physical Review D, vol. 108, no. 1, 014503, American Physical Society (APS), 2023, doi:10.1103/physrevd.108.014503. short: L. Altenkort, A.M. Eller, A. Francis, O. Kaczmarek, L. Mazur, G.D. Moore, H.-T. Shu, Physical Review D 108 (2023). date_created: 2023-07-24T10:54:18Z date_updated: 2023-07-26T09:23:32Z department: - _id: '27' doi: 10.1103/physrevd.108.014503 intvolume: ' 108' issue: '1' language: - iso: eng publication: Physical Review D publication_identifier: issn: - 2470-0010 - 2470-0029 publication_status: published publisher: American Physical Society (APS) quality_controlled: '1' status: public title: Viscosity of pure-glue QCD from the lattice type: journal_article user_id: '90492' volume: 108 year: '2023' ... --- _id: '38041' abstract: - lang: eng text: "While FPGA accelerator boards and their respective high-level design tools are maturing, there is still a lack of multi-FPGA applications, libraries, and not least, benchmarks and reference implementations towards sustained HPC usage of these devices. As in the early days of GPUs in HPC, for workloads that can reasonably be decoupled into loosely coupled working sets, multi-accelerator support can be achieved by using standard communication interfaces like MPI on the host side. However, for performance and productivity, some applications can profit from a tighter coupling of the accelerators. FPGAs offer unique opportunities here when extending the dataflow characteristics to their communication interfaces.\r\n \ In this work, we extend the HPCC FPGA benchmark suite by multi-FPGA support and three missing benchmarks that particularly characterize or stress inter-device communication: b_eff, PTRANS, and LINPACK. With all benchmarks implemented for current boards with Intel and Xilinx FPGAs, we established a baseline for multi-FPGA performance. Additionally, for the communication-centric benchmarks, we explored the potential of direct FPGA-to-FPGA communication with a circuit-switched inter-FPGA network that is currently only available for one of the boards. The evaluation with parallel execution on up to 26 FPGA boards makes use of one of the largest academic FPGA installations." author: - first_name: Marius full_name: Meyer, Marius id: '40778' last_name: Meyer - first_name: Tobias full_name: Kenter, Tobias id: '3145' last_name: Kenter - first_name: Christian full_name: Plessl, Christian id: '16153' last_name: Plessl orcid: 0000-0001-5728-9982 citation: ama: Meyer M, Kenter T, Plessl C. Multi-FPGA Designs and Scaling of HPC Challenge Benchmarks via MPI and Circuit-Switched Inter-FPGA Networks. ACM Transactions on Reconfigurable Technology and Systems. Published online 2023. doi:10.1145/3576200 apa: Meyer, M., Kenter, T., & Plessl, C. (2023). Multi-FPGA Designs and Scaling of HPC Challenge Benchmarks via MPI and Circuit-Switched Inter-FPGA Networks. ACM Transactions on Reconfigurable Technology and Systems. https://doi.org/10.1145/3576200 bibtex: '@article{Meyer_Kenter_Plessl_2023, title={Multi-FPGA Designs and Scaling of HPC Challenge Benchmarks via MPI and Circuit-Switched Inter-FPGA Networks}, DOI={10.1145/3576200}, journal={ACM Transactions on Reconfigurable Technology and Systems}, publisher={Association for Computing Machinery (ACM)}, author={Meyer, Marius and Kenter, Tobias and Plessl, Christian}, year={2023} }' chicago: Meyer, Marius, Tobias Kenter, and Christian Plessl. “Multi-FPGA Designs and Scaling of HPC Challenge Benchmarks via MPI and Circuit-Switched Inter-FPGA Networks.” ACM Transactions on Reconfigurable Technology and Systems, 2023. https://doi.org/10.1145/3576200. ieee: 'M. Meyer, T. Kenter, and C. Plessl, “Multi-FPGA Designs and Scaling of HPC Challenge Benchmarks via MPI and Circuit-Switched Inter-FPGA Networks,” ACM Transactions on Reconfigurable Technology and Systems, 2023, doi: 10.1145/3576200.' mla: Meyer, Marius, et al. “Multi-FPGA Designs and Scaling of HPC Challenge Benchmarks via MPI and Circuit-Switched Inter-FPGA Networks.” ACM Transactions on Reconfigurable Technology and Systems, Association for Computing Machinery (ACM), 2023, doi:10.1145/3576200. short: M. Meyer, T. Kenter, C. Plessl, ACM Transactions on Reconfigurable Technology and Systems (2023). date_created: 2023-01-23T08:40:42Z date_updated: 2023-07-28T08:02:05Z department: - _id: '27' - _id: '518' doi: 10.1145/3576200 keyword: - General Computer Science language: - iso: eng main_file_link: - open_access: '1' url: https://dl.acm.org/doi/10.1145/3576200 oa: '1' project: - _id: '52' name: 'PC2: Computing Resources Provided by the Paderborn Center for Parallel Computing' - _id: '4' name: 'SFB 901 - C: SFB 901 - Project Area C' - _id: '1' grant_number: '160364472' name: 'SFB 901: SFB 901' - _id: '14' grant_number: '160364472' name: 'SFB 901 - C2: SFB 901 - Subproject C2' publication: ACM Transactions on Reconfigurable Technology and Systems publication_identifier: issn: - 1936-7406 - 1936-7414 publication_status: published publisher: Association for Computing Machinery (ACM) quality_controlled: '1' status: public title: Multi-FPGA Designs and Scaling of HPC Challenge Benchmarks via MPI and Circuit-Switched Inter-FPGA Networks type: journal_article user_id: '24135' year: '2023' ... --- _id: '45893' author: - first_name: Tim full_name: Hansmeier, Tim id: '49992' last_name: Hansmeier orcid: 0000-0003-1377-3339 - first_name: Tobias full_name: Kenter, Tobias id: '3145' last_name: Kenter - first_name: Marius full_name: Meyer, Marius id: '40778' last_name: Meyer - first_name: Heinrich full_name: Riebler, Heinrich id: '8961' last_name: Riebler - first_name: Marco full_name: Platzner, Marco id: '398' last_name: Platzner - first_name: Christian full_name: Plessl, Christian id: '16153' last_name: Plessl orcid: 0000-0001-5728-9982 citation: ama: 'Hansmeier T, Kenter T, Meyer M, Riebler H, Platzner M, Plessl C. Compute Centers I: Heterogeneous Execution Environments. In: Haake C-J, Meyer auf der Heide F, Platzner M, Wachsmuth H, Wehrheim H, eds. On-The-Fly Computing -- Individualized IT-Services in Dynamic Markets. Vol 412. Verlagsschriftenreihe des Heinz Nixdorf Instituts. Heinz Nixdorf Institut, Universität Paderborn; 2023:165-182. doi:10.5281/zenodo.8068642' apa: 'Hansmeier, T., Kenter, T., Meyer, M., Riebler, H., Platzner, M., & Plessl, C. (2023). Compute Centers I: Heterogeneous Execution Environments. In C.-J. Haake, F. Meyer auf der Heide, M. Platzner, H. Wachsmuth, & H. Wehrheim (Eds.), On-The-Fly Computing -- Individualized IT-services in dynamic markets (Vol. 412, pp. 165–182). Heinz Nixdorf Institut, Universität Paderborn. https://doi.org/10.5281/zenodo.8068642' bibtex: '@inbook{Hansmeier_Kenter_Meyer_Riebler_Platzner_Plessl_2023, place={Paderborn}, series={Verlagsschriftenreihe des Heinz Nixdorf Instituts}, title={Compute Centers I: Heterogeneous Execution Environments}, volume={412}, DOI={10.5281/zenodo.8068642}, booktitle={On-The-Fly Computing -- Individualized IT-services in dynamic markets}, publisher={Heinz Nixdorf Institut, Universität Paderborn}, author={Hansmeier, Tim and Kenter, Tobias and Meyer, Marius and Riebler, Heinrich and Platzner, Marco and Plessl, Christian}, editor={Haake, Claus-Jochen and Meyer auf der Heide, Friedhelm and Platzner, Marco and Wachsmuth, Henning and Wehrheim, Heike}, year={2023}, pages={165–182}, collection={Verlagsschriftenreihe des Heinz Nixdorf Instituts} }' chicago: 'Hansmeier, Tim, Tobias Kenter, Marius Meyer, Heinrich Riebler, Marco Platzner, and Christian Plessl. “Compute Centers I: Heterogeneous Execution Environments.” In On-The-Fly Computing -- Individualized IT-Services in Dynamic Markets, edited by Claus-Jochen Haake, Friedhelm Meyer auf der Heide, Marco Platzner, Henning Wachsmuth, and Heike Wehrheim, 412:165–82. Verlagsschriftenreihe Des Heinz Nixdorf Instituts. Paderborn: Heinz Nixdorf Institut, Universität Paderborn, 2023. https://doi.org/10.5281/zenodo.8068642.' ieee: 'T. Hansmeier, T. Kenter, M. Meyer, H. Riebler, M. Platzner, and C. Plessl, “Compute Centers I: Heterogeneous Execution Environments,” in On-The-Fly Computing -- Individualized IT-services in dynamic markets, vol. 412, C.-J. Haake, F. Meyer auf der Heide, M. Platzner, H. Wachsmuth, and H. Wehrheim, Eds. Paderborn: Heinz Nixdorf Institut, Universität Paderborn, 2023, pp. 165–182.' mla: 'Hansmeier, Tim, et al. “Compute Centers I: Heterogeneous Execution Environments.” On-The-Fly Computing -- Individualized IT-Services in Dynamic Markets, edited by Claus-Jochen Haake et al., vol. 412, Heinz Nixdorf Institut, Universität Paderborn, 2023, pp. 165–82, doi:10.5281/zenodo.8068642.' short: 'T. Hansmeier, T. Kenter, M. Meyer, H. Riebler, M. Platzner, C. Plessl, in: C.-J. Haake, F. Meyer auf der Heide, M. Platzner, H. Wachsmuth, H. Wehrheim (Eds.), On-The-Fly Computing -- Individualized IT-Services in Dynamic Markets, Heinz Nixdorf Institut, Universität Paderborn, Paderborn, 2023, pp. 165–182.' date_created: 2023-07-07T08:15:45Z date_updated: 2023-07-28T09:38:14Z ddc: - '004' department: - _id: '7' - _id: '27' - _id: '518' doi: 10.5281/zenodo.8068642 editor: - first_name: Claus-Jochen full_name: Haake, Claus-Jochen last_name: Haake - first_name: Friedhelm full_name: Meyer auf der Heide, Friedhelm last_name: Meyer auf der Heide - first_name: Marco full_name: Platzner, Marco last_name: Platzner - first_name: Henning full_name: Wachsmuth, Henning last_name: Wachsmuth - first_name: Heike full_name: Wehrheim, Heike last_name: Wehrheim file: - access_level: open_access content_type: application/pdf creator: florida date_created: 2023-07-07T08:15:35Z date_updated: 2023-07-07T11:17:33Z file_id: '45894' file_name: C2-Chapter-SFB-Buch-Final.pdf file_size: 2288788 relation: main_file file_date_updated: 2023-07-07T11:17:33Z has_accepted_license: '1' intvolume: ' 412' language: - iso: eng oa: '1' page: 165-182 place: Paderborn project: - _id: '1' grant_number: '160364472' name: 'SFB 901: SFB 901: On-The-Fly Computing - Individualisierte IT-Dienstleistungen in dynamischen Märkten ' - _id: '4' name: 'SFB 901 - C: SFB 901 - Project Area C' - _id: '14' grant_number: '160364472' name: 'SFB 901 - C2: SFB 901 - On-The-Fly Compute Centers I: Heterogene Ausführungsumgebungen (Subproject C2)' publication: On-The-Fly Computing -- Individualized IT-services in dynamic markets publisher: Heinz Nixdorf Institut, Universität Paderborn series_title: Verlagsschriftenreihe des Heinz Nixdorf Instituts status: public title: 'Compute Centers I: Heterogeneous Execution Environments' type: book_chapter user_id: '3145' volume: 412 year: '2023' ... --- _id: '46190' author: - first_name: Jan-Oliver full_name: Opdenhövel, Jan-Oliver last_name: Opdenhövel - first_name: Christian full_name: Plessl, Christian id: '16153' last_name: Plessl orcid: 0000-0001-5728-9982 - first_name: Tobias full_name: Kenter, Tobias id: '3145' last_name: Kenter citation: ama: 'Opdenhövel J-O, Plessl C, Kenter T. Mutation Tree Reconstruction of Tumor Cells on FPGAs Using a Bit-Level Matrix Representation. In: Proceedings of the 13th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies. ACM; 2023. doi:10.1145/3597031.3597050' apa: Opdenhövel, J.-O., Plessl, C., & Kenter, T. (2023). Mutation Tree Reconstruction of Tumor Cells on FPGAs Using a Bit-Level Matrix Representation. Proceedings of the 13th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies. https://doi.org/10.1145/3597031.3597050 bibtex: '@inproceedings{Opdenhövel_Plessl_Kenter_2023, title={Mutation Tree Reconstruction of Tumor Cells on FPGAs Using a Bit-Level Matrix Representation}, DOI={10.1145/3597031.3597050}, booktitle={Proceedings of the 13th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies}, publisher={ACM}, author={Opdenhövel, Jan-Oliver and Plessl, Christian and Kenter, Tobias}, year={2023} }' chicago: Opdenhövel, Jan-Oliver, Christian Plessl, and Tobias Kenter. “Mutation Tree Reconstruction of Tumor Cells on FPGAs Using a Bit-Level Matrix Representation.” In Proceedings of the 13th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies. ACM, 2023. https://doi.org/10.1145/3597031.3597050. ieee: 'J.-O. Opdenhövel, C. Plessl, and T. Kenter, “Mutation Tree Reconstruction of Tumor Cells on FPGAs Using a Bit-Level Matrix Representation,” 2023, doi: 10.1145/3597031.3597050.' mla: Opdenhövel, Jan-Oliver, et al. “Mutation Tree Reconstruction of Tumor Cells on FPGAs Using a Bit-Level Matrix Representation.” Proceedings of the 13th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies, ACM, 2023, doi:10.1145/3597031.3597050. short: 'J.-O. Opdenhövel, C. Plessl, T. Kenter, in: Proceedings of the 13th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies, ACM, 2023.' date_created: 2023-07-28T09:49:23Z date_updated: 2023-07-28T09:58:06Z department: - _id: '27' - _id: '518' doi: 10.1145/3597031.3597050 language: - iso: eng main_file_link: - open_access: '1' url: https://dl.acm.org/doi/pdf/10.1145/3597031.3597050 oa: '1' project: - _id: '52' name: 'PC2: Computing Resources Provided by the Paderborn Center for Parallel Computing' publication: Proceedings of the 13th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies publication_status: published publisher: ACM quality_controlled: '1' status: public title: Mutation Tree Reconstruction of Tumor Cells on FPGAs Using a Bit-Level Matrix Representation type: conference user_id: '3145' year: '2023' ... --- _id: '46188' author: - first_name: Jennifer full_name: Faj, Jennifer id: '78722' last_name: Faj - first_name: Tobias full_name: Kenter, Tobias id: '3145' last_name: Kenter - first_name: Sara full_name: Faghih-Naini, Sara last_name: Faghih-Naini - first_name: Christian full_name: Plessl, Christian id: '16153' last_name: Plessl orcid: 0000-0001-5728-9982 - first_name: Vadym full_name: Aizinger, Vadym last_name: Aizinger citation: ama: 'Faj J, Kenter T, Faghih-Naini S, Plessl C, Aizinger V. Scalable Multi-FPGA Design of a Discontinuous Galerkin Shallow-Water Model on Unstructured Meshes. In: Proceedings of the Platform for Advanced Scientific Computing Conference. ACM; 2023. doi:10.1145/3592979.3593407' apa: Faj, J., Kenter, T., Faghih-Naini, S., Plessl, C., & Aizinger, V. (2023). Scalable Multi-FPGA Design of a Discontinuous Galerkin Shallow-Water Model on Unstructured Meshes. Proceedings of the Platform for Advanced Scientific Computing Conference. https://doi.org/10.1145/3592979.3593407 bibtex: '@inproceedings{Faj_Kenter_Faghih-Naini_Plessl_Aizinger_2023, title={Scalable Multi-FPGA Design of a Discontinuous Galerkin Shallow-Water Model on Unstructured Meshes}, DOI={10.1145/3592979.3593407}, booktitle={Proceedings of the Platform for Advanced Scientific Computing Conference}, publisher={ACM}, author={Faj, Jennifer and Kenter, Tobias and Faghih-Naini, Sara and Plessl, Christian and Aizinger, Vadym}, year={2023} }' chicago: Faj, Jennifer, Tobias Kenter, Sara Faghih-Naini, Christian Plessl, and Vadym Aizinger. “Scalable Multi-FPGA Design of a Discontinuous Galerkin Shallow-Water Model on Unstructured Meshes.” In Proceedings of the Platform for Advanced Scientific Computing Conference. ACM, 2023. https://doi.org/10.1145/3592979.3593407. ieee: 'J. Faj, T. Kenter, S. Faghih-Naini, C. Plessl, and V. Aizinger, “Scalable Multi-FPGA Design of a Discontinuous Galerkin Shallow-Water Model on Unstructured Meshes,” 2023, doi: 10.1145/3592979.3593407.' mla: Faj, Jennifer, et al. “Scalable Multi-FPGA Design of a Discontinuous Galerkin Shallow-Water Model on Unstructured Meshes.” Proceedings of the Platform for Advanced Scientific Computing Conference, ACM, 2023, doi:10.1145/3592979.3593407. short: 'J. Faj, T. Kenter, S. Faghih-Naini, C. Plessl, V. Aizinger, in: Proceedings of the Platform for Advanced Scientific Computing Conference, ACM, 2023.' date_created: 2023-07-28T09:42:14Z date_updated: 2023-07-28T09:48:19Z department: - _id: '27' - _id: '518' doi: 10.1145/3592979.3593407 language: - iso: eng main_file_link: - open_access: '1' url: https://dl.acm.org/doi/pdf/10.1145/3592979.3593407 oa: '1' project: - _id: '52' name: 'PC2: Computing Resources Provided by the Paderborn Center for Parallel Computing' publication: Proceedings of the Platform for Advanced Scientific Computing Conference publication_status: published publisher: ACM quality_controlled: '1' status: public title: Scalable Multi-FPGA Design of a Discontinuous Galerkin Shallow-Water Model on Unstructured Meshes type: conference user_id: '3145' year: '2023' ... --- _id: '46189' author: - first_name: Charles full_name: Prouveur, Charles last_name: Prouveur - first_name: Matthieu full_name: Haefele, Matthieu last_name: Haefele - first_name: Tobias full_name: Kenter, Tobias id: '3145' last_name: Kenter - first_name: Nils full_name: Voss, Nils last_name: Voss citation: ama: 'Prouveur C, Haefele M, Kenter T, Voss N. FPGA Acceleration for HPC Supercapacitor Simulations. In: Proceedings of the Platform for Advanced Scientific Computing Conference. ACM; 2023. doi:10.1145/3592979.3593419' apa: Prouveur, C., Haefele, M., Kenter, T., & Voss, N. (2023). FPGA Acceleration for HPC Supercapacitor Simulations. Proceedings of the Platform for Advanced Scientific Computing Conference. https://doi.org/10.1145/3592979.3593419 bibtex: '@inproceedings{Prouveur_Haefele_Kenter_Voss_2023, title={FPGA Acceleration for HPC Supercapacitor Simulations}, DOI={10.1145/3592979.3593419}, booktitle={Proceedings of the Platform for Advanced Scientific Computing Conference}, publisher={ACM}, author={Prouveur, Charles and Haefele, Matthieu and Kenter, Tobias and Voss, Nils}, year={2023} }' chicago: Prouveur, Charles, Matthieu Haefele, Tobias Kenter, and Nils Voss. “FPGA Acceleration for HPC Supercapacitor Simulations.” In Proceedings of the Platform for Advanced Scientific Computing Conference. ACM, 2023. https://doi.org/10.1145/3592979.3593419. ieee: 'C. Prouveur, M. Haefele, T. Kenter, and N. Voss, “FPGA Acceleration for HPC Supercapacitor Simulations,” 2023, doi: 10.1145/3592979.3593419.' mla: Prouveur, Charles, et al. “FPGA Acceleration for HPC Supercapacitor Simulations.” Proceedings of the Platform for Advanced Scientific Computing Conference, ACM, 2023, doi:10.1145/3592979.3593419. short: 'C. Prouveur, M. Haefele, T. Kenter, N. Voss, in: Proceedings of the Platform for Advanced Scientific Computing Conference, ACM, 2023.' date_created: 2023-07-28T09:46:25Z date_updated: 2023-07-28T09:58:16Z department: - _id: '27' - _id: '518' doi: 10.1145/3592979.3593419 language: - iso: eng main_file_link: - open_access: '1' url: https://dl.acm.org/doi/pdf/10.1145/3592979.3593419 oa: '1' project: - _id: '52' name: 'PC2: Computing Resources Provided by the Paderborn Center for Parallel Computing' publication: Proceedings of the Platform for Advanced Scientific Computing Conference publication_status: published publisher: ACM quality_controlled: '1' status: public title: FPGA Acceleration for HPC Supercapacitor Simulations type: conference user_id: '3145' year: '2023' ... --- _id: '43228' abstract: - lang: eng text: "The computation of electron repulsion integrals (ERIs) over Gaussian-type orbitals (GTOs) is a challenging problem in quantum-mechanics-based atomistic simulations. In practical simulations, several trillions of ERIs may have to be\r\ncomputed for every time step.\r\nIn this work, we investigate FPGAs as accelerators for the ERI computation. We use template parameters, here within the Intel oneAPI tool flow, to create customized designs for 256 different ERI quartet classes, based on their orbitals. To maximize data reuse, all intermediates are buffered in FPGA on-chip memory with customized layout. The pre-calculation of intermediates also helps to overcome data dependencies caused by multi-dimensional recurrence\r\nrelations. The involved loop structures are partially or even fully unrolled for high throughput of FPGA kernels. Furthermore, a lossy compression algorithm utilizing arbitrary bitwidth integers is integrated in the FPGA kernels. To our\r\nbest knowledge, this is the first work on ERI computation on FPGAs that supports more than just the single most basic quartet class. Also, the integration of ERI computation and compression it a novelty that is not even covered by CPU or GPU libraries so far.\r\nOur evaluation shows that using 16-bit integer for the ERI compression, the fastest FPGA kernels exceed the performance of 10 GERIS ($10 \\times 10^9$ ERIs per second) on one Intel Stratix 10 GX 2800 FPGA, with maximum absolute errors around $10^{-7}$ - $10^{-5}$ Hartree. The measured throughput can be accurately explained by a performance model. The FPGA kernels deployed on 2 FPGAs outperform similar computations using the widely used libint reference on a two-socket server with 40 Xeon Gold 6148 CPU cores of the same process technology by factors up to 6.0x and on a new two-socket server with 128 EPYC 7713 CPU cores by up to 1.9x." author: - first_name: Xin full_name: Wu, Xin id: '77439' last_name: Wu - first_name: Tobias full_name: Kenter, Tobias id: '3145' last_name: Kenter - first_name: Robert full_name: Schade, Robert id: '75963' last_name: Schade orcid: 0000-0002-6268-539 - first_name: Thomas full_name: Kühne, Thomas id: '49079' last_name: Kühne - first_name: Christian full_name: Plessl, Christian id: '16153' last_name: Plessl orcid: 0000-0001-5728-9982 citation: ama: 'Wu X, Kenter T, Schade R, Kühne T, Plessl C. Computing and Compressing Electron Repulsion Integrals on FPGAs. In: 2023 IEEE 31st Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM). ; 2023:162-173. doi:10.1109/FCCM57271.2023.00026' apa: Wu, X., Kenter, T., Schade, R., Kühne, T., & Plessl, C. (2023). Computing and Compressing Electron Repulsion Integrals on FPGAs. 2023 IEEE 31st Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM), 162–173. https://doi.org/10.1109/FCCM57271.2023.00026 bibtex: '@inproceedings{Wu_Kenter_Schade_Kühne_Plessl_2023, title={Computing and Compressing Electron Repulsion Integrals on FPGAs}, DOI={10.1109/FCCM57271.2023.00026}, booktitle={2023 IEEE 31st Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)}, author={Wu, Xin and Kenter, Tobias and Schade, Robert and Kühne, Thomas and Plessl, Christian}, year={2023}, pages={162–173} }' chicago: Wu, Xin, Tobias Kenter, Robert Schade, Thomas Kühne, and Christian Plessl. “Computing and Compressing Electron Repulsion Integrals on FPGAs.” In 2023 IEEE 31st Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM), 162–73, 2023. https://doi.org/10.1109/FCCM57271.2023.00026. ieee: 'X. Wu, T. Kenter, R. Schade, T. Kühne, and C. Plessl, “Computing and Compressing Electron Repulsion Integrals on FPGAs,” in 2023 IEEE 31st Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM), 2023, pp. 162–173, doi: 10.1109/FCCM57271.2023.00026.' mla: Wu, Xin, et al. “Computing and Compressing Electron Repulsion Integrals on FPGAs.” 2023 IEEE 31st Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM), 2023, pp. 162–73, doi:10.1109/FCCM57271.2023.00026. short: 'X. Wu, T. Kenter, R. Schade, T. Kühne, C. Plessl, in: 2023 IEEE 31st Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM), 2023, pp. 162–173.' date_created: 2023-03-30T11:15:40Z date_updated: 2023-08-02T15:05:42Z department: - _id: '27' - _id: '518' doi: 10.1109/FCCM57271.2023.00026 external_id: arxiv: - '2303.13632' language: - iso: eng main_file_link: - url: https://ieeexplore.ieee.org/document/10171537 page: 162-173 project: - _id: '52' name: 'PC2: Computing Resources Provided by the Paderborn Center for Parallel Computing' publication: 2023 IEEE 31st Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) quality_controlled: '1' status: public title: Computing and Compressing Electron Repulsion Integrals on FPGAs type: conference user_id: '75963' year: '2023' ... --- _id: '45361' abstract: - lang: eng text: The non-orthogonal local submatrix method applied to electronic structure–based molecular dynamics simulations is shown to exceed 1.1 EFLOP/s in FP16/FP32-mixed floating-point arithmetic when using 4400 NVIDIA A100 GPUs of the Perlmutter system. This is enabled by a modification of the original method that pushes the sustained fraction of the peak performance to about 80%. Example calculations are performed for SARS-CoV-2 spike proteins with up to 83 million atoms. article_number: '109434202311776' article_type: original author: - first_name: Robert full_name: Schade, Robert id: '75963' last_name: Schade orcid: 0000-0002-6268-539 - first_name: Tobias full_name: Kenter, Tobias id: '3145' last_name: Kenter - first_name: Hossam full_name: Elgabarty, Hossam id: '60250' last_name: Elgabarty orcid: 0000-0002-4945-1481 - first_name: Michael full_name: Lass, Michael id: '24135' last_name: Lass orcid: 0000-0002-5708-7632 - first_name: Thomas full_name: Kühne, Thomas id: '49079' last_name: Kühne - first_name: Christian full_name: Plessl, Christian id: '16153' last_name: Plessl orcid: 0000-0001-5728-9982 citation: ama: Schade R, Kenter T, Elgabarty H, Lass M, Kühne T, Plessl C. Breaking the exascale barrier for the electronic structure problem in ab-initio molecular dynamics. The International Journal of High Performance Computing Applications. Published online 2023. doi:10.1177/10943420231177631 apa: Schade, R., Kenter, T., Elgabarty, H., Lass, M., Kühne, T., & Plessl, C. (2023). Breaking the exascale barrier for the electronic structure problem in ab-initio molecular dynamics. The International Journal of High Performance Computing Applications, Article 109434202311776. https://doi.org/10.1177/10943420231177631 bibtex: '@article{Schade_Kenter_Elgabarty_Lass_Kühne_Plessl_2023, title={Breaking the exascale barrier for the electronic structure problem in ab-initio molecular dynamics}, DOI={10.1177/10943420231177631}, number={109434202311776}, journal={The International Journal of High Performance Computing Applications}, publisher={SAGE Publications}, author={Schade, Robert and Kenter, Tobias and Elgabarty, Hossam and Lass, Michael and Kühne, Thomas and Plessl, Christian}, year={2023} }' chicago: Schade, Robert, Tobias Kenter, Hossam Elgabarty, Michael Lass, Thomas Kühne, and Christian Plessl. “Breaking the Exascale Barrier for the Electronic Structure Problem in Ab-Initio Molecular Dynamics.” The International Journal of High Performance Computing Applications, 2023. https://doi.org/10.1177/10943420231177631. ieee: 'R. Schade, T. Kenter, H. Elgabarty, M. Lass, T. Kühne, and C. Plessl, “Breaking the exascale barrier for the electronic structure problem in ab-initio molecular dynamics,” The International Journal of High Performance Computing Applications, Art. no. 109434202311776, 2023, doi: 10.1177/10943420231177631.' mla: Schade, Robert, et al. “Breaking the Exascale Barrier for the Electronic Structure Problem in Ab-Initio Molecular Dynamics.” The International Journal of High Performance Computing Applications, 109434202311776, SAGE Publications, 2023, doi:10.1177/10943420231177631. short: R. Schade, T. Kenter, H. Elgabarty, M. Lass, T. Kühne, C. Plessl, The International Journal of High Performance Computing Applications (2023). date_created: 2023-05-30T09:19:09Z date_updated: 2023-08-02T15:04:53Z department: - _id: '27' - _id: '518' doi: 10.1177/10943420231177631 keyword: - Hardware and Architecture - Theoretical Computer Science - Software language: - iso: eng main_file_link: - open_access: '1' url: https://journals.sagepub.com/doi/10.1177/10943420231177631 oa: '1' project: - _id: '52' name: 'PC2: Computing Resources Provided by the Paderborn Center for Parallel Computing' publication: The International Journal of High Performance Computing Applications publication_identifier: issn: - 1094-3420 - 1741-2846 publication_status: published publisher: SAGE Publications quality_controlled: '1' status: public title: Breaking the exascale barrier for the electronic structure problem in ab-initio molecular dynamics type: journal_article user_id: '75963' year: '2023' ... --- _id: '50172' abstract: - lang: eng text: "Viscous hydrodynamics serves as a successful mesoscopic description of the\r\nQuark-Gluon Plasma produced in relativistic heavy-ion collisions. In order to\r\ninvestigate, how such an effective description emerges from the underlying\r\nmicroscopic dynamics we calculate the hydrodynamic and non-hydrodynamic modes\r\nof linear response in the sound channel from a first-principle calculation in\r\nkinetic theory. We do this with a new approach wherein we discretize the\r\ncollision kernel to directly calculate eigenvalues and eigenmodes of the\r\nevolution operator. This allows us to study the Green's functions at any point\r\nin the complex frequency space. Our study focuses on scalar theory with quartic\r\ninteraction and we find that the analytic structure of Green's functions in the\r\ncomplex plane is far more complicated than just poles or cuts which is a first\r\nstep towards an equivalent study in QCD kinetic theory." author: - first_name: Stephan full_name: Ochsenfeld, Stephan last_name: Ochsenfeld - first_name: Sören full_name: Schlichting, Sören last_name: Schlichting citation: ama: Ochsenfeld S, Schlichting S. Hydrodynamic and Non-hydrodynamic Excitations in Kinetic Theory -- A  Numerical Analysis in Scalar Field Theory. arXiv:230804491. Published online 2023. apa: Ochsenfeld, S., & Schlichting, S. (2023). Hydrodynamic and Non-hydrodynamic Excitations in Kinetic Theory -- A  Numerical Analysis in Scalar Field Theory. In arXiv:2308.04491. bibtex: '@article{Ochsenfeld_Schlichting_2023, title={Hydrodynamic and Non-hydrodynamic Excitations in Kinetic Theory -- A  Numerical Analysis in Scalar Field Theory}, journal={arXiv:2308.04491}, author={Ochsenfeld, Stephan and Schlichting, Sören}, year={2023} }' chicago: Ochsenfeld, Stephan, and Sören Schlichting. “Hydrodynamic and Non-Hydrodynamic Excitations in Kinetic Theory -- A  Numerical Analysis in Scalar Field Theory.” ArXiv:2308.04491, 2023. ieee: S. Ochsenfeld and S. Schlichting, “Hydrodynamic and Non-hydrodynamic Excitations in Kinetic Theory -- A  Numerical Analysis in Scalar Field Theory,” arXiv:2308.04491. 2023. mla: Ochsenfeld, Stephan, and Sören Schlichting. “Hydrodynamic and Non-Hydrodynamic Excitations in Kinetic Theory -- A  Numerical Analysis in Scalar Field Theory.” ArXiv:2308.04491, 2023. short: S. Ochsenfeld, S. Schlichting, ArXiv:2308.04491 (2023). date_created: 2024-01-04T08:47:38Z date_updated: 2024-01-04T08:47:47Z department: - _id: '27' external_id: arxiv: - '2308.04491' language: - iso: eng project: - _id: '52' name: 'PC2: Computing Resources Provided by the Paderborn Center for Parallel Computing' publication: arXiv:2308.04491 status: public title: Hydrodynamic and Non-hydrodynamic Excitations in Kinetic Theory -- A Numerical Analysis in Scalar Field Theory type: preprint user_id: '67287' year: '2023' ... --- _id: '50221' abstract: - lang: eng text: "Memory Gym presents a suite of 2D partially observable environments, namely\r\nMortar Mayhem, Mystery Path, and Searing Spotlights, designed to benchmark\r\nmemory capabilities in decision-making agents. These environments, originally\r\nwith finite tasks, are expanded into innovative, endless formats, mirroring the\r\nescalating challenges of cumulative memory games such as ``I packed my bag''.\r\nThis progression in task design shifts the focus from merely assessing sample\r\nefficiency to also probing the levels of memory effectiveness in dynamic,\r\nprolonged scenarios. To address the gap in available memory-based Deep\r\nReinforcement Learning baselines, we introduce an implementation that\r\nintegrates Transformer-XL (TrXL) with Proximal Policy Optimization. This\r\napproach utilizes TrXL as a form of episodic memory, employing a sliding window\r\ntechnique. Our comparative study between the Gated Recurrent Unit (GRU) and\r\nTrXL reveals varied performances across different settings. TrXL, on the finite\r\nenvironments, demonstrates superior sample efficiency in Mystery Path and\r\noutperforms in Mortar Mayhem. However, GRU is more efficient on Searing\r\nSpotlights. Most notably, in all endless tasks, GRU makes a remarkable\r\nresurgence, consistently outperforming TrXL by significant margins. Website and\r\nSource Code: https://github.com/MarcoMeter/endless-memory-gym/" author: - first_name: Marco full_name: Pleines, Marco last_name: Pleines - first_name: Matthias full_name: Pallasch, Matthias last_name: Pallasch - first_name: Frank full_name: Zimmer, Frank last_name: Zimmer - first_name: Mike full_name: Preuss, Mike last_name: Preuss citation: ama: 'Pleines M, Pallasch M, Zimmer F, Preuss M. Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of  Agents. arXiv:230917207. Published online 2023.' apa: 'Pleines, M., Pallasch, M., Zimmer, F., & Preuss, M. (2023). Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of  Agents. In arXiv:2309.17207.' bibtex: '@article{Pleines_Pallasch_Zimmer_Preuss_2023, title={Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of  Agents}, journal={arXiv:2309.17207}, author={Pleines, Marco and Pallasch, Matthias and Zimmer, Frank and Preuss, Mike}, year={2023} }' chicago: 'Pleines, Marco, Matthias Pallasch, Frank Zimmer, and Mike Preuss. “Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of  Agents.” ArXiv:2309.17207, 2023.' ieee: 'M. Pleines, M. Pallasch, F. Zimmer, and M. Preuss, “Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of  Agents,” arXiv:2309.17207. 2023.' mla: 'Pleines, Marco, et al. “Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of  Agents.” ArXiv:2309.17207, 2023.' short: M. Pleines, M. Pallasch, F. Zimmer, M. Preuss, ArXiv:2309.17207 (2023). date_created: 2024-01-05T12:38:42Z date_updated: 2024-01-05T12:39:50Z department: - _id: '27' external_id: arxiv: - '2309.17207' language: - iso: eng project: - _id: '52' name: 'PC2: Computing Resources Provided by the Paderborn Center for Parallel Computing' publication: arXiv:2309.17207 status: public title: 'Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of Agents' type: preprint user_id: '67287' year: '2023' ... --- _id: '46191' author: - first_name: Christoph full_name: Alt, Christoph id: '100625' last_name: Alt - first_name: Tobias full_name: Kenter, Tobias id: '3145' last_name: Kenter - first_name: Sara full_name: Faghih-Naini, Sara last_name: Faghih-Naini - first_name: Jennifer full_name: Faj, Jennifer id: '78722' last_name: Faj - first_name: Jan-Oliver full_name: Opdenhövel, Jan-Oliver last_name: Opdenhövel - first_name: Christian full_name: Plessl, Christian id: '16153' last_name: Plessl orcid: 0000-0001-5728-9982 - first_name: Vadym full_name: Aizinger, Vadym last_name: Aizinger - first_name: Jan full_name: Hönig, Jan last_name: Hönig - first_name: Harald full_name: Köstler, Harald last_name: Köstler citation: ama: 'Alt C, Kenter T, Faghih-Naini S, et al. Shallow Water DG Simulations on FPGAs: Design and Comparison of a Novel Code Generation Pipeline. In: Lecture Notes in Computer Science. Springer Nature Switzerland; 2023. doi:10.1007/978-3-031-32041-5_5' apa: 'Alt, C., Kenter, T., Faghih-Naini, S., Faj, J., Opdenhövel, J.-O., Plessl, C., Aizinger, V., Hönig, J., & Köstler, H. (2023). Shallow Water DG Simulations on FPGAs: Design and Comparison of a Novel Code Generation Pipeline. In Lecture Notes in Computer Science. Springer Nature Switzerland. https://doi.org/10.1007/978-3-031-32041-5_5' bibtex: '@inbook{Alt_Kenter_Faghih-Naini_Faj_Opdenhövel_Plessl_Aizinger_Hönig_Köstler_2023, place={Cham}, title={Shallow Water DG Simulations on FPGAs: Design and Comparison of a Novel Code Generation Pipeline}, DOI={10.1007/978-3-031-32041-5_5}, booktitle={Lecture Notes in Computer Science}, publisher={Springer Nature Switzerland}, author={Alt, Christoph and Kenter, Tobias and Faghih-Naini, Sara and Faj, Jennifer and Opdenhövel, Jan-Oliver and Plessl, Christian and Aizinger, Vadym and Hönig, Jan and Köstler, Harald}, year={2023} }' chicago: 'Alt, Christoph, Tobias Kenter, Sara Faghih-Naini, Jennifer Faj, Jan-Oliver Opdenhövel, Christian Plessl, Vadym Aizinger, Jan Hönig, and Harald Köstler. “Shallow Water DG Simulations on FPGAs: Design and Comparison of a Novel Code Generation Pipeline.” In Lecture Notes in Computer Science. Cham: Springer Nature Switzerland, 2023. https://doi.org/10.1007/978-3-031-32041-5_5.' ieee: 'C. Alt et al., “Shallow Water DG Simulations on FPGAs: Design and Comparison of a Novel Code Generation Pipeline,” in Lecture Notes in Computer Science, Cham: Springer Nature Switzerland, 2023.' mla: 'Alt, Christoph, et al. “Shallow Water DG Simulations on FPGAs: Design and Comparison of a Novel Code Generation Pipeline.” Lecture Notes in Computer Science, Springer Nature Switzerland, 2023, doi:10.1007/978-3-031-32041-5_5.' short: 'C. Alt, T. Kenter, S. Faghih-Naini, J. Faj, J.-O. Opdenhövel, C. Plessl, V. Aizinger, J. Hönig, H. Köstler, in: Lecture Notes in Computer Science, Springer Nature Switzerland, Cham, 2023.' date_created: 2023-07-28T09:53:21Z date_updated: 2024-01-22T09:58:49Z department: - _id: '27' - _id: '518' doi: 10.1007/978-3-031-32041-5_5 language: - iso: eng place: Cham project: - _id: '52' name: 'PC2: Computing Resources Provided by the Paderborn Center for Parallel Computing' publication: Lecture Notes in Computer Science publication_identifier: isbn: - '9783031320408' - '9783031320415' issn: - 0302-9743 - 1611-3349 publication_status: published publisher: Springer Nature Switzerland quality_controlled: '1' status: public title: 'Shallow Water DG Simulations on FPGAs: Design and Comparison of a Novel Code Generation Pipeline' type: book_chapter user_id: '3145' year: '2023' ... --- _id: '43439' abstract: - lang: eng text: "This preprint makes the claim of having computed the $9^{th}$ Dedekind\r\nNumber. This was done by building an efficient FPGA Accelerator for the core\r\noperation of the process, and parallelizing it on the Noctua 2 Supercluster at\r\nPaderborn University. The resulting value is\r\n286386577668298411128469151667598498812366. This value can be verified in two\r\nsteps. We have made the data file containing the 490M results available, each\r\nof which can be verified separately on CPU, and the whole file sums to our\r\nproposed value." author: - first_name: Lennart full_name: Van Hirtum, Lennart last_name: Van Hirtum - first_name: Patrick full_name: De Causmaecker, Patrick last_name: De Causmaecker - first_name: Jens full_name: Goemaere, Jens last_name: Goemaere - first_name: Tobias full_name: Kenter, Tobias id: '3145' last_name: Kenter - first_name: Heinrich full_name: Riebler, Heinrich id: '8961' last_name: Riebler - first_name: Michael full_name: Lass, Michael id: '24135' last_name: Lass orcid: 0000-0002-5708-7632 - first_name: Christian full_name: Plessl, Christian id: '16153' last_name: Plessl orcid: 0000-0001-5728-9982 citation: ama: Van Hirtum L, De Causmaecker P, Goemaere J, et al. A computation of D(9) using FPGA Supercomputing. arXiv:230403039. Published online 2023. apa: Van Hirtum, L., De Causmaecker, P., Goemaere, J., Kenter, T., Riebler, H., Lass, M., & Plessl, C. (2023). A computation of D(9) using FPGA Supercomputing. In arXiv:2304.03039. bibtex: '@article{Van Hirtum_De Causmaecker_Goemaere_Kenter_Riebler_Lass_Plessl_2023, title={A computation of D(9) using FPGA Supercomputing}, journal={arXiv:2304.03039}, author={Van Hirtum, Lennart and De Causmaecker, Patrick and Goemaere, Jens and Kenter, Tobias and Riebler, Heinrich and Lass, Michael and Plessl, Christian}, year={2023} }' chicago: Van Hirtum, Lennart, Patrick De Causmaecker, Jens Goemaere, Tobias Kenter, Heinrich Riebler, Michael Lass, and Christian Plessl. “A Computation of D(9) Using FPGA Supercomputing.” ArXiv:2304.03039, 2023. ieee: L. Van Hirtum et al., “A computation of D(9) using FPGA Supercomputing,” arXiv:2304.03039. 2023. mla: Van Hirtum, Lennart, et al. “A Computation of D(9) Using FPGA Supercomputing.” ArXiv:2304.03039, 2023. short: L. Van Hirtum, P. De Causmaecker, J. Goemaere, T. Kenter, H. Riebler, M. Lass, C. Plessl, ArXiv:2304.03039 (2023). date_created: 2023-04-08T11:05:29Z date_updated: 2024-01-22T09:56:42Z department: - _id: '27' - _id: '518' external_id: arxiv: - '2304.03039' language: - iso: eng project: - _id: '52' name: 'PC2: Computing Resources Provided by the Paderborn Center for Parallel Computing' publication: arXiv:2304.03039 status: public title: A computation of D(9) using FPGA Supercomputing type: preprint user_id: '3145' year: '2023' ... --- _id: '32177' abstract: - lang: eng text: "We investigate the early time development of the anisotropic transverse flow\r\nand spatial eccentricities of a fireball with various particle-based transport\r\napproaches using a fixed initial condition. In numerical simulations ranging\r\nfrom the quasi-collisionless case to the hydrodynamic regime, we find that the\r\nonset of $v_n$ and of related measures of anisotropic flow can be described\r\nwith a simple power-law ansatz, with an exponent that depends on the amount of\r\nrescatterings in the system. In the few-rescatterings regime we perform\r\nsemi-analytical calculations, based on a systematic expansion in powers of time\r\nand the cross section, which can reproduce the numerical findings." author: - first_name: Nicolas full_name: Borghini, Nicolas last_name: Borghini - first_name: Marc full_name: Borrell, Marc last_name: Borrell - first_name: Hendrik full_name: Roch, Hendrik last_name: Roch citation: ama: Borghini N, Borrell M, Roch H. Early time behavior of spatial and momentum anisotropies in kinetic  theory across different Knudsen numbers. arXiv:220113294. Published online 2022. apa: Borghini, N., Borrell, M., & Roch, H. (2022). Early time behavior of spatial and momentum anisotropies in kinetic  theory across different Knudsen numbers. In arXiv:2201.13294. bibtex: '@article{Borghini_Borrell_Roch_2022, title={Early time behavior of spatial and momentum anisotropies in kinetic  theory across different Knudsen numbers}, journal={arXiv:2201.13294}, author={Borghini, Nicolas and Borrell, Marc and Roch, Hendrik}, year={2022} }' chicago: Borghini, Nicolas, Marc Borrell, and Hendrik Roch. “Early Time Behavior of Spatial and Momentum Anisotropies in Kinetic  Theory across Different Knudsen Numbers.” ArXiv:2201.13294, 2022. ieee: N. Borghini, M. Borrell, and H. Roch, “Early time behavior of spatial and momentum anisotropies in kinetic  theory across different Knudsen numbers,” arXiv:2201.13294. 2022. mla: Borghini, Nicolas, et al. “Early Time Behavior of Spatial and Momentum Anisotropies in Kinetic  Theory across Different Knudsen Numbers.” ArXiv:2201.13294, 2022. short: N. Borghini, M. Borrell, H. Roch, ArXiv:2201.13294 (2022). date_created: 2022-06-27T09:08:04Z date_updated: 2022-06-27T09:35:53Z department: - _id: '27' external_id: arxiv: - '2201.13294' language: - iso: eng project: - _id: '52' name: 'PC2: Computing Resources Provided by the Paderborn Center for Parallel Computing' publication: arXiv:2201.13294 status: public title: Early time behavior of spatial and momentum anisotropies in kinetic theory across different Knudsen numbers type: preprint user_id: '67287' year: '2022' ... --- _id: '32178' abstract: - lang: eng text: "We test the ability of the \"escape mechanism\" to create the anisotropic flow\r\nobserved in high-energy nuclear collisions. We compare the flow harmonics $v_n$\r\nin the few-rescatterings regime from two types of transport simulations, with\r\n$2\\to 2$ and $2\\to 0$ collision kernels respectively, and from analytical\r\ncalculations neglecting the gain term of the Boltzmann equation. We find that\r\nthe even flow harmonics are similar in the three approaches, while the odd\r\nharmonics differ significantly." author: - first_name: Benedikt full_name: Bachmann, Benedikt last_name: Bachmann - first_name: Nicolas full_name: Borghini, Nicolas last_name: Borghini - first_name: Nina full_name: Feld, Nina last_name: Feld - first_name: Hendrik full_name: Roch, Hendrik last_name: Roch citation: ama: Bachmann B, Borghini N, Feld N, Roch H. Even anisotropic-flow harmonics are from Venus, odd ones are from Mars. arXiv:220313306. Published online 2022. apa: Bachmann, B., Borghini, N., Feld, N., & Roch, H. (2022). Even anisotropic-flow harmonics are from Venus, odd ones are from Mars. In arXiv:2203.13306. bibtex: '@article{Bachmann_Borghini_Feld_Roch_2022, title={Even anisotropic-flow harmonics are from Venus, odd ones are from Mars}, journal={arXiv:2203.13306}, author={Bachmann, Benedikt and Borghini, Nicolas and Feld, Nina and Roch, Hendrik}, year={2022} }' chicago: Bachmann, Benedikt, Nicolas Borghini, Nina Feld, and Hendrik Roch. “Even Anisotropic-Flow Harmonics Are from Venus, Odd Ones Are from Mars.” ArXiv:2203.13306, 2022. ieee: B. Bachmann, N. Borghini, N. Feld, and H. Roch, “Even anisotropic-flow harmonics are from Venus, odd ones are from Mars,” arXiv:2203.13306. 2022. mla: Bachmann, Benedikt, et al. “Even Anisotropic-Flow Harmonics Are from Venus, Odd Ones Are from Mars.” ArXiv:2203.13306, 2022. short: B. Bachmann, N. Borghini, N. Feld, H. Roch, ArXiv:2203.13306 (2022). date_created: 2022-06-27T09:12:26Z date_updated: 2022-06-27T09:35:34Z department: - _id: '27' external_id: arxiv: - '2203.13306' language: - iso: eng project: - _id: '52' name: 'PC2: Computing Resources Provided by the Paderborn Center for Parallel Computing' publication: arXiv:2203.13306 status: public title: Even anisotropic-flow harmonics are from Venus, odd ones are from Mars type: preprint user_id: '67287' year: '2022' ... --- _id: '32183' author: - first_name: W full_name: Hou, W last_name: Hou - first_name: Y full_name: Yao, Y last_name: Yao - first_name: Y full_name: Li, Y last_name: Li - first_name: B full_name: Peng, B last_name: Peng - first_name: K full_name: Shi, K last_name: Shi - first_name: Z full_name: Zhou, Z last_name: Zhou - first_name: J full_name: Pan, J last_name: Pan - first_name: M full_name: Liu, M last_name: Liu - first_name: J full_name: Hu, J last_name: Hu citation: ama: Hou W, Yao Y, Li Y, et al. Linearly shifting ferromagnetic resonance response of La0.7Sr0.3MnO3 thin film for body temperature sensors. Frontiers of materials science. 2022;16(1). apa: Hou, W., Yao, Y., Li, Y., Peng, B., Shi, K., Zhou, Z., Pan, J., Liu, M., & Hu, J. (2022). Linearly shifting ferromagnetic resonance response of La0.7Sr0.3MnO3 thin film for body temperature sensors. Frontiers of Materials Science, 16(1). bibtex: '@article{Hou_Yao_Li_Peng_Shi_Zhou_Pan_Liu_Hu_2022, title={Linearly shifting ferromagnetic resonance response of La0.7Sr0.3MnO3 thin film for body temperature sensors}, volume={16}, number={1}, journal={Frontiers of materials science}, author={Hou, W and Yao, Y and Li, Y and Peng, B and Shi, K and Zhou, Z and Pan, J and Liu, M and Hu, J}, year={2022} }' chicago: Hou, W, Y Yao, Y Li, B Peng, K Shi, Z Zhou, J Pan, M Liu, and J Hu. “Linearly Shifting Ferromagnetic Resonance Response of La0.7Sr0.3MnO3 Thin Film for Body Temperature Sensors.” Frontiers of Materials Science 16, no. 1 (2022). ieee: W. Hou et al., “Linearly shifting ferromagnetic resonance response of La0.7Sr0.3MnO3 thin film for body temperature sensors,” Frontiers of materials science, vol. 16, no. 1, 2022. mla: Hou, W., et al. “Linearly Shifting Ferromagnetic Resonance Response of La0.7Sr0.3MnO3 Thin Film for Body Temperature Sensors.” Frontiers of Materials Science, vol. 16, no. 1, 2022. short: W. Hou, Y. Yao, Y. Li, B. Peng, K. Shi, Z. Zhou, J. Pan, M. Liu, J. Hu, Frontiers of Materials Science 16 (2022). date_created: 2022-06-27T09:43:47Z date_updated: 2022-06-27T12:49:59Z department: - _id: '27' intvolume: ' 16' issue: '1' language: - iso: eng project: - _id: '52' name: 'PC2: Computing Resources Provided by the Paderborn Center for Parallel Computing' publication: Frontiers of materials science publication_identifier: issn: - 2095-025x status: public title: Linearly shifting ferromagnetic resonance response of La0.7Sr0.3MnO3 thin film for body temperature sensors type: journal_article user_id: '15278' volume: 16 year: '2022' ... --- _id: '32234' author: - first_name: M full_name: Wojciechowski, M last_name: Wojciechowski citation: ama: Wojciechowski M. Dataset for random uniform distributions of 2D circles and 3D spheres. Data Brief. 2022;43:108318. apa: Wojciechowski, M. (2022). Dataset for random uniform distributions of 2D circles and 3D spheres. Data Brief, 43, 108318. bibtex: '@article{Wojciechowski_2022, title={Dataset for random uniform distributions of 2D circles and 3D spheres.}, volume={43}, journal={Data Brief}, author={Wojciechowski, M}, year={2022}, pages={108318} }' chicago: 'Wojciechowski, M. “Dataset for Random Uniform Distributions of 2D Circles and 3D Spheres.” Data Brief 43 (2022): 108318.' ieee: M. Wojciechowski, “Dataset for random uniform distributions of 2D circles and 3D spheres.,” Data Brief, vol. 43, p. 108318, 2022. mla: Wojciechowski, M. “Dataset for Random Uniform Distributions of 2D Circles and 3D Spheres.” Data Brief, vol. 43, 2022, p. 108318. short: M. Wojciechowski, Data Brief 43 (2022) 108318. date_created: 2022-06-28T06:53:33Z date_updated: 2022-06-28T06:54:00Z department: - _id: '27' external_id: pmid: - '35677623' intvolume: ' 43' language: - iso: eng page: '108318' pmid: '1' project: - _id: '52' name: 'PC2: Computing Resources Provided by the Paderborn Center for Parallel Computing' publication: Data Brief publication_identifier: issn: - 2352-3409 status: public title: Dataset for random uniform distributions of 2D circles and 3D spheres. type: journal_article user_id: '15278' volume: 43 year: '2022' ... --- _id: '32414' author: - first_name: Michael full_name: Lass, Michael id: '24135' last_name: Lass orcid: 0000-0002-5708-7632 citation: ama: Lass M. Bringing Massive Parallelism and Hardware Acceleration to Linear Scaling Density Functional Theory Through Targeted Approximations. Universität Paderborn; 2022. doi:10.17619/UNIPB/1-1281 apa: Lass, M. (2022). Bringing Massive Parallelism and Hardware Acceleration to Linear Scaling Density Functional Theory Through Targeted Approximations. Universität Paderborn. https://doi.org/10.17619/UNIPB/1-1281 bibtex: '@book{Lass_2022, place={Paderborn}, title={Bringing Massive Parallelism and Hardware Acceleration to Linear Scaling Density Functional Theory Through Targeted Approximations}, DOI={10.17619/UNIPB/1-1281}, publisher={Universität Paderborn}, author={Lass, Michael}, year={2022} }' chicago: 'Lass, Michael. Bringing Massive Parallelism and Hardware Acceleration to Linear Scaling Density Functional Theory Through Targeted Approximations. Paderborn: Universität Paderborn, 2022. https://doi.org/10.17619/UNIPB/1-1281.' ieee: 'M. Lass, Bringing Massive Parallelism and Hardware Acceleration to Linear Scaling Density Functional Theory Through Targeted Approximations. Paderborn: Universität Paderborn, 2022.' mla: Lass, Michael. Bringing Massive Parallelism and Hardware Acceleration to Linear Scaling Density Functional Theory Through Targeted Approximations. Universität Paderborn, 2022, doi:10.17619/UNIPB/1-1281. short: M. Lass, Bringing Massive Parallelism and Hardware Acceleration to Linear Scaling Density Functional Theory Through Targeted Approximations, Universität Paderborn, Paderborn, 2022. date_created: 2022-07-25T18:13:51Z date_updated: 2022-07-25T18:14:23Z department: - _id: '27' - _id: '518' doi: 10.17619/UNIPB/1-1281 language: - iso: eng place: Paderborn publisher: Universität Paderborn status: public supervisor: - first_name: Christian full_name: Plessl, Christian id: '16153' last_name: Plessl orcid: 0000-0001-5728-9982 title: Bringing Massive Parallelism and Hardware Acceleration to Linear Scaling Density Functional Theory Through Targeted Approximations type: dissertation user_id: '24135' year: '2022' ... --- _id: '36879' abstract: - lang: eng text: The Julia programming language has evolved into a modern alternative to fill existing gaps in scientific computing and data science applications. Julia leverages a unified and coordinated single-language and ecosystem paradigm and has a proven track record of achieving high performance without sacrificing user productivity. These aspects make Julia a viable alternative to high-performance computing's (HPC's) existing and increasingly costly many-body workflow composition strategy in which traditional HPC languages (e.g., Fortran, C, C++) are used for simulations, and higher-level languages (e.g., Python, R, MATLAB) are used for data analysis and interactive computing. Julia's rapid growth in language capabilities, package ecosystem, and community make it a promising universal language for HPC. This paper presents the views of a multidisciplinary group of researchers from academia, government, and industry that advocate for an HPC software development paradigm that emphasizes developer productivity, workflow portability, and low barriers for entry. We believe that the Julia programming language, its ecosystem, and its community provide modern and powerful capabilities that enable this group's objectives. Crucially, we believe that Julia can provide a feasible and less costly approach to programming scientific applications and workflows that target HPC facilities. In this work, we examine the current practice and role of Julia as a common, end-to-end programming model to address major challenges in scientific reproducibility, data-driven AI/machine learning, co-design and workflows, scalability and performance portability in heterogeneous computing, network communication, data management, and community education. As a result, the diversification of current investments to fulfill the needs of the upcoming decade is crucial as more supercomputing centers prepare for the exascale era. author: - first_name: Valentin full_name: Churavy, Valentin last_name: Churavy - first_name: William F full_name: Godoy, William F last_name: Godoy - first_name: Carsten full_name: Bauer, Carsten id: '90082' last_name: Bauer - first_name: Hendrik full_name: Ranocha, Hendrik last_name: Ranocha - first_name: Michael full_name: Schlottke-Lakemper, Michael last_name: Schlottke-Lakemper - first_name: Ludovic full_name: Räss, Ludovic last_name: Räss - first_name: Johannes full_name: Blaschke, Johannes last_name: Blaschke - first_name: Mosè full_name: Giordano, Mosè last_name: Giordano - first_name: Erik full_name: Schnetter, Erik last_name: Schnetter - first_name: Samuel full_name: Omlin, Samuel last_name: Omlin - first_name: Jeffrey S full_name: Vetter, Jeffrey S last_name: Vetter - first_name: Alan full_name: Edelman, Alan last_name: Edelman citation: ama: Churavy V, Godoy WF, Bauer C, et al. Bridging HPC Communities through the Julia Programming Language. Published online 2022. apa: Churavy, V., Godoy, W. F., Bauer, C., Ranocha, H., Schlottke-Lakemper, M., Räss, L., Blaschke, J., Giordano, M., Schnetter, E., Omlin, S., Vetter, J. S., & Edelman, A. (2022). Bridging HPC Communities through the Julia Programming Language. bibtex: '@article{Churavy_Godoy_Bauer_Ranocha_Schlottke-Lakemper_Räss_Blaschke_Giordano_Schnetter_Omlin_et al._2022, title={Bridging HPC Communities through the Julia Programming Language}, author={Churavy, Valentin and Godoy, William F and Bauer, Carsten and Ranocha, Hendrik and Schlottke-Lakemper, Michael and Räss, Ludovic and Blaschke, Johannes and Giordano, Mosè and Schnetter, Erik and Omlin, Samuel and et al.}, year={2022} }' chicago: Churavy, Valentin, William F Godoy, Carsten Bauer, Hendrik Ranocha, Michael Schlottke-Lakemper, Ludovic Räss, Johannes Blaschke, et al. “Bridging HPC Communities through the Julia Programming Language,” 2022. ieee: V. Churavy et al., “Bridging HPC Communities through the Julia Programming Language.” 2022. mla: Churavy, Valentin, et al. Bridging HPC Communities through the Julia Programming Language. 2022. short: V. Churavy, W.F. Godoy, C. Bauer, H. Ranocha, M. Schlottke-Lakemper, L. Räss, J. Blaschke, M. Giordano, E. Schnetter, S. Omlin, J.S. Vetter, A. Edelman, (2022). date_created: 2023-01-16T09:10:48Z date_updated: 2023-01-16T09:16:20Z department: - _id: '27' language: - iso: eng main_file_link: - open_access: '1' url: https://arxiv.org/abs/2211.02740 oa: '1' status: public title: Bridging HPC Communities through the Julia Programming Language type: preprint user_id: '90082' year: '2022' ... --- _id: '40523' abstract: - lang: eng text: AbstractTailored nanoscale quantum light sources, matching the specific needs of use cases, are crucial building blocks for photonic quantum technologies. Several different approaches to realize solid-state quantum emitters with high performance have been pursued and different concepts for energy tuning have been established. However, the properties of the emitted photons are always defined by the individual quantum emitter and can therefore not be controlled with full flexibility. Here we introduce an all-optical nonlinear method to tailor and control the single photon emission. We demonstrate a laser-controlled down-conversion process from an excited state of a semiconductor quantum three-level system. Based on this concept, we realize energy tuning and polarization control of the single photon emission with a control-laser field. Our results mark an important step towards tailored single photon emission from a photonic quantum system based on quantum optical principles. article_number: '1387' author: - first_name: B. full_name: Jonas, B. last_name: Jonas - first_name: Dirk Florian full_name: Heinze, Dirk Florian id: '10904' last_name: Heinze - first_name: E. full_name: Schöll, E. last_name: Schöll - first_name: P. full_name: Kallert, P. last_name: Kallert - first_name: T. full_name: Langer, T. last_name: Langer - first_name: S. full_name: Krehs, S. last_name: Krehs - first_name: A. full_name: Widhalm, A. last_name: Widhalm - first_name: Klaus full_name: Jöns, Klaus id: '85353' last_name: Jöns - first_name: Dirk full_name: Reuter, Dirk id: '37763' last_name: Reuter - first_name: Stefan full_name: Schumacher, Stefan id: '27271' last_name: Schumacher orcid: 0000-0003-4042-4951 - first_name: Artur full_name: Zrenner, Artur id: '606' last_name: Zrenner orcid: 0000-0002-5190-0944 citation: ama: Jonas B, Heinze DF, Schöll E, et al. Nonlinear down-conversion in a single quantum dot. Nature Communications. 2022;13(1). doi:10.1038/s41467-022-28993-3 apa: Jonas, B., Heinze, D. F., Schöll, E., Kallert, P., Langer, T., Krehs, S., Widhalm, A., Jöns, K., Reuter, D., Schumacher, S., & Zrenner, A. (2022). Nonlinear down-conversion in a single quantum dot. Nature Communications, 13(1), Article 1387. https://doi.org/10.1038/s41467-022-28993-3 bibtex: '@article{Jonas_Heinze_Schöll_Kallert_Langer_Krehs_Widhalm_Jöns_Reuter_Schumacher_et al._2022, title={Nonlinear down-conversion in a single quantum dot}, volume={13}, DOI={10.1038/s41467-022-28993-3}, number={11387}, journal={Nature Communications}, publisher={Springer Science and Business Media LLC}, author={Jonas, B. and Heinze, Dirk Florian and Schöll, E. and Kallert, P. and Langer, T. and Krehs, S. and Widhalm, A. and Jöns, Klaus and Reuter, Dirk and Schumacher, Stefan and et al.}, year={2022} }' chicago: Jonas, B., Dirk Florian Heinze, E. Schöll, P. Kallert, T. Langer, S. Krehs, A. Widhalm, et al. “Nonlinear Down-Conversion in a Single Quantum Dot.” Nature Communications 13, no. 1 (2022). https://doi.org/10.1038/s41467-022-28993-3. ieee: 'B. Jonas et al., “Nonlinear down-conversion in a single quantum dot,” Nature Communications, vol. 13, no. 1, Art. no. 1387, 2022, doi: 10.1038/s41467-022-28993-3.' mla: Jonas, B., et al. “Nonlinear Down-Conversion in a Single Quantum Dot.” Nature Communications, vol. 13, no. 1, 1387, Springer Science and Business Media LLC, 2022, doi:10.1038/s41467-022-28993-3. short: B. Jonas, D.F. Heinze, E. Schöll, P. Kallert, T. Langer, S. Krehs, A. Widhalm, K. Jöns, D. Reuter, S. Schumacher, A. Zrenner, Nature Communications 13 (2022). date_created: 2023-01-27T13:41:42Z date_updated: 2023-04-20T15:18:31Z department: - _id: '15' - _id: '297' - _id: '230' - _id: '429' - _id: '27' - _id: '623' - _id: '170' - _id: '35' doi: 10.1038/s41467-022-28993-3 intvolume: ' 13' issue: '1' keyword: - General Physics and Astronomy - General Biochemistry - Genetics and Molecular Biology - General Chemistry - Multidisciplinary language: - iso: eng project: - _id: '53' name: 'TRR 142: TRR 142' - _id: '54' name: 'TRR 142 - A: TRR 142 - Project Area A' - _id: '60' name: 'TRR 142 - A03: TRR 142 - Subproject A03' - _id: '52' name: 'PC2: Computing Resources Provided by the Paderborn Center for Parallel Computing' publication: Nature Communications publication_identifier: issn: - 2041-1723 publication_status: published publisher: Springer Science and Business Media LLC status: public title: Nonlinear down-conversion in a single quantum dot type: journal_article user_id: '16199' volume: 13 year: '2022' ...