---
res:
  bibo_abstract:
  - N-body methods are one of the essential algorithmic building blocks of high-performance
    and parallel computing. Previous research has shown promising performance for
    implementing n-body simulations with pairwise force calculations on FPGAs. However,
    to avoid challenges with accumulation and memory access patterns, the presented
    designs calculate each pair of forces twice, along with both force sums of the
    involved particles. Also, they require large problem instances with hundreds of
    thousands of particles to reach their respective peak performance, limiting the
    applicability for strong scaling scenarios. This work addresses both issues by
    presenting a novel FPGA design that uses each calculated force twice and overlaps
    data transfers and computations in a way that allows to reach peak performance
    even for small problem instances, outperforming previous single precision results
    even in double precision, and scaling linearly over multiple interconnected FPGAs.
    For a comparison across architectures, we provide an equally optimized CPU reference,
    which for large problems actually achieves higher peak performance per device,
    however, given the strong scaling advantages of the FPGA design, in parallel setups
    with few thousand particles per device, the FPGA platform achieves highest performance
    and power efficiency.@eng
  bibo_authorlist:
  - foaf_Person:
      foaf_givenName: Johannes
      foaf_name: Menzel, Johannes
      foaf_surname: Menzel
  - foaf_Person:
      foaf_givenName: Christian
      foaf_name: Plessl, Christian
      foaf_surname: Plessl
      foaf_workInfoHomepage: http://www.librecat.org/personId=16153
    orcid: 0000-0001-5728-9982
  - foaf_Person:
      foaf_givenName: Tobias
      foaf_name: Kenter, Tobias
      foaf_surname: Kenter
      foaf_workInfoHomepage: http://www.librecat.org/personId=3145
  bibo_doi: 10.1145/3491235
  bibo_issue: '1'
  bibo_volume: 15
  dct_date: 2021^xs_gYear
  dct_isPartOf:
  - http://id.crossref.org/issn/1936-7406
  - http://id.crossref.org/issn/1936-7414
  dct_language: eng
  dct_title: The Strong Scaling Advantage of FPGAs in HPC for N-body Simulations@
...
