{"user_id":"61189","citation":{"apa":"Wang, H., Thiagaraj, P., & Sinnen, O. (2019). Combining Multiple Optimized FPGA-based Pulsar Search Modules Using OpenCL. Journal of Astronomical Instrumentation. https://doi.org/10.1142/s2251171719500089","ama":"Wang H, Thiagaraj P, Sinnen O. Combining Multiple Optimized FPGA-based Pulsar Search Modules Using OpenCL. Journal of Astronomical Instrumentation. 2019. doi:10.1142/s2251171719500089","mla":"Wang, Haomiao, et al. “Combining Multiple Optimized FPGA-Based Pulsar Search Modules Using OpenCL.” Journal of Astronomical Instrumentation, 1950008, 2019, doi:10.1142/s2251171719500089.","ieee":"H. Wang, P. Thiagaraj, and O. Sinnen, “Combining Multiple Optimized FPGA-based Pulsar Search Modules Using OpenCL,” Journal of Astronomical Instrumentation, 2019.","short":"H. Wang, P. Thiagaraj, O. Sinnen, Journal of Astronomical Instrumentation (2019).","chicago":"Wang, Haomiao, Prabu Thiagaraj, and Oliver Sinnen. “Combining Multiple Optimized FPGA-Based Pulsar Search Modules Using OpenCL.” Journal of Astronomical Instrumentation, 2019. https://doi.org/10.1142/s2251171719500089.","bibtex":"@article{Wang_Thiagaraj_Sinnen_2019, title={Combining Multiple Optimized FPGA-based Pulsar Search Modules Using OpenCL}, DOI={10.1142/s2251171719500089}, number={1950008}, journal={Journal of Astronomical Instrumentation}, author={Wang, Haomiao and Thiagaraj, Prabu and Sinnen, Oliver}, year={2019} }"},"date_created":"2020-04-06T12:00:06Z","_id":"16420","title":"Combining Multiple Optimized FPGA-based Pulsar Search Modules Using OpenCL","keyword":["pc2-harp-ressources"],"publication_identifier":{"issn":["2251-1717","2251-1725"]},"abstract":[{"text":" Field-Programmable Gate Arrays (FPGAs) are widely used in the central signal processing design of the Square Kilometer Array (SKA) as hardware accelerators. The frequency domain acceleration search (FDAS) module is an important part of the SKA1-MID pulsar search engine. To develop for a yet to be finalized hardware, for cross-discipline interoperability and to achieve fast prototyping, OpenCL as a high-level FPGA synthesis approaches employed to create the sub-modules of FDAS. The FT convolution and the harmonic-summing plus some other minor sub-modules are elements in the FDAS module that have been well-optimized separately before. In this paper, we explore the design space of combining well-optimized designs, dealing with the ensuing need to trade-off and compromise. Pipeline computing is employed to handle multiple input arrays at high speed. The hardware target is to employ multiple high-end FPGAs to process the combined FDAS module. The results show interesting consequences, where the best individual solutions are not necessarily the best solutions for the speed of a pipeline where FPGA resources and memory bandwidth need to be shared. By proposing multiple buffering techniques to the pipeline, the combined FDAS module can achieve up to 2[Formula: see text] speedup over implementations without pipeline computing. We perform an extensive experimental evaluation on multiple high-end FPGA cards hosted in a workstation and compare to a technology comparable mid-range GPU. ","lang":"eng"}],"author":[{"full_name":"Wang, Haomiao","last_name":"Wang","first_name":"Haomiao"},{"first_name":"Prabu","last_name":"Thiagaraj","full_name":"Thiagaraj, Prabu"},{"first_name":"Oliver","last_name":"Sinnen","full_name":"Sinnen, Oliver"}],"status":"public","year":"2019","date_updated":"2022-01-06T06:52:50Z","type":"journal_article","publication":"Journal of Astronomical Instrumentation","article_number":"1950008","doi":"10.1142/s2251171719500089","publication_status":"published","language":[{"iso":"eng"}]}