TY - GEN
T1 - Comparison of Direct and Indirect Networks for High-Performance FPGA Clusters
AU - Mondigo, Antoniette
AU - Ueno, Tomohiro
AU - Sano, Kentaro
AU - Takizawa, Hiroyuki
N1 - Publisher Copyright:
© 2020, Springer Nature Switzerland AG.
PY - 2020
Y1 - 2020
N2 - As field programmable gate arrays (FPGAs) become a favorable choice in exploring new computing architectures for the post-Moore era, a flexible network architecture for scalable FPGA clusters becomes increasingly important in high performance computing (HPC). In this paper, we introduce a scalable platform of indirectly-connected FPGAs, where its Ethernet-switching network allows flexibly customized inter-FPGA connectivity. However, for certain applications such as in stream computing, it is necessary to establish a connection-oriented datapath with backpressure between FPGAs. Due to the lack of physical backpressure channel in the network, we utilized our existing credit-based network protocol with flow control to provide receiver FPGA awareness and tailored it to minimize overall communication overhead for the proposed framework. To know its performance characteristics, we implemented necessary data transfer hardware on Intel Arria 10 FPGAs, modeled and obtained its communication performance, and compared it to a direct network. Results show that our proposed indirect framework achieves approximately 3% higher effective network bandwidth than our existing direct inter-FPGA network, which demonstrates good performance and scalability for large HPC applications.
AB - As field programmable gate arrays (FPGAs) become a favorable choice in exploring new computing architectures for the post-Moore era, a flexible network architecture for scalable FPGA clusters becomes increasingly important in high performance computing (HPC). In this paper, we introduce a scalable platform of indirectly-connected FPGAs, where its Ethernet-switching network allows flexibly customized inter-FPGA connectivity. However, for certain applications such as in stream computing, it is necessary to establish a connection-oriented datapath with backpressure between FPGAs. Due to the lack of physical backpressure channel in the network, we utilized our existing credit-based network protocol with flow control to provide receiver FPGA awareness and tailored it to minimize overall communication overhead for the proposed framework. To know its performance characteristics, we implemented necessary data transfer hardware on Intel Arria 10 FPGAs, modeled and obtained its communication performance, and compared it to a direct network. Results show that our proposed indirect framework achieves approximately 3% higher effective network bandwidth than our existing direct inter-FPGA network, which demonstrates good performance and scalability for large HPC applications.
KW - Direct network
KW - FPGA cluster
KW - Flexibility
KW - Flow control
KW - Indirect network
KW - Scalability
UR - http://www.scopus.com/inward/record.url?scp=85083031692&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85083031692&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-44534-8_24
DO - 10.1007/978-3-030-44534-8_24
M3 - Conference contribution
AN - SCOPUS:85083031692
SN - 9783030445331
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 314
EP - 329
BT - Applied Reconfigurable Computing. Architectures, Tools, and Applications - 16th International Symposium, ARC 2020, Proceedings
A2 - Rincón, Fernando
A2 - Barba, Jesús
A2 - Caba, Julián
A2 - So, Hayden K.H.
A2 - Diniz, Pedro
PB - Springer
T2 - 16th International Symposium on Applied Reconfigurable Computing, ARC 2020
Y2 - 1 April 2020 through 3 April 2020
ER -