Implications of memory performance for highly efficient supercomputing of scientific applications

Akihiro Musa, Hiroyuki Takizawa, Koki Okabe, Takashi Soga, Hiroaki Kobayashi

研究成果: Conference contribution

3 被引用数 (Scopus)

抄録

This paper examines the memory performance of the vectorparallel and scalar-parallel computing platforms across five applications of three scientific areas; electromagnetic analysis, CFD/heat analysis, and seismology. Our evaluation results show that the vector platforms can achieve the high computational efficiency and hence significantly outperform the scalar platforms in the areas of these applications. We did exhaustive experiments and quantitatively evaluated representative scalar and vector platforms using real applications from the viewpoint of the system designers and developers. These results demonstrate that the ratio of memory bandwidth to floating-point operation rate needs to reach 4-bytes/flop to preserve the computational performance with hiding the memory access latencies by pipelined vector operations in the vector platforms. We also confirm that the enough number of memory banks to handle stride memory accesses leads to an increase in the execution efficiency. On the scalar platforms, the cache hit rate needs to be almost 100% to achieve the high computational efficiency.

本文言語English
ホスト出版物のタイトルParallel and Distributed Processing and Applications - 4th International Symposium, ISPA 2006, Proceedings
編集者Feilong Tang, Minyi Guo, Beniamino Di Martino, Hans P. Zima, Hans P. Zima, Laurence T Yang, Jack Dongarra
出版社Springer-Verlag
ページ845-858
ページ数14
ISBN(印刷版)9783540680673
出版ステータスPublished - 2006 1月 1
イベント4th International Symposium on Parallel and Distributed Processing and Applications, ISPA 2006 - Sorrento, Italy
継続期間: 2006 12月 42006 12月 6

出版物シリーズ

名前Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
4330
ISSN(印刷版)0302-9743
ISSN(電子版)1611-3349

Other

Other4th International Symposium on Parallel and Distributed Processing and Applications, ISPA 2006
国/地域Italy
CitySorrento
Period06/12/406/12/6

ASJC Scopus subject areas

  • 理論的コンピュータサイエンス
  • コンピュータ サイエンス(全般)

フィンガープリント

「Implications of memory performance for highly efficient supercomputing of scientific applications」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル