Multi-grain parallel processing of data-clustering on programmable graphics hardware

Research output: Contribution to journalArticlepeer-review

3 Citations (Scopus)


This paper presents an effective scheme for clustering a huge data set using a commodity programmable graphics processing unit (GPU). Due to GPU's application-specific architecture, one of the current research issues is how to bind the rendering pipeline with the data-clustering process. By taking advantage of GPU's parallel processing capability, our implementation scheme is devised to exploit the multigrain single-instruction multiple-data (SIMD) parallelism of the nearest neighbor search, which is the most computationally-intensive part of the data-clustering process. The performance of our scheme is discussed in comparison with that of the implementation entirely running on CPU. Experimental results clearly show that the parallelism of the nearest neighbor search allows our scheme to efficiently execute the data-clustering process. Although data-transfer from GPU to CPU is generally costly, acceleration by GPU is significant to save the total execution time of data-clustering.

Original languageEnglish
Pages (from-to)16-27
Number of pages12
JournalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Publication statusPublished - 2004 Dec 1

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)


Dive into the research topics of 'Multi-grain parallel processing of data-clustering on programmable graphics hardware'. Together they form a unique fingerprint.

Cite this