ParaHaplo 3.0: A program package for imputation and a haplotype-based whole-genome association study using hybrid parallel computing

Kazuharu Misawa, Naoyuki Kamatani

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

Background: Use of missing genotype imputations and haplotype reconstructions are valuable in genome-wide association studies (GWASs). By modeling the patterns of linkage disequilibrium in a reference panel, genotypes not directly measured in the study samples can be imputed and used for GWASs. Since millions of single nucleotide polymorphisms need to be imputed in a GWAS, faster methods for genotype imputation and haplotype reconstruction are required.Results: We developed a program package for parallel computation of genotype imputation and haplotype reconstruction. Our program package, ParaHaplo 3.0, is intended for use in workstation clusters using the Intel Message Passing Interface. We compared the performance of ParaHaplo 3.0 on the Japanese in Tokyo, Japan and Han Chinese in Beijing, and Chinese in the HapMap dataset. A parallel version of ParaHaplo 3.0 can conduct genotype imputation 20 times faster than a non-parallel version of ParaHaplo.Conclusions: ParaHaplo 3.0 is an invaluable tool for conducting haplotype-based GWASs. The need for faster genotype imputation and haplotype reconstruction using parallel computing will become increasingly important as the data sizes of such projects continue to increase. ParaHaplo executable binaries and program sources are available at http://en.sourceforge.jp/projects/parallelgwas/releases/.

Original languageEnglish
Article number10
JournalSource Code for Biology and Medicine
Volume6
DOIs
Publication statusPublished - 2011 May 24

Keywords

  • GWAS
  • Genotype imputation
  • HapMap
  • Haplotype reconstruction
  • ParaHaplo
  • Parallel computing

ASJC Scopus subject areas

  • Information Systems
  • Computer Science Applications
  • Health Informatics
  • Information Systems and Management

Fingerprint Dive into the research topics of 'ParaHaplo 3.0: A program package for imputation and a haplotype-based whole-genome association study using hybrid parallel computing'. Together they form a unique fingerprint.

  • Cite this