Population-scale whole genome sequencing identifies 271 highly polymorphic short tandem repeats from Japanese population

Satoshi Hirata, Kaname Kojima, Kazuharu Misawa, Olivier Gervais, Yosuke Kawai, Masao Nagasaki

Research output: Contribution to journalArticlepeer-review


Forensic DNA typing is widely used to identify missing persons and plays a central role in forensic profiling. DNA typing usually uses capillary electrophoresis fragment analysis of PCR amplification products to detect the length of short tandem repeat (STR) markers. Here, we analyzed whole genome data from 1,070 Japanese individuals generated using massively parallel short-read sequencing of 162 paired-end bases. We have analyzed 843,473 STR loci with two to six basepair repeat units and cataloged highly polymorphic STR loci in the Japanese population. To evaluate the performance of the cataloged STR loci, we compared 23 STR loci, widely used in forensic DNA typing, with capillary electrophoresis based STR genotyping results in the Japanese population. Seventeen loci had high correlations and high call rates. The other six loci had low call rates or low correlations due to either the limitations of short-read sequencing technology, the bioinformatics tool used, or the complexity of repeat patterns. With these analyses, we have also purified the suitable 218 STR loci with four basepair repeat units and 53 loci with five basepair repeat units both for short read sequencing and PCR based technologies, which would be candidates to the actual forensic DNA typing in Japanese population.

Original languageEnglish
Article numbere00625
Issue number5
Publication statusPublished - 2018 May


  • Genetics

ASJC Scopus subject areas

  • General


Dive into the research topics of 'Population-scale whole genome sequencing identifies 271 highly polymorphic short tandem repeats from Japanese population'. Together they form a unique fingerprint.

Cite this