TY - JOUR
T1 - Characterization of transcription start sites of putative non-coding RNAs by multifaceted use of massively paralleled sequencer
AU - Sathira, Nuankanya
AU - Yamashita, Riu
AU - Tanimoto, Kousuke
AU - Kanai, Akinori
AU - Arauchi, Takako
AU - Kanematsu, Soutaro
AU - Nakai, Kenta
AU - Suzuki, Yutaka
AU - Sugano, Sumio
PY - 2010/6
Y1 - 2010/6
N2 - On the basis of integrated transcriptome analysis, we show that not all transcriptional start site clusters (TSCs) in the intergenic regions (iTSCs) have the same properties; thus, it is possible to discriminate the iTSCs that are likely to have biological relevance from the other noise-level iTSCs. We used a total of 251 933 381 short-read sequence tags generated from various types of transcriptome analyses in order to characterize 6039 iTSCs, which have significant expression levels. We analyzed and found that 23 of these iTSCs were located in the proximal regions of the RefSeq genes. These RefSeq-linked iTSCs showed similar expression patterns with the neighboring RefSeq genes, had widely fluctuating transcription start sites and lacked ordered nucleosome positioning. These iTSCs seemed not to form independent transcriptional units, simply representing the by-products of the neighboring RefSeq genes, in spite of their significant expression levels. Similar features were also observed for the TSCs located in the antisense regions of the RefSeq genes. Furthermore, for the remaining iTSCs that were not associated with any RefSeq genes, we demonstrate that integrative interpretation of the transcriptome data provides essential information to specify their biological functions in the hypoxic responses of the cells.
AB - On the basis of integrated transcriptome analysis, we show that not all transcriptional start site clusters (TSCs) in the intergenic regions (iTSCs) have the same properties; thus, it is possible to discriminate the iTSCs that are likely to have biological relevance from the other noise-level iTSCs. We used a total of 251 933 381 short-read sequence tags generated from various types of transcriptome analyses in order to characterize 6039 iTSCs, which have significant expression levels. We analyzed and found that 23 of these iTSCs were located in the proximal regions of the RefSeq genes. These RefSeq-linked iTSCs showed similar expression patterns with the neighboring RefSeq genes, had widely fluctuating transcription start sites and lacked ordered nucleosome positioning. These iTSCs seemed not to form independent transcriptional units, simply representing the by-products of the neighboring RefSeq genes, in spite of their significant expression levels. Similar features were also observed for the TSCs located in the antisense regions of the RefSeq genes. Furthermore, for the remaining iTSCs that were not associated with any RefSeq genes, we demonstrate that integrative interpretation of the transcriptome data provides essential information to specify their biological functions in the hypoxic responses of the cells.
KW - antisense transcript
KW - integrated transcriptome analysis
KW - intergenic transcript
KW - non-coding RNA
KW - transcriptional start site cluster (TSC)
UR - http://www.scopus.com/inward/record.url?scp=77955953730&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77955953730&partnerID=8YFLogxK
U2 - 10.1093/dnares/dsq007
DO - 10.1093/dnares/dsq007
M3 - Article
C2 - 20400770
AN - SCOPUS:77955953730
VL - 17
SP - 169
EP - 183
JO - DNA Research
JF - DNA Research
SN - 1340-2838
IS - 3
ER -