TY - JOUR
T1 - Diversification of transcriptional modulation
T2 - Large-scale identification and characterization of putative alternative promoters of human genes
AU - Kimura, Kouichi
AU - Wakamatsu, Ai
AU - Suzuki, Yutaka
AU - Ota, Toshio
AU - Nishikawa, Tetsuo
AU - Yamashita, Riu
AU - Yamamoto, Jun Ichi
AU - Sekine, Mitsuo
AU - Tsuritani, Katsuki
AU - Wakaguri, Hiroyuki
AU - Ishii, Shizuko
AU - Sugiyama, Tomoyasu
AU - Saito, Kaoru
AU - Isono, Yuko
AU - Irie, Ryotaro
AU - Kushida, Norihiro
AU - Yoneyama, Takahiro
AU - Otsuka, Rie
AU - Kanda, Katsuhiro
AU - Yokoi, Takahide
AU - Kondo, Hiroshi
AU - Wagatsuma, Masako
AU - Murakawa, Katsuji
AU - Ishida, Shinichi
AU - Ishibashi, Tadashi
AU - Takahashi-Fujii, Asako
AU - Tanase, Tomoo
AU - Nagai, Keiichi
AU - Kikuchi, Hisashi
AU - Nakai, Kenta
AU - Isogai, Takao
AU - Sugano, Sumio
PY - 2006/1
Y1 - 2006/1
N2 - By analyzing 1,780,295 5′-end sequences of human full-length cDNAs derived from 164 kinds of oligo-cap cDNA libraries, we identified 269,774 independent positions of transcriptional start sites (TSSs) for 14,628 human RefSeq genes. These TSSs were clustered into 30,964 clusters that were separated from each other by more than 500 bp and thus are very likely to constitute mutually distinct alternative promoters. To our surprise, at least 7674 (52%) human RefSeq genes were subject to regulation by putative alternative promoters (PAPs). On average, there were 3.1 PAPs per gene, with the composition of one CpG-island-containing promoter per 2.6 CpG-less promoters. In 17% of the PAP-containing loci, tissue-specific use of the PAPs was observed. The richest tissue sources of the tissue-specific PAPs were testis and brain. It was also intriguing that the PAP-containing promoters were enriched in the genes encoding signal transduction-related proteins and were rarer in the genes encoding extracellular proteins, possibly reflecting the varied functional requirement for and the restricted expression of those categories of genes, respectively. The patterns of the first exons were highly diverse as well. On average, there were 7.7 different splicing types of first exons per locus partly produced by the PAPs, suggesting that a wide variety of transcripts can be achieved by this mechanism. Our findings suggest that use of alternate promoters and consequent alternative use of first exons should play a pivotal role in generating the complexity required for the highly elaborated molecular systems in humans.
AB - By analyzing 1,780,295 5′-end sequences of human full-length cDNAs derived from 164 kinds of oligo-cap cDNA libraries, we identified 269,774 independent positions of transcriptional start sites (TSSs) for 14,628 human RefSeq genes. These TSSs were clustered into 30,964 clusters that were separated from each other by more than 500 bp and thus are very likely to constitute mutually distinct alternative promoters. To our surprise, at least 7674 (52%) human RefSeq genes were subject to regulation by putative alternative promoters (PAPs). On average, there were 3.1 PAPs per gene, with the composition of one CpG-island-containing promoter per 2.6 CpG-less promoters. In 17% of the PAP-containing loci, tissue-specific use of the PAPs was observed. The richest tissue sources of the tissue-specific PAPs were testis and brain. It was also intriguing that the PAP-containing promoters were enriched in the genes encoding signal transduction-related proteins and were rarer in the genes encoding extracellular proteins, possibly reflecting the varied functional requirement for and the restricted expression of those categories of genes, respectively. The patterns of the first exons were highly diverse as well. On average, there were 7.7 different splicing types of first exons per locus partly produced by the PAPs, suggesting that a wide variety of transcripts can be achieved by this mechanism. Our findings suggest that use of alternate promoters and consequent alternative use of first exons should play a pivotal role in generating the complexity required for the highly elaborated molecular systems in humans.
UR - http://www.scopus.com/inward/record.url?scp=30044442791&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=30044442791&partnerID=8YFLogxK
U2 - 10.1101/gr.4039406
DO - 10.1101/gr.4039406
M3 - Article
C2 - 16344560
AN - SCOPUS:30044442791
SN - 1088-9051
VL - 16
SP - 55
EP - 65
JO - Genome Research
JF - Genome Research
IS - 1
ER -