TY - JOUR
T1 - FREP
T2 - A database of functional repeats in mouse cDNAs
AU - Nagashima, Takeshi
AU - Matsuda, Hideo
AU - Silva, Diego G.
AU - Petrovsky, Nikolai
AU - Kasukawa, Takeya
AU - Arakawa, Takahiro
AU - Carninci, Piero
AU - Kawai, Jun
AU - Hayashizaki, Yoshihide
AU - Konagaya, Akihiko
AU - Schönbach, Christian
PY - 2004/1/1
Y1 - 2004/1/1
N2 - The FREP database (http://facts.gse.riken.go.jp/FREP/) contains 31 396 RepeatMasker-identified non-redundant variant repeat sequences derived from 16 527 mouse cDNAs with protein-coding potential. The repeats were computationally associated with potential effects on transcriptional variation, translation, protein function or involvement in disease to identify Functional REPeats (FREPs). FREPs are defined by the (i) occurrence of exon-exon boundaries in repeats, (ii) presence of poly-adenylation sites in 3′UTR-located repeats, (iii) effect on translation, (iv) position in the protein-coding region or protein domains or (v) conditional association with disease MeSH terms. Currently the database contains 9261 (29.5%) inferred FREPs derived from 6861 (41.5%) mouse cDNAs. Integrated evidence of the functional assignments and dynamically generated sequence similarity search results support the exploration and annotation of functional, ancestral or taxon-specific repeats. Keyword and pre-selected feature searches (e.g. coding sequence-repeat or splice site-repeat relations) support intuitive database querying as well as the retrieval of repeat sequences. Integrated sequence search and alignment tools allow the analysis of known or identification of new functional repeat candidates. FREP is a unique resource for illuminating the role of transposons and repetitive sequences in shaping the coding part of the mouse transcriptome and for selecting the appropriate experimental model to study diseases with suspected repeat etiology contributions.
AB - The FREP database (http://facts.gse.riken.go.jp/FREP/) contains 31 396 RepeatMasker-identified non-redundant variant repeat sequences derived from 16 527 mouse cDNAs with protein-coding potential. The repeats were computationally associated with potential effects on transcriptional variation, translation, protein function or involvement in disease to identify Functional REPeats (FREPs). FREPs are defined by the (i) occurrence of exon-exon boundaries in repeats, (ii) presence of poly-adenylation sites in 3′UTR-located repeats, (iii) effect on translation, (iv) position in the protein-coding region or protein domains or (v) conditional association with disease MeSH terms. Currently the database contains 9261 (29.5%) inferred FREPs derived from 6861 (41.5%) mouse cDNAs. Integrated evidence of the functional assignments and dynamically generated sequence similarity search results support the exploration and annotation of functional, ancestral or taxon-specific repeats. Keyword and pre-selected feature searches (e.g. coding sequence-repeat or splice site-repeat relations) support intuitive database querying as well as the retrieval of repeat sequences. Integrated sequence search and alignment tools allow the analysis of known or identification of new functional repeat candidates. FREP is a unique resource for illuminating the role of transposons and repetitive sequences in shaping the coding part of the mouse transcriptome and for selecting the appropriate experimental model to study diseases with suspected repeat etiology contributions.
UR - http://www.scopus.com/inward/record.url?scp=9144274362&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=9144274362&partnerID=8YFLogxK
M3 - Article
C2 - 14681460
AN - SCOPUS:9144274362
VL - 32
SP - D471-D475
JO - Nucleic Acids Research
JF - Nucleic Acids Research
SN - 0305-1048
IS - DATABASE ISS.
ER -