Skip to main content
Database: The Journal of Biological Databases and Curation logoLink to Database: The Journal of Biological Databases and Curation
. 2013 Feb 7;2013:bat001. doi: 10.1093/database/bat001

pseudoMap: an innovative and comprehensive resource for identification of siRNA-mediated mechanisms in human transcribed pseudogenes

Wen-Ling Chan 1,2, Wen-Kuang Yang 3,4, Hsien-Da Huang 1,2,*, Jan-Gowth Chang 5,6,7,*
PMCID: PMC3567485  PMID: 23396300

Abstract

RNA interference (RNAi) is a gene silencing process within living cells, which is controlled by the RNA-induced silencing complex with a sequence-specific manner. In flies and mice, the pseudogene transcripts can be processed into short interfering RNAs (siRNAs) that regulate protein-coding genes through the RNAi pathway. Following these findings, we construct an innovative and comprehensive database to elucidate siRNA-mediated mechanism in human transcribed pseudogenes (TPGs). To investigate TPG producing siRNAs that regulate protein-coding genes, we mapped the TPGs to small RNAs (sRNAs) that were supported by publicly deep sequencing data from various sRNA libraries and constructed the TPG-derived siRNA-target interactions. In addition, we also presented that TPGs can act as a target for miRNAs that actually regulate the parental gene. To enable the systematic compilation and updating of these results and additional information, we have developed a database, pseudoMap, capturing various types of information, including sequence data, TPG and cognate annotation, deep sequencing data, RNA-folding structure, gene expression profiles, miRNA annotation and target prediction. As our knowledge, pseudoMap is the first database to demonstrate two mechanisms of human TPGs: encoding siRNAs and decoying miRNAs that target the parental gene. pseudoMap is freely accessible at http://pseudomap.mbc.nctu.edu.tw/.

Database URL: http://pseudomap.mbc.nctu.edu.tw/

Introduction

Pseudogenes are genomic DNA sequences homologous to functional genes yet are not translated into proteins (1). Although pseudogenes are often considered the structurally defective non-functional copies of protein-coding genes, the human genome comprises more numbers of pseudogenes than corresponding functional genes (2). Despite the previous assumption of pseudogenes as genomic fossils, the genome-wide investigations have demonstrated actively transcribed pseudogenes (TPGs) with functional potential (3–12). For instant, TPG of nitric oxide synthase (ψNOS) acts as an antisense regulator of neuronal NOS protein synthesis in snails (13, 14). Another study has established that binding of transcriptional repressor to receptor of ψmakorin1-p1 could activate the homologous parental gene Mkrn1 (15), despite contradictory result was also reported (16). In addition, the TPG of PTENP1PTEN), a highly conserved processed pseudogene of tumour suppressor PTEN, acts as a miRNA-decoy by binding to PTEN-targeting miRNAs (17). Moreover, human pseudogene myosin light chain kinase pseudogene 1 is partially duplicated from the original MYLK gene and promotes cancer cell proliferation (18). Above findings clearly suggest that the non-coding RNA products of TPGs may play an important role in biogenesis pathway and functional processes.

The RNA interference (RNAi) is an important component of the RNA modulation pathway and is incorporated into the RNA-induced silencing complex (RISC) with a sequence-specific manner (19). In mice and fruit flies, double-stranded RNAs arising from the antisense/sense transcripts of processed pseudogene, and its cognate gene, or hairpin structures from inversion and duplication, are cut by Dicer into 21 nt endogenous short interfering RNAs (esiRNAs) with the ability to bind RISC and regulate the expression of parental gene (20–25). Such regulatory mechanism in human remains unclear.

To demonstrate that in human, as in animal models, TPGs may generate naturally occurring siRNAs and Piwi interacting RNAs (piRNAs) to regulate the expression of protein-coding genes, we have developed a computational pipeline and constructed a database-pseudoMap, the map for studying pseudogenes. pseudoMap pre-processes the raw data of public microarray and deep sequencing data into gene expression profiles for both TPG and its cognate gene and small RNA (sRNA) profiles for TPG-derived esiRNAs. pseudoMap further combined the gene expression profiles to construct the TPG-derived esiRNA-target interactions (eSTIs). In addition, according to the previous study of pseudogene, PTENP1 exerts a miRNA decoy by binding to cognate-targeting miRNAs (17), and pseudoMap also provided the ‘miRNA regulator’ to elucidate the relationship of TPG and its cognate gene with miRNA target regulation.

Data generation

In total, more than 20 000 human pseudogenes and their cognate genes were obtained from the Ensembl Genome Browser (Ensembl 63, GRCH37) (26) using BioMart (http://www.ensembl.org/index.html). Affymetrix GeneChip® Human Genome U133A/U133Plus2 is a microarray composed of oligonucleotide probes to measure the level of transcription of each sequence represented, which included transcribed pseudogenes. 1404 pseudogenes have been detectable by this chip, thus considered being transcribed and referred as TPGs. Functional sRNAs (fsRNAs) with sequence length between 18 to 40 nt were collected from the Functional RNA Database (27), which hosts a large collection of known/predicted non-coding RNA sequences from public databases: H-invDB v5.0 (6), FANTOM3 (28), miRBase 17.0 (29, 30), NONCODE v1.0 (31), Rfam v8.1 (32), RNAdb v2.0 (33) and snoRNA-LBME-db rel. 3 (34). The public deep sequencing data from sRNA libraries (35–38) were experimented with on human embryo stem cells, liver tissues or hepatocellular carcinoma (HCC) tissues. Supplementary Table 1 summarizes the statistics of the deep sequencing data from various sRNA libraries. The genomic sequences were obtained from UCSC hg19 (39). Table 1 lists the integrated databases and tools for mining potential regulators and functions of human TPGs.

Table 1.

Supported databases and tools in pseudoMap

Integrated database or tools Dataset Description
miRBase (29, 30) miRNA annotation This database not only provides published miRNA sequences and annotations but also supplies known/predict targets
Functional RNA Database (27) sRNA annotation A database to support mining and annotation of functional RNAs
Ensembl Genome Browser (26) Pseudogene, protein-coding gene It produces genome databases for vertebrates and other eukaryotic species
UCSC Genome Browser (39) Conserved region and Genomic view of genes This browser provides a rapid and reliable display of any requested portion of genomes at any scale, together with dozens of aligned annotation tracks
GeneCards (52) Gene annotation GeneCards is a searchable, integrated, database of human genes that provides concise genomic-related information of all known and predicted human genes
Mfold (40) RNA folding tool Folding RNA structure
GEO (47) Gene expression profiles and deep sequencing data A public functional genomics data
BLAST (51) Sequence alignment tool BLAST finds regions of similarity between biological sequences

System flow of pseudoMap

The system flow of pseudoMap is shown in Figure 1, mainly including the collection of datasets such as TPGs, parental genes, fsRNAs, sRNA deep sequencing data, expression profiles, integration of various tools and identification of functions and regulations of TPGs. Based on a genome-wide computational pipeline of sequence-alignment approaches, this work constructed pseudoMap database for elucidation of two major discoveries: TPG-derived eSTI and miRNA-decoy mechanism of TPGs. The detailed analyses are described below.

Figure 1.

Figure 1

System flow of pseudoMap. The system flow of pseudoMap mainly includes the collection of datasets such as TPGs, parental genes, miRNAs, piRNAs, sRNA deep sequencing data and expression profiles; integration of various tools and identification of functions and regulations of TPGs. Based on a genome-wide computational pipeline of sequence-alignment approaches and gene expression profiles, this work constructed pseudoMap database for elucidation of two major discoveries: TPG-derived esiRNA-target interaction and miRNA-decoy mechanism of TPGs.

Identification of TPG-derived esiRNAs by public next-generation sequencing data

A computational pipeline was developed to verify the hypotheses that human TPGs may generate esiRNAs to regulate protein-coding genes (Figure 2). An attempt was made to identify the candidates of TPG-derived esiRNAs, by aligning the sequences of TPGs and fsRNAs. These candidates were verified using the deep sequencing data from various sRNA libraries (35–38) experimented with on human embryo stem cells, liver tissues or HCC tissues. The hairpin structure by Mfold (40) was then determined by using the extended sequences of these candidates of esiRNAs. In pseudoMap, a total of 1232 TPGs may produce esiRNAs, which were profiling by deep sequencing data, within 1404 human TPGs were characterized. The information of these TPGs is shown in Supplementary File 1. The results showed that 4 miRNAs and 326 piRNAs may derive from TPGs. We also found that miRNA has-miR-622 was identified, which was derived from keratin 18 pseudogene 27, located on nt 858879, as similar as miRBase database. Table 2 summarizes the entire statistical analysis of pseudoMap.

Figure 2.

Figure 2

Computational pipeline for identification of TPG-derived esiRNA-target interactions.

Table 2.

Summarizes the entire statistical analysis of pseudoMap

Dataset Counts
No. of miRNA regulators 5771/1014a
No. of TPG-derived miRNAs 4
No. of TPG-derived piRNAs 326
Deep sequencing data for profiling TPG-derived esiRNAs
    Human embryo stem cell—hB 247
    Human embryo stem cell—hESC 553
    Human embryo stem cell—hues6 190
    Human embryo stem cell—hues6NP 81
    Human embryo stem cell—hues6Neuron 16
    HBV(+) adjacent tissue sample 1 917
    HBV(+) adjacent tissue sample 2 4377
    HBV(+) distal tissue sample 1 1011
    HBV(+) HCC tissue sample 1 1281
    HBV(+) HCC tissue sample 2 2649
    HBV-infected liver tissue 3056
    HBV(+) side tissue sample 1 1087
    HCV(+) adjacent tissue sample 14 297
    HCV(+) HCC tissue sample 9277
    HBV(−) HCV(−) adjacent tissue sample 2324
    HBV(−) HCV(−) HCC tissue sample 6579
    Human normal liver tissue sample 1 1220
    Human normal liver tissue sample 2 1290
    Human normal liver tissue sample 3 1209
    Severe chronic hepatitis B liver tissue 1247

a1014 distinct miRNAs involved in 5771 miRNA regulators.

Identification of TPG-derived esiRNA-target interactions

Our previous approach (41) was modified to identify TPG-derived esiRNA targets. Briefly, the esiRNA target sites within the conserved regions of coding, 5’-UTR and 3’-UTR of genes were identified in 12 metazoan genomes by using three computational approaches, TargetScan (42–44), miRanda (45) and RNAhybrid (46). The minima free energy (MFE) threshold was −20 kcal/mol with a score more than or equal to 150 for miRanda and default parameters for TargetScan and RNAhybrid. The targets were identified using the following criteria: (i) the potential target sites were determined by at least two approaches; (ii) multiple target sites were prioritized and (iii) target sites must be located in accessible regions. Finally, we provided the gene expression profiles of TPG and its cognate gene to construct the eSTIs.

Gene expression analysis

The mRNA abundances of TPGs and protein-coding genes were obtained from Gene Expression Omnibus (47), such as GDS596 examined from 79 human physiologically normal tissues (48), GSE2109 examined from 2158 samples with 61 tumour tissues, GSE3526 examined from 353 samples with 65 normal tissues (49) and GSE5364 examined from primary human tumours and adjacent non-tumour tissues, which include 270 tumours and 71 normal-cancer pairs from patients with breast, colon, liver, lung, oesophagal and thyroid cancers (50). Moreover, the Pearson correlation coefficient was computed from TPGs and protein-coding genes.

Determination of miRNA-target interactions

According to the study by Poliseno et al. (17), pseudogenes PTENP1 and KRAS1P act as a ‘miRNA decoy’, binding to and thereby reducing the effective cellular concentration of miRNAs, therefore resulting their cognate genes to escape miRNA-mediated repression. In this study, we analyse the relationships between TPG and its cognate gene with miRNA decoys mechanism to examine miRNA-target interactions (MTIs) by performing a pipeline. First, the parental genes were obtained by mapping the TPGs and genomic sequences with the BLAST (51) program. The MTIs with TPGs and parental genes were then investigated using our previous approach (41). The MFE threshold was −20 kcal/mol with a score more than or equal to 150 for miRanda and default parameters for TargetScan and RNAhybrid. Finally, the TPGs and their cognates co-regulated by miRNAs were obtained. The miRNA and 3’UTR sequences were obtained from miRBase R18 (29, 30) and Ensembl Genome Browser release 63 (26), respectively. Analysis results indicated that 874 miRNAs with MFE ≤ −20 and Score ≥ 150 interact with many possible target sites in 248 TPGs and their cognate genes and might potentially co-regulate this pair of TPG and parental gene (Supplementary File 2).

Web interface

As a web-based system, pseudoMap can thoroughly identify TPGs, including TPGs act as a miRNA regulators and TPGs-derived eSTIs in humans. There are two ways to access pseudoMap: by browsing the database content or by searching for a particular TPG. Figure 3A displays the interface of output results of the browse gateway. The interface contains general information of TPGs, the relationships of TPG and its cognate gene with miRNA-mediated repression termed as ‘miRNA Regulator’, TPG-derived eSTIs named as ‘esiRNA’, and ‘Expression’ showed the gene expression profiles. Figure 3B provides a detailed view of miRNA regulator, which displays more fine-grained information. Above results indicated the relationships between TPGs and cognate genes by a miRNA decoy mechanism such as that observed by Poliseno et al. (17). The ‘Expression’ presents the gene expression profiles of not only distinct TPG and corresponding parental gene but also TPG referenced by cognate in various experimental conditions (Figure 3C). Moreover, the view of esiRNA indicates the TPG-derived esiRNAs and graphical display of deep sequencing data (Figure 3D). The red line represents the TPG and the blue line refers to esiRNAs. We also estimate the eSTIs (Figure 3E) and the RNA folding structure of TPG-derived esiRNA (Figure 3F). All the results and sequences can be downloaded for further experimental tests. In pseudoMap, we also incorporate the external sources, such as UCSC genome browser (39) for a genomic view, GeneCards (52) for gene annotation and miRBase (29) for miRNA annotation (Figure 3G). In addition, pseudoMap also consists of a tutorial and knowledge of pseudogenes.

Figure 3.

Figure 3

Web interface of pseudoMap. (A) Browse interface of pseudoMap illustrates general information of TPGs, miRNA regulators, esiRNAs and gene expression profiles. (B) The miRNA regulator indicates the miRNA decoys mechanisms between TPG and its cognate. (C) Gene expression profiles of TPG and its cognate gene in various experimental conditions. (D) The diagram of esiRNA represents TPG-derived siRNAs as profiled by deep sequencing data. It displays the more fine-grained information of (E) esiRNA-target interaction and (F) RNA folding structure of TPG-derived esiRNA. In addition, pseudoMap also incorporates the external sources, such as (G) UCSC genome browser for a genomic view, GeneCards for gene annotation and miRBase for miRNA annotation.

In the search gateway, the TPG ID, Ensembl ID, TPG symbol and parental gene symbol are allowed for further analysis. Figure 4 displays the interface of output results with the search a particular TPG ID/Ensembl ID/TPG symbol/parental gene symbol. The interface contains the general information of TPG, miRNA regulators, gene expression profiles and TPG-derived esiRNAs.

Figure 4.

Figure 4

Search interface of pseudoMap.

Construction and content

In pseudoMap, various databases are integrated and maintained with MySQL (http://www.mysql.com/) relational database management system. While operating on an Apache HTTP server (http://www.apache.org/) and PHP (http://www.php.net/) on a Linux operation system (http://www.linux.com), pseudoMap was constructed using the Smarty template engine (http://www.smarty.net). Based on PHP, JavaScript (http://www.javascriptsource.com/), CSS (http://www.w3schools.com/css/) and HTML (http://www.w3schools.com/html/) languages, the web interface enables dynamic MySQL queries with user-friendly graphics. Above software are open source technologies.

Discussion and conclusions

Comparison with other previous databases related to pseudogenes

A few databases have been constructed to explore pseudogenes. In particular, PseudoGene database (53) identifies pseudogenes using various computational methods in genomes; HOPPSIGEN (54) represents the homologous processed pseudogenes shared between the mouse and human genomes that contains location information and potential function; as a web-based system, PseudoGeneQuest (55) identifies novel human pseudogenes based on a user-provided protein sequence; in addition, the University of Iowa’s UI Pseudogenes website contains human pseudogenes and the candidates for gene conversion (56). However, these databases focus on automatic detection of pseudogenes by using a variety of homology-based approaches. Our database, pseudoMap, aims at providing comprehensive resource for genome-wide identifying the functions and regulators of human TPGs. In briefly, there are three major differentiating features from currently public databases of pseudogenes. First, pseudoMap elucidates the relationships of TPG and its cognate gene with miRNA decoys mechanism. Second, to explore the interaction of TPG and its parental gene, pseudoMap provides the gene expression profiles of TPG and its cognate gene in various experimental conditions. Third, pseudoMap curates the TPG-derived esiRNAs, which supported by deep sequencing data, as well as their interacting gene targets in the human genome. Table 3 lists the detailed comparisons of pseudoMap with other previous databases related to pseudogenes.

Table 3.

Comparisons of pseudoMap with currently public databases of pseudogenes

Supported features pseudoMap (our database) PseudoGene database UI pseudogene Hoppsigen
Web interface http://pseudomap.mbc.nctu.edu.tw/ http://www.pseudogene.org/ https://genome.uiowa.edu/pseudogenes/ http://pbil.univ-lyon1.fr/databases/hoppsigen.html
Description pseudoMap provides a comprehensive resource for genome-wide identifying the functions and regulators of human pseudogenes. This site contains a comprehensive database of identified pseudogenes, utilities used to find pseudogenes, various publication data sets and a pseudogene knowledgebase. This site serves as a repository for all pseudogenes in the human genome and provides a ranked list of human pseudogenes that have been identified as candidates for gene conversion. Hoppsigen is a nucleic database of homologous processed pseudogenes.
Species supported Human Eukaryote and prokaryote Human Human
Sequence download Yes Yes Yes Yes
Pseudogene information Yes Yes Yes Yes
Parental gene information Yes Yes
Knowledge of pseudogenes Yes Yes Yes
miRNA–pseudogene interactions Yes
miRNA–parental gene interactions Yes
Gene expression profiles Yes (both pseudogene and its parental gene)
Pseudogene-derived siRNAs Yes
Deep sequencing data for profiling TPG-derived siRNAs Yes

Applications

PseudoMap provides two major applications. One is the non-coding RNA products of TPGs, as like animal models, that may generate esiRNAs to regulate protein-coding genes in humans. In this process, pseudoMap supplies next-generation sequencing data from sRNA libraries to support the candidates of TPG-derived esiRNAs and gene expression profiles to verify the target interactions, respectively. Another application is that both the gene and pseudogene contain miRNA target sites, if the pseudogene competes for the freely available repressor molecules that would be free the gene to reduce the miRNA-mediated repression. Another words, the pseudogene may act as a ‘miRNA decoy’ to release the repression of its cognate gene. pseudoMap provides another insight into the pathway of MTIs with TPG-mediated mechanism.

Conclusion

In this study, we performed a computational pipeline to identify TPG-derived esiRNAs-target interactions and constructed a comprehensive database to represent the potential functions and regulators of TPGs in human. To our knowledge, the pseudoMap is the first database to identify TPGs to enable biologists and bioinformaticians to elucidate two major discoveries, the relationships between TPG and its cognate gene with miRNA decoyed mechanisms and TPG-derived eSTIs. Efforts are underway in our laboratory to expand the methods used in pseudoMap to other species such as mice, fruit flies and plants. The pseudoMap will be updated frequently by continuingly surveying experimentally validated sRNAs and will be maintained with a long-term support from National Chiao Tung University and National Science Council at Taiwan. This novel and creative resource is now freely available at http://pseudomap.mbc.nctu.edu.tw/.

Supplementary data

Supplementary Data are available at Database online.

Funding

National Science Council of the Republic of China [NSC 99-2320-B-037-006-MY3 to J.G.C., NSC 98-2314-B039-010MY3 to W.K.Y., NSC 98-2311-B-009-004-MY3 to H.D.H., NSC 99-2627-B-009-003 to H.D.H., NSC 101-2311-B-009-003-MY3 to H.D.H. and NSC 100-2627-B-009-002 to H.D.H.]; UST-UCSD International Center of Excellence in Advanced Bio-engineering sponsored by the Taiwan National Science Council I-RiCE Program [NSC 101-2911-I-009 -101 - to H.D.H., in part]; Veterans General Hospitals and University System of Taiwan (VGHUST) Joint Research Program [VGHUST101-G5-1-1 to H.D.H., in part]; MOE ATU [in part]. Funding for open access charge: National Science Council of the Republic of China.

Conflict of interest statement. None declared.

Supplementary Material

Supplementary Data

References

  • 1.Mighell AJ, Smith NR, Robinson PA, et al. Vertebrate pseudogenes. FEBS Lett. 2000;468:109–114. doi: 10.1016/s0014-5793(00)01199-6. [DOI] [PubMed] [Google Scholar]
  • 2.Torrents D, Suyama M, Zdobnov E, et al. A genome-wide survey of human pseudogenes. Genome Res. 2003;13:2559–2567. doi: 10.1101/gr.1455503. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Balasubramanian S, Zheng D, Liu YJ, et al. Comparative analysis of processed ribosomal protein pseudogenes in four mammalian genomes. Genome Biol. 2009;10:R2. doi: 10.1186/gb-2009-10-1-r2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Harrison P, Yu Z. Frame disruptions in human mRNA transcripts, and their relationship with splicing and protein structures. BMC Genomics. 2007;8:371. doi: 10.1186/1471-2164-8-371. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Harrison PM, Zheng D, Zhang Z, et al. Transcribed processed pseudogenes in the human genome: an intermediate form of expressed retrosequence lacking protein-coding ability. Nucleic Acids Res. 2005;33:2374–2383. doi: 10.1093/nar/gki531. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Imanishi T, Itoh T, Suzuki Y, et al. Integrative annotation of 21,037 human genes validated by full-length cDNA clones. PLoS Biol. 2004;2:e162. doi: 10.1371/journal.pbio.0020162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Khachane AN, Harrison PM. Assessing the genomic evidence for conserved transcribed pseudogenes under selection. BMC Genomics. 2009;10:435. doi: 10.1186/1471-2164-10-435. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Vinckenbosch N, Dupanloup I, Kaessmann H. Evolutionary fate of retroposed gene copies in the human genome. Proc. Natl Acad. Sci. USA. 2006;103:3220–3225. doi: 10.1073/pnas.0511307103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Zheng D, Zhang Z, Harrison PM, et al. Integrated pseudogene annotation for human chromosome 22: evidence for transcription. J. Mol. Biol. 2005;349:27–45. doi: 10.1016/j.jmb.2005.02.072. [DOI] [PubMed] [Google Scholar]
  • 10.Zheng D, Frankish A, Baertsch R, et al. Pseudogenes in the ENCODE regions: consensus annotation, analysis of transcription, and evolution. Genome Res. 2007;17:839–851. doi: 10.1101/gr.5586307. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.McCarrey JR, Riggs AD. Determinator-inhibitor pairs as a mechanism for threshold setting in development: a possible function for pseudogenes. Proc. Natl Acad. Sci. USA. 1986;83:679–683. doi: 10.1073/pnas.83.3.679. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Zou C, Lehti-Shiu MD, Thibaud-Nissen F, et al. Evolutionary and expression signatures of pseudogenes in Arabidopsis and rice. Plant Physiol. 2009;151:3–15. doi: 10.1104/pp.109.140632. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Korneev S, O'Shea M. Evolution of nitric oxide synthase regulatory genes by DNA inversion. Mol. Biol. Evol. 2002;19:1228–1233. doi: 10.1093/oxfordjournals.molbev.a004183. [DOI] [PubMed] [Google Scholar]
  • 14.Korneev SA, Park JH, O'Shea M. Neuronal expression of neural nitric oxide synthase (nNOS) protein is suppressed by an antisense RNA transcribed from an NOS pseudogene. J. Neurosci. 1999;19:7711–7720. doi: 10.1523/JNEUROSCI.19-18-07711.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Hirotsune S, Yoshida N, Chen A, et al. An expressed pseudogene regulates the messenger-RNA stability of its homologous coding gene. Nature. 2003;423:91–96. doi: 10.1038/nature01535. [DOI] [PubMed] [Google Scholar]
  • 16.Gray TA, Wilson A, Fortin PJ, et al. The putatively functional Mkrn1-p1 pseudogene is neither expressed nor imprinted, nor does it regulate its source gene in trans. Proc. Natl Acad. Sci. USA. 2006;103:12039–12044. doi: 10.1073/pnas.0602216103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Poliseno L, Salmena L, Zhang J, et al. A coding-independent function of gene and pseudogene mRNAs regulates tumour biology. Nature. 2010;465:1033–1038. doi: 10.1038/nature09144. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Han YJ, Ma SF, Yourek G, et al. A transcribed pseudogene of MYLK promotes cell proliferation. FASEB J. 2011;25:2305–2312. doi: 10.1096/fj.10-177808. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Kim DH, Rossi JJ. Strategies for silencing human disease using RNA interference. Nat. Rev. Genet. 2007;8:173–184. doi: 10.1038/nrg2006. [DOI] [PubMed] [Google Scholar]
  • 20.Czech B, Malone CD, Zhou R, et al. An endogenous small interfering RNA pathway in Drosophila. Nature. 2008;453:798–802. doi: 10.1038/nature07007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Ghildiyal M, Seitz H, Horwich MD, et al. Endogenous siRNAs derived from transposons and mRNAs in Drosophila somatic cells. Science. 2008;320:1077–1081. doi: 10.1126/science.1157396. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Kawamura Y, Saito K, Kin T, et al. Drosophila endogenous small RNAs bind to Argonaute 2 in somatic cells. Nature. 2008;453:793–797. doi: 10.1038/nature06938. [DOI] [PubMed] [Google Scholar]
  • 23.Okamura K, Chung WJ, Ruby JG, et al. The Drosophila hairpin RNA pathway generates endogenous short interfering RNAs. Nature. 2008;453:803–806. doi: 10.1038/nature07015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Tam OH, Aravin AA, Stein P, et al. Pseudogene-derived small interfering RNAs regulate gene expression in mouse oocytes. Nature. 2008;453:534–538. doi: 10.1038/nature06904. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Watanabe T, Totoki Y, Toyoda A, et al. Endogenous siRNAs from naturally formed dsRNAs regulate transcripts in mouse oocytes. Nature. 2008;453:539–543. doi: 10.1038/nature06908. [DOI] [PubMed] [Google Scholar]
  • 26.Flicek P, Amode MR, Barrell D, et al. Ensembl 2012. Nucleic Acids Res. 2012;40:D84–D90. doi: 10.1093/nar/gkr991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Mituyama T, Yamada K, Hattori E, et al. The Functional RNA Database 3.0: databases to support mining and annotation of functional RNAs. Nucleic Acids Res. 2009;37:D89–D92. doi: 10.1093/nar/gkn805. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Carninci P, Kasukawa T, Katayama S, et al. The transcriptional landscape of the mammalian genome. Science. 2005;309:1559–1563. doi: 10.1126/science.1112014. [DOI] [PubMed] [Google Scholar]
  • 29.Kozomara A, Griffiths-Jones S. miRBase: integrating microRNA annotation and deep-sequencing data. Nucleic Acids Res. 2011;39:D152–D157. doi: 10.1093/nar/gkq1027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Griffiths-Jones S, Saini HK, van Dongen S, et al. miRBase: tools for microRNA genomics. Nucleic Acids Res. 2008;36:D154–D158. doi: 10.1093/nar/gkm952. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.He S, Liu C, Skogerbo G, et al. NONCODE v2.0: decoding the non-coding. Nucleic Acids Res. 2008;36:D170–D172. doi: 10.1093/nar/gkm1011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Griffiths-Jones S, Moxon S, Marshall M, et al. Rfam: annotating non-coding RNAs in complete genomes. Nucleic Acids Res. 2005;33:D121–D124. doi: 10.1093/nar/gki081. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Pang KC, Stephen S, Dinger ME, et al. RNAdb 2.0—an expanded database of mammalian non-coding RNAs. Nucleic Acids Res. 2007;35:D178–D182. doi: 10.1093/nar/gkl926. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Lestrade L, Weber MJ. snoRNA-LBME-db, a comprehensive database of human H/ACA and C/D box snoRNAs. Nucleic Acids Res. 2006;34:D158–D162. doi: 10.1093/nar/gkj002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Morin RD, O'Connor MD, Griffith M, et al. Application of massively parallel sequencing to microRNA profiling and discovery in human embryonic stem cells. Genome Res. 2008;18:610–621. doi: 10.1101/gr.7179508. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Seila AC, Calabrese JM, Levine SS, et al. Divergent transcription from active promoters. Science. 2008;322:1849–1851. doi: 10.1126/science.1162253. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Yeo GW, Xu X, Liang TY, et al. Alternative splicing events identified in human embryonic stem cells and neural progenitors. PLoS Comput. Biol. 2007;3:1951–1967. doi: 10.1371/journal.pcbi.0030196. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Hou J, Lin L, Zhou W, et al. Identification of miRNomes in human liver and hepatocellular carcinoma reveals miR-199a/b-3p as therapeutic target for hepatocellular carcinoma. Cancer Cell. 2011;19:232–243. doi: 10.1016/j.ccr.2011.01.001. [DOI] [PubMed] [Google Scholar]
  • 39.Kent WJ, Sugnet CW, Furey TS, et al. The human genome browser at UCSC. Genome Res. 2002;12:996–1006. doi: 10.1101/gr.229102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Zuker M. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res. 2003;31:3406–3415. doi: 10.1093/nar/gkg595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Hsu SD, Chu CH, Tsou AP, et al. miRNAMap 2.0: genomic maps of microRNAs in metazoan genomes. Nucleic Acids Res. 2008;36:D165–D169. doi: 10.1093/nar/gkm1012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Friedman RC, Farh KK, Burge CB, et al. Most mammalian mRNAs are conserved targets of microRNAs. Genome Res. 2009;19:92–105. doi: 10.1101/gr.082701.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Grimson A, Farh KK, Johnston WK, et al. MicroRNA targeting specificity in mammals: determinants beyond seed pairing. Mol. Cell. 2007;27:91–105. doi: 10.1016/j.molcel.2007.06.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Lewis BP, Burge CB, Bartel DP. Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets. Cell. 2005;120:15–20. doi: 10.1016/j.cell.2004.12.035. [DOI] [PubMed] [Google Scholar]
  • 45.John B, Enright AJ, Aravin A, et al. Human MicroRNA targets. PLoS Biol. 2004;2:e363. doi: 10.1371/journal.pbio.0020363. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Kruger J, Rehmsmeier M. RNAhybrid: microRNA target prediction easy, fast and flexible. Nucleic Acids Res. 2006;34:W451–W454. doi: 10.1093/nar/gkl243. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Barrett T, Edgar R. Gene expression omnibus: microarray data storage, submission, retrieval, and analysis. Methods Enzymol. 2006;411:352–369. doi: 10.1016/S0076-6879(06)11019-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Su AI, Wiltshire T, Batalov S, et al. A gene atlas of the mouse and human protein-encoding transcriptomes. Proc. Natl Acad. Sci. USA. 2004;101:6062–6067. doi: 10.1073/pnas.0400782101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Roth RB, Hevezi P, Lee J, et al. Gene expression analyses reveal molecular relationships among 20 regions of the human CNS. Neurogenetics. 2006;7:67–80. doi: 10.1007/s10048-006-0032-6. [DOI] [PubMed] [Google Scholar]
  • 50.Yu K, Ganesan K, Tan LK, et al. A precisely regulated gene expression cassette potently modulates metastasis and survival in multiple solid cancers. PLoS Genet. 2008;4:e1000129. doi: 10.1371/journal.pgen.1000129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Altschul SF, Gish W, Miller W, et al. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
  • 52.Stelzer G, Dalah I, Stein TI, et al. In-silico human genomics with GeneCards. Hum. Genomics. 2011;5:709–717. doi: 10.1186/1479-7364-5-6-709. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Karro JE, Yan Y, Zheng D, et al. Pseudogene.org: a comprehensive database and comparison platform for pseudogene annotation. Nucleic Acids Res. 2007;35:D55–D60. doi: 10.1093/nar/gkl851. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Khelifi A, Duret L, Mouchiroud D. HOPPSIGEN: a database of human and mouse processed pseudogenes. Nucleic Acids Res. 2005;33:D59–D66. doi: 10.1093/nar/gki084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Ortutay C, Vihinen M. PseudoGeneQuest—service for identification of different pseudogene types in the human genome. BMC Bioinformatics. 2008;9:299. doi: 10.1186/1471-2105-9-299. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Bischof JM, Chiang AP, Scheetz TE, et al. Genome-wide identification of pseudogenes capable of disease-causing gene conversion. Hum. Mutat. 2006;27:545–552. doi: 10.1002/humu.20335. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data

Articles from Database: The Journal of Biological Databases and Curation are provided here courtesy of Oxford University Press

RESOURCES