Complex organisation and structure of the ghrelin antisense strand gene GHRLOS, a candidate non-coding RNA gene
© Seim et al; licensee BioMed Central Ltd. 2008
Received: 06 June 2008
Accepted: 28 October 2008
Published: 28 October 2008
The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH) release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS), which spans the promoter and untranslated regions of the ghrelin gene (GHRL). Here we further characterise GHRLOS.
We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2). Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis), as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed.
GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA) genes, including 5' capping, polyadenylation, extensive splicing and short open reading frames. The gene is also non-conserved, with differential and tissue-restricted expression. The overlapping genomic arrangement of GHRLOS with the ghrelin gene indicates that it is likely to have interesting regulatory and functional roles in the ghrelin axis.
Ghrelin, a hormone with many physiological and pathophysiological roles, was initially described as the endogenous ligand for the growth hormone secretagogue receptor (GHSR 1a), through which it stimulates the release of growth hormone from the anterior pituitary . Ghrelin is primarily produced in the stomach and plays a key role in regulating appetite, gut motility and energy balance [2–6]. Ghrelin is also an autocrine factor in a number of tissues, as it regulates insulin release and has therapeutic potential for inflammatory diseases, heart disease, cancer cachexia, diabetes mellitus and obesity . Despite the importance of ghrelin in a range of physiological systems and pathophysiological conditions, little is known about the regulation of ghrelin synthesis and secretion. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS), which spans the promoter and untranslated regions of the ghrelin gene (GHRL) . However, the genomic structure, expression pattern and potential function of GHRLOS remains to be investigated. It is not known whether GHRLOS RNA species with open reading frames exist, or whether GHRLOS is a non-coding RNA gene.
There is strong support for the hypothesis that antisense transcripts provide a widespread and important mechanism for the regulation of the human genome [9, 10]. Our understanding of the genome is currently undergoing a paradigm shift, as a previously hidden and complex layer of antisense and non-coding RNAs is emerging, which controls gene transcription and translation through a diverse range of mechanisms . Much phenotypic diversity between humans and other species is likely to be due to regulation by RNA .
In this study, we examined the genomic structure and organisation of GHRLOS. We have found that GHRLOS spans approximately 44 kb of genomic DNA and transcribes long, 5' capped, polyadenlyated RNA species that are extensively spliced and differentially expressed. High levels of GHRLOS expression occur in the emerging non-coding RNA tissues, the brain, testis and thymus. We have also examined GHRLOS RNA species in silico, revealing that GHRLOS is a candidate non-coding RNA gene. These data provide a strong basis for further functional studies to determine whether GHRLOS plays a role in the regulation of ghrelin gene expression.
Characterisation of GHRLOS start sites and alternative splicing
In order to determine the polyadenylation site(s) of GHRLOS transcripts, 3' RACE and inverse PCR were conducted using normal stomach and prostate tissues, the RWPE-1 prostate cell line and the PC3 prostate cancer cell line (which is derived from prostate cancer cells that have metastasised to the bone). We obtained 3' RACE clones from the PC3 prostate cancer cell line, which demonstrated a 1.4 kb exon 4a and a 1.1 kb exon 4b, with the putative polyadenylation signal AAATTA [GenBank:EU789531, and EU789532] (Fig. 1). The 3' RACE data matches a number of Expressed Sequence Tags (ESTs) from subchondral bone [GenBank:BM991802], foetal liver [GenBank:AI056187], pooled liver and spleen [GenBank:BX118778; R01027], Wilms' tumour (a paediatric cancer of the kidney) [GenBank:AI174574], pooled brain cancers [GenBank:AI863058] and high grade serous carcinoma of the uterus [GenBank:AI289483]. Moreover, we performed uncapped inverse PCR, and sequencing of amplicons from the PC3 cell line revealed a 90 bp exon I and exon 4 sequence with a 3' end differing by less than 20 base pairs when compared to the 3' RACE and reported ESTs [GenBank:EU789533]. Interestingly, the putative 3' end of exon 4 corresponds to publicly available Affymetrix Human Exon 1.0 ST array data (Exon Cluster ID 183613). Furthermore, all of the amplicons that we obtained were followed by a stretch of adenosines at the 3' ends that are not present in the genomic sequence. We, therefore, concluded that we had reached a genuine 3' end of GHRLOS.
Complex pattern of GHRLOS variant exon expression
GHRLOS exons are highly polymorphic
Extending the known length of GHRLOS
Northern blotting demonstrated GHRLOS transcripts 1–5.5 kb in size, however, further RT-PCR and 3' RACE analysis with a number of different primer combinations failed to reveal GHRLOS transcripts larger than 2 kb (data not shown). For this reason, we hypothesised that the ghrelin antisense gene may extend further upstream. Capped RLM-RACE of stomach tissue and uncapped inverse PCR of the PC3 prostate cancer cell line (Fig. 1) indicated that a GHRLOS promoter (exon I) was present at the very end of the 2 kb 3' untranslated exon 8 of TATDN2. Moreover, we discovered that multiple Cap Analysis of Gene Expression (CAGE) tags are clustered approximately 1.5 kb upstream of the GHRLOS transcription start sites found via 5' RLM-RACE. CAGE tags are an average of 20–21 nucleotides and are produced by large-scale sequencing of concatemers derived from the 5' ends of capped mRNA [16, 17]. The CAGE method, therefore, detects the most 5' site of the mRNA transcripts (the transcription start site) and gives an unbiased and comprehensive picture of the positions and usage of transcription start sites . To confirm if this region belongs to GHRLOS, we employed nested RT-PCR using thymus tissue, foetal brain tissue and Hep G2 hepatocarcinoma cell line cDNA (with RNA reverse transcribed using oligo(dT) primers). The forward primers were present immediately downstream of the CAGE tag cluster (which is 1.5 kb upstream of the 51 bp exon I of GHRLOS) and the reverse primers spanned exon 1 (which is common to all known GHRLOS variants). Sequence analysis revealed several novel exon I variants, which were approximately 1601 bp, 994 bp and 526 bp in length. All of these variants spliced into the expected acceptor site of the 106 bp exon 1 (Fig. 1 and [Additional File 4]). The end of the 1601 bp exon I (termed exon Ib) spanned the 51 bp exon I (exon Ia) identified via capped RLM-RACE of the stomach, while the other exons (Ic-d) employed different upstream donor sites, suggesting that they are alternative first exons. The data demonstrate that there are at least two first GHRLOS exon regions in the untranslated region of TATDN2. This suggests that the GHRLOS promoter is a broad-type, TATA-less promoter that initiates transcription at many sites .
We have identified several novel GHRLOS transcripts. This includes overlapping GHRLOS transcripts initiating in the TATDN2 3' UTR and putative transcription-induced chimaeras of TATDN2 and GHRLOS. These findings extend the previously reported length of GHRLOS by ~37 kb. We propose that GHRLOS harbour several promoters with start sites in exon 1 (of GHRLOS), and in the 3' UTR and the first exon of TATDN2.
Results of in silico analysis indicate that GHRLOS is a non-coding RNA gene
Our study suggests that GHRLOS is a non-coding RNA. In silico translation revealed that the heterogeneous GHRLOS RNAs contain multiple stop codons, resulting in lack of extensive reading frames, and the putative ORFs do not span conserved regions (data not shown). Moreover, no significant sequence similarity to any known proteins was observed (data not shown). Finally, screening of GHRLOS sequence against a reference collection of repeats (RepbasE) using CENSOR  identified a 203 bp overlap of exon 4 with the extinct 224 bp MIR3 SINE element, which is present in all vertebrates  (Fig. 5). Interestingly, the presence of repeat elements in exons of non-coding RNAs has been reported previously [24–27]
The GHRLOS terminal exon 4 and a putative SEC13 exon overlap in an antisense manner
GHRLOS is expressed in many tissues and cell lines and the level of expression shows great variability
The level of total GHRLOS transcript expression varied greatly in different human tissues, with high levels in the thymus, testis, foetal brain, uterus, cerebellum, ovary, thyroid, and whole brain. Very low levels of total GHRLOS RNA expression were detected in the stomach, foetal liver and pancreas (Fig. 7B). In the thymus the level of expression was approximately 133 fold higher than in the stomach (P < 0.01) and 110-fold higher than in the foetal liver (P < 0.01). Furthermore, the level of expression in the adult liver was 9-fold higher than in the foetal liver (P < 0.05), indicating differential expression according to developmental stage in this tissue. The level of GHRLOS in the foetal brain was two-fold higher than the adult brain, but this was not statistically significant (P > 0.05).
Expression of GHRLOS in a number of continuous cell lines was also examined using real-time RT-PCR (Fig. 7C). Levels of GHRLOS expression in the U-937 (Non-Hodgkin's lymphoma) and SW1353 (chondrosarcoma) cell lines were similar to the cerebellum, uterus, foetal brain, testis and thymus. High levels of expression were observed in the HEK293 human embryonic kidney cell line and in the CaCo-2 (colon adenocarcinoma), SAOS-2 (osteosarcoma), U-87 MG and U251-MG (glioblastoma), OVCAR-3 (ovarian adenocarcinoma) and Hep G2 (hepatocarcinoma) cancer cell lines. It is interesting to note that multiple Hep G2 CAGE tags have been identified in the exon I region of GHRLOS (see [Additional File 4]). GHRLOS RNA appears to be significantly upregulated in the Hep G2 cell line compared to adult and foetal liver (P < 0.05) (Fig. 7B and 7C). This observation could indicate that GHRLOS may be a specific target in the development of liver and other cancers.
Comparison of total GHRLOS and total GHRL expression
First reported in 1987  and originally thought to be a rarity, it has recently been established that promoters in 3' untranslated regions (3' UTRs) are not uncommon and may be independently transcribed and regulated from their upstream "host gene" . The ENCODE (ENCyclopedia Of DNA Elements) consortium has recently demonstrated that two-thirds of the loci in their dataset contain new putative first exons, which frequently overlap upstream genes . However, it is currently not known how promoters that overlap 3' UTRs are regulated and coordinated . We have demonstrated GHRLOS transcription start sites in the 3' UTR of TATDN2 via RLM-RACE. The sequence upstream of exon I of GHRLOS contains no apparent TATA boxes (data not shown), indicating that GHRLOS has a broad type promoter, with many potential transcription start sites in the 2.1 kb 3' UTR of TATDN2. This may allow the transcription of numerous tissue-specific and developmental stage-specific transcripts . In addition, multiple CAGE tags are present in the 3' UTR of TATDN2, indicating that GHRLOS transcripts initiate in this region.
Interestingly, we also report the joining of exons of the neighbouring genes TATDN2 and GHRLOS. Similar chimaeric transcripts (not caused by chromosomal translocation) have been reported in lower eukaryotes [38, 39], but were until recently assumed to be relatively rare in mammals [20, 40–43]. It is not known how chimaeric transcripts arise, but transcriptional read-through, followed by canonical cis-splicing is the most likely mechanism . Alternatively, chimaeric transcripts could arise through trans- splicing, but the existence of this mechanism has not been well-established in mammals . We, therefore, suggest that the chimaeric TATDN2-GHRLOS transcripts probably result from a single pre-mRNA, creating transcription-induced chimaeras (TICs) which are spliced in cis . The TATDN2-GHRLOS TIC which was identified in the normal thymus harbours premature stop codons (PTCs) and this is likely to result in nonsense-mediated decay (NMD). NMD is an RNA surveillance mechanism (present in eukaryotes) which targets mRNA with premature stop codons that, if translated, encode truncated proteins with dominant-negative or deleterious gain-of function activities . Whilst we are not aware of any studies of the translated product of TATDN2 (TatD DNase domain containing 2), TatD is a cytoplasmic protein that harbours magnesium-dependent DNase activity in E. coli . Proteins with a TatD domain, therefore, belong to the large superfamily of metalloenzymes. This suggests that the chimaeric transcripts may play a role in auto-regulation of the TATDN2 protein product, which we hypothesise is likely to be a DNase associated with apoptosis. We speculate that TATDN2-GHRLOS chimaeras may be used to tightly regulate the programmed cell death of T-cells by downregulating the translation of the putative DNase TATDN2. Hillman and colleagues have termed this system of auto-regulation, where alternative splicing results in mRNA isoforms that are not translated, regulated unproductive splicing and translation (RUST) . Interestingly, it has very recently been reported that transcripts of the tumour suppressor protein E-cadherin that result in non-sense mediated decay are upregulated in gastric cancer, suggesting that NMD may also promote disease .
We previously reported that GHRLOS completely overlaps the ghrelin (GHRL) gene . Here we also show that the 3' terminal exon 4 of GHRLOS is present on the opposite (antisense) strand to a novel, 3' terminal SEC13 exon (Fig. 8). SEC13 is a protein that forms a part of the coat protein complex II (COPII) . COPII proteins are required for the trafficking of nascent proteins from the endoplasmic reticulum (ER) to the Golgi apparatus. It also plays a role in the selection and concentration of cargo proteins for transport . SEC13 , therefore, has a core endocrine function. GHRLOS overlaps a novel SEC13 variant, SEC13-tentative (SEC13-T). SEC13-T may encode a 372 amino acid protein with a unique C-terminus. Future studies are required to confirm the presence of full-length transcripts and to elucidate the functional role of SEC13-T protein. We suggest that transcription of SEC13-T may have implications for COPII function in health and disease states. As GHRLOS overlaps the terminal exons of both GHRL and SEC13-T, the GHRLOS natural antisense transcripts may regulate one or both of these genes.
GHRLOS, a candidate non-coding RNA
Non-coding RNAs are frequently not conserved between species, suggesting that they are either biological noise (non-functional transcription), or that they have species specific-functions. Species-specific non-coding transcripts have been observed and there is strong evidence that non-coding transcripts are functionally significant [48–52]. Interestingly it has recently been observed, using in silico analysis, that even between closely related Drosophila species non-coding RNAs are not conserved . While the number of protein coding genes in distant eukaryotes (such as worms, mice, and humans) is approximately equal, the relative amount of non-coding DNA increases in proportion to eukaryotic complexity [54, 55]. Mattick and colleagues explain this paradox by hypothesising that non-coding RNAs have evolved to enable the emergence of organisms with increasingly complex higher levels functions [55, 56]. Our in silico analysis suggests that GHRLOS exons show very low sequence conservation in vertebrate species. Furthermore, exon 4 of GHRLOS contains a transposable element, a feature observed in many non-coding RNAs [25–27].
Our bioinformatic studies indicate that GHRLOS does not encode a protein and, therefore, is a non-coding RNA. Taking into account the full-length sequence of GHRLOS, the length of the open reading frames is very short. Moreover, GHRLOS RNAs are numerous (due to extensive splicing), contain a large number of stop codons and there is little nucleotide and putative amino acid conservation between vertebrates, making it less likely that this gene encodes proteins. Although it has been suggested that many small peptides may be translated , the majority of small peptides are processed from larger precursor proteins, as is ghrelin itself which is processed from preproghrelin. Therefore, while it cannot be excluded that GHRLOS encodes short ORFs (that are not conserved in the mouse) it appears unlikely that GHRLOS encodes biologically active peptides.
We examined the GHRLOS expression profile to strengthen the hypothesis that it is a candidate non-coding RNA. It has been demonstrated that mammalian long non-coding RNAs are expressed in a tissue-specific manner, indicating that they are biologically significant [50, 58–61]. In both humans and mice, the major tissues of non-coding expression are the complex organs; the brain, testis, and thymus [50, 61]. This also holds true in Drosophila, where the majority of the candidate non-coding RNAs are expressed in the central nervous system . Indeed, non-coding RNAs are emerging as important regulators of complex systems, such as the central nervous system (brain) and intricate processes, including spermatogenesis in the testis [62–65]. We demonstrated high levels of GHRLOS in the thymus, brain, testis, uterus, ovary and thymus, while the expression levels in the stomach, where GHRL is highly expressed , were almost undetectable. Our data demonstrate that GHRLOS is predominantly expressed in a limited number of tissues and cell types, suggesting that these transcripts have physiological functions in distinct cell types and tissues. Moreover, real-time RT-PCR showed extremely low levels of GHRLOS in the foetal and adult liver and high levels in the Hep G2 hepatocarcinoma cell line, suggesting that GHRLOS expression may be altered in liver cancer. Indeed, it has recently been reported that non-coding RNA expression is frequently altered in cancer [58, 66]. This indicates that non-coding RNAs may have specific functions in normal cells. Much like protein-coding transcripts, ncRNAs may act as tumour suppressors, or be upregulated in cancer and act as oncogenes. An examination of GHRLOS expression in cancer would, therefore, be of great interest.
Here we have characterised the structure and organisation of GHRLOS, a ghrelin antisense gene, suggesting that GHRLOS has multiple first exons. Therefore, it is possible that GHRLOS could be a part of very large, continuous ncRNA species in the 3p25 chromosomal region and beyond. In the absence of hallmarks, such as large open reading frames, mapping complex non-coding RNA genes remains a complex task. For example, two RNAs in the FMR1 locus, the cis-NAT ASFMR1  and the non-coding RNA gene FMR4 (found just upstream of FMR1 ) may be one continuous RNA isoform .
What is the function of GHRLOS?
It is currently difficult and costly to determine the mechanism of function of long non-coding RNAs. The roles of non-coding RNAs are likely to be diverse, and there is strong evidence that they play a role in regulating important pathways. Many ncRNAs are expressed during development, neural differentiation, during macrophage activation and in cancer, indicating that they have key functions in these processes [12, 63, 68]. Non-coding RNAs have been found to play a role in the silencing of overlapping genes in cis , in the silencing of distant chromosome regions in trans , in nuclear trafficking , apoptosis [51, 72], promoter repression , and can act as tumour suppressors . Moreover, ncRNAs are emerging as markers for complex human disease, including lung cancer , heart disease , and a range of other pathologies [68, 77]. GHRLOS may also serve as a host gene for snoRNA (small nucleolar RNAs) genes  or GHRLOS RNA transcripts may be precursors for short RNAs, such as micoRNAs , endogenous siRNAs [79, 80], piRNAs  and other novel, short non-coding RNA species [82, 83].
Although the understanding of natural antisense transcripts (NATs) remains in its infancy, they have been associated with a range of regulatory mechanisms that are not necessarily mutually exclusive. This includes transcriptional interference, RNA masking and dsRNA mediated gene-silencing via direct interaction between the sense and antisense transcripts [9, 84, 85].
While it is difficult to predict GHRLOS function, the fact that all spliced GHRLOS variants share exon 1, which overlaps the 3' untranslated exon 4 of GHRL is striking. Our findings suggest that GHRLOS functions as a non-coding RNA. There are a few examples that suggest that antisense transcripts are important in the regulation of endocrine hormone receptors, including a thyroid hormone receptor [86, 87], and the luteinising hormone/choriogonadotropin receptor  gene and in the regulation of growth factors . Interestingly, it has been recently suggested that the invertebrate (insect) polypeptide hormone allatostatin may be regulated by cis-NATs . To date, however the physiological and pathophysiological roles of natural sense/antisense pairs have not been elucidated for any vertebrate endocrine hormone. Further studies are necessary to reveal the function and molecular mechanisms regulating the candidate non-coding RNA gene GHRLOS.
In the present study, we have characterised GHRLOS, which gives rise to endogenous ghrelin natural antisense transcripts. GHRLOS exhibits features which are common to many non-coding RNA genes, including extensive splicing, lack of significant and conserved open reading frames, differential expression and lack of conservation in vertebrates. Our data also reveal that GHRLOS contains multiple first exons and that it overlaps both GHRL and a novel SEC13 exon in the antisense direction, suggesting that GHRLOS may have a role in regulating these genes. Moreover, we report TATDN2-GHRLOS chimaeras that may function to regulate the translation of the putative DNase TATDN2. Additional studies are underway to elucidate the functions of GHRLOS and to investigate, in particular, its overlapping genomic arrangement with the ghrelin gene. These studies may provide a new, physiologically relevant model system for investigating the roles of antisense gene and non-coding RNA regulation and the mechanisms involved, as well as establishing whether GHRLOS RNAs may be useful markers for diagnosis and prognosis of complex disease.
Multiple sequence alignments were generated using the MUltiple sequence Local AligNment and conservation visualization tool (Mulan) . Human [UCSC hg18], chicken [UCSC galGal3], frog [UCSC xenTro2], opossum [UCSC monDom4], mouse [UCSC mm9], rat [UCSC rn4], dog [UCSC canFam2], rhesus macaque [UCSC rheMac2], and chimpanzee [UCSC PanTro2] genomic sequences were obtained via the Evolutionary Conserved Regions (ECRs) Browser  and forwarded to Mulan to generate a full local alignment of the GHRLOS locus. GHRLOS was annotated based on the exons sequenced in this study.
To identify putative antisense exons and transcripts, we examined the publicly available Affymetrix Human Exon 1.0 ST Array tissue panel dataset consisting of 11 tissues (breast, cerebellum, heart, kidney, liver, muscle, pancreas, prostate, spleen, testes, and thyroid) using the Affymetrix All Exon track in the UCSC Genome Browser  and the Integrated Genome Browser (IGB) from Affymetrix .
To locate transcription start sites in the putative first exons of GHRLOS, CAGE (Cap Analysis of Gene Expression) tags (deposited by the RIKEN consortium and its collaborators) were obtained via the Genome Network Platform Viewer . We then recovered the RNA library information for each CAGE tag starting site. Briefly, each CAGE tag was individually queried against the 1.4 GB CAGE tag sequencing file, (release date 13.11.2006) available on the Genome Network Platform website using the UNIX grep command .
The exon-intron-structure of ESTs and mRNA entries identified from BLAST searches, as well as sequenced PCR amplicons obtained in this study, were analysed against the human genome (NCBI release 35) using GMAP . Presence of open reading frames was analysed by NCBI ORF Finder , Fickett's TestCode [98, 99] and ESTScan2 [100, 101]. The presence of transposable elements in GHRLOS sequence was examined using CENSOR v4.2.8 . Protein domain analysis was performed using the SMART database .
Cell culture and RNA extraction
The following cell lines (originally obtained from the American Type Culture Collection/ATCC, Rockville, MD unless specified) were cultured in their recommended media: Prostate and/or prostate cancer derived cell lines DU145 (ATCC HTB-81), RWPE-1 (ATCC CRL-11609), RWPE-2 (ATCC CRL-11610), LNCaP (ATCC CRL-1740), 22Rv1 (ATCC CRL-2505) and PC3 (ATCC CRL-1435), HEK293 human embryonic kidney (ATCC CRL-1573), SAOS-2 osteosarcoma (ATCC HTB-85), Hep G2 hepatocarcinoma (ATCC HB-8065), U-87 MG and U-251 MG glioblastoma (ATCC HTB-14 and JCRB Cell Bank # IFO50288, respectively), CaCo-2 colorectal adenocarcinoma (ATCC HTB-37), SW1353 chondrosarcoma (ATCC HTB-94), and OVCAR-3 ovarian cancer (ATCC HTB-161). All cells were grown in T80 or T175 flasks (Nagle Nunc International, Roskilde, Denmark) in 95% CO2 in a Sanyo incubator at 37°C. Total RNA was harvested from cultured cells at 70% confluence using TRIzol reagent (Invitrogen, Carlsbad, CA) according to the manufacturer's instructions.
Human RNA samples
Tissue Total RNA was obtained from the stomach, prostate (FirstChoice, Ambion, Austin, TX), foetal liver, adrenal gland, liver, trachea, salivary gland, spinal cord, skeletal muscle, lung, placenta, bone marrow, kidney, heart, whole brain, thyroid, cerebellum, uterus, foetal brain, testis, thymus (Human total RNA Master Panel II, Clontech, Mountain View, CA) and pancreas (Clontech).
5' and 3' RACE mapping of GHRLOS transcripts
Designations and sequences of oligonucleotides
For GHRLOS 3' RACE, human stomach and PC3 prostate cancer cell line total RNA was reverse transcribed using Transcriptor reverse transcriptase (Roche Applied Science Applied Science, Penzberg, Germany) and 10 μM adapter primer (3'-RACE-adapter, Table 1) from the FirstChoice RLM-RACE Kit (Ambion). 3' RACE was performed with 2 μl of this cDNA. Two 3' RACE reactions were performed – one combined an exon 4 GHRLOS-specific forward primer and an adapter-specific reverse primer (3'4F and 3'2OR, Table 1), and the other used an adapter-specific reverse primer and an exon 2-specific forward primer 3'2OR/F, Table 1). PCR products were then diluted and used in a secondary, nested PCR with a gene-specific forward and a reverse adapter primer (3'2IF/R, Table 1). PCR products were purified using a High Pure PCR purification kit (Roche Applied Science), cloned into pCR-XL-TOPO (Invitrogen) or pGEM-T Easy (Promega, Madison, WI), transformed into DH5α chemical competent cells (Invitrogen) and sequenced at the Australian Genome Research Facility (AGRF, Brisbane, Australia) using BigDye III (Applied Biosystems, AB, Foster City, CA).
Determination of GHRLOS transcription start and polyadenylation sites by Rolling Circle Amplification Rapid Amplification of cDNA Ends (RCA-RACE)
To simultaneously obtain the 5' and 3' ends of GHRLOS transcripts, we employed Rolling Circle Amplification-RACE (Rapid Amplification of cDNA Ends , an improved inverse PCR approach. Briefly, 3 μg stomach, prostate, RWPE-1 cell line and PC3 prostate cancer cell line total RNA were reverse transcribed using 10 U of Transcriptor reverse transcriptase (Roche Applied Science) and 100 μM HPLC-purified 5'-end phosphorylated oligo d(T)-adapter primer (Phospo-dT, Table 1) (Proligo, Boulder, CO) according to the manufacturer's instructions. The single-stranded cDNA was purified using a High Pure PCR purification kit (Roche Applied Science) and eluted in 50 μl elution buffer (10 mM Tris-HCl, pH 8.5). Next, 25 μl purified linear cDNA was circularised using 100 U of CircLigase (EPICENTRE Biotechnologies, Madison, WI) and purified as before. After self-ligation, 15 μl circular cDNA was added to a rolling circle amplification reaction with 10 U of φ 29 DNA polymerase (NEB) and 10 μM HPLC-purified random hexamer primers with two phosphothioate linkages on their 3'ends (Pthioate-hex, Table 1) (Proligo). Following a 21 h incubation at 30°C in a waterbath, RCA-products were subjected to two rounds of inverse PCR with 1 U of Platinum Taq HIFI polymerase (Invitrogen), as per manufacturer's instructions. PCRs were performed in a reaction volume of 50 μl using a PTC-200 thermocycler (MJ Research, Waltham, MA). For the first round amplification, the RCA reaction was diluted 1/100 in water, and 1 μl used in an outer PCR with a forward primer at the start of exon 4 and a reverse primer in exon I (IPCR-out-F and IPCR-ALL-R, Table 1). After 35 cycles at a 60°C annealing temperature, the outer PCR product was diluted 100 times in water and 1 μl was used in a hemi-nested PCR of 20 cycles, with annealing at 60°C (IPCR-in-F and IPCR-ALL-R, Table 1). Amplification products were eluted from agarose gels in 50 μl water overnight, reamplified, cloned into pCR-XL-TOPO (Invitrogen), transformed into One Shot MAX Efficiency DH5α-T1R chemically competent cells (Invitrogen) and sequenced at the Australian Genome Research Facility (AGRF, Brisbane, Australia).
Northern blot hybridisation
Initially, a cRNA probe spanning exon I, II, 1 and 2 of GHRLOS was employed. Briefly, a 5' RACE clone in pGEM-T Easy was linearised with SalI restriction enzyme and a cRNA probe was synthesised using T7 RNA polymerase and a digoxigenin (DIG) RNA labelling kit (Roche Applied Science). Probe concentration was estimated by dot blot comparison with digoxigenin-labelled standards. 500 ng stomach poly(A)+ RNA (FirstChoice, Ambion) was separated on a 1.2% formaldehyde gel and blotted, as described previously . Samples were electrophoresed with 50 ng RNA Molecular Weight Marker II (Roche Applied Science). The blot was hybridised to 50 ng/mL DIG-labelled cRNA probe overnight. Prehybridisation and hybridisation was performed with DIG-Easy Hyb (Roche Applied Science) at 65°C. The membranes were washed twice for 5 min at room temperature with 1 × Saline-Sodium Citrate (SSC), 0.1% sodium dodecyl sulfate (SDS) and then washed three times for 10 min at 65°C with 0.1 × SSC, 0.1% SDS. The membrane was then reacted with an alkaline phosphatase (AP)-conjugated anti-DIG antibody (Roche Applied Science). AP activity was detected using a chemiluminescence method using CDP-Star (Roche Applied Science).
A second cRNA probe, which spanned exon 1 (which is common to all known GHRLOS mRNA isoforms) was synthesised from 100 ng human stomach genomic DNA (BioChain, Hayward, CA) using the PCR method  (Ex1-cRNA-F/R, Table 1). The PCR product was purified using a High Pure PCR Product Purification Kit (Roche Applied Science) and the DIG-labelled cRNA probe synthesised and quantified as detailed above. A multi-tissue membrane containing poly(A)+ RNA from 12 human tissues (brain, duodenum, oesophagus, pancreas, PBL/leukocytes, prostate, salivary gland, testis, thymus, thyroid, urinary bladder and uterus) was purchased from OriGene (Rockville, MD). Prehybridisation and hybridisation were performed as described above, except that ULTRAhyb Ultrasensitive Hybridization Buffer (Ambion) was used instead of DIG-Easy Hyb (Roche Applied Science). Equivalent loading between tissues on the blot was determined by rehybridising with 20 ng/mL DIG-labelled β-actin cRNA probe (Roche Applied Science).
Isolation of alternatively spliced GHRLOS mRNAs via non-quantitative RT-PCR
For non-quantitative RT-PCR analysis of GHRLOS splicing, RT-PCRs were performed with a forward primer in a region common to the 5' terminal exon Ia/b and a reverse primer in the 3' terminal exon 4 of GHRLOS (Ito4-F/R, Table 1). cDNA was synthesised in a final volume of 20 μl from 3 μg total RNA from tissues and cell lines using 10 U of Transcriptor reverse transcriptase (Roche Applied Science), 20 U of RNasin Plus RNase Inhibitor (Promega) and a 3' RACE adapter primer (3'-RACE-adapter, Table 1) at 55°C according to the manufacturer's instructions. PCR amplicons from the stomach, prostate, foetal brain, heart, thymus, testis, and pancreas were purified, sub-cloned and sequenced as described above.
Long-range RT-PCR to detect putative chimaeric TATDN2-GHRLOS transcripts
To detect long, chimaeric transcripts, we employed RT-PCR with a forward primer in exon 2 of TATDN2 (ChiOut-F, Table 1) and a reverse primer in exon 1 of GHRLOS (ChiOut-R, Table 1). PCR was carried out with 1 U of Platinum Taq HIFI polymerase (Invitrogen) as per manufacturer's instructions, extending at 68°C for 2.5 minutes per cycle. cDNA was synthesised as above in a final volume of 20 μl from 2 μg total RNA, from the Hep G2 hepatocarcinoma cell line, CaCo-2 colorectal adenocarcinoma cell line, OVCAR-3 ovarian cancer cell line, and from a range of normal tissues (testis, prostate, pancreas, thymus, and foetal brain). RT-PCR products were sub-cloned and sequenced as described above.
CAGE-aided cDNA primer walking
To determine if the identified upstream CAGE tag starting sites transcribe exons that belong to GHRLOS, we employed RT-PCR using a forward primer designed to the region immediately after a CAGE cluster in the ~2 kb 3' untranslated region of the adjacent gene TATDN2 (TSS ID T03F009D1927) and a reverse primer in exon 1, an exon which is common to all known GHRLOS variants (F_CAGE and R_CAGEout in Table 1, respectively). cDNA was synthesised in a final volume of 20 μl from 2 μg total RNA, from the Hep G2 hepatocarcinoma cell line and the thymus and foetal brain, using 10 units Transcriptor reverse transcriptase (Roche Applied Science), 20 U of RNasin Plus RNase Inhibitor (Promega) and oligo(dT)18 primers (Proligo) according to the manufacturer's instructions. The cDNAs were subjected to 35 cycles of PCR with a two-minute extension time per cycle, and then diluted 1/100 in water and subjected to a hemi-nested 30-cycle PCR with a nested primer in exon 1 (R_CAGEin, Table 1). PCRs were performed in a total reaction volume of 50 μl using 10 U of Platinum Taq Polymerase High Fidelity (Invitrogen) according to the manufacturer's instructions. Entire PCR products were purified using the High Pure PCR Product Purification Kit (Roche Applied Science), subcloned into pCR-XL-TOPO (Invitrogen) and transformed into One Shot MAX Efficiency DH5α-T1R chemical competent cells (Invitrogen). Insert-positive, purified clones were sequenced by the Australian Genome Research Facility (AGRF, Brisbane, Australia) using the AB PRISM BigDye Terminator Cycle Sequencing Kit v3.1 protocol (AB).
Identification of novel SEC13 exon
To verify the presence of a novel SEC13 exon identified in a brain tumour EST [GenBank:BF931280], cDNAs reverse transcribed with an oligo(dT) primer (as described above) were challenged by RT-PCR with primers in exon 8 of SEC13 and a reverse primer in the novel exon (231-F/R, Table 1, respectively).
Strand-specific, quantitative real-time RT-PCR
To allow strand-specific and RNA-specific amplification [105–107] of GHRLOS transcripts, reverse transcription was performed using a gene-specific primer in exon 4 with a linker (LK)  sequence attached to the 5' end of the primer (GHRLOS-Real-RT-LK, Table 1). cDNA was generated from 1 μg total RNA using 40 U of AMV reverse transcriptase (Roche Applied Science) at 42°C, according to the manufacturer's instructions. The strand-specific, real-time RT-PCR was performed with an exon 4 specific forward primer, a reverse primer with the LK sequence only (GHRLOS-Real-F and LK, Table 1) and a TaqMan probe (Ex4-TaqMan, Table 1). To detect sense GHRL transcripts, we employed a strand-specific RT-PCR approach, with a reverse transcription primer spanning the 3' terminal exon 4 of the ghrelin gene (GHRLex4_RT_LK, Table 1) followed by PCR with an exon 4 specific forward primer (GHRLex4_F, Table 1) and a linker-specific reverse primer. (LK, Table 1). The relative quantification of GHRLOS and GHRL transcripts was estimated by direct normalisation to the threshold cycle (CT) of the housekeeping gene, 18S ribosomal RNA (18S-Real-F/R, Table 1). 18S PCRs were used to normalise real-time data. As reported for GAPDH , 18S RNAs self-primes efficiently in reverse transcription reactions without the addition of random or gene-specific primers. All primers were designed using the Primer Express version 2.0 software (AB).
PCRs were performed in a total reaction volume of 20 μl using Platinum Quantitative PCR SuperMix-UDG w/ROX (Invitrogen) for GHRLOS, while GHRL and the housekeeping gene 18S ribosomal RNA were amplified using 2 × SYBR green master mix (AB). Controls included the use of cDNA, which was reverse transcribed using random hexamers as primers, as well as the reverse transcription of RNA in the absence of primer. Real-time RT-PCR was performed using the AB 7000 sequence detection system (AB) and data analysed using the absolute standard curve method (User Bulletin #2, AB) to determine expression levels in a range of tissues and cell lines. Briefly, we calculated values from duplicate reactions for each sample from standards, which were constructed from PCR products. Statistical significance was determined using the Student's t-test and, where applicable, one-way analysis of variance (ANOVA) with Tukey post-hoc analysis. P-values of < 0.05 were considered to be statistically significant. Data are represented as mean ± standard deviation (S.D.).
This work was supported by grants from the Cancer Council Queensland (to LKC and ACH), the Faculty of Science, Queensland University of Technology and a QUT International Doctoral Scholarship (to IS). We thank Dr Yaowu He (IHBI, Kelvin Grove, Australia) for the U-937, U-87 MG and U-251 MG cell lines; Carina Walpole (IHBI) for the OVCAR-3 cell line; and Dr Penny Jeffery (Mater Medical Research Institute, South Brisbane, Australia) for the CaCo-2 colorectal adenocarcinoma cell line.
- Kojima M, Hosoda H, Date Y, Nakazato M, Matsuo H, Kangawa K: Ghrelin is a growth-hormone-releasing acylated peptide from stomach. Nature 1999, 402(6762):656-660. 10.1038/45230.PubMedGoogle Scholar
- Heijboer AC, Hoek AM, Parlevliet ET, Havekes LM, Romijn JA, Pijl H, Corssmit EP: Ghrelin differentially affects hepatic and peripheral insulin sensitivity in mice. Diabetologia 2006, 49(4):732-738. 10.1007/s00125-006-0138-2.PubMedGoogle Scholar
- Bednarek MA, Feighner SD, Pong SS, McKee KK, Hreniuk DL, Silva MV, Warren VA, Howard AD, Ploeg LH, Heck JV: Structure-function studies on the new growth hormone-releasing peptide, ghrelin: minimal sequence of ghrelin necessary for activation of growth hormone secretagogue receptor 1a. J Med Chem 2000, 43(23):4370-4376. 10.1021/jm0001727.PubMedGoogle Scholar
- Neary NM, Druce MR, Small CJ, Bloom SR: Acylated ghrelin stimulates food intake in the fed and fasted states but desacylated ghrelin has no effect. Gut 2006, 55(1):135.PubMed CentralPubMedGoogle Scholar
- Tack J, Depoortere I, Bisschops R, Delporte C, Coulie B, Meulemans A, Janssens J, Peeters T: Influence of ghrelin on interdigestive gastrointestinal motility in humans. Gut 2006, 55(3):327-333. 10.1136/gut.2004.060426.PubMed CentralPubMedGoogle Scholar
- Zigman JM, Nakano Y, Coppari R, Balthasar N, Marcus JN, Lee CE, Jones JE, Deysher AE, Waxman AR, White RD, Williams TD, Lachey JL, Seeley RJ, Lowell BB, Elmquist JK: Mice lacking ghrelin receptors resist the development of diet-induced obesity. J Clin Invest 2005, 115(12):3564-3572. 10.1172/JCI26002.PubMed CentralPubMedGoogle Scholar
- Tritos NA, Kokkotou EG: The physiology and potential clinical applications of ghrelin, a novel peptide hormone. Mayo Clin Proc 2006, 81(5):653-660.PubMedGoogle Scholar
- Seim I, Collet C, Herington AC, Chopin LK: Revised genomic structure of the human ghrelin gene and identification of novel exons, alternative splice variants and natural antisense transcripts. BMC Genomics 2007, 8: 298. 10.1186/1471-2164-8-298.PubMed CentralPubMedGoogle Scholar
- Lapidot M, Pilpel Y: Genome-wide natural antisense transcription: coupling its regulation to its different regulatory mechanisms. EMBO Rep 2006, 7(12):1216-1222. 10.1038/sj.embor.7400857.PubMed CentralPubMedGoogle Scholar
- RIKEN Genome Exploration Research Group and Genome Science Group (Genome Network Project Core Group) and the FANTOM Consortium: The transcriptional landscape of the mammalian genome. Science 2005, 309(5740):1559-1563. 10.1126/science.1112014.Google Scholar
- Mattick JS, Makunin IV: Non-coding RNA. Hum Mol Genet 2006, 15(Spec No 1):R17-29. 10.1093/hmg/ddl046.PubMedGoogle Scholar
- Mattick JS: A new paradigm for developmental biology. J Exp Biol 2007, 210(Pt 9):1526-1547. 10.1242/jeb.005017.PubMedGoogle Scholar
- Nagase T, Seki N, Ishikawa K, Ohira M, Kawarabayasi Y, Ohara O, Tanaka A, Kotani H, Miyajima N, Nomura N: Prediction of the coding sequences of unidentified human genes. VI. The coding sequences of 80 new genes (KIAA0201-KIAA0280) deduced by analysis of cDNA clones from cell line KG-1 and brain. DNA Res 1996, 3(5):321-329. 341–354. 10.1093/dnares/3.5.321.PubMedGoogle Scholar
- Wu TD, Watanabe CK: GMAP: a genomic mapping and alignment program for mRNA and EST sequences. Bioinformatics 2005, 21(9):1859-1875. 10.1093/bioinformatics/bti310.PubMedGoogle Scholar
- Burset M, Seledtsov IA, Solovyev VV: Analysis of canonical and non-canonical splice sites in mammalian genomes. Nucleic Acids Res 2000, 28(21):4364-4375. 10.1093/nar/28.21.4364.PubMed CentralPubMedGoogle Scholar
- Shiraki T, Kondo S, Katayama S, Waki K, Kasukawa T, Kawaji H, Kodzius R, Watahiki A, Nakamura M, Arakawa T, Fukuda S, Sasaki D, Podhajska A, Harbers M, Kawai J, Carninci P, Hayashizaki Y: Cap analysis gene expression for high-throughput analysis of transcriptional starting point and identification of promoter usage. PNAS 2003, 100(26):15776-15781. 10.1073/pnas.2136655100.PubMed CentralPubMedGoogle Scholar
- Harbers M, Carninci P: Tag-based approaches for transcriptome research and genome annotation. Nat Methods 2005, 2(7):495-502. 10.1038/nmeth768.PubMedGoogle Scholar
- Frith MC, Ponjavic J, Fredman D, Kai C, Kawai J, Carninci P, Hayashizaki Y, Sandelin A: Evolutionary turnover of mammalian transcription start sites. Genome Res 2006, 16(6):713-722. 10.1101/gr.5031006.PubMed CentralPubMedGoogle Scholar
- Sandelin A, Carninci P, Lenhard B, Ponjavic J, Hayashizaki Y, Hume DA: Mammalian RNA polymerase II core promoters: insights from genome-wide studies. Nat Rev Genet 2007, 8(6):424-436. 10.1038/nrg2026.PubMedGoogle Scholar
- Akiva P, Toporik A, Edelheit S, Peretz Y, Diber A, Shemesh R, Novik A, Sorek R: Transcription-mediated gene fusion in the human genome. Genome Res 2006, 16(1):30-36. 10.1101/gr.4137606.PubMed CentralPubMedGoogle Scholar
- Chang YF, Imam JS, Wilkinson MF: The nonsense-mediated decay RNA surveillance pathway. Annu Rev Biochem 2007, 76: 51-74. 10.1146/annurev.biochem.76.050106.093909.PubMedGoogle Scholar
- Kohany O, Gentles AJ, Hankus L, Jurka J: Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor. BMC Bioinformatics 2006, 7: 474. 10.1186/1471-2105-7-474.PubMed CentralPubMedGoogle Scholar
- Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, et al.: Initial sequencing and analysis of the human genome. Nature 2001, 409(6822):860-921. 10.1038/35057062.PubMedGoogle Scholar
- Chen W, Bocker W, Brosius J, Tiedge H: Expression of neural BC200 RNA in human tumours. J Pathol 1997, 183(3):345-351. 10.1002/(SICI)1096-9896(199711)183:3<345::AID-PATH930>3.0.CO;2-8.PubMedGoogle Scholar
- Yoon H, He H, Nagy R, Davuluri R, Suster S, Schoenberg D, Pellegata N, Chapelle Ade L: Identification of a novel noncoding RNA gene, NAMA, that is downregulated in papillary thyroid carcinoma with BRAF mutation and associated with growth arrest. Int J Cancer 2007, 121(4):767-775. 10.1002/ijc.22701.PubMedGoogle Scholar
- Sonkoly E, Bata-Csorgo Z, Pivarcsi A, Polyanka H, Kenderessy-Szabo A, Molnar G, Szentpali K, Bari L, Megyeri K, Mandi Y, Dobozy A, Kemeny L, Szell M: Identification and characterization of a novel, psoriasis susceptibility-related noncoding RNA gene, PRINS. J Biol Chem 2005, 280(25):24159-24167. 10.1074/jbc.M501704200.PubMedGoogle Scholar
- Moh MC, Lee LH, Yang X, Shen S: Identification of a novel gene HEPT3 that is overexpressed in human hepatocellular carcinoma and may function through its noncoding RNA. Int J Oncol 2007, 31(2):293-301.PubMedGoogle Scholar
- Hughes H, Stephens DJ: Assembly, organization, and function of the COPII coat. Histochem Cell Biol 2008, 129(2):129-151. 10.1007/s00418-007-0363-x.PubMed CentralPubMedGoogle Scholar
- Gurkan C, Stagg SM, Lapointe P, Balch WE: The COPII cage: unifying principles of vesicle coat assembly. Nat Rev Mol Cell Biol 2006, 7(10):727-738. 10.1038/nrm2025.PubMedGoogle Scholar
- Dias Neto E, Correa RG, Verjovski-Almeida S, Briones MR, Nagai MA, da Silva W Jr, Zago MA, Bordin S, Costa FF, Goldman GH, Carvalho AF, Matsukuma A, Baia GS, Simpson DH, Brunstein A, de Oliveira PS, Bucher P, Jongeneel CV, O'Hare MJ, Soares F, Brentani RR, Reis LF, de Souza SJ, Simpson AJ: Shotgun sequencing of the human transcriptome with ORF expressed sequence tags. Proc Natl Acad Sci USA 2000, 97(7):3491-3496. 10.1073/pnas.97.7.3491.PubMed CentralPubMedGoogle Scholar
- Gnanapavan S, Kola B, Bustin SA, Morris DG, McGee P, Fairclough P, Bhattacharya S, Carpenter R, Grossman AB, Korbonits M: The tissue distribution of the mRNA of ghrelin and subtypes of its receptor, GHS-R, in humans. J Clin Endocrinol Metab 2002, 87(6):2988. 10.1210/jc.87.6.2988.PubMedGoogle Scholar
- McCracken S, Fong N, Rosonina E, Yankulov K, Brothers G, Siderovski D, Hessel A, Foster S, Shuman S, Bentley DL: 5'-Capping enzymes are targeted to pre-mRNA by binding to the phosphorylated carboxy-terminal domain of RNA polymerase II. Genes Dev 1997, 11(24):3306-3318. 10.1101/gad.11.24.3306.PubMed CentralPubMedGoogle Scholar
- Holmes R, Williamson C, Peters J, Denny P, Wells C: A comprehensive transcript map of the mouse Gnas imprinted complex. Genome Res 2003, 13(6B):1410-1415. 10.1101/gr.955503.PubMed CentralPubMedGoogle Scholar
- Hoshimaru M, Nakanishi S: Identification of a new type of mammalian myosin heavy chain by molecular cloning. Overlap of its mRNA with preprotachykinin B mRNA. J Biol Chem 1987, 262(30):14625-14632.PubMedGoogle Scholar
- Carninci P: Constructing the landscape of the mammalian transcriptome. J Exp Biol 2007, 210(Pt 9):1497-1506. 10.1242/jeb.000406.PubMedGoogle Scholar
- Denoeud F, Kapranov P, Ucla C, Frankish A, Castelo R, Drenkow J, Lagarde J, Alioto T, Manzano C, Chrast J, Dike S, Wyss C, Henrichsen CN, Holroyd N, Dickson MC, Taylor R, Hance Z, Foissac S, Myers RM, Rogers J, Hubbard T, Harrow J, Guigo R, Gingeras TR, Antonarakis SE, Reymond A: Prominent use of distal 5' transcription start sites and discovery of a large number of additional exons in ENCODE regions. Genome Res 2007, 17(6):746-759. 10.1101/gr.5660607.PubMed CentralPubMedGoogle Scholar
- Gingeras TR: The multitasking genome. Nat Genet 2006, 38(6):608-609. 10.1038/ng0606-608.PubMedGoogle Scholar
- Manak JR, Dike S, Sementchenko V, Kapranov P, Biemar F, Long J, Cheng J, Bell I, Ghosh S, Piccolboni A, Gingeras TR: Biological function of unannotated transcription during the early development of Drosophila melanogaster. Nat Genet 2006, 38(10):1151-1158. 10.1038/ng1875.PubMedGoogle Scholar
- Williams BA, Slamovits CH, Patron NJ, Fast NM, Keeling PJ: A high frequency of overlapping gene expression in compacted eukaryotic genomes. Proc Natl Acad Sci USA 2005, 102(31):10936-10941. 10.1073/pnas.0501321102.PubMed CentralPubMedGoogle Scholar
- Parra G, Reymond A, Dabbouseh N, Dermitzakis ET, Castelo R, Thomson TM, Antonarakis SE, Guigo R: Tandem chimerism as a means to increase protein complexity in the human genome. Genome Res 2006, 16(1):37-44. 10.1101/gr.4145906.PubMed CentralPubMedGoogle Scholar
- Kapranov P, Drenkow J, Cheng J, Long J, Helt G, Dike S, Gingeras TR: Examples of the complex architecture of the human transcriptome revealed by RACE and high-density tiling arrays. Genome Res 2005, 15(7):987-997. 10.1101/gr.3455305.PubMed CentralPubMedGoogle Scholar
- Kapranov P, Willingham AT, Gingeras TR: Genome-wide transcription and the implications for genomic organization. Nat Rev Genet 2007, 8(6):413-423. 10.1038/nrg2083.PubMedGoogle Scholar
- Carninci P, Kasukawa T, Katayama S, Gough J, Frith MC, Maeda N, Oyama R, Ravasi T, Lenhard B, Wells C, Kodzius R, Shimokawa K, Bajic VB, Brenner SE, Batalov S, Forrest AR, Zavolan M, Davis MJ, Wilming LG, Aidinis V, Allen JE, Ambesi-Impiombato A, Apweiler R, Aturaliya RN, Bailey TL, Bansal M, Baxter L, Beisel KW, Bersano T, Bono H, et al.: The transcriptional landscape of the mammalian genome. Science 2005, 309(5740):1559-1563. 10.1126/science.1112014.PubMedGoogle Scholar
- Horiuchi T, Aigaki T: Alternative trans-splicing: a novel mode of pre-mRNA processing. Biol Cell 2006, 98(2):135-140. 10.1042/BC20050002.PubMedGoogle Scholar
- Wexler M, Sargent F, Jack RL, Stanley NR, Bogsch EG, Robinson C, Berks BC, Palmer T: TatD is a cytoplasmic protein with DNase activity. No requirement for TatD family proteins in sec-independent protein export. J Biol Chem 2000, 275(22):16717-16722. 10.1074/jbc.M000800200.PubMedGoogle Scholar
- Hillman RT, Green RE, Brenner SE: An unappreciated role for RNA surveillance. Genome Biol 2004, 5(2):R8. 10.1186/gb-2004-5-2-r8.PubMed CentralPubMedGoogle Scholar
- Karam R, Carvalho J, Bruno I, Graziadio C, Senz J, Huntsman D, Carneiro F, Seruca R, Wilkinson MF, Oliveira C: The NMD mRNA surveillance pathway downregulates aberrant E-cadherin transcripts in gastric cancer cells and in CDH1 mutation carriers. Oncogene 2008.Google Scholar
- Pang KC, Frith MC, Mattick JS: Rapid evolution of noncoding RNAs: lack of conservation does not mean lack of function. Trends Genet 2006, 22(1):1-5. 10.1016/j.tig.2005.10.003.PubMedGoogle Scholar
- Pollard KS, Salama SR, Lambert N, Lambot MA, Coppens S, Pedersen JS, Katzman S, King B, Onodera C, Siepel A, Kern AD, Dehay C, Igel H, Ares M Jr, Vanderhaeghen P, Haussler D: An RNA gene expressed during cortical development evolved rapidly in humans. Nature 2006, 443(7108):167-172. 10.1038/nature05113.PubMedGoogle Scholar
- Sasaki YT, Sano M, Ideue T, Kin T, Asai K, Hirose T: Identification and characterization of human non-coding RNAs with tissue-specific expression. Biochem Biophys Res Commun 2007, 357(4):991-996. 10.1016/j.bbrc.2007.04.034.PubMedGoogle Scholar
- Khalil AM, Faghihi MA, Modarresi F, Brothers SP, Wahlestedt C: A novel RNA transcript with antiapoptotic function is silenced in fragile × syndrome. PLoS ONE 2008, 3(1):e1486. 10.1371/journal.pone.0001486.PubMed CentralPubMedGoogle Scholar
- Yan MD, Hong CC, Lai GM, Cheng AL, Lin YW, Chuang SE: Identification and characterization of a novel gene Saf transcribed from the opposite strand of Fas. Hum Mol Genet 2005, 14(11):1465-1474. 10.1093/hmg/ddi156.PubMedGoogle Scholar
- Inagaki S, Numata K, Kondo T, Tomita M, Yasuda K, Kanai A, Kageyama Y: Identification and expression analysis of putative mRNA-like non-coding RNA in Drosophila. Genes Cells 2005, 10(12):1163-1173. 10.1111/j.1365-2443.2005.00910.x.PubMedGoogle Scholar
- Taft R, Mattick J: Increasing biological complexity is positively correlated with the relative genome-wide expansion of non-protein-coding DNA sequences. Genome Biology 2003, 5(1):P1. 10.1186/gb-2003-5-1-p1.Google Scholar
- Mattick JS: RNA regulation: a new genetics? Nat Rev Genet 2004, 5(4):316-323. 10.1038/nrg1321.PubMedGoogle Scholar
- Mattick JS: Non-coding RNAs: the architects of eukaryotic complexity. EMBO Rep 2001, 2(11):986-991. 10.1093/embo-reports/kve230.PubMed CentralPubMedGoogle Scholar
- Frith MC, Forrest AR, Nourbakhsh E, Pang KC, Kai C, Kawai J, Carninci P, Hayashizaki Y, Bailey TL, Grimmond SM: The abundance of short proteins in the mammalian proteome. PLoS Genet 2006, 2(4):e52. 10.1371/journal.pgen.0020052.PubMed CentralPubMedGoogle Scholar
- Perez DS, Hoage TR, Pritchett JR, Ducharme-Smith AL, Halling ML, Ganapathiraju SC, Streng PS, Smith DI: Long, abundantly expressed non-coding transcripts are altered in cancer. Hum Mol Genet 2008, 17(5):642-655. 10.1093/hmg/ddm336.PubMedGoogle Scholar
- Ravasi T, Suzuki H, Pang KC, Katayama S, Furuno M, Okunishi R, Fukuda S, Ru K, Frith MC, Gongora MM, Grimmond SM, Hume DA, Hayashizaki Y, Mattick JS: Experimental validation of the regulated expression of large numbers of non-coding RNAs from the mouse genome. Genome Res 2006, 16(1):11-19. 10.1101/gr.4200206.PubMed CentralPubMedGoogle Scholar
- Furuno M, Pang KC, Ninomiya N, Fukuda S, Frith MC, Bult C, Kai C, Kawai J, Carninci P, Hayashizaki Y, Mattick JS, Suzuki H: Clusters of internally primed transcripts reveal novel long noncoding RNAs. PLoS Genet 2006, 2(4):e37. 10.1371/journal.pgen.0020037.PubMed CentralPubMedGoogle Scholar
- Mercer TR, Dinger ME, Sunkin SM, Mehler MF, Mattick JS: Specific expression of long noncoding RNAs in the mouse brain. Proc Natl Acad Sci USA 2008, 105(2):716-721. 10.1073/pnas.0706729105.PubMed CentralPubMedGoogle Scholar
- St Laurent G 3rd, Wahlestedt C: Noncoding RNAs: couplers of analog and digital information in nervous system function? Trends Neurosci 2007, 30(12):612-621. 10.1016/j.tins.2007.10.002.PubMedGoogle Scholar
- Mehler MF, Mattick JS: Non-coding RNAs in the nervous system. J Physiol 2006, 575(Pt 2):333-341. 10.1113/jphysiol.2006.113191.PubMed CentralPubMedGoogle Scholar
- Mehler MF, Mattick JS: Noncoding RNAs and RNA editing in brain development, functional diversification, and neurological disease. Physiol Rev 2007, 87(3):799-823. 10.1152/physrev.00036.2006.PubMedGoogle Scholar
- Okada Y, Tashiro C, Numata K, Watanabe K, Nakaoka H, Yamamoto N, Okubo K, Ikeda R, Saito R, Kanai A, Abe K, Tomita M, Kiyosawa H: Comparative expression analysis uncovers novel features of endogenous antisense transcription. Hum Mol Genet 2008.Google Scholar
- Calin GA, Liu CG, Ferracin M, Hyslop T, Spizzo R, Sevignani C, Fabbri M, Cimmino A, Lee EJ, Wojcik SE, Shimizu M, Tili E, Rossi S, Taccioli C, Pichiorri F, Liu X, Zupo S, Herlea V, Gramantieri L, Lanza G, Alder H, Rassenti L, Volinia S, Schmittgen TD, Kipps TJ, Negrini M, Croce CM: Ultraconserved regions encoding ncRNAs are altered in human leukemias and carcinomas. Cancer Cell 2007, 12(3):215-229. 10.1016/j.ccr.2007.07.027.PubMedGoogle Scholar
- Ladd PD, Smith LE, Rabaia NA, Moore JM, Georges SA, Hansen RS, Hagerman RJ, Tassone F, Tapscott SJ, Filippova GN: An antisense transcript spanning the CGG repeat region of FMR1 is upregulated in premutation carriers but silenced in full mutation individuals. Hum Mol Genet 2007, 16(24):3174-3187. 10.1093/hmg/ddm293.PubMedGoogle Scholar
- Prasanth KV, Spector DL: Eukaryotic regulatory RNAs: an answer to the 'genome complexity' conundrum. Genes Dev 2007, 21(1):11-42. 10.1101/gad.1484207.PubMedGoogle Scholar
- Sleutels F, Zwart R, Barlow DP: The non-coding Air RNA is required for silencing autosomal imprinted genes. Nature 2002, 415(6873):810-813.PubMedGoogle Scholar
- Rinn JL, Kertesz M, Wang JK, Squazzo SL, Xu X, Brugmann SA, Goodnough LH, Helms JA, Farnham PJ, Segal E, Chang HY: Functional demarcation of active and silent chromatin domains in human HOX loci by noncoding RNAs. Cell 2007, 129(7):1311-1323. 10.1016/j.cell.2007.05.022.PubMed CentralPubMedGoogle Scholar
- Willingham AT, Orth AP, Batalov S, Peters EC, Wen BG, Aza-Blanc P, Hogenesch JB, Schultz PG: A strategy for probing the function of noncoding RNAs finds a repressor of NFAT. Science 2005, 309(5740):1570-1573. 10.1126/science.1115901.PubMedGoogle Scholar
- Tsang WP, Wong TW, Cheung AH, Co CN, Kwok TT: Induction of drug resistance and transformation in human cancer cells by the noncoding RNA CUDR. Rna 2007, 13(6):890-898. 10.1261/rna.359007.PubMed CentralPubMedGoogle Scholar
- Martianov I, Ramadass A, Serra Barros A, Chow N, Akoulitchev A: Repression of the human dihydrofolate reductase gene by a non-coding interfering transcript. Nature 2007, 445(7128):666-670. 10.1038/nature05519.PubMedGoogle Scholar
- Zhou Y, Zhong Y, Wang Y, Zhang X, Batista DL, Gejman R, Ansell PJ, Zhao J, Weng C, Klibanski A: Activation of p53 by MEG3 non-coding RNA. J Biol Chem 2007, 282(34):24731-24742. 10.1074/jbc.M702029200.PubMedGoogle Scholar
- Ji P, Diederichs S, Wang W, Boing S, Metzger R, Schneider PM, Tidow N, Brandt B, Buerger H, Bulk E, Thomas M, Berdel WE, Serve H, Muller-Tidow C: MALAT-1, a novel noncoding RNA, and thymosin beta4 predict metastasis and survival in early-stage non-small cell lung cancer. Oncogene 2003, 22(39):8031-8041. 10.1038/sj.onc.1206928.PubMedGoogle Scholar
- Ishii N, Ozaki K, Sato H, Mizuno H, Saito S, Takahashi A, Miyamoto Y, Ikegawa S, Kamatani N, Hori M, Nakamura Y, Tanaka T: Identification of a novel non-coding RNA, MIAT, that confers risk of myocardial infarction. J Hum Genet 2006, 51(12):1087-1099. 10.1007/s10038-006-0070-9.PubMedGoogle Scholar
- Szymanski M, Barciszewska MZ, Erdmann VA, Barciszewski J: A new frontier for molecular medicine: noncoding RNAs. Biochim Biophys Acta 2005, 1756(1):65-75.PubMedGoogle Scholar
- Matera AG, Terns RM, Terns MP: Non-coding RNAs: lessons from the small nuclear and small nucleolar RNAs. Nat Rev Mol Cell Biol 2007, 8(3):209-220. 10.1038/nrm2124.PubMedGoogle Scholar
- Tam OH, Aravin AA, Stein P, Girard A, Murchison EP, Cheloufi S, Hodges E, Anger M, Sachidanandam R, Schultz RM, Hannon GJ: Pseudogene-derived small interfering RNAs regulate gene expression in mouse oocytes. Nature 2008, 453(7194):534-538. 10.1038/nature06904.PubMed CentralPubMedGoogle Scholar
- Watanabe T, Totoki Y, Toyoda A, Kaneda M, Kuramochi-Miyagawa S, Obata Y, Chiba H, Kohara Y, Kono T, Nakano T, Surani MA, Sakaki Y, Sasaki H: Endogenous siRNAs from naturally formed dsRNAs regulate transcripts in mouse oocytes. Nature 2008, 453(7194):539-543. 10.1038/nature06908.PubMedGoogle Scholar
- Kim M, Patel B, Schroeder KE, Raza A, Dejong J: Organization and transcriptional output of a novel mRNA-like piRNA gene (mpiR) located on mouse chromosome 10. Rna 2008, 14(6):1005-1011. 10.1261/rna.974608.PubMed CentralPubMedGoogle Scholar
- Kapranov P, Cheng J, Dike S, Nix DA, Duttagupta R, Willingham AT, Stadler PF, Hertel J, Hackermuller J, Hofacker IL, Bell I, Cheung E, Drenkow J, Dumais E, Patel S, Helt G, Ganesh M, Ghosh S, Piccolboni A, Sementchenko V, Tammana H, Gingeras TR: RNA maps reveal new RNA classes and a possible function for pervasive transcription. Science 2007, 316(5830):1484-1488. 10.1126/science.1138341.PubMedGoogle Scholar
- Ogawa Y, Sun BK, Lee JT: Intersection of the RNA interference and X-inactivation pathways. Science 2008, 320(5881):1336-1341. 10.1126/science.1157676.PubMed CentralPubMedGoogle Scholar
- Lavorgna G, Dahary D, Lehner B, Sorek R, Sanderson CM, Casari G: In search of antisense. Trends Biochem Sci 2004, 29(2):88-94. 10.1016/j.tibs.2003.12.002.PubMedGoogle Scholar
- Osato N, Suzuki Y, Ikeo K, Gojobori T: Transcriptional interferences in cis natural antisense transcripts of humans and mice. Genetics 2007, 176(2):1299-1306. 10.1534/genetics.106.069484.PubMed CentralPubMedGoogle Scholar
- Lazar MA, Hodin RA, Cardona G, Chin WW: Gene expression from the c-erbA alpha/Rev-ErbA alpha genomic locus. Potential regulation of alternative splicing by opposite strand transcription. J Biol Chem 1990, 265(22):12859-12863.PubMedGoogle Scholar
- Hastings ML, Ingle HA, Lazar MA, Munroe SH: Post-transcriptional regulation of thyroid hormone receptor expression by cis-acting sequences and a naturally occurring antisense RNA. J Biol Chem 2000, 275(15):11507-11513. 10.1074/jbc.275.15.11507.PubMedGoogle Scholar
- Chambers AE, Banerjee S: Natural antisense LHCGR could make sense of hypogonadism, male-limited precocious puberty and pre-eclampsia. Mol Cell Endocrinol 2005, 241(1–2):1-9. 10.1016/j.mce.2005.06.007.PubMedGoogle Scholar
- Barclay C, Li AW, Geldenhuys L, Baguma-Nibasheka M, Porter GA, Veugelers PJ, Murphy PR, Casson AG: Basic fibroblast growth factor (FGF-2) overexpression is a risk factor for esophageal cancer recurrence and reduced survival, which is ameliorated by coexpression of the FGF-2 antisense gene. Clin Cancer Res 2005, 11(21):7683-7691. 10.1158/1078-0432.CCR-05-0771.PubMedGoogle Scholar
- Yin GL, Chen Q, Yang WJ: Naturally occurring antisense RNA of allatostatin gene in the prawn, Macrobrachium rosenbergii. Comp Biochem Physiol B Biochem Mol Biol 2007, 146(1):20-25. 10.1016/j.cbpb.2006.08.010.PubMedGoogle Scholar
- Ovcharenko I, Loots GG, Giardine BM, Hou M, Ma J, Hardison RC, Stubbs L, Miller W: Mulan: multiple-sequence local alignment and visualization for studying function and evolution. Genome Res 2005, 15(1):184-194. 10.1101/gr.3007205.PubMed CentralPubMedGoogle Scholar
- Ovcharenko I, Nobrega MA, Loots GG, Stubbs L: ECR Browser: a tool for visualizing and accessing data from comparisons of multiple vertebrate genomes. Nucleic Acids Res 2004, (32 Web Server):W280-286. 10.1093/nar/gkh355Google Scholar
- Kuhn RM, Karolchik D, Zweig AS, Trumbower H, Thomas DJ, Thakkapallayil A, Sugnet CW, Stanke M, Smith KE, Siepel A, Rosenbloom KR, Rhead B, Raney BJ, Pohl A, Pedersen JS, Hsu F, Hinrichs AS, Harte RA, Diekhans M, Clawson H, Bejerano G, Barber GP, Baertsch R, Haussler D, Kent WJ: The UCSC genome browser database: update 2007 10.1093/nar/gkl928. Nucl Acids Res 2007, 35(suppl_1):D668-673. 10.1093/nar/gkl928.PubMed CentralPubMedGoogle Scholar
- Affymetrix Integrated Genome Browser[https://www.affymetrix.com/support/developer/tools/IGB_User_Guide.pdf]
- Genome Network Platform Viewer[http://genomenetwork.nig.ac.jp/public/sys/gnppub/]
- Haertel M: GNU grep-2.0. Internet Archive: Usenet archive comp. sources.reviewed. 1996., 3:Google Scholar
- NCBI ORF Finder[http://www.ncbi.nlm.nih.gov/projects/gorf/]
- Fickett JW: Recognition of protein coding regions in DNA sequences. Nucleic Acids Res 1982, 10(17):5303-5318. 10.1093/nar/10.17.5303.PubMed CentralPubMedGoogle Scholar
- Lottaz C, Iseli C, Jongeneel CV, Bucher P: Modeling sequencing errors by combining Hidden Markov models. Bioinformatics 2003, 19(Suppl 2):ii103-112.PubMedGoogle Scholar
- Iseli C, Jongeneel CV, Bucher P: ESTScan: a program for detecting, evaluating, and reconstructing potential coding regions in EST sequences. Proc Int Conf Intell Syst Mol Biol 1999, 138-148.Google Scholar
- Letunic I, Copley RR, Pils B, Pinkert S, Schultz J, Bork P: SMART 5: domains in the context of genomes and networks. Nucleic Acids Res 2006, (34 Database):D257-260. 10.1093/nar/gkj079Google Scholar
- Polidoros AN, Pasentsis K, Tsaftaris AS: Rolling circle amplification-RACE: a method for simultaneous isolation of 5' and 3' cDNA ends from amplified cDNA templates. Biotechniques 2006, 41(1):35-36, 38, 40 passim.. 10.2144/000112205.PubMedGoogle Scholar
- Chopin LK, Veveris-Lowe TL, Philipps AF, Herington AC: Co-expression of GH and GHR isoforms in prostate cancer cell lines. Growth Horm IGF Res 2002, 12(2):126-136. 10.1054/ghir.2002.0271.PubMedGoogle Scholar
- Lin L, Fevery J, Hiem Yap S: A novel strand-specific RT-PCR for detection of hepatitis C virus negative-strand RNA (replicative intermediate): evidence of absence or very low level of HCV replication in peripheral blood mononuclear cells. J Virol Methods 2002, 100(1–2):97-105. 10.1016/S0166-0934(01)00399-8.PubMedGoogle Scholar
- Pinto FL, Svensson H, Lindblad P: Generation of non-genomic oligonucleotide tag sequences for RNA template-specific PCR. BMC Biotechnol 2006, 6: 31. 10.1186/1472-6750-6-31.PubMed CentralPubMedGoogle Scholar
- Cho DH, Thienes CP, Mahoney SE, Analau E, Filippova GN, Tapscott SJ: Antisense transcription and heterochromatin at the DM1 CTG repeats are constrained by CTCF. Mol Cell 2005, 20(3):483-489. 10.1016/j.molcel.2005.09.002.PubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.