Bmc Molecular Biology Characterization of Novel Alternative Splicing Sites in Human Telomerase Reverse Transcriptase (htert): Analysis of Expression and Mutual Correlation in Mrna Isoforms from Normal and Tumour Tissues

Background: Human telomerase reverse transcriptase (hTERT) is a key component for synthesis and maintenance of telomeres on chromosome ends and is required for the continued proliferation of cells. Estimation of hTERT expression therefore has broad relevance in oncology and stem cell research. Several splicing variants of hTERT have been described whose regulated expression contributes to the control of telomerase activity. Knowledge of the different hTERT mRNA isoforms and the ability to distinguish between them is an important issue when evaluating telomerase expression.


Background
The ends of eukaryotic chromosomes are usually capped by telomeres which consist of repeated copies of a short DNA sequence and several associated proteins [reviewed in [1]]. The telomeres protect the chromosomes from damage and degradation and from being fused together by DNA repair mechanisms and are essential for genomic integrity and cell viability. Since the conventional DNA replication process is unable to completely synthesize chromosome ends, proliferating cells lacking de novo synthesis of telomeric DNA will eventually lose their telomeres and enter a growth-arrest state called replicative senescence. The continued proliferation of eukaryotic cells is ensured by the telomerase enzyme which maintains and synthesizes telomeric repeats on to chromosome ends [2,3]. In humans, telomerase activity has been detected in many highly proliferative cells and tissues, such as early stage embryos, reproductive tissues in testis and ovary, stem cells, fibroblasts and activated lymphocytes [reviewed in [4]]. Most somatic tissues contain undetectable levels of telomerase activity, but restoration of telomerase activity is required for immortalization and continued growth of cancer cells. Thus, regulation of telomerase activity has important implications for many developmental processes including cell proliferation, differentiation, ageing and tumorigenesis.
The human telomerase holoenzyme is composed of two core subunits, the telomerase RNA component hTERC [5], which contains a template for telomere elongation, and the telomerase reverse transcriptase catalytic subunit hTERT [6][7][8][9]. While hTERC is widely expressed in human tissues irrespective of telomerase status, the expression of normal full-length hTERT correlates well with telomerase activity and seems to be the rate-limiting factor for telomerase activity in human cells [7,8,[10][11][12][13]. The hTERT gene consists of 16 exons and spans ~37 kb of genomic DNA, of which ~33 kb is intronic sequences and the remaining 4 kb corresponds to the hTERT mRNA transcript [14]. Since processing of hTERT pre-mRNA also yields nonfunctional alternatively spliced products, the correlation between hTERT gene expression and telomerase activity is complicated. To date, seven alternatively spliced sites (ASPSs) in the hTERT mRNA have been described [9,14,15]. Two ASPSs, α-deletion and γ-deletion, result from in-frame deletions of exonic sequences in exon 6 and 11, respectively. The α-deletion isoform appear to be a dominant inhibitor of telomerase activity when overexpressed [16,17]. The remaining ASPSs represent exonic deletions and/or insertion of intronic sequences that cause frame shift and premature termination of the open reading frame (ORF). Alternative splicing of hTERT has implications for the regulation of telomerase activity [10,[18][19][20]. In particular, telomerase is down-regulated in many tissues by a shift to β-deletion splicing mode, in which exons 7 and 8 are deleted. Alternative splicing reconciles many of the inconsistencies observed between hTERT mRNA levels and lack of telomerase activity, but telomerase-negative cells frequently contain hTERT mRNA of which a fraction apparently is normal fulllength [11,13,21,22]. It has been suggested that this may be due to downstream regulatory mechanisms, such as inhibitory factors or post-translational modification of the hTERT protein. It is also possible that the putative fulllength hTERT mRNA in such cases contained unknown ASPSs, which consequently were not screened for. Here we report the characterization of six novel hTERT ASPSs detected in primary tissues from lung and colon that may be important for the regulation of telomerase activity in human cells.

Characterization of hTERT ASPSs
To identify novel hTERT ASPSs and splice patterns, a systematic search was set up by establishing panels of cDNA plasmid clones from five different tissues: lung tumour and adjacent tissue, colon tumour, K562 and HL60 cell lines. The cDNA clones were generated by RT-PCR using two primers (p1255 and m3652; Table 1) that produce a 2409-bp fragment from normal full-length hTERT mRNA, encompassing the positions of intron 2 to 15 (Fig. 1A). A total of 134 cDNA clones were analysed individually by screening with a panel of primer pairs (Table 1) covering different sub-regions of the hTERT cDNA. Clones that contained insertions and/or deletions compared to the normal full-length isoform were analysed by DNA sequencing. The screening detected ten different hTERT ASPSs in addition to the normal full-length isoform (Fig.  1B) and six of these have not been described earlier. The donor and acceptor splice sequences used to produce the different ASPSs conform to the GT-AG consensus for intron splicing in eukaryotes [23].
Three novel ASPSs were found that involve insertion of sequences from intron 2. The ASPS named Ins-i2 [1-349] (see Figure 1 for description of the naming convention used) has inserted the first 349 nucleotides of intron 2 and results from the use of an alternative 5'-splice donor sequence within intron 2 (Fig. 1D). The same splice site has also been used to produce the ASPS Ins-i2 . But in this ASPS an additional splicing event has removed the first 181 nucleotides of intron 2 by use of the normal Genomic organization and ASPSs of the hTERT gene   5'-splice donor sequence and an alternative 3'-splice acceptor sequence, leaving nucleotides 182-349 of intron 2 in the spliced product. The third ASPS involving intron 2, Ins-i2 [?-5273], results from the use of an alternative 5'splice donor sequence within intron 2, but we were not able to characterize this ASPS completely. The RT-PCR cDNA clones containing this ASPS result from internal priming of the 14 3'-most nucleotides of the p1255 primer at position 5212-5225 of intron 2. Consequently, the sequences located 5' to nucleotide position 5212 of intron 2 were missing from the clones. In principle, the insertion could involve nucleotides 1-5273 of intron 2, or other sub-regions further upstream may be spliced out.
The complete lack of Ins-i2 [?-5273] clones produced by normal priming in exon 2 indicates that it either lacks part of exon 2, or is too long to be efficiently amplified by PCR together with other isoforms. The screening also detected several ASPSs that have already been described in the literature (see Figure 1B-C). These include Ins-i4 [1-38], Del-e6 [1-36], Del-e7e8 and Ins-i14 [623-781] [9]. Two ASPSs that have been described by others were not observed among the 134 cDNA clones. These include Del-e11 [15] and Ins-i11 [1-?] [9] (see Figure 1B-C). No ASPSs were found that are associated with alternative splicing of intron 3, 5, 9, 10, 11, 12, 13 or 15. Since the primers used to generate RT-PCR cDNA clones anneal in exon 2 and 16 respectively, potential ASPSs involving intron 1 or deletions in exon 1, 2 or 16 could not be addressed in this screening, including [14].

Quantitative analysis of RT-PCR cDNA-clone panels
Assuming that the RT-PCR cDNA clones represent a random selection of mRNA isoforms present in the sample analysed, statistical analyses were employed to address quantitative and correlative relationships. The quantitative data obtained by this approach relates to the percentage-wise distribution of ASPSs within a sample and does not address absolute expression levels. Figure 2 (black bars) summarizes the ASPSs found in RT-PCR cDNA clones from four different samples. Distinct differences were observed in splice pattern between the samples.  Since many ASPSs result in frame shift and premature termination of the ORF, the expected frequency of a particular protein isoform does not depend on the frequency of the corresponding ASPS in mRNA only, but also on the presence of additional ASPSs located further upstream in the same mRNAs. To address this question, the cDNA clones were mapped individually for the presence of alternative 5'-splice donor sites being continuous with the main ORF (Fig. 2, cross-hatched bars). As may be expected, the probability of a particular ASPS being expressed at the protein level depends on its position on the 5' to 3' axis. ASPSs involving intron 2 will always be continuous with the main ORF, while those involving intron 14 have a low probability of becoming part of a protein.

Quantitative analysis of cDNA samples
To consolidate the data obtained by screening of RT-PCR cDNA clones, five lung normal/adjacent tissue cDNA samples and seven colon tumour/adjacent tissue cDNA samples were screened by PCR using primer pairs (Table 1) covering smaller sub-regions of hTERT. The results (Table  3) confirmed the presence of most ASPSs detected in RT-PCR cDNA clones. The exception was the ASPS Ins-i14 which was not detected in any of the samples. Ins-i14 contains a 781-bp insert and is probably subject to negative selection when co-amplified with considerably shorter products. The ASPS Ins-i14 [1-600]/Del-e15e16 [1-492], which could not be addressed in the analysis of clones, was not detected in any of the samples. Several ASPSs were detected in tissues for which there are no corresponding RT-PCR cDNA clones (see Figure 2). These include Ins-i2 [  1) Correlation coefficient calculated according to the formula: ρ X, Y = cov(X, Y)/(σ X ·σ Y ), where σ x 2 = Σ(X i -µ x ) 2 /n, σ y 2 = Σ(Y i -µ y ) 2 /n (ρ; correlation coefficient, cov; co-variance, σ; standard deviation). The correlation coefficient is a number between -1 and 1: -1, mutually exclusive (all clones contain one or the other); 0, random distribution or both not present; 1, always occur together (independently of frequency).    centage of Del-e11 increased 5-fold from 35 to 55 cycles of PCR, which implies an amplification rate of 1.087 per cycle compared to normal full-length. Extrapolation of quantitative data back to PCR cycle 0 ( It was apparent from the PCR analysis that cDNA samples from lung tumour yielded more hTERT PCR products compared to the other tissues. To provide a rough estimate of these differences, all hTERT PCR products produced for each respective tissue were normalized to αtubulin levels and pooled (Table 3, lower right). The cDNA samples included were limited to those numbered from A through E, which are all pre-normalized with respect to cDNA concentration based on the levels of two housekeeping genes. Compared to lung adjacent tissue, which had the lowest level of hTERT products, a 14-fold increase was observed in lung tumour tissue. Colon tumour and adjacent tissues, on the other hand, had very similar hTERT levels, about 1.5 to 2-fold higher than in lung adjacent tissue. In a theoretical perspective, these levels are indicative of hTERT mRNA abundance (all isoforms included) in the respective tissues.

Discussion
Regulation of telomerase activity is a complex process involving transcriptional control, post-translational modification, positively and negatively acting factors and alternative splicing [reviewed in [4,24]]. Telomerase activity in cells can be measured directly by TRAP analysis [25]. When addressing telomerase expression by other methods, such as detection of mRNA or protein, caution must be taken to distinguish between functional and non-functional isoforms. In this respect, detailed knowledge about the different hTERT isoforms is a prerequisite for proper experimental set-up and correct interpretation of data. By screening a number of RT-PCR-generated cDNA clones from human primary tissues, we identified six novel hTERT ASPSs and also four out of six previously described ASPSs. Three novel ASPSs involve alternative splicing of intron 2 and thus have a dominant relationship to other ASPSs located further down-stream in the hTERT mRNA. The ASPSs add to the complexity of telomerase regulation and should be taken into consideration when interpreting new and past studies on hTERT expression.
In addition to being a tool for detecting sequence differences, the panels of cDNA clones may also be used to address quantitative relationships and mutual correlation of ASPSs in mRNA isoforms. Since the PCR amplicon used to generate RT-PCR cDNA clones is rather large, the differences in size between the different isoforms due to insertions/deletions becomes comparatively smaller. This may promote a more equal amplification rate of the different isoforms being co-amplified in the PCR reaction, compared to PCR covering smaller sub-regions of the hTERT cDNA. The situation with over-amplification of the deletion ASPS Del-e11 (Table 3) may serve as an example. Accordingly, Del-e11 was not found among the 134 cDNA clones, which implies that such over-amplification has not occurred when synthesizing the long fragment used for cloning. In this respect, statistical analysis of cDNA clones may represent the most accurate method for quantification of ASPSs in mRNA. The number of clones analysed here (Fig. 2) is not sufficient to meet statistical requirements of sample size. Sampling errors are therefore unavoidable. As seen in the screening of cDNA samples with primer pairs covering smaller sub-regions of hTERT (Table 3), several ASPSs were detected in tissues for which there are no corresponding RT-PCR cDNA clones. Thus, many ASPSs appear to be more widely expressed than identified by analysis of clones. The variation in ASPS content observed between cDNA clones and cDNA samples is most likely related to a generally low abundance of hTERT mRNA. Some uncertainty also exist regarding the two cDNA clones that contained an unspliced intron 14 (Ins-i14). It is possible that these clones have originated from nuclear mRNA in which intron 14 is retained because of incomplete processing.
A useful consequence of mapping individual cDNA clones is the information obtained on correlation, i.e. the degree by which the different ASPSs tend to occur together in the same mRNAs. This information is necessary for predicting which protein isoforms may be produced in cells since the majority of ASPSs result in frame shift and premature termination of translation (Fig. 1B), which will mask other features located further down-stream in the same mRNA. The RT-PCR cDNA clones were generated from poly(A) + mRNA and it is reasonable to assume that the corresponding mRNA isoforms are processed by the translation machinery with similar efficiency as the normal fulllength isoform. If not, specific mechanisms must exist that can recognize the different internal insertions/deletions in these mRNAs and suppress transport/translation. Thus, the expected frequency of a particular protein isoform (frame shift, insertion or deletion of aa) depends in part on the frequency of the corresponding ASPS in mRNA, but also on the presence of additional ASPSs located further upstream. Since the majority of hTERT transcripts in lung and colon tissues contained two or more ASPSs, ASPSs located downstream (and in particular 3' to exon 8) are rarely continuous with the ORF (Fig. 2). Instead, the pool of hTERT protein in human lung and colon tissues seems to be dominated by truncated isoforms corresponding to the ASPSs Ins-i2 [1-349], Ins-i2 [?-5273] and Del-e7e8.
The correlation analysis ( Table 2) defined two groups of ASPSs with a tendency to occur together in the same mRNAs, but no pair of ASPSs showed a strict positive correlation. With regard to negative correlation the results were less conclusive, since some ASPSs occur at a low frequency. However, for the more abundant ASPSs being present in the same tissue samples (Ins-i2 [?-5273], Ins-i4 [1-38], Del-e6 [1-36], Del-e7e8), any single ASPS were found both combined with and separate from other ASPSs in individual cDNA clones, which implies that the alternative splicing occurring at these sites are more or less independent events. If no ASPSs are strictly correlated, the hTERT gene may give rise to as many as 384 different hTERT mRNA isoforms. Many of these are similar in size to normal full-length hTERT. Considering the number and location of in-frame insertion/deletion ASPSs (Fig.  1B) and the combination of these with the other ASPSs, the 384 mRNA isoforms may give rise to 64 different hTERT protein isoforms. As explained above, the abundance of the different protein isoforms will vary greatly according to ASPS frequency and position on the 5' to 3' axis.
RT-PCR analysis of a sub-region in hTERT containing the Del-e6 [1-36] and Del-e7e8 deletion splice sites is often used to estimate telomerase expression, since mRNA lacking this alternative splicing is considered to be normal full-length. Our data showing that a large percentage of hTERT mRNA may be alternatively spliced in intron 2 implies that this analysis is insufficient for estimation of hTERT isoforms. The increased complexity in telomerase expression necessitates a more detailed analysis of hTERT isoforms, which can measure both upstream and downstream splicing at the single-transcript level. Analysing the different sub-regions in separate PCRs (Table 3) fails to detect whether the different ASPSs are present in the same transcripts or not (correlation), which is required for estimation of normal full-length hTERT. Direct analysis of a PCR product covering all ASPSs is also problematic, since many isoforms (different combinations of ASPSs) are similar in size to normal full-length. It would require a method with perfect one-nucleotide resolution for fragments above 2 kbp and an acceptable quantitative readout. One solution to these problems is to map long PCR products (or cDNA clones) covering all ASPSs on individual basis as shown in Figure 2, which gives a direct measurement of all different isoforms being present.

Conclusion
Previous work addressing identification of hTERT ASPSs [9,14,15] have focused on cell lines, which normally contain high telomerase activity and a high percentage of normal full-length hTERT mRNA. In the present study, by analysing a modest number of RT-PCR-generated cDNA clones from two different human tissues, six novel ASPSs were identified of which two represented dominating hTERT isoforms. A better understanding of hTERT mRNA processing and its implications for telomerase regulation will require more studies on human primary tissues, including those that are low or negative for telomerase activity. It is also necessary to use methods that do not discriminate between mRNA isoforms based on size, as opposed to PCR, since ASPSs may involve intronic insertions of considerable length. Detailed knowledge of the different hTERT isoforms present in human tissues and the ability to distinguish between them is required to obtain a proper understanding of telomerase function and regulation, and may identify hTERT sequences suitable for targeted cancer immuno therapy.

Tissue samples and cDNA synthesis
Lung and colon tumour/adjacent tissue matched pair cDNA samples A-E were obtained from Clontech (Catalog #: K1433-1, K1434-1). Additional tissue samples were removed by surgery (The Norwegian Radium Hospital, Oslo, Norway) and stored in RNAlater (Sigma-Aldrich) for preservation of RNA. The samples were homogenized in Trizol Reagent (Invitrogen, Basel, Switzerland) using the Ultra-Turrax T8 homogenizer (IKA-Werke, Staufen, Germany) and total RNA was isolated and purified according to the Trizol Reagent manual. Poly(A) + mRNA was isolated and double-selected from total RNA using the GenoPrep Direct mRNA kit (GenoVision, Oslo, Norway) and cDNA synthesis was performed by using the StrataScript Reverse Transcriptase (Stratagene, La Jolla, CA) with oligo(dT) priming as described by the manufacturer.

Generation of hTERT RT-PCR cDNA clones
PCR fragments encompassing intron positions 2-15 in the hTERT gene was produced in two consecutive PCR reactions using two primers (p1255 and m3652; Table 1) that hybridize within exon 2 and 16, respectively. The first PCR was performed using the HotStarTaq kit (Qiagen) with 10% vol/vol cDNA-synthesis reaction as template, 2% DMSO, 30 cycles of PCR, annealing at 58°C and extension for 6 minutes. The second PCR was run using the Pfu Turbo kit (Stratagene) with 10% vol/vol of the first PCR as template and other modifications as described for the first PCR. The product was then separated in an agarose gel and a segment containing PCR products in the range 1 to 10 kb was cut out and purified using the Wizard PCR Preps DNA Purification System (Promega). The PCR products were digested with EcoR I and Xba I and ligated between the same sites in the pBluescript SK+ vector (Stratagene). Following transfection of E. coli DH10B and growth on LA-amp, colonies were picked at random for production of plasmid DNA (Wizard Plus SV Minipreps, Promega).

Mapping of hTERT RT-PCR cDNA clones
Individual hTERT RT-PCR plasmid clones were screened for the presence of ASPSs by running PCR with a panel of 6-fam-labelled primer pairs (Table 1) covering different regions of the hTERT cDNA. The products were analysed with the MegaBACE 1000 capillary electrophoresis unit (Amersham Biosciences). Clones that contained insertions and/or deletions compared to normal full-length hTERT cDNA were sequenced through the relevant region using the MegaBACE sequencing system (Amersham).

PCR analysis of cDNA samples
cDNA samples were screened for the presence of hTERT ASPSs by performing two consecutive PCR reactions with a panel of 6-fam-labelled primer pairs (Table 1) covering different regions of the hTERT gene. The first PCR was run using the HotStarTaq kit with 10% vol/vol cDNA-synthesis reaction as template, 35 cycles of PCR, annealing at 58°C and extension for 2 minutes. The second PCR was run using the Pfu Turbo kit (Stratagene) with 10% vol/vol of the first PCR as template, 20 cycles of PCR and other parameters as described for the first PCR. The products were analysed with the MegaBACE 1000 capillary electrophoresis unit.