Relative quantification of mRNA: comparison of methods currently used for real-time PCR data analysis
- Štefan Čikoš^{1}Email author,
- Alexandra Bukovská^{1}Email author and
- Juraj Koppel^{1}Email author
https://doi.org/10.1186/1471-2199-8-113
© Čikoš et al; licensee BioMed Central Ltd. 2007
Received: 12 September 2007
Accepted: 20 December 2007
Published: 20 December 2007
Abstract
Background
Fluorescent data obtained from real-time PCR must be processed by some method of data analysis to obtain the relative quantity of target mRNA. The method chosen for data analysis can strongly influence results of the quantification.
Results
To compare the performance of six techniques which are currently used for analysing fluorescent data in real-time PCR relative quantification, we quantified four cytokine transcripts (IL-1β, IL-6 TNF-α, and GM-CSF) in an in vivo model of colonic inflammation. Accuracy of the methods was tested by quantification on samples with known relative amounts of target mRNAs. Reproducibility of the methods was estimated by the determination of the intra-assay and inter-assay variability. Cytokine expression normalized to the expression of three reference genes (ACTB, HPRT, SDHA) was then determined using the six methods for data analysis. The best results were obtained with the relative standard curve method, comparative Ct method and with DART-PCR, LinRegPCR and Liu & Saint exponential methods when average amplification efficiency was used. The use of individual amplification efficiencies in DART-PCR, LinRegPCR and Liu & Saint exponential methods significantly impaired the results. The sigmoid curve-fitting (SCF) method produced medium performance; the results indicate that the use of appropriate type of fluorescence data and in some instances manual selection of the number of amplification cycles included in the analysis is necessary when the SCF method is applied. We also compared amplification efficiencies (E) and found that although the E values determined by different methods of analysis were not identical, all the methods were capable to identify two genes whose E values significantly differed from other genes.
Conclusion
Our results show that all the tested methods can provide quantitative values reflecting the amounts of measured mRNA in samples, but they differ in their accuracy and reproducibility. Selection of the appropriate method can also depend on the design of a particular experiment. The advantages and disadvantages of the methods in different applications are discussed.
Background
Reverse transcription (RT) followed by polymerase chain reaction (PCR) is at present the most sensitive method for the detection of specific RNA molecules. Quantification of nucleic acids using the PCR has been significantly simplified by the development of the real-time PCR technique, where the fluorescent signal reflecting the PCR product accumulation is detected in every amplification cycle. In biological applications examining gene expression, it is mostly not necessary to know the absolute amount of the measured mRNA (number of molecules in a sample). Relative mRNA quantification is an approach determining the amount of target mRNA in samples relative each to other. To compensate for differences in the RT-PCR input quality and quantity, the target mRNA amount in each sample is normalized to one or more internal controls. Selection of an optimal normalization strategy has been widely discussed [1, 2] and is out of the scope of the present study.
Fluorescent data obtained from real-time PCR must be processed by some method of data analysis to obtain the relative quantity of target mRNA. There are several techniques used for real-time PCR data analysis and adequate attention should be paid to the selection of the appropriate method. Skern et al.[3] demonstrated that quantification results can vary dramatically depending on the method chosen for data analysis, and different analytical approaches may even lead to opposing biological conclusions. Methods for analysis of fluorescent real-time PCR data used in relative mRNA quantification can be classified in various ways, depending on the criteria applied. All methods determine the RT-PCR template quantity (designated "R_{0}" throughout the present study) from the accumulation of the PCR product during the amplification process. Most techniques utilize exclusively the exponential phase of PCR to determine the amplification efficiency (designated "E" throughout the present study) and the R_{0} value [4, 5]. The methods can be based on the determination of a "crossing point" between the PCR product fluorescence and a chosen benchmark. The benchmark is a point in the amplification curve (a graph of PCR product fluorescence versus amplification cycle number) that represents the same amounts of PCR product in every amplification. The number of amplification cycles needed to reach the benchmark is usually denoted as CP [6]. The most commonly used form of the benchmark is the threshold fluorescence (which can be set manually by the user or automatically by the software of a real-time PCR instrument), and the number of amplification cycles needed for reaching the threshold fluorescence is usually denoted as "Ct". The basic principle of the "threshold-based" methods is the same – the lower the RT-PCR template amount, the more amplification cycles are needed to reach the threshold fluorescence. Apart from the "threshold-based" methodologies that currently predominate, there are methods which use linear regression analysis of the fluorescent data from the exponential phase of PCR to determine the E and/or R_{0} values [7, 8]. Moreover, a method that utilizes fluorescent data from the whole course of the amplification curve has been developed [9–11].
To compare the performance of six techniques which are currently used for analyzing fluorescent data in real-time PCR relative quantification, we determined the mRNA levels of four pro-inflammatory cytokines (IL-1β, IL-6 TNF-α, and GM-CSF) in mice with trinitrobenzene sulphonic acid (TNBS) – induced colitis. TNBS-induced colitis is a widely used experimental model for studying gut inflammatory processes such as ulcerative colitis and Crohn's disease [12].
Results
Relative standard curves, determination of amplification efficiency
Amplification of the added luciferase mRNA showed very similar Ct values in all six dilutions of the standard RNA (arithmetical mean: 19.4, coefficient of variation: 1.28) indicating a similar efficiency of reverse transcription in all dilutions. Moreover, high correlation coefficients of the standard curves (Fig. 1) indicate that both, the efficiencies of reverse transcription and the PCR amplification efficiencies are similar in all dilutions.
Amplification efficiency determined by various methods of real-time PCR data analysis. Arithmetical means ± SD and coefficients of variation (in parentheses) are shown.
Gene/Method | Standard curve | DART-PCR | LinRegPCR | Liu&Saint-exp |
---|---|---|---|---|
IL-1β | 0.998 | 1.033 ± 0.061 (5.95) | 0.950 ± 0.074 (7.76) | 0.974 ± 0.033 (3.42) |
IL-6 | 0.900 | 0.912 ± 0.046 (5.05) | 0.881 ± 0.075 (8.51) | 0.870 ± 0.029 (3.35) |
TNF-α | 1.012 | 1.033 ± 0.024 (2.32) | 0.974 ± 0.072 (7.45) | 0.991 ± 0.032 (3.32) |
GM-CSF | 0.918 | 0.978 ± 0.044 (4.49) | 0.932 ± 0.063 (6.75) | 0.898 ± 0.013 (1.44) |
ACTB | 0.981 | 1.086 ± 0.046 (4.21) | 0.999 ± 0.031 (3.10) | 0.978 ± 0.044 (4.53) |
SDHA | 0.967 | 1.072 ± 0.052 (4.88) | 0.980 ± 0.064 (6.55) | 1.017 ± 0.044 (4.27) |
HPRT | 1.022 | 1.069 ± 0.064 (5.95) | 0.978 ± 0.109 (11.1) | 0.976 ± 0.048 (4.49) |
Determination of target mRNA quantity in known sample dilutions
Pearson's correlation coefficients obtained from the linear regression plotting R_{0} values against diluting factors of the total RNA (RT-PCR template). The R_{0} values were obtained by transformation of fluorescence data using the following methods for real-time PCR data analysis: St cur, relative standard curve; Comp, comparative Ct; SCF, sigmoid curve-fitting; DART ind E, DART-PCR with individual E values; DART av E, DART-PCR with average E values; Liu&S ind E, Liu & Saint-exp with individual E values, Liu&S av E, Liu & Saint-exp with average E values; LinReg ind E, LinRegPCR (using individual E values); LinReg-Ct av E, LinRegPCR combined with Ct (using average E values).
Gene/Method | St cur | Comp | SCF | DART ind E | DART av E | Liu&S ind E | Liu&S av E | LinReg ind E | LinReg-Ct av E |
---|---|---|---|---|---|---|---|---|---|
IL-1β | 0.9993 | 0.9996 | 0.9960 | 0.9924 | 0.9997 | 0.9697 | 0.9994 | 0.9113 | 0.9994 |
IL-6 | 0.9998 | 0.9998 | 0.9951 | 0.9745 | 0.9993 | 0.9510 | 0.9996 | 0.9391 | 0.9997 |
TNF-α | 0.9996 | 0.9998 | 0.9987 | 0.9910 | 0.9989 | 0.9622 | 0.9996 | 0.9835 | 0.9996 |
GM-CSF | 0.9980 | 0.9980 | 0.9803 | 0.9620 | 0.9985 | 0.9277 | 0.9975 | 0.9426 | 0.9980 |
ACTB | 0.9991 | 0.9992 | 0.9973 | 0.9435 | 0.9972 | 0.9915 | 0.9998 | 0.9828 | 0.9984 |
SDHA | 0.9998 | 0.9999 | 0.9999 | 0.9799 | 0.9999 | 0.9872 | 0.9999 | 0.9902 | 0.9999 |
HPRT | 0.9996 | 0.9995 | 0.9997 | 0.9699 | 0.9990 | 0.9662 | 0.9994 | 0.9545 | 0.9997 |
Average | 0.9991 | 0.9994 | 0.9953 | 0.9733 | 0.9990 | 0.9651 | 0.9993 | 0.9577 | 0.9992 |
Parameters of the geNorm software for ACTB, SDHA, and HPRT. The parameters were determined from quantities (R_{0} values) of the three genes in the RT-PCR template dilutions. M – measure of the gene expression stability, V2/V3 – pairwise variations determining the optimal number of reference genes. Indicated methods for real-time PCR data analysis were used for the transformation of fluorescence data to the R_{0} values. Designation of the methods is the same as in Table 2.
Parameter/Method | St cur | Comp | SCF | DART ind E | DART av E | Liu&S ind E | Liu&S av E | LinReg ind E | LinReg-Ct av E |
---|---|---|---|---|---|---|---|---|---|
M for ACTB | 0.159 | 0.161 | 0.315 | 0.598 | 0.200 | 0.544 | 0.213 | 1.772 | 0.276 |
M for SDHA | 0.187 | 0.191 | 0.293 | 0.578 | 0.183 | 0.538 | 0.326 | 1.549 | 0.246 |
M for HPRT | 0.181 | 0.165 | 0.324 | 0.631 | 0.226 | 0.626 | 0.228 | 1.349 | 0.338 |
V2/V3 | 0.057 | 0.060 | 0.097 | 0.190 | 0.071 | 0.195 | 0.107 | 0.564 | 0.109 |
Intra-assay and inter-assay variability
Intra-assay variability. Coefficients of variation calculated from R_{0} values of 15 replicate PCR reactions. Indicated methods for real-time PCR data analysis were used for the transformation of fluorescence data to the R_{0} values. Designation of the methods is the same as in Table 2.
Gene/Method | St cur | Comp | SCF | DART ind E | DART av E | Liu&S ind E | Liu&S av E | LinReg ind E | LinReg-Ct av E |
---|---|---|---|---|---|---|---|---|---|
IL-1β | 5.95 | 5.95 | 17.1 | 60.6 | 7.63 | 25.03 | 7.33 | 49.08 | 5.71 |
IL-6 | 9.94 | 9.94 | 22.3 | 83.6 | 10.2 | 43.7 | 9.28 | 53.5 | 9.73 |
Inter-assay variability. Coefficients of variation calculated from R_{0} values of 8 PCR reactions. Indicated methods for real-time PCR data analysis were used for the transformation of fluorescence data to the R_{0} values. Designation of the methods is the same as in Table 2.
Gene/Method | St cur | Comp | SCF | DART ind E | DART av E | Liu&S ind E | Liu&S av E | LinReg ind E | LinReg-Ct av E |
---|---|---|---|---|---|---|---|---|---|
IL-1β | 12.4 | 12.4 | 15.3 | 79.7 | 18.3 | 30.3 | 17.7 | 68.8 | 11.2 |
IL-6 | 18.6 | 17.1 | 20.5 | 65.5 | 17.7 | 40.2 | 17.5 | 53.8 | 16.2 |
In the standard curve method, comparative Ct method, and the three methods using average E values (DART-PCR, Liu & Saint-exp, LinRegPCR-Ct – with average E), the intra-assay variability was higher in IL-6 than in IL-1β; a similar trend was found for the inter-assay variability (except of two methods – DART-PCR and Liu & Saint-exp, which showed similar values for both genes). The three methods (DART-PCR, Liu & Saint-exp, LinRegPCR-Ct) utilizing individual E values (and showing high CV values) gave inconsistent results and in some cases, the determined intra-assay variability was even higher than the inter-assay variability. The CV values obtained in the intra- and inter-assay experiment with the SCF method were comparable, and they were higher in IL-6 than in IL-1β which is in accordance with the five well-working methods (Table 4, Table 5).
Normalized cytokine expression in the experimental samples
In TNF-α and GM-CSF, significant differences were found only between the group of untreated colitic animals (Un) and the group of sham control animals (Sh). In TNF-α, all the tested methods of real-time PCR data analysis showed significantly higher amount of the cytokine in the group Un than in the group Sh. In GM-CSF, the difference between the group Un and Sh was detected as significant only with the standard curve method, comparative Ct method, and the three methods utilizing average E values (DART-PCR, Liu & Saint-exp, LinRegPCR-Ct – with average E); the methods which used individual E values (DART-PCR, Liu & Saint-exp, LinRegPCR-Ct – with individual E) did not detect the difference as statistically significant (data not shown).
Discussion
To compare the methods currently used for analyzing fluorescence data in real-time PCR relative quantification, we determined mRNA levels of four pro-inflammatory cytokines in the in vivo model of colonic inflammation, using six different techniques. Three housekeeping genes were also quantified to serve as reference genes for the normalization of cytokine mRNA quantity. The compared methods differ principally in the mathematical function used for modeling the PCR process, in the necessity to create a dilution series of the RT-PCR template, in the necessity and means of amplification efficiency (E) determination, in the way of calculation of the target mRNA quantity (R_{0}).
Most of the methods utilize fluorescence data only from the exponential phase of PCR amplification (the exponential model of PCR) and require setting a threshold fluorescence (common for all compared samples) to determine the Ct value ("threshold-based" methodologies). The relative standard curve method determines target mRNA relative quantities in samples from known relative quantities of standard RNA or cDNA. To obtain correct results, amplification efficiency in the dilutions of the standard preparation and in samples must be similar. The E value determined from the slope of the standard curve (or from a dilution series of a representative sample) can be utilized for the calculation of the target mRNA quantity (R_{0}) in techniques of relative quantification, which can be denoted as "comparative Ct methods". Equation 5 can be used for the transformation of Ct values to the R_{0} values; the only parameter which differs among the compared samples is Ct. The term "comparative Ct method" is most frequently used for the " 2^{-ΔΔCt} " method introduced by Livak [14]; in this method E is assumed to be equal to 1 and the formula for the R_{0} calculation can be modified to R_{0} = 2^{-Ct}. Pfaffl's model of the comparative Ct method [15] incorporates a correction for amplification efficiency differing from the optimal value 1. Most published versions of the comparative Ct method use normalization of the target gene quantity to a reference gene quantity, very often referring to one of the samples (a "calibrator" sample), and the formula for calculation of the target gene relative quantity is more complicated [14–19]. In any case, formulas used in comparative Ct methods for calculation of the normalized relative amount of target gene quantity can be derived from equation 5. Pfaffl et al.[20] developed the software named REST (Relative Expression Software Tool) which can perform the comparative Ct quantification in two experimental groups (with or without the E value correction) followed by a statistical test. In the present study, we used E values determined for each gene from the relative standard curves and calculated the R_{0} value for each gene in each sample using the equation 5. The cytokine genes quantity was then normalized with the factor obtained from the quantities of three reference genes.
In the method proposed by Liu and Saint [21], the amplification efficiency for each sample is determined from the amount of fluorescence and the number of cycles at two arbitrary fluorescence thresholds along the exponential phase of the PCR amplification (Eq 11). The fluorescence thresholds can be set individually for each amplification, or a threshold level common for all reactions can be set. In the first case equation 4 is used for the R_{0} calculation, and in the second case equation 4 or equation 5 can be used. We compared the quantification results obtained with this method using either individual E values (determined for each amplification) or using an average E value.
Ramakers et al.[7] developed a computer program entitled LinRegPCR, which determines the target mRNA quantity (R_{0}) and amplification efficiency (E) by linear regression analysis (Eq 9, Eq 10) of fluorescence data obtained from real-time PCR. Like the above-mentioned methods, linear regression exploits only the exponential phase of PCR amplification, but the method is not "threshold-based" – no benchmark (threshold fluorescence) is needed for the calculations. We analyzed our fluorescence data with the LinRegPCR software and with a technique which combines linear regression analysis and the threshold-based methodology. The combined technique (we have designated it "LinRegPCR-Ct") utilizes E values determined by the LinRegPCR software (for calculation of an average E) and Ct values determined by the Mx3000P real-time PCR instrument. A similar strategy was applied by Karlen et al.[22] and Schefe et al.[23]. The DART-PCR program developed by Peirson et al.[8] provides an automated analysis of real-time PCR fluorescence data utilizing the combined approach (linear regression for E determination and threshold fluorescence for Ct determination). Similarly as for LinRegPCR, we compared the quantification results obtained with DART-PCR using individual or average E values for the R_{0} calculation.
The method in which a sigmoid mathematical model that fits the kinetics of the whole real-time PCR process is applied [9, 10], represents an approach completely different from the above-mentioned techniques. The sigmoid curve-fitting (SCF) method utilizes all fluorescence data recorded during the amplification process (not only the data from the exponential phase) for determination of the R_{0} value. Moreover, the method can carry out quantification without the knowledge of amplification efficiency and without determination of Ct. Rutledge [11] found that amplification cycles within the plateau phase of PCR deviate from that predicted by sigmoid curve-fitting, and their exclusion from the curve-fitting process is necessary. He proposed the selection of a cut-off cycle beyond which further cycles are excluded from the fitting of the amplification curve. The criterion used for the selection of the cut-off cycle was based on repetitive curve-fitting in which the last cycle was sequentially excluded and the R_{0} value was calculated at each individual curve-fitting. Plotting the calculated R_{0} values against the cut-off cycle revealed a highly regular pattern in which the calculated R_{0} value decreased with subsequent cycle removal, and after reaching a minimum a small increase in the R_{0} value appeared; the minimum-calculated R_{0} value was selected as the resulting R_{0}. In our work, we found that the shape of the graph of R_{0} dependency on the cut-off cycle can be influenced by fluorescence data provided by the real-time PCR system. Using background-subtracted fluorescence data from the Mx 3000P system (Stratagene), we were not able to identify the regular trend of the curve R_{0} vs cut-off cycle as described by Rutledge [11]; background-subtracted data from Opticon2 DNA Engine (MJ research Inc.) were used in Rutledge's study. On the other hand, raw fluorescence data from Mx 3000P provided a regular pattern of the R_{0} value as a function of the cut-off cycle (R_{0} decrease followed by a small increase after reaching a minimum) in most curves. In some instances the cut-off cycle with the minimal R_{0} value was determined in the region of the amplification curve which contained an insufficient amount of fluorescent data, and manual selection was necessary. Karlen et al.[22] testing the performance of several methods for real-time PCR data analysis found the sigmoid curve-fitting method (together with the method fitting PCR amplification to the exponential function) as the least suitable for quantitative PCR analysis. On the contrary, our results indicate that the SCF method can provide reasonable results. Similarly, Qiu et al.[24] obtained comparable results using the SCF method and a classic threshold-based method. The differences in the SCF method performance were probably caused by different number of amplification cycles included into the fitting process. An appropriate selection of the optimal cycle number (exclusion of late cycles) is probably the key factor for obtaining satisfactory performance of the SCF method, but the choice of a suitable criterion for determination of the "cut-off cycle" can be difficult (as discussed above).
In our study, we determined amplification efficiencies (E) of seven genes using four methods of real-time PCR data analysis, and found some differences in the determined E values. On the other hand, all the methods were capable to identify the two genes whose E values significantly differed from the others. Interestingly, the amplification efficiencies determined by the two methods which employ linear regression for the calculation (DART-PCR and LinRegPCR) were less close each to other than to efficiencies determined by the relative standard curve or Liu & Saint-exp method. This can be caused by differences in the way the two methods determine the exponential phase of amplification. Lower amplification efficiency found by all tested methods in two genes with high Ct values (IL-6 and GM-CSF) could suggest some influence of Ct value on the determination of the E value. However, the comparison of E and Ct values in the two „high-Ct genes“ indicates that a higher Ct value is not necessarily leading to obtaining of a lower E value. This finding is in accordance with results of Karlen et al.[22] who did not find a dependency of E value on Ct value; the authors defined the amplicon and primer sequences as the main factor influencing the efficiency of amplification.
To compare the accuracy of relative quantification conducted with application of different methods for real-time PCR data analysis, we determined quantities of seven target mRNAs in serially diluted preparation of total RNA. We found that R_{0} values determined with the relative standard curve method, comparative Ct method and with the three methods using an average E value for the calculations (DART-PCR, Liu & Saint-exp, LinRegPCR-Ct – with average E) most accurately reflected the RT-PCR template dilutions. Less effective was the SCF method, and the worst results were obtained with the three methods which used individual E values (DART-PCR, Liu & Saint-exp, LinRegPCR – with individual E). Normalization of cytokines mRNA quantity (to a single reference gene or to a normalization factor calculated from reference genes with comparable expression stability) did not influence the differences in the performance of the methods for real-time PCR analysis tested in our study. On the other hand, our results indicate that targets with high Ct values can be quantified with a lower accuracy than targets with medium and low Ct values. Reproducibility of the tested methods was estimated by determination of the intra- and inter-assay variability and showed the same result as the accuracy test. The highest reproducibility was found in the relative standard curve method, comparative Ct method and in the three methods using an average E value for calculations. The SCF method was less precise, and the worst results were obtained with the three methods which used individual E values. Our results also showed a negative effect of higher Ct values on the reproducibility of the tested methods.
Comparing results of normalized expression of IL-1β and IL-6 in the experimental samples, all the tested methods were able to detect significant changes between the control animals, untreated colitic animals, and animals undergoing treatment B. The effects of treatments A or C on the IL-1β mRNA level were detected as statistically significant only by some of the tested methods. In TNF-α and GM-CSF, all the tested methods showed higher amount of the cytokines in untreated colitic animals than in control animals, hovewer the three methods which utilized individual E values (DART-PCR, Liu & Saint-exp, LinRegPCR-Ct – with individual E) were not able to detect the difference in GM-CSF expression as statistically significant.
Application of corrections for individual sample efficiency should theoretically improve the accuracy and reproducibility of the quantification. But our results showed that, on the contrary, the use of individual E values for the R_{0} calculation impaired the quantification. Similar findings were presented in studies where linear regression was utilized for the calculation of individual amplification efficiencies [8, 22, 23]. We used two approaches for determination of the individual amplification efficiencies – linear regression (in DART-PCR method and LinReg PCR method which differ in the way of exponential phase determination) and the method which utilize setting of two fluorescence thresholds along the exponential phase (Eq 11). Independently on the way used for the calculation of E values, all the three methods showed the negative effect of the individual E values on the quantification which indicates that this is probably a feature connected with the limited precision of individual data and not with the mathematical approach used for E value determination. The individual E values (determined with linear regression or with the Eq 11) are derived from individual sample kinetics represented by the fluorescence values obtained in particular amplification cycles. Above mentioned results suggest that fluorescence values detected by real-time systems do not reflect the reaction kinetics with the precision which would be sufficient for reliable E value determination from the exponential phase of the individual amplification. The average E value obtained from the group of amplifications eliminates individual imprecisions enabling to find a reliable value of the amplification efficiency.
Conclusion
Choosing the appropriate method for real-time PCR data analysis can depend on conditions in a particular application. The relative standard curve method is widely used and can provide reliable results, especially in the case when the same quality of RT-PCR template is ensured in standards and samples. Sometimes this is not possible, for instance if the quantification is performed on different tissue types, or if the amount of tissue is limited (e.g. tissue biopsies, preimplantation embryos). Similar restrictions apply also for comparative Ct methods. These methods require serial dilutions of a representative sample to determine the E value which is usually done in a validation experiment preceding a series of measurements. This approach enables to measure more samples in a PCR run (no standard curve in the run), but identical conditions for all measurements must be ensured. The other three methods tested in the present study (Liu & Saint-exp, DART-PCR, LinRegPCR) do not require serial dilutions of the RT-PCR template for E value determination. We found that these methods provide more reliable results when an average E value and not individual E values (determined for each amplification) are used for the R_{0}calculation. The Liu & Saint-exp method is simple – the formula for E value calculation can be easily implemented into spreadsheet programs such as Microsoft Excel and the selection of the exponential phase of amplification is not difficult (the human eye is good enough for distinguishing a straight line). Our results with the Liu & Saint-exp quantification showed that although the criterion for selection of the exponential phase is subjective, reasonable results can be obtained if the same criterion is applied for all compared samples. The LinRegPCR and DART-PCR methods use a more complicated calculation (based on linear regression) for the E value determination. The DART-PCR software combining the linear regression analysis with threshold-based methodology enables R_{0} values to be calculated using an average E value; the LinRegPCR software do not enable the automated use of an average E value, but manual combination with a threshold-based technique is possible. The SCF method differs from all the other methods in the mathematical model used for the calculations – it is not necessary to look for the exponential phase of amplification and to determine the E value. For reliable quantification with this method, fluorescence data including at least the beginning of the plateau phase are needed, which can be a disadvantage when genes with low expression are quantified or when low sample amounts are available. In summary, our results show that all the tested methods for real-time PCR data analysis can provide quantitative values reflecting the amounts of measured mRNA in samples, but they differ in their accuracy and reproducibility. Although selection of the appropriate method can be limited by the design of a particular experiment (e.g. tissue type, gene abundance, number of experimental groups) the use of more than one analytical method is recommended for validation of results.
Methods
Animal experiment, sample preparation, and real-time PCR
All the fluorescent data used in this study were obtained from a previous study examining the effects of plant essential oils thyme and oregano on trinitrobenzene sulphonic acid (TNBS)-induced colitis in mice [25]. Briefly, colitis was induced in male Balb/c mice by administration of TNBS and the animals were treated with three increasing concentrations of the plant oils (treatments A, B and C; the group of untreated colitic animals was designated as Un and sham control group as Sh in the present study). Total RNA was isolated from the strips of colonic tissue with TRIzol Reagent (Invitrogen Life Technologies, Karlsruhe, Germany). The RNA preparations were then cleaned and DNase I treated with an RNeasy Micro kit (Qiagen, Hilden, Germany). Complementary DNA (cDNA) was then synthesized from the RNA (0.75 μg from each sample) using Superscript™ II Rnase H-Reverse Transcriptase (Invitrogen Life Technologies; for more details see [25])
Serial dilutions of total RNA (1× – i.e. no dilution, 2×, 4×, 8×, 80×, 800×) were prepared from the pool of colon RNA obtained by combining aliquots of samples from the colitic animals. Complementary DNA was then synthesized from each dilution as described above. To compensate for different RNA amounts in the reverse transcription reactions yeast total RNA was added in appropriate amounts to the colon RNA dilutions (all oligonucleotide primers were checked so as not to create any PCR product on the yeast cDNA template). The cDNAs then served as a PCR template for construction of relative standard curves. Amplifications performed on the standard cDNAs were also utilized for comparison of the accuracy of quantification by the tested methods for real-time PCR data analysis (determination of known sample dilutions); in the relative standard curve method two sets of total RNA dilutions (and cDNA preparations) were used – one set was utilized for the construction of standard curves and the other set served as known sample dilutions.
To test the efficiency of reverse transcription in the serial dilutions of standard RNA, identical amount (10 pg) of luciferase mRNA (Promega, Madison, WI) was added into the six RNA dilutios (1×, 2×, 4×, 8×, 80×, 800×) and then cDNA was synthesized from each dilution (as described above).
PCR reactions were carried out in duplicates using SYBRGreen I as a fluorescent detection dye and they were performed in the real-time PCR system Mx 3000P (Stratagene, La Jolla, CA). Background-subtracted fluorescences were used for data analysis by all tested methods except for the sigmoid curve-fitting method, where raw fluorescences were used (see below). Specific oligonucleotide primers for amplification of mouse interleukin 1 beta (IL-1β), interleukin 6 (IL-6), tumor necrosis factor alpha (TNF-α), granulocyte macrophage-macrophage colony stimulating factor (GM-CSF), beta actin (ACTB), hypoxanthine guanine phosphoribosyl transferase 1 (HPRT) and succinate dehydrogenase complex subunit A (SDHA) were used (for sequences of the primers and PCR reaction conditions see [25]).
Amplification of luciferase mRNA: 1 μl of each cDNA (six cDNAs was prepared, see above) was amplified in 25 μl PCR containing 1× SYBR Green/ROX PCR Master Mix (PA-012, SuperArray Bioscience Corp., Frederick, MD), and 0.4 μM primers; 40 amplification cycles at 95°C for 20 s, 60°C for 60 s, and 82°C for 20 s (fluorescence acquiring) was used. Primers specific for the luciferase sequence (5'GCTTACTGGGACGAAGACGAAC3', 5'CTTGACTGGCGACGTAATCCAC3') amplified a 247 bp PCR product.
Normalization of cytokine mRNAs quantity
To ensure correctness of the quantification we normalized cytokine (IL-1β, IL-6 TNF-α, and GM-CSF) expression to the three reference genes (ACTB, SDHA, and HPRT) whose expression was found to be stable in our previous work [25]. After determining quantities of mRNAs (R_{0} values) of the three reference genes in each sample (using tested methods of real-time PCR data analysis) the sample normalization factor was calculated as a geometric mean of the three R_{0} values. GeNorm software was utilized for the calculation [13]. The amount of cytokine mRNA in each sample (cytokine R_{0} value) was then divided by the normalization factor of the sample.
Data analysis
The mathematical equations used in most methods for analyzing data obtained from real-time PCR are derived from the basic formula describing the PCR amplification in the exponential phase of the reaction:
X_{ n }= X_{0} × (E + 1)^{ n }
where X_{ n }is the amount of PCR product at cycle n, X_{0} is the starting amount of PCR template (which we are interested in) and E is the amplification efficiency which can have a value between 0 (no amplification) and 1 (doubling of the PCR product in each amplification cycle). In fluorescent real-time PCR it is assumed that accumulation of reporter dye fluorescence (R, fluorescence readings after background subtraction) is proportional to the accumulation of PCR amplification product, and equation can then be written as:
R_{ n }= R_{0} × (E + 1)^{ n }
and starting fluorescence R_{0} can be then calculated as:
R_{0} = R_{ n }/(E + 1)^{ n }
where R_{ n }is the intensity of reporter dye fluorescence (proportional to the amount of PCR product) at cycle n, and R_{0} is the theoretical starting fluorescence which is proportional to the amount of starting PCR template. Thus, the R_{0} value represents the target quantity expressed in arbitrary fluorescence units. There are several techniques based on this equation which are used for calculating the R_{0} value. In threshold-based techniques where "Ct" (the number of amplification cycles needed to reach the fluorescence threshold) is measured, "n" in Eq 3 can be replaced by "Ct":
R_{0} = R_{ Ct }/(E + 1)^{ Ct }
"R_{ Ct }" then represents the threshold fluorescence which can be set for each of the compared amplifications individually, or a threshold value (R_{ Ct }) common for all compared amplifications can be used. In the latter case, the numerical value of R_{ Ct }in Eq 4 can be ignored (replaced for example by 1.0) and the R_{0} value can be calculated:
R_{0} = 1/(E + 1)^{ Ct }or R_{0} = (E + 1)^{-Ct}
Other possibilities to obtain the R_{0} and E values are techniques which use relative standard curve, linear regression (utilizing equations derived from the basic formula – Eq 1) or fitting the PCR process to the sigmoid function (see below). The quantities determined from the relative standard curve as well as the quantities determined with the other methods were designated as „R_{0}“ throughout the manuscript.
Relative standard curve method
For each gene, standard cDNAs (see above) were amplified along with sample cDNAs in the same PCR run. Standard curves were generated by Mx 3000P 2.0 software (Stratagene). The threshold fluorescence common for all compared samples was set into the exponential phase of the amplifications by the Mx 3000P system. The target mRNA quantity in each sample (R_{0}) was determined from the relative standard curve (using sample Ct values) and expressed in arbitrary units corresponding to the dilution factors of the standard RNA preparation. Amplification efficiency (E) representative for each gene was determined using equation of the standard curve:
Ct = -1/log (E + 1) × log Ro + log R/log (E + 1)
E = 10^{-1/Slope}- 1; slope = -1/log [E + 1]
Comparative Ct method
Amplification efficiency (E) for each gene was determined from the relative standard curve (see above). The Ct value for each reaction was determined by the real-time PCR system Mx3000P setting the threshold fluorescence (common for all compared samples) into the exponential phase of the amplifications). The target mRNA quantity in each sample (R_{0}) was then calculated from equation 5.
LinRegPCR method
A computer program entitled LinRegPCR developed by Ramakers et al.[7] utilizes linear regression analysis of fluorescence data from the exponential phase of PCR amplification to determine the target mRNA quantity (R_{0}) as well as the amplification efficiency (E). Following equations were used:
log R = log (E + 1) × n + log R_{0}; intercept = log R_{0}, slope = log [E + 1]
R_{0} = 10^{ Intercept }
E = 10^{ Slope }- 1
LinRegPCR software utilizes an iterative algorithm (considering the number of data points, regression coefficient and slope of the regression line) for the selection of the exponential phase in each PCR amplification. We analyzed our fluorescence data using this software and obtained the R_{0} and E values for each reaction. Since ANOVA detected no significant differences in amplification efficiencies between the sample groups (Sh, Un, A, B, C) we also applied a combined analysis using an average E value (arithmetical mean of E values of all samples). In the combined analysis ("LinRegPCR-Ct method"), the R_{0} was calculated from equation 4 using the average E value and threshold fluorescence values (R) with corresponding Ct values (determined by the real-time PCR system Mx3000P).
DART-PCR method
DART-PCR (Data Analysis for Real-Time PCR) Excel workbook developed by Peirson et al.[8] determines the E value of each individual reaction using linear regression analysis of the fluorescence data from the exponential phase of each amplification (Eq 10). For the selection of the exponential phase a midpoint (M) for each PCR amplification is calculated, using maximal and minimal fluorescence levels. DART-PCR determines the Ct value for each reaction, offering the possibility of using individual threshold fluorescences or a threshold fluorescence common for all compared reactions (the second possibility was used in this study). Target mRNA quantity (R_{0}) is then determined using equation 4. One-way analysis of variance (included as a component of DART-PCR workbook) detected no significant differences in amplification efficiencies between the sample groups (Sh, Un, A, B, C), so we also used the average E value (arithmetical mean of E values of all samples) for the R_{0} calculation in each sample. Since the DART-PCR requires inputting of fluorescence data in triplicates and our PCR reactions were carried out in duplicates, we created the "third replicate" as the mean value of our duplicate fluorescence readings.
Liu and Saint method using exponential model of PCR ("Liu & Saint-exp method")
Two arbitrary fluorescence thresholds (lower R_{1} and higher R_{2}, common for all compared samples) were manually set in the exponential phase of amplification curves and corresponding Ct values (provided by the Mx 3000P) were recorded. For the thresholds selection the semilogarithmic graph of the amplification curves (log of fluorescences – log R, against cycle number – n) was used because the exponential phase of PCR amplification can be simply identified on the graph (fluorescent data acquired in the exponential phase of the reaction produce a straight line on the semilogarithmic graph). Amplification efficiency (E) of each reaction was calculated using equation:
E = (R_{2}/R_{1})^{1/(Ct 2-Ct 1)}- 1
The target mRNA quantity (R_{0}) was calculated using equation 4. One-way analysis of variance (ANOVA) detected no significant differences in amplification efficiencies between the sample groups (Sh, Un, A, B, C), so the average E value (arithmetical mean of E values of all samples) was also used for the R_{0} calculation in each sample (using equation 4).
Sigmoid curve-fitting (SCF) method
Raw fluorescences (i.e. fluorescences without background subtraction) from Mx 3000P were fitted to the four-parametric sigmoid function using the nonlinear regression function of SigmaPlot (Version 10, Systat Software, Richmond, CA, USA). Following equations were used:
R_{ na }= R_{ b }+ R_{ n }= R_{ b }+ R_{ max }/1 + e^{-((n-n 1/2)/k)}
R_{0} = R_{ max }/1 + e^{(n 1/2/k)}
where R_{ na }is the aggregate reaction fluorescence at cycle n, R_{ b }is the background reaction fluorescence, R_{ n }is the fluorescence generated by the PCR product at cycle n, R_{ max }is the maximal fluorescence generated by the PCR product, n_{1/2} is the cycle number at which fluorescence reaches half of the R_{ max }, k describes the slope of the sigmoid curve.
Repetitive regression analyses with sequential removal of the last amplification cycle (the cut-off cycle) were performed until the curve fitting failed, due to insufficient data. The R_{0} value was calculated at each curve-fitting using equation 13. For all amplification curves, the graph of dependence of the calculated R_{0} values on the cut-off-cycle was constructed and the minimum R_{0} value was selected as the resulting R_{0} value – the target mRNA relative quantity [11]. For the data treatment a macro for SigmaPlot provided by Qiu et al.[24] was utilized. In some instances the cut-off cycle with the minimal R_{0} value was located in the linear region of the amplification curve. The resulting R_{0} value was then selected manually by shifting the cut-off cycle into the region located between end of the linear amplification and entry of the reaction into the plateau phase (i.e. into "upper arc" of the amplification curve).
Intra-assay and inter-assay variability
In the intra-assay variability experiment, fifteen replicate PCR reactions amplifying IL-1β or IL-6 were ran. In the inter-assay variability experiment, eight PCR reactions amplifying IL-1β or IL-6 were ran in eight separate PCR (Mx 3000P) runs. The fluorescent data obtained from each reaction replicate were then transformed into mRNA quantity (the R_{0} value) by the tested methods for real-time PCR data analysis. Arithmetical means, standard deviations (SD) and coefficients of variation (CV) were then calculated from the determined R_{0} values (15 and 8 replicates for each gene, respectively).
Statistics
All statistics were performed using Statistica (StatSoft, Tulsa, OK). To enable better comparison of the results obtained from the different methods of real-time PCR data analysis the values of target mRNA quantities (R_{0}) were transformed percentually (setting the median of R_{0} values of all samples as 100%). Differences in amplification efficiencies between the sample groups were assessed using one-way analysis of variance (ANOVA). The Kruskal-Wallis test was used for the comparison of differences in normalized cytokine expression between groups, and the Mann-Whitney U test was used to compare differences between the group of untreated colitic animals (Un) and other groups of animals. Values of P < 0.05 were considered as significant.
Declarations
Acknowledgements
This work was supported by the Slovak Research and Development Agency under Contract No. APVT-51-015404.
Authors’ Affiliations
References
- Bustin SA: Quantification of mRNA using real-time reverse transcription PCR (RT-PCR): trends and problems. J Mol Endocrinol 2002, 29: 23-39. 10.1677/jme.0.0290023View ArticlePubMedGoogle Scholar
- Huggett J, Dheda K, Bustin SA, Zumla A: Real-time RT-PCR normalisation; strategies and considerations. Genes Immun 2005, 6: 279-284. 10.1038/sj.gene.6364190View ArticlePubMedGoogle Scholar
- Skern R, Frost P, Nilsen F: Relative transcript quantification by quantitative PCR: Roughly right or precisely wrong? BMC Mol Biol 2005, 6: 10. DOI 10.1186/1471-2199-6-10PubMed CentralView ArticlePubMedGoogle Scholar
- Pfaffl MW: Quantification strategies in real-time PCR. In A-Z of Quantitative PCR. Edited by: Bustin SA. International University Line (IUL), La Jolla; 2004:86-120.Google Scholar
- Wong ML, Medrano JF: Real-time PCR for mRNA quantitation. BioTechniques 2005, 39: 75-85.View ArticlePubMedGoogle Scholar
- Rasmussen R: Quantification on the LightCycler. In Rapid cycle real-time PCR Methods and applications. Edited by: Meuer S, Wittwer C, Nakagawara K. Springer-Verlag, Heidelberg; 2001:21-34.View ArticleGoogle Scholar
- Ramakers Ch, Ruijter JM, Deprez RHL, Moorman AFM: Assumption-free analysis of quantitative real-time polymerase chain reaction (PCR) data. Neuroscience Letters 2003, 339: 62-66. 10.1016/S0304-3940(02)01423-4View ArticlePubMedGoogle Scholar
- Peirson SN, Butler JN, Foster RG: Experimental validation of novel and conventional approaches to quantitative real-time PCR data analysis. Nucleic Acids Res 2003, 31: e73. DOI 101093/nar/gng073. 10.1093/nar/gng073PubMed CentralView ArticlePubMedGoogle Scholar
- Liu W, Saint DA: Validation of a quantitative method for real time PCR kinetics. Bioch Biophys Res Comm 2002, 294: 347-353. 10.1016/S0006-291X(02)00478-3View ArticleGoogle Scholar
- Tichopad A, Dzidic A, Pfaffl MW: Improving quantitative real-time RT-PCR reproducibility by boosting primer-linked amplification efficiency. Biotech Lett 2002, 24: 2053-2056. 10.1023/A:1021319421153View ArticleGoogle Scholar
- Rutlege RG: Sigmoidal curve-fitting redefines quantitative real-time PCR with the prospective of developing automated high-throughput applications. Nucleic Acids Res 2004, 32: e178. DOI 101093/nar/gnh177. 10.1093/nar/gnh177View ArticleGoogle Scholar
- Panes J: Inflammatory bowel disease: pathogenesis and targets for therapeutic interventions. Acta Physiol Scand 2001, 173: 159-165. 10.1046/j.1365-201X.2001.00905.xView ArticlePubMedGoogle Scholar
- Vandesompele J, De Peter K, Pattyn F, Poppe B, Van Roy N, De Paepe A, Speleman F: Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes. Genome Biol 2002, 3: RESEARCH0034. 10.1186/gb-2002-3-7-research0034PubMed CentralView ArticlePubMedGoogle Scholar
- Livak KJ: ABI Prism 7700 Sequence Detection System. User Bulletin no. 2. PE Applied Biosystems 1997, AB website, bulletin reference: 4303859B 777802-002 [http://docs.appliedbiosystems.com/pebiodocs/04303859.pdf]Google Scholar
- Pfaffl MW: A new mathematical model for relative quantification in real-time RT-PCR. Nucleic Acids Res 2001, 29: e45. 10.1093/nar/29.9.e45PubMed CentralView ArticlePubMedGoogle Scholar
- Livak KJ, Schmittgen TD: Analysis of relative gene expression data using real-time quantitative PCR and the 2^{ -ΔΔc }_{ T }method. Methods 2001, 25: 402-408. DOI 10.1006/meth.2001.1262View ArticlePubMedGoogle Scholar
- Zimmermann AK, Simon P, Seeburger J, Hoffmann J, Ziemer G, Aebert H, Wendel HP: Cytokine gene expression in monocytes of patients undergoing cardiopulmonary bypass surgery evaluated by real-time PCR. J Cel Mol Med 2003, 7: 146-156. 10.1111/j.1582-4934.2003.tb00213.xView ArticleGoogle Scholar
- Meijerink J, Mandigers C, van de Locht L, Tönnissen E, Goodsaid F, Raemaekers J: A novel method to compensate for different amplification efficiencies between patient DNA samples in quantitative real-time PCR. J Mol Diagn 2001,3(2):55-61.PubMed CentralView ArticlePubMedGoogle Scholar
- Kamphuis W, Schneemann A, van Beek LM, Smit AB, Hoyng PF, Koya E: Prostanoid receptor gene expression profile in human trabecular meshwork: a quantitative real-time PCR approach. Invest Ophthalmol Vis Sci 2001,42(13):3209-3215.PubMedGoogle Scholar
- Pfaffl MW, Horgan GW, Dempfle L: Relative expression software tool (REST) for group-wise comparison and statistical analysis of relative expression results in real-time PCR. Nucleic Acids Res 2002, 30: e36. 10.1093/nar/30.9.e36PubMed CentralView ArticlePubMedGoogle Scholar
- Liu W, Saint DA: A new quantitave method of real time reverse transcription polymerase chain reaction assay based on simulation of polymerase chain reaction kinetics. Anal Biochem 2002, 302: 52-59. DOI 10.1006/abio.2001.5530.View ArticlePubMedGoogle Scholar
- Karlen Y, McNair A, Perseguers S, Mazza Ch, Mermod N: Statistical significance of quantitative PCR. BMC Bioinformatics 2007, 8: 131. DOI 10.1186/147-2105-8-131.PubMed CentralView ArticlePubMedGoogle Scholar
- Schefe JH, Lehmann KE, Buschmann IR, Unger T, Funke-Kaiser H: Quantitave real-timeRT-PCR data analysis: current concepts and the novel "gene expression's C_{ T }difference" formula. J Mol Med 2006, 84: 901-910. DOI 09-006-0097-6. 10.1007/s00109-006-0097-6View ArticlePubMedGoogle Scholar
- Qiu H, Durand K, Rabinovitch-Chable H, Rigaud M, Gazaille V, Clavere P, Sturtz FG: Gene expression of HIF-1α and XRCC4 measured in human samples by real-time RT-PCR using the sigmoidal curve-fitting method. BioTechniques 2007, 42: 355-362. DOI 10.2144/000112331.View ArticlePubMedGoogle Scholar
- Bukovská A, Čikoš Š, Juhás Š, Il'ková G, Rehák P, Koppel J: Effects of a combination of thyme and oregano essential oils on TNBS-induced colitis in mice. Med Inflam, in press.Google Scholar
Copyright
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.