- Research article
- Open Access
Evaluation of suitable reference genes for gene expression studies in bovine muscular tissue
BMC Molecular Biologyvolume 9, Article number: 79 (2008)
Real-time reverse transcriptase quantitative polymerase chain reaction (real-time RTqPCR) is a technique used to measure mRNA species copy number as a way to determine key genes involved in different biological processes. However, the expression level of these key genes may vary among tissues or cells not only as a consequence of differential expression but also due to different factors, including choice of reference genes to normalize the expression levels of the target genes; thus the selection of reference genes is critical for expression studies. For this purpose, ten candidate reference genes were investigated in bovine muscular tissue.
The value of stability of ten candidate reference genes included in three groups was estimated: the so called 'classical housekeeping' genes (18S, GAPDH and ACTB), a second set of genes used in expression studies conducted on other tissues (B2M, RPII, UBC and HMBS) and a third set of novel genes (SF3A1, EEF1A2 and CASC3). Three different statistical algorithms were used to rank the genes by their stability measures as produced by geNorm, NormFinder and Bestkeeper. The three methods tend to agree on the most stably expressed genes and the least in muscular tissue. EEF1A2 and HMBS followed by SF3A1, ACTB, and CASC3 can be considered as stable reference genes, and B2M, RPII, UBC and GAPDH would not be appropriate. Although the rRNA-18S stability measure seems to be within the range of acceptance, its use is not recommended because its synthesis regulation is not representative of mRNA levels.
Based on geNorm algorithm, we propose the use of three genes SF3A1, EEF1A2 and HMBS as references for normalization of real-time RTqPCR in muscle expression studies.
In the last few years, Real-time reverse transcriptase quantitative polymerase chain reaction (Real-time RTqPCR) has been successfully used to measure mRNA species copy number as a way to determine key genes involved in different biological processes: disease, economic traits, etc. [e.g. [1–4]]. This technique shows a high sensitivity over a wide range of transcript expression levels and enables high throughput capabilities . Nevertheless, it is subject to substantial technical variability in expression measures due to different factors such as type and quality of samples [6, 7], starting cell number, RNA extraction and reverse transcription methods [8–10]. Moreover, the biological interpretations of expression results critically depend on normalization of transcript signals to mRNA standards before statistical evaluation, which will allow the control of the variability produced by all the mentioned factors . Normalization of the expression levels of the target genes is performed through reference genes [12, 10] also called housekeeping , which are internal endogenous controls that should be constitutively expressed in a tissue, across samples and treatments. Misinterpretation of data occurs when expression measures are erroneously normalized to a subset of mRNAs that are subject to strong regulation [14, 15]. The correct reference genes can be selected by evaluating data from Real-time RTqPCR with statistical algorithms such as geNorm , Bestkeeper  or Normfinder . Common reference genes for normalization of qRT-PCR data in skeletal muscle include ACTB, β2-microglobulin, GAPDH, PPIA, and 18S and 28S rRNAs [19–21]. Most studies use only one reference gene, generally 18S rRNA, ACTB or GAPDH [see [20, 22, 23], respectively], and more rarely B2M or PPIA [see [24, 25] respectively]; however, the analysis of the stability of these genes in muscle shows contradictory conclusions [19, 26, 27]. Erkens and co-workers  checked 10 different reference genes for pig muscle expression studies, proposing ACTB, TBP and TOP2B as good references in measures by Real-time RTqPCR to contrast between different muscle fibers and between muscle and adipose tissue. Also Nygard et al.  select high quality reference genes for real-time qPCR data interpretation in muscle tissue and others.
In the present study, the expression stability and level of ten candidate reference genes is measured with the aim of creating a set of genes which can be used in bovine skeletal muscle tissue for normalization of mRNA measures by Real-time RTqPCR. For this purpose we evaluate a set of "classical housekeeping" genes (18S, GAPDH and ACTB), a second set of genes used in expression studies conducted on other tissues (B2M, RPII, UBC and HMBS) and a third set of other genes (SF3A1, EEF1A2 and CASC3) on samples of Longissimus dorsi for which fatty acid profiles have been measured, in an effort to avoid misinterpretation of expression data produced in transcription studies of bovine skeletal muscle samples.
• RNA source, total RNA extraction and cDNA synthesis
A total of 120 bovine individuals were measured for long chain omega 3 fatty acids [%LCω3 = % eicosapentaenoic acid (20:5, ω3; EPA) + % docosapentaenoic acid (22:5, ω3; DPA) + % docosahexaenoic acid (22:6, ω3; DHA)]. Ten of these individuals showing the highest and ten showing the lowest percentages (differences p < 0.001) for this phenotype were selected. For each, a sample of 25 mg from Longissimus dorsi taken shortly after slaughtering was homogenized and RNA was extracted using commercial spin-columns (RNeasy® Fibrous Tissue Mini Kit, QIAGEN), yielding around 10–20 μg of total RNA protected against RNase degradation with RNA secure™ Reagent 1× (Ambion).
Two μl of total RNA were used to produce a retro-transcription reaction using an iScript™ cDNA Synthesis Kit (Bio-Rad), following the manufacturer's recommendations. To perform gene testing, part of the reaction was diluted 1/10 and part was pooled and serial diluted to construct the standard curves; all the aliquots were stored at -70°C until use. The quality and concentration of total RNA representing each sample was assessed by conventional agarose electrophoresis and through absorbance measurements (ratio 260/280 ≥ 2). Intact 28S and 18S rRNA subunit were observed on the gel indicating minimal degradation of the RNA.
• Selection of genes and primer design
Known sequences were used to design primers for eight genes (Table 1) using primer 3 , preventing possible secondary structures with QIAGEN Oligo Toolkit http://www.operon.com/ and Dinamelt server , and ensuring the specificity of the sequence by BLAST [32, 33]. Two primer pair sequences were defined by Chitko-McKown and co-workers  and Goossens et al. 
• Real-time RTqPCR
Real-time RTqPCR reactions were performed in an iCycler IQ Real-Time PCR Detection System (Bio-Rad) and a master mix was prepared using Dynamo™ HS SYBR® Green qPCR Kit (Finnzymes), 0.4 mM of each primer and 2.7 μl of 1/10 diluted RT (regardless of initial concentration) in 15 μl reaction volumes. After the selection of the most adequate annealing temperature, standard curves and no-template controls were produced in triplicate for each gene, together with the sample assays. The following experimental run protocol was used: quantification program consisting of 45 cycles of 95°C for 25 sec, 10s at annealing temperature and 15 s at 72°C, ending with a melting program ranging from 68°C to 95°C with a heating rate of 0.1°C/10 sec and continuous fluorescence measurement.
The results were exported from the iCycler IQ Real-Time PCR Detection System into Microsoft Excel files for further analysis.
• Data Analysis
Real-time RTqPCR data were exported into an Excel datasheet (Microsoft Excel 2003) and analyzed using three separate reference gene stability analysis software packages; geNorm , Bestkeeper©  and NormFinder . The three methods generate a measure of reference gene stability, which can be used to rank the reference genes; GeNorm generates an M value for each gene which is arbitrarily suggested to be lower than 1.5 (with a lower value indicating increased gene stability across samples), and a pairwise stability measure to determine the benefit of adding extra reference genes for the normalization process (again with a lower value indicating greater stability of the normalization factor). An arbitrary cut off value of 0.15 indicates acceptable stability of the reference gene combination. Similarly, NormFinder generates a stability measure of which a lower value indicates increased stability in gene expression and groups samples to allow direct estimation of expression variation, ranking genes according to the similarity of their expression profiles by using a model-based approach. Bestkeeper© generates a pairwise correlation co-efficient between each gene and the Bestkeeper index (the geometric mean of the threshold cycle values of all the reference genes grouped together).
Results and discussion
When faced with any tissue expression experiment, the lack of really stable reference genes makes misinterpretation of results a high risk . It is considered necessary to have a list of genes which will enable satisfactory normalization of the expression data of the experiment and permit biological conclusions. There are many different ways to test the aptitude of a control gene for normalization in an experimental design and they have been compared with contradictory results [e.g.  or ]; although all give an order of candidate reference genes on the basis of the relative expression stabilities through mathematical evaluation of expression data, in our opinion geNorm  seems the best option as it proposes the number of reference genes necessary for accurate normalization, helping to choose the most stable genes by expressing a stability measure (M), which is the average pairwise variation for a given gene compared to all the other tested genes. Stepwise exclusion of the gene with the highest M value allows ranking of the tested genes according to their expression stability. Bestkeeper  and Normfinder  also rank the candidate reference genes and Normfinder is able to identify the single gene with the most stable expression, whereas geNorm detects the two genes whose expression ratios show least variation from those of the other genes tested. On the other hand, Bestkeeper does not restart pair-wise correlation after one gene is eliminated; nevertheless, results with geNorm are quite similar. Results of the three algorithms comparison and identification of the set of reference genes that offers reliable results for Real-time RTqPCR data normalization for use in gene expression studies involving muscular tissue from bovine individuals are listed in Table 2.
The need to validate a collection of reference genes in every tissue and between different treatments to ensure correct normalization has led us to validate a list of genes as reference for studies by Real-time RTqPCR of skeletal muscle fatty acid metabolism. Information on reference genes for use in expression normalization of samples from skeletal muscle tissue is scarce, although different papers address the same problem in other tissues [e.g. [29, 37]]. The ten different candidate genes were checked for different reasons: GAPDH, ACTB and 18S rRNA are used as single control genes in more then 90% of the published expression studies  and are specifically used as reference genes in Longissimus dorsi in the pig . However, it has been reported that ACTB is most relevant for high abundant transcript  and together with GAPDH, it fluctuates dramatically  and should be rejected. Although the 18S rRNA measure of stability seems to be within the range of acceptance (when using geNorm but also for Bestkeeper), it has repeatedly been documented that it is not a good control gene [see e.g. [38, 15, 39]], as its synthesis regulation is not representative of mRNA levels .
B2M, RPII and UBC are frequently used for mRNA measures as reference genes in other tissues but not in skeletal muscle . We added SF3A1, EEF1A2, HMBS and CASC3 as genes with validated stability in several cellular classes [16, 42, 43], but not in mammalian skeletal muscle. All the genes chosen belong to different functional classes to avoid co-variation between them.
Melting curves generated (not shown) ensure the correct amplification of all genes tested in this work when using the primers shown in Table 1, with PCR amplification efficiency values near to 100% [8, 9]; correlation coefficients (r2) between the logarithm of the cDNA starting quantity and the Ct were, at least, 0.95 for all genes (Table 2). Negative controls lacking template show no amplification or a very late exponential growth and, in this case, the melting curve reveals clearly identified negligible peaks.
GeNorm calculation of the internal control gene-stability measure (M) ranks our gene expression variations (Figure 1), SF3A1 and EEF1A2 being the most stable with an M value around 0.66. When ranking the reference genes by different methods (Table 2), we found that EEF1A2 appears as the best reference gene both in geNorm and Normfinder. Genes HMBS, ACTB, CASC3 and B2M can be accepted as stable with M values between 0.7 – 1, and to a lesser extent RPII and 18S with M values below 1.5.
Use of GAPDH as a housekeeping gene would not be appropriate, as it appears to be regulated in muscle tissue showing an M value of 1.63 and is thus not recommended as a reference gene. Actually this gene is ranked as the worst with all algorithms used here and the inadequate use of this gene has also been documented in other studies [43, 28]. Our assessment is that GAPDH, 18S and RPII are not able to make conclusions obtained by expression measures normalized with them, as these genes show expression differences between the two sets of samples.
As regards the number of genes to be used in expression studies, de Jonge et al.  report that no single gene qualifies as a 'real' housekeeping gene. The calculation of V values by geNorm for the proposed genes (Figure 2) is useful for deciding their optimal number to be used in an expression study ; pairwise variation between samples is reduced by the inclusion of additional reference genes and thus indicates the number of genes required to achieve an arbitrarily selected threshold of reference gene stability; a recommended cut-off value of 0.15 shows genes SF3A1 and EEF1A2 as having good stability in relative quantification. If HMBS, ACTB and/or CASC3 are added, a more accurate normalization can be performed, but the use of more than three control genes is unnecessary for the analyses .
Using the geometric mean of the three most stable genes listed, we conclude that SF3A1, EEF1A2 and HMBS would lead to powerful results in bovine skeletal muscle tissue. These genes have already shown stability in breast cancer , salmon muscle  and human fibroblasts , and now show to be stable in mammalian skeletal muscle transcriptome studies.
Puntschart A, Wey E, Jostarndt K, Vogt M, Wittwer M, Widmer HR, Hoppeler H, Billeter R: Expression of fos and jun genes in human skeletal muscle after exercise. Am J Physiol 1998, 274(1 Pt 1):C129-C137.
Pattison JS, Folk LC, Madsen RW, Childs TE, Spangenburg EE, Booth FW: Expression profiling identifies dysregulation of myosin heavy chains IIb and IIx during limb immobilization in the soleus muscles of old rats. J Physiol 2003, 553(Pt 2):357-68. 10.1113/jphysiol.2003.047233.
Bernard C, Cassar-Malek I, Le Cunff M, Dubroeucq H, Renand G, Hocquette JF: New indicators of beef sensory quality revealed by expression of specific genes. J Agric Food Chem 2007, 55(13):5229-37. 10.1021/jf063372l.
Pal NR, Aguan K, Sharma A, Amari S: Discovering biomarkers from gene expression data for predicting cancer subgroups using neural networks and relational fuzzy clustering. BMC Bioinformatics 2007, 8: 5. 10.1186/1471-2105-8-5.
Heid CA, Stevens J, Livak KJ, Williams PM: Real time quantitative PCR. Genome Res 1996, 6(10):986-94. 10.1101/gr.6.10.986.
Bustin SA, Benes V, Nolan T, Pfaffl MW: Quantitative real-time RT-PCR – a perspective. J Mol Endocrinol 2005, 34(3):597-601. Review. 10.1677/jme.1.01755.
Imbeaud S, Graudens E, Boulanger V, Barlet X, Zaborski P, Eveno E, Mueller O, Schroeder A, Auffray C: Towards standardization of RNA quality assessment using user-independent classifiers of microcapillary electrophoresis traces. Nucleic Acids Res 2005, 33: e56. 10.1093/nar/gni054.
Bustin SA: Tenth annual nucleic acid-based technologies: time to stop and think. Expert Rev Mol Diagn 2002, 2(5):405-8. 10.1586/14737184.108.40.2065.
Lekanne Deprez RH, Fijnvandraat AC, Ruijter JM, Moorman AFM: Sensitivity and accuracy of quantitative real-time polymerase chain reaction using SYBR green I depends on cDNA synthesis conditions. Anal Biochem 2002, 307: 63-69. 10.1016/S0003-2697(02)00021-0.
Stahlberg A, Hakansson J, Xian X, Semb H, Kubista M: Properties of the reverse transcription reaction in mRNA quantification. Clin Chem 2004, 50(3):509-15. 10.1373/clinchem.2003.026161.
Huggett J, Dheda K, Bustin S, Zumla A: Real-time RT-PCR normalisation; strategies and considerations. Genes Immun 2005, 6: 279-284. 10.1038/sj.gene.6364190.
Bustin SA, Nolan T: Pitfalls of Quantitative Real-Time Reverse-Transcription Polymerase Chain Reaction. J Biomol Tech 2004, 15: 155-166.
Dheda K, Huggett JF, Bustin SA, Johnson MA, Rook G, Zumla A: Validation of housekeeping genes for normalizing RNA expression in real-time PCR. Biotechniques 2004, 37: 112-119.
Schmittgen TD, Zakrajsek BA: Effect of experimental treatment on housekeeping gene expression: validation by real-time, quantitative RT-PCR. J Biochem Biophys Methods 2000, 20;46(1–2):69-81. 10.1016/S0165-022X(00)00129-9.
Tricarico C, Pinzani P, Bianchi S, Paglierani M, Distante V, Pazzagli M, Bustin SA, Orlando C: Quantitative real-time reverse transcription polymerase chain reaction: normalization to rRNA or single housekeeping genes is inappropriate for human tissue biopsies. Anal Biochem 2002, 309(2):293-300. 10.1016/S0003-2697(02)00311-1.
Vandesompele J, De Preter K, Pattyn F, Poppe B, Van Roy N, De Paepe A, Speleman F: Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes. Genome Biology 2002, 3: 0034.1-0034.11. 10.1186/gb-2002-3-7-research0034.
Pfaffl MW, Tichopad A, Prgomet C, Neuvians TP: Determination of stable housekeeping genes, differentially regulated target genes and sample integrity: BestKeeper- Excel-based tool using pair-wise correlations. Biotechnology Letters 2004, 26: 509-515. 10.1023/B:BILE.0000019559.84305.47.
Andersen CL, Jensen JL, Orntoft TF: Normalization of Real-Time Quantitative Reverse Transcription-PCR Data: A Model-Based Variance Estimation Approach to Identify Genes Suited for Normalization, Applied to Bladder and Colon Cancer Data Sets. Cancer Res 2004, 64: 5245-5250. 10.1158/0008-5472.CAN-04-0496.
Murphy RM, Watt KK, Cameron-Smith D, Gibbons CJ, Snow RJ: Effects of creatine supplementation on housekeeping genes in human skeletal muscle using real-time RT-PCR. Physiol Genomics 2003, 12(2):163-74.
Timmons JA, Jansson E, Fischer H, Gustafsson T, Greenhaff PL, Ridden J, Rachman J, Sundberg CJ: Modulation of extracellular matrix genes reflects the magnitude of physiological adaptation to aerobic exercise training in humans. BMC Biol 2005, 3: 19. 10.1186/1741-7007-3-19.
de Jonge HJ, Fehrmann RS, de Bont ES, Hofstra RM, Gerbens F, Kamps WA, de Vries EG, Zee AG, te Meerman GJ, ter Elst A: Evidence based selection of housekeeping genes. PLoS ONE 2(9):e898. 2007 Sep 19 10.1371/journal.pone.0000898 10.1371/journal.pone.0000898
Razeghi P, Young ME, Abbasi S, Taegtmeyer H: Hypoxia in vivo decreases peroxisome proliferator-activated receptor alpha-regulated gene expression in rat heart. Biochem Biophys Res Commun 287(1):5-10. 2001 Sep 14 10.1006/bbrc.2001.5541 10.1006/bbrc.2001.5541
Marcell TJ, Harman SM, Urban RJ, Metz DD, Rodgers BD, Blackman MR: Comparison of GH, IGF-I, and testosterone with mRNA of receptors and myostatin in skeletal muscle in older men. Am J Physiol Endocrinol Metab 2001, 281(6):E1159-64.
Mahoney DJ, Parise G, Melov S, Safdar A, Tarnopolsky MA: Analysis of global mRNA expression in human skeletal muscle during recovery from endurance exercise. FASEB J 2005, 19(11):1498-500.
Birot OJ, Koulmann N, Peinnequin A, Bigard XA: Exercise induced expression of vascular endothelial growth factor mRNA in rat skeletal muscle is dependent on fibre type. J Physiol 2003, 552: 213-221. 10.1113/jphysiol.2003.043026.
Mahoney DJ, Carey K, Fu MH, Snow R, Cameron-Smith D, Parise G, Tarnopolsky MA: Real-time RT-PCR analysis of housekeeping genes in human skeletal muscle following acute exercise. Physiol Genomics 2004, 18: 226-231. 10.1152/physiolgenomics.00067.2004.
Jemiolo B, Trappe S: Single muscle fiber gene expression in human skeletal muscle: validation of internal control with exercise. Biochem Biophys Res Commun 2004, 320: 1043-1050. 10.1016/j.bbrc.2004.05.223.
Erkens T, Van Poucke M, Vandesompele J, Goossens K, Van Zeveren A, Peelman LJ: Development of a new set of reference genes for normalization of real-time RT-PCR data of porcine backfat and longissimus dorsi muscle, and evaluation with PPARGC1A. BMC Biotechnol 2006, 6: 41. 10.1186/1472-6750-6-41.
Nygard AB, Jørgensen CB, Cirera S, Fredholm M: Selection of reference genes for gene expression studies in pig tissues using SYBR green qPCR. BMC Molecular Biology 2007, 8: 67. 10.1186/1471-2199-8-67.
Rozen S, Skaletsky H: Primer3 on the WWW for general users and for biologist programmers. Methods Mol Biol 2000, 132: 365-86.
Markham NR, Zuker M: DINAMelt web server for nucleic acid melting prediction. Nucleic Acids Res 2005, (33 Web Server):W577-81. 10.1093/nar/gki591.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol 1990, 215: 403-410.
Wheeler D, Bhagwat M: BLAST QuickStart: Example-Driven Web-Based BLAST Tutorial. Methods Mol Biol 2007, 395: 149-76.
Chitko-McKown CG, Fox JM, Miller LC, Heaton MP, Bono JL, Keen JE, Grosse WM, Laegreid WW: Gene expression profiling of bovine macrophages in response to Escherichia coli O157:H7 lipopolysaccharide. Dev Comp Immunol 2004, 28(6):635-45. 10.1016/j.dci.2003.10.002.
Goossens K, Van Poucke M, Van Soom A, Vandesompele J, Van Zeveren A, Peelman LJ: Selection of reference genes for quantitative real-time PCR in bovine preimplantation embryos. BMC Dev Biol 2005, 5: 27. 10.1186/1471-213X-5-27.
Spinsanti G, Panti C, Lazzeri E, Marsili L, Casini S, Frati F, Fossi C: Selection of reference genes for quantitative RT-PCR studies) in striped dolphin: Stenella coeruleoalba) skin biopsies. BMC Molecular Biology 2006, 7: 32. 10.1186/1471-2199-7-32.
Maccoux LJ, Clements DN, Salway F, Day PJ: Identification of new reference genes for the normalisation of canine osteoarthritic joint tissue transcripts from microarray data. BMC Mol Biol 8: 62. 2007 Jul 25 10.1186/1471-2199-8-62 10.1186/1471-2199-8-62
Solanas M, Moral R, Escrich E: Unsuitability of using ribosomal RNA as loading control for Northern blot analyses related to the imbalance between messenger and ribosomal RNA content in rat mammary tumors. Anal Biochem 2001, 288(1):99-102. 10.1006/abio.2000.4889.
Mogal A, Abdulkadir SA: Effects of Histone Deacetylase Inhibitor: HDACi); Trichostatin-A: TSA) on the expression of housekeeping genes. Mol Cell Probes 2006, 20(2):81-6. 10.1016/j.mcp.2005.09.008.
Radonic A, Thulke S, Mackay IM, Landt O, Siegert W, Nitsche A: Guideline to reference gene selection for quantitative real-time PCR. Biochem Biophys Res Commun 313(4):856-62. 2004 Jan 23 10.1016/j.bbrc.2003.11.177 10.1016/j.bbrc.2003.11.177
Ohl F, Jung M, Xu C, Stephan C, Rabien A, Burkhardt M, Nitsche A, Kristiansen G, Loening SA, Radoniæ A, Jung K: Gene expression studies in prostate cancer tissue: which reference gene should be selected for normalization? J Mol Med 2005, 83(12):1014-24. 10.1007/s00109-005-0703-z.
Szabo A, Perou CM, Karaca M, Perreard L, Quackenbush JF, Bernard PS: Statistical modeling for selecting housekeeper genes. Genome Biol 2004, 5(8):R59. 10.1186/gb-2004-5-8-r59.
Olsvik PA, Lie KK, Jordal AE, Nilsen TO, Hordvik I: Evaluation of potential reference genes in real-time RT-PCR studies of Atlantic salmon. BMC Mol Biol 6: 21. 2005 Nov 17 10.1186/1471-2199-6-21 10.1186/1471-2199-6-21
The samples and phenotypes used in this study and funding belong to the EU project: GeMQual QLRT-CT2000-0147.
RP and IT carried out the qPCR experiments, participated in the election of genes and drafted the manuscript. RP and SD participated in the design of the study and the statistical analysis. SD contributed to drafting the manuscript and supervised the process. All authors read and approved the final manuscript.
About this article
- Reference Gene
- Candidate Reference Gene
- Reference Gene Stability
- Reverse Transcriptase Quantitative Polymerase Chain
- Mammalian Skeletal Muscle