Reference genes for normalization of gene expression studies in human osteoarthritic articular cartilage
© Pombo-Suarez et al; licensee BioMed Central Ltd. 2008
Received: 11 May 2007
Accepted: 29 January 2008
Published: 29 January 2008
Assessment of gene expression is an important component of osteoarthritis (OA) research, greatly improved by the development of quantitative real-time PCR (qPCR). This technique requires normalization for precise results, yet no suitable reference genes have been identified in human articular cartilage. We have examined ten well-known reference genes to determine the most adequate for this application.
Analyses of expression stability in cartilage from 10 patients with hip OA, 8 patients with knee OA and 10 controls without OA were done with classical statistical tests and the software programs geNorm and NormFinder. Results from the three methods of analysis were broadly concordant. Some of the commonly used reference genes, GAPDH, ACTB and 18S RNA, performed poorly in our analysis. In contrast, the rarely used TBP, RPL13A and B2M genes were the best. It was necessary to use together several of these three genes to obtain the best results. The specific combination depended, to some extent, on the type of samples being compared.
Our results provide a satisfactory set of previously unused reference genes for qPCR in hip and knee OA This confirms the need to evaluate the suitability of reference genes in every tissue and experimental situation before starting the quantitative assessment of gene expression by qPCR.
Osteoarthritis (OA) is the most common rheumatic disease and a leading cause of disability in the elderly . It involves ligaments, subchondral bone, synovium and cartilage [2, 3]. Most research in OA has been focused in articular cartilage where the disease becomes highly evident in its late stages. Biochemical changes in chondrocytes and extracellular matrix components are followed by macroscopic lesions including thinning, fibrillation, fissuring and erosion of cartilage that will eventually lead to denudation of subchondral bone. These changes result from active processes that involve matrix destruction and inefficient repair [4–6]. Progress in the management of OA requires better knowledge of the regulation of these processes as they could have a different impact depending on its etiology. The commonest form of OA is idiopathic and appears only in the elderly. Nevertheless, some forms of OA have a genetic cause or are secondary to rheumatic, endocrine, metabolic or neuropathic diseases or to local factors like trauma, infection or avascular necrosis . This variety of etiologies as well as OA chronic evolution, its heterogeneity in different joints, and the possibility of wide differences in gene expression between different disease stages or areas of cartilage complicate OA research [8–10]. There are not generally accepted methods to address these issues.
In recent years, it has become possible to study satisfactorily gene expression in cartilage. A major problem has been the difficulty in obtaining RNA due to the unique characteristics of human cartilage as low cell content, collagenous matrix and richness in proteoglycans that co-purify with RNA . Methods improving RNA yield and quality as well as methods of cDNA amplification by in vitro transcription that can compensate for the poor content of RNA in human cartilage have been reported [11–14]. Techniques allowing precise quantification of gene expression are also available, microarrays for a large number of genes or quantitative real-time PCR (qPCR) for individual genes [5, 9, 10, 15–18] This latter has a main role in studies focused on a few genes and to validate results from microarray studies. However, the full potential of qPCR cannot be obtained if specific care is not taken [19, 20].
The absolute amount of a gene RNA can be quantified by qPCR, but this is seldom done because between-sample variation in RNA extraction, reverse transcription and PCR efficiency make the procedure inaccurate. Also, requirement of gene-specific calibration curves makes it too complex for many laboratories [20, 21]. The commonest alternative is relative quantification, in which normalization by endogenous reference genes allows comparison between samples but not between genes. In this approach, selection of the reference gene is critical because its expression should be invariable under the conditions of study [22–25]. The suitability of reference genes in cartilage has not been addressed previously, though a study on chondrocytes has recently been published , and this lack of analysis is not without risks. Most reference genes in common use were selected because they are housekeeping genes, that is, they are widely and constitutively expressed in many tissues and stages of development. However, these genes are also regulated . They were useful for Northern blot and RNase protection assays but not for the more sensitive real-time qPCR. Small changes in expression of the reference genes will lead to wrong conclusions, as has been shown in many areas of research [25, 27–29]. Therefore, expression stability of the prospective reference genes should be explored in each specific tissue and type of experiment [22, 30, 31]. In fact, most authors that have investigated this area agree in the need of using more than a single reference gene to obtain high quality data [30, 32]. In our study, we have explored ten well-known reference genes to identify the most suitable for normalization of qPCR data from human cartilage obtained from the hips and knees of elderly healthy subjects and OA patients.
The prospective reference genes included in this study covered a wide range of expression levels in articular cartilage (mean Ct values ranging from 18 to 36). Results from individual samples showed a uniform dispersion around the mean without any marked skeewing (not shown). qPCR replicates showed very low variability, with a mean coefficient of variation (CV) of 1.08% ± 1.2 (standard deviation, SD). When the raw individual values were stratified by group of samples-hip OA, knee OA, or hip controls – there were differences in the HPRT1 and 18S RNA values. After data transformation, which involved correction by well-specific efficiency and determination of the relative value of each sample in relation to the gene-specific median of all samples, the Mann-Whitney U test showed that 18S RNA was expressed at significantly lower levels in hip and knee OA cartilage than in control hip cartilage (p = 0.03). This difference indicated a possible source of spurious results if 18S RNA is used for normalization in qPCR across the mentioned groups of samples.
We were concerned by the possibility that sex differences between control and OA samples could affect the choice of most stable genes. However, separate geNorm analyses in men and women provided the same set of three most stable genes, TBP, RPL13A and B2M. The geNorm analysis was also repeated after excluding the HPRT1 and 18S RNA genes given the mentioned differences in expression between groups of samples. In this new analysis, the most stable genes were RPL13A and TBP, followed by B2M, concordant with the previously found.
A first consideration in the analysis of our results is that the prospective reference genes that we have analyzed had already been selected in previous studies because of their utility for this function [25, 28, 30, 32], they have relative stable expression and we did not expect large differences between them. Also, these genes are from different functional families and not known to be coregulated. This implies that each one provides independent and complementary information, which is an important requisite for geNorm analysis. A second important consideration is that we have made efforts to minimize every known source of experimental variation by DNAse digestion, adjusting the amount of input RNA, using two-step reverse transcription polymerase chain reaction (RT-PCR) and by correcting raw results for PCR efficiency [19–21, 24, 33]. These steps are a requisite for the assumption that observed results reflect true gene expression. We found specially necessary to include a DNAse digestion step because human genome has many processed pseudogenes inserted by retrotransposition that are amplified in PCR even with intron-spanning primers .
The systematic selection of the best reference genes for real-time qPCR has been approached with different methods. All of them look for stability of the expression levels, by either absence of differences between clinically relevant groups , or relative stability in relation to other reference genes [30, 34] or in relation to clinically relevant groups . These analyses have been facilitated by the free availability of programs and by the description of their principles and use [30, 32, 34]. There are no definitive reasons to prefer one method over the others as their relative strengths depend on the circumstances and we have used three of the best-grounded: a conventional statistical test to compare clinically relevant groups and the geNorm and NormFinder softwares. Results from the three were broadly similar, though the conventional statistical analysis lacked sensitivity. The independence of our results from the analysis method gives credence to the conclusions.
The most striking result was the poor performance of some commonly used reference genes. A special case was GAPDH that is widely used in many areas of research  and is one of the best reference genes in many tissues . Nevertheless, there have been also previous examples of this gene leading to wrong results due to its lack of stability in specific experimental conditions . In our study, GAPDH was not among the best reference genes in any of the analyses done. Other two commonly used reference genes, ACTB and 18S RNA , performed better in our tests but they were not among the more stable genes in most comparisons. These results confirm, once more, the need to evaluate the reference genes in each experimental setting. A particularly striking example in this regard, is the contrast between our results and the reported in prostate cancer tissue, where HPRT1 was the most stable gene, and RPL13A and ACTB were the most unstable . In our experiments, their ranks were reversed, i.e. HPRT1 was the most unstable and RPL13A one of the most stable genes.
Best reference genes in articular cartilage from elderly subjects were among the less commonly used: TBP, RPL13A and B2M. Best results will be obtained by combining two or three reference genes as emphasized by several authors [30, 32]. We propose that for general studies of cartilage from elderly subjects a combination of TBP and RPL13A could be a good starting point, with the inclusion of B2M if practical. For specific comparisons other combinations could be more appropriate.
Finally, it is necessary to take into account some limitations of our study. First, we have included a limited array of prospective reference genes. Other genes have been proposed for use in qPCR, and it is possible that some of them are better candidates for articular cartilage studies in elderly subjects. Microarray data from cartilage, that now start to be published [5, 10] will provide clues for the identification of the best candidates. Second, our results only apply directly to articular cartilage with a focus in OA of large joints. In particular, collection of samples from surgical procedures dictated that all donors were older than 60 years and that OA samples were of an advanced disease stage. This mimics most of the studies in cartilage in OA. However, it is unclear how well our results could be extended to other joint areas, patients with different ages or OA at early stages. Nevertheless, our study can serve as a guide for any kind of cartilage study, and reference genes could be used once tested for low M values. It is also unclear to what extent results obtained with SYBR Green quantification will be applicable to other relative qPCR techniques.
Precise assessment of gene expression in cartilage samples from elderly subjects requires selection of suitable reference genes. Some of the commonly used performed poorly, questioning the accuracy of previous reports. Combinations of the previously unused genes TBP, RPL13A and B2M were found to perform addequately and are recommended to improve evaluation of gene expression in OA research. In studies involving only the hip joint, TBP and RPL13A are the best choice.
Characteristics of the cartilage donors included in the study
No. of patients
Age, median (range) years
Collins grade, average (range)
Cartilage dissection and evaluation
Intact femoral heads and knees were washed and kept in sterile PBS at 4°C. Surface of the cartilage was carefully examined and graded by the macroscopic visual Collins' scale modified by Muehleman . Briefly, grade 0: no signs of cartilage lesions; grade 1: very limited disruptions of the articular surface with no changes in surface geometry; grade 2: deep fibrillation and fissuring, early marginal hyperplasia and possibly, small osteophytes; grade 3: extensive fibrillation and fissuring, 30% or less of the articular cartilage surface eroded down to the subchondral bone, and osteophytes; and grade 4: lips or shelves at the articular margin, greater than 30% of the articular surface eroded down to the subchondral bone and gross geometric changes and osteophytes. Cartilages with Collins grades 0 and 1 are considered normal, while cartilages of grade 2 and higher are considered degenerated. Given the advanced stage of disease in the OA samples there were areas of the joint surface without cartilage. All the remaining cartilage was removed from the bone using a scalpel, chopped into 2–5 mm pieces and snap-frozen in liquid nitrogen within 6 hours of surgery. For consistency, we took also all available cartilage from control donors. Special care was taken to exclude fibrotic tissue or any subchondral bone contamination. Tissue pieces were stored at -80°C until further processing.
RNA extractions from articular cartilage were performed following the method of Price et al  with the addition of a DNase digestion step. Frozen cartilage was weighed and 1 g was ground using a stainless-steel mortar and pestle that were liquid nitrogen-cooled. After initial extraction in TRI Reagent (Sigma, Saint Louis, MO), the aqueous phase was mixed with a half volume of 100% ethanol and further purified on silica-gel-based membranes using the RNeasy Plant Mini Kit (Qiagen, Valencia, CA) according to the manufacturer's instructions. A DNase I (Qiagen, Valencia, CA) digestion step was performed on the spin column. Concentration of the isolated RNA and the 260/280 nm absorbance ratio were measured with the NanoDrop® ND-1000 spectrophotometer (NanoDrop Technologies, Wilmington, DE, USA). Samples with 260/280 ratio < 1.90 were discarded.
Real-time quantitative PCR characteristics
Amplicon size (bp)
PCR efficiency and Cycle threshold (Ct) determination
PCR efficiency was calculated with the LinRegPCR program  from raw fluorescence data taken from the Chromo4™ real-time PCR detection system. According with this method PCR efficiency is the slope of the straight line that best fit the log-linear part of the amplification curve. Mean efficiencies were determined in sample duplicates and used to adjust Ct values. Ct values, the cycle number at which the fluorescence signal of the sample exceeds background fluorescence, were used for the quantitative comparison of the amplification rates. They were obtained using Opticon Monitor™ version 3.0 software, provided with the Chromo4™ real-time PCR detection system. After baseline subtraction, threshold lines were manually established for each gene to cross the the log-linear part of the fluorescence curves. Mean Ct values of the duplicates were determined and transformed into relative quantities.
Mann-Whitney U tests were performed with Statistica, version 7 (Statsoft, Tulsa OK). The softwares geNorm™, version 3.4  and NormFinder  were used to calculate stability of the candidate reference genes. The first, geNorm, relies on the principle that the expression ratio of two reference genes should be identical in all samples, regardless of the experimental condition. It calculates the expression stability measure (M) for the set of candidate reference genes and by stepwise exclusion of the least stable gene in each step arrives to the the most stable pair of reference genes. It provides also a way to estimate the best number of required reference genes. NormFinder follows a different approach: it calculates a stability value for each individual candidate reference gene taking into account separation of samples in the different groups that are of interest in the specific area of research . In this case, the stability value is based on the combined estimate of intra- and intergroup variation of gene expression.
List of abbreviations
Reverse transcription polymerase chain reaction
Quantitative real-time PCR
Expression stability measure
TATA box binding protein
Hypoxanthine phosphoribosyltransferase 1
Ribosomal protein L13a
Succinate dehydrogenase complex
Ribosomal protein S18
We thank sample donors for their generosity, Dr Fernando Baltar-Tojo for providing access to surgery samples, Isabel Castro-Perez and Maria Dolores Alvarez-Vilariño for collecting surgery material and Cristina Fernández for oustanding technical support. This project was financed by the MMA Foundation (Madrid, Spain). MP-S received a bursary from the Fundacion Española de Reumatologia. AG was supported by the Instituto de Salud Carlos III (Spain).
- Woolf AD, Pfleger B: Burden of major musculoskeletal conditions. Bull World Health Organ 2003, 81: 646-656.PubMed CentralPubMed
- Felson DT: Risk factors for osteoarthritis: understanding joint vulnerability. Clin Orthop Relat Res 2004, S16-21. 10.1097/01.blo.0000144971.12731.a2
- Cimmino MA, Parodi M: Risk factors for osteoarthritis. Semin Arthritis Rheum 2005, 34: 29-34. 10.1016/j.semarthrit.2004.03.009.View ArticlePubMed
- Lapadula G, Iannone F: Metabolic activity of chondrocytes in human osteoarthritis as a result of cell-extracellular matrix interactions. Semin Arthritis Rheum 2005, 34: 9-12. 10.1016/j.semarthrit.2004.03.004.View ArticlePubMed
- Aigner T, Fundel K, Saas J, Gebhard PM, Haag J, Weiss T, Zien A, Obermayr F, Zimmer R, Bartnik E: Large-scale gene expression profiling reveals major pathogenetic pathways of cartilage degeneration in osteoarthritis. Arthritis Rheum 2006, 54: 3533-3544. 10.1002/art.22174.View ArticlePubMed
- Smith GN Jr: The role of collagenolytic matrix metalloproteinases in the loss of articular cartilage in osteoarthritis. Front Biosci 2006, 11: 3081-3095. 10.2741/2034View ArticlePubMed
- Poole ARGF, Abramson SB: Etiopathogenesis of osteoarthritis. In Osteoarthritis. 4th edition. Edited by: Moskovitz RWAR, Hochberg MC, Buckwalter JA, Goldberg VM. Philadelphia: Lippincott Williams & Wilkins; 2007:27-49.
- Yagi R, McBurney D, Laverty D, Weiner S, Horton WE Jr: Intrajoint comparisons of gene expression patterns in human osteoarthritis suggest a change in chondrocyte phenotype. J Orthop Res 2005, 23: 1128-1138. 10.1016/j.orthres.2004.12.016.View ArticlePubMed
- Eid K, Thornhill TS, Glowacki J: Chondrocyte gene expression in osteoarthritis: Correlation with disease severity. J Orthop Res 2006, 24: 1062-1068. 10.1002/jor.20137.View ArticlePubMed
- Sato T, Konomi K, Yamasaki S, Aratani S, Tsuchimochi K, Yokouchi M, Masuko-Hongo K, Yagishita N, Nakamura H, Komiya S, Beppu M, Aoki H, Nishioka K, Nakajima T: Comparative analysis of gene expression profiles in intact and damaged regions of human osteoarthritic cartilage. Arthritis Rheum 2006, 54: 808-817. 10.1002/art.21638.View ArticlePubMed
- Mallein-Gerin F, Gouttenoire J: RNA extraction from cartilage. Methods Mol Med 2004, 100: 101-104.PubMed
- Price JS, Waters JG, Darrah C, Pennington C, Edwards DR, Donell ST, Clark IM: The role of chondrocyte senescence in osteoarthritis. Aging Cell 2002, 1: 57-65. 10.1046/j.1474-9728.2002.00008.x.View ArticlePubMed
- Wang J, Hu L, Hamilton SR, Coombes KR, Zhang W: RNA amplification strategies for cDNA microarray experiments. Biotechniques 2003, 34: 394-400.PubMed
- Subkhankulova T, Livesey FJ: Comparative evaluation of linear and exponential amplification techniques for expression profiling at the single-cell level. Genome Biol 2006, 7: R18. 10.1186/gb-2006-7-3-r18.PubMed CentralView ArticlePubMed
- Dell'Accio F, De Bari C, El Tawil NM, Barone F, Mitsiadis TA, O'dowd J, Pitzalis C: Activation of WNT and BMP signaling in adult human articular cartilage following mechanical injury. Arthritis Res Ther 2006, 8: R139. 10.1186/ar2029.PubMed CentralView ArticlePubMed
- Gosset M, Berenbaum F, Levy A, Pigenet A, Thirion S, Saffar JL, Jacques C: Prostaglandin E2 synthesis in cartilage explants under compression: mPGES-1 is a mechanosensitive gene. Arthritis Res Ther 2006., 8:
- Soder S, Roach HI, Oehler S, Bau B, Haag J, Aigner T: MMP-9/gelatinase B is a gene product of human adult articular chondrocytes and increased in osteoarthritic cartilage. Clin Exp Rheumatol 2006, 24: 302-304.PubMed
- Stove J, Gremmes C, Gunther KP, Scharf HP, Schwarz M: Metabolic activity and gene expression of osteoarthritic chondrocytes in correlation with radiological and histological characteristics. Biomed Pharmacother 2006, 60: 644-647. 10.1016/j.biopha.2006.09.005.View ArticlePubMed
- Bustin SA, Nolan T: Pitfalls of quantitative real-time reverse-transcription polymerase chain reaction. J Biomol Tech 2004, 15: 155-166.PubMed CentralPubMed
- Wong ML, Medrano JF: Real-time PCR for mRNA quantitation. Biotechniques 2005, 39: 75-85.View ArticlePubMed
- Huggett J, Dheda K, Bustin S, Zumla A: Real-time RT-PCR normalisation; strategies and considerations. Genes Immun 2005, 6: 279-284. 10.1038/sj.gene.6364190.View ArticlePubMed
- Suzuki T, Higgins PJ, Crawford DR: Control selection for RNA quantitation. Biotechniques 2000, 29: 332-337.PubMed
- Lee PD, Sladek R, Greenwood CM, Hudson TJ: Control genes and variability: absence of ubiquitous reference transcripts in diverse mammalian expression studies. Genome Res 2002, 12: 292-297. 10.1101/gr.217802.PubMed CentralView ArticlePubMed
- Vandesompele J, De Paepe A, Speleman F: Elimination of primer-dimer artifacts and genomic coamplification using a two-step SYBR green I real-time RT-PCR. Anal Biochem 2002, 303: 95-98. 10.1006/abio.2001.5564.View ArticlePubMed
- Dheda K, Huggett JF, Bustin SA, Johnson MA, Rook G, Zumla A: Validation of housekeeping genes for normalizing RNA expression in real-time PCR. Biotechniques 2004, 37: 112-119.PubMed
- Toegel S, Huang W, Piana C, Unger F, Wirth M, Goldring M, Gabor F, Viernstein H: Selection of reliable reference genes for qPCR studies on chondroprotective action. BMC Molecular Biology 2007, 8: R13. 10.1186/1471-2199-8-13.View Article
- Glare EM, Divjak M, Bailey MJ, Walters EH: beta-Actin and GAPDH housekeeping gene expression in asthmatic airways is variable and not suitable for normalising mRNA levels. Thorax 2002, 57: 765-770. 10.1136/thorax.57.9.765.PubMed CentralView ArticlePubMed
- Ohl F, Jung M, Xu C, Stephan C, Rabien A, Burkhardt M, Nitsche A, Kristiansen G, Loening SA, Radonic A, Jung K: Gene expression studies in prostate cancer tissue: which reference gene should be selected for normalization? J Mol Med 2005, 83: 1014-1024. 10.1007/s00109-005-0703-z.View ArticlePubMed
- Laidlaw AM, Copeland B, Ross CM, Hardingham JE: Extent of over-expression of hepatocyte growth factor receptor in colorectal tumours is dependent on the choice of normaliser. Biochem Biophys Res Commun 2006, 341: 1017-1021. 10.1016/j.bbrc.2006.01.060.View ArticlePubMed
- Vandesompele J, De Preter K, Pattyn F, Poppe B, Van Roy N, De Paepe A, Speleman F: Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes. Genome Biol 2002, 3: R34. 10.1186/gb-2002-3-7-research0034.View Article
- Dheda K, Huggett JF, Chang JS, Kim LU, Bustin SA, Johnson MA, Rook GA, Zumla A: The implications of using an inappropriate reference gene for real-time reverse transcription PCR data normalization. Anal Biochem 2005, 344: 141-143. 10.1016/j.ab.2005.05.022.View ArticlePubMed
- Andersen CL, Jensen JL, Orntoft TF: Normalization of real-time quantitative reverse transcription-PCR data: a model-based variance estimation approach to identify genes suited for normalization, applied to bladder and colon cancer data sets. Cancer Res 2004, 64: 5245-5250. 10.1158/0008-5472.CAN-04-0496.View ArticlePubMed
- Ramakers C, Ruijter JM, Deprez RH, Moorman AF: Assumption-free analysis of quantitative real-time polymerase chain reaction (PCR) data. Neurosci Lett 2003, 339: 62-66. 10.1016/S0304-3940(02)01423-4.View ArticlePubMed
- Pfaffl MW, Tichopad A, Prgomet C, Neuvians TP: Determination of stable housekeeping genes, differentially regulated target genes and sample integrity: BestKeeper – Excel-based tool using pair-wise correlations. Biotechnol Lett 2004, 26: 509-515. 10.1023/B:BILE.0000019559.84305.47.View ArticlePubMed
- Muehleman C, Bareither D, Huch K, Cole AA, Kuettner KE: Prevalence of degenerative morphological changes in the joints of the lower extremity. Osteoarthritis Cartilage 1997, 5: 23-37. 10.1016/S1063-4584(97)80029-5.View ArticlePubMed
- Rozen S, Skaletsky H: Primer3 on the WWW for general users and for biologist programmers. In Bioinformatics Methods and Protocols: Methods in Molecular Biology. Edited by: Krawetz S, Misener S. Totowa, NJ: Humana Press; 2000:365-382.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.