Quantification of mRNA in single cells and modelling of RT-qPCR induced noise
© Bengtsson et al; licensee BioMed Central Ltd. 2008
Received: 19 March 2008
Accepted: 17 July 2008
Published: 17 July 2008
Skip to main content
© Bengtsson et al; licensee BioMed Central Ltd. 2008
Received: 19 March 2008
Accepted: 17 July 2008
Published: 17 July 2008
Gene expression has a strong stochastic element resulting in highly variable mRNA levels between individual cells, even in a seemingly homogeneous cell population. Access to fundamental information about cellular mechanisms, such as correlated gene expression, motivates measurements of multiple genes in individual cells. Quantitative reverse transcription PCR (RT-qPCR) is the most accessible method which provides sufficiently accurate measurements of mRNA in single cells.
Low concentration of guanidine thiocyanate was used to fully lyse single pancreatic β-cells followed by RT-qPCR without the need for purification. The accuracy of the measurements was determined by a quantitative noise-model of the reverse transcription and PCR. The noise is insignificant for initial copy numbers >100 while at lower copy numbers the noise intrinsic of the PCR increases sharply, eventually obscuring quantitative measurements. Importantly, the model allows us to determine the RT efficiency without using artificial RNA as a standard. The experimental setup was applied on single endocrine cells, where the technical and biological noise levels were determined.
Noise in single-cell RT-qPCR is insignificant compared to biological cell-to-cell variation in mRNA levels for medium and high abundance transcripts. To minimize the technical noise in single-cell RT-qPCR, the mRNA should be analyzed with a single RT reaction, and a single qPCR reaction per gene.
Cells in a population are in many aspects unique in their characteristics, even in a seemingly homogenous culture or tissue. Single cell gene expression levels–protein as well as mRNA–show large cell-to-cell variations both in a resting state and when exposed to stimuli, stemming in part from the stochastic nature of gene expression [1–6]. This implies that data obtained from a population of cells cannot be assumed to reflect the behaviour of the individual cell. Instead, cells must be assayed one at a time which, in the case of mRNA measurements, requires characterization of femtograms of mRNA .
Quantitative reverse transcription polymerase chain reaction (RT-qPCR) offers sufficient sensitivity and it is the gold standard for mRNA quantification [8, 9]. Coupled with a method to handle and collect individual cells, quantification of mRNA in single cells is feasible for most laboratories today. Previously published protocols for single-cell RT-PCR are generally non-quantitative and specialized for a particular cell-type or involve laborious purification steps [10–13]. For single-cell mRNA quantification to become common practice, simple and generally applicable methods are needed. Indeed, non-proprietary protocols not requiring purification have been presented, but they involve only weak detergents serving as lysis agents that do not inhibit nucleases [14–19]. Lack of appropriate controls for single-cell measurements makes it difficult to guarantee mRNA integrity during collection and handling of cells.
Noise in RT-qPCR measurements increases with decreasing initial copy numbers  and quantification of rare transcripts pushes the measurement to a point where it no longer can be considered quantitative; at best the presence of mRNA will be detected. The details of this phenomenon–including the relative contribution of RT and qPCR respectively–are poorly investigated [20, 21]. Previous reports utilizing single-cell RT-qPCR have left these questions largely unanswered or with inconsistent results: qPCR induced variation ranging from ~8% (CV) at 4000 cDNA copies  down to <1% at two copies  have been shown. Without knowledge of the distribution and impact of the technical noise, it is hard to assess the validity of the measurements.
In this study, we present a single-cell RT-qPCR protocol without purification, with maintained mRNA integrity, high reproducibility and no inhibition of RT or PCR reactions. In addition, we investigate the nature of noise in RT-qPCR and through mathematical modelling we determine the accuracy in measurements of single mouse endocrine cells. Our model predicts how the noise in RT and PCR varies with different expression levels, which will aid in planning and design of single-cell RT-qPCR measurements. We conclude that the inherent variability of RT-qPCR is negligible compared to naturally occurring variation between cells for abundantly expressed genes, while uncertainties in the PCR may obfuscate measurements for rare transcripts.
The presence of potential inhibitors and RNases from the tissue or culture medium may reduce the cDNA yield. We addressed this concern by adding a known amount of purified total islet RNA to tubes containing one of the following: 1) extracellular solution (see the Methods section for details); 2) one cell in 0.5 M GuSCN in water; or 3) ten cells in 0.5 M GuSCN in water (Additional file 1: Table 1). The amount of RNA in the cells was negligible compared to the externally added RNA. The yield was unaffected by the extracellular solution and in fact increased in the presence of cells in GuSCN, a phenomenon investigated further (see below). As an additional control for inhibitors, we added an equal amount (~2000 copies) of in vitro transcribed artificial RNA, based on the cyclophilin E (Ppie) gene, to the lysis solution. Ppie was measured in every single-cell sample and deviation >1% led to exclusion from further analysis (data not shown).
In Figure 2B, an equal amount of purified RNA was reverse transcribed in the presence of 0, 40, 80 and 120 mM GuSCN, followed by qPCR using five different primer pairs. The RT-reaction efficiency was significantly improved (2- to 6-fold) for all tested genes (Ins1, Gcg, Sst, Gapdh and Rps29) using 40 mM GuSCN. By contrast, 80 mM GuSCN had no effect whereas 120 mM GuSCN was strongly inhibitory. We also evaluated the addition of 0.5–1.5% 2-mercapto ethanol, but no effect on the RT yield, either alone or in concert with GuSCN, was observed (data not shown). Formation of correct PCR-products was confirmed by agarose gel electrophoresis. Neither of the other two lysis conditions, NP-40 or proteinase K, had any discernible effect on RT yield (Additional file 1: Figure 2B ). Taken together, we chose to use ~0.5 M GuSCN in the single-cell lysis solution due to superior cell lysis ability and positive effect on the RT reaction at the concentration of 40 mM.
The total technical noise level of RT-qPCR was estimated by diluting purified total RNA from four to six different concentrations (equivalent of ~5-5·103 copies of chromogranin B (Chgb), ~10-104 copies of ribosomal protein S29 (Rps29), ~50-103 copies of SRY-box containing gene 2 (Sox2), nestin (Nes) and glial fibrillary acidic protein (Gfap) and ~500-5·105 copies of Ins2 transcripts). For each dilution, 24 identical RT-qPCR reactions were run. The mRNA concentrations were chosen to primarily cover low-medium abundance transcripts, i.e. Chgb, Gfap, Nes, Rps29 and Sox2. Consequently, the concentration of the high abundance Ins2 transcript is 50–100 times higher. The qPCR-specific noise was calculated from 30 qPCR replicates (Rps29) at five different concentrations of mouse cDNA. The measurements were combined to create a model of RT- and qPCR-induced noise, as well as errors introduced by the liquid handling. The technical reproducibility of RT and qPCR is represented here by the noise strength (η2 = SD2/mean2), for different initial copy numbers. Assuming that the noise from each reaction is additive, we can write the total noise as η2Total = η2Dilution + η2RT+ η2PCR and by fitting this model to the data, we can quantify the contribution of each component. η2Dilution corresponds to Poisson-type noise which arises when a sample with a low number of molecules is imperfectly diluted. Additional noise is introduced by the RT-reaction and the final term corresponds to the noise introduced by the qPCR step. In Additional file 2, we derive functional forms for each component and we show how they can be assessed from replicate measurements.
Finally, we investigated PCR inhibition caused by overloading the reaction with cDNA [11, 22–24]. We tried to mitigate the PCR inhibition by decreasing the ratio between reverse transcriptase and Taq polymerase, and found that a ratio ≤ 2 (U/U) was required to completely remove inhibitory effects. At low concentrations of RNA, decreasing the RT-enzyme concentration did not have a negative effect on RT-yield (Additional file 1: Table 2).
Individual cells have transcript levels which vary significantly from the population mean [1–6, 14, 19]. The biological significance of this heterogeneity is still not clearly understood and it is an active area of research today. To characterize this cell-to-cell variation, high precision measurements of single cells are required. Technically, such measurements are still very challenging and we have presented an RT-qPCR protocol which should be accessible for most laboratories.
By using a purification-free protocol we are both simplifying single-cell measurements and eliminating RNA losses associated with current available purification protocols. High concentrations (≥ 4 M) of guanidine salts are commonly used in standard RNA purification protocols, where it denatures nucleases rapidly and liberates nucleic acids from bound proteins [25–29]. Since GuSCN strongly inhibits downstream reactions, the RNA is normally purified. At lower concentrations of GuSCN the effect diminishes, but RNases are completely inhibited even at 1 M GuSCN  and we show that the cell lysis ability remains strong at 0.5 M. By ensuring sufficient dilution (10-fold) of the GuSCN prior reverse transcription, the concentration is brought down to regions where it does not interfere with the enzymatic reactions. Indeed, we demonstrate that low concentrations (~40 mM) even had a favourable effect on the RT yield and reproducibility. This effect may be a result from reduced secondary structures of the mRNA , enhanced enzyme activity , and/or that GuSCN protects the mRNA from degradation.
We have also developed a mathematical model which allows us to quantify the contributions from the RT and PCR steps to the technical noise. The only parameter in our model which is not directly obtained from the experimental data is the RT efficiency. The RT efficiency can be estimated experimentally by RT-qPCR using known amounts of synthetic RNA mimicking the properties of the native mRNA . However, producing full length mRNAs is challenging and it is common to use a truncated RNA molecule which may not be reverse transcribed with the same efficiency as the native mRNA. Instead, we use our model to determine the RT efficiency indirectly; using non-linear regression we find the RT efficiency which best fits the data. We find that this indirect method gives realistic results for Chgb, Rps29, Gfap, Nes and Sox2 and we believe that it could be a useful way of estimating the RT efficiency for a given gene.
In summary, the single-cell mRNA should be reverse transcribed in a single RT reaction followed by one qPCR reaction per gene. It is advantageous to use more of the single-cell cDNA for low-abundance genes and a smaller part for abundantly expressed genes. These conclusions are only valid for low mRNA levels. However, loading large volumes of cDNA to the qPCR may cause inhibition. In an effort to reduce this effect, we confirm previous findings suggesting a lowered reverse transcriptase concentration and an increased Taq enzyme concentration [22, 23]. An alternate solution is to precipitate the single-cell cDNA in ethanol, thus utilizing a larger fraction of the total amount in the qPCR . Yet another option is multiplex pre-amplification RT-PCR for 10–15 cycles, followed by singleplex qPCR on the resulting pool of PCR-products[33, 34]. This could decrease technical noise levels, provided that the multiplex PCR is efficient, reproducible and unbiased.
Our model of the technical noise suggests that most of the noise in the method stems from the qPCR, at least when the initial template copy number is low and the RT-efficiency is relatively high. However, our analyses show that the RT-efficiencies vary significantly across genes [21, 35] and for low efficiencies, the RT reaction will constitute a larger proportion of the total noise. Compared to the biological cell-to-cell variation, the technical noise is insignificant down to ~1000 copies, but becomes considerable at <100 RNA copies. The model includes only three experimental factors; the RT-reaction, dilution of the sample and the qPCR while in reality there are many more steps contributing to the total noise. Thus, our estimates of the noise components should be considered upper bounds since they will most likely become smaller if additional factors are included in the model.
We found that the strong denaturant guanidine thiocyanate can serve as lysis agent and RNase inhibitor and improve reverse transcription yield in purification-free single-cell RT-qPCR. Our protocol allows fast and streamlined measurements of single-cell gene expression. Analysis of technical noise caused by the RT- and PCR-reaction showed that this noise is insignificant compared to biological cell-to-cell variation at mRNA copy numbers >~100. Below this level technical noise stemming from the PCR, which increases dramatically when initial copy number is <~20 cDNA copies, becomes significant. In addition, to achieve a high technical reproducibility, genes should be analyzed with lowest possible dilution between the RT and qPCR. That is, a single qPCR measurement should be performed for each gene, where the number of genes to be analyzed determines the total dilution factor between RT and PCR.
Pancreatic islets were prepared from healthy female National Maritime Research Institute (NMRI) mice aged 3–4 months (Bomholtgaard) and fed a normal diet ad libitum. The mice were sacrificed by cervical dislocation, and pancreatic islets were isolated by collagenase P digestion (Roche) followed by manual collection of islets . To prepare dispersed single cells the collected islets were gently shaken at low extracellular Ca2+ concentration to dissolve the structure of the islet . Dispersed cells were plated in plastic 35 mm Petri dishes (Nunc) in RPMI 1640 medium (SVA) supplemented with 10% FCS, 100 U ml-1 penicillin, and 10 μgml-1 streptomycin (all from Invitrogen) in the presence of 5 mM glucose (Sigma-Aldrich). The cells were maintained in culture for 2–6 hours, allowing them to attach to the surface of the dish
Primary astrocytes were prepared and cultured as described . Cells were grown in Dulbecco's modified Eagle's medium (Sigma-Aldrich) containing 10% fetal calf serum, 2 mmol/L L-glutamine, and penicillin-streptomycin (all Invitrogen).
Dispersed attached cells were washed twice with a buffer (referred to as extracellular solution, EC) containing 138 mM NaCl, 5.6 mM KCl, 1.2 mM MgCl2, 2.6 mM CaCl2, 5 mM HEPES (pH 7.4 with NaOH) and 3, 6, 10 or 20 mM glucose to remove dead and loose debris prior to cell collection with patch-clamp pipettes. The dish, containing adhered cells and approximately 1 ml EC-solution, was mounted in a standard inverted light-microscope (Zeiss Axiovert 135). Borosilicate glass capillaries (Hilgenberg GmbH) with outer diameter of 1.5–1.6 mm and wall thickness of 0.16 mm were pulled to pipettes using a patch-clamp pipette puller (Heka PIP5). The diameter of the tip was approximately 10 μm on average, substantially wider than standard patch-clamp pipettes and large enough to allow passage of an intact cell. The glass pipette was mounted on a hydraulic micromanipulator (Narishige) on the microscope. By manually controlling the pressure inside the pipette it was possible to collect intact or nearly intact cells with minimum volume of extracellular solution (<<0.1 μl). In total, 158 cells were collected, and 126 (80%) were identified as β-cells based on their expression of insulin while most of the remaining cells were α-cells (16%). Six samples were negative for all measured genes and they were categorized as technical failures (96% success rate).
Single pancreatic islets of roughly the same size were placed in 10 μl of various lysis buffers. The detergents Nonidet P-40 (NP-40, also known as Igepal CA-630, Sigma-Aldrich) and guanidine thiocyanate (GuSCN, Sigma-Aldrich) were used. Samples were incubated at 60°C or 80°C for 15 minutes (60°C for samples containing 0.4 mg/ml proteinase K (Invitrogen)) followed by 5 min incubation at 95°C and frozen at -25°C for subsequent analysis. Samples were diluted 1:20 with mQ water (ELGA Labwater) prior reverse transcription.
Primary astrocytes were washed twice in PBS followed by single cell dissociation with 0.25% Trypsin/EDTA (Invitrogen) treatment for 3 min. Trypsin was inactivated by addition of cell medium and washed once in PBS. Cells were then filtrated using a 40 μm cell strainer (Becton Dickinson). Cells were evenly distributed between sample tubes and all liquid were eliminated after a short centrifugation (3000 g for 1 min) followed by addition of 50 μl lysis buffer. 2 U/μl RNaseOut (Invitrogen) was used as RNase inhibitor. Samples were incubated at 80°C for 5 minutes and frozen at -25°C for subsequent analysis. Samples were diluted 1:4 with mQ water prior reverse transcription.
In single-cell experiments, the glass pipettes were emptied in 0.2 ml plastic tubes containing either 2 μl of 0.5 M GuSCN or 1 μl of 1 M GuSCN in water. The emptying required a custom-made device consisting of a tube holder lined up with a coarse micromanipulator on which the pipette was mounted. The glass pipette was carefully flushed with lysis solution a few times to make sure the cell entered the tube. In most cases, the tip of the pipette was gently broken in the tube thereby facilitating the flushing of the pipette. Tubes were then immediately frozen on dry ice (-78°C) and stored at -80°C for subsequent reverse transcription. Short-term storage on wet ice (0°C) did not affect degradation or subsequence reaction performance.
Total RNA was purified with GenElute Mammalian Total RNA Kit (Sigma-Aldrich) or RNeasy Mini Kit (Qiagen) and concentrations were measured with a NanoDrop ND-1000 spectrophotometer (Nanodrop Technologies).
An artificial RNA control was generated with the T7 RNA Polymerase in vitro transcription system (Takara). A PCR assay for cyclophilin E (Ppie) was used as template for the in vitro transcription. Ppie PCR product was generated using the protocol as for the SYBR Green-based real-time PCR assays (see below), except that all fluorophores were excluded, and purified using QIAquick PCR purification kit (Qiagen). The PCR product was then amplified using an extended forward PCR primer including the promoter sequence for T7 RNA Polymerase. All primer sequences are found in Additional file 1: Table 3. The resulting (extended) PCR product was purified as above and used in the in vitro transcription reaction, according to the manufacturer's instruction. The reaction mixture contained: 50 U T7 RNA polymerase, 40 mM Tris-HCl (pH 8.0), 8 mM MgCl2, 2 mM spermidine, 5 mM dithiothreitol (all Takara), 2 mM NTP (Invitrogen), 20 U RNaseOut (Invitrogen) and ~40 ng template DNA. The reaction was carried out at 42°C for 1 h.). In samples containing ~2000 copies of the Ppie RNA, the average intra-assay coefficient of variation was 0.3% and the inter-assay variation was 1.2%.
Due to stochastic fluctuations, relative gene expression levels provide little information, leaving absolute quantification using standard curves as the best option to compare transcript levels within and between cells. For all assays, PCR-products were purified (QIAquick PCR purification kit, Qiagen) and quantified spectrophotometrically (NanoDrop ND-1000) using the following molar absorptivity values (in Moles-1cm-1): dAMP, 15200; dTMP, 8400; dGMP, 12010; dCMP, 7050. The PCR-products were diluted in series to generate a standard curve ranging 10-106 copies with PCR-efficiencies spanning 77–95% (Additional file 1: Figure 3). qPCR data analysis was performed essentially as described . Since PCR-products are double-stranded and cDNA is single-stranded, one cycle was subtracted from single-cell Ct-values for correct absolute quantification.
The reverse transcriptase SuperScript III (Invitrogen) was used throughout the study. 6.5 μl containing total RNA or lysed single cells, 0.5 mM dNTP (Sigma-Aldrich), 2.5 μM oligo(dT) (15 bp, Invitrogen), 2.5 μM random hexamers (Invitrogen) were incubated at 65°C for 5 min. We then added 50 mM Tris-HCl (pH 8.3), 75 mM KCl, 3 mM MgCl2, 5 mM dithiothreitol, 20 U RNaseOut and 100, 40 or 10 U (lower amount for single-cell-samples) SuperScript III (all Invitrogen) to total volume of 10 μl. Final reaction concentrations are shown. The temperature profiles used were 25°C for 5 min, 37°C for 10 min, 50°C for 80 min. All reactions were terminated at 70°C for 15 min. Throughout this paper we refer to absolute copy numbers estimated by DNA-based standard curves; assuming an RT efficiency of ≤ 100%, the true number of mRNA copies is equal to or higher than our estimate.
ABI PRISM 7900 HT Sequence Detection System (Applied Biosystems) and LightCycler 480 (Roche Diagnostics) equipped with 384-well blocks were used for qPCR measurements. Reactions (10 μl) with SYBR Green as fluorescent reporter contained 10 mM Tris (pH 8.3), 50 mM KCl, 3 mM MgCl2, 0.3 mM dNTP, 1 U JumpStart Taq polymerase, 1 × Reference Dye (all Sigma-Aldrich), 0.5 × SYBR Green I (Invitrogen) and 400 nM of each primer (TAGC Copenhagen). The Reference Dye was excluded using LightCycler 480. The iQ™ SYBR® Green Supermix (BioRad) was used for the Nes assay. The primer sequences are available in Additional file 1: Table 3. All qPCR assays were designed using Primer3Plus http://www.bioinformatics.nl/cgi-bin/primer3plus/primer3plus.cgi and spanned at least one intron when possible. They were optimized to not generate any unspecific products, such as primer-dimers, that could interfere with the quantification of the desired target. Formation of expected PCR products was confirmed by agarose gel electrophoresis (2%) for all assays, and melting curve analysis for all samples.
For RT-qPCR noise calculations, 24-plicate RT-qPCR on diluted purified total mouse islet RNA was used and 30-plicate qPCR reactions on total islet cDNA.
The theoretical model is based on an empirical characterization of the noise in the experiments. First, we consider the noise in the qPCR reaction only and determine how it scales as a function of the number of cDNAs based on replicates which have been diluted to different degrees. The qPCR noise is assumed to be the same for all genes. To model the RT-noise a parameter for the RT efficiency is fitted from replicates which have been diluted to different degrees. Using our empirically determined parameters for the model, we can calculate the contributions of the RT and qPCRs and the total measurement noise can be compared to the biological variation in a population of cells. A full description of the model is given in Additional file 2.
quantitative reverse transcription PCR.
We thank K Borglid and B-M Nilsson for technical assistance and M Kubista for critically reading the manuscript. MB's fellowship was initially funded by the Goran Gustafsson Foundation for Research in the Natural Sciences and Medicine and later by the Wellcome Trust. AS is supported by a postdoctoral fellowship award from the Swedish Research Council. MH was funded by the Engineering and Physical Sciences Research Council of the UK and Children's Hospital Ophthalmology Foundation. Additional funding was provided by The Royal Physiographic Society, Lund; The Foundation Blanceflor Boncompagni-Ludovisi, née Bildt; and Wilhelm and Martina Lundgren's Foundation.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.