- Methodology article
- Open Access
An improved genetic system for detection and analysis of protein nuclear import signals
BMC Molecular Biology volume 8, Article number: 6 (2007)
Nuclear import of proteins is typically mediated by their physical interaction with soluble cytosolic receptor proteins via a nuclear localization signal (NLS). A simple genetic assay to detect active NLSs based on their function in the yeast Saccharomyces cerevisiae has been previously described. In that system, a chimera consisting of a modified bacterial LexA DNA binding domain and the transcriptional activation domain of the yeast Gal4 protein is fused to a candidate NLS. A functional NLS will redirect the chimeric fusion to the yeast cell nucleus and activate transcription of a reporter gene.
We have reengineered this nuclear import system to expand its utility and tested it using known NLS sequences from adenovirus E1A. Firstly, the vector has been reconstructed to reduce the level of chimera expression. Secondly, an irrelevant "stuffer" sequence from the E. coli maltose binding protein was used to increase the size of the chimera above the passive diffusion limit of the nuclear pore complex. The improved vector also contains an expanded multiple cloning site and a hemagglutinin epitope tag to allow confirmation of expression.
The alterations in expression level and composition of the fusions used in this nuclear import system greatly reduce background activity in β-galactosidase assays, improving sensitivity and allowing more quantitative analysis of NLS bearing sequences.
The proper function of many proteins requires that they are targeted to their correct sub-cellular compartments via specific localization signals. Nuclear import of many cellular and viral proteins is typically mediated by their physical interaction with soluble cytosolic receptor proteins via nuclear localization signals (NLS) . A canonical NLS is characterized by a short single stretch of basic amino acids, as exemplified by the NLS sequence of the large T antigen of the simian virus 40 (PKKKRKV) . These monopartite signals generally contain at least three basic amino acids (B) with a consensus sequence fitting B4, P(B3X), PXX(B3X), B3(H/P), where P is proline, H is histidine, X is any amino acid and letters in parentheses can be in any order . Alternatively, bipartite NLSs contain two short stretches of basic amino acids separated by a non-conserved sequence, as found in the cellular nucleoplasmin protein (KRPAATKKAGQAKKKK) .
Proteins containing NLSs are imported into the nucleus by interacting in the cytosol with members of the importin α family of NLS receptors (also known as karyopherin α). Subsequent heterodimerization of importin α with importin β (also known as karyopherin β) and interaction with components of the nuclear pore complex (NPC) leads to translocation through the nuclear envelope in a GTP dependent fashion. Inside the nucleus, the importin α-substrate complex dissociates from importin β and the substrate subsequently dissociates from importin α . The yeast Saccharomyces cerevisiae express a single importin α, Srp1 , whereas multiple isoforms have been isolated in higher eukaryotes. Each isoform of importin α has distinct substrate specificities, suggesting that this may contribute to the regulation of nuclear import . Despite the fact that many thousands of cellular proteins are transported into the nucleus, very few NLSs have been characterized in detail, and existing work has been biased towards classical monopartite or bipartite signals. As nuclear transport regulates, at least in part, an extraordinarily diverse range of cellular processes including cell cycle, signal transduction, apoptosis and circadian rhythm, the identification of mechanisms regulating nuclear import is a valuable area of investigation.
The largest protein encoded by the human adenovirus type 5 (Ad5) early region 1A (E1A) gene contains 289 amino acid residues (289R). A highly related 243 amino acid (243R) protein is produced by alternative splicing . E1A is present in roughly equal amounts in the nucleus and cytoplasm and both the 289R and 243R proteins contain a well characterized monopartite NLS (KRPRP) located at their C-terminus . This NLS mediates nuclear import in vitro and in vivo and shows a distinct preference for human importin α3 in vitro . A second non-conventional NLS with the consensus sequence FV(X)7–20MXSLXYM(X)4MF, spans residues 142–182 and is unique to the larger 289 residue E1A protein [9, 10]. This sequence does not resemble other known NLSs and appears to be regulated developmentally . At least one other non-canonical NLS is present elsewhere within the E1A protein, as residues 23–120 of E1A have been reported to be sufficient to mediate nuclear accumulation via an unknown pathway in microinjected Xenopus laevis oocytes . Another study, which used indirect immunofluorescence to examine the subcellular localization of 243R E1A in virally infected baby rat kidney cells also demonstrated that the N-terminal region spanning residues 30–85 were involved in nuclear localization. In addition, this latter study confirmed that the well characterized C-terminal NLS was necessary for efficient nuclear localization and further suggested a role for a region encompassing residues 186–220 in infected cells . It is not currently known why E1A contains so many different NLSs. However, it is likely that the actions of E1A, like those of some cellular proteins , are closely regulated through modulation of the level of nuclear import. As the E1A proteins possess such an array of non-conventional nuclear import functions, they provide an excellent model system to identify and characterize novel mechanisms that contribute to nuclear localization.
The lack of clearly defined and consistent NLS consensus motifs makes it difficult to predict their presence in a protein of interest. Until recently, the only practical way to identify a functional NLS was by microinjecting or otherwise expressing the test protein in eukaryotic cells, forming heterokaryons or using in vitro transport systems. As an alternative, an easier and more sensitive method that exploited yeast genetics was devised to detect an active NLS, which is based on the expression of the test protein fused to a modified LexA DNA binding domain (DBD) and the Gal4 transcriptional activation domain (AD) in the yeast S. cerevisiae . This transcription based assay relies on the ability of a functional NLS to allow the chimera to enter the yeast nucleus and activate transcription of a LexA responsive β-galactosidase or HIS3 reporter gene. In the absence of a functional NLS, the fusion protein is not efficiently imported into the nucleus, and is unable to activate transcription. As a result, this assay provides a simple qualitative measure of NLS function based on β-galactosidase activity assays or by monitoring yeast growth on medium lacking histidine respectively. This system is not limited to the identification of yeast NLSs, as the nuclear import apparatus is highly conserved between yeast and higher eukaryotic cells [1, 15].
We have obtained this system and verified that Ad5 E1A actively functions to promote nuclear import under these conditions. However, the original system had two major problems that have now been corrected. A high level of background activity was initially observed, which was in part related to the high levels of test protein expression from the original multicopy pNIA vector. To solve this issue, we reconstructed the expression vector as a single copy yeast plasmid. The small size of the initial pNIA LexA fusion allows the chimera to passively diffuse into the nucleus, increasing the background activity. We have re-engineered the vector with an additional "stuffer" sequence that encodes a portion of the E. coli maltose binding protein that contains no NLS activity. This increases the size of the chimera well above the commonly accepted 50 kDa passive diffusion limit of the nuclear pore complex  and greatly reduces background activity in the β-galactosidase assays. The improved vector also contains an expanded multiple cloning site that simplifies cloning of target sequences and a hemagglutinin (HA) epitope tag to allow confirmation of expression.
Using this improved system, we demonstrate that it faithfully detects all the known E1A NLS functions in a quantitative fashion. Thus, this improved system may aid in the identification and characterization of novel NLS activities.
Results and discussion
Construction and evaluation of an improved vector for detecting nuclear import
A simple transcription based assay in yeast for detecting nuclear import has been described . Using this system, we evaluated nuclear import conferred by fusing the full length Ad5 289R E1A protein or the N-terminal 82 amino acids (1–82) in frame to the modified LexA DBD-Gal4 AD chimera (Fig. 2). Although a substantial increase in β-galactosidase activity was detected with E1A 1–82 as compared to the empty pNIA vector, surprisingly little change was observed with the full length 289R E1A protein or the well studied SV40 large T antigen NLS. Based on these results, we pursued a series of modifications to improve the quantitative functionality of this system and its general utility.
As described in the Materials and Methods and illustrated in Fig. 1, we first reconstructed the pNIA LexA DBD-Gal4 AD into another multicopy plasmid backbone originally derived from YEplac181 . This yielded pNIA-2μ, which has a known nucleotide sequence, contains an improved polylinker and has an HA epitope tag to allow convenient detection of expression of the LexA fusion. However, this vector contains the entire ADH1 promoter, which is substantially more active than the truncated ADH1 promoter in the original pNIA. Tests with the same candidate NLSs in pNIA-2μ showed a uniformly high level of activity for all constructs, including the empty vector (Fig. 2). This was likely caused by the very high level of expression of the LexA DBD fusions from the strong promoter in a multicopy plasmid. Indeed, Western blot analysis of the expression levels of the various fusions using an anti-HA antibody confirmed abundant expression, with the exception of E1A 1–82 (Fig. 3). Recalculation of the nuclear import based on relative levels of protein expression suggested that E1A 1–82 continues to display import activity above that of the corresponding empty vector (Fig. 2B).
To reduce the high level of protein expression from pNIA-2μ, we shifted the entire expression cassette into the single copy YCplac111 vector  and tested the same candidate NLSs in pNIA-CEN. Western blot analysis with the HA tag confirmed that expression levels of the fusions were lower in yeast transformed with the single copy plasmid than the multicopy plasmid (Fig. 3). Again, E1A 1–82 was consistently expressed at a lower level than the other constructs. The background activity of the empty pNIA-CEN vector was dramatically lower than that of pNIA-2μ (Fig. 2A). Importantly, all candidate NLSs conferred a strong increase in β-galactosidase activity in pNIA-CEN, regardless of whether they are normalized to protein expression (Fig. 2B). These results clearly show that overexpression of the LexA DBD fusions by the multicopy plasmid obscures nuclear import.
It is commonly accepted that the nuclear pore complex has a passive diffusion limit of approximately 50 kDa . The LexA DBD-Gal4 AD-HA fusion produced by the empty pNIA-2μ and pNIA-CEN vectors is 363 amino acids in size with a predicted molecular weight of about 40 kDa (Fig. 3). Thus, passive diffusion of the test fusion into the nucleus likely occurs and this could contribute to the relatively high background activity in the β-galactosidase assays. To overcome this issue, we inserted a "stuffer" fragment corresponding to a portion of the E. coli maltose binding protein (MBP) in frame between the LexA DBD and Gal4 AD to create pNIA-CEN-MBP. Analysis of this fragment of MBP with PSORT did not detect any sequences predicted to function as an NLS , and this was confirmed experimentally (Fig. 2). Insertion of this fragment increases the molecular weight of the test fusion to about 80 kDa (Fig. 3), well above the passive diffusion limit. All candidate NLSs confer a strong increase in β-galactosidase activity in pNIA-CEN-MBP. Importantly, the background activity of the empty vector decreased to nearly undetectable levels, yielding very large fold differences between the empty vector and vectors containing control NLSs (Fig. 2B). When compared with pNIA-CEN, these results suggest that passive diffusion of the smaller LexA DBD fusions obscures active nuclear import conferred by the candidate NLSs. Thus, the pNIA-CEN-MBP vector appears ideally suited for the analysis of short sequences that may function as NLSs. This is exemplified by the short SV40 large T antigen NLS, which showed the most dramatic increase in import activity between pNIA-CEN and pNIA-CEN-MBP (Fig. 2).
Analysis of nuclear import conferred by various portions of E1A
As described in the introduction, E1A contains a canonical NLS with the sequence KRPRP spanning residues 285–289. In addition, a second non-conventional NLS with the consensus sequence FV(X)7–20MXSLXYM(X)4MF, spanning residues 142–182 has been described and a third activity is localized within residues 30–85. PSORT analysis  of E1A also predicts the presence of a second canonical NLS with the sequence RRPK spanning residues 205–208, which is located within a region previously identified as influencing nuclear localization of E1A in infected baby rat kidney cells . Although the modified pNIA can detect the SV40 large T antigen NLS, and at least the N-terminal NLS in E1A (Fig. 2), it was unclear whether this yeast system can faithfully detect other NLSs that function in mammalian cells. To address this issue, we tested a number of fragments of E1A in pNIA-CEN-MBP to see if the various regions previously shown to confer nuclear import in a variety of other test systems were detected using this yeast system (Fig. 4). As previously shown in Fig. 2, full length 289R E1A or E1A 1–82 function to direct nuclear localization of the LexA chimera. In addition, E1A 139–204, which contains a developmentally regulated non-canonical NLS, also functions in the yeast system. Similarly, E1A 187–289, which encompasses the well characterized canonical NLS, as well as the PSORT predicted NLS, confers nuclear import. Deletion of this well characterized NLS, to generate E1A 187–281, greatly reduces import, suggesting that the major activity resides in the well characterized canonical NLS. In agreement, fusion of just this canonical NLS, contained within E1A 282–289, is sufficient to induce strong import of the chimera. Fusion of E1A 201–218, which contains a canonical NLS predicted by PSORT, exhibited the lowest level of import activity. As such, this signal may not function efficiently, although it is located within a region of E1A known to influence nuclear localization in infected cells . Thus, the yeast nuclear import system appears quite versatile and is capable of recognizing all the known functional NLSs within E1A, whether they represent canonical or non-canonical sequences. Furthermore, this system may be sensitive enough to uncover weak import sequences. However, one obvious caveat of this technology, or any other approach that uses small fragments of proteins for identifying sequences directing nuclear import, is that these activities may not function in the context of the full length folded protein. In many cases, the effect of mutating the putative NLS on nuclear localization of the full length protein could be tested, although this is not feasible in proteins such as E1A that contain multiple nuclear localization signals.
The availability of a simple and quantitative transcription based method for the identification and functional analysis of nuclear import in yeast provides a powerful alternative method for studying the complexities of nucleocytoplasmic transport. Although useful for analysis of individual proteins, the availability of this technology may allow high throughput analysis of nuclear import potential and could be readily applied to large scale screens of short random sequence to identify novel potential NLSs.
The construction of pNIA-2μ, pNIA-CEN and pNIA-CEN-MBP is illustrated in Fig. 1. Briefly, pNIA  was digested with EcoRI and ligated with a NotI linker to create JMB726 (A). JMB726 was digested with BamHI, blunt ended by fill in with the Klenow fragment of E. coli DNA polymerase I, and further digested with PstI. This fragment was ligated into a compatible HpaI-PstI digested pJG4-5 (Clontech, Mountain View, CA) containing the hemagglutinin (HA) epitope tag and polylinker to form JMB941 (B). A HindIII-EcoRI fragment from JMB941, encoding the mutant LexA-Gal4 activation domain-HA tag, was ligated into similarly digested pBAIT  to construct pNIA-2μ, which contains an expanded polylinker (C). pBAIT is a derivative of YEplac181 , which contains the ADH1 promoter and CYC1 terminator from pRS423ADH . A PvuII fragment encoding the entire nuclear import expression cassette, including transcriptional promoter and terminator was transferred from pNIA-2μ into YCplac111  to generate the single copy pNIA-CEN vector (D). Finally, an 1104 base pair stuffer fragment encoding a portion of the E. coli maltose binding protein was amplified by PCR and inserted in frame into the unique NotI site of pNIA-CEN to create pNIA-CEN-MBP, which expresses a 731 amino acid fusion with a molecular weight of 80.3 kDa (E). BamHI-PstI fragments corresponding to the full length 289R E1A protein or residues 1–82 were inserted in frame into pNIA. pNEA contains the SV40 large T antigen NLS inserted into the EcoRI site of pNIA . EcoRI-SalI or EcoRI-XhoI fragments corresponding to the full length 289R E1A protein, residues 1–82, 139–204, 187–289, 187–282, 201–218, 282–289 or the SV40 large T antigen NLS were inserted into the corresponding sites of pNIA-2μ, pNIA-CEN or pNIA-CEN-MBP as indicated in the text.
L40 strain yeast (Invitrogen, Carlsbad, CA) were transformed with pNIA or the derived plasmids by the lithium acetate procedure as described . Cells were plated on glucose supplemented synthetic complete medium lacking either tryptophan or leucine and grown at 30°C. Transformed yeast colonies were used to inoculate 5 ml of glucose supplemented synthetic complete medium lacking either tryptophan or leucine and grown overnight at 30°C with agitation. Cells were collected by centrifugation at 4000 × g for 30 seconds. β-galactosidase assays were performed in triplicate as described previously .
Western blot analysis of protein expression
Expression levels in yeast were determined by Western blot analysis. L40 strain yeast were transformed with pNIA or derivatives and plated on SC selective medium containing glucose. Colonies of transformed yeast were picked from each plate and used to inoculate 5 ml of liquid SC selective medium containing glucose, which were then grown in a cell culture rotator at 30°C overnight. Cells were harvested by centrifugation, washed with sterile distilled H2O, washed twice in 1 ml of buffer (20 mM HEPES pH 7.5), resuspended in 200 μl of buffer containing Complete Protease Inhibitor Cocktail (Roche Diagnostics, Laval, PQ) and transferred into fresh tubes containing 0.3 g of 425–600 micron acid-washed glass beads (Sigma-Aldrich, Oakville, ON). Cells were lysed by 3 cycles of vortexing for 3 minutes followed by 3 minutes of incubation on ice. After lysis, samples were centrifuged at 14,000 × g for 30 min. 200 μl of each supernatant were transferred to fresh microcentrifuge tubes, recentrifuged as above and the protein concentration of the supernatants was measured using Bio-Rad DC Protein Assay (Bio-Rad Laboratories, Mississauga, ON). 30 μg of protein from each extract were separated on NuPAGE 4–12% gradient SDS-polyacrylamide gels (Invitrogen, Carlsbad, CA) and transferred to a PVDF membrane (GE Healthcare Life Sciences, Piscataway, NJ). Western blot analyses were performed with the ECL Plus system (Amersham Pharmacia Biotech, Piscataway, NJ), using the 3F10 anti-HA rat monoclonal antibody (Roche Diagnostics, Laval, PQ) and goat anti-rat IgG horseradish peroxidase conjugated antibody (Pierce, Rockford, IL). Blots were processed concurrently and exposed to film for the same amount of time. Quantitation of protein expression was performed using ImageQuant (GE Healthcare Life Sciences, Piscataway, NJ).
Macara IG: Transport into and out of the nucleus. Microbiol Mol Biol Rev. 2001, 65: 570-594. 10.1128/MMBR.65.4.570-594.2001
Kalderon D, Richardson WD, Markham AF, Smith AE: Sequence requirements for nuclear location of simian virus 40 large-T antigen. Nature. 1984, 311: 33-38. 10.1038/311033a0
Robbins J, Dilworth SM, Laskey RA, Dingwall C: Two interdependent basic domains in nucleoplasmin nuclear targeting sequence: identification of a class of bipartite nuclear targeting sequence. Cell. 1991, 64: 615-623. 10.1016/0092-8674(91)90245-T
Nakai K, Horton P: PSORT: a program for detecting sorting signals in proteins and predicting their subcellular localization. Trends Biochem Sci. 1999, 24: 34-36. 10.1016/S0968-0004(98)01336-X
Enenkel C, Blobel G, Rexach M: Identification of a yeast karyopherin heterodimer that targets import substrate to mammalian nuclear pore complexes. J Biol Chem. 1995, 270: 16499-16502. 10.1074/jbc.270.28.16499
Frisch SM, Mymryk JS: Adenovirus-5 e1a: paradox and paradigm. Nat Rev Mol Cell Biol. 2002, 3: 441-452. 10.1038/nrm827
Lyons RH, Ferguson BQ, Rosenberg M: Pentapeptide nuclear localization signal in adenovirus E1a. Mol Cell Biol. 1987, 7: 2451-2456.
Kohler M, Gorlich D, Hartmann E, Franke J: Adenoviral E1A protein nuclear import is preferentially mediated by importin alpha3 in vitro. Virology. 2001, 289: 186-191. 10.1006/viro.2001.1151
Slavicek JM, Jones NC, Richter JD: A karyophilic signal sequence in adenovirus type 5 E1A is functional in Xenopus oocytes but not in somatic cells. J Virol. 1989, 63: 4047-4050.
Standiford DM, Richter JD: Analysis of a developmentally regulated nuclear localization signal in Xenopus. J Cell Biol. 1992, 118: 991-1002. 10.1083/jcb.118.5.991
Madison DL, Yaciuk P, Kwok RP, Lundblad JR: Acetylation of the adenovirus-transforming protein E1A determines nuclear localization by disrupting association with importin-alpha. J Biol Chem. 2002, 277: 38755-38763. 10.1074/jbc.M207512200
Richter JD, Young P, Jones NC, Krippl B, Rosenberg M, Ferguson B: A first exon-encoded domain of E1A sufficient for posttranslational modification, nuclear-localization, and induction of adenovirus E3 promoter expression in Xenopus oocytes. Proc Natl Acad Sci USA. 1985, 82: 8434-8438. 10.1073/pnas.82.24.8434
Quinlan MP, Whyte P, Grodzicker T: Growth factor induction by the adenovirus type 5 E1A 12S protein is required for immortalization of primary epithelial cells. Mol Cell Biol. 1988, 8: 3191-3203.
Luo M, Pang CW, Gerken AE, Brock TG: Multiple nuclear localization sequences allow modulation of 5-lipoxygenase nuclear import. Traffic. 2004, 5: 847-854. 10.1111/j.1600-0854.2004.00227.x
Rhee Y, Gurel F, Gafni Y, Dingwall C, Citovsky V: A genetic system for detection of protein nuclear import and export. Nat Biotechnol. 2000, 18: 433-437. 10.1038/74500
Gietz RD, Sugino A: New yeast-Escherichia coli shuttle vectors constructed with in vitro mutagenized yeast genes lacking six-base pair restriction sites. Gene. 1988, 74: 527-534. 10.1016/0378-1119(88)90185-0
Zhang Z, Smith MM, Mymryk JS: Interaction of the E1A oncoprotein with Yak1p, a novel regulator of yeast pseudohyphal differentiation, and related mammalian kinases. Mol Biol Cell. 2001, 12: 699-710.
Mumberg D, Müller R, Funk M: Yeast vectors for the controlled expression of heterologous proteins in different genetic backgrounds. Gene. 1995, 156: 119-122. 10.1016/0378-1119(95)00037-7
Gietz RD, Schiestl RH, Willems AR, Woods RA: Studies on the transformation of intact yeast cells by the LiAc/SS- DNA/PEG procedure. Yeast. 1995, 11: 355-360. 10.1002/yea.320110408
Adams A, Gottschling DE, Kaiser CA, Stearns T: Methods in yeast genetics. Edited by: Adams A, Gottschling DE, Kaiser CA and Stearns T. 1998, Plainview N.Y., Cold Spring Harbor Laboratory Press, 1997 Edition
We thank Jay Loftus for technical assistance and Dr. Peter Pelka for proofreading the manuscript. We thank Drs. Y. Rhee, V. Citovsky and D. Mangroo for generous gifts of plasmids and yeast strains. This work was supported by a grant from the National Sciences and Engineering Research Council of Canada.
All experimental procedures were carried out by KM, ZZ, JC and SD. JM contributed to the design of the study. All authors read and approved the final manuscript.
About this article
Cite this article
Marshall, K.S., Zhang, Z., Curran, J. et al. An improved genetic system for detection and analysis of protein nuclear import signals. BMC Molecular Biol 8, 6 (2007). https://doi.org/10.1186/1471-2199-8-6
- Nuclear Localization Signal
- Nuclear Import
- Nuclear Pore Complex
- Multicopy Plasmid
- ADH1 Promoter