Characterization of the octamer, a cis-regulatory element that modulates excretory cell gene-expression in Caenorhabditis elegans
© Mah et al; licensee BioMed Central Ltd. 2010
Received: 6 July 2009
Accepted: 8 March 2010
Published: 8 March 2010
We have previously demonstrated that the POU transcription factor CEH-6 is required for driving aqp-8 expression in the C. elegans excretory (canal) cell, an osmotic regulatory organ that is functionally analogous to the kidney. This transcriptional regulation occurs through a CEH-6 binding to a cis-regulatory element called the octamer (ATTTGCAT), which is located in the aqp-8 promoter.
Here, we further characterize octamer driven transcription in C. elegans. First, we analyzed the positional requirements of the octamer. To do so, we assayed the effects on excretory cell expression by placing the octamer within the well-characterized promoter of vit-2. Second, using phylogenetic footprinting between three Caenorhabditis species, we identified a set of 165 genes that contain conserved upstream octamers in their promoters. Third, we used promoter::GFP fusions to examine the expression patterns of 107 of the 165 genes. This analysis demonstrated that conservation of octamers in promoters increases the likelihood that the gene is expressed in the excretory cell. Furthermore, we found that the sequences flanking the octamers may have functional importance. Finally, we altered the octamer using site-directed mutagenesis. Thus, we demonstrated that some nucleotide substitutions within the octamer do not affect the expression pattern of nearby genes, but change their overall expression was changed. Therefore, we have expanded the core octamer to include flanking regions and variants of the motif.
Taken together, we have demonstrated that octamer-containing regions are associated with excretory cell expression of several genes that have putative roles in osmoregulation. Moreover, our analysis of the octamer sequence and its sequence variants could aid in the identification of additional genes that are expressed in the excretory cell and that may also be regulated by CEH-6.
The Caenorhabditis elegans excretory system is composed of four cells: the excretory duct cell, the bi-nucleate excretory gland cell, the excretory pore cell, and the excretory (canal) cell (EC). Each of these cells are descendents of the AB cell lineage . The EC in particular has a unique H-shaped structure consisting of two pairs of bilaterally-symmetrical projections that protrude anteriorly and posteriorly from the central cell body. The EC forms approximately 270 minutes after the first cellular division near the centre of the embryo . Subsequently, two processes extend dorso-laterally, which then bifurcate to form the anterior and posterior canal branches. By the end of the first larval stage, the EC canals have reached their full length relative to the length of the nematode . Further growth of the canals is influenced by their attachment to the hypodermis, which promotes extension of the EC canals following the first larval stage . Not surprisingly, because of similarities in their structures, the EC and neurons share many developmental cues that dictate elongation, guidance and outgrowth .
Even though the EC shares developmental cues with neurons, it has a distinct function. The EC plays a role in maintaining osmotic balance by collecting soluble organic and inorganic metabolic waste substances and expelling these to the environment . For C. elegans, osmoregulation is a critical function due to the continuous and unpredictable stresses placed upon worms in their native soil habitat. To facilitate the exchange of dissolved material, the fluid-filled EC arms are exposed to pseudocoelomic fluid. The pseudocoelomic fluid in turn is in contact with most of the cells in C. elegans, presumably having a function analogous with circulatory fluid. The pseudocoelomic fluid also provides the turgor for the hydrostatic skeleton of C. elegans, which is essential for locomotion. In addition to maintaining osmotic balance, the excretory system is responsible for secreting hormones  and fluids required for molting . Notably, the osmoregulatory function of the EC resembles the role of the mammalian kidneys. Thus, characterizing conserved mechanisms that govern EC transcription may provide insight into regulatory circuits that control kidney-specific transcription.
We are interested in the mechanisms that govern transcriptional regulation in the EC of C. elegans. Our laboratory has previously characterized two distinct transcriptional regulatory mechanisms affecting EC gene expression. One of these mechanisms relies upon binding of the transcriptional regulator DCP-66 to the Ex-1 cis- regulatory element (CCATACATTA). Together, DCP-66 and Ex-1 drive EC-exclusive expression of pgp-12, an ABC transporter-encoding gene . However, DCP-66 is a component of the transcriptional inhibitory nucleosomal remodeling and deacetylase (NuRD) complex, which is typically associated with gene repression . The second mechanism, involves CEH-6, a class III POU homeobox transcription factor (TF), which binds to a cis- regulatory element called the octamer (ATTTGCAT) . We originally characterized the octamer as an element required for EC-expression of the aquaporin-encoding gene aqp-8. Of note, CEH-6 is also expressed in the excretory cell (EC), thus fulfilling the spatial requirements of a TF responsible for EC-selective transcription. Additionally, CEH-6 is detected in the bilaterally symmetric neurons (RMDDLR, RMDVLR, AUALR and AVHLR), P.na cells (ventral nerve cord), five rectal cells (B, Y, U, F and K) , two tail nerve cells, ventral hypoderm, anterior body wall muscle, body wall muscle cells, and intestine . The broad expression of CEH-6 indicates that it likely regulates transcription in a several ectodermal cells as well.
Because CEH-6 interacts with the octamer to drive aqp-8 expression in the EC, we wanted to determine whether the octamer is generally linked to EC-expression. If so, octamer regulated genes could represent novel candidates that function in the EC. Furthermore, we also attempted to define the role of the octamer cis-regulatory element as a driver of EC-selective transcription. Specifically, we determined whether the octamer is under spatial restrictions within promoter regions, identified genes that require the octamer for EC expression, and identified variants of the octamer that are able to drive EC-expression. Thus, this work identified a set of candidate genes that could be relevant to kidney function in vertebrates. Overall, our data reinforce the role of octamers and presumably their cognate transcription factors such as CEH-6 in directing osmoregulatory organ gene expression.
The Location of the Octamer Sequence in the Promoter can be Flexible
We observed that the octamer was not able to drive EC expression in constructs that contained less than 448 bp upstream of the vit-2 ATG. However, these constructs retained the ability to drive intestinal expression indicating that the transgene was successfully generated and functional (Figure 1B). Placing the octamer at the 5'-end of vit-2 promoter constructs larger than 448 bp upstream of the ATG led to ectopic GFP expression in the EC (Figure 1C). An exception to this was from a transgene containing octamers 652 bp upstream of vit-2's ATG, which did not drive assayable levels of GFP. This transgenic strain drove intestinal GFP, but failed to drive GFP in the EC. The lack of EC-expression indicates that there may be a cis-linked element located between 600 bp and 652 bp upstream of the vit-2 ATG that represses EC expression. Alternatively, it is possible that this transgene construct formed a concatemer in vivo that is incompatible with the expression of the GFP reporter. This second scenario is unlikely because we did observe intestinal expression. In any case, these data suggest that octamer function may be spatially restricted in some promoters. Taken together, although the above data did not allow us to conclude whether the octamer has an optimal distance from the ATG for influencing EC-expression, the data do suggest that functional octamers can be present at various places in different promoters.
Many EC-expressed Genes are not regulated by Octamers
Because functional octamers may be located at various distances upstream of the ATG, we searched for genes with upstream octamers within 1,200 bp upstream of their ATGs; WormBase (WS195). Thus, we identified 1,340 candidate genes including promoters that contain either the forward and/or the inverse of the octamer (ATGCAAAT). To assess the function of these octamers, we selected genes from this set that are expressed in the EC [14, 15]. In addition, we assessed the function of the octamer in the promoters of hlh-8 and ZC395.10, genes with no known EC-expression. For these seventeen genes, we tested if regions containing the octamer are required for EC-expression by truncating the promoter from the 5'-end and observing whether the octamer regions affect the level of EC expression (Additional file 1). However, we could not conclude whether the octamers were functional in promoters of Y69E1A.6, F36H1.2, Y48A6B.8, F29F11.6, B0334.4, C02B8.6, H23N18.3, R13F6.3, and, Y53G8AR.3 because EC expression was lost in worms carrying promoter constructs that still had the octamer, suggesting that there are other cis- linked elements that drive EC-expression of these genes (Additional file 1). Amongst the other eight genes, our promoter truncation analysis revealed three genes that are likely dependent upon the octamer containing region for EC expression: ZC395.10, C01B12.3, and hlh-8/C02B8.4. These results are not surprising as previous reports indicate that 12% of EC-expressed genes are predicted to be regulated by the DCP-66/Ex-1 mechanism  Additionally, the nuclear hormone receptor NHR-31 regulates most of the vacuolar ATPase (vATPase) components in the EC  pointing to multiple regulatory mechanisms that specify EC-expression.
An octamer is located 1,055 bp upstream of the ATG in C01B12.3's promoter region. A transcriptional reporter construct with a 5'-end 2,853 bp upstream of the ATG leads to expression in the EC (Figure 2C), hypoderm, spermatheca, and the anal depressor muscle [14, 15]. However, the EC-expression level is greatly decreased when the promoter is truncated (879 bp upstream of the ATG) corresponding to removal of a large portion of the promoter and including an octamer (Figure 2D). Interestingly, the orthologs of C01B12.3 in C. briggsae and C. remanei also have octamers in their promoters. Moreover, the distance of the octamers from the ATG are similar (1,108 bp and 1,100 bp upstream of the ATG respectively), suggesting that this element might also regulate EC-expression of C01B12.3 orthologs.
The hlh-8 promoter contains an octamer located 582 bp upstream of the ATG. It was previously shown that hlh-8 is expressed in the intestine, anterior neurons, and vulva [14, 15]. A 5' truncation that limits the promoter to only seven bases upstream of the octamer (589 bp upstream of the ATG) results in expression localized to the EC and the second pharyngeal bulb (Figure 2E). This change in expression pattern indicates the likely presence of a repressor element(s) that blocks EC and pharyngeal expression. Expression of hlh-8 in the EC is completely abolished upon deletion of the octamer-containing fragment. In sum, our promoter deletion analysis identified octamer-containing promoter regions required for the EC-specific expression for the above three genes.
Conserved Octamer sequences in Promoter Regions Strongly Bias for EC Expression
Due to the low success rate in finding octamer containing regions associated with EC-expression with the above approach (only 3/17), we turned to phylogenetic footprinting. Using this comparative method, we identified promoters with perfectly conserved octamers between three closely related Caenorhabditis species (C. elegans, C. briggsae, and C. remanei). This resulted in the identification of 165 promoters that contain conserved octamers (Additional file 2).
Of the 165 genes, nineteen genes had previously characterized expression patterns [14, 15]. To obtain a larger sample size, we analyzed the expression patterns of an additional 88 candidates (Additional file 3). From these 107 promoters, we identified 64 that could drive detectable levels of GFP expression, including 25 (39%) that were EC-expressed. This represents a significant enrichment of genes expressed in the EC when compared to a control dataset of 1,885 expression patterns, within which only 10.2% (193/1,885) of all genes are expressed in the EC [14, 15]. Strikingly, twelve genes (19%) in our set were expressed only in the EC. This is a vast enrichment over the control set which contains only 0.3% (6/1,885) genes with EC-exclusive expression [14, 15] (significance P < 0.01 as determined by 1-tailed Z-test).
Sequences Flanking Functional Octamers are Likely Conserved
Intra-octamer Nucleotide Substitutions Have Different Effects on EC Expression
At -264 bp, a G→A residue change (aqp-8promoter(-264G→A)::GFP) did not alter the expression level or pattern (Figure 5B). However, a G→T change (aqp-8promoter(-264G→T)::GFP led to a significant decrease in the EC expression level (Figure 5C). A G→C substitution (aqp-8promoter(-264G→C)::GFP) led to a complete loss of EC-expression. At -263 bp, a C→A residue substitution in aqp-8promoter(-263C→A)::GFP led to decreased EC-expression (Figure 5D). The C→G and C→T substitutions (aqp-8promoter(-263C→G)::GFP and aqp-8promoter(-263C→T)::GFP) both led to a complete losses of GFP expression. Finally, replacing the GC pair at -264 with an AG pair (aqp-8promoter(-264GC→AG)::GFP) led to decreased EC-expression (Figure 5E). In all constructs, the GFP signal remained localized to the EC suggesting that some octamer sequence variants have the capacity to influence EC expression.
In this study, we demonstrate that octamer containing regions are involved in driving EC-expression of several genes. This builds upon our previous data demonstrating that aqp-8 is dependent upon the POU homeobox TF CEH-6 and the octamer for EC-expression . In several nematodes, the position of the octamer relative to the ATG is fairly well conserved among aqp-8 orthologs . This is interesting as some cis- regulatory elements are spatially restricted within promoters. For example, in C. elegans functional X-box motifs cluster roughly 100 bp upstream of ATGs to drive neuronal expression . Likewise, the GC-box cis- regulatory element has an optimal distance in relation to the TATA box. Moving the GC-box away from its optimal distance leads to decreased expression levels of nearby genes even though its cognate TF, Sp1, still bind with similar affinities . In addition to spatial restriction relative to ATGs, relative positioning between cis- regulatory elements within the same promoter can affect expression. For instance, the β-actin promoter contains the so-called CCAAT and CCArGG boxes. These regulatory elements are binding sites for the TFs nuclear factor Y (NF-Y) and serum response factor (SRF), respectively. Manipulating the intra-element distance between these two cis- regulatory sequences accordingly affects β-actin message levels . However, in the present study we had also observed that the octamer could be placed at different positions in a heterologous promoter and still drive EC-specific expression. Therefore, unlike the above examples, the octamer does not appear to be as spatially restricted within promoter although there still might be some tight limitations to octamer location.
In this study, we used different strategies to identify novel genes that require upstream octamer containing regions for expression in the EC. First, we identified genes with octamer sequences in their promoters. Using this strategy we identified a small number of genes that are modulated by upstream octamer-containing fragments. Secondly, we used a more stringent approach to identify octamers by relying on interspecies conservation. We discovered that the expression patterns of genes in this filtered promoter set had a higher than expected incidence of EC-expression. In these two screens, we identified nine genes that are likely octamer-modulated. Several of these genes likely have osmoregulatory functions, agreeing with the notion the EC is analogous to the kidney. In general, the genes we identified fell into four categories: transmembrane channels/pores, Hsp90 co-chaperones, proteins with unknown functions, and TFs.
The largest group of genes encodes transmembrane channels/pores, indicating that many genes regulated by the octamer participate in substrate transport. The five channels/pores are:
1) twk-36 encodes a C. elegans TWIK potassium ion channel protein. In vertebrates, TWIKs are commonly expressed in neuronal tissues and, to a lesser extent in lungs, skeletal muscle , and tubular portions of the kidney (proximal tubule, ascending limbs, distal convoluted tubules, and medullary collecting duct) . The mammalian TWIK, TASK, is sensitive to changes in extracellular pH, indicating that some of these proteins have roles in modulating cellular responses to pH flux . Also, because their conductance is osmotically regulated, TWIKs can influence cellular volume . The C. elegans genome has 42 twk genes. As in other organisms, most of the C. elegans twks are expressed in neurons ; however, twk-36 is the only twk expressed in the EC . Interestingly, another group has demonstrated that twk-36 is directly regulated by CEH-6 . This independent study strengthens the notion CEH-6 is a bona fide regulator of EC expression that likely acts through the octamer in the promoter of twk-36. Therefore, it is possible that CEH-6 regulation impacts at least some of the octamer-dependent candidates identified here.
3) C05D12.1 is a homolog of the cytochrome b561/ferric reductase SDR-2. In mammals, SDR-2 is expressed in the brain  and kidney where it aids in iron reabsorption via the accessory transporter, divalent-cation transporter 1 (DCT-1) . Cytochrome b561 proteins transport electrons in an ascorbate-dependent manner. Due to the role of SDR-2 in ascorbate regeneration, C05D12.1 could be involved in vitamin C homeostasis and/or oxidative stress responses.
4) R02F2.8 encodes a solute carrier (SLC) protein that is most similar to the mammalian SLC36 subfamily. SLC36 proteins are localized to intracellular and plasma membranes  where they function as symporters. SLC36 proteins transport small neutral amino acids such as glycine, alanine, and proline. Because SLC36 proteins affect proton flux, they also contribute to intracellular pH homeostasis . In mammals there are four SLC36 genes, two of which are expressed in the kidney (SLC36A1 and SLC36A2) .
5) C01B12.3 encodes a C. elegans Bestrophin 3 homolog. Bestrophins are transmembrane proteins that modulate calcium dependent transport of chloride ions across cellular membranes. Bestrophins are enriched in the plasma membranes of epithelial cells where they manage cellular volume . Bestrophins are also expressed in exocrine gland tissues (e.g. pancreas, lacrimal and salivary glands), lung, testis and kidney . In these tissues they facilitate trans-epithelial movement of chloride ions leading to water and electrolyte movement .
In addition to transmembrane channels and pores, we uncovered several other genes that have less obvious links to osmoregulation and kidney biology. One of these, ZC395.10, is homologous to thehighly conserved Hsp90 co-chaperone protein, p23. p23 interactions with Hsp90 to ensure the proper folding and maturation of many proteins including steroid receptors , telomerase , and proteins that are upregulated in cancers . We also identified M176.5, a gene with little prior functional data. M176.5 is a nematode-specific gene that is mainly composed of hydrophobic amino acids and is therefore likely to be localized to cell membranes and/or forms a globular protein.
Finally, we identified two TFs. The first, F16F9.1 is a homolog of the mammalian protein lipopolysaccharide-induced tumor necrosis factor-alpha factor (LITAF; a.k.a. SIMPLE/Small Integral Membrane Protein of Lysosome/Late Endosome) . LITAF is linked to Charcot-Marie-Tooth (CMT1C) disease, a heritable neuropathy characterized by loss of muscle tissue and touch sensation . In CMT1C, LITAF is implicated in protein degradation . LITAF also functions in cytokine production . We speculate that because F16F9.1 is expressed in the EC, a tissue exposed to the environment, it could be involved in innate immune responses. Also, because C. elegans F16F9.1 is expressed in neurons, it is possible that the nematode could act as a model for CMT1C.
The other TF we identified is hlh-8, a helix-loop-helix TF related to human TWIST. TWIST was originally characterized in Drosophila as a gene involved in dorsal-ventral patterning . TWIST TFs bind E-box cis- regulatory elements. In C. elegans, HLH-8 is important for regulating muscle, intestinal and anal muscle development. Consequently hlh-8 mutants exhibit defecation and egg-laying defects . Several transcriptional targets of HLH-8 are known, including cdh-4, egl-15, C18B12.6, F08D12.7, rbc-1, npr-10, dhs-5, sgcb-1, erv-46, M60.6, R02E4.1, rev-1, and myo-3 . Most of these genes are unlikely to be transcriptional targets of HLH-8 in the EC as they are not expressed in this cell. An exception is cdh-4 [14, 15], which encodes a widely expressed cadherin, which is also expressed in the EC. Due to the limited number of HLH-8 targets in the EC, we can envisage a model where CEH-6 plays a role in directing the precise transcriptional outcomes of downstream TFs (e.g. hlh-8). In this role, CEH-6 could modulate target genes (e.g. cdh-4) specifically in a subset of ectodermal tissues including the EC.
Several of the genes that depend on their upstream octamer containing fragments for EC expression are also expressed in additional tissues. An interesting consequence of assaying the activity of truncated promoters is that loss of the octamer containing fragment sometimes led to loss of expression in multiple tissues including neurons as indicated in the cases of ZC395.10, twk-36, M176.5, and F16F9.1 (Additional file 4). This is not surprising as vertebrate orthologs of CEH-6 including Brn1 are involved in neuronal and kidney development [44, 45].
Our strategy for studying the above genes involved comparing the expression patterns resulting from promoter truncations that either contain or remove the octamer. Analyzing promoter function by means of these truncations imposes some significant drawbacks; for example, we could not address the function of several candidate octamer elements because removing regions upstream of the octamer led to loss of EC expression. Also, because our promoter truncations deleted the octamer and some flanking regions, we cannot be certain that loss of EC-selective expression is the consequence of removing the octamer. However, because the 5' ends of the truncations were in general fairly close to the octamer, and because loss of EC-expression correlated with octamer deletion, we believe that these genes are likely dependent on octamers for their expression in the EC. To address the issue of whether these are indeed functional octamers, one could introduce point mutations within the octamer and assess the resulting consequences on EC-expression. However, we demonstrated in our mutagenesis experiments that certain point mutations are not sufficient to abolish EC expression in the context of the aqp-8 promoter. Another potential drawback from our approach is the fact that much of our study is based on expression patterns arising from transgenic C. elegans strains containing extrachromosomal arrays. Such arrays are susceptible to somatic transgene loss, which results in mosaic reporter expression. This mosaiscism has the potential to confound our analysis by under-representing the expression pattern. However, expression patterns resulting from genome-integrated transgenes (aqp-8promoter::GFP) are identical to the expression patterns in aqp-8promoter::GFP strains carrying extrachromosomal transgenes. Therefore, mosaic loss of the extrachromosomal transgene array is not likely an issue for analyzing changes in EC expression. Despite the potential shortcomings detailed above, our study revealed a set of genes whose expression likely depends on the octamer for expression in the EC; these genes are excellent CEH-6 candidate targets. In agreement with this notion, twk-23, one of the genes that we identified as dependent on an upstream octamer fragement, has been demonstrated, independently, to be regulated by CEH-6 .
Because our bioinformatic search did not bias the direction of the octamer, we discovered promoters in the forward and inverted orientation can be associated with EC-expression. Octamers in a forward orientation occur in the promoters of aqp-8, M176.5, twk-36, ZC395.10, hlh-8, C01B12.3, and C05D12.1, whereas inverted octamers are present in the promoters of F16F9.4 and R02F2.8. Although we could not determine from our small sample set whether the direction of the octamer has a functional consequence, there are possible implications related to direction of the octamer sequence. For example, octamers upstream of immunoglobulin light and heavy chain genes have directional preferences (ATGCAAAT and ATTTGCAT respectively) . In addition, the human POU homeobox gene Oct1 is auto-regulated by two upstream octamers, which are also situated in inverted orientations. Although each site binds Oct1 with equal affinity, each of these sites has different effects on Oct1 expression .
It appears that transcriptional auto-regulation is a common mode of regulation among POU TFs [47–49]. Interestingly, we detected an octamer upstream of ceh-6's ATG in C. elegans. Likewise, there is an octamer located in the regulatory region of the C. briggsae gene encoding a putative CEH-6 ortholog, providing evidence that auto-regulation might be conserved. However, these octamer are located within a predicted non-coding RNA gene (class RNAz) . Nevertheless, it would be interesting to determine whether CEH-6, like other POU TFs is auto-regulated.
With our set of nine candidate octamers, we had the opportunity to determine whether residues flanking the element are conserved. Globally, the G/C content of C. elegans is 31% . However, the regions flanking the octamers-associated with EC-expression contain a slightly higher G/C content (38%). Our alignments of these octamers revealed that despite the higher G/C content, some positions have preferences for A/T residues. Strikingly, directly 3' to the octamer, an A or T is always present. The conservation of this residue is consistent with the results of a previous study, which identified Oct1 binding preferences using a Systematic Evolution of Ligands by Exponential Environment (SELEX)-based in vitro binding approach . Because of the enriched G/C content in octamer adjacent regions, it is likely that the observed preference for A/T at certain positions have relevance.
We tested the effects of targeted-octamer mutations on EC-expression. We found that several single-base substitutions did not affect the cis- regulatory element's ability to drive expression in the EC. Interestingly, we did not observe a change in expression level or pattern when position five of the octamer was mutated from a purine to purine (ATTTG CAT→ATTTA CAT). This variant of the octamer was demonstrated to be a binding site for the catfish class III POU TF, Oct2 . Therefore, this residue change results in an octamer variant which retains the ability to interact with the POUS sub-domain binding consensus sequence (TG(C/A)ATattc) . At the same residue position, a thymine replacement (ATTTG CAT→ATTTT CAT), led to weak GFP expression also restricted to the EC. This sequence was able to bind to Oct1 in vitro in an electrophoretic mobility shift assay (EMSA) . However, in another study this variant of the octamer was not able to drive reporter expression in human cells . These results, taken together, indicate that this motif variant is a sub-optimal POU TF binding site that can drive weak EC expression in C. elegans. A mutation at the sixth residue (ATTTGC AT→ATTTGA AT) also led to weak EC-localized expression . This octamer variant is functional in the promoter of the Drosophila gene, Choline Acetyltransferase (ChAT). In the ChAT promoter, ATTTGAAT interacts with the POU homeobox TF, dPOU-19 . All other single nucleotide substitutions at these two locations led to loss of GFP expression. However, a double residue replacement of these residues (ATTTGC AT→ATTTAG AT) could still drive expression, as indicated by weak GFP expression in the EC. There have been no previous reports of this dual nucleotide substitution variant associating with POU TFs and it is therefore a novel POU TF binding site variant. Overall, our mutagenesis assays indicate that the octamer cis- regulatory element could have a range of functional variants in C. elegans. Thus, identifying and characterizing promoters containing these octamer variants may reveal a larger group genes expressed in the EC.
Because variants of the octamer can influence EC-expression, we examined the pgp-12 promoter region in C. elegans more closely. Previously, it was demonstrated that pgp-12 expression is regulated by DCP-66/Ex-1 . In this report, the TF/cis- regulatory element interaction was confirmed using in vitro and genetic approaches. Loss of either Ex-1 (located 238 bp upstream pgp-12's ATG) or DCP-66 resulted in loss of EC expression . We detected an octamer like sequence (ATTTCCAT) that partially overlaps the Ex-1 (-241 bp). We also identified this octamer-like sequence in the orthologous regions of C. briggsae. Using the Transcriptional Element Search System (TESS)  to identify predicted cis- regulatory elements, we found that the octamer-like sequence is indeed a potential target for octamer binding proteins. Additionally this sequence binds Oct1 in vitro . In fact, Zhao et al. reported that promoter constructs encompassing Ex-1 at -241 bp results in strong reporter expression during all developmental stages . They also studied the expression pattern resulting from a promoter region defined by a 5'-end 238 bp upstream of the ATG, thereby removing three nucleotides from the octamer-like sequence. Although this promoter still drove EC-expression, the intensity of the GFP reporter was greatly decreased in adult and larval worms. Additionally, embryonic expression was almost eliminated. Finally, a construct with a 5'-end 228 bp upstream of the ATG was not able to drive expression of GFP indicating the necessity of the Ex-1 (and the octamer-like sequence) for EC expression. Therefore, not only did this prior study define the role of the Ex-1 for EC-expression, but it also indirectly provided evidence that the octamer upstream of pgp-12 might affect EC-expression. This suggests that the DCP-66/Ex-1 and the octamer-directed transcriptional regulatory mechanisms co-operatively modulate the expression of pgp-12 in the EC. This model of concerted and redundant regulation of EC-expression may have relevance in other genes including, possibly, several genes within our phylogenetically defined candidate set.
Overall, we determined that the octamer is likely responsible for the expression of several genes within the EC, an osmoregulatory organ analogous to the kidney. Because one of our candidate genes, twk-36, has been demonstrated to be a bona fide target of CEH-6 regulation, it would be interesting to determine whether CEH-6 is involved in the regulation of four other candidates. Although our candidates, for the most part, were chosen based upon perfect conservation of the octamer, we determined that several variants of the octamer can drive EC-expression. The existence of functional octamer variants indicates that future searches for octamer-driven genes should use a loosely defined octamer sequence. Overall, understanding conserved mechanisms of gene regulation that determine appropriate EC expression may provide insight into underlying transcriptional mechanisms that regulate transcription in analogous organs including the kidney.
Nematode strains and maintenance
C. elegans strains were maintained at 20°C on nematode growth media (NGM) plates inoculated with E. coli OP50. All manipulations were conducted using standard procedures . For the list of promoter::reporter constructs used in this study, refer to Additional file 5.
Generation of transgene constructs and strains
DNA constructs were generated using fusion PCR as previously described . Promoter-containing sequences were fused upstream of the GFP-coding region in the pPD95.67 GFP-coding cassette. The octamer::vit-2 promoter ::GFP chimeric constructs were generated by PCR as follows. The forward PCR primers contain three tandem repeats of the octamer at the 5' end of a vit-2-promoterspecific sequence. The right primer of the vit-2 chimeric promoter constructs remained consistent between strains (vit2reverse -AGT CGA CCT GCA GGC ATG CAA GCT CGA CCT GAT GGC TGA ACC G). The chimeric promoters were fused to the GFP-coding region in the pPD95.67. The mutagenized octamer constructs were generated by substituting target nucleotides in the forward PCR primer.
All C. elegans microinjections were conducted on either an Olympus BH2-HLSH or a Zeiss 47 3016 invert microscope. The PCR constructs were injected into the syncitial region of the gonad. The final concentrations of the injection mix are 30 ng/μl of the target construct along with 100 ng/μl of the marker construct, pCeh361 (dpy-5(+)) , into the target strain dpy-5(e907) (CB907). Transgenic F1s (Dpy-5 rescued) were individually plated. Wild type F2 lines were selected to establish the transgenic lines. When available, we analyzed a second independently segregating transgenic line.
Identification of all C. elegans genes with upstream octamers
All genes containing an octamer (ATTTGCAT or ATGCAAAT) within 1,200 bp upstream of a protein-coding gene in C. elegans were identified in WormBase, WS195.
Identification of all genes with interspecies conserved upstream octamer
1,000 bp upstream of the ATG of all orthologous gene groups in the nematodes:C. elegans, C. briggsae, and C. remanei (WormBase WS195), were searched for the presence of octamers. A C. elegans promoter was considered if its C. briggsae and C. remanei counterparts both contain one or more upstream octamers. Sequences flanking the ATGs from these three Caenorhabditis species and the predicated motifs were loaded into a MySQL database using the GFF3 format http://www.sequenceontology.org/gff3.shtml. The comparative analysis was performed in programs written in Perl. We used the Bio::DB::GFF Perl module .
All GFP-expression analyses were conducted on a Zeiss Axioscope equipped with a QImaging camera and the appropriate GFP optical filter sets. Worms were immobilized with 100 mM sodium azide (in water) immediately prior to imaging. All images were captured at 400× with identical camera and fluorescence settings for all images (exposure times are indicated in the Figures) using QCapture software. The GFP images from each transgenic strain are representative of their populations.
The authors thank the members of our laboratories (DLB and NSC) for productive discussions. A. K. Mah is supported by an NSERC doctoral scholarship. J. S. Chu is supported by an NSERC doctoral scholarship. N. S. Chen is supported by a grant from NSERC Canada and a Faculty start-up fund provided by Simon Fraser University. D. L. Baillie is a Canada Research Chair in Genomics and is supported by grants from NSERC and CIHR. Additionally, we would like to thank Stefan Taubert for productive comments on the manuscript.
- Sulston JE, Horvitz HR: Post-embryonic cell lineages of the nematode, Caenorhabditis elegans. Dev Biol. 1977, 56 (1): 110-156. 10.1016/0012-1606(77)90158-0View ArticlePubMedGoogle Scholar
- Sulston JE, Schierenberg E, White JG, Thomson JN: The embryonic cell lineage of the nematode Caenorhabditis elegans. Dev Biol. 1983, 100 (1): 64-119. 10.1016/0012-1606(83)90201-4View ArticlePubMedGoogle Scholar
- Buechner M: Tubes and the single C. elegans excretory cell. Trends Cell Biol. 2002, 12 (10): 479-484. 10.1016/S0962-8924(02)02364-4View ArticlePubMedGoogle Scholar
- Buechner M, Hall DH, Bhatt H, Hedgecock EM: Cystic canal mutants in Caenorhabditis elegans are defective in the apical membrane domain of the renal (excretory) cell. Dev Biol. 1999, 214 (1): 227-241. 10.1006/dbio.1999.9398View ArticlePubMedGoogle Scholar
- Haskins WT, Weinstein PP: Nitrogenous excretory products of Trichinella spiralis larvae. J Parasitol. 1957, 43 (1): 19-24. 10.2307/3274744View ArticlePubMedGoogle Scholar
- Davey KG, Kan SP: Molting in a parasitic nematode, Phocanema decipiens. IV. Ecdysis and its control. Can J Zool. 1968, 46 (5): 893-898. 10.1139/z68-125View ArticlePubMedGoogle Scholar
- Nelson FK, Albert PS, Riddle DL: Fine structure of the Caenorhabditis elegans secretory-excretory system. J Ultrastruct Res. 1983, 82 (2): 156-171. 10.1016/S0022-5320(83)90050-3View ArticlePubMedGoogle Scholar
- Zhao Z, Fang L, Chen N, Johnsen RC, Stein L, Baillie DL: Distinct regulatory elements mediate similar expression patterns in the excretory cell of Caenorhabditis elegans. J Biol Chem. 2005, 280 (46): 38787-38794. 10.1074/jbc.M505701200View ArticlePubMedGoogle Scholar
- Xue Y, Wong J, Moreno GT, Young MK, Cote J, Wang W: NURD, a novel complex with both ATP-dependent chromatin-remodeling and histone deacetylase activities. Mol Cell. 1998, 2 (6): 851-861. 10.1016/S1097-2765(00)80299-3View ArticlePubMedGoogle Scholar
- Mah AK, Armstrong KR, Chew DS, Chu JS, Tu DK, Johnsen RC, Chen N, Chamberlin HM, Baillie DL: Transcriptional regulation of AQP-8, a Caenorhabditis elegans aquaporin exclusively expressed in the excretory system, by the POU homeobox transcription factor CEH-6. J Biol Chem. 2007, 282 (38): 28074-28086. 10.1074/jbc.M703305200View ArticlePubMedGoogle Scholar
- Burglin TR, Ruvkun G: Regulation of ectodermal and excretory function by the C. elegans POU homeobox gene ceh-6. Development. 2001, 128 (5): 779-790.PubMedGoogle Scholar
- Reece-Hoyes JS, Shingles J, Dupuy D, Grove CA, Walhout AJ, Vidal M, Hope IA: Insight into transcription factor gene duplication from Caenorhabditis elegans Promoterome-driven expression patterns. BMC Genomics. 2007, 8: 27- 10.1186/1471-2164-8-27View ArticlePubMedPubMed CentralGoogle Scholar
- MacMorris M, Broverman S, Greenspoon S, Lea K, Madej C, Blumenthal T, Spieth J: Regulation of vitellogenin gene expression in transgenic Caenorhabditis elegans: short sequences required for activation of the vit-2 promoter. Mol Cell Biol. 1992, 12 (4): 1652-1662.View ArticlePubMedPubMed CentralGoogle Scholar
- McKay SJ, Johnsen R, Khattra J, Asano J, Baillie DL, Chan S, Dube N, Fang L, Goszczynski B, Ha E, et al: Gene expression profiling of cells, tissues, and developmental stages of the nematode C. elegans. Cold Spring Harb Symp Quant Biol. 2003, 68: 159-169. 10.1101/sqb.2003.68.159View ArticlePubMedGoogle Scholar
- Hunt-Newbury R, Viveiros R, Johnsen R, Mah A, Anastas D, Fang L, Halfnight E, Lee D, Lin J, Lorch A, et al: High-throughput in vivo analysis of gene expression in Caenorhabditis elegans. PLoS Biol. 2007, 5 (9): e237- 10.1371/journal.pbio.0050237View ArticlePubMedPubMed CentralGoogle Scholar
- Hahn-Windgassen A, Van Gilst MR: The Caenorhabditis elegans HNF4alpha Homolog, NHR-31, mediates excretory tube growth and function through coordinate regulation of the vacuolar ATPase. PLoS Genet. 2009, 5 (7): e1000553- 10.1371/journal.pgen.1000553View ArticlePubMedPubMed CentralGoogle Scholar
- Schneider TD, Stephens RM: Sequence logos: a new way to display consensus sequences. Nucleic Acids Res. 1990, 18 (20): 6097-6100. 10.1093/nar/18.20.6097View ArticlePubMedPubMed CentralGoogle Scholar
- Efimenko E, Bubb K, Mak HY, Holzman T, Leroux MR, Ruvkun G, Thomas JH, Swoboda P: Analysis of xbx genes in C. elegans. Development. 2005, 132 (8): 1923-1934. 10.1242/dev.01775View ArticlePubMedGoogle Scholar
- Wu L, Berk A: Constraints on spacing between transcription factor binding sites in a simple adenovirus promoter. Genes Dev. 1988, 2 (4): 403-411. 10.1101/gad.2.4.403View ArticlePubMedGoogle Scholar
- Danilition SL, Frederickson RM, Taylor CY, Miyamoto NG: Transcription factor binding and spacing constraints in the human beta-actin proximal promoter. Nucleic Acids Res. 1991, 19 (24): 6913-6922. 10.1093/nar/19.24.6913View ArticlePubMedPubMed CentralGoogle Scholar
- Lesage F, Lauritzen I, Duprat F, Reyes R, Fink M, Heurteaux C, Lazdunski M: The structure, function and distribution of the mouse TWIK-1 K+ channel. FEBS Lett. 1997, 402 (1): 28-32. 10.1016/S0014-5793(96)01491-3View ArticlePubMedGoogle Scholar
- Nie X, Arrighi I, Kaissling B, Pfaff I, Mann J, Barhanin J, Vallon V: Expression and insights on function of potassium channel TWIK-1 in mouse kidney. Pflugers Arch. 2005, 451 (3): 479-488. 10.1007/s00424-005-1480-9View ArticlePubMedGoogle Scholar
- Duprat F, Lesage F, Fink M, Reyes R, Heurteaux C, Lazdunski M: TASK, a human background K+ channel to sense external pH variations near physiological pH. Embo J. 1997, 16 (17): 5464-5471. 10.1093/emboj/16.17.5464View ArticlePubMedPubMed CentralGoogle Scholar
- Niemeyer MI, Cid LP, Barros LF, Sepulveda FV: Modulation of the two-pore domain acid-sensitive K+ channel TASK-2 (KCNK5) by changes in cell volume. J Biol Chem. 2001, 276 (46): 43166-43174. 10.1074/jbc.M107192200View ArticlePubMedGoogle Scholar
- Salkoff L, Butler A, Fawcett G, Kunkel M, McArdle C, Paz-y-Mino G, Nonet M, Walton N, Wang ZW, Yuan A, et al: Evolution tunes the excitability of individual neurons. Neuroscience. 2001, 103 (4): 853-859. 10.1016/S0306-4522(01)00079-3View ArticlePubMedGoogle Scholar
- Armstrong KR, Chamberlin HM: Coordinate regulation of gene expression in the C. elegans excretory cell by the POU domain protein CEH-6. Mol Genet Genomics. 283 (1): 73-87.
- Huang CG, Lamitina T, Agre P, Strange K: Functional analysis of the aquaporin gene family in Caenorhabditis elegans. Am J Physiol Cell Physiol. 2007, 292 (5): C1867-1873. 10.1152/ajpcell.00514.2006View ArticlePubMedGoogle Scholar
- Ponting CP: Domain homologues of dopamine beta-hydroxylase and ferric reductase: roles for iron metabolism in neurodegenerative disorders?. Hum Mol Genet. 2001, 10 (17): 1853-1858. 10.1093/hmg/10.17.1853View ArticlePubMedGoogle Scholar
- Ferguson CJ, Wareing M, Ward DT, Green R, Smith CP, Riccardi D: Cellular localization of divalent metal transporter DMT-1 in rat kidney. Am J Physiol Renal Physiol. 2001, 280 (5): F803-814.PubMedGoogle Scholar
- Boll M, Daniel H, Gasnier B: The SLC36 family: proton-coupled transporters for the absorption of selected amino acids from extracellular and intracellular proteolysis. Pflugers Arch. 2004, 447 (5): 776-779. 10.1007/s00424-003-1073-4View ArticlePubMedGoogle Scholar
- Bermingham JR, Shumas S, Whisenhunt T, Sirkowski EE, O'Connell S, Scherer SS, Rosenfeld MG: Identification of genes that are downregulated in the absence of the POU domain transcription factor pou3f1 (Oct-6, Tst-1, SCIP) in sciatic nerve. J Neurosci. 2002, 22 (23): 10217-10231.PubMedGoogle Scholar
- Fischmeister R, Hartzell HC: Volume sensitivity of the bestrophin family of chloride channels. J Physiol. 2005, 562 (Pt 2): 477-491.View ArticlePubMedPubMed CentralGoogle Scholar
- Srivastava A, Romanenko VG, Gonzalez-Begne M, Catalan MA, Melvin JE: A variant of the Ca2+-activated Cl channel Best3 is expressed in mouse exocrine glands. J Membr Biol. 2008, 222 (1): 43-54. 10.1007/s00232-008-9098-4View ArticlePubMedGoogle Scholar
- Martinez-Yamout MA, Venkitakrishnan RP, Preece NE, Kroon G, Wright PE, Dyson HJ: Localization of sites of interaction between p23 and Hsp90 in solution. J Biol Chem. 2006, 281 (20): 14457-14464. 10.1074/jbc.M601759200View ArticlePubMedGoogle Scholar
- Keppler BR, Grady AT, Jarstfer MB: The biochemical role of the heat shock protein 90 chaperone complex in establishing human telomerase activity. J Biol Chem. 2006, 281 (29): 19840-19848. 10.1074/jbc.M511067200View ArticlePubMedGoogle Scholar
- Drysdale MJ, Brough PA, Massey A, Jensen MR, Schoepfer J: Targeting Hsp90 for the treatment of cancer. Curr Opin Drug Discov Devel. 2006, 9 (4): 483-495.PubMedGoogle Scholar
- Myokai F, Takashiba S, Lebo R, Amar S: A novel lipopolysaccharide-induced transcription factor regulating tumor necrosis factor alpha gene expression: molecular cloning, sequencing, characterization, and chromosomal assignment. Proc Natl Acad Sci USA. 1999, 96 (8): 4518-4523. 10.1073/pnas.96.8.4518View ArticlePubMedPubMed CentralGoogle Scholar
- Bennett CL, Shirk AJ, Huynh HM, Street VA, Nelis E, Van Maldergem L, De Jonghe P, Jordanova A, Guergueltcheva V, Tournev I, et al: SIMPLE mutation in demyelinating neuropathy and distribution in sciatic nerve. Ann Neurol. 2004, 55 (5): 713-720. 10.1002/ana.20094View ArticlePubMedGoogle Scholar
- Saifi GM, Szigeti K, Wiszniewski W, Shy ME, Krajewski K, Hausmanowa-Petrusewicz I, Kochanski A, Reeser S, Mancias P, Butler I, et al: SIMPLE mutations in Charcot-Marie-Tooth disease and the potential role of its protein product in protein degradation. Hum Mutat. 2005, 25 (4): 372-383. 10.1002/humu.20153View ArticlePubMedGoogle Scholar
- Tang X, Metzger D, Leeman S, Amar S: LPS-induced TNF-alpha factor (LITAF)-deficient mice express reduced LPS-induced cytokine: Evidence for LITAF-dependent LPS signaling pathways. Proc Natl Acad Sci USA. 2006, 103 (37): 13777-13782. 10.1073/pnas.0605988103View ArticlePubMedPubMed CentralGoogle Scholar
- Thisse B, el Messal M, Perrin-Schmitt F: The twist gene: isolation of a Drosophila zygotic gene necessary for the establishment of dorsoventral pattern. Nucleic Acids Res. 1987, 15 (8): 3439-3453. 10.1093/nar/15.8.3439View ArticlePubMedPubMed CentralGoogle Scholar
- Harfe BD, Fire A: Muscle and nerve-specific regulation of a novel NK-2 class homeodomain factor in Caenorhabditis elegans. Development. 1998, 125 (3): 421-429.PubMedGoogle Scholar
- Wang P, Zhao J, Corsi AK: Identification of novel target genes of CeTwist and CeE/DA. Dev Biol. 2006, 293 (2): 486-498. 10.1016/j.ydbio.2005.10.011View ArticlePubMedGoogle Scholar
- Lan L, Liu M, Liu Y, Liu Y, Zhang W, Xue J, Xue Z, He R: Expression of qBrn-1, a new member of the POU gene family, in the early developing nervous system and embryonic kidney. Dev Dyn. 2006, 235 (4): 1107-1114. 10.1002/dvdy.20703View ArticlePubMedGoogle Scholar
- Hauptmann G, Gerster T: Combinatorial expression of zebrafish Brn-1- and Brn-2-related POU genes in the embryonic brain, pronephric primordium, and pharyngeal arches. Dev Dyn. 2000, 218 (2): 345-358. 10.1002/(SICI)1097-0177(200006)218:2<345::AID-DVDY8>3.0.CO;2-VView ArticlePubMedGoogle Scholar
- Parslow TG, Blair DL, Murphy WJ, Granner DK: Structure of the 5' ends of immunoglobulin genes: a novel conserved sequence. Proc Natl Acad Sci USA. 1984, 81 (9): 2650-2654. 10.1073/pnas.81.9.2650View ArticlePubMedPubMed CentralGoogle Scholar
- Pankratova E, Sytina E, Polanovsky O: Autoregulation of Oct-1 gene expression is mediated by two octa-sites in alternative promoter. Biochimie. 2006, 88 (10): 1323-1329. 10.1016/j.biochi.2006.04.012View ArticlePubMedGoogle Scholar
- Chen RP, Ingraham HA, Treacy MN, Albert VR, Wilson L, Rosenfeld MG: Autoregulation of pit-1 gene expression mediated by two cis-active promoter elements. Nature. 1990, 346 (6284): 583-586. 10.1038/346583a0View ArticlePubMedGoogle Scholar
- Trieu M, Ma A, Eng SR, Fedtsova N, Turner EE: Direct autoregulation and gene dosage compensation by POU-domain transcription factor Brn3a. Development. 2003, 130 (1): 111-121. 10.1242/dev.00194View ArticlePubMedGoogle Scholar
- Washietl S, Hofacker IL, Stadler PF: Fast and reliable prediction of noncoding RNAs. Proc Natl Acad Sci USA. 2005, 102 (7): 2454-2459. 10.1073/pnas.0409169102View ArticlePubMedPubMed CentralGoogle Scholar
- Webb CT, Shabalina SA, Ogurtsov AY, Kondrashov AS: Analysis of similarity within 142 pairs of orthologous intergenic regions of Caenorhabditis elegans and Caenorhabditis briggsae. Nucleic Acids Res. 2002, 30 (5): 1233-1239. 10.1093/nar/30.5.1233View ArticlePubMedPubMed CentralGoogle Scholar
- Bendall AJ, Sturm RA, Danoy PA, Molloy PL: Broad binding-site specificity and affinity properties of octamer 1 and brain octamer-binding proteins. Eur J Biochem. 1993, 217 (3): 799-811. 10.1111/j.1432-1033.1993.tb18308.xView ArticlePubMedGoogle Scholar
- Hikima J, Lennard ML, Wilson MR, Miller NW, Warr GW: Regulation of the immunoglobulin heavy chain locus expression at the phylogenetic level of a bony fish: transcription factor interaction with two variant octamer motifs. Gene. 2006, 377: 119-129. 10.1016/j.gene.2006.03.014View ArticlePubMedGoogle Scholar
- Verrijzer CP, Alkema MJ, van Weperen WW, Van Leeuwen HC, Strating MJ, Vliet van der PC: The DNA binding specificity of the bipartite POU domain and its subdomains. Embo J. 1992, 11 (13): 4993-5003.PubMedPubMed CentralGoogle Scholar
- Givens ML, Kurotani R, Rave-Harel N, Miller NL, Mellon PL: Phylogenetic footprinting reveals evolutionarily conserved regions of the gonadotropin-releasing hormone gene that enhance cell-specific expression. Mol Endocrinol. 2004, 18 (12): 2950-2966. 10.1210/me.2003-0437View ArticlePubMedPubMed CentralGoogle Scholar
- Brabletz T, Pfeuffer I, Schorr E, Siebelt F, Wirth T, Serfling E: Transforming growth factor beta and cyclosporin A inhibit the inducible activity of the interleukin-2 gene in T cells through a noncanonical octamer-binding site. Mol Cell Biol. 1993, 13 (2): 1155-1162.View ArticlePubMedPubMed CentralGoogle Scholar
- Kitamoto T, Salvaterra PM: A POU homeo domain protein related to dPOU-19/pdm-1 binds to the regulatory DNA necessary for vital expression of the Drosophila choline acetyltransferase gene. J Neurosci. 1995, 15 (5 Pt 1): 3509-3518.PubMedGoogle Scholar
- Schug JOG: Current Protocols in Bioinformatics. 1996, Chapter 2.6:Google Scholar
- Brenner S: The genetics of Caenorhabditis elegans. Genetics. 1974, 77 (1): 71-94.PubMedPubMed CentralGoogle Scholar
- Hobert O: PCR fusion-based approach to create reporter gene constructs for expression analysis in transgenic C. elegans. Biotechniques. 2002, 32 (4): 728-730.PubMedGoogle Scholar
- Thacker C, Sheps JA, Rose AM: Caenorhabditis elegans dpy-5 is a cuticle procollagen processed by a proprotein convertase. Cell Mol Life Sci. 2006, 63 (10): 1193-1204. 10.1007/s00018-006-6012-zView ArticlePubMedGoogle Scholar
- Stein LD, Mungall C, Shu S, Caudy M, Mangone M, Day A, Nickerson E, Stajich JE, Harris TW, Arva A, et al: The generic genome browser: a building block for a model organism system database. Genome Res. 2002, 12 (10): 1599-1610. 10.1101/gr.403602View ArticlePubMedPubMed CentralGoogle Scholar