How the Dorsal gradient works: Insights from postgenome technologies

  1. Joung-Woo Hong,
  2. David A. Hendrix,
  3. Dmitri Papatsenko and
  4. Michael S. Levine,1
  1. Division of Genetics, Genomics, and Development, Department of Molecular and Cell Biology, Center for Integrative Genomics, University of California, Berkeley, CA 94720
  1. Edited by Eric H. Davidson, California Institute of Technology, Pasadena, CA, and approved September 2, 2008 (received for review July 21, 2008)

Abstract

Gradients of extracellular signaling molecules and transcription factors are used in a variety of developmental processes, including the patterning of the Drosophila embryo, the establishment of diverse neuronal cell types in the vertebrate neural tube, and the anterior–posterior patterning of vertebrate limbs. Here, we discuss how a gradient of the maternal transcription factor Dorsal produces complex patterns of gene expression across the dorsal–ventral (DV) axis of the early Drosophila embryo. The identification of 60–70 Dorsal target genes, along with the characterization of ≈35 associated regulatory DNAs, suggests that there are at least six different regulatory codes driving diverse DV expression profiles.

Keywords:

The dorsal–ventral (DV) patterning of the early Drosophila embryo is controlled by a sequence-specific transcription factor, Dorsal, which is related to mammalian NF-κB (1,3). Differential activation of the Toll receptor leads to the formation of a broad Dorsal nuclear gradient, with peak levels present in ventral regions and progressively lower levels in lateral and dorsal regions (,Fig. 1) (,4, ,5).

Fig. 1.

The Dorsal nuclear gradient generates diverse thresholds of gene expression. The Dorsal gradient regulates 60–70 target genes in a concentration-dependent manner across the DV axis of the early embryo. The expression domains of these genes are depicted on a diagram representing a cross-section through an early embryo. Filled green circles represent high levels of nuclear Dorsal protein, and shaded green and yellow circles represent intermediate and low levels, respectively. At least six different expression patterns have been identified. High levels of Dorsal activate Type 1 genes such as snail in the presumptive mesoderm. Type 2 genes such as rho and vnd are activated by intermediate levels of Dorsal gradient in ventral regions of the presumptive neurogenic ectoderm. In addition, type 1A and 2A expression profiles, represented by sim and ind, depend on Notch and EGF signaling and high and intermediate levels of the Dorsal gradient, respectively. Two type 3 expression patterns, those represented by sog and zen, are generated by the lowest levels of the gradient. The same low levels that are sufficient to activate sog repress the expression of dpp, zen, and tolloid. Thus, the Dorsal gradient generates three basic transcription responses, and these three thresholds produce a total of six different patterns of gene expression across the DV axis of the early embryo. This figure is adapted from the review by Stathopoulos and Levine (50).

The Dorsal gradient initiates DV patterning by regulating 60–70 target genes in a concentration-dependent manner (Fig. 1) (,6, ,7). Altogether, the Dorsal gradient generates six discrete patterns of gene expression across the DV axis of cellularizing embryos. Most of the target genes are activated only by the highest levels of the Dorsal gradient, in the ventral third of the embryo forming the mesoderm. Intermediate levels of the gradient lead to restricted patterns of gene expression within ventral regions of the presumptive neurogenic ectoderm. The lowest levels of the gradient activate gene expression throughout the neurogenic ectoderm; these same low levels also restrict expression to the dorsal ectoderm.

In addition to these four basic patterns of gene expression, produced by high, intermediate, and low levels of the Dorsal gradient, there are at least two other expression profiles, represented by the target genes sim and ind. Insights into the regulatory codes governing their expression were obtained from the DV gene regulatory network.

A combination of genetic analysis, transgenesis, bioinformatics, and microarray assays led to the elucidation of a comprehensive gene regulatory network for the DV patterning of the early embryo (6,11). This network contains >40 genes encoding sequence-specific transcription factors or cell signaling components that impinge on the activities of these factors (,12, ,13). Altogether, the network comprises >200 functional interconnections among the 40 major determinants of DV patterning.

Cell–Cell Interactions Produce Additional Dorsal Gradient Readouts

The sim expression pattern depends on the prior activation of snail by high levels of the Dorsal gradient within the presumptive mesoderm (Fig. 2A). snail encodes a zinc finger repressor of tom, which encodes an inhibitor of the Notch signaling ligand, Delta (14, ,15). As a result of this repression, Delta is stably expressed in the ventral mesoderm. All of the mesectoderm cells in contact with the mesoderm, the ventral-most cells of the presumptive neurogenic ectoderm, receive this Delta signal and experience Notch signaling (,16, ,17). Notch, in concert with the Dorsal and Twist gradients, triggers the expression of sim in the presumptive mesoderm and mesectoderm. sim expression is restricted to the mesectoderm because of repression by Snail in the presumptive mesoderm.

Fig. 2.

sim and ind expression patterns are variants of type 1 and type 2 threshold responses. (A) Colocalization of snail (green) and sim (red) mRNAs in a pregastrular embryo. Sharp boundaries of snail expression delineate dorsal limits of the presumptive mesoderm that directly contact the ventral-most cells of the presumptive neurogenic ectoderm, the mesectoderm, where sim is expressed. The Snail repressor leads to stable presentation of the Notch signaling ligand, Delta, in the presumptive mesoderm by inhibiting Tom expression. The adjacent mesectoderm cells receive this Delta signal and experience Notch signaling. Notch, in conjunction with the Dorsal and Twist gradients, triggers the expression of sim. Thus, the sim expression pattern arises from the type 1 snail pattern, which produces a localized source of Notch signaling. (G) Expression of ind depends on EGF signaling generated by the products of two type 2 Dorsal target genes, vein and rho. (D) vein encodes a secreted EGF signaling ligand, whereas rho encodes a membrane-bound protease that releases the Spitz EGF signaling ligand from ventral regions of the neurogenic ectoderm. This notion is consistent with two experimental observations. (B and C) ind expression (B) is lost in rho;vn double mutants (C). (E and F) Furthermore, misexpression of rho via the eve stripe2 enhancer (E) induces ectopic expression of ind in the presumptive neurogenic ectoderm (F). Ectopic staining is restricted to lateral regions where there are limiting amounts of the Dorsal.

Cell signaling also works in concert with the Dorsal gradient to establish the ind expression pattern (Fig. 2B). ind expression depends on EGF signaling, which is produced by vein and rho, two target genes activated by intermediate levels of the Dorsal gradient (Fig. 1). vein encodes a secreted EGF signaling molecule (18, ,19), whereas rho encodes a membrane protease that releases the Spitz EGF ligand from ventral regions of the neurogenic ectoderm (20, ,21). These EGF signals activate MAP kinase in both ventral and lateral regions of the neurogenic ectoderm (,22). In principle, ind can be activated in both regions by the combination of low levels of the Dorsal gradient and EGF signaling, but expression is kept off in the ventral neurogenic ectoderm by Vnd, a sequence-specific transcriptional repressor that is activated by intermediate levels of Dorsal (Fig. 1) (,23,25). As a result of this repression, ind expression is restricted to lateral regions of the neurogenic ectoderm. Thus, just as the sim expression profile is defined by the combination of Dorsal and Notch signaling (Fig. 3 C and D), the ind expression pattern is regulated by the combination of Dorsal and EGF signaling (Figs. 2G and 3 G and H).

Fig. 3.

Regulatory codes for diverse DV patterns. (A) The highest levels of Dorsal activate type 1 genes, such as snail, thereby restricting the expression of the associated target genes to the presumptive mesoderm. (B) Most of type 1 enhancers contain a disordered series of low-affinity Dorsal and/or Twist binding sites. (C) Expression of the type 1A gene, sim, in the mesectoderm depends on Notch signaling, which is produced by the Snail-dependent inhibition of Tom. (D) The sim enhancer contains a high-affinity Dorsal binding site and Suppressor of Hairless Su(H) sites, which mediate Notch signaling. (E) Type 2 genes, such as vnd, are activated by intermediate levels of the Dorsal gradient, and low levels of Twist. (F) All five of the known type 2 enhancers contain a fixed arrangement of Dorsal and Twist binding sites. An optimal Dorsal site is closely linked to an asymmetric Twist site, which is positioned in a convergent orientation relative to the Dorsal site. This arrangement plays a critical role for cooperative DNA binding interactions between Dorsal and Twist/Da heterodimers. These cooperative interactions generate a stable pattern of type 2 expression in ventral regions of the neurogenic ectoderm. (G) Intermediate levels of the Dorsal gradient induce EGF signaling, which is important for the activation of the Type 2A gene, ind, in the neurogenic ectoderm. (H) The ind enhancer contains a high-affinity Dorsal binding site and ETS sites, which mediate EGF signaling. (I and K) Two different classes of type 3 enhancers are activated (type 3activation) and repressed (type 3repression) by the same low levels of the Dorsal gradient. (I and J) Optimal Dorsal binding sites are closely linked to CAGGTAG motifs in the intronic enhancer that regulates the type 3activation gene, sog. This CAGGTAG motif is probably recognized by a ubiquitous activator distributed throughout the early embryo. It is conceivable that cooperative DNA binding interactions between CAGGTAG binding factor and Dorsal would permit both intermediate and low levels of the Dorsal protein to activate sog expression throughout the neurogenic ectoderm. (K) zen is repressed by the same low levels of the Dorsal gradient that activate sog. (L) The zen enhancer contains optimal Dorsal binding sites and a closely linked motif, WTCKTTCAT, that binds one or more ubiquitous factors that convert Dorsal into a silencer. Dorsal–corepressor interactions appear to induce the exposure of a cryptic peptide motif in Dorsal that recruits the Groucho corepressor (42).

Two lines of evidence support the view that ind expression requires EGF signaling. First, the endogenous ind expression pattern is completely lost in rho;vn double mutants (Fig. 2 B and C) (23). Second, misexpression of rho in transgenic embryos using the eve stripe 2 enhancer induces ectopic activation of ind in the presumptive neurogenic ectoderm. However, this ectopic expression is restricted to a pyramid pattern, suggesting that EGF signaling is not sufficient for induction, but is limited by diminishing levels of the Dorsal gradient (Fig. 2 D–F).

Whole-Genome Identification of Dorsal Target Enhancers

We have argued that four of the DV expression profiles, those represented by snail, sim, vnd, and ind, are produced by high and intermediate levels of the Dorsal gradient. High levels of Dorsal activate snail, which leads to the localized sim pattern via Notch signaling, whereas intermediate levels activate vnd, vn, and rho, which delineate the ind pattern. The final two expression patterns, those represented by sog and zen, are generated by the lowest levels of the gradient (Fig. 1). The same low levels that are sufficient to activate sog repress the expression of dpp, zen, and tolloid (26). Thus, the Dorsal gradient generates a total of six different patterns of gene expression across the DV axis of the early embryo (,Fig. 3).

Of course, Dorsal does not work alone in regulating these differential patterns of gene expression. Many target genes are regulated by a combination of Dorsal, Twist, and/or Snail. High levels of Dorsal activate twist, and the encoded Twist basic helix–loop–helix (bHLH) activator is distributed in a steeper gradient than the input Dorsal gradient. Dorsal and Twist jointly activate more than half of all Dorsal target genes (27, ,28). High levels of both activators are required for induction and maintenance of snail expression, which exhibits a tight pattern of expression within the presumptive mesoderm (29). The encoded zinc finger Snail repressor establishes a sharp boundary between the presumptive mesoderm and neurogenic ectoderm (e.g., ,Fig. 3A) (30, ,31). Several genes are activated by low levels of Dorsal and Twist in lateral regions of early embryos (e.g., ,Fig. 3E). The Snail repressor keeps these genes off in the mesoderm, thereby restricting their expression to the neurogenic ectoderm where they are important for the differentiation of the CNS.

To understand the regulatory basis for the six DV expression patterns, a large number of Dorsal target enhancers have been identified and characterized. Several methods have been used, including conventional gene fusion assays (e.g., refs. 27,29), bioinformatics methods (,9, ,10), and most recently, ChIP-chip assays (,7). The latter method proved to be the most effective means for the genomewide identification of Dorsal target enhancers.

The experiments were done by using Toll10b mutant embryos (32), which possess high levels of constitutive Toll signaling across the entire DV axis. As a result, there are high levels of Dorsal, Twist, and Snail throughout the embryo in place of the normal gradients. High levels of Dorsal and Twist activate mesoderm genes throughout the embryos; the Snail repressor keeps ectodermal genes off. As a result, all of the cells of Toll10b embryos form mesoderm at the expense of ectodermal derivatives.

Toll10b embryos were cross-linked at 2–4 h after fertilization because this is the time when the Dorsal gradient regulates its various target genes. Chromosomal DNA was isolated from the mutants, sonicated into small pieces, and then immunoprecipitated with antibodies directed against Dorsal, Twist, and Snail. There are at least 200–300 significant clusters of Dorsal, Twist, and Snail binding sites scattered throughout the Drosophila genome (7). These clusters identified 22 of the 24 known Dorsal target enhancers (,7) and another 10 enhancers that had not been previously identified (WntD, kni, mes5, pnr, tup, and shadow enhancers for vnd, miR-1, brk and sog; see refs. 7 and ,33). Altogether, the 34 experimentally confirmed Dorsal target enhancers regulate 29 of the 60–70 known Dorsal target genes (,6, ,11). The ChIP-chip assays also provide clear predictions for at least another ≈20 putative DV enhancers, including those associated with Mes4 (5′), Neu3 (intron), SoxN (3′), CadN (3′), and Doc 1 (5′).

Regulatory Codes for DV Expression Patterns

The highest levels of the Dorsal gradient activate gene expression in the presumptive mesoderm. Most of the associated enhancers (16 in total) contain a disordered series of low affinity Dorsal and/or Twist binding sites that function in an additive fashion to activate gene expression (Fig. 3 A and B) (6, ,10). A very different scenario is observed for those enhancers that mediate gene expression in ventral regions of the neurogenic ectoderm. These enhancers are activated by intermediate levels of the Dorsal gradient and low levels of Twist (e.g., ,Fig. 3E).

All five of the known type 2 enhancers contain a fixed arrangement of Dorsal and Twist binding sites (Fig. 3F) (34,36). An optimal Dorsal site is closely linked to an asymmetric Twist site, which is positioned in a convergent orientation relative to the Dorsal site. This organization is conserved in all 12 of the sequenced drosophilids (,36). Inverting the Twist site causes a severe reduction in the activities of an otherwise normal vein-lacZ fusion gene (35). These enhancers also contain Snail binding sites, which inhibit their activities in the ventral mesoderm. Thus, there are two differences in the regulatory codes of enhancers mediating expression in the mesoderm and ventral neurogenic ectoderm. The latter enhancers contain a fixed arrangement of Dorsal and Twist binding sites and also contain Snail repressor sites. Enhancers mediating expression in the mesoderm lack an intrinsic organization, or grammar, and contain low-affinity Dorsal and/or Twist sites.

The fixed arrangement of Dorsal and Twist sites seen in enhancers active in the ventral neurogenic ectoderm is probably essential for cooperative DNA binding interactions between Dorsal and the Daughterless (Da) subunit of Da/Twist heterodimers bound to the asymmetric Twist site (28). Da is a ubiquitous bHLH protein that is expressed throughout the early embryo and related to the mammalian E12/E47 protein (,38, ,39). In general, Da does not function as a homodimer, but forms heterodimers with tissue-specific transcription factors such as Twist (mesoderm) and Achaete (neurogenic ectoderm). Dl-Da/Twi cooperative binding interactions produce a stable pattern of type 2 expression in ventral regions of the neurogenic ectoderm, even though there is a 100-fold reduction in the levels of the Twist gradient (,35, ,36, ,40).

A similar principle, cooperative DNA binding interactions to a fixed arrangement of sites, might also apply to those enhancers responding to the lowest levels of the Dorsal gradient. The sog enhancer contains a series of optimal Dorsal binding sites and linked CAGGTAG sequences (Fig. 3 I and J). This latter motif has been identified in a number of genes expressed in the early, precellular Drosophila embryo (41). The CAGGTAG motif is probably recognized by a ubiquitous, maternal activator distributed throughout the early embryo. Linked Dorsal and CAGGTAG sites are conserved in the sog intronic enhancer among all 12 drosophilids. It is possible that the binding of the activator to CAGGTAG facilitates the binding of Dorsal to linked sites via cooperative DNA binding interactions, similar to those seen for Dorsal and Twist. This cooperative interaction would permit both intermediate and low levels of the Dorsal protein to activate sog expression throughout the neurogenic ectoderm.

The same low levels of the Dorsal gradient that activate sog (e.g., Fig. 3I) also repress target genes such as zen (e.g., Fig. 3K) (26, ,42). The zen enhancer contains a series of optimal Dorsal binding sites, similar to those seen in the sog enhancer, which is activated by Dorsal. However, unlike sog, the zen enhancer contains linked binding sites that convert Dorsal into a transcriptional repressor (Fig. 3L).

Dorsal contains a cryptic peptide motif that can interact with the Groucho corepressor (42, ,43). The exposure of this motif depends on protein–protein interactions between Dorsal and neighboring corepressor proteins. One of these proteins recognizes a highly conserved sequence motif, WTCKTTCAT. Previous studies in Drosophila melanogaster demonstrated that Dorsal–corepressor interactions depend on helical phasing of Dorsal and the linked corepressor binding site (44). Helical phasing is classical evidence for direct protein–protein interactions on the DNA template (,45,47), and this phasing is highly conserved among the zen enhancers in all 12 sequenced drosophilids. Thus, the Dorsal-CAGGTAG linkage seen in the sog enhancer is replaced by Dorsal-WTCKTTCAT linkage in zen (44). These distinct regulatory codes determine whether the target genes are activated or repressed by the same low levels of the Dorsal gradient.

Conclusions and Implications

A combination of classical genetic screens, computational methods, conventional gene fusion assays, along with the advent of more recent postgenome technologies such as whole-genome tiling arrays and ChIP-chip assays, has permitted the systematic identification of Dorsal target genes and their associated regulatory DNAs (e.g., refs. 7 and ,11; reviewed in refs. ,8 and ,13). This information was used to create an extensive gene regulatory network for the DV patterning of the Drosophila embryo (12, ,13). The network helped define six regulatory codes underlying diverse DV patterns of gene activity. The recent determination of 12 different drosophilid whole-genome assemblies helped confirm the notion that stable patterns of gene expression (e.g., in the ventral neurogenic ectoderm) in response to sharply diminishing levels of the Dorsal and Twist gradients depend on a fixed arrangement of linked activator sites. Target enhancers that respond to peak levels of the Dorsal and/or Twist gradients lack an intrinsic organization, whereas enhancers responding to low levels exhibit a “grammar” that is conserved among divergent drosophilids (,7, ,35, ,36). This organization probably fosters cooperative occupancy of Dorsal binding sites.

We propose that other regulatory gradients function like Dorsal to produce complex patterns of gene expression. For example, the floorplate organizer specifies at least four different neuronal cell types in the developing vertebrate neural tube (48). As seen for the Dorsal gradient, they might arise from the combination of Gli activator thresholds and cell–cell interactions similar to the role of Notch and EGF signaling in producing the sim and ind expression patterns, respectively (Fig. 3 C, D, G, and H). Similar rules might also apply to the zone of polarizing activity (ZPA) and anterior–posterior compartment organizers that pattern vertebrate and insect limbs (49).

Acknowledgments

This study was funded by grants from the Moore Foundation and the National Institutes of Health.

Footnotes

  • 1To whom correspondence should be addressed. E-mail: mlevine{at}berkeley.edu
  • Author contributions: J.-W.H. and M.S.L. designed research; J.-W.H. performed research; J.-W.H., D.A.H., D.P., and M.S.L. analyzed data; and J.-W.H. and M.S.L. wrote the paper.

  • This paper results from the Arthur M. Sackler Colloquium of the National Academy of Sciences, “Gene Networks in Animal Development and Evolution,” held February 15–16, 2008, at the Arnold and Mabel Beckman Center of the National Academies of Sciences and Engineering in Irvine, CA. The complete program and audio files of most presentations are available on the NAS web site at http://www.nasonline.org/SACKLER_Gene_Networks.

  • The authors declare no conflict of interest.

  • This article is a PNAS Direct Submission.

References

« Previous | Next Article »Table of Contents