Previous Article |
Table of Contents
| Next Article
BIOLOGICAL SCIENCES / PLANT BIOLOGY
Omics-based identification of Arabidopsis Myb transcription factors regulating aliphatic glucosinolate biosynthesis

,




,||
*RIKEN Plant Science Center, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan;
Kazusa DNA Research Institute, 2-6-7 Kazusakamatari, Kisarazu, Chiba 292-0818, Japan;
Graduate School of Pharmaceutical Sciences, Chiba University, 1-33 Yayoi-cho, Inage-ku, Chiba, Chiba 263-8522, Japan;
Core Research for Evolutional Science and Technology, Japan Science and Technology Agency, Saitama 332-0012, Japan; and ¶Central Laboratories for Frontier Technology, Kirin Brewery Company, Ltd., 1-13-5 Fukuura, Kanazawa-ku, Yokohama, Kanagawa 236-0004, Japan
Edited by Marc C. E. Van Montagu, Ghent University, Ghent, Belgium, and approved February 13, 2007 (received for review December 28, 2006)
| Abstract |
|---|
|
|
|---|
coexpression | functional genomics | transcriptomics
120 GSLs that differ in side-chain structures [supporting information (SI) Fig. 6]. GSLs are synthesized from several amino acids, including Met, Trp, and Phe (5, 6). Met-derived 4-methylsulfinylbutyl GSL (4MSOB) deserves special attention because its degraded product, sulforaphane, which was first isolated from broccoli, exhibits pronounced anticarcinogenic activity (1–3). The model plant Arabidopsis thaliana (L.) Heynh. also contains GSLs, including 4MSOB. For biotechnological applications that require increasing 4MSOB production, it is important to identify all of the genes involved in GSL biosynthesis and to elucidate the entire regulatory mechanism in Arabidopsis. To date, however, there has been no report on the genes regulating Met-derived aliphatic GSL biosynthesis. Along with the massive accumulation of microarray data sets, transcriptome coexpression analysis has proven to be a powerful tool for identifying regulatory relationships in the transcriptional networks of model organisms, including Escherichia coli (7) and yeast (8). Assuming that a set of genes coexpressed under a given experimental regimen is involved in the same or related metabolic pathway, candidate genes involved in the regulation or synthesis steps of a particular metabolic pathway can be comprehensively identified, or at least predicted with some confidence, by using publicly available transcriptome databases (9–13). Although the strategy of coexpression analysis has great potential for versatility, its application has thus far been limited in the actual discovery of useful genes in important pathways in Arabidopsis (11). If a coexpression profile for a specific condition (e.g., nutrient-deficiency stress) is compared with the set of genes that is always coexpressed in all tissues under all experimental conditions that have been tested, or "condition-independent" profile derived from public data sets, the reliability and feasibility of predicting a gene function would greatly increase. In this study, we identified two previously unrecognized genes, Myb28 and Myb29, which encode R2R3-Myb transcription factors involved in the regulation of aliphatic GSL biosynthesis, by using an integrated strategy based on transcriptome coexpression analysis for both public data sets and our own data, along with metabolic profiling. We also concurrently predicted many of the unknown structural genes encoding enzymes of aliphatic GSL biosynthesis and clarified their regulatory network. Overexpression of Myb28 in Arabidopsis cell suspension cultures resulted in the production of large amounts of GSLs, indicating the usefulness of these transcription factors for the production of GSLs in biotechnological applications.
| Results and Discussion |
|---|
|
|
|---|
|
Functional Analysis of Myb28 and Myb29 by Gene-Knockout Plants. To confirm the predicted function of Myb28 and Myb29, we analyzed transcript (i.e., transcriptome analysis by microarray) and GSL accumulation in gene-knockout and -knockdown plants, designated as myb28 and myb29, respectively, both of which grew normally. As shown in Fig. 2A, the expression of most of the genes involved in aliphatic GSL biosynthesis was repressed in myb28, whereas there were no significant changes in indole and Phe-derived aromatic GSL biosynthetic gene expression or in the expression of three nuclear factor genes reported to be involved in GSL metabolism (18–20). The other genes whose expression changed in myb28 are shown in SI Table 1. Most of the genes down-regulated in myb28 other than GSL biosynthetic genes were sulfur-deficiency-inducible genes (15), suggesting that a decrease of GSL amount (see below) might result in perturbation of sulfur metabolism, i.e., a sulfur-surplus status in myb28, to repress these genes secondarily.
|
However, there was no apparent change in the expression of GSL biosynthetic genes or GSL content in myb29 gene-knockdown plants (SI Fig. 9), suggesting that Myb29 may not be essential for GSL biosynthesis. Because GSL biosynthesis is known to be enhanced by the plant hormone methyl jasmonate (MeJA) (21), we measured changes of gene expression in wild-type plants in response to MeJA application (Fig. 3). Under control conditions (without MeJA), Myb28 expression was higher than Myb29. MeJA application, however, induced expression of Myb29 but not Myb28. Expression of the aliphatic GSL biosynthetic genes CYP79F1, CYP79F2, and CYP83A1 was also up-regulated with the induction of Myb29. These results suggest that Myb28 is essential for basal-level synthesis of aliphatic GSLs, and Myb29 presumably has a function in the induction of aliphatic GSL biosynthetic genes in response to MeJA signaling.
|
2-fold (data not shown). These results indicate that ectopic expression of Myb28 specifically up-regulates the expression of aliphatic GSL biosynthetic genes, but the indole and aromatic GSL biosynthetic genes, as well as Myb29, are unaffected. The other affected genes in Myb28-overexpressing cell cultures are shown in SI Table 2, which indicated that most of the genes up-regulated in Myb28-overexpressing cell cultures other than GSL biosynthetic genes were sulfur-deficiency-inducible genes.
|
Functional Description of the Genes Involved in GSL Biosynthesis. On the basis of transcriptome analysis of the Myb28-knockout and -overexpressing lines, the genes involved in GSL biosynthesis can be described in some detail.
Genes involved in Met side-chain elongation. Met is subjected to side-chain elongation cycles before entering the GSL core biosynthetic pathway. Elongation proceeds through the four-step reactions as in Leu biosynthesis (SI Fig. 10). Thus, the enzymes committed to Met side-chain elongation and Leu biosynthesis are presumably encoded by homologous genes belonging to the same gene families as follows: methylthioalkylmalate synthase (MAM) and isopropylmalate synthase (IPM-S) by four genes; MAM isomerase (MAM-I) and isopropylmalate isomerase (IPM-I) by three genes (AtLeuCs) for a large subunit and three genes (AtLeuDs) for a small subunit; MAM dehydrogenase (MAM-D) and isopropylmalate dehydrogenase (IPM-D) by three genes; and methionine-analog aminotransferase (MAAT) and branched-chain aminotransferase (BCAT) by six genes (SI Fig. 10). Of these 19 genes, only MAM1 and MAM3 have been functionally identified as coding for the methylthioalkylmalate synthase involved in Met side-chain elongation (22, 23). AtBCAT-1 has been shown to initiate degradation of the branched-chain amino acids Leu, Ile, and Val (24). Transcriptome analyses of the myb28-knockout and Myb28-overexpressing cell cultures indicated that MAM1, MAM3, AtLeuC1, AtLeuD1, AtLeuD2, AtIMD1, AtBCAT-3, and AtBCAT-4 were all positively regulated by Myb28 (SI Fig. 10), suggesting that these regulated genes are committed to aliphatic GSL biosynthesis; thus, the remainder of the 19 genes are not likely related to GSL but rather to Leu biosynthesis. AtBCAT-4 has recently been reported to be involved in Met side-chain elongation (25), confirming our methodology for predicting gene function.
Methionine biosynthetic genes.
Met is synthesized in plants from Cys by sequential reactions catalyzed by cystathionine
-synthase (CGS), cystathionine
-lyase (CBL), and Met synthase (MS) (SI Fig. 10). Arabidopsis possesses three MS genes, AtMS1, AtMS2, and AtMS3. Of these genes, AtMS3, which encodes a chloroplastic isoform, is postulated to be responsible for the de novo synthesis of Met, whereas AtMS1 and AtMS2 are assumed to be involved in the recycling of cytosolic S-adenosyl-homoCys into Met (26). Transcriptome analyses of Myb28-engineered cells suggest that CGS, CBL, and AtMS2 are up-regulated by Myb28, which is indicative of the connection of these genes to GSL biosynthesis. These structural genes might be induced to compensate for the decrease in Met concentrations as the plant synthesizes aliphatic GSLs.
Other genes involved in GSL biosynthesis. We suggested in ref. 15 that two GST genes, At1g78370 (ATGSTU20) and At3g03190 (ATGSTF11), are involved in GSL biosynthesis. Transcriptome analyses of myb28 and Myb28-overexpressing cell cultures indicated that both of them are regulated by Myb28, further supporting the specific involvement of these genes in aliphatic GSL biosynthesis.
We also assumed that At5g36160, which is annotated as a putative Tyr aminotransferase, is an additional C-S lyase gene functioning in GSL biosynthesis under certain conditions (15). However, the expression of this gene was not regulated by Myb28 (data not shown), suggesting that it may encode a C-S lyase involved in indole/aromatic GSL biosynthesis, or it encodes a Phe aminotransferase in side-chain elongation of Phe, directing the synthesis of 2-phenylethyl GSL derived from homoPhe (SI Fig. 6).
PMSR2, a gene encoding a putative peptide methionine sulfoxide reductase, was coexpressed with the GSL biosynthetic genes under sulfur deficiency, and was regulated by Myb28. Supposing that this enzyme could recognize the methylsulfinyl moiety of methylsulfinylalkyl GSL and that of methionine sulfoxide, this enzyme could be involved in side-chain conversion of aliphatic GSLs (Fig. 5).
|
A Working Model for Regulation of GSL Biosynthesis. The present study identified Myb28 and Myb29 as positive regulatory factors for aliphatic GSL biosynthesis under certain conditions. A regulatory network for this pathway could thus be proposed, as summarized in Fig. 5. Myb28 and Myb29 belong to the R2R3-Myb gene family, clustered into a small subgroup with Myb34 and Myb76 (see below) in a molecular phylogenetic tree drawn by the relationship of their amino acid sequences (28). The subtle differences in hormonal or stress-signal recognition by these regulatory elements suggest an evolutionary conservation of their primary structures and functions with fine tuning of their distinct roles in GSL metabolism. On the basis of the results discussed above, we rename these genes Production of Methionine-Derived Glucosinolate (PMG) 1 and 2, that is, PMG1/Myb28 and PMG2/Myb29.
Because GSLs play important roles as storage forms of sulfur and defense compounds against herbivores and microorganisms, Arabidopsis has evolved sophisticated regulatory mechanisms to control GSL biosynthesis responding to changes in nutritional status and biotic/abiotic stresses. PMG1/Myb28 is apparently a master transcription factor, generally regulating the pathway from Met to aliphatic GSLs, and is necessary and sufficient for the biosynthesis of aliphatic GSLs at a basal level in Arabidopsis. In contrast, PMG2/Myb29 plays an accessory role in MeJA-mediated induction of aliphatic GSL biosynthesis. Because the expression of ATR1/Myb34, together with the indole GSL biosynthetic genes, was also induced by MeJA application (29), ATR1/Myb34 participates in the MeJA-mediated induction of indole GSL biosynthesis, as PMG2/Myb29 does for aliphatic GSLs.
GSLs allow the nontoxic storage of sulfur in plants, because GSLs contain two or three sulfur atoms per molecule. Under sulfur-deficiency conditions, the expression of PMG1/Myb28, PMG2/Myb29, and ATR1/Myb34 was down-regulated, presumably to shut down GSL biosynthesis, so that the limited available sulfur would not be diverted from the production of sulfur-containing primary metabolites such as Cys, Met, and glutathione (30). In this context, the relation of sulfur-deficiency gene regulation to the recently identified SLIM1 factor that regulates sulfate uptake and assimilation is particularly intriguing (31). Although ATR1/Myb34 is apparently negatively regulated by SLIM1 in roots (31), the effect of SLIM1 on PMG1/Myb28 expression is unclear, probably due to the relatively minor importance of PMG1/Myb28 in roots compared with ATR1/Myb34 (SI Fig. 11). ATR1/Myb34 was repressed in the PMG1/Myb28-overexpressing plants (Fig. 4A). In an ATR1 loss-of-function mutant, the expression of CYP79B2, CYP79B3, and CYP83B1 as part of indole GSL biosynthesis and thus presumed to be controlled by ATR1/Myb34 was repressed in adult leaves but not in seedlings (17), suggesting the presence of a complicated regulatory mechanism for indole GSL biosynthesis. Biosynthesis of indole GSL could be finely regulated by several regulatory networks in response to environmental stimuli and the developmental stage, because indole GSL metabolism is closely related to the biosynthesis of indole-3-acetic acid, an important plant hormone.
A gene expression database analysis revealed that PMG1/Myb28 and PMG2/Myb29 are expressed preferentially in leaves and nodes along with AtBCAT-4 and MAM-1, whereas ATR1/Myb34 and CYP79B2 are expressed more strongly in roots than in leaves (SI Fig. 11). Interestingly, SUR1, which is involved in both aliphatic and indole GSL biosyntheses, seems to be positively regulated by both PMG1/Myb28 and ATR1/Myb34 in a tissue-specific manner (i.e., in leaves and inflorescences by PMG1/Myb28 and in root by ATR1/Myb34) (SI Fig. 11).
Perspectives on the Application of PMG1/Myb28 and PMG2/Myb29 and Coexpression Strategy. In terms of biotechnological application for GSL metabolic engineering, PMG1/Myb28 and PMG2/Myb29 are quite promising targets of genetic engineering for improved production of aliphatic GSLs on an industrial scale, because the ectopic expression of PMG1/Myb28 resulted in the production of GSLs at levels comparable to differentiated plants, even in dedifferentiated suspension cells. This case study using Arabidopsis leads to further applicable studies aimed at industrial production of human health-beneficial GSLs and at engineering potent pest-resistant rapeseed for biodiesel production.
Besides PMG1/Myb28 and PMG2/Myb29, our strategy of combining omics analyses of public transcriptome coexpression data sets with condition-specific (i.e., sulfur deficiency) transcriptome and metabolome profiles in-house, lead to the prediction of additional transcription factors, Myb76 and Myb59, which exhibit weaker correlation with GSL biosynthetic genes, and a number of structural genes presumably encoding the enzymes involved in GSL biosynthesis. This result indicates that a strategy based on transcriptome coexpression analysis is highly versatile for the comprehensive identification of genes involved in plant metabolism. Especially when condition-independent coexpression profiles from public databases are combined with condition-specific transcriptome and metabolome profiles, whole regulatory frameworks can be outlined (15, 32), leading to a greater understanding of metabolic systems and subsequent biotechnological applications in plant production.
| Methods |
|---|
|
|
|---|
Vector Construction and Plant Materials.
For overexpression lines, full-length Myb28 cDNA was amplified by PCR using Arabidopsis leaf cDNA as a template. The cDNA was introduced into binary vector pGWB2 by TOPO and the Gateway system (Invitrogen, Carlsbad, CA), in which the expression of cDNA is under the control of the CaMV35S promoter. For the genetic complementation study, an
4-kb fragment spanning the upstream sequence and coding region of Myb28 was amplified by PCR using Arabidopsis leaf DNA as a template. This genomic fragment was introduced in pGWB1 by TOPO and the Gateway system. The resulting vectors were introduced into Agrobacterium tumefaciens EHA101 by the method of An et al. (39).
Wild-type Arabidopsis accession Columbia was transformed with full-length Myb28 cDNA by the floral dip method (40) to obtain Myb28-overexpressing plants. The T-DNA insertion mutant myb28 (see below) was complemented with a genomic fragment containing an intact copy of Myb28. Arabidopsis T87 cultured suspension cells (41) were transformed with the fusion construct of CaMV35S promoter linked to Myb28 cDNA to obtain Myb28 overexpressing suspension cell lines. Details of suspension cell culture and transformation are described in SI Methods.
Myb28-knockout plants, in which T-DNA was inserted into the 5' UTR of Myb28 (SALK_136312) (42), was obtained from the Arabidopsis Biological Resource Center (Ohio State University, Columbus, OH). Homozygous lines of the T-DNA insertion mutant were selected and designated as myb28. A homozygous T-DNA-inserted line of Myb29, designated as myb29, in which T-DNA is inserted in the 5' UTR of Myb29 (CS121027 [GenBank] ), was a kind gift from Mitsuhiro Aida (Nara Institute of Science and Technology, Ikoma, Japan).
T2 and T3 generations of mutants and transgenic plants were used for analysis. Plants were grown for
3 weeks on soil [PRO-MIX BX (Premier Horticulture Inc., Quakertown, PA): vermiculite = 2:1, supplemented with fertilizer] in a greenhouse at 22°C under natural and fluorescent light (16 h light/8 h dark cycle). Rosette leaves were harvested, immediately frozen in liquid nitrogen, and stored at –80°C.
MeJA Treatment. Wild-type Arabidopsis plants were grown for 7 days in liquid culture (43). Plants were treated with MeJA for 3 h by direct addition to the liquid medium (final concentration of 10 µM). DMSO was used for mock-treatments (final concentration of 0.1%, vol/vol). The seedlings were harvested, immediately frozen in liquid nitrogen, and stored at –80°C until use.
Transcriptome Analysis. Total RNA was extracted with an RNeasy Plant mini kit (Qiagen, Valencia, CA). In the case of myb28 and myb29, three independent hybridizations with an Affymetrix (Santa Clara, CA) ATH1 microarray using two biological replicates were conducted according to the manufacturer's instructions. In the case of Myb28-overexprssing plants, four hybridizations with an Affymetrix ATH1 microarray were conducted by using T2 hygromycin-resistant individuals and T3 homozygous lines of two independent transformants. To compare expression levels in myb28 and overexpressing plants with wild-type plants, comparison analysis of GeneChip Operating Software (GCOS) ver1.4 (Affymetrix) was conducted. For the transcriptome analysis of cell suspension cultures, nine hybridizations were conducted with Agilent Arabidopsis2 Oligo DNA Microarray (Agilent Technologies, Palo Alto, CA) using nine combinations of one of three overexpressing lines (Cy5 label) and one of three control lines transformed with an empty vector (Cy3 label).
GSL Analysis. GSLs were analyzed by liquid chromatography–mass spectrometry using sinigrin as an internal standard for quantification (31).
| Acknowledgements |
|---|
|
|
|---|
| Footnotes |
|---|
Abbreviations: GSL, glucosinolate; MAM, methylthioalkylmalate synthase; MeJA, methyl jasmonate.
||To whom correspondence should be addressed. E-mail: ksaito{at}psc.riken.jp
Freely available online through the PNAS open access option.
Author contributions: M.Y.H., K. Sugiyama, and Y.S. contributed equally to this work; M.Y.H. and K. Saito designed research; K. Sugiyama, Y.S., T.T., A.S., R.A., and H.G. performed research; N.S., H.S., K.A., and D.S. contributed new reagents/analytic tools; M.Y.H., K. Sugiyama, Y.S., T.T., T.O., and O.I.N. analyzed data; and M.Y.H. and K. Saito wrote the paper.
The authors declare no conflict of interest.
This article is a PNAS Direct Submission.
Data deposition: Microarray data have been deposited in ArrayExpress database (accession nos. E-ATMX-6, E-ATMX-7, and E-ATMX-8).
This article contains supporting information online at www.pnas.org/cgi/content/full/0611629104/DC1.
© 2007 by The National Academy of Sciences of the USA
| References |
|---|
|
|
|---|
This article has been cited by other articles in HighWire Press-hosted journals:
![]() |
S. Alvarez, Y. He, and S. Chen Comparative Investigations of the Glucosinolate-Myrosinase System in Arabidopsis Suspension Cells and Hypocotyls Plant Cell Physiol., March 1, 2008; 49(3): 324 - 333. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Knill, J. Schuster, M. Reichelt, J. Gershenzon, and S. Binder Arabidopsis Branched-Chain Aminotransferase 3 Functions in Both Amino Acid and Glucosinolate Biosynthesis Plant Physiology, March 1, 2008; 146(3): 1028 - 1039. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Ogawa, T. Dansako, K. Yano, N. Sakurai, H. Suzuki, K. Aoki, M. Noji, K. Saito, and D. Shibata Efficient and High-Throughput Vector Construction and Agrobacterium-Mediated Transformation of Arabidopsis thaliana Suspension-Cultured Cells for Functional Genomics Plant Cell Physiol., February 1, 2008; 49(2): 242 - 250. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Pauwels, K. Morreel, E. De Witte, F. Lammertyn, M. Van Montagu, W. Boerjan, D. Inze, and A. Goossens Mapping methyl jasmonate-mediated transcriptional reprogramming of metabolism and cell cycle progression in cultured Arabidopsis cells PNAS, January 29, 2008; 105(4): 1380 - 1385. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Ma, Q. Gong, and H. J. Bohnert An Arabidopsis gene network based on the graphical Gaussian model Genome Res., November 1, 2007; 17(11): 1614 - 1625. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||