Previous Article |
Table of Contents
| Next Article
Vol. 96, Issue 24, 13603-13610, November 23, 1999
Contributed by Tony Hunter, September 7, 1999
Caenorhabditis elegans should soon be
the first multicellular organism whose complete genomic sequence has
been determined. This achievement provides a unique opportunity for a
comprehensive assessment of the signal transduction molecules required
for the existence of a multicellular animal. Although the worm C. elegans may not much resemble humans, the molecules that regulate
signal transduction in these two organisms prove to be quite similar. We focus here on the content and diversity of protein kinases present
in worms, together with an assessment of other classes of proteins that
regulate protein phosphorylation. By systematic analysis of the 19,099 predicted C. elegans proteins, and thorough analysis of the
finished and unfinished genomic sequences, we have identified 411 full
length protein kinases and 21 partial kinase fragments. We also
describe 82 additional proteins that are predicted to be structurally
similar to conventional protein kinases even though they share minimal
primary sequence identity. Finally, the richness of
phosphorylation-dependent signaling pathways in worms is further
supported with the identification of 185 protein phosphatases and 128 phosphoprotein-binding domains (SH2, PTB, STYX, SBF, 14-3-3, FHA, and
WW) in the worm genome.
Reversible protein phosphorylation plays a
central role in regulating basic functions of all eukaryotes such as
DNA replication, cell cycle control, gene transcription, protein
translation, and energy metabolism. Protein phosphorylation is also
required for more advanced functions in higher eukaryotes such as cell,
organ, and limb differentiation, cell survival, synaptic transmission, cell-substratum and cell-cell communication, and to mediate complex interactions with the external environment. Because aberrant protein phosphorylation is commonly the cause of cancer and other human diseases, a comprehensive knowledge of the key enzymes that regulate these functions can provide the basis for novel therapeutic
intervention strategies.
The genomic revolution promises to provide a new paradigm for
drug discovery, allowing one to selectively target the molecular basis
of human disease. The completion of the
Caenorhabditis elegans genome sequence gives
us an opportunity to decipher the molecular nature of its signal
transduction machinery. Several global analyses of proteins and protein
domains present in C. elegans have been presented elsewhere
(1-4), revealing that protein kinases comprise the second largest
family of protein domains in worms. The three most frequently occurring
protein domains found in worms are seven transmembrane chemoreceptors
(650 domains, 3.5% of genome), protein kinases (496 domains, 2.6% of
genome), and zinc finger C4 domains, including nuclear hormone
receptors (275 domains, 1.4% of genome). A more in-depth analysis has
been performed on the 535 worm proteins containing zinc-binding
domains, including the C4, C2H2, and C3HC4 ring finger types (3), and
on the 83 worm homeobox transcription factors (4). Here, we present a
comparative analysis of the enzymes and adaptor molecules that are the
key components of the protein phosphorylation signaling network present
in C. elegans.
Identification and Classification of C. elegans Protein
Kinases.
To identify worm protein kinases, we first used an
HMMER 2.1.1 (http://hmmer.wustl.edu/)
profile search against the 19,099 predicted worm proteins, the finished
and unfinished C. elegans genomic sequence, and the worm
chromosome assemblies. The nucleic acid databases were first translated
in all six frames, and ORFs longer than 30 amino acids were parsed into
a relational database. We generated a hidden Markov model based on 70 representative yeast and human protein kinases whose catalytic domains
share <50% sequence identity with each other (5). Using a similar strategy, additional profiles were generated for other protein kinase-like domains (phosphoinositide kinases, atypical A6 kinases, diacylglycerol kinases, aminoglycoside resistance
kinases, and microbial kinases), protein phosphatases, and domains
capable of specifically binding to phosphotyrosine (P.Tyr) or
phosphoserine/threonine residues (SH2, PTB, STYX, SBF, 14-3-3, FHA,
and WW domains). Scripts were written for reassembly of contiguous
exons identified from genomic sequence to generate the predicted
catalytic domain sequence of each kinase. Pairwise BLAST
2.0 (ftp://ncbi.nlm.nih.gov/blast/executables/) analysis was performed to identify redundant entries, and putative protein kinases with low profile scores were manually inspected to
determine whether they should be included in subsequent analyses.
Inaugural Article
The protein kinases of Caenorhabditis
elegans: A model for signal transduction in
multicellular organisms
,
The Salk Institute, 10010 North Torrey Pines Road, La
Jolla, CA 92037; and * SUGEN, 230 East Grand Avenue, South San
Francisco, CA 94080
![]()
Abstract
Top
Abstract
Article
References
![]()
Article
Top
Abstract
Article
References
|
|
Yeast- and Fungal-Specific Kinases. The first complete eukaryote sequence, that of the budding yeast Saccharomyces cerevisiae, was reported in 1996 (6). Shortly thereafter, we presented a comprehensive analysis and classification of yeast protein kinases (7). Now, with the availability of a second eukaryotic genome, C. elegans, we can perform a similar analysis and make more informed generalizations on which of these protein kinases are unique to yeast or fungi, and also on which protein kinases evolved during the emergence of multicellular organisms and are therefore not represented in yeast or fungi.
We now identify a total of 24 yeast-specific protein kinases and an additional 3 that are currently restricted to yeast and worms. Originally we defined four protein kinase subfamilies, containing a total of 18 members, to be yeast specific [protein kinase A (PKA)-related, RAN, ELM, and NPR/HAL5 families]. These remain yeast- or fungal-specific, as no close homologues are present in worms, and none have yet been described in vertebrates. However, the ELM family could be considered as a subfamily of the CAMK group. Rim15 is a yeast-specific kinase that is related to Schizosaccharomyces pombe Cek1, and its similarity to budding yeast YNL161w places it as a distant member of the NDR family kinases. Two other protein kinase subfamilies, containing a total of five members, were originally recognized as having only distant homologues in higher organisms (NEK-like and PIM-like families). The prototype of the NEK-like family, YNL020C, has a homologue in worms, but not in mammals, although its C-terminal tail has a predicted coiled-coil structure related to numerous mammalian protein kinases (e.g., SLK/PLKK, TAK1). The two yeast PIM-like family members have catalytic domains related to worm and mammalian protein kinases, but have a unique N-terminal domain. Members of the NPR/HAL5 family are involved in ion homeostasis, polyamine transport, nutrient uptake, and response to nitrogen starvation, whereas Elm1 initiates a protein kinase cascade controlling pseudohyphal growth (8). Members of the RAN family are related to fission yeast Ran1/Pat1, which regulates the switch between vegetative growth and meiosis. Because these are fungal-specific responses, it is not surprising that these protein kinases are restricted to lower eukaryotes. A second set of "unique" yeast protein kinases was originally defined because they had no close homologues in other species (7). Most of these yeast protein kinases now have both worm and vertebrate orthologues (Cdc5, Ipl1, Ire1, Vps15, YGL180W/Apg1, Swe1, Spk1, Gcn2, YBR274W, YGR262C, and Bub1). Exceptions among this list of unique yeast protein kinases are YPL236C and Mps1, which have orthologues in humans, but not in worms; YKL116C, which is distantly related to the EMK-family, yet has only weak homologues in worms and humans; and YKL171W, YGR052W, and YPR106W, which remain yeast specific protein kinases. Two sequences that were excluded from our previous analysis of yeast protein kinases deserve mention. The budding yeast protein Iks1 can be classified as a yeast-specific protein kinase because it still has no homologues in worms or other species whereas another yeast kinase-like sequence, SCY1, has orthologues in C. elegans and Arabidopsis, but none thus far in vertebrates. A S. pombe protein, which is distantly related to SCY1, also has a single worm orthologue.Worm-Specific Protein Kinases. Which protein kinases are specific to worms? Protein kinases that are absent from yeast yet present in worms are likely to be involved in the complex signal transduction pathways that are required for the existence of multicellular organisms. These might include protein kinases involved in cell-substratum and cell-cell adhesion, transmembrane signaling in response to humoral factors, protein kinases involved in cell survival or programmed cell death, and protein kinases whose signals regulate metazoan-specific transcription factors, particularly those containing Zn-finger domains.
In the absence of complete genome sequences of other multicellular eukaryotes, we tentatively classify 165 protein kinases (plus 9 protein kinase fragments) as worm-specific. The majority (134, 80%) fall into three groups (CK1, FER, and KIN-15) whereas the others are distant members of common protein kinase families or belong to worm specific subfamilies. Five protein kinase subfamilies, containing a total of 12 members, can tentatively be defined as worm-specific (C04G2.10, K08B4.5, K09C6.7, R107.4, and ZK177.2-families). An additional 15 unique worm protein kinases are also identified, which to date have no close homologues in yeast, worms, or in higher organisms. However, mammalian homologues of some of these worm protein kinases are already beginning to appear in publicly available expressed sequence tag databases, and assignment of a protein kinase as being truly worm-specific will have to await the completion of the Drosophila and human genome sequences. Members of four other protein kinase or kinase-like subfamilies are disproportionately represented in worms compared with humans. Clusters of 5-9 members of each of these families are localized to short regions (<1 megabase) of chromosomes II and IV, suggesting they may each have expanded as a result of extensive tandem gene duplication. The chromosomal density of protein kinases is graphically depicted on our web site at www.kinase.com. The four gene families are the CK1-family, the KIN-15-family of receptor protein-tyrosine kinases, the FER-family of cytoplasmic protein-tyrosine kinases, and the kinase-like domains of the receptor guanylyl cyclases. CK1 family. The worm genome contains 87 CK1 (casein kinase I) members (plus 7 additional partial catalytic domains) whereas there are only 4 known members in budding yeast and 6 in humans. Genetic evidence from the yeast homologues suggests CK1s may be involved in DNA repair and cell division, and mammalian CK1s have been shown to phosphorylate p53 in G1 and G2, possibly affecting cell sensitivity to DNA damage at these checkpoints (9). Little is known regarding the function of CK1s in worms, but the enormous arborization and diversification of this kinase family may be an adaptation allowing for enhanced DNA repair in response to excessive exposure to environmental mutagens. KIN-15/16 family. C. elegans contains 16 members of a unique family of receptor protein-tyrosine kinases whose presence to date is restricted to this species. These transmembrane proteins have unusually short (<50-aa) extracellular domains, and many are clustered within the genome, as though they arose through tandem gene duplication. The prototype members of this family, KIN-15 and KIN-16, are expressed in the hypodermal syncytium, which expands by cell fusion during larval development (10). Compared with wild-type worms, KIN-15 and KIN-16 deletion mutants produce fewer embryos and rarely develop into adults, but, when they do mature, they typically exhibit extrusion of the gonads through the vulva (11). Therefore, KIN-15/16 appear to be essential genes, yet may undergo variable compensation by 1 of the 14 other homologues. One of the KIN-15 clusters is interspersed with chitinase genes, which are known to function in cell wall morphogenesis during the molting process and in fungal resistance. Expansion of this region may have been necessary during evolution to facilitate this aspect of larval development. An alternative function for KIN-15-family kinases is suggested by the fact that overexpression TKR-1 (C08H9.5) causes a 40-100% extension of life expectancy in worms (12). Unlike other life extension (age) mutants, TKR-1 transgenics do not form dauers, and their longevity has been attributed to an increased resistance to ultraviolet and thermal stress. FER family. The worm genome contains 42 members (plus 2 additional partial catalytic domains) of the FER-family of single SH2-containing cytoplasmic protein-tyrosine kinases. Most of these genes are interspersed throughout the worm genome; however, nine members reside within a 1.1-megabase region on chromosome IV. Unfortunately, no literature is available on the function of any of these protein kinases in worms, but the two mammalian homologues, FER and FES, have been demonstrated to play a role in cell adhesion, to signal downstream of cytokine receptors, and to function as oncogenes (13). Conceivably, additional human representatives will be revealed on completion of the human genome sequence, possibly with restricted expression. Alternatively, their function may be replaced in humans by expansion and diversification of non-FER cytoplasmic protein-tyrosine kinases, of which worms have only 10 whereas humans have at least 34. Most evident is a dramatic expansion of SRC-family kinases and emergence of ZAP70 and JAK family kinases in higher eukaryotes that are not found in the worm genome.Conserved Metazoan Protein Kinase Signaling Transduction Pathways. Worms provide an elegant model system for studying signal transduction. This transparent animal is comprised of 959 somatic cells plus 131 cells destined for programmed cell death. The C. elegans hermaphrodite contains 302 neurons and 81 muscle cells and has a brain, reproductive system, and digestive tract (ref. 14; http://dauerdigs.biosci.missouri.edu/Dauer-World/Wormintro.html). It provides a complex yet tractable system for studying development, metabolism, aging, and behavioral responses to a number of stimuli. Regulation of many of these processes is carried out through signal transduction pathways that are also present in humans. Not surprisingly, all of the major protein kinase groups found in worms are also conserved in humans (15). The number of protein kinases classified into each major group from yeast and worms, along with a current estimate from humans, is provided in Table 1. These numbers represent a current analysis, but new protein kinases are being discovered every month as the worm genome sequencing project continues. Some of these entries may also represent pseudogenes containing frameshifts that result in incomplete translation into a full kinase catalytic domain.
AGC Group.
The AGC group of worm protein kinases contains representatives of
many of the known types of cyclic nucleotide-dependent, NDR or DBF2,
and ribosomal S6 kinase families. Worms also contain members of the
cGMP-dependent kinase (PKG), RSK, and G-protein coupled receptor kinase
families that are absent from budding yeast. Two of the S6 kinase
members have dual catalytic domains similar to vertebrate RSK enzymes,
where the N-terminal domain clusters into the AGC group and the
C-terminal kinase domain is most related to the CaMK group. Worms have
four members of the AKT family, two being close orthologues of
mammalian AKT1/PKB/RAC
, and two related to the AKT upstream
kinase, PDK1. AKT is a mammalian protooncoprotein regulated by
phosphatidylinositol 3-kinase (PI3-K), which appears to function as a
cell survival signal to protect cells from apoptosis (16).
Insulin receptor, RAS, PI3-K, and PDK1 all act as upstream activators
of AKT whereas the lipid phosphatase PTEN functions as a negative
regulator of the PI3-K/AKT pathway (17). Downstream targets for
AKT-mediated cell survival include the proapoptotic factors BAD
and Caspase9 and transcription factors in the forkhead family, such as
DAF-16 in the worm. AKT is also an essential mediator in insulin
signaling, in part because of its use of GSK-3 as another downstream
target. Each of these components of the AKT/PI3-K pathway is
conserved in worms, providing a powerful system for genetic dissection
of a major cell survival signal.
CaMK Group.
In the CaMK group, the most abundant representatives include
Ca2+/calmodulin-regulated and
AMP-dependent protein kinases and EMK-related kinases. Worms also
contain members of the death-associated protein kinase,
mitogen-activated protein kinase (MAPK)-associated protein kinase,
myosin light chain kinase, and phosphorylase kinase families that are
absent from budding yeast. All of these protein kinase families have
likely evolved as a result of the demands of multicellularity and the
emergence of complex organ systems. For example, even though yeast have
myosin homologues, they lack myosin light chain kinases. These protein
kinases have presumably evolved to regulate myosin during muscle
contraction. A worm contig still under construction appears to contain
a phosphorylase kinase catalytic
subunit orthologue, consistent
with the presence of two orthologues of the noncatalytic phosphorylase
kinase
subunits, which facilitate calmodulin-binding
and are required for activation of the mammalian holoenzyme.
CMGC Group. In the CMGC group of serine/threonine kinases, all of the main subfamilies are conserved between yeast, worms, and mammals, including cyclin-dependent kinase (CDK), MAPK, GSK-3, and CLK. An exception is the RCK family, which is absent from yeast but has two members in worms and at least seven in humans. The worm RCK kinases are most similar to mammalian MAK, or male germ cell-associated kinase, which has been implicated in spermatogenic meiosis and in signal transduction pathways for sight and smell. Worms have 14 CDKs (compared with 5 CDKs in yeast) including orthologues of CDC2, CDK3, CDK5, CDK7, and CDK8, and contain 34 cyclins, compared with 23 in budding yeast (Table 1), including one cyclin H orthologue, which we predict will interact with worm CDK-7 to generate a functional cyclin-activated kinase.
Worms have 14 MAPKs, compared with 6 in yeast and at least 14 in humans. The worm MAPKs include representatives of each of the major types of MAPKs: ERK/MAPK, JNK/SAPK1, p38/SAPK2, BMK/ERK5, and NEMO-like kinase (NLK) (21). In budding yeast, three protein kinase families (the prototypes being Ste20, Ste11, and Ste7) function upstream of the MAPKs to generate at least four distinct MAPK signaling pathways that mediate the response to pheromone, nutritional starvation, and cellular or osmotic stress. In multicellular organisms, these MAPK cascades have evolved to mediate responses to diverse signals including growth factors, mitogens, hormones, and cytokines, in addition to the more primitive stress responses to anoxia, heat shock, and osmotic stress.STE Group: MAPK Pathways. The STE family refers to the three classes of protein kinases that lie sequentially upstream of the MAPKs. In worms, this group includes 10 STE7 (MEK or MAPKK) kinases, 2 STE11 (MEKK or MAPKKK) kinases, and 12 STE20 (MEKKK) kinases. Based on the number of MAPK and STE-family kinases in C. elegans, we predict worms will contain at least 8-10 MAPK pathways. In humans, several protein kinase families that bear only distant homology with the STE11 family also operate at the level of MAPKKKs, including RAF, MLK, TAK1, and COT. Except for COT, worms also have orthologues of each of these kinases. Because crosstalk takes place between protein kinases functioning at different levels of the MAPK cascade, the large number of STE family kinases could translate into an enormous potential for upstream signal specificity and diversity.
Protein-Tyrosine Kinase Group: Receptor Protein-Tyrosine Kinases
(RTKs).
The largest group of protein kinases in worms are the
protein-tyrosine kinases (PTKs), with 92 members and 5 fragments. We predict this will also remain the largest group of protein kinases in
higher eukaryotes, including humans, where the current count is
100.
These numbers are impressive when one considers that this family is
absent from budding yeast. Yeast, however, do have a "budding"
tyrosine phosphorylation signaling system, with several dual-specificity kinases (CLK-like, Ste7/MEK family, Swe1,
Spk1/Rad53, Mps1), an atypical A6 PTK, 3 protein-tyrosine
phosphatases, 16 dual-specificity and low molecular weight
phosphatases, and 6 "infant" P.Tyr-binding proteins comprising an
apparently nonfunctional SH2 domain protein and 5 phosphatase-like STYX
domains. Budding yeast lack PTB domains, and none of the six potential
P.Tyr-binding domains have been functionally verified.
|
Protein-Tyrosine Kinase Group: CTKs. Most of the 52 CTKs in worms belong to the single SH2-containing FER family. Of the remaining 10 CTKs, there are 2 orthologues of the SH3-containing ACK, and 1 each of FYN (SRC family CTK), FRK, CSK, ABL, and FAK, plus 3 unclassified CTKs. In vertebrates, CSK negatively regulates FYN-family kinases by phosphorylation of a C-terminal tyrosine facilitating a conformational change through an intramolecular SH2-P.Tyr interaction (25). We predict a similar functional interaction between worm FYN and CSK. Co-evolution of this regulatory pair suggests even early metazoans required a means to dampen signaling through CTKs. Notably absent in worms are protein kinases related to the ZAP70 and JAK CTKs, whose primary role in mammals is in signaling through the T cell and cytokine receptors, both of which represent more specialized pathways not present in worms. Humans have eight SRC-family kinases whereas worms have only one. This redundancy has confounded efforts to dissect out the precise role of these CTKs in human biology, often requiring "triple knockouts" to demonstrate a deficiency. The simplicity of non-FER-like CTKs in worms may be helpful in placing these CTKs within specific signaling cascades.
Protein-Tyrosine Kinases: Adaptor and Docking Molecules.
Ligand activation of RTKs results in tyrosine phosphorylation of
both the receptor itself (autophosphorylation) and of downstream substrates. These phosphorylated tyrosines then function as attachment sites for proteins containing SH2 and other P.Tyr-binding domains. We
have identified 74 proteins containing a total of 77 SH2 domains in
worms. The majority of these SH2 domains are in CTKs, two are present
in a SHP2-related PTP, and the remainder are predicted to represent
orthologues of a variety of adaptor molecules, including phospholipase
C
, CBL, CIS4/SOCS5, CRK, NCK, SEM-5/GRB2, SHC, tensin, STAT, and
VAV. Worms also contain at least 16 PTB domains, which in some cases
have been found to interact specifically with tyrosine phosphorylated
proteins. Worm PTB-containing proteins include orthologues of SHC,
which also contains an SH2 domain, neuronal transmembrane protein X11,
and an insulin receptor substrate (IRS) family member. The mammalian
X11 PTB domain does not to bind to P.Tyr, so we anticipate only a few
of these worm domains will function as P.Tyr-binding domains.
Additional potential phosphoprotein-binding domains identified in worms
include three 14-3-3 domains, 22 WW domains, and 11 FHA domains.
Other Protein Kinases.
Approximately 15% of the worm protein kinases do not fall into
one of the six major groups but include smaller families with representatives in higher eukaryotes, including CHK1, DYRK, MLK, TAK1,
PIM, RAF, STKR, and the mitotic kinases (BUB1, AURORA, PLK, and
NIMA/NEK). Recent genetic and biochemical data place TAK1 (transforming growth factor
-associated kinase) on a MAPK-like pathway at the level of a MAPKKK acting upstream of the MAPK-family member NLK. The worm orthologues of TAK1 and NLK regulate Wnt-mediated cell polarization during embryogenesis (21). Biochemical data also
demonstrate that this MAPK-like pathway negatively regulates Wnt
signaling because NLK phosphorylates the TCF/LEF HMG transcription factors, thereby inhibiting Wnt-regulated binding of the
-catenin-TCF complex to DNA. Both of these pathways are conserved
between mammals and worms. The likely orthologous human/worm pairs on
the TAK1 MAPK-like pathway include TAK1/MOM-4, NLK/LIT-1, and
TCF4/POP-1. Upstream regulators may include TGF
1/DBL-1, TGF
type I receptor/SMA-6, TGF
type II receptor/DAF-4 (worms have
three receptor serine kinases). Additional components of the
Wnt-signaling pathway, such as cadherin, the adenomatous polyposis coli
tumor suppressor gene (APC), disheveled, and GSK-3 kinase are also
present in worms, suggesting that there may be a primordial connection
between polarized control of cell division/migration and cellular
transformation in vertebrates (26).
Microbial-Like Kinases: Origin of Protein Kinases? The availability of the sequence of the first complete metazoan genome, combined with the sequence of budding yeast and several prokaryotic and Archaea genomes, provides an excellent opportunity to reassess current theories on the evolutionary origin of protein kinases. Pkn1 is a bacterial protein kinase-like sequence first described in the Gram-negative bacteria Myxococcus xanthus, which functions in growth and differentiation and in the ability of this prokaryote to form a fruiting body in response to nutrient starvation. Pkn-related proteins are present in other prokaryotes, including Streptomyces, Bacillus, Mycobacterium, Pseudomonas, Chlamydia, and Synechocystis, where they are involved in virulence, secondary metabolism, sporulation, and complex growth cycles (27). However, there are no Pkn homologues in bacteria with less complex life cycles, such as Escherichia coli, and Haemophilus influenzae, or in any Archaea, suggesting they may have been acquired by horizontal transmission from an early eukaryote, and are unlikely to represent the ancestral founders to protein kinases.
In our kinase profile searches of the worm genome, we detected several entries with low profile scores, yet with significant (E value < 10
2) random expectation (E)
values. Most of these contained similarity to kinase subdomains I, II,
and VI, containing the consensus GxGxxGxV, VAVK, and HxDxxxxN motifs,
respectively. Upon further analysis, many of these entries could be
classified into distinct families designated ABC1, RI01, YGR262,
diacylglycerol kinase, choline/ethanolamine kinases,
and the YLK1-antibiotic resistance kinases. The first three families
are named after their prototypic members in S. cerevisiae
(27).
Worms contain three proteins related to the budding yeast ABC1.
The yeast protein is required for the assembly of the mitochondrial cytochrome c reductase complex, which functions as an
electron carrier during oxidative phosphorylation to generate ATP (28). ABC1 homologues are present in numerous prokaryotes, including Mycobacterium, Clostridium,
Rickettsia, Synechocystis, Azobacter, and Enterobacteriaceae such as E. coli and
Providencia stuartii, in addition to the Archaea,
Methanobacterium. ABC1-like proteins are also present in
eukaryotes, including fission yeast, Arabidopsis, worms, and
mammals. Although ABC1 homologues are absent from bacteria such as
Mycoplasma, Bacillus, Haemophilus,
Helicobacter, and spirochetes, their presence in other
prokaryotes, Archaea, and eukaryotes positions them as
likely representatives of the primordial protein kinase, which was the
progenitor of the eukaryotic protein kinase family. Based on their
recognized role in mitochondrial ATP production and because they
maintain many of the structurally important residues and motifs
involved in ATP binding, the ABC1-family proteins may either bind ATP
or act as phosphotransferases. Conceivably, the ABC1 proteins transfer
phosphate to proteins as part of a feedback loop to sense mitochondrial
ATP levels.
The RI01 family has three representatives in worms and is named
after one of the two homologues in budding yeast. There are also
representatives from several Archaea species, but none from bacteria, making them a less attractive candidate as a progenitor to
the protein kinase lineage.
Atypical Protein Kinases and Protein Kinase-Like Domains.
Worms contain 26 kinase-like domains present in receptor guanylyl
cyclases (there are 10 additional soluble guanylyl cyclases), and at
least 7 diacylglycerol kinases, 7 choline/ethanolamine kinases, and 30 YLK1-related antibiotic
resistance kinases. Each of these families contain short motifs that
were recognized by our profile searches with low scoring E-values, but
a priori would not be expected to function as protein
kinases. Instead, the similarity could simply reflect the modular
nature of protein evolution and the primal role of ATP binding in
diverse phosphotransfer enzymes. However, two recent papers on a
bacterial homologue of the YLK1 family suggests that the aminoglycoside
phosphotransferases (APHs) are structurally and functionally related to
protein kinases (28, 29). There are over 40 APHs identified from
bacteria that are resistant to aminoglycosides such as kanamycin,
gentamycin, or amikacin. The crystal structure of one well
characterized APH reveals that it shares >40% structural identity
with the two-lobed structure of the catalytic domain of cAMP-dependent
protein kinase (PKA), including an N-terminal lobe composed of a
five-stranded antiparallel
sheet and the core of the C-terminal
lobe, including several invariant segments found in all protein kinases
(29). APHs lack the GxGxxG normally present in the loop between
strands 1 and 2 but contain 7 of the 12 strictly conserved residues
present in most protein kinases, including the HGDxxxN signature
sequence in kinase subdomain VIB (29). Furthermore, Daigle et
al. (30) have demonstrated that this APH also exhibits
protein-serine/threonine kinase activity, suggesting that the worm
YLK1-related molecules may indeed be functional protein kinases.
defines a conserved ATP-binding core
that is strikingly similar to conventional protein kinases (31). Three
residues are conserved among all of these enzymes, including (relative
to the PKA sequence) Lys-72, which binds the
-phosphate of ATP,
Asp-166, which is part of the HRDLK motif, and Asp-184, from the
conserved Mg2+ or Mn2+
binding DFG motif (31). The worm genome contains 12 phosphatidylinositol kinases, including 3 PI3-kinases, 2 PI4-kinases, 3 PIP5-kinases, and 4 PI3-kinase-related kinases. The latter group has
four mammalian members (DNA-PK, FRAP/TOR, ATM, and ATR), which have
been shown to participate in the maintenance of genomic integrity in
response to DNA damage and exhibit true protein kinase activity,
raising the possibility that other PI-kinases may also act as protein kinases. Regardless of whether they have true protein kinase activity, PI3-kinases are tightly linked to protein kinase signaling, as evidenced by their involvement downstream of many growth factor receptors and as upstream activators of the cell survival response mediated by the AKT protein kinase.
There are several proteins with protein kinase activity that
appear structurally unrelated to the eukaryotic protein kinases. These
include Dictyostelium myosin heavy chain
kinase A, Physarum polycephalum actin-fragmin kinase, the
human A6 PTK, human BCR, mitochondrial pyruvate dehydrogenase and
branched chain fatty acid dehydrogenase kinase, and the prokaryotic
"histidine" protein kinase family. Worms lack representatives of
the actin-fragmin kinases, BCR, and bacterial histidine kinases yet do
contain a single representative of the other classes of atypical
kinases and two homologues of the A6-related PTKs. The single worm
orthologue of the Dictyostelium myosin heavy
chain kinase A (32) proves to be the worm eukaryotic elongation factor
2 kinase (33). The slime mold, worm, and human eukaryotic elongation
factor 2 kinase homologues have all been demonstrated to have protein
kinase activity, yet they bear little resemblance to conventional
protein kinases except for the presence of a putative GxGxxG
ATP-binding motif (33).
The so-called histidine kinases are abundant in prokaryotes, with
>20 representatives in E. coli, and have also been
identified in yeast, molds, and plants. In response to external
stimuli, these kinases act as part of two-component systems to regulate DNA replication, cell division, and differentiation through
phosphorylation of an aspartate in the target protein (34). To date, no
"histidine" kinases have been identified in metazoans, although
mitochondrial pyruvate dehydrogenase (PDK) and branched chain
-ketoacid dehydrogenase kinase are related in sequence. PDK and
branched chain
-ketoacid dehydrogenase kinase represent a unique
family of atypical protein kinases involved in regulation of
glycolysis, the citric acid cycle, and protein synthesis during protein
malnutrition. Structurally, they conserve only the C-terminal portion
of "histidine" kinases, including the G box regions. Branched chain
-ketoacid dehydrogenase kinase phosphorylates the E1
subunit of
the branched chain
-ketoacid dehydrogenase complex on Ser-293,
proving it to be a functional protein kinase (35). Although no bona
fide "histidine" kinase has yet been identified in worms or humans,
they do contain PDK homologues (one in worms and five in humans).
However, the paucity of PDKs in worms makes it unlikely that they fill
in for the absence of "histidine" kinases in metazoans. Instead,
these signaling cascades have more likely been replaced by pathways
initiated through RTKs.
Based on these examples of atypical protein kinases present in
the worm genome, we predict additional worm protein kinases will be
functionally identified that lack any of the obvious motifs conserved
in the conventional members. Indeed, various biochemical data point to
the existence of true histidine, lysine, and arginine kinases in
metazoans, yet their structural identity remains a mystery.
Protein Phosphatases. Because of their important role in signal transduction, it is not surprising that the activity of protein kinases must be tightly regulated. This is accomplished through autoinhibition, autophosphorylation, transphosphorylation, dimerization, and cellular localization. Equally important is the role of protein phosphatases, which act to remove these regulatory phosphates from the protein kinase and its substrates. Because our analysis reveals worms to have a mature P.Tyr-signaling network, especially when compared with the yeast genome, we surveyed the worm genome for protein-tyrosine phosphatases.
Our analysis reveals 83 conventional protein-tyrosine phosphatases (PTPs) plus 6 catalytic fragments and 12 additional fragments with high homology to the noncatalytic portion of other worm PTPs. In addition, there are 26 proteins classified as dual-specificity phosphatases related to MAPK phosphatases, CDC14, PRL, PIR1, CDC25, myotubularins, or PTEN lipid phosphatases. We also identify two SBF1- and one STYX-related proteins that are related to myotubularins and MAPK phosphatases yet lack the catalytic cysteine motif. These proteins are predicted to be catalytically inert yet may function as phosphoprotein-binding domains or anti-phosphatases (36). We also identify 11 inositol polyphosphate phosphatases and 65 serine-threonine phosphatases. Among the 83 PTPs, there are 57 cytoplasmic PTPs and 26 receptor-like PTPs, most of which are worm specific, lacking clear human orthologues. Exceptions include worm orthologues of the cytoplasmic PTPs; SHP2, MEG1, and MEG2, and the receptor PTPs; and PTP
, PTP
, PTPµ, PTP
and IA2 (catalytically inactive).
Overall, worms contain approximately the same number of tyrosine and
dual-specificity protein kinases as they do tyrosine and
dual-specificity protein phosphatases. This coordinate expansion in the
eukaryotic lineage of both protein-tyrosine kinases and phosphatases
emphasizes the biological need to maintain tight regulation of tyrosine
phosphorylation. Because of the large numbers of worm-specific PTKs
(FER and KIN-15 families) and worm-specific PTPs (89%, or 66 of 74),
it is tempting to speculate that these unique enzymes may regulate each
other's activity, or function in the same signaling pathways.
Precedence for such specificity comes from genetic data indicating that
the CLR-1 receptor PTP attenuates EGL-15, an FGFR orthologue, signaling
in worms (37).
Conclusions.
What does the worm genome sequence tell us about mammalian signal
transduction? First, it has provided an ideal model to highlight the
bioinformatics challenges that lie ahead with the human genome effort
and allows us to test our analysis tools and database organization. Second, it lets us refine our expectations as to the diversity and
absolute number of unique protein kinases that we can expect to find in
the human genome. Based on our count of 493 (411 conventional and 82 PK-like proteins) worm kinases, minus the
197 kinases that appear to
be worm-specific expansions of certain families such as the CK1, FER,
and KIN-15 families, multiplied by the
4-fold greater number of
genes in humans compared with worms, we predict the human genome to
contain
1,100 protein kinases (PTKs and serine/threonine kinases).
A similar extrapolation predicts
300 human protein phosphatases
(PTPs, dual-specificity phosphatases, and serine-threonine phosphatases). Because our current count of human protein kinases and
phosphatases stands at
600 and 130, respectively, we still have
about half the work ahead of us. However, recent claims predict the
human genome may contain as many as 140,000 genes, compared with
previous estimates of
80,000. Such calculations would result in a
significant increase in our predictions of the total number of human
protein kinases and phosphatases.
302 neurons, compared with one trillion in humans, it
would not be surprising to see this selectivity in cellular expression
corroborated on mining the human genome. Indeed, because many of these
novel protein kinases are likely to exhibit highly restricted
expression, they may prove to be excellent targets for drug discovery
in the battle against human disease.
The worm serves as a much simpler and tractable organism than
humans for deciphering signaling cascades. Although their
P.Tyr-signaling system is quite mature
based on the content of
protein-tyrosine kinases, phosphatases, and adaptor molecules
they
lack much of the molecular redundancy that exists in mammals, allowing
the geneticist, biochemist, and cell biologist to more readily generate an "outline" of the signaling pathways that are conserved between worms and humans. The availability of the complete worm genome provides
a unique opportunity to learn about human biology. Predicted orthologous pairs of human and worm genes can be targeted by using reverse genetic approaches to identify new signaling partners or
biologic functions that can then be biochemically and functionally verified in mammals.
Although worms and humans have much in common, they also have
obvious differences. Worms do not have limbs or bones, or a circulatory
or immune system, and they eat only bacteria. Not surprisingly, they
lack several protein kinases present in humans. Over the next 2 years,
we should be better able to define which protein kinases are required
for these specialized functions as the genome sequences of
Drosophila and humans are completed. Identification and
classification of the proteins present in each is just a first step
toward understanding the biological complexity of life.
| |
Acknowledgements |
|---|
We thank Sara Courtneidge for her input. T.H. serves on the Scientific Advisory Board of SUGEN. T.H. is a Frank and Else Schilling American Cancer Society Research Professor.
| |
Footnotes |
|---|
To whom reprint requests should be addressed.
E-mail: hunter{at}salk.edu.
Abbrevations: PKA, protein kinase A; MAPK, mitogen-activated protein kinase; CDK, cyclin-dependent kinase; PTK, protein-tyrosine kinase; RTK, receptor protein-tyrosine kinases; CTK, cytoplasmic protein-tyrosine kinases; STAT, signal transducer and activator of transcription; IRS, insulin receptor substrate; NLK, NEMO-like kinase; APH, aminoglycoside phosphotransferases; PTP, protein-tyrosine phosphatase.
This contribution is part of the special series of Inaugural Articles by members of the National Academy of Sciences elected in April 28, 1998.
| |
References |
|---|
|
|
|---|
| 1. |
The C. elegans Sequencing Consortium.
(1998)
Science
282,
2012-2018 |
| 2. |
Chervitz, S. A., Aravind, L., Sherlock, G., Ball, C. A., Koonin, E. V., Dwight, S. S., Harris, M. A., Dolinski, K., Mohr, S., Smith, T., et al.
(1998)
Science
282,
2022-2028 |
| 3. |
Clarke, N. D. & Berg, J. M.
(1998)
Science
282,
2018-2022 |
| 4. |
Ruvkun, G. & Hobert, O.
(1998)
Science
282,
2033-2041 |
| 5. | Bingham, J., Plowman, G. D. & Sudarsanam, S. (1999) J. Cell. Biochem. in press. |
| 6. |
Goffeau, A., Barrell, B. G., Bussey, H., Davis, R. W., Dujon, B., Feldmann, H., Galibert, F., Hoheisel, J. D., Jacq, C., Johnston, M., et al.
(1996)
Science
274,
546 |
| 7. | Hunter, T. & Plowman, G. D. (1997) Trends Biochem. Sci. 22, 18-22[ISI][Medline] . |
| 8. | Garret, J. M. (1997) Mol. Microbiol. 4, 8098-8120. |
| 9. | Knippschild, U., Milne, D. M., Campbell, L. E., DeMaggio, A. J., Christenson, E., Hoekstra, M. F. & Meek, D. W. (1997) Oncogene 15, 1727-1736[CrossRef][ISI][Medline] . |
| 10. |
Morgan, W. R. & Greenwald, I.
(1993)
Mol. Cell. Biol.
13,
7133-7143 |
| 11. | Morgan, W. R. (1996) Worm Breeder's Gazette 14, 27. |
| 12. | Murakami, S. & Johnson, T. E. (1998) Curr. Biol. 8, 1091-1094[CrossRef][ISI][Medline] . |
| 13. |
Rosato, R., Veltmaat, J. M., Groffen, J. & Heisterkamp, N.
(1998)
Mol. Cell. Biol.
18,
5762-5770 |
| 14. | Metzstein, M. M., Stanfield, G. M. & Horvitz, H. R. (1998) Trends Genet. 14, 410-416[CrossRef][ISI][Medline] . |
| 15. |
Hanks, S. K., Quinn, A. M. & Hunter, T.
(1988)
Science
241,
42-52 |
| 16. | Downward, J. (1998) Curr. Opin. Cell Biol. 10, 262-267[CrossRef][ISI][Medline] . |
| 17. | Maehama, T. & Dixon, J. E. (1999) Trends Cell Biol. 9, 125-128[CrossRef][ISI][Medline] . |
| 18. | Bellanger, J. M., Lazaro, J. B., Diriong, S., Fernandez, A., Lamb, N. & Debant, A. (1998) Oncogene 16, 147-152[CrossRef][ISI][Medline] . |
| 19. |
Sun, Z., Hsiao, J., Fay, D. S. & Stern, D. F.
(1998)
Science
281,
272-274 |
| 20. |
Matsuoka, S., Huang, M. & Elledge, S. J.
(1998)
Science
282,
1893-1897 |
| 21. | Meneghini, M. D., Ishitani, T., Carter, J. C., Hisamoto, N., Ninomiya-Tsuji, J., Thorpe, C. J., Hamill, D. R., Matsumoto, K. & Bowerman, B. (1999) Nature (London) 399, 793-797[CrossRef][Medline] . |
| 22. | Korpelainen, E. I., Kärkkäinen, M., Gunji, Y., Vikkula, M. & Alitalo, K. (1999) Oncogene 18, 1-8[CrossRef][ISI][Medline] . |
| 23. | Hou, X. S., Melnick, M. B. & Perrimon, N. (1996) Cell 84, 411-419[CrossRef][ISI][Medline] . |
| 24. | Kawata, T., Shevchenko, A., Fukuzawa, M., Jermyn, D. A., Totty, N. F., Zhukovskaya, N. V., Sterling, A. E., Mann, M. & Williams, J. G. (1997) Cell 89, 909-916[CrossRef][ISI][Medline] . |
| 25. | Williams, J. C., Wierenga, R. K. & Saraste, M. (1998) Trends Biochem. Sci. 23, 179-184[CrossRef][ISI][Medline] . |
| 26. |
Morin, P. J., Sparks, A. B., Korinek, V., Barker, N., Clevers, H., Vogelstein, B & Kinzler, K. W.
(1997)
Science
275,
1787-1790 |
| 27. |
Leonard, C. J., Aravind, L. & Koonin, E. V.
(1998)
Genome Res.
8,
1038-1047 |
| 28. | Brasseur, G., Tron, G., Dujardin, G., Slonimski, P. P. & Brivet-Chevillotte, P. (1997) Eur. J. Biochem. 246, 103-111[Medline] . |
| 29. | Hon, W.-C., McKay, G. A., Thompson, P. R., Sweet, R. M., Yang, D. S. C., Wright, G. D. & Berghuis, A. M. (1997) Cell 89, 887-895[CrossRef][Medline] . |
| 30. | Daigle, D. M., McKay, G. A., Thompson, P. R. & Wright, G. D. (1999) Chem. Biol. 6, 11-18[CrossRef][Medline] . |
| 31. | Rao, V. D., Misra, S., Boronenkov, I. V., Anderson, R. A. & Hurley, J. H. (1998) Cell 94, 829-839[CrossRef][ISI][Medline] . |
| 32. |
Cote, G. P., Luo, X., Murphy, M. B. & Egelhoff, T. T.
(1997)
J. Biol. Chem.
272,
6846-6849 |
| 33. |
Ryazanov, A. G., Ward, M. D., Mendola, C. E., Pavur, K. S., Dorovkov, M. V., Wiedmann, M., Erdjument-Bromage, H., Tempst, P., Parmer, T. G., Prostko, C. R., et al.
(1997)
Proc. Natl. Acad. Sci. USA
94,
4884-4889 |
| 34. | Stock, J. (1999) Curr. Biol. 9, R364-R367[CrossRef][ISI][Medline] . |
| 35. | Harris, R. A., Hawes, J. W., Popov, K. M., Zhao, Y., Shimomura, Y., Sato, J., Jaskiewicz, J. & Hurley, T. D. (1997) Adv. Enzyme Regul. 37, 271-293[CrossRef][ISI][Medline] . |
| 36. | Hunter, T. (1998) Nat. Genet. 18, 303-305[CrossRef][ISI][Medline] . |
| 37. |
Kokel, M., Borland, C. Z., DeLong, L., Horvitz, H. R. & Stern, M. J.
(1998)
Genes Dev.
12,
1425-1427 |
Copyright © 1999 by The National Academy of Sciences 0027-8424/99/9613603-8$2.00/0
![]()
CiteULike
Complore
Connotea
Del.icio.us
Digg What's this?
This article has been cited by other articles in HighWire Press-hosted journals:
![]() |
Y. Wang and D. E. Levy C. elegans STAT: evolution of a regulatory switch FASEB J, August 1, 2006; 20(10): 1641 - 1652. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Grassot, M. Gouy, G. Perriere, and G. Mouchiroud Origin and Molecular Evolution of Receptor Tyrosine Kinases with Immunoglobulin-Like Domains Mol. Biol. Evol., June 1, 2006; 23(6): 1232 - 1241. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Price CKI, there's more than one: casein kinase I family members in Wnt and Hedgehog signaling. Genes & Dev., February 15, 2006; 20(4): 399 - 410. [Abstract] [Full Text] [PDF] |
||||