Previous Article |
Table of Contents
| Next Article
BIOLOGICAL SCIENCES / BIOCHEMISTRY
Structure of aminopeptidase N from Escherichia coli suggests a compartmentalized, gated active site
Howard Hughes Medical Institute and Institute of Molecular Biology and Department of Physics, University of Oregon, Eugene, OR 97403-1229
Contributed by Brian W. Matthews, July 20, 2006
| Abstract |
|---|
|
|
|---|
Aminopeptidase N from Escherichia coli is a major metalloprotease that participates in the controlled hydrolysis of peptides in the proteolytic pathway. Determination of the 870-aa structure reveals that it has four domains similar to the tricorn-interacting factor F3. The thermolysin-like active site is enclosed within a large cavity with a volume of 2,200 Å3, which is inaccessible to substrates except for a small opening of approximately 810 Å. The substrate-based inhibitor bestatin binds to the protein with minimal changes, suggesting that this is the active form of the enzyme. The previously described structure of F3 had three distinct conformations that were described as "closed," "intermediate," and "open." The structure of aminopeptidase N from E. coli, however, is substantially more closed than any of these. Taken together, the results suggest that these proteases, which are involved in intracellular peptide degradation, prevent inadvertent hydrolysis of inappropriate substrates by enclosing the active site within a large cavity. There is also some evidence that the open form of the enzyme, which admits substrates, remains inactive until it adopts the closed form.
bestatin | closed active site | thermolysin-like protease
Aminopeptidase N (PepN) (3, 4, 6, 7) is a zinc-dependent M1-class metalloenzyme classified as a gluzincin based on the conserved active site sequence (HEXXH-X18-E) (3, 4, 8). PepN is the major aminopeptidase in bacteria and accounts for 99% of the hydrolysis of substrates with Ala at the N terminus (9). Although bacterial PepNs were initially identified as Ala aminopeptidases, subsequent studies demonstrated that Escherichia coli, Lactococcus lactis, and Streptococcus thermophilus PepNs have the highest specificities for the basic amino acids Arg and Lys, followed by hydrophobic/uncharged residues, like Ala, Leu, Met, and Phe (4, 10). There are conflicting reports as to whether PepN has endopeptidase as well as exopeptidase activity (3, 9, 11). Based on biochemical studies, Chandu et al. (3) have predicted two distinctive active sites, one for the well established exopeptidase activity and a second for the presumed endopeptidase activity of E. coli PepN (ePepN).
Although ePepN is the major aminopeptidase in E. coli, it is not essential (12, 13). The homologous human enzyme hPepN is anchored in the plasma membrane and serves as a receptor for coronavirus and helps in its internalization (14). Elevated levels of expression of this enzyme have been associated with several cancers (15).
We have undertaken a biochemical and structural investigation of PepN to elucidate its selectivity and catalytic activity. We present the structure of ePepN in the native and the bestatin-bound forms. Based on structural and sequence comparisons, together with model-building studies, we propose a mechanism that involves large domain movements between a "closed" active form and an "open" inactive form.
| Results and Discussion |
|---|
|
|
|---|
-sheet with two smaller
-sheets packed against it on the same face. Residues Pro-5Asp-17 at the N terminus are well ordered and extend to contact domains II and IV. Whereas the Ala-7Arg-10 region forms several hydrogen bonds with the region of Asp-367Glu-371 near the surface of domain II, Pro-5 forms a hydrophobic stacking interaction with the side chain of Tyr-862 in domain IV.
|
15
18 and helices
1 and
2, and the second subdomain is composed of helices
3
8. These two subdomains form a deep groove that extends across the entire domain and includes the active site (Fig. 1B). The irregular structure in thermolysin that accommodates three of the four calcium ions is replaced by the long
4 helix (Arg-337Ala-357) in ePepN. Because of this change, the active-site cavity of ePepN widens where the C-terminal region of the substrate is bound (Fig. 1B).
In common with domain I, domain III (residues 444545) also consists entirely of
-strands (
19
25), which form two parallel
-sheets that interact with domains II and IV.
Domain IV (residues 546870), the largest, is composed of eight pairs of
-helices (
9
24) arranged into two layers forming an overall cone-shaped superhelical structure. The inner and outer layers of the
-helices are arranged in an antiparallel manner. At the vertex of the cone there is an opening with a diameter of 810 Å. Domain IV also can be divided into two subdomains with an equal number of
-helical pairs centered at Leu-725. Together, domains I and IV seal off access to the active site. The only opening to the active site in the present crystal form is the pore on the top of the cone of domain IV (Fig. 1A).
Bestatin Binding.
ePepN, like thermolysin, belongs to the gluzincin family of proteins, which has a common HEXXHX18E zinc-binding motif in the active site. In ePepN, this motif is H297EYFHX17KE320. His-297, His-301, and Glu-320 along with a water molecule bind to the zinc ion with tetrahedral geometry. As anticipated for the binding of bestatin to a metalloprotease, the hydroxyl group (O2) and the carbonyl group (O3) are coordinated to the zinc ion. The
-amino group points into a depression formed by Glu-121, Glu-264, and Glu-320 and forms hydrogen bonds with all three acidic groups (Fig. 2). One side of the phenylalanyl side chain of the inhibitor stacks against the side chain of Tyr-376. On the other side, Met-260, Met-263, and the CB and CG atoms of Glu-121 form a hydrophobic pocket. Residues 261265, which include Met-263, form the so-called GAMEN substrate specificity motif common to many exopeptidases (17). The main-chain carbonyl group (O3) of bestatin, apart from interacting with the zinc ion, also forms a hydrogen bond (2.6 Å) with the hydroxyl of Tyr-381. The backbone carbonyl of Ala-262 and the side-chain carboxylate of Glu-298 interact with the N1 atom. Main-chain amides of Gly-261 and Ala-262 (from the G261AMEN motif) form hydrogen bonds with one of the carboxylate atoms (O4) of the C terminus of bestatin (Fig. 2). Two water molecules form hydrogen bonds with the other oxygen atom (O1) of the carboxylate. Arg-783 and Arg-825 from domain IV are pointed into the active site and interact with the C terminus of the inhibitor through these water molecules. The leucyl side chain of the inhibitor points into a rather open hydrophobic cleft lined by Val-294, Val-324, Tyr-381, and the lateral face of the guanidyl group of Arg-293. Binding of bestatin does not change the overall structure of ePepN, although several residues that line the binding pocket undergo changes in their side-chain conformations. Met-260, which makes a hydrophobic contact with the phenyl ring of the inhibitor, moves away from its position in the native structure to make way for the bestatin. Asn-373 undergoes a change in side-chain conformation that brings its side-chain carbonyl close to the phenyl group (3.4 Å). Tyr-381, which forms a short hydrogen bond, moves away by 0.4 Å from the active site. The movements of Met-260 and Asn-373 induce changes (
2.0 Å) in the side chains of the residues in domain I (Glu-107) and domain IV (Gln-821), which are in close proximity.
|
Closed Conformation.
Similar to thermolysin, the catalytic domain of ePepN is bilobal with a pronounced active site cleft at the junction of the two lobes (Fig. 1B). This cleft is wider and longer in ePepN because the calcium-binding loop is replaced with an
-helix as described above. Large regions of the surface of the catalytic domain interact with domains I and IV (1,966 Å2 and 2,112 Å2, respectively) (Fig. 3). The interactions between domains I and II are largely hydrophobic, whereas those between domains II and IV are hydrophilic. Domain I forms a wall-shaped surface that stacks against the active site on the left near the N terminus of the substrate-binding region to which it donates Glu-121. Domain IV adopts the shape of a cone, which covers the active site region in domain II.
|
9
16) of domain IV contacts the far right side of the active site, whereas the second subdomain contacts the left. The second subdomain of domain IV also interacts with domain I. The combined contacts of domains I and IV with the catalytic domain results in the complete occlusion of the active site. As a consequence of this occlusion, a chamber of
2,200 Å3 is created that is lined by residues from domains I, II, and IV. The only opening to the active site is at the vertex of the cone of domain IV, which has a cross-section of 8 x 10 Å (distances measured between
-carbon atoms) (Fig. 1A). Chandu et al. (3) have suggested that ePepN has endopeptidase activity at a site distinct from the exopeptidase site. Inspection of the ePepN structure suggests that it has only a single active site. To determine whether endopeptidase activity might be feasible at this site we used model building to place the extended thermolysin inhibitor carbobenzoxy-Gly-P-Leu-Leu (16) into the ePepN structure. In such a model-built complex, there are severe steric clashes with domain I. In principle these clashes might be alleviated by moving domain I away from domain II, although this would also pull Glu-121 out of the active site. Possibly, this movement might change the specificity of substrate binding to accept the neutral amide of an extended substrate rather than a positively charged N terminus. Although there is substantial evidence for domain movement in ePepN and its relatives (see below), there is no suggestion to date that domain I can move relative to domain II.
Relationship of ePepN to Other Proteases: Closed vs. Open Structures. A search of the Protein Data Bank with DALI (19) showed that the structure of ePepN is most similar to that of the tricorn-interacting factor F3 (20). ePepN is also related to the three-domain structure of leukotrine 4A hydrolase (21).
F3 has 21% sequence identity with ePepN and, in common with ePepN, has a four-domain structure. In the crystallographic analysis of F3, three independent copies of the molecule were observed (20). In each of these copies, domains IIII plus the first half of domain IV have essentially the same structure. The second half of domain IV, however, showed relative movements of up to 15 Å. Because of these movements the three versions of the structure were classified as closed, intermediate, and open. If, for example, the open form is compared with ePepN (Fig. 4), 437 C
atoms align within 1.8 Å. Comparing individual domains, domain I has 160 C
atoms that align within 1.3 Å, domain II has 220 C
atoms that align within 1.7 Å, domain III has 65 atoms that align within 2.2 Å, and domain IV has 188 atoms that align within 4.0 Å.
|
|
In going from the ePepN structure to the open form of F3, the last five helical pairs of domain IV move away from domain II (the catalytic domain) by 1221 Å (Fig. 4 and Table 1). This movement results in a wide opening near the surface of the protein that extends into the active site, which is at a depth of
28 Å. In the open form of F3, this opening measures
18 Å in one direction and 30 Å in the other. In going to the intermediate and closed forms, this opening is reduced in size to 15 x 30 Å and 13 x 30 Å, respectively. In the case of ePepN, the corresponding opening is, at maximum,
4 Å (all distances being measured between
-carbon atoms). Thus, in the ePepN structure, the active-site region is much more closed than in any of the F3 variants, including the closed form. The conformational changes that are observed for F3 indicate that the molecule is capable of substantial dynamic behavior. Extending the general argument made by Kyrieleis et al. (20), it is tempting to argue that ePepN also undergoes a conformational change between a closed and open form. The closed form corresponds to the crystal structure, and the open form could be similar to any of the forms of F3.
It was noted above that a loop including Glu-121 of ePepN extends from domain I into the domain II active site. In so doing, the Glu-121Ala-122 peptide bond adopts a cis conformation. Similarly, Glu-136 of leukotrine 4A hydrolase points into the active site and the adjacent peptide bond is cis (21). There is a Glu residue in the same region of the F3 structure, but it does not point into the active site and its main chain has a normal trans configuration (20, 23).
Is the Open Substrate-Binding Site Inactive?
The catalytic activity of proteases such as PepN needs to be carefully regulated. On one hand, the active site needs to be sufficiently (or transiently) open to admit appropriate substrates. On the other hand, an open active site might lead to hydrolysis of unintended substrates. It is possible that ePepN may open to admit substrates and to release products but that this open form is catalytically compromised. As discussed above, the crystal structure of ePepN corresponds to a closed form in which the active site appears to be largely if not entirely inaccessible to bulk solution. This form of the enzyme binds bestatin and combines all of the elements expected for a fully functional enzyme. In analogy with thermolysin (16), these elements include Glu-264, which helps to activate the catalytic water or hydroxide ion, and Tyr-381, which helps stabilize the tetrahedral transition state. In thermolysin, the analogous residues are Glu-240 and His-231, with the addition of Tyr-157. F3 has Tyr-351 at a site analogous to Tyr-381, as in ePepN. However, in all three structures of F3, the Tyr-351 side chain is rotated away from the zinc ion, and, in addition, the helix harboring Tyr-351 is shifted
2.7 Å away from the zinc ion (Fig. 5). This shift suggests that all three versions of the F3 crystal structure represent inactive conformation. Kyrieleis et al. (20) suggest that the active conformation is induced by substrate binding. We propose that it is not substrate binding, per se, that activates F3 but rather a further closing of the structure toward the conformation seen for ePepN. Comparison of the structure of F3 and ePepN suggests that this closing step would induce a bend of
30° in the central helix (
4) of the thermolysin-like domain. This bend would bring the C terminus of the
4 helix (residues Gly-319Ser-325 in F3) closer to the active site and, in concert, cause a 22° bend in helix
5, which would also bring its N terminus closer to the active site (residues Glu-348Gly-352) (Fig. 5). As a consequence of this bending, the backbone of Tyr-351 in F3 would be brought
2 Å closer to the active site and the side chain would be directed toward the zinc ion. Requiring that the active site region be fully closed to confer catalytic activity would avoid inappropriate cleavage of proteins or peptides.
|
810 Å, suggests that it might better allow smaller products to leave rather than longer substrates to enter. A "pore" with a maximal size of 10 Å has been proposed to serve as an entry point to the peptides in the dodecameric tetrahedral-shaped aminopeptidase (24). Such an opening into the active site chamber also has been observed in other multimeric protease structures (25). If the small opening acts as an entry point for peptides, the ePepN structure represents a class of protease that has a buried active site in a monomeric form. | Materials and Methods |
|---|
|
|
|---|
-D-thiogalactoside. Cells were harvested after 12 h by centrifuging at 8,000 x g for 20 min, followed by resuspension in +T/G buffer (50 mM Hepes, pH 8.0/0.5 M KCl/10% glycerol/0.1% Triton X-100/5 mM imidazole) and lysed by being passed through a French press. After centrifugation at 17,000 x g for 30 min, the supernatant was applied to a talon column (cobalt affinity resin; BD Biosciences, Franklin Lakes, NJ), which was preequilibrated with +T/G buffer. Continued application of +T/G buffer was used to wash the column until the absorption at A280 reached the baseline. The column was further washed with T/G buffer (50 mM Hepes, pH 8.0/0.5 M KCl/5 mM imidazole). Pure protein was eluted with 100 mM imidazole in T/G buffer at 15 mg/ml and dialyzed twice in 4 liters of 25 mM Hepes, pH 7.5/150 mM KCl. No further purification was done either for the enzyme activity assays or for crystallization. A similar method was used to purify the selenomethionine protein. We obtained an average of 300 mg of protein from 4 liters of culture for the native enzyme. For the selenomethionine protein, the yield was
100 mg. The terminal His-tag was left intact for structural studies. Protein concentration was adjusted to 11 mg/ml before setting up crystallization screens at 25°C. Initial crystals appeared within 12 h. After optimization, crystals in trigonal space group P3121 were obtained in hanging drops with 5 µl of reservoir solution (1.8 M sodium malonate, pH 7.5) (Table 2). Crystals with an average size of 150 µm3 were equilibrated for 5 min in 22% glycerol/2.0 M sodium malonate before being frozen in liquid nitrogen. Selenomethionine crystals were also grown and frozen in similar conditions. Crystals in complex with bestatin were obtained by cocrystallization under similar conditions in the presence of the ligand at 1 mM.
|
Structure Solution and Refinement. The SHELX (27) program suite was used to search for the positions of the selenium atoms. Of the expected 21 atoms, 20 were located in anomalous difference maps along with a peak for the active site zinc. Automated model building in the 1.65-Å resolution map was done by using ArpWarp (28). The resulting model was examined and completed in O and COOT (23, 29). Structure refinement to 1.65-Å resolution (Table 1) was done by using CNS and the CCP4 program suite (30, 31). Geometric parameters for bestatin were obtained from the HIC-UP server (32). The quality of the final models was judged with PROCHECK (33). The DALI server was used to search for the structural similarities (19). Structure figures were generated by using the program PYMOL (34).
Note. While this manuscript was in preparation, the crystallization of ePepN was reported (35), and the coordinates for PepN from Neisseria meningitides were deposited in the Protein Data Bank (ID code 2GTQ).
| Acknowledgements |
|---|
|
|
|---|
| Footnotes |
|---|
Abbreviations: PepN, aminopeptidase N; ePepN, Escherichia coli PepN.
*To whom correspondence should be addressed. E-mail: brian{at}uoxray.uoregon.edu
Author contributions: A.A. designed research; A.A. and L.G. performed research; and A.A. and B.W.M. wrote the paper.
Conflict of interest statement: No conflicts declared.
Data deposition: The atomic coordinates have been deposited in the Protein Data Bank, www.pdb.org (PDB ID codes 2HPT and 2HPO).
© 2006 by The National Academy of Sciences of the USA
| References |
|---|
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||