Previous Article |
Table of Contents
| Next Article
COMMENTARY
Seeing is believing: Structure of the catalytic domain of HIV-1 integrase in complex with human LEDGF/p75
Laboratory of Molecular Biology, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, 5 Center Drive, Bethesda, MD 20892
Retroviruses covalently insert their genome into the DNA of the host cell and subsequently coopt cellular machinery for DNA replication, transcription, and protein expression (1). These viruses also exploit cellular proteins to assist in this stable insertion of their genetic material into the host genome, a process called integration. In a recent issue of PNAS, Cherepanov et al. (2) reported the first structure of a retroviral integrase, the viral enzyme that catalyzes integration, in complex with a host protein. This definitive structural evidence for an interaction between HIV-1 integrase and any other protein adds to the significant evidence for the role of host proteins in integration that has been accumulated over the past decade (ref. 3 and references therein).
The infectious particles of retroviruses, called virions, contain two RNA copies of their genomes. After viral entry and a series of poorly understood uncoating steps, the RNA is released into the host cytoplasm. Here the viral enzyme reverse transcriptase synthesizes a double-stranded DNA copy of the genome by using the RNA as a template. The next step is the hallmark of the retroviral life cycle: the viral cDNA is transported to the nucleus and is inserted into the host genome (1). Integration is required for infection and ensures the stable association of the viral genome in the host cell for subsequent generations. From its new position, the viral genome is transcribed, leading to the synthesis of viral proteins and full-length transcripts of the genome and, ultimately, to new virus particles.
The chemistry of integration is catalyzed by the viral enzyme integrase, many copies of which are found in the virion (1). The integration reaction has been successfully recapitulated in vitro with recombinant integrase and short DNA oligonucleotides representing the viral DNA ends and the target DNA. With this approach, the biochemical mechanism of integration has been elucidated. First, a pair of dinucleotides at the 3' ends of the viral DNA are cleaved, exposing the conserved CA sequence that marks the boundary between the host DNA and the integrated viral cDNA. Next, during strand transfer, the 3' hydroxyl groups are joined to opposite strands of the host DNA at sites separated by 5 bp in the case of HIV-1. Cellular enzymes repair the resulting intermediate to complete integration.
Integration takes place in a more complex environment than these biochemical assays suggest. Before integration, the viral cDNA is associated with a number of viral and cellular proteins in a large nucleoprotein assembly called the preintegration complex (PIC). Many of the protein components of the PIC are derived from the core of the infecting virion, but some are acquired from the cytoplasm of the infected cell. The viral proteins associated with the HIV-1 PICs include integrase, reverse transcriptase, matrix, and Vpr (46). In addition, the list of host proteins that have been reported to associate with HIV-1 PICs or to interact directly with HIV-1 integrase continues to grow. These proteins include INI1, HMGA1, BAF, EED, p300, and LEDGF/p75, the topic of this Commentary (reviewed in ref. 3). The role of most of these cellular proteins in integration is largely unknown.
This definitive structural study adds to the significant evidence for the role of host proteins in integration.
Expression of HIV-1 integrase in human cells from a synthetic gene led to the discovery that integrase stably associates with LEDGF/p75 (lens epithelium-derived growth factor/transcription coactivator p75) and that this protein stabilizes the association of integrase with DNA (7). The interaction between integrase and LEDGF/p75 was confirmed with recombinant proteins by solution methods, such as pull-down assays (810), gel filtration chromatography, and analytical ultracentrifugation (2), eliminating the possibility that the interaction is weak, indirect, or mediated by DNA or another protein. The integrase-interacting domain of LEDGF/p75 was mapped independently by two groups using cell-based (11) and in vitro and bioinformatics (9) methods. Cherepanov et al. (2) report the crystal structure of this domain and the catalytic subdomain of HIV-1 integrase. Both HIV-1 and feline immunodeficiency virus PICs can be immunoprecipated with anti-LEDGF/p75 antibodies, suggesting that LEDGF/p75 is indeed a component of these PICs (12). But what is the function of this interaction? The answer to this question is not clear, but many groups are currently investigating its possible effect on integrase activity, PIC nuclear import and trafficking, and integration site target selection.
Does LEDGF/p75 play a role in the catalytic steps of breaking and joining DNA? HIV-1 integrase carries out these reaction steps in vitro without the assistance of other proteins, but the reactions lack the full fidelity of integration in vivo. In particular, many reaction products result from insertion of only one viral DNA end into one strand of target DNA rather than concerted integration of pairs of viral DNA ends. LEDGF/p75 has been reported to stimulate integration in in vitro assays that do not distinguish single-end integration from concerted integration (7). It would be interesting to determine whether LEDGF/p75 influences the fidelity of HIV-1 DNA integration. However, other retroviral integrases, such as RSV, do not interact with LEDGF (13), so a potential role in the catalytic steps of integration is not a universal paradigm.
Another possible role for auxiliary PIC proteins, such as LEDGF/p75, is in nuclear import and targeting. Retroviral PICs must enter the host nucleus to access their target, and, because lentiviruses like HIV-1 efficiently infect nondividing cells, these PICs must in theory traverse intact nuclear membranes. Transport of particles into and out of the nucleus is controlled by the pore complexes found in the nuclear envelope, which permit passive diffusion of particles <9 nm. The HIV-1 PIC, which is significantly larger, must therefore be actively transported into the nucleus and harbor a nuclear localization signal (NLS) (14, 15). Where is the NLS in the HIV-1 PIC? Attempts to define an NLS that is required for nuclear import of HIV-1 PICs have yielded inconclusive results. Although some individual components of the PIC, such as integrase, matrix, and Vpr, carry potential NLS signals, none of these has been shown to be essential for the nuclear import of HIV-1 PICs. LEDGF/p75 is an alternative candidate for directing nuclear import of HIV-1 PICs, although the data are far from unambiguous.
When HIV-1 integrase is expressed in the absence of other viral proteins, it resides in the nucleus (7), where it forms a complex with LEDGF/p75. The nuclear distribution pattern of integrase mutants during interphase and of the integrase mutants binding to condensed chromosomes in mitosis correlates with their colocalization with LEDGF/p75 (8). RNA interference knockdown of LEDGF/p75 leads to the diffuse redistribution of integrase in cells (8). These data suggest that LEDGF/p75 could be the factor responsible for the nuclear import of integrase. To further address this question, Maertens et al. (16) used deletion and missence mutagenesis to isolate the NLS in LEDGF/p75; they found a canonical simian virus 40-like NLS that transferred nuclear import activity to an otherwise diffuse
-galactosidase enzyme. A single amino acid change in this NLS was sufficient to exclude mutant LEDGF/p75 from the nucleus and to abolish nuclear import of HIV-1 integrase (16). However, Q168A, an integrase mutation that disrupts the interaction with LEDGD/p75, fails to abolish nuclear import of HIV-1 integrase, although it blocks replication of the virus at the integration step (17). Furthermore, depletion of LEDGF/p75 by small inhibitory RNA had no effect on viral replication or nuclear import of PICs, suggesting that if LEDGF/p75 is indeed involved in this process it is by a redundant mechanism (12).
The final role that PIC proteins can play in integration, target-site selection, has received considerable interest of late. Integration confers the potential for long persistence of the viral genome after infection. Whether the integrated provirus is a harmless or deadly addition to the genome depends on the site of integration and the potential for transcription. The mechanisms that regulate transcriptional activity of proviral DNA are not well understood but are likely to involve not only positional effects due to the chromatin environment at the site of integration but also epigenetic mechanisms. Recent findings that HIV-1, murine leukemia virus, and avian sarcoma virus differ in their preference for sites of integration (1822) have led to considerable interest in the underlying mechanisms of targeting and the possible roles of cellular proteins. LEDGF/p75 possesses an additional function that may be relevant to target selection: it tethers integrase to DNA and chromatin (7, 12, 17). In fact, the IN Q168A integrase mutation that disrupts the interaction with LEDGF/p75 and blocks viral replication at the integration step prevents chromosomal tethering of integrase (17). LEDGF/p75 does not interact with integrases from Moloney murine leukemia virus or RSV (13), a feature that may or may not be related to a potential role in target site selection. It would be interesting to know whether depletion of LEDGF/p75 leads to a change in the distribution of HIV-1 integration sites.
The work of Cherepanov et al. (2) represents a major advance in the field of retroviral integration because of the unambiguous nature of the interaction and the unprecedented crystal structure of a complex between integrase and any other protein. Like all breakthrough results, this structure and the other experimental work present many unresolved questions. Beyond the remaining functional questions outlined above, some structural issues need to be addressed. It would be interesting to know the structural details of the interaction between LEDGF/p75 and the N-terminal domain of integrase, particularly in light of the fact that the structure of full-length integrase has yet to be determined. The interaction of integrase and LEDGF/p75 may represent the best hope for obtaining structural information on full-length integrase.
Finally, beyond the intellectually compelling aspects of this problem is the fact that integrase, as one of three virally encoded enzymes, is an important target for developing antiretroviral compounds. Integrase inhibitors that specifically block strand transfer are in advanced stages of development (23). A new frontier for integrase-targeting drugs would be ones that inhibit specific interactions in the PIC necessary for integration in vivo. Mutations of the LEDGF/p75 binding site on integrase have been shown to block viral replication, suggesting that modulation of this binding site by a small molecule or peptide could have serious effects on infection, regardless of the function of LEDGF/p75.
Acknowledgements
This research was supported in part by the Intramural Research Program of the National Institutes of Health, National Institute of Diabetes and Digestive and Kidney Diseases. C.M.B. is a Paul Sigler/Argoron Fellow of the Helen Hay Whitney Foundation.
Footnotes
Author contributions: C.M.B. and R.C. wrote the paper.
Conflict of interest statement: No conflicts declared.
See companion article on page 17308 in issue 48 of volume 102.
* To whom correspondence should be addressed. E-mail: bobc{at}helix.nih.gov.
References
Companion article to this Commentary:
This article has been cited by other articles in HighWire Press-hosted journals:
![]() |
F. Turlure, G. Maertens, S. Rahman, P. Cherepanov, and A. Engelman A tripartite DNA-binding element, comprised of the nuclear localization signal and two AT-hook motifs, mediates the association of LEDGF/p75 with chromatin in vivo Nucleic Acids Res., March 20, 2006; 34(5): 1653 - 1665. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||