Previous Article |
Table of Contents
| Next Article
From the Cover
BIOLOGICAL SCIENCES / MEDICAL SCIENCES
Identification of noninvasive imaging surrogates for brain tumor gene-expression modules






*Department of Radiology and **Center for Translational Medical Systems, University of California San Diego Medical Center, San Diego, CA 92103;
Department of Radiation Oncology, Stanford University School of Medicine, Stanford, CA 94305;
Department of Neuropathology, University of Texas M. D. Anderson Cancer Center, Houston, TX 77030;
Department of Radiology, Brown University, Providence, RI 02912; ¶Department of Neurological Surgery, Brain Tumor Research Center, University of California San Francisco Medical Center, San Francisco, CA 94143; and ||Department of Radiology, University of California San Francisco Medical Center, San Francisco, CA 94143
Communicated by Helen M. Ranney, University of California at San Diego, La Jolla, CA, February 9, 2008 (received for review October 16, 2007)
| Abstract |
|---|
|
|
|---|
cancer | genomics | glioblastoma multiforme | radiogenomics
Magnetic resonance imaging (MRI) is routinely used in the diagnosis, characterization, and clinical management of GBM (8). It is a powerful and noninvasive diagnostic imaging tool that allows global assessment of GBMs and their interaction with their local environment. In its ability to extract structural, compositional, physiological, and functional information, MRI captures multidimensional, in vivo portraits of GBMs. Interestingly, histologically similar tumors often demonstrate highly distinct imaging profiles on MRI (9). Recently, several studies have attempted to correlate imaging findings with molecular markers, but no consistent associations have emerged. and many of the imaging features that characterize tumors currently lack biological or molecular correlates (10–15). Much of the information encoded within neuroimaging studies therefore remains unaccounted for and incompletely characterized at the molecular level. We reasoned that the phenotypic diversity of GBM captured by neuroimaging reflects underlying inter- and intratumoral gene-expression differences and that these relationships could be uncovered by combining genome-scale gene expression and MRI. Here, we show that this approach reveals previously unknown associations between molecular properties of tumors and their appearance by radiologic imaging. Furthermore, we identified an imaging phenotype that is associated with overall survival of GBM patients.
| Results |
|---|
|
|
|---|
A high-level view of the data revealed a dense and robust association between radiophenotypes and GBM gene expression. Expression of 1,089 of the 2,188 cDNA clones previously shown to be differentially expressed among these GBMs significantly correlated with at least 1 of the 10 imaging phenotypes. Furthermore, 5 of the 10 imaging phenotypes were significantly associated with at least one of the seven gene-expression modules (Fig. 1A). The link between radiophenotype and gene expression appeared relatively insensitive to gene-expression module size, because we identified significant associations with modules as small as 10 clones (EGFR) and as large as 224 clones (immune cell infiltration). Most of the gene-expression signatures were best captured by unique imaging traits, suggesting specificity in the association between radiophenotypes and the underlying gene-expression signatures. Moreover, when one imaging trait was associated with more than one gene module, the modules displayed very similar gene-expression patterns (e.g., association of SVZ involvement with neuronal and glial gene clusters.) Thus, a relatively large fraction of the molecular heterogeneity of GBM could be captured by using only a small number of MR imaging features.
|
Similarly, a strong association between the mass effect radiophenotype and the proliferation gene-expression signature (P = 0.0017) was uncovered. The proliferation cluster contains genes involved in proliferation and cell-cycle progression (e.g., TOP2A, CDC2, and BUB1B) (20). As such, it serves as a molecular marker of the proliferation rate of tumor cells. As seen in Fig. 1C, hierarchical clustering of tumors based solely on the expression of this module resulted in segregation of the majority of low- and high-mass-effect tumors to opposite branches of the dendrogram. Interestingly, tumor size alone did not significantly correlate with the proliferation cluster (data not shown). Mass effect may be a better surrogate for tumor proliferation rate than tumor size because it incorporates the response of surrounding brain tissue to a rapidly growing tumor. Thus, clinically and biologically relevant gene-expression signatures can be assessed noninvasively by using specific imaging phenotypes.
Analysis of Spatial Gene-Expression Differences Within Tumors. Although intertumoral gene-expression differences among GBMs are significantly greater than intratumoral differences, we aimed to test whether we could image spatial differences in gene expression within a tumor using MRI. We therefore evaluated the imaging and gene-expression patterns from paired stereotactic biopsy samples from several GBMs. Tissue sample selection was guided by imaging, with samples obtained from CE and non-CE regions within the same tumor (3). Consistent with our previous analyses, genes within the hypoxia cluster were differentially expressed within the same tumor according to the relative extent of CE within each region (Fig. 2). This indicates that spatial gene-expression differences within tumors can be visualized by radiologic imaging.
|
2) (Fig. 3). This finding confirms the link between the C:N imaging trait and EGFR expression, indicates that this imaging phenotype is a surrogate for EGFR overexpression, and suggests that it may be possible to develop imaging-based predictors of treatment response.
|
|
Given the potential clinical utility of this finding, we tested our prognostic imaging marker on an independent set of 110 GBMs. Survival analysis confirmed the association of the infiltrative phenotype with poor prognosis (P < 3.1 x 10–7, Fig. 4C), with a median survival of 216 days for patients with the infiltrative phenotype compared with 390 days for those with the edematous phenotype. Further, tumors with the infiltrative phenotype were more likely to have multiple tumor foci compared with their edematous counterparts (P = 0.009). Our findings suggest that overexpression of the infiltrative radiophenotype-associated genes increases tumor cell invasion into the surrounding brain parenchyma, which is reflected by an infiltrative pattern of T2 hyperintense signal on MR images.
| Discussion |
|---|
|
|
|---|
Over the past 20 years, there has been exponential growth in the power and clinical utility of imaging modalities such as MRI to noninvasively diagnose and characterize disease and to guide clinical management. In parallel, the development of functional genomic tools such as DNA microarrays has provided powerful methods for dissecting the molecular basis of disease on a genome-wide level. One of the current limitations of many microarray-based studies continues to be the difficulty of translating molecular findings into clinically useful assays or interventions. As shown in this study, the fusion of imaging and functional genomic datasets offers the potential for more rapid clinical translation. This was highlighted by the identification of the "infiltrative" imaging phenotype, whose prognostic value was validated in a large independent dataset where it was shown to be a strong predictor of survival. This noninvasive prognostic biomarker may be useful in clinical management, particularly if the infiltrative and edematous subgroups are shown to differ in their susceptibility to therapies. Although these findings will need to be further characterized and validated, they demonstrate the power of the combined radiologic and genomic approach and provide a paradigm for rapidly identifying testable hypotheses in the clinical setting.
Other investigators have attempted to identify MRI characteristics of GBMs that correlate with patient outcome (23–26). Although no consistent association between imaging traits and survival has been found, tumor properties such as the amount of necrosis, the intensity of contrast enhancement, and the extent of peritumoral edema correlate with outcome in various datasets. The imperfect agreement among these studies is likely due at least in part to the generally small number of tumors examined and the use of differing selection criteria. The infiltrative phenotype that we describe here has not been previously examined in the context of GBM, and we therefore validated its association with survival in a large independent dataset. One of the strengths of our approach is that we can examine the list of genes whose expression correlates with the infiltrative phenotype to identify possible molecular explanations for the apparently increased invasiveness of these tumors. This list includes genes involved in tumor cell migration such as brevican (BCAN), TRIO (27), PTP4A3 (28), and ERF (29). Interestingly, BCAN has been documented to specifically promote glioma cell aggressiveness and invasion in an animal model (30). Enhanced invasion of normal brain and associated disruption of neurological functions caused by the expression of a subset of the infiltrative radiophenotype-associated genes may therefore explain the inferior survival of patients with infiltrative tumors and may suggest possible therapeutic avenues.
The approach used in this study is easily generalizable, because it could be applied to any disease for which both imaging and functional genomic data are available. Indeed, we have previously demonstrated that global gene-expression patterns of hepatocellular carcinoma are correlated with contrast-enhanced computed tomography phenotypes using a module networks algorithm (31), indicating that this approach can yield useful information in multiple cancer types. To better approximate potential clinical applications, we applied a more simplified and targeted supervised approach in the current study, allowing the direct association of imaging phenotypes with predefined gene-expression signatures. This allowed us to directly address whether known gene-expression programs are reflected by an imaging phenotype. Extension of this method to other tumor types and diseases will likely uncover associations between molecular properties and imaging characteristics of human diseases. Of note, once a link between an imaging phenotype and a molecular signature has been uncovered, it should be possible to reexamine imaging studies of previously treated patients (such as those on clinical trials) to assess whether the new biomarker predicts outcome or treatment response.
Although our approach offers a number of inherent advantages over current diagnostic molecular and imaging approaches, there are a number of important caveats. First, the dimensionality of imaging, although rapidly increasing over time, is still orders of magnitude lower than that of whole genome gene-expression profiling. Thus, the molecular detail of this approach will likely never match that of conventional molecular profiling approaches. However, the ability of this technique to identify imaging biomarkers for molecular features of tumors should scale as a function of the resolving capability of imaging. Although we uncovered a number of molecular associations with conventional MR images, we expect that application of more advanced imaging techniques (including new contrast agents, pulse sequences, and molecular imaging tools) (8, 32) should facilitate the identification of additional clinically relevant radiophenotypic surrogates for molecular signatures within GBM. Furthermore, the application of more quantitative image analysis tools should also allow for richer image feature extraction and should facilitate the standardization and adoption of these types of imaging biomarkers by decreasing the potential for interobserver bias. A second caveat is that our study was retrospective in nature and relied on previously acquired gene-expression data in a relatively small patient cohort. Thus, our findings will need to be further characterized and validated in future studies, ideally with uniformly treated patients for which gene-expression data are being prospectively generated.
In conclusion, this study demonstrates a simple, widely applicable method for discovering imaging biomarkers that are associated with underlying gene-expression signatures. Our approach facilitates the association of complex molecular signatures with readily identifiable imaging characteristics. Given the noninvasive nature of medical imaging and its wide use in clinical practice, there are a wide range of human disease processes to which this method could be applied. It is likely that identification of imaging phenotypes tied to distinct molecular phenotypes will help to advance individualized patient care.
| Methods |
|---|
|
|
|---|
23,000 elements (representing
18,000 unique UniGene clusters) by using standard methods. Array elements that varied at least 2-fold from the median on at least five microarrays were included in Fig. 1 (2,188 cDNA elements representing
1,800 UniGene clusters). Gene-expression clusters were extracted from the hierarchical cluster as described (3). Annotated Gene Modules. Gene-expression modules were curated from the 2,188-gene dataset as described (3). In total, seven annotated gene-expression modules were used in this study: epidermal growth factor receptor (EGFR) overexpression, hypoxia, extracellular matrix (ECM), immune cells, proliferation, glial, and neuronal.
MRI Image Analysis. All MRI exams were performed on a 1.5 T Signa Echospeed scanner (GE Medical Systems). The following 10 binary imaging traits were scored for each MRI study after IRB: contrast enhancement, necrosis, mass effect, pattern of T2 edema (infiltrative/edematous), cortical involvement, SVZ involvement, C:N ratio, contrast/T2 ratio, degree of T2 edema, and T2 heterogeneity. The edematous T2 pattern was defined as classic vasogenic edema of the nonenhancing portion of the tumor where the T2 abnormality has a "pseudopod" appearance and a generally uniform T2 hyperintense signal. The infiltrative pattern was defined as a diffuse, expansile, T2 abnormality beyond the solid enhancing portion of the tumor that is slightly brighter than normal brain and does not have the classic appearance of vasogenic edema. Complete definitions for each of the traits can be found in SI Methods. All images were evaluated by consensus in a blinded fashion by two board-certified radiologists (S.C. and M.D.K.) across each of the 10 imaging traits.
Bioinformatic Analyses. To assess the association between the imaging traits and gene modules, we used a two-step algorithm. In the first step, we identified sets of cDNA elements (i.e., genes) that were statistically significantly correlated with each imaging trait (hereafter referred to as trait-associated genes.) We thus evaluated the correlation of the log2(Cy5/Cy3) expression ratio for every cDNA clone with each of the imaging traits using a Spearman rank-correlation coefficient. To assess statistical significance and to control for multiple hypothesis testing, we generated 1,000 random permutations of the imaging trait values and recalculated the correlation coefficient for each cDNA element. The distribution consisting of these values was used to determine the correlation coefficient corresponding to a P value of 0.05. All cDNA elements with absolute real correlation coefficients greater than this cut-off were included in the trait-associated gene sets. In the second step, we determined the overlap of these trait-associated gene sets with the seven previously identified gene-expression clusters (or modules) using the hypergeometric distribution. To evaluate for statistical significance and to control for multiple hypothesis testing, we used the 1,000 random permutations from the above and identified 1,000 "random" trait-associated gene sets for each trait and permutation using the correlation coefficient cut-off identified in the first step. We then generated a null distribution of hypergeometric scores that was used to determine the final P value for the association between each of the seven gene modules and the trait-associated gene sets (Dataset S1).
Gene Ontology Enrichment Analysis. Gene Ontology enrichment analysis was performed for each imaging trait-associated gene set by using GoMiner (33).
EGFR Immunohistochemistry Validation Analysis. Paraffin-embedded tissues and MRI datasets were from the M. D. Anderson Cancer Center, University of Texas and were obtained with informed consent after approval of the Human Research Committee. To be included, tumors had to have preoperative, diagnostic-quality MRI scans displaying appreciable contrast-enhancing and necrotic components on postcontrast enhanced T1-weighted sequences. Antigen retrieval and immunostaining of EGFR in paraffin samples were performed as described (34) by using the mouse monoclonal antibody clone 528 (1:50 dilution; Oncogene Science). Scoring was semiquantitative and was performed by a single neuropathologist (K.D.A.).
Infiltrative/Edematous Validation Analysis. The independent validation dataset consisted of 110 GBM patients from Stanford University and the M. D. Anderson Cancer Center with clinical follow-up and appropriate MRI studies. All data were obtained with approval of the relevant Institutional Review Boards. MR images were scored for the infiltrative/edematous imaging trait as described above. Kaplan–Meier survival analysis was carried out by using WINSTAT (www.winstat.com).
| Acknowledgments. |
|---|
|
|
|---|
| Footnotes |
|---|

To whom correspondence should be addressed. E-mail: mkuo{at}ucsd.eduFreely available online through the PNAS open access option.
Author contributions: M.D. and M.D.K. designed research; M.D., C.N., D.S.W., S.M., M.J., Y.L., K.A., S.C., and M.D.K. performed research; C.N. contributed new reagents/analytic tools; M.D., C.N., M.J., and M.D.K. analyzed data; and M.D. and M.D.K. wrote the paper.
Conflict of interest statement: M.D.K. has filed a patent application in an area related to this field of work.
This article contains supporting information online at www.pnas.org/cgi/data/0801279105/DCSupplemental.
© 2008 by The National Academy of Sciences of the USA
| References |
|---|
|
|
|---|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||