The generation and utilization of a cancer-oriented representation of the human transcriptome by using expressed sequence tags

  1. Helena Brentania,
  2. Otávia L. Caballeroa,
  3. Anamaria A. Camargoa,
  4. Aline M. da Silvab,
  5. Wilson Araújo da Silva, Jr.c,
  6. Emmanuel Dias Netod,
  7. Marco Grivete,
  8. Arthur Gruberf,
  9. Pedro Edson Moreira Guimaraesd,
  10. Winston Hideg,
  11. Christian Iselih,
  12. C. Victor Jongeneelh,
  13. Janet Kelsog,
  14. Maria Aparecida Nagaii,
  15. Elida Paula Benquique Ojopid,
  16. Elisson C. Osorioa,
  17. Eduardo M. R. Reisb,
  18. Gregory J. Rigginsj,
  19. Andrew John George Simpsona,k,
  20. Sandro de Souzaa,
  21. Brian J. Stevensonh,
  22. Robert L. Strausbergl,
  23. Eloiza H. Tajaram,
  24. Sergio Verjovski-Almeidab,
  25. The Human Cancer Genome Project Cancer Genome Anatomy Project Annotation Consortium*, and
  26. The Human Cancer Genome Project Sequencing Consortium
  1. iLaboratorio de Genética Molecular do Cancer, Departmento de Radiologia, Universidade de São Paulo, Travessa da Rua Dr. Ovídeo Pires de Campos S/N, 4° andar, 05403-010, São Paulo, SP, Brazil; bDepartamento de Bioquímica, Instituto de Química, Universidade de São Paulo, 05508-900, São Paulo, SP, Brazil; mDepartamento de Biologia, Instituto de Biociências, Letras e Ciências Exatas, Universidade Estadual Paulista, 15054, São José do Rio Preto, SP, Brazil; aLudwig Institute for Cancer Research, Rua Professor Antonio Prudente 109 4° andar, 01509-010, São Paulo, SP, Brazil; fDepartamento de Patologia, Faculdade de Medicina Veterinária e Zootecnia, Universidade de São Paulo, Avenida Professor Orlando Marques de Paiva 87, 05508-000, São Paulo, SP, Brazil; cFundação Hemocentro de Ribeirão Preto, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Rua Tenente Catão Roxo 2501, 14051-140, Ribeirão Preto, SP, Brazil; dLaboratory of Neurosciences (LIM-27), Instituto de Psiquiatria, Faculdade de Medicina, Universidade de São Paulo, Rua Dr. Ovidio de Campos, S/N, 05403-010, São Paulo, SP, Brazil; lNational Cancer Institute, Bethesda, MD 20892; jDuke University Medical Center, Durham, NC 27710; gSouth African National Bioinformatics Institute, University of the Western Cape, Private Bag X17, 7535 Bellville, South Africa; hOffice of Information Technology, Ludwig Institute for Cancer Research, CH-1066 Épalinges, Switzerland; and eCentro de Estudo de Telecomunicações-PUC, Rua Marquês de São Vicente, 225, 22453-900, Rio de Janeiro, RJ, Brazil
  1. Contributed by Walter Bodmer, June 12, 2003

Abstract

Whereas genome sequencing defines the genetic potential of an organism, transcript sequencing defines the utilization of this potential and links the genome with most areas of biology. To exploit the information within the human genome in the fight against cancer, we have deposited some two million expressed sequence tags (ESTs) from human tumors and their corresponding normal tissues in the public databases. The data currently define ≈23,500 genes, of which only ≈1,250 are still represented only by ESTs. Examination of the EST coverage of known cancer-related (CR) genes reveals that <1% do not have corresponding ESTs, indicating that the representation of genes associated with commonly studied tumors is high. The careful recording of the origin of all ESTs we have produced has enabled detailed definition of where the genes they represent are expressed in the human body. More than 100,000 ESTs are available for seven tissues, indicating a surprising variability of gene usage that has led to the discovery of a significant number of genes with restricted expression, and that may thus be therapeutically useful. The ESTs also reveal novel nonsynonymous germline variants (although the one-pass nature of the data necessitates careful validation) and many alternatively spliced transcripts. Although widely exploited by the scientific community, vindicating our totally open source policy, the EST data generated still provide extensive information that remains to be systematically explored, and that may further facilitate progress toward both the understanding and treatment of human cancers.

Footnotes

  • k To whom correspondence should be addressed. E-mail: asimpson{at}licr.org.

  • * The Human Cancer Genome Project/Cancer Genome Anatomy Project Annotation Consortium: Marcio Luis Acencion, Mário Henrique Bengtsono, Fabiana Bettonip, Walter F. Bodmerq, Marcelo R. S. Brionesr, Luiz Paulo Camargos, Webster Caveneet, Janete M. Ceruttiu, Luís Eduardo Coelho Andradev, Paulo César Costa dos Santosn, Maria Cristina Ramos Costaw, Israel Tojal da Silvaw, Marcos Roberto H. Estéciox, Karine Sa Ferreiraw, Frank B. Furnarit, Milton Faria, Jr.s, Pedro A. F. Galantep, Gustavo S. Guimaraesy, Adriano Jesus Holandaw, Edna Teruko Kimuraz, Maarten R Leerkesp, Xin Luaa, Rui M. B. Macielu, Elizabeth A. L. Martinsbb, Katlin Brauer Massirero, Analy S. A. Melor, Carlos Alberto Mestrinercc, Elisabete Cristina Miraccan, Leandro Lorenco Mirandas, Francisco G. Nobregadd, Paulo S. Oliveirap, Apuã C. M. Paquolaee, José Rodrigo C. Pandolficc, Maria Inês de Moura Campos Pardiniff, Fabio Passettip, John Quackenbushgg, Beatriz Schnabelr, Mari Cleide Sogayaro, Jorge E. Souzap, Sandro R. Valentinicc, and Andre C. Zaiatsp.

  • The Human Cancer Genome Project Sequencing Consortium: Elisabete Jorge Amaralx, Liliane A. T. Arnaldiu, Amélia Goes de Araújow, Simone Aparecida de Bessan, David C. Bicknellq, Maria Eugenia Ribeiro de Camaroy, Dirce Maria Carrarop, Helaine Carrerhh, Alex F. Carvalhop, Christian Colino, Fernando Costaii, Cyntia Curcioz, Ismael Dale Cotrim Guerreiro da Silvaw, Neusa Pereira da Silvav,Márcia Dellamanop, Hamza El-Dorrykk, Enilza Maria Espreaficoll, Ari José Scattone Ferreirakk, Cristiane Ayres Ferreiraw, Maria Angela H. Z. Fortesmm, Angelita Habr Gamann, Daniel Giannella-Netomm, Maria Lúcia C. C. Giannellamm, Ricardo R. Giorgimm, Gustavo Henrique Goldmanoo, Maria Helena S. Goldmanpp, Christine Hackely, Paulo Lee Hobb, Elza Myiuki Kimuraqq, Luiz Paulo Kowalskirr, Jose E. Kriegerss, Luciana C. C. Leitebb, Ademar Lopesrr, Ana Mercedes S. C. Lunamm, Alan Mackaytt, Suely Kazue Nagahashi Marin, Adriana Aparecida Marquesw, Waleska K. Martinsp, André Montagninirr, Mario Mourão Netorr, Ana Lucia T. O. Nascimentobb, A. Munro Nevilleuu, Marina P. Nobregadd, Mike J. O'Harett, Audrey Yumi Otsukavv, Anna Izabel Ruas de Melop, Maria Luisa Paçó-Larsonww, Gonçalo Guimarães Pereiraii, Neusa Pereira da Silvav, João Bosco Pesquerojj, Juliana Gilbert Pessoajj, Paula Rahalx, Claudia Aparecida Rainhoxx, Vanderlei Rodriguesyy, Silvia Regina Rogattoxx, Camila Malta Romanozz, Janaína Gusmão Romeirox, Benedito Mauro Rossirr, Monica Rusticcin, Renata Guerra de Sáyy, Simone Cristina Sant' Annaqq, Míriam L. Sarmazox, Teresa Cristina de Lima e Silvay, Fernando Augusto Soaresrr, Maria de Fátima Sonatiqq, Josane de Freitas Sousall, Diana Queirozy, Valéria Valenteww, André Luiz Vettorep, Fabiola Elizabeth Villanovavv, Marco Antonio Zagow, and Heloisa Zalcbergp.

    nLaboratorio de Genética Molecular do Cancer, and nnDepartment of Gastroenterology, Faculdade de Medicina, Universidade de São Paulo, Travessa da Rua Dr. Ovídeo Pires de Campos S/N, 4° andar, 05403-010, São Paulo, SP, Brazil; eeDepartamento de Bioquímica, Instituto de Química, Universidade de São Paulo, 05508-900, São Paulo, SP, Brazil; bbInstituto Butantan, Avenida Vital Brazil 1500, 05503-900, São Paulo, Brazil; wwDepartamento de Biologia Celular e Molecular e de Bioagentes Patogênicos, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, 14049-900, Ribeirão Preto, SP, Brazil; uLaboratory of Molecular Endocrinology, Department of Medicine, Federal University of São Paulo, Rua Pedro de Toledo 781, 12th Floor, 04023-039, São Paulo, SP, Brazil; oChemistry Institute, University of São Paulo, 05513-970, São Paulo, SP, Brazil; yyDepartamento de Bioquimica e Imunologia, and llDepartamento de Biologia Celular e Molecular e Bioagentes Patogênicos, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, 14049-900, Ribeirão Preto, SP, Brazil; jjDepartment of Biophysics, and rDepartamento de Microbiologia, Imunologia, e Parasitologia, Universidade Federal de São Paulo, Rua Botucatu, 862 3° andar, 04023-062, São Paulo, SP, Brazil; ffLaboratório de Biologia Molecular, Hemocentro, Faculdade de Medicina Universidade Estidual Paulista, 18618-000, Botucatu, SP, Brazil; sDepartamento de Bioinformática-UNAERP, Universidade de Ribeirao Preto, 14096-380, Ribeirão Preto, SP, Brazil; zDepartamento de Histologia e Embriologia, Instituto de Ciencias Biomedicas, Universidade de São Paulo, Avenida Prof Lineu Prestes 1524, 05508-900, São Paulo, Brazil; ssDepartment of Medicine, Laboratory of Genetics and Molecular Cardiology, Heart Institute (InCor), Universidade de São Paulo, 05403-000, São Paulo, SP, Brazil; xDepartamento de Biologia, Instituto de Biociências, Letras e Ciências Exatas, Universidade Estadual Paulista, 15054, São José do Rio Preto, SP, Brazil; qqDepartment of Clinical Pathology, School of Medical Sciences, State University of Campinas-UNICAMP, 13083-970 Campinas, SP, Brazil; pLudwig Institute for Cancer Research, Rua Professor Antonio Prudente 109, 4° andar, 01509-010, São Paulo, SP, Brazil; rrFundação Antonio Prudente, Hospital do Câncer, Rua Professor Antonio Prudente 211, 01509-900, São Paulo, SP, Brazil; zzDepartamento de Patologia, Faculdade de Medicina Veterinária e Zootecnia, Universidade de São Paulo, Avenida Professor Orlando Marques de Paiva 87, 05508-000, São Paulo, SP, Brazil; wFundação Hemocentro de Ribeirão Preto, Faculdade de Medicina de Ribeirão Preto, Universidade de São Paulo, Rua Tenente Catão Roxo 2501, 14051-140, Ribeirão Preto, SP, Brazil; ddInstituto de Pesquisa e Desenvolvimento, Universidade do Vale do Paraíba, 12244-000, São José dos Campos, SP, Brazil; hhDepartment of Biological Sciences, Escola Superior de Agricultura “Luiz de Queiroz,” Universidade de São Paulo, 13418-900, Piracicaba, SP, Brazil; vDepartamento de Reumatologia, Universidade Federal de São Paulo, Rua Botucatu 740, 04023-062, São Paulo, SP, Brazil; kkDepartment of Biochemistry, Institute of Chemistry, University of Sao Paulo, Avenida Professor Lineu Prestes 748, 05508-900, São Paulo, SP, Brazil; ccDepartamento de Ciências Biológicas, Faculdade de Ciências Farmacêuticas de Araraquara, Universidade Estadual Paulista, 14801-902, São Paulo, Brazil; xxDepartamento de Genética, Instituto de Biociências, Universidade Estadual Paulista, 18618-970, Botucatu, SP, Brazil; vvGynecology Department, Federal University of Sao Paulo, Rua Botucatu 740, 04023-062, São Paulo, SP, Brazil; ooFaculdade de Ciencias Farmaceuticas de Ribeirao Preto, Universidade de São Paulo, Avenida do Cafe S/N, 14040-903, Ribeirao Preto, SP, Brazil; ppFaculdade de Filosofia, Ciências e Letras, Universidade de São Paulo, Avenida Bandeirantes, 3900, 14040-901, Ribeirão Preto, SP, Brazil; mmLaboratory for Cellular and Molecular Endocrinology (LIM-25/HCFMUSP), School of Medicine, Universidade de São Paulo, Avenida Dr. Arnaldo, 455 no. 4305, 01246-903, São Paulo, SP, Brazil; yDepartamento de Genética e Evolução, State University of Campinas-UNICAMP, 13083-970, Campinas, SP, Brazil; iiNational Cancer Institute, Bethesda, MD 20892; qCancer Research U.K. Cancer and Immunogenetics Laboratory, Institute of Molecular Medicine, University of Oxford, Oxford OX3 9DS, United Kingdom; tLudwig Institute for Cancer Research, Department of Medicine, Center for Molecular Genetics, University of California at San Diego, La Jolla, CA 92093-0660; aaLudwig Institute for Cancer Research, Imperial College School of Medicine, St. Mary's Campus, Norfolk Place, London W2 1PG, United Kingdom; ggThe Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850; ttLudwig Institute for Cancer Research, Breast Cancer Laboratory, Department of Surgery, Royal Free and University College Medical School, 67-73 Riding House Street, London W1W 7EJ, United Kingdom; and uuLudwig Institute for Cancer Research, Horatio House, 77-85 Fulham Palace Road, London W6 8JC, United Kingdom.

  • Abbreviations: EST, expressed sequence tag; CGAP, Cancer Genome Anatomy Project; HCGP, Human Cancer Genome Project; SNP, single-nucleotide polymorphism; CR, cancer-related.

  • The Human Cancer Genome Project/Cancer Genome Anatomy Project Annotation Consortium, Jamborestes, Aug. 20-25, 2001, São Paulo, Brazil.

« Previous | Next Article »Table of Contents