Skip to main content

Main menu

  • Home
  • Articles
    • Current
    • Special Feature Articles - Most Recent
    • Special Features
    • Colloquia
    • Collected Articles
    • PNAS Classics
    • List of Issues
  • Front Matter
    • Front Matter Portal
    • Journal Club
  • News
    • For the Press
    • This Week In PNAS
    • PNAS in the News
  • Podcasts
  • Authors
    • Information for Authors
    • Editorial and Journal Policies
    • Submission Procedures
    • Fees and Licenses
  • Submit
  • Submit
  • About
    • Editorial Board
    • PNAS Staff
    • FAQ
    • Accessibility Statement
    • Rights and Permissions
    • Site Map
  • Contact
  • Journal Club
  • Subscribe
    • Subscription Rates
    • Subscriptions FAQ
    • Open Access
    • Recommend PNAS to Your Librarian

User menu

  • Log in
  • My Cart

Search

  • Advanced search
Home
Home
  • Log in
  • My Cart

Advanced Search

  • Home
  • Articles
    • Current
    • Special Feature Articles - Most Recent
    • Special Features
    • Colloquia
    • Collected Articles
    • PNAS Classics
    • List of Issues
  • Front Matter
    • Front Matter Portal
    • Journal Club
  • News
    • For the Press
    • This Week In PNAS
    • PNAS in the News
  • Podcasts
  • Authors
    • Information for Authors
    • Editorial and Journal Policies
    • Submission Procedures
    • Fees and Licenses
  • Submit
Research Article

DeepTracer for fast de novo cryo-EM protein structure modeling and special studies on CoV-related complexes

Jonas Pfab, View ORCID ProfileNhut Minh Phan, and View ORCID ProfileDong Si
  1. aDivision of Computing and Software Systems, University of Washington Bothell, Bothell, WA 98011

See allHide authors and affiliations

PNAS January 12, 2021 118 (2) e2017525118; https://doi.org/10.1073/pnas.2017525118
Jonas Pfab
aDivision of Computing and Software Systems, University of Washington Bothell, Bothell, WA 98011
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Nhut Minh Phan
aDivision of Computing and Software Systems, University of Washington Bothell, Bothell, WA 98011
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Nhut Minh Phan
Dong Si
aDivision of Computing and Software Systems, University of Washington Bothell, Bothell, WA 98011
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Dong Si
  • For correspondence: dongsi@uw.edu
  1. Edited by Eva Nogales, University of California, Berkeley, CA, and approved December 1, 2020 (received for review August 18, 2020)

  • Article
  • Figures & SI
  • Info & Metrics
  • PDF
Loading

Article Figures & SI

Figures

  • Tables
  • Fig. 1.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Fig. 1.

    DeepTracer model determination pipeline. All-atom structure of multichain protein complexes is determined fully automatically solely from a cryo-EM map and amino acid sequence using the steps shown in the center of the figure. The structure shown on the right side is an actual model built by DeepTracer.

  • Fig. 2.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Fig. 2.

    Architecture of tailored convolutional neural network. Top shows overview of DeepTracer’s neural network architecture consisting of four parallel U-Nets. The gray boxes show the input and output maps, with their dimensions noted to the left and the number of channels marked below. Bottom dashed box shows the detailed architecture of each parallel U-Net. The blue boxes show the output maps of the different layers, where the dimensions of the maps are depicted on the left and the number of channels is depicted on top.

  • Fig. 3.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Fig. 3.

    Example masks from the training dataset based on the PDB ID code 6NQ1 deposited model structure. (A) Deposited model structure. (B) Backbone (Cα, C, and N atoms) in purple and side chains in green. (C) Atoms mask with labels for Cα, C, and N atoms. (D) Secondary structure mask with helices in turquoise, loops in pink, and sheets in orange. (E) Amino acid type mask with 20 different colors.

  • Fig. 4.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Fig. 4.

    Backbone confidence map of the EMD-0478 map with identified chains annotated in different colors.

  • Fig. 5.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Fig. 5.

    Traced backbone atoms. Predicted Cα atoms for the EMD-4054 map in blue before (Left) and after (Right) the backbone tracing step compared to the deposited model structure in pink.

  • Fig. 6.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Fig. 6.

    Protein sequence alignment algorithm. Interval of the predicted sequence is aligned with the target sequence using a custom dynamic algorithm. The amino acid confusion matrix depicts the relative frequency of pairs of predicted and true amino acid type and was calculated based on a set of test cryo-EM maps. The numbers shown in the score matrix are solely for illustrative purposes and do not reflect real data.

  • Fig. 7.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Fig. 7.

    Carbon, nitrogen, and oxygen determination. (A) Initial positioning of carbon (yellow) and nitrogen (blue) atoms in between the Cα atoms (gray) on Left and their initial refined positioning, which fits the U-Net prediction of carbon atoms (green volume) and nitrogen atoms (blue volume), on Right. (B) The positions of carbon and nitrogen atoms are refined further by forcing bond angles into their well-known values. The blue lines represent the bonds from the initial refinement. The red lines represent the bonds from the final refinement. (C) Position of oxygen atom in the carbonyl group by definition.

  • Fig. 8.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Fig. 8.

    Evaluation results for set of 476 experimental cryo-EM maps. Evaluation of determined models from DeepTracer (blue) and Phenix (red) for 476 cryo-EM maps. The dotted lines represent the trend for each method. DeepTracer outperformed Phenix in all four metrics. Precise data can be found in SI Appendix, Table S3.

  • Fig. 9.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Fig. 9.

    Results of EMD-6757 map. Models built by DeepTracer (blue) and Phenix (red) next to PDB ID code 5XS7 deposited model structure (yellow) for EMD-6757 map.

  • Fig. 10.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Fig. 10.

    Results of EMD-6272 map. Models built by DeepTracer (blue) and Phenix (red) compared to PDB ID code 3J9S deposited model structure (yellow) for EMD-6272 map. Top shows structures in ribbon view, and Bottom shows structures in all-atom view. Areas where DeepTracer correctly predicted amino acids that Phenix missed are highlighted by the four red circles.

  • Fig. 11.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Fig. 11.

    Results for coronavirus-related cryo-EM maps. Evaluation of models built by DeepTracer (blue) and Phenix (red) for 52 coronavirus-related high-resolution cryo-EM maps. The dotted lines represent the trend for each method. Computation times are shown on a logarithmic scale. Precise data can be found in SI Appendix, Table S2.

  • Fig. 12.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Fig. 12.

    Models built from SARS-CoV-2 cryo-EM maps, which do not have deposited model structures in the EMDR. DeepTracer model for the EMD-30044 map (Top) showing a human receptor ACE2 to which spike proteins of the SARS-CoV-2 virus bind and (Bottom) the EMD-21374 depicting a SARS-CoV-2 spike glycoprotein. No model structure has been deposited to the EMDataResource for the cryo-EM maps as of the date this paper is announced.

Tables

  • Figures
    • View popup
    Table 1.

    Comparison of DeepTracer (DT) and Phenix (P) for SARS-CoV-2 dataset

    Percent matchingrmsdPercent sequence IDGDC
    lr0.5em)4-5lr0.5em)6-7lr0.5em)8-9l)10-11 EMDBPDBResiduesDTPDTPDTPDTP
    213756vsb290584.9048.601.141.4045.9020.9017.885.39
    214526vxx291691.4053.800.961.1861.3040.00——
    300396m17307280.3053.101.721.7269.8054.6011.848.31
    301276m71107791.7054.201.021.2058.6016.6020.748.89
    301787btf122794.9081.000.831.0985.8051.8065.5723.06
    302097bv1110287.6067.000.841.2987.5030.5055.6218.15
    302107bv2100692.4078.200.781.0888.9053.7040.9013.32
    Average89.0362.271.041.2871.1138.3035.4212.85
    • GDC score could not be calculated for the EMD-21452 map, as the LGA web service could not process the modeled structures due to their size. EMDB, Electron Microscopy Data Bank.

Data supplements

  • Supporting Information

    • Download Appendix (PDF)
PreviousNext
Back to top
Article Alerts
Email Article

Thank you for your interest in spreading the word on PNAS.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
DeepTracer for fast de novo cryo-EM protein structure modeling and special studies on CoV-related complexes
(Your Name) has sent you a message from PNAS
(Your Name) thought you would like to see the PNAS web site.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Citation Tools
DeepTracer for fast de novo cryo-EM protein structure modeling and special studies on CoV-related complexes
Jonas Pfab, Nhut Minh Phan, Dong Si
Proceedings of the National Academy of Sciences Jan 2021, 118 (2) e2017525118; DOI: 10.1073/pnas.2017525118

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Request Permissions
Share
DeepTracer for fast de novo cryo-EM protein structure modeling and special studies on CoV-related complexes
Jonas Pfab, Nhut Minh Phan, Dong Si
Proceedings of the National Academy of Sciences Jan 2021, 118 (2) e2017525118; DOI: 10.1073/pnas.2017525118
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Mendeley logo Mendeley

Article Classifications

  • Biological Sciences
  • Biophysics and Computational Biology
  • Physical Sciences
  • Biophysics and Computational Biology
Proceedings of the National Academy of Sciences: 118 (2)
Table of Contents

Submit

Sign up for Article Alerts

Jump to section

  • Article
    • Abstract
    • Methods
    • Results
    • Discussion
    • Data Availability.
    • Acknowledgments
    • Footnotes
    • References
  • Figures & SI
  • Info & Metrics
  • PDF

You May Also be Interested in

Reflection of clouds in the still waters of Mono Lake in California.
Inner Workings: Making headway with the mysteries of life’s origins
Recent experiments and simulations are starting to answer some fundamental questions about how life came to be.
Image credit: Shutterstock/Radoslaw Lecyk.
Depiction of the sun's heliosphere with Voyager spacecraft at its edge.
News Feature: Voyager still breaking barriers decades after launch
Launched in 1977, Voyagers 1 and 2 are still helping to resolve past controversies even as they help spark a new one: the true shape of the heliosphere.
Image credit: NASA/JPL-Caltech.
Drop of water creates splash in a puddle.
Journal Club: Heavy water tastes sweeter
Heavy hydrogen makes heavy water more dense and raises its boiling point. It also appears to affect another characteristic long rumored: taste.
Image credit: Shutterstock/sl_photo.
Mouse fibroblast cells. Electron bifurcation reactions keep mammalian cells alive.
Exploring electron bifurcation
Jonathon Yuly, David Beratan, and Peng Zhang investigate how electron bifurcation reactions work.
Listen
Past PodcastsSubscribe
Panda bear hanging in a tree
How horse manure helps giant pandas tolerate cold
A study finds that giant pandas roll in horse manure to increase their cold tolerance.
Image credit: Fuwen Wei.

Similar Articles

Site Logo
Powered by HighWire
  • Submit Manuscript
  • Twitter
  • Facebook
  • RSS Feeds
  • Email Alerts

Articles

  • Current Issue
  • Special Feature Articles – Most Recent
  • List of Issues

PNAS Portals

  • Anthropology
  • Chemistry
  • Classics
  • Front Matter
  • Physics
  • Sustainability Science
  • Teaching Resources

Information

  • Authors
  • Editorial Board
  • Reviewers
  • Subscribers
  • Librarians
  • Press
  • Cozzarelli Prize
  • Site Map
  • PNAS Updates
  • FAQs
  • Accessibility Statement
  • Rights & Permissions
  • About
  • Contact

Feedback    Privacy/Legal

Copyright © 2021 National Academy of Sciences. Online ISSN 1091-6490