Structured sequences emerge from random pool when replicated by templated ligation
- aSystems Biophysics and Center for NanoScience, Ludwigs-Maximilian-Universität München, 80799 Munich, Germany;
- bCenter for Functional Nanomaterials, Brookhaven National Laboratory, Upton, NY 11973;
- cDepartment of Bioengineering, University of Illinois at Urbana–Champaign, Urbana, IL 61801;
- dCarl R. Woese Institute for Genomic Biology, University of Illinois at Urbana–Champaign, Urbana, IL 61801
See allHide authors and affiliations
Edited by Eugene V. Koonin, National Institutes of Health, Bethesda, MD, and approved January 20, 2021 (received for review September 7, 2020)

Significance
The structure of life emerged from randomness. This is attributed to selection by molecular Darwinian evolution. This study found that random templated ligation led to the simultaneous elongation and sequence selection of oligomers. Product strands showed highly structured sequence motifs which inhibited self-folding and built self-templating reaction networks. By the reduction of the sequence space, the kinetics of duplex formation increased and led to a faster replication through the ligation process. These findings imply that elementary binding properties of nucleotides can lead to an early selection of sequences even before the onset of Darwinian evolution. This suggests that such a simplification of sequence space could result in faster downstream selection for sequence-based function for the origin of life.
Abstract
The central question in the origin of life is to understand how structure can emerge from randomness. The Eigen theory of replication states, for sequences that are copied one base at a time, that the replication fidelity has to surpass an error threshold to avoid that replicated specific sequences become random because of the incorporated replication errors [M. Eigen, Naturwissenschaften 58 (10), 465–523 (1971)]. Here, we showed that linking short oligomers from a random sequence pool in a templated ligation reaction reduced the sequence space of product strands. We started from 12-mer oligonucleotides with two bases in all possible combinations and triggered enzymatic ligation under temperature cycles. Surprisingly, we found the robust creation of long, highly structured sequences with low entropy. At the ligation site, complementary and alternating sequence patterns developed. However, between the ligation sites, we found either an A-rich or a T-rich sequence within a single oligonucleotide. Our modeling suggests that avoidance of hairpins was the likely cause for these two complementary sequence pools. What emerged was a network of complementary sequences that acted both as templates and substrates of the reaction. This self-selecting ligation reaction could be restarted by only a few majority sequences. The findings showed that replication by random templated ligation from a random sequence input will lead to a highly structured, long, and nonrandom sequence pool. This is a favorable starting point for a subsequent Darwinian evolution searching for higher catalytic functions in an RNA world scenario.
Footnotes
- ↵1To whom correspondence may be addressed. Email: dieter.braun{at}lmu.de.
Author contributions: P.W.K., A.V.T., A.S., S.M., and D.B. designed research; P.W.K., A.V.T., A.S., S.M., and D.B. performed research; P.W.K., A.V.T., A.S., S.M., and D.B. contributed new reagents/analytic tools; P.W.K., A.V.T., A.S., S.M., and D.B. analyzed data; and P.W.K., A.V.T., S.M., and D.B. wrote the paper.
The authors declare no competing interest.
This article is a PNAS Direct Submission.
This article contains supporting information online at https://www.pnas.org/lookup/suppl/doi:10.1073/pnas.2018830118/-/DCSupplemental.
Data Availability.
All used data was cited or is reproducible from the study. NGS data files are available upon request.
- Copyright © 2021 the Author(s). Published by PNAS.
This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND).
Citation Manager Formats
Article Classifications
- Physical Sciences
- Biophysics and Computational Biology
- Biological Sciences
- Evolution