# Size limits of self-assembled colloidal structures made using specific interactions

See allHide authors and affiliations

Edited by David A. Weitz, Harvard University, Cambridge, MA, and approved September 30, 2014 (received for review June 23, 2014)

## Significance

Nature uses hierarchical assembly to make complex structures such as biomolecules, virus shells, and microtubules with high fidelity. Today a key challenge is to translate this process to artificial systems, which hinges on understanding the fundamental questions of efficiency and scalability of self-assembly. Although self-assembly has been studied for decades, the principles behind it and its fundamental and practical limits are still largely unknown. In this paper we establish size limitations for assembling structures of controlled size and shape out of colloidal particles with specific interactions. Inspired by simulations of structures with highly variable shapes and sizes, we develop an understanding of yield through a general theory of excited states that compete with the desired structure in assembly.

## Abstract

We establish size limitations for assembling structures of controlled size and shape out of colloidal particles with short-ranged interactions. Through simulations we show that structures with highly variable shapes made out of dozens of particles can form with high yield, as long as each particle in the structure binds only to the particles in their local environment. To understand this, we identify the excited states that compete with the ground-state structure and demonstrate that these excited states have a completely topological characterization, valid when the interparticle interactions are short-ranged. This allows complete enumeration of the energy landscape and gives bounds on how large a colloidal structure can assemble with high yield. For large structures the yield can be significant, even with hundreds of particles.

Nature uses hierarchical assembly of complicated building blocks to make highly functional structures such as biomolecules, virus shells, and microtubules without any external influence and with high fidelity. Mimicking this would not only give more insight into biological mechanisms but would also help realize the dream of “bottom-up” assembly that has been a central theme of nanotechnology for many decades (1).

As in biology, the information needed for assembling arbitrary macroscopic structures can be stored in the building blocks through the design of their interactions and interaction rules. Over the years great advances have been made by synthesizing new building blocks differing in geometry, composition, and interactions (2⇓⇓⇓⇓⇓⇓⇓–10), allowing for study of more complex objects. However, basic rules necessary for robust and efficient assembly of a desired structure in a scalable fashion and reasonable time scales are still not understood. A number of schemes for approaching this “inverse” statistical mechanics problem have been proposed (11⇓–13), but a general framework and systematic studies are still missing. One of the essential underlying questions, having both practical and conceptual impact, is whether any desired macroscopic structure can be assembled with a high yield, out of a given set of building blocks. Or are there fundamental constraints limiting the structures that can be effectively built?

In this paper we address these general questions using the model system of DNA-coated particles, itself of considerable recent interest. We consider an isolated system of *N* spherical colloidal particles, each of which is isotropically coated with DNA strands to control interparticle interactions. At the colloidal scale, such interactions have a range that is much shorter than the size of the particles. The use of DNA labeling to control binding specificity was originally pioneered for assembling nanoparticles (14⇓⇓–17) into infinite crystals (18⇓⇓⇓⇓⇓–24), where recently it was demonstrated that with two species with differing particle radii and DNA linker length a zoo of different crystal morphologies can be created (25). Work at the colloidal scale has begun to bear fruit (13, 17, 26⇓⇓⇓–30). However, the set of possible structures that could be coded is far more general, including structures of any shape and size, both rigid and flexible. For example, the number of clusters that can be assembled out of spherical particles with fixed size increases dramatically with particle number *N*, so that with only 10 particles there are 223 topologically distinct structures with at least

Designing arbitrary complex structures requires using the specificity of interactions to make the desired target the energetic ground state. The most robust way of doing this is to make every particle in the target structure different, with interparticle interactions chosen to favor the desired local configuration in its target structure. The interactions between different particles are coded into an interaction matrix

We begin by asking how high the equilibrium yield can be when *N* particles, in contrast to a common definition of yield as the percentage of particles from the bulk that assemble into copies of a desired structure. Numerical simulations using dissipative particle dynamics demonstrate that there is a temperature regime where high-yield (>50%) assembly is possible for a range of complex structures consisting of dozens of particles. This is striking, because as the number of particles grows, the number of competing states grows rapidly with *N*; such a high yield implies that these states are less competitive than naively expected. To understand why this is the case, we study the yield of an entire family of structures, the set of rigid clusters with *N*. However, we use the obtained insights to develop a complete description of the low-energy excited states that compete with the ground state, valid for asymptotically large structures. This description explains the high observed yield for large structures and points to the limits of equilibrium self-assembly with colloidal particles with short-ranged interactions. Finally, we comment on the role and importance of kinetic effects.

## Results

### Designing Interactions.

We study the assembly yield of arbitrary structures by choosing the interaction energy so that the desired structure is the ground state. This can be done uniquely for an isolated system of *N* spherical particles with isotropic interactions as follows: Start with the adjacency matrix *i* and *j* are in contact and *ϵ* (unfavorable). Setting all favorable interactions to have the same strength and setting all unfavorable interactions to have the same strength has been shown to optimize the equilibrium yield (34). With this interaction matrix the different interactions between different particles are maximally specific. If some particles have identical sets of neighbors their interactions are indistinguishable, so these particles are effectively of the same type. When the interaction matrix is reduced to show the interactions between the different particle types, it is called an alphabet (34), with the maximally specific interactions defining the maximal alphabet.

When a structure has a unique adjacency matrix, this procedure guarantees that the desired structure has the maximal number of contacts and is therefore the unique ground state. However, if a structure has no mirror symmetries, then its “chiral partner,” obtained as the object’s mirror reflection through an arbitrary mirror plane, cannot be made to coincide with the original object through proper rotations or translations. The chiral partners are therefore distinct assemblies of particles, although each particle shares the same neighbors in both (and therefore the chiral partners have the same

Consequently, both chiral partners are ground states, and in this paper we identify both as being the desired structure. For equilibrium yield this difference is not consequential, but we will see at the end of the paper that the simultaneous assembly of both chiral partners can lead to kinetic effects relevant for the yield.

### Assembly of Large Structures.

To discover whether it is possible to assemble large structures with high yield we use dissipative particle dynamics (DPD) (35, 36) and measure the equilibrium yield as a function of temperature. Our simulation contains *N* colloidal spheres of diameter *D*, with an interaction range of *SI Text*.

The complex structures include (*i*) a bipyramid with 44 particles, (*ii*) a bipyramid of 19 particles, (*iii*) a 19-particle chiral chain structure, and (*iv*) a 69-particle replica of Big Ben, with a crystalline base and a pyramidal top. Fig. 1*A* shows snapshots in the time evolution of the Big Ben assembly, starting with randomly distributed and thermalized particles. Assembly into the desired structure occurs with high yield, and this result prevails in most of the complex structures we have studied. Fig. 1*B* plots yield as a function of temperature *T* for a fixed time *SI Text* for more details).

Our simulations exhibit several regimes as a function of temperature *T*, with a glassy regime at low *T* and an equilibrium regime at high *T* (*SI Text*). At the highest temperatures (at *B* is that the maximum yield is so high, despite the large number of particles in the desired state; this implies that the number of equilibria that are competing with the ground state is relatively small.

### Clusters.

To uncover the landscape of equilibria that compete with the ground state we examine a simpler problem, the assembly of small clusters of particles with at least *T* for a fixed time *SI Text*). Using the positions of particles we form an adjacency matrix and use its eigenvalues to uniquely identify the assembled structure at

The panels of Fig. 2 compare yield curves of given clusters for identical particles with those when interactions are determined by *B*, *F*, and *G*). The yield curves from the clusters simulation exhibit the same phenomenology as those in Fig. 1. By comparing time and ensemble averages we show that the equilibrium regime extends down to *SI Text* and Figs. S1 and S2 for more details.

Fig. 3*A* shows how the maximum equilibrium yield *SI Text*) depends on *N* for maximal alphabets of clusters with *A* also includes all nonmaximal alphabets (*SI Text*) for *N*. For each *N*, the yield is determined by the geometry of the clusters: Fig. 3*B* shows that the yield of clusters increases with decreasing second moment (i.e., with increasing symmetry).

### What Determines the Equilibrium Yield?

Consider *N* particles with a fixed alphabet that determines the ground-state cluster *C*. Vibrationally and rotationally excited states of *C* preserve the cluster’s structure without breaking interparticle bonds. This means that our simulations would identify these states as *C* too. Hence, the partition function that describes the ground state is *σ* is the symmetry number and *C*.

The states that compete for yield with the cluster *C* are the low-energy excited states. In particular, a local minimum (LM) state is a stable configuration of *N* particles and must have at least one particle bond less than *C*. Each LM is characterized by the number of broken bonds compared with *C*, *ϵ*. As an example, Fig. 4 shows the energy landscape with the two lowest-energy local minima that arise for the maximal alphabet of one of the

The partition function of the

With a complete enumeration of the set of local minima, the equilibrium yield of the ground-state cluster is given by *m* bonds. The partition function then becomes*σ* factors to 1 temporarily for simplicity of presentation. The maximum yield is determined by the balance between *m*. The dependence of *m* is a purely geometrical problem, because the landscape of local minima depends only on the geometry of the structure being assembled.

Note that when a designed cluster *C* has two chiralities we identify both as desired ground states, thereby doubling *m*).

#### Clusters.

For the clusters, we determine *SI Text* for the definition) and the number and type of local minima. Fig. 5*A* plots the maximum yield of all alphabets of all *B* also considers the LMs with one additional broken bond; the correlation improves only slightly, implying that the yield of these small clusters is determined by the competition between the ground state and the lowest-lying local minima.

In these plots we also show the prediction from Eq. **1**, with the symmetry number factors reinstated. We used the entropic free-energy loss *SI Text*) as the only free parameter, obtaining a good agreement with data. The value of

#### Large structures.

Superficially, the observations of high yields for large structures (Fig. 1) are even more curious in light of the cluster results, because the extrapolated yield from Fig. 3*A* would correspond to a negligible yield for Fig. 1*B*. This discrepancy suggests that the dominant low-energy local minima change as *N* increases. To understand this, we enumerate the local minima for large structures and determine which local minima are competing with the ground state. Within our prescription for specifying *B* (*SI Text*). Fig. 6 shows the rapid increase of number of LMs with growing number of broken bonds *m* in the large structures. We then compute the predicted yield from Eq. **1** by including only LMs with *M* is increased (we fix *SI Text*). This shows that yield is dominated by low-lying LMs, including some with more than the minimal number of broken bonds. A special case is the chiral chain, which only has

### Local Defects and Asymptotic Yield.

For each of the large-structure examples above the relevant low-lying LMs feature permutations of only nearest-neighbor particles. We note that in case of small clusters all LMs are obtained by permutations of two particles (*SI Text* and Fig. S6). Considering an arbitrary structure, the interactions given by *i* and *j* would break all their bonds with the rest of structure.

With these observations, we define a local defect as a permutation of two particles *i*, and *i* and *j*, including the bond between *i* and *j*.

When both *N* grows the number of such configurations is negligible compared with the number of configurations with well-separated defect locations. In this limit, the energy of a configuration of defects is just the sum of the defects’ individual energies.

We can now use these ideas to understand how the yield depends on the size of a structure. As illustrations we consider two particular examples, a cube structure and a chain structure.

#### Cube.

We consider a cube with face-centered-cubic arrangement of particles and with sides of **2**. There are **1**: For example, LMs with **1** of

As structure size increases the yield decreases. The same argument as above implies that if we consider a much larger cube, for example with

#### Linear chain.

For the example of a linear chain with length

#### Arbitrary structure.

Finally, we consider an arbitrary large structure that has large volume-to-surface, surface-to-edge, and edge-to-corner ratios and abundant contacts so that it is rigid. We call this a “bulky” 3D structure. Then, we can roughly estimate the input data and use the theory to qualitatively distinguish outcomes of considerable yield and negligible yield. Consider a structure of linear dimension *L* (measured in particles), which has point, line, surface, and bulk defects labeled by the corresponding dimensionality

Consider contributions to the yield from LMs consisting of *d*. For simplicity we do not consider LMs that contain different types of defects at the same time. First we demand that the yield stay considerable, that is, the LM contribution to denominator of Eq. **1** stay much smaller than unity, that is, *d*. The high cost of defects is consistent with our assumption of their diluteness in the dominant LMs.

If, however, the defects are energetically cheap compared with system size, that is, the condition in Eq. **3** is violated, the yield is significantly diminished owing to LM contributions. In this case of energetically cheap defects it can also happen that the defect number in relevant LMs becomes large, violating the diluteness assumption. (Notice that this regime can never happen with point defects

It is clear that this analysis can be applied to “nonbulky” structures too, because we focused on each defect type separately. For instance, the above example of a chain is representative of quasi-1D structures and is a special case of the arbitrary structure where surface and bulk defects are absent. Additionally, one can consider structures for which the scaling of defect numbers with structure dimension is not trivial; for example, in a planar fractal-like structure the edge length and the number of line defects would scale as a noninteger power of linear system size. Overall, our analysis gives a rough but general and simple understanding of equilibrium yield limitations based on local defects.

### Chirality and Kinetic Effects.

Our above theoretical estimates give yields in reasonable quantitative agreement with simulations, in which we indeed observe that LM configurations occur in the assembly process. However, simulations also reveal that some suppression of yield is due to chirality.

In accordance to our analysis of chirality, when separate parts of the assembling structure nucleate independently they randomly acquire one of the two chiralities, leading to an inherent kinetic effect: Formed pieces with opposite chirality can never properly join into the structure. Instead, the pieces can weakly connect, for example along one of their edges (Fig. S8). The detrimental contribution to yield from these effects could dominate the contribution from low-lying LMs, and further analysis of such kinetic effects should be valuable.

## Discussion

To summarize, we have demonstrated through numerical simulations that high-yield aggregates of coated colloidal spheres can be created with specific, short-ranged interactions. Strikingly, our simulations indicate that high-yield structures form with dozens of particles. We developed a theoretical framework for understanding this result, based on the fact that the low-energy local minima competing with the designed ground state consist of configurations in which particles in the ground state structure swap places. For example, in bulky (as defined above) structures of hundreds of particles it is the surface defects that are most detrimental to yield. The scalings implied by these calculations indicate that high-yield bulky structures can form from

Our focus on maximally specific interactions not only enables the local defect analysis but also prevents transitioning between different structures without breaking any bonds. Still, in nonrigid structures global floppy modes (which do not change the bond network) could influence the yield, and we leave this question for future study.

We note that there are technological challenges with implementing the high-yield DNA-coated colloid schemes outlined here: Our maximal interaction specificity construction requires a different DNA strand to mediate the interaction for every contact (e.g., because a particle in a bulk crystal has of order six nearest neighbors, that many different types of strands per particle are required). Although the practical limit of how many different types of strands per particle can be used is much higher than what we require (40), the density of strands is not high enough yet to avoid kinetic effects (40). Nonetheless, it is possible to implement the basic schemes outlined here with nonmaximal alphabets, in which the number of different strands on each particle is less than the number of contacts. If carefully chosen, a nonmaximal alphabet uniquely identifies a target structure—although having more low-energy excited states, leading to a smaller yield. We have included such nonmaximal alphabets in our simulations of clusters (Fig. 5), and the yields can still be significantly higher than with nonspecific interactions.

There are other opportunities to further increase yield by removing the assumption of equilibrium interactions, which was the basis of our analysis. For example, recent work (41) has shown that in a system with a fixed number of building blocks kinetic effects can be critical for achieving successful assembly. One example of nonequilibrium design that is natural for colloidal assembly is to allow some of the bonds to be irreversible. Any irreversible bond that does not limit pathways out of local minima will increase the yield of the ground state. The assembly of complex systems in biology suggests other ways of beating the equilibrium threshold, including (*i*) the possibility of using error correction, by allowing energy consuming reactions to bias toward the correctly formed structure, and (*ii*) including allosteric interactions, in which the binding energy of a particle depends on the set of particles that it binds to. Determining how best to implement these schemes with colloid-mediated DNA interactions is an important topic for future research.

## Materials and Methods

A detailed description of our simulations together with one simulation movie (Movie S1) is included in *Supporting Information*.

## Acknowledgments

The first observation of a local minimum in a colloidal system was made by Jesse Collins in an experimental study of octahedral clusters, and we thank him for sharing his results. We also thank Sahand Hormoz and Natalie Arkus for early discussions, Sahand Hormoz for enumerating the nonmaximal alphabets in Fig. 3, and Yu Qin for running some of the computations. This research was funded by the George F. Carrier Fellowship, the National Science Foundation through the Harvard Materials Research Science and Engineering Center (DMR-0820484), the Division of Mathematical Sciences (DMS-0907985), and by Grant RFP-12-04 from the Foundational Questions in Evolutionary Biology Fund. M.P.B. is an Investigator of the Simons Foundation. V.N.M. acknowledges support from an Alfred P. Sloan research fellowship.

## Footnotes

- ↵
^{1}To whom correspondence should be addressed. Email: zorana{at}seas.harvard.edu.

Author contributions: Z.Z., V.N.M., and M.P.B. designed research, performed research, analyzed data, and wrote the paper.

The authors declare no conflict of interest.

This article is a PNAS Direct Submission.

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1411765111/-/DCSupplemental.

## References

- ↵.
- Whitesides GM,
- Grzybowski B

- ↵.
- Whitesides GM,
- Mathias JP,
- Seto CT

- ↵
- ↵
- ↵
- ↵
- ↵
- ↵.
- Ke Y,
- Ong LL,
- Shih WM,
- Yin P

- ↵.
- Blaaderen Av

- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵.
- Valignat MP,
- Theodoly O,
- Crocker JC,
- Russel WB,
- Chaikin PM

- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵.
- Macfarlane RJ, et al.

- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵
- ↵.
- Hormoz S,
- Brenner MP

- ↵
- ↵
- ↵.
- Rogers WB,
- Crocker JC

- ↵.
- Meng G,
- Arkus N,
- Brenner MP,
- Manoharan VN

- ↵.
- Holmes-Cerfon M,
- Gortler SJ,
- Brenner MP

- ↵.
- Wu K-T, et al.

- ↵

## Citation Manager Formats

## Article Classifications

- Physical Sciences
- Physics