Sound–meaning association biases evidenced across thousands of languages
- aDepartment of Comparative Linguistics and Psycholinguistics Laboratory, University of Zürich, CH-8006 Zurich, Switzerland;
- bDepartment of Linguistic and Cultural Evolution, Max Planck Institute for the Science of Human History, 07745 Jena, Germany;
- cDiscrete Biomathematics Group, Max Planck Institute for Mathematics in the Sciences, 04103 Leipzig, Germany;
- dUniversity of Leiden, 2311 BV Leiden, The Netherlands;
- eKazan Federal University, Kazan, Russia, 420000;
- fInterdisciplinary Center for Bioinformatics, Department of Computer Science, University of Leipzig, 04107 Leipzig, Germany;
- gSanta Fe Institute, Santa Fe, NM 87501;
- hDepartment of Psychology, Cornell University, Ithaca, NY 14853;
- iInteracting Minds Centre, Aarhus University, 8000 Aarhus C, Denmark
See allHide authors and affiliations
Edited by Anne Cutler, University of Western Sydney, Penrith South, NSW, Australia, and approved July 25, 2016 (received for review April 13, 2016)

Significance
The independence between sound and meaning is believed to be a crucial property of language: across languages, sequences of different sounds are used to express similar concepts (e.g., Russian “ptitsa,” Swahili “ndege,” and Japanese “tori” all mean “bird”). However, a careful statistical examination of words from nearly two-thirds of the world’s languages reveals that unrelated languages very often use (or avoid) the same sounds for specific referents. For instance, words for tongue tend to have l or u, “round” often appears with r, and “small” with i. These striking similarities call for a reexamination of the fundamental assumption of the arbitrariness of the sign.
Abstract
It is widely assumed that one of the fundamental properties of spoken language is the arbitrary relation between sound and meaning. Some exceptions in the form of nonarbitrary associations have been documented in linguistics, cognitive science, and anthropology, but these studies only involved small subsets of the 6,000+ languages spoken in the world today. By analyzing word lists covering nearly two-thirds of the world’s languages, we demonstrate that a considerable proportion of 100 basic vocabulary items carry strong associations with specific kinds of human speech sounds, occurring persistently across continents and linguistic lineages (linguistic families or isolates). Prominently among these relations, we find property words (“small” and i, “full” and p or b) and body part terms (“tongue” and l, “nose” and n). The areal and historical distribution of these associations suggests that they often emerge independently rather than being inherited or borrowed. Our results therefore have important implications for the language sciences, given that nonarbitrary associations have been proposed to play a critical role in the emergence of cross-modal mappings, the acquisition of language, and the evolution of our species’ unique communication system.
Footnotes
- ↵1To whom correspondence should be addressed. Email: damianblasi{at}gmail.com.
Author contributions: D.E.B., S.W., H.H., and M.H.C. designed research; D.E.B. performed research; D.E.B. analyzed data; and D.E.B., S.W., H.H., P.F.S., and M.H.C. wrote the paper.
The authors declare no conflict of interest.
This article is a PNAS Direct Submission.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1605782113/-/DCSupplemental.
Citation Manager Formats
Article Classifications
- Social Sciences
- Anthropology
- Biological Sciences
- Psychological and Cognitive Sciences














