Skip to main content

Main menu

  • Home
  • Articles
    • Current
    • Special Feature Articles - Most Recent
    • Special Features
    • Colloquia
    • Collected Articles
    • PNAS Classics
    • List of Issues
  • Front Matter
    • Front Matter Portal
    • Journal Club
  • News
    • For the Press
    • This Week In PNAS
    • PNAS in the News
  • Podcasts
  • Authors
    • Information for Authors
    • Editorial and Journal Policies
    • Submission Procedures
    • Fees and Licenses
  • Submit
  • Submit
  • About
    • Editorial Board
    • PNAS Staff
    • FAQ
    • Accessibility Statement
    • Rights and Permissions
    • Site Map
  • Contact
  • Journal Club
  • Subscribe
    • Subscription Rates
    • Subscriptions FAQ
    • Open Access
    • Recommend PNAS to Your Librarian

User menu

  • Log in
  • My Cart

Search

  • Advanced search
Home
Home
  • Log in
  • My Cart

Advanced Search

  • Home
  • Articles
    • Current
    • Special Feature Articles - Most Recent
    • Special Features
    • Colloquia
    • Collected Articles
    • PNAS Classics
    • List of Issues
  • Front Matter
    • Front Matter Portal
    • Journal Club
  • News
    • For the Press
    • This Week In PNAS
    • PNAS in the News
  • Podcasts
  • Authors
    • Information for Authors
    • Editorial and Journal Policies
    • Submission Procedures
    • Fees and Licenses
  • Submit
Colloquium Paper

From likely to likable: The role of statistical typicality in human social assessment of faces

Chaitanya K. Ryali, Stanny Goffin, View ORCID ProfilePiotr Winkielman, and Angela J. Yu
  1. aDepartment of Computer Science and Engineering, University of California San Diego, La Jolla, CA 92093;
  2. bDepartment of Psychology, University of California San Diego, La Jolla, CA 92093;
  3. cDepartment of Cognitive Neuroscience, Maastricht University, 6229 ER Maastricht, The Netherlands;
  4. dFaculty of Psychology, SWPS University of Social Sciences and Humanities, 03-815 Warsaw, Poland;
  5. eDepartment of Cognitive Science, University of California San Diego, La Jolla, CA 92093;
  6. fHalicioğlu Data Science Institute, University of California San Diego, La Jolla, CA 92093

See allHide authors and affiliations

PNAS November 24, 2020 117 (47) 29371-29380; first published November 23, 2020; https://doi.org/10.1073/pnas.1912343117
Chaitanya K. Ryali
aDepartment of Computer Science and Engineering, University of California San Diego, La Jolla, CA 92093;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Stanny Goffin
bDepartment of Psychology, University of California San Diego, La Jolla, CA 92093;
cDepartment of Cognitive Neuroscience, Maastricht University, 6229 ER Maastricht, The Netherlands;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Piotr Winkielman
bDepartment of Psychology, University of California San Diego, La Jolla, CA 92093;
dFaculty of Psychology, SWPS University of Social Sciences and Humanities, 03-815 Warsaw, Poland;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Piotr Winkielman
Angela J. Yu
eDepartment of Cognitive Science, University of California San Diego, La Jolla, CA 92093;
eDepartment of Cognitive Science, University of California San Diego, La Jolla, CA 92093;
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: ajyu@ucsd.edu
  1. Edited by Wilson S. Geisler, The University of Texas at Austin, Austin, Texas, and approved July 29, 2020 (received for review January 10, 2020)

  • Article
  • Figures & SI
  • Info & Metrics
  • PDF
Loading

Abstract

Humans readily form social impressions, such as attractiveness and trustworthiness, from a stranger’s facial features. Understanding the provenance of these impressions has clear scientific importance and societal implications. Motivated by the efficient coding hypothesis of brain representation, as well as Claude Shannon’s theoretical result that maximally efficient representational systems assign shorter codes to statistically more typical data (quantified as log likelihood), we suggest that social “liking” of faces increases with statistical typicality. Combining human behavioral data and computational modeling, we show that perceived attractiveness, trustworthiness, dominance, and valence of a face image linearly increase with its statistical typicality (log likelihood). We also show that statistical typicality can at least partially explain the role of symmetry in attractiveness perception. Additionally, by assuming that the brain focuses on a task-relevant subset of facial features and assessing log likelihood of a face using those features, our model can explain the “ugliness-in-averageness” effect found in social psychology, whereby otherwise attractive, intercategory faces diminish in attractiveness during a categorization task.

  • face evaluation
  • attractiveness
  • statistical typicality
  • attention
  • efficient coding

Humans readily form social impressions, such as attractiveness and trustworthiness, from a brief glance of a stranger’s face (1⇓⇓–4). While the accuracy of these social judgments is actively debated (4), such social impressions clearly exert a powerful influence on daily human social life (4), whether choosing a life partner, assessing eye witness testimony, interviewing a job candidate, or choosing whom to befriend. One of the most robust and curious findings in the study of social judgment of faces is the so-called “beauty-in-averageness” (BiA) effect, whereby blends of two or more face images are generally perceived to be more attractive than the “parent” face images (5, 6). A number of qualitative hypotheses have been put forth to explain the provenance of BiA, such as a human preference for symmetry (7, 8) or lack of blemishes (9). However, symmetry and blemishes appear at most to provide an incomplete explanation, as controlling for these factors does not eliminate BiA (10). Alternatively, it has been suggested that humans have a preference for highly prototypical stimuli over more unusual stimuli (11), possibly as a cue to mate value or reproductive health (12, 13). However, humans exhibit BiA effects not only for strangers’ faces but also for a variety of natural and artificial object categories, such as dogs, birds, butterflies, fish, automobiles, watches, and even synthetic dot patterns (11, 14, 15). Beyond attractiveness, perceived trustworthiness also appears greater for more typical-looking faces (16). Other recent studies have shown that attractiveness and trustworthiness perception are culturally dependent (17) and can be rapidly modified via exposure to specific types of faces (18). Altogether, these results suggest that there may be a general human social preference for more typical-looking objects, and this effect depends on rather general cognitive mechanisms beyond those specific to beauty or mating. Here, we propose a statistically grounded theory of human “liking” of more typical-looking faces (and other objects) (16, 19, 20). We suggest that the perceived appeal of a face is, at least in part, monotonically driven by a measure of “typicality” of the face with respect to the face distribution, specifically in terms of the log likelihood (LL) of the face under the distribution. We refer to our measure as statistical typicality to contrast it from previous nonstatistical proposals of the link between typicality and attractiveness (16, 19, 20). Intuitively, as long as the face distribution is unimodal (Fig. 1A), the blend (average) of two parent faces tends to be closer to the mean, and thus have higher LL than the parent faces, resulting in BiA (5, 6, 16). Formally, we model faces as points in a vector space (“face space”) (21), defined by their latent feature values from a widely used computer vision model (22⇓–24), and estimate the face distribution using a demographically balanced, publicly available face dataset, Chicago Face Database (CFD) (25). As we will show, attractiveness ratings from the CFD correlate with LL of these faces, and simulations based on CFD faces exhibit expected BiA effects. Moreover, we report an experimental finding that model-estimated LL of face stimuli linearly predicts not only perceived attractiveness, but also trustworthiness, dominance, and valence—traits thought by social psychologists to be particularly fundamental in face-based social judgments (4). We also demonstrate that statistical typicality may even partially explain the role of symmetry in attractiveness perception (7, 8), as symmetry correlates with LL among CFD faces, and symmetrization manipulations increase LL.

Fig. 1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 1.

BiA vs. UiA. (A) In a unimodal distribution, the blend (x¯) of two face stimuli tends to have higher statistical typicality than the parent faces (x1, x2). In a bimodal distribution, (B) blending two parents from two different modes tends to have lower statistical typicality than (C) when the parents are from the same mode.

If human liking of a face indeed relates to statistical typicality, then where the distribution is bimodal, we would expect a cross-modal blend to be less likable than the parent stimuli (Fig. 1B). Indeed, empirical studies show that cross-category blends [e.g., biracial (26), bigender (27) faces, and cross-category synthetic stimuli (28)] elicit lower attractiveness ratings than same-category blends, but only in the relevant categorization context (outside the relevant categorization context, these stimuli still elicit BiA). To explain ugliness in averageness (UiA), we make one additional assumption: attention restricts processing to a task-relevant subspace of facial features (e.g., gender-discriminating features in the gender categorization task) (29⇓⇓–32). As in BiA, attractiveness perception is linked to LL but now defined over this task-relevant subspace instead of the full feature space. Whereas same-category blends exhibit the standard BiA (Fig. 1C), cross-category blends tend to fall in between modes, resulting in low LL and attractiveness. We will show how UiA arises naturally from the decrease in LL of cross-category faces when the representation is restricted to the task-relevant subspace, and that model-estimated LL of individual face images within that subspace is predictive of attractiveness rating in the categorization context (27).

One may well question why the brain should have an affect signal related to statistical typicality defined in terms of LL. For theoretical motivation, we appeal to Claude Shannon’s classical result (33) that a maximally efficient coding system (minimizing the code needed to represent data) should assign to each item a code length that is exactly its negative LL (−LL; i.e., shorter codes for more frequently encountered data). For example, Morse code is a fairly efficient code, consisting of short dots and long dashes, that assigns a dot to the most frequently used English letter “E” and much longer codes to rare letters such a “Q” (dash–dash–dot–dash). Relatedly, the “efficient coding hypothesis” posits that information representation and processing in the brain are highly efficient (34⇓–36), such that less neuronal response is allocated to encode more probable stimuli (34, 37⇓⇓⇓–41). In support, empirical evidence indicates that face-responsive areas in the human brain indeed decrease their response to more typical faces (42, 43). To attain and maintain coding efficiency in the brain, inefficiently coded stimuli should be aversive, so as to encourage representational updating that minimizes long-term coding cost averaged across observed data (44, 45). Interestingly, reducing the average negative LL of observed data under the assumed model, known as cross-entropy minimization, is also a popular and effective tool for representation learning in modern machine learning and artificial intelligence (46). Separately, empirical evidence indicates that face stimuli whose representation in the brain requires greater neural activity are indeed aversive (47⇓–49).

In summary, diverse empirical and theoretical results support efficient face coding in the brain and theoretically motivate a positive influence of LL-based statistical typicality on social liking of faces. However, this paper primarily focuses on the relationship between LL and social liking, and thus, its findings stand independently of the explanatory role played by efficient coding. In the following, we demonstrate how our model assumptions and predictions are validated by a combination of face data, statistical modeling, human behavioral data, and model simulations. We begin with an analysis of BiA effects and then proceed to UiA.

Results

We hypothesize that BiA arises because the blend of two faces tends to be closer to the mean of the distribution and have higher LL than the parent faces (Fig. 1A). To examine the statistical distribution of faces (Materials and Methods and SI Appendix), we utilize a demographically balanced, publicly available dataset (CFD) of 597 face images (25) and retrieve their latent feature representation from a commonly used computer vision model, the active appearance model (AAM) (22⇓–24). The AAM has previously been used to model human face representation (20, 50), and AAM feature dimensions (axes) appear to be encoded linearly by face-selective neurons in the primate brain (51).

Simulation: BiA.

To check whether statistical typicality (LL) of CFD faces and their blends can reproduce experimentally observed BiA effects, we need a parametric density model of faces. Due to the relatively small number of CFD faces (597) compared with the large number of AAM features (90), we fit a multivariate normal distribution to the CFD data—although we find that LL is highly correlated whether we fit a single Gaussian or a mixture of Gaussians corresponding to demographic subgroups (e.g., genders) (SI Appendix)—and the two models correlate with CFD attractiveness ratings similarly well (SI Appendix). Taking 500 random one-dimensional (1D) projections of the CFD data in the AAM feature space (Fig. 2A shows an example projection), we find that normality cannot be rejected (Anderson Darling test, significance level α=0.05) in all but 38 dimensions (7.6%), not much higher than the 5% we would expect if every dimension was truly normal (in contrast, if we simulate random projections of data sampled from a distribution that was bimodal in every dimension, then 60% of them reject normality) (SI Appendix). In fact, we do expect a small number of dimensions to be multimodal due to natural demographic clustering of data (e.g., due to gender or race), an important point we will return to in the UiA analysis. In the following, we use the CFD-estimated multivariate Gaussian density for all of the BiA-related analyses.

Fig. 2.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 2.

BiA and statistical typicality. (A) Empirical face distribution projected into a random axis is unimodal and bell shaped. (B) BiA: humans perceive more “evenly” blended faces as more attractive (data from ref. 26). (C) Simulated statistical typicality captures a similar trend as seen in data. Five hundred pairs of faces are randomly selected from CFD, each pair’s coordinates are averaged in varying proportions, and LLs for blends of the same proportion are averaged across pairs. Error bars indicate SEM.

Our simulations (SI Appendix) indicate that the blend of two randomly sampled CFD faces has higher LL (thus, presumably higher attractiveness) than parent face images (two-sample t test, P<0.0001), consistent with experimental findings (6). Moreover, as the number of parent faces increases in a blend, the blended face increases in LL (and thus, attractiveness) monotonically (ANOVA test, P<0.0001; t test comparing 2-face blend with 32-face blend, P<0.0001), also consistent with experimental findings (6). Relatedly, when two faces are blended in different proportions, from 100% of one face to 100% of the other face, LL has a characteristic inverted U shape, similar to experimental findings (26) (Fig. 2 B and C).

Experiment: Social Liking Varies Linearly with Statistical Typicality.

If statistical typicality modulates the affective experience of perceptual stimuli, then it should not only be restricted to the perception of attractiveness but also influence other desirable attributes such as trustworthiness (16). Here, we report results from an experiment (Materials and Methods) in which subjects rate face images (SI Appendix, Fig. S2 shows example stimuli) for attractiveness, trustworthiness, dominance, and valence (how “positive” a face appears) (Materials and Methods)—traits thought by social psychologists to be the most fundamental in face-based social trait perception (4). We find that face image ratings, averaged across subjects, of all four traits increase monotonically with LL of the face stimuli under the estimated normal density model (attractiveness: Pearson r=0.386,P<0.0001; trustworthiness: r=0.268,P<0.0001; dominance: r=0.196,P<0.0001; valence: r=0.171,P<0.0001). The effect is strongest for attractiveness followed by trustworthiness, dominance, and finally, valence. Fig. 3 shows that this relationship is highly linear for all four traits (binned data: Pearson r=0.96, 0.96, 0.98, and 0.98, respectively). At the individual level, we find that statistical typicality correlates with ratings for a significant fraction of participants for each trait (binomial test, P<0.0001 for each trait). Analogous to group-level analysis, more individuals show significant correlation (at α=0.05 significance level) for attractiveness (75% of participants) and trustworthiness (55%) than for dominance (40%) and valence (32.5%) (SI Appendix, Fig. S4 shows histograms of individually significant correlation coefficients).

Fig. 3.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 3.

Trait rating increases linearly and monotonically against statistical typicality (LL) for all four traits: attractiveness (A), trustworthiness (B), dominance (C), and valence (D). Data binning ensures an equal number of samples in each bin. Linear regression line (using binned data) is superimposed for visualization. The text has correlation coefficients for raw data. Error bars indicate SEM over samples in each bin.

One corollary of the statistical typicality account of attractiveness is that, if the distribution of faces is Gaussian over the underlying face feature space, then LL is a particular parameter-free quadratic function of the underlying face features (SI Appendix). This is consonant with previous work showing that a freely fitted quadratic model of human perception of attractiveness improves over a pure linear model (52, 53), except we make a more refined claim that the quadratic component is precisely LL and has no free parameters.

We hardly expect statistical typicality to be the sole contributor to attractiveness. Previous work suggested that human perception of attractiveness, as well as other traits such as trustworthiness and dominance, has both linear and squared dependence on the face feature space (52, 53). Here, we adapt this idea and jointly fit a multiple linear regression (MLR) model (“linear + LL” model) consisting of all of the linear terms (x1,…,xd), along with an LL term premultiplied by a coefficient parameterizing the importance of LL relative to the linear ones (Fig. 4A). For comparison, we also fit an MLR with only linear predictors (“linear model”) and another with all of the linear and squared terms (“quadratic” model) (52, 54). Because the three models have vastly different numbers of free parameters (linear: 90 features, one offset; linear + LL: 90 features, one LL term, one offset; quadratic: 90 features, 90 squared features, one offset), it is inappropriate to compare them directly on the training data. Instead, we do 10-fold cross-validation: we train the model on 90% of the dataset and then compute the correlation coefficient between the predicted trait rating of held-out (10%) test data and the human average ratings on those faces. Across the 10 folds, Fig. 4B shows that linear + LL significantly outperforms the quadratic model for each of attractiveness, trustworthiness, dominance, and valence (paired one-sided t test: P=0.0412,P=0.0003,P=0.089,P = 0.0003, respectively), as well as the linear model (paired one-sided t test: P<0.0001,P<0.0001,P<0.0001,P=0.0003, respectively). Alternatively, we find that linear + LL has the best (lowest) Bayesian information criterion scores compared with the linear and the quadratic models (SI Appendix, Fig. S3). These results indicate that by using a particular parameter-free quadratic form, provided directly by the probability density of the data, we can account for social perception across a number of traits better than a more complex model that freely fits all possible squared terms. For brevity, we refer to the vector of regression coefficients for the linear terms in linear + LL as the linear trait axis (LTA) for that trait in the remainder of the paper.

Fig. 4.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 4.

Model illustration and comparison. (A) We use a two-dimensional simplified illustration to visualize the linear + LL model (in actuality, there are 90 dimensions). Oval: equi-LL contour of an axis-aligned normal density function, with σ1 and σ2 being the SD along the two feature dimensions x1 and x2 (although estimated normal density may not be axis aligned). Linear + LL model of attractiveness is a linear combination of the quadratic component related to negative LL and a linear component indicated by the “attractiveness LTA” (blue); similarly, linear + LL models trustworthiness as a linear combination of the LL quadratic component and a linear “trustworthiness LTA” component (red). −LL of a face situated at (x1, x2) is proportional to the sum of the square of its distance along each coordinate divided by the SD of that coordinate—in particular, x2 is given a greater weight than x1 in this example because the data are more tightly distributed for x2 than for x1. The component of attractiveness LTA orthogonal to trustworthiness LTA (blue dashed line) is an axis along which attractiveness ratings should look linear, while trustworthiness ratings should look quadratic, as was found in ref. 16. However, the component of trustworthiness LTA orthogonal to attractiveness LTA (red dashed line) is an axis along which trustworthiness ratings should look linear, while attractiveness ratings should look quadratic. (B) Correlation coefficient between predicted and actual ratings on held-out data, averaged over 10 folds. Linear + LL consistently outperforms the quadratic and linear models. Error bars are SEM over folds. n.s., P > 0.05; *P < 0.05; ***P < 0.001.

The astute reader might have noticed that the quadratic component (LL) is shared by all social traits, while the linear component (LTA) is fit to each trait individually. Indeed, using 10-fold cross-validation, we find that the LTA for each trait makes better prediction about the same trait on held-out test data than the LTA for any other trait (paired t test, P<0.0001) (Table 1) except for the trait “trustworthiness,” which is slightly better predicted by the LTA for valence (r=0.307) than for trustworthiness (r=0.270), but this is not statistically significant (P=0.384). In particular, it is apparent that dominance actually has quite different linear explanatory factors than the other traits, but nevertheless, it shares the same quadratic, typicality-driven factor as the rest. These results also shed light on a recent finding that typicality affects trustworthiness perception and the suggestion that BiA may be an indirect phenomenon due to the high correlation between trustworthiness and attractiveness (16). What we find is that indeed trustworthiness and attractiveness are highly related: not only do they have similar linear components (their LTAs predict each other well, although not as well as each trait’s own LTA) (Table 1), but they also share a common typicality-based quadratic component (Fig. 3). However, trustworthiness and attractiveness do not seem to “inherit” their dependence on typicality from each other but are instead each driven directly by statistical typicality. For instance, statistical typicality (LL) correlates significantly more strongly with attractiveness (r=0.39) than trustworthiness (r=0.27). When we regress out trustworthiness ratings of faces, we find that residual attractiveness is still strongly correlated with statistical typicality (r=0.33,P<0.0001); likewise, residual trustworthiness, after having regressed out attractiveness, still strongly correlates with statistical typicality (r=0.18,P<0.0001). Given that the two traits have both a (shared) quadratic component and a (unique) linear component component (Fig. 4), not only should one be able to find an axis along which trustworthiness looks quadratic while attractiveness looks linear (as found in ref. 16), but it should also be possible to find an axis along which attractiveness looks quadratic while trustworthiness looks linear (Fig. 4A).

View this table:
  • View inline
  • View popup
Table 1.

Trait-specific LTAs

Simulation: Symmetry and Statistical Typicality.

Revisiting the role of symmetry in attractiveness perception (7, 8), we suggest the possibility that symmetry elevates attractiveness perception at least in part through statistical typicality. Fig. 5A shows that the statistical typicality (LL) of CFD face images is negatively correlated (Pearson r=−0.373, P<0.0001) with shape asymmetry (Euclidean distance between landmarks and their mirror image). As a whole, a large majority of CFD faces (98.3%, binomial test, P<0.0001) increase LL after shape symmetrization (Fig. 5B and SI Appendix), and more atypical faces benefit more in LL from shape symmetrization [regression coefficient of LL(symmetrized face) − LL(original face) against −LL(original face) is significantly greater than zero, P<0.0001; regression coefficient b=1.21, 95% CI [1.12,1.30]]. As a whole, shape symmetrization increases LL of CFD faces by 1.01 in units of SD of the original empirical LL distribution of CFD faces (paired t test, P<0.0001) (Fig. 5C). If texture features are left–right symmetrized in addition to shape features (not typically done in experiments), our model predicts an even bigger effect of symmetrization on LL (SI Appendix, Fig. S8). Moreover, similar to human data (10), our simulations show that blends of (shape) symmetrized faces still exhibit BiA (larger LL than their “parent faces”), although the difference is slightly smaller than for blends of original (nonsymmetrized) faces (SI Appendix, Fig. S5), indicating that symmetry has a significant but partial contribution to both experimentally observed attractiveness and model-predicted statistical typicality.

Fig. 5.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 5.

Symmetry and statistical typicality. (A) LL negatively correlates with shape asymmetry. (B and C) Shape symmetrization increases LL of most face images. (A) Data binning ensures an equal number of samples in each bin. Linear regression line (using binned data) is superimposed for visualization. The text has the correlation coefficient for raw data. Error bars indicate SEM over faces in each bin.

UiA.

We hypothesize that UiA arises because attentional mechanisms focus on the task-relevant feature subspace, such that statistical typicality and subjective liking are both assessed within this subspace instead of the full face space. If the data distribution projected into this subspace is bimodal, then statistical typicality and subjective liking of an average between two samples from two different modes should be lower than if the parent faces were from the same mode (Fig. 1B). When we project CFD faces into the 1D subspace that best discriminates race [Caucasian vs. Asian (26, 27), found by regularized linear discriminant analysis] (SI Appendix), we find the face distribution to be indeed bimodal (Fig. 6C). Likewise, projecting CFD faces into the gender-informative 1D subspace also exhibits clear bimodality (Fig. 7B and SI Appendix). It may seem odd that the face distribution is both approximately Gaussian and a mixture distribution. The reason is that the mixture components only appear as distinct components (multimodal) when viewed from a small number of very particular feature dimensions (e.g., important for discriminating race or gender), but still Gaussian in the great majority of dimensions.

Fig. 6.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 6.

Simulation: UiA due to bimodality in the race-informative subspace. (A) Data are adapted from ref. 26: single-race (SR; Asian–Asian, White–White) face blends are rated as more attractive than mixed-race (MR; Asian–White) face blends, when a race categorization task precedes attractiveness rating (two-sample t test, P<0.05). Error bars are 95% CIs. (B) Model: statistical typicality of MR blends is lower than SR blends in the 1D race-informative subspace. (C) Empirical distribution of White and Asian faces (25) projected into the race-informative subspace is bimodal. X, mean location of face images (60 total) for each percentage of blend (i.e., 1: 100% White and 0% Asian, …, 11: 0% White and 100% Asian. (D) Model-predicted statistical typicality (LL) for actual face images (binned by racial blend percentage) in the full face space. (E) Model-predicted LL for face images (same as in D) in the race-informative subspace. Error bars in B, D, and E indicate SEM over simulated face stimuli.

Fig. 7.
  • Download figure
  • Open in new tab
  • Download powerpoint
Fig. 7.

UiA of bigender blends induced by gender categorization. (A) AAM reconstruction of example stimuli used in ref. 27: blends of varying proportions of male and female parent faces. (B) The empirical distribution of male and female faces (25) projected into the gender-informative subspace is a mixture of two approximately normal distributions. X: mean location of actual experimental stimuli (27) for each percentage of blend. (C) Data: attractiveness rating in experimental condition minus control condition as a function of percentage blend (27). (D) Model-predicted statistical typicality assessed within the gender-discriminating subspace for the same faces as in C (27). (E) Model-predicted statistical typicality assessed in the full (original) space for the same faces as in C (27). Error bars in C–E are SEM over all stimuli used in the experiment for each percentage blend.

Simulation: UiA Reproduced by LL in Task-Relevant Subspace.

Human subjects have been found to rate single-race face blends more attractive than mixed-race face blends when they are required to categorize race before rating attractiveness (Fig. 6A) (26). We simulate the effect of this race categorization task on statistical typicality as follows. First, we randomly sample face images from CFD (25) to create 100 single-race (Asian–Asian, White–White) and 100 mixed-race (Asian–White) blends. Then, we project all face blends into the race-informative 1D subspace. The model-predicted statistical typicality for each face blend is its LL under the Bayesian a posteriori most probable race category (SI Appendix), consistent with a body of work showing that people often use category membership to predict features of, and reason about, members of a category (55⇓⇓–58). We find that model-predicted statistical typicality reproduces the empirically observed UiA effects (Fig. 6B).

We can make more refined predictions by smoothly varying the percentage of blending in each pair of faces. We first randomly draw 60 Asian and White face images (with replacement) from the face dataset (25) and then blend them at 10% increments, from 100% of the White face to 100% of the Asian face, thus producing 11 morphs of each pair (Fig. 6C). We find that average statistical typicality, as a function of percentage blend, has an inverted U shape (BiA) relative to percentage racial blend in the full space (Fig. 6D) or a random subspace (SI Appendix, Fig. S11), but a U shape (UiA) in the race-informative subspace (Fig. 6E). Analogous simulations corresponding to the case of gender categorization (27) exhibit a similar pattern (SI Appendix, Fig. S9).

Image-Level Comparison: Model vs. Data.

As a more stringent test of our model, we investigate the relationship between individual faces’ attractiveness ratings and model-predicted statistical typicality in the attended, task-relevant [linear discriminant analysis (LDA)] subspace. We reanalyze the stimuli and behavioral data from the gender categorization study (27), in which subjects rated the attractiveness of blends from male and female parent faces (Fig. 7A) in different proportions (10% increments), under either the control condition (no gender categorization) or the experimental condition (following gender categorization). It was found that attractiveness ratings in the experimental condition, after subtracting out their ratings in the control condition, exhibit UiA: more even gender blends are relatively more negatively affected by the gender categorization (Fig. 7C) (27). When we use our face model to assess statistical typicality in the gender-informative subspace, we also observe a similar UiA effect (Fig. 7D), in contrast to the BiA effect when statistical typicality is assessed in the full face space (Fig. 7E). More impressively, at the individual face level, we find that the statistical typicality of blended face images significantly correlates with (subtractively normalized) attractiveness rating in the experimental condition (r=0.28, P=0.0026) (SI Appendix, Fig. S6 shows a scatterplot). This relatively high correlation coefficient at the individual face level is impressive because there are clearly other determinants of facial attractiveness other than typicality, such as perceptual and conceptual priming, contrast, clarity, sexual dimorphism, etc. (11, 52, 59). Importantly, the entirety of this predictive power comes from theoretical considerations of the relationship between LL (based on a density model fit to a different dataset, CFD) (25) and liking—no free parameter was estimated using the ratings or stimuli from the experiment.

Discussion

In this paper, we proposed a statistically grounded account of human liking of high-dimensional objects, which in the case of faces, manifests itself as positive social evaluation across multiple traits. We showed that human perception of attractiveness and other positive traits of a face image depends on its statistical typicality, defined as its LL relative to an internal representation of the face distribution. This hypothesis is motivated by statistical and information-theoretic arguments that a good or efficient representation should maximize the average LL (or equivalently, minimize the average code lengths or cross-entropy) of observed data, and is related to the efficient coding hypothesis of neural representation (34). While our analysis is inherently correlational in nature, some existing findings can be reinterpreted to imply that statistical typicality has a causal effect on subjective liking. For example, in a neural phenomenon known as “repetition suppression,” repeated presentation of the same stimulus, such as a face image, has been shown to increase predicted likelihood of observing the stimulus (60) and leads to a robust decrease in evoked neural response (60⇓⇓–63). Relatedly, in the psychological “mere exposure” phenomenon, repeated exposure to a novel stimulus, such as a face image, leads subjects to report greater liking (64) and more positive perceived valence associated with the face (65). Together, it is clear that empirically increased frequency of a stimulus is sufficient to induce both lower neural representational cost and greater subjective liking. Whether the increase in liking is causally mediated by the decrease in neural coding cost is an important direction of future research.

Additionally, we demonstrated that categorization-induced UiA effects (26, 27) naturally arise when statistical typicality is dynamically redefined over the task-relevant featural subspace via attentional modulations (29⇓⇓–32, 66). For example, while categorizing gender, attention dynamically enhances the processing of the gender-discriminative features—formally, by restriction to the relevant featural subspace. The statistical distribution of faces is redefined within this subspace (appearing bimodal), thus leading to systematic reassignment of LL to each face. In particular, the faces that straddle the boundary between two categories (e.g., bigender blends) tend to have high LL in the full face space but low LL in the dynamically restricted representation—thus resulting in UiA. We showed that this theory can indeed quantitatively capture the categorization-induced changes in liking on an image by image basis. Critical for this theory is the assumption that the brain can dynamically alter its representation of faces in a task-dependent manner. Consistent with this, neural receptive fields for faces are known to be rapidly and dynamically modified by attention and task context (67). Notably, we expect rapid dynamic modulation of stimulus representation to be primarily applicable in the case of featural dimensions that are ecologically relevant (such as those discriminating race and gender). Such dynamic modulation may also be possible for arbitrary featural dimensions but, as logic would suggest and empirical evidence concurs (18, 28), would require extensive additional training. It would be interesting to test in future work whether UiA can also be causally induced by newly learned multimodal distributions (18, 28).

Supporting our suggestion that energy allocation in the brain should be efficient and thus proportional to negative LL of the face stimulus, human functional magnetic resonance imaging (fMRI) studies have shown that energy expenditure across multiple face-responsive areas in the brain, indexed by blood-oxygenation-level-dependent (BOLD) response, is indeed approximately quadratic (negative LL of a multivariate normal distribution is quadratic) (42, 43). While these empirical findings (42, 43, 68) were originally interpreted to implicate the relevant brain regions as encoding typicality or “distinctiveness” of faces, we suggest instead that any efficient face representation must respond in this quadratic manner given the approximate normal distribution of faces—this is the case whether or not there is an explicit encoding or decoding of typicality or distinctiveness in a brain area exhibiting a quadratic response to faces. We also note here the important distinction between statistical typicality and subjective typicality. While statistical typicality, of the kind of quadratic signal found in face areas (42, 43), might well contribute to subjective typicality, we have found (50) that human judgment of face typicality and distinctiveness (related to memorability) both have a strong linear component in the face space, just like the four social traits reported here, and thus cannot only be driven by statistical typicality or the quadratic signal found in face-responsive areas.

While a detailed neurocomputational theory is outside the scope of this paper, we briefly discuss one plausible, although obviously greatly simplified, neural implementation of our computational-level theory (69). Various studies have shown that familiar and unfamiliar faces are represented differently in the brain, both in terms of brain regions (62, 70, 71) and coding scheme (51, 72). In particular, familiar faces appear to involve dedicated feature detectors (72), while unfamiliar faces appear to be encoded via a dimensional scheme (51). Within our framework, one may well ask how the brain encodes a distribution over faces, which is necessary to represent the LL of a new face. One possibility, related to a sampling representation of distributions (73) and the notion of landmark points for manifold learning in machine learning (74), is for the brain to represent the face manifold (and distribution) using a sparsely sampled representation consisting of well-known faces. When a novel face is encountered, the brain could first identify the closest known face (55⇓⇓–58) and then use a dimensional coding scheme (51) to encode the discrepancy between the retrieved prototype and the novel face. This scheme builds on both prototype- and norm-based face representations (75) and is consistent with the predictive coding hypothesis (76, 77). Within this framework, a statistically atypical face incurs high coding cost because it tends to be far from the retrieved exemplar, and thus, the discrepancy will be large and expensive to represent in a dimensional coding scheme (51). In the UiA-inducing categorization setting, attention enhances the neural response to task-relevant features relative to task-irrelevant features. This has the effect of increasing the overall coding cost of a category-straddling stimulus, since the featural discrepancy (and thus, coding cost) between the stimulus and the closest retrieved prototype is highest in the task-relevant dimensions, which are enhanced by attentional modulation. Importantly, this example also illustrates that attentional modulation and statistical typicality alone are sufficient to explain UiA at the neural level, regardless of whether the relationship between statistical typicality and liking is due to efficient coding or not.

The above is but one plausible neural coding scheme, but it illustrates the important distinction between dynamic coding cost, which we hypothesize drives liking, and long-term structural cost (e.g., forming a new feature detector). Well-known faces, which are statistically more typical, are given greater structural representational resources in order to minimize their dynamic coding cost; conversely, statistically atypical faces incur high dynamic coding cost as a consequence of little dedicated structural representation (no dedicated feature detectors). While atypical faces in a stable, well-known environment tend to cancel each other out in terms of driving learning since they are inevitably and symmetrically distributed in the fringes of an approximately normal distribution, a sudden influx of atypical faces (such as that induced by immigration) could drive systematic representational plasticity so as to reduce average long-term coding cost.

Statistical atypicality (negative LL) is related to the theoretical notion of “unexpected uncertainty,” which we earlier proposed to reflect a confluence of unexpected deviations between expectation and observations and signal the need for representational learning (44). We proposed that unexpected uncertainty, signaled by the neuromodulator norepinephrine, acts in concert with expected uncertainty, signaled by the neuromodulator acetylcholine, to assist the neocortex in learning and maintaining appropriate representations of environmental statistics as well as selecting the appropriate behavioral responses (44). This theory has received considerable empirical support in the intervening years (78⇓⇓⇓⇓–83). In the original formulation of expected and unexpected uncertainty (44), we had in mind rather simple kinds of statistical inference and learning such as those related to associative learning; the current work suggests that similar mechanisms may also apply to highly complex stimuli such as faces (68). Among other implications, this leads to the interesting hypothesis that atypical faces might lead to an elevation of norepinephrine release. Consistent with this, there is evidence that an early and rapid pupil constriction predicts high attractiveness rating for faces, and experimental manipulation of pupil size causally affects perceived facial attractiveness in the expected direction (84). Combined with the findings that phasic increase in pupil size is associated with elevated norepinephrine release in the brain (83) and that phasic pupil dilation enhances learning (both correlationally and causally) in a manner consistent with unexpected uncertainty (79, 85), this suggests that facial atypicality may indeed modulate norepinephrine-mediated control over attractiveness perception and representational learning in a computationally principled manner.

It may puzzle some readers that we assert greater learning about atypical faces despite their associated negative affect, instead of less attention or less approach behavior. While negative affect can lead to physical avoidance of a face (although not always) (86), it is but one contributing factor to approach/avoidance behavior. In a classical study, it has actually been found that there is a strong negative correlation (r=−0.60) between liking and exploration of face images, such that those faces perceived as least likable are also those that induce the most exploration (87). Relatedly, novelty has been observed to drive learning and exploration in relation to faces and visual scenes (88, 89).

We do not claim statistical typicality to be the sole determiner of attractiveness or other social trait evaluations. For example, our model does not explain certain aspects of facial preferences (e.g., sexual dimorphism in face perception) (52, 90) or systematic differences in preference judgment for faces vs. other categories of objects (89). Even in our own analysis [consistent with prior findings (50, 54)], we find a separate linear component, apparently unique to each social trait. Relatedly, we recently found that the liking “function” over the stimulus space may be modified by positive/negative encounters with specific exemplars, which are then extrapolated to the rest of the space depending on the clustering (categorical) structure of the data (91). Nevertheless, statistical typicality already provides a parsimonious and normative account of several previously proposed causal factors of attractiveness. For example, we found that popular methods for symmetrizing face images also tend to increase statistical typicality. There is also recent work showing that coding cost at low-level image statistics (e.g., related to small image patches) also decreases attractiveness (20, 92), which is expected since low-level statistical typicality (in an efficient coding scheme) is a subcomponent of general statistical typicality. From this perspective, it also makes sense that blemishes (9) should decrease attractiveness, as they are statistically irregular and thus, expensive to encode.

Previously, it has been suggested that “averageness” contributes to facial attractiveness (93, 94), but the effect size was found to be quite modest (18, 20, 52). Because LL of a multivariate normal distribution peaks at the mean face and falls off monotonically along any particular dimension, it can also be thought of as a measure of averageness—our results indicate a larger effect of averageness on attractiveness perception. In previous studies, averageness was quantified as negative Euclidean distance (20) or negative standardized Euclidean distance (normalized by the SD in each dimension) (52). In contrast, LL is equivalent to negative Mahalanobis distance, which not only takes SD into account but also, correlation between features (as are present between shape and texture features). A larger quadratic effect might also be present in our study because the face stimuli were designed to vary systematically in LL (Materials and Methods and SI Appendix)—although Fig. 3 makes it clear that it is not only the most extreme stimuli that are driving the effects but the full range of faces. In contrast to previous studies (16, 18) that constrained the analysis to a single dimension, our model allows the possibility of using stimuli that vary along all (90) face feature dimensions and thus, arrive at more accurate and general conclusions about human face processing. For example, Euclidean distance, standardized Euclidean distance, and Mahalanobis distance (−LL) are all confounded on a single dimension but differ from one another nontrivially when a larger number of dimensions is considered. A greater differentiation between statistical typicality and averageness arises in the case of multimodal distributions, since LL and averageness are no longer well correlated. In this vein, our UiA analyses suggest that statistical typicality is a more precise and general concept than averageness for explaining facial liking.

Another prominent notion related to statistical typicality is prototypicality. Prototypicality is a measure of “closeness” of a stimulus to the “prototype” (11, 21), implying there are clear, fixed modes in the stimulus distribution. Our account differs in two ways: firstly, it does not assume the prototypes to be fixed and predefined but rather, task dependent (e.g., the average male face can be considered a prototype in a gender categorization task but not in a race categorization task); secondly, statistical typicality is well defined even for distributions that have no distinct modes, such as for the fairly flat (close to uniform) distribution of age, in which case LL is approximately a constant.

Our statistical typicality account is also related to a rather different explanation of BiA and UiA, known as the fluency account (11, 26, 27, 95). The fluency account hypothesizes that stimuli, such as category-specific prototypes, may be processed more “fluently” than other stimuli, and human liking of faces and other objects decreases in response to “disfluency” in processing. At a broad level, the fluency account shares with our statistical typicality hypothesis the concept of a human preference for efficiency. However, fluency is not mathematically defined but empirically measured as categorization response time (26, 27), which is undefined for the BiA setting where there is no categorization task (11). There is also a subtle but important distinction between statistical typicality and fluency (11, 26, 27, 95): the latter assumes that the disfluency (and thus, disliking) of intercategory faces is specifically related to the difficulty of categorizing faces that are close to the decision boundary. So far, all experimental results related to UiA have been obtained using categories that correspond to statistical modes in the data, as is the case with race (26) and gender (27). In such cases, the intercategory stimuli are both close to the decision boundary and have low LL. However, one can disentangle the two factors by using a categorization task in which the categories do not correspond to natural statistical clusters: for example, by asking subjects to decide whether a face is above or below a certain age threshold (say 40). As the statistical distribution along the age-relevant dimension is not obviously bimodal, statistical typicality would not predict a UiA effect in this case, while the fluency account still would. More broadly, processing fluency can be expected to be influenced not only by coding efficiency but also by, for example, computational complexity, motor delay or effort, attention, or motivational factors unrelated to coding efficiency. Other qualitative concepts related to fluency and statistical typicality are “simplicity” (96), in the sense that processing can become more efficient after a simple explanation/representation is learned, and effort-based decision making (97, 98), as processing cost at the biophysical level may contribute to cognitively perceived effort. More broadly, future empirical and theoretical work is needed to clarify the formal relationship among computational concepts such as statistical typicality, coding efficiency (34, 39), and processing efficiency, and more qualitative psychological concepts such as fluency (95), effort (97, 98), and simplicity (96).

In addition to providing a statistically grounded explanation of contextual dependence of human attractiveness judgment, our work also provides some general insight as to how high-dimensional data can be analyzed and stored efficiently in an intelligent system. All elements of the model presented here can be easily generalized to nonface stimuli, such as dogs, birds, butterflies, fish, automobiles, watches, and synthetic dot patterns (11, 14, 15). Minimizing the average coding cost of statistically distributed data is a desirable goal for any efficient representational system, whether natural or artificial. Another important computational insight is that a system can overcome limitations in its representational and computational capacity by dynamically shifting its featural representation to focus on task-relevant dimensions according to the behavioral context. As such, our work sheds light on one possible functional role played by attentional selection in the brain: it is one way to dynamically construct subspaces that emphasize feature dimensions that are most relevant or salient for performing the task at hand (29⇓⇓–32, 66). A productive direction of future research would be to see whether our hypothesized role of attention can help to shed light on the large and confusing attention literature.

Our line of reasoning sheds light on a broader computational understanding of how the brain dynamically encodes and processes complex, high-dimensional data. Faces provide excellent stimuli for investigating such processes, because they are informationally rich, ecologically important, and amenable to neurally plausible, quantitative modeling via methods such as AAM. We therefore used faces to implement and test concrete ideas about information representation and its contextual modulation in this work. The attractiveness literature also provided a convenient empirical test of our theory. However, we expect that the dynamic representational framework we hypothesize here also affects other cognitive processes, such as working memory, learning, decision making, and problem solving, in the sense that all these cognitive processes can benefit from attentional enhancement of task-relevant feature dimensions over irrelevant dimensions. For example, learning to memorize a set of items should be easier if one’s attention is focused on the features that make these items easier to organize. Another example is that two stimuli that differ along an attended featural dimension should be easier to discriminate and later recall, and those that differ along an unattended dimension (orthogonal to the attended dimension or dimensions) should be harder to discriminate and later recall. In general, the benefits of cognitive expediency and behavioral accuracy derived from focusing on task-relevant features are broad and multifaceted, pointing to a promising direction for future research.

Materials and Methods

Formal Model.

We assume humans have an internal d-dimensional representation of faces X (21, 52, 99), in which each face is represented by a vector x=(x1,…,xd) of d real-valued features. We also assume that this face space is endowed with a probability distribution pX(x), reflecting an internal statistical representation of faces (18). In the absence of a categorization task (e.g., by gender or race), statistical typicality is defined as the LL of the face under the full distribution logpX(x). When the observer performs a categorization task, such as gender categorization, we assume the brain projects each face x into the task-relevant subspace X∼⊆X to obtain x∼ (for example, if θ is a basis vector for the task-informative 1D subspace X∼, the dot product x⋅θ yields x∼). It then evaluates the statistical typicality of x∼ as its LL logpX∼(x∼|c) under the class conditional marginal distribution pX∼(⋅|c) that marginalizes over all of the dimensions orthogonal to the subspace X∼, where c is the Bayesian a posteriori most probable category (e.g., “male” or “female” for gender categorization). This formulation of attentional focusing corresponds to completely eliminating task-irrelevant feature information while sparing attended dimensions, which is likely an exaggeration of brain mechanisms (29), but a “softer” attentional mechanism that partially suppresses information transmission in the irrelevant dimensions relative to the relevant dimensions should produce similar results.

AAM.

We model faces using the AAM, due to its neural relevance (51), previous success in modeling human face space and predicting human responses (20, 50, 100), ability to output feature values for novel faces and generate realistic-looking synthetic faces for any feature setting, and high transparency and customizability in contrast to commercial software. We train AAM (SI Appendix) using CFD, a public dataset of 597 face images (25). We obtain for each face 30 shape features related to the geometric layout of invariant elements of faces (e.g., eyes, eye brows, nose, mouth, contour of the face) and 60 texture features related to pixel variations within and among these elements. (Results are insensitive to exact number of features; see SI Appendix.)

Experiment.

Participants.

Forty-one (mean age 20.6 y, 24 female) University of California (UC) San Diego undergraduate students participated in the study in exchange for course credit. Participants gave informed consent before taking part in the study. Approval for the study was given by the UC San Diego Human Research Protection Program.

Stimuli

AAM was used to generate 2,520 synthetic face images (SI Appendix, Fig. S2 shows example stimuli).

Procedure.

Participants rated faces “intuitively” on a Likert scale (one to five) for how “attractive,” “trustworthy,” “dominant,” and positive (valence) faces appeared. Each image received on average 2.35 ratings per trait.

Data Availability.

Anonymized data from the experiment where participants rated attractiveness, trustworthiness, dominance, and valence are available in the Open Science Framework (101).

Acknowledgments

We thank Jamin Halberstadt for sharing data from ref. 26. This project was partially funded by University of California San Diego Academic Senate grants (to P.W. and A.J.Y.).

Footnotes

  • ↵1To whom correspondence may be addressed. Email: ajyu{at}ucsd.edu.
  • Author contributions: C.K.R., S.G., P.W., and A.J.Y. designed research; C.K.R., S.G., and A.J.Y. performed research; C.K.R. and A.J.Y. contributed new reagents/analytic tools; C.K.R. and A.J.Y. analyzed data; and C.K.R., S.G., P.W., and A.J.Y. wrote the paper.

  • The authors declare no competing interest.

  • This paper results from the Arthur M. Sackler Colloquium of the National Academy of Sciences, “Brain Produces Mind by Modeling,” held May 1–3, 2019, at the Arnold and Mabel Beckman Center of the National Academies of Sciences and Engineering in Irvine, CA. NAS colloquia began in 1991 and have been published in PNAS since 1995. From February 2001 through May 2019, colloquia were supported by a generous gift from The Dame Jillian and Dr. Arthur M. Sackler Foundation for the Arts, Sciences, & Humanities, in memory of Dame Sackler’s husband, Arthur M. Sackler. The complete program and video recordings of most presentations are available on the NAS website at http://www.nasonline.org/brain-produces-mind-by.

  • This article is a PNAS Direct Submission.

  • This article contains supporting information online at https://www.pnas.org/lookup/suppl/doi:10.1073/pnas.1912343117/-/DCSupplemental.

Published under the PNAS license.

References

  1. ↵
    1. M. Neta,
    2. M. Bar,
    3. H. Linz
    , Very first impressions. Emotion 6, 269–278 (2006).
    OpenUrlCrossRefPubMed
  2. ↵
    1. J. Willis,
    2. A. Todorov
    , First impressions: Making up your mind after a 100-ms exposure to a face. Psychol. Sci. 17, 592–598 (2006).
    OpenUrlCrossRefPubMed
  3. ↵
    1. I. Gauthier,
    2. M. Tarr,
    3. D. Bub
    , Perceptual Expertise: Bridging Brain and Behavior (Oxford University Press, 2010).
  4. ↵
    1. A. Todorov,
    2. C. Y. Olivola,
    3. R. Dotsch,
    4. P. Mende-Siedlecki
    , Social attributions from faces: Determinants, consequences, accuracy, and functional significance. Annu. Rev. Psychol. 66, 519–545 (2015).
    OpenUrlCrossRefPubMed
  5. ↵
    1. F. Galton
    , Composite portraits, made by combining those of many different persons into a single resultant figure. J. Anthropol. Inst. G. B. Ireland 8, 132–148 (1879).
    OpenUrl
  6. ↵
    1. J. H. Langlois,
    2. L. A. Roggman
    , Attractive faces are only average. Psychol. Sci. 1, 115–121 (1990).
    OpenUrlCrossRef
  7. ↵
    1. B. Jones et al.
    , Facial symmetry and judgements of apparent health. Evol. Hum. Behav. 22, 417–429 (2001).
    OpenUrlCrossRef
  8. ↵
    1. G. Rhodes et al.
    , Attractiveness of facial averageness and symmetry in non-Western cultures: In search of biologically based standards of beauty. Perception 30, 611–625 (2001).
    OpenUrlCrossRefPubMed
  9. ↵
    1. J. H. Langlois,
    2. L. A. Roggman,
    3. L. Musselman
    , What is average and what is not average about attractive faces? Psychol. Sci. 5, 214–220 (1994).
    OpenUrlCrossRef
  10. ↵
    1. B. C. Jones,
    2. L. M. DeBruine,
    3. A. C. Little
    , The role of symmetry in attraction to average faces. Percept. Psychophys. 69, 1273–1277 (2007).
    OpenUrlCrossRefPubMed
  11. ↵
    1. P. Winkielman,
    2. J. Halberstadt,
    3. T. A. Fazendeiro,
    4. S. Catty
    , Prototypes are attractive because they are easy on the mind. Psychol. Sci. 17, 799–806 (2006).
    OpenUrlCrossRefPubMed
  12. ↵
    1. D. Symons
    , The Evolution of Human Sexuality (Oxford University Press, 1979).
  13. ↵
    1. R. Thornhill,
    2. S. W. Gangestad
    , Human facial beauty: Averageness, symmetry, and parasite resistance. Hum. Nat. 4, 237–269 (1993).
    OpenUrlCrossRefPubMed
  14. ↵
    1. J. Halberstadt,
    2. G. Rhodes
    , The attractiveness of nonface averages: Implications for an evolutionary explanation of the attractiveness of average faces. Psychol. Sci. 11, 285–289 (2000).
    OpenUrlCrossRefPubMed
  15. ↵
    1. J. Halberstadt,
    2. G. Rhodes
    , It’s not just average faces that are attractive: Computer-manipulated averageness makes birds, fish, and automobiles attractive. Psychon. Bull. Rev. 10, 149–156 (2003).
    OpenUrlPubMed
  16. ↵
    1. C. Sofer,
    2. R. Dotsch,
    3. D. H. J. Wigboldus,
    4. A. Todorov
    , What is typical is good: The influence of face typicality on perceived trustworthiness. Psychol. Sci. 26, 39–47 (2015).
    OpenUrlCrossRefPubMed
  17. ↵
    1. C. Sofer et al.
    , For your local eyes only: Culture-specific face typicality influences perceptions of trustworthiness. Perception 46, 914–928 (2017).
    OpenUrl
  18. ↵
    1. R. Dotsch,
    2. A. Todorov,
    3. R. R. Hassin
    , Statistical learning shapes face evaluation. Nat. Hum. Behav. 1, 0001 (2016).
    OpenUrl
  19. ↵
    1. J. Halberstadt
    , The generality and ultimate origins of the attractiveness of prototypes. Pers. Soc. Psychol. Rev. 10, 166–183 (2006).
    OpenUrlCrossRefPubMed
  20. ↵
    1. I. J. Holzleitner et al.
    , Comparing theory-driven and data-driven attractiveness models using images of real women’s faces. J. Exp. Psychol. Hum. Percept. Perform. 45, 1589–1595 (2019).
    OpenUrl
  21. ↵
    1. T. Valentine
    , A unified account of the effects of distinctiveness, inversion, and race in face recognition. Q. J. Exp. Psychol. A 43, 161–204 (1991).
    OpenUrlCrossRefPubMed
  22. ↵
    1. H. Burkhardt,
    2. B. Neumann
    1. G. J. Edwards,
    2. T. F. Cootes,
    3. C. J. Taylor
    , “Face recognition using active appearance models” in European Conference on Computer Vision, H. Burkhardt, B. Neumann, Eds. (Springer-Verlag, Heidelberg, Germany, 1998), pp. 581–595.
  23. ↵
    1. T. F. Cootes,
    2. G. J. Edwards,
    3. C. J. Taylor
    , Active appearance models. IEEE Trans. Pattern Anal. Mach. Intell. 23, 681–685 (2001).
    OpenUrlCrossRef
  24. ↵
    1. G. Tzimiropoulos,
    2. M. Pantic
    , “Optimization problems for fast AAM fitting in-the-wild” in Proceedings of the IEEE International Conference on Computer Vision (IEEE, Los Alamitos, CA, 2013), pp. 593–600.
  25. ↵
    1. D. S. Ma,
    2. J. Correll,
    3. B. Wittenbrink
    , The Chicago face database: A free stimulus set of faces and norming data. Behav. Res. Methods 47, 1122–1135 (2015).
    OpenUrlCrossRefPubMed
  26. ↵
    1. J. Halberstadt,
    2. P. Winkielman
    , Easy on the eyes, or hard to categorize: Classification difficulty decreases the appeal of facial blends. J. Exp. Soc. Psychol. 50, 175–183 (2014).
    OpenUrlCrossRef
  27. ↵
    1. H. E. Owen,
    2. J. Halberstadt,
    3. E. W. Carr,
    4. P. Winkielman
    , Johnny Depp, reconsidered: How category-relative processing fluency determines the appeal of gender ambiguity. PloS One 11, e0146328 (2016).
    OpenUrl
  28. ↵
    1. T. Vogel,
    2. E. W. Carr,
    3. T. Davis,
    4. P. Winkielman
    , Category structure determines the relative attractiveness of global versus local averages. J. Exp. Psychol. Learn. Mem. Cogn. 44, 250–267 (2018).
    OpenUrl
  29. ↵
    1. L. K. Saul,
    2. Y. Weiss,
    3. L. Bottou
    1. A. J. Yu,
    2. P. Dayan
    , “Inference, attention, and decision in a Bayesian neural architecture” in Advances in Neural Information Processing Systems 17, L. K. Saul, Y. Weiss, L. Bottou, Eds. (MIT Press, Cambridge, MA, 2005), pp. 1577–1584.
  30. ↵
    1. Y. C. Chiu,
    2. M. Esterman,
    3. Y. S. Yantis,
    4. R. H. Han
    , Decoding task-based attentional modulation during face categorization. J. Cogn. Neurosci. 23, 1198–1204 (2011).
    OpenUrlCrossRefPubMed
  31. ↵
    1. C. Gratton,
    2. K. K. Sreenivasan,
    3. M. A. Silver,
    4. M. D’Esposito
    , Attention selectively modifies the representation of individual faces in the human brain. J. Neurosci. 33, 6979–6989 (2013).
    OpenUrlAbstract/FREE Full Text
  32. ↵
    1. N. Davidenko,
    2. C. Q. Vu,
    3. N. H. Heller,
    4. J. M. Collins
    , Attending to race (or gender) does not increase race (or gender) aftereffects. Front. Psychol. 7, 909 (2016).
    OpenUrl
  33. ↵
    1. C. E. Shannon
    , A mathematical theory of communication. Bell Labs Tech. J. 27, 379–423 (1948).
    OpenUrlCrossRef
  34. ↵
    1. W. A. Rosenblith
    1. H. B. Barlow
    , “Possible principles underlying the transformations of sensory messages” in Sensory Communication, W. A. Rosenblith, Ed. (MIT Press, 1961), pp. 217–234.
  35. ↵
    1. L. Luo
    , Principles of Neurobiology (Garland Science, 2015).
  36. ↵
    1. C. K. Machens,
    2. T. Golisch,
    3. O. Kolesnikova,
    4. A. V. Herz
    , Testing the efficiency of sensory coding with optimal stimulus ensembles. Neuron 47, 447–456 (2005).
    OpenUrlCrossRefPubMed
  37. ↵
    1. A. Barron,
    2. J. Rissanen,
    3. B. Yu
    , The minimum description length principle in coding and modeling. IEEE Trans. Inf. Theor. 44, 2743–2760 (1998).
    OpenUrlCrossRef
  38. ↵
    1. T. M. Cover,
    2. J. A. Thomas
    , Elements of Information Theory (Wiley-Interscience, 2006).
  39. ↵
    1. W. B. Levy,
    2. R. A. Baxter
    , Energy efficient neural codes. Neural Comput. 8, 531–543 (1996).
    OpenUrlCrossRefPubMed
  40. ↵
    1. Z. Wang,
    2. X.-X. Wei,
    3. A. A. Stocker,
    4. D. D. Lee
    , Efficient neural codes under metabolic constraints. Adv. Neural Inf. Process. Syst. 29, 4619–4627 (2016).
    OpenUrl
  41. ↵
    1. N. Qian,
    2. J. Zhang
    , Neuronal firing rate as code length: A hypothesis. Comput. Brain & Behav. 3, 34–53 (2020).
    OpenUrl
  42. ↵
    1. C. P. Said,
    2. R. Dotsch,
    3. A. Todorov
    , The amygdala and FFA track both social and non-social face dimensions. Neuropsychologia 48, 3596–3605 (2010).
    OpenUrlCrossRefPubMed
  43. ↵
    1. G. Mattavelli,
    2. T. J. Andrews,
    3. A. U. R. Asghar,
    4. J. R. Towler,
    5. A. W. Young
    , Response of face-selective brain regions to trustworthiness and gender of faces. Neuropsychologia 50, 2205–2211 (2012).
    OpenUrl
  44. ↵
    1. A. J. Yu,
    2. P. Dayan
    , Uncertainty, neuromodulation, and attention. Neuron 46, 681–692 (2005).
    OpenUrlCrossRefPubMed
  45. ↵
    1. K. Friston
    , A theory of cortical responses. Philos. Trans. R Soc. Lond. B Biol. Sci. 360, 815–836 (2005).
    OpenUrlCrossRefPubMed
  46. ↵
    1. W. Cohen,
    2. A. Moore
    1. R. Caruana,
    2. A. Niculescu-Mizil
    , “An empirical comparison of supervised learning algorithms” in Proceedings of the 23rd International Conference on Machine Learning, W. Cohen, A. Moore, Eds. (Omni Press, Madison, WI, 2006), pp. 161–168.
  47. ↵
    1. L. T. Trujillo,
    2. J. M. Jankowitsch,
    3. J. H. Langlois
    , Beauty is in the ease of the beholding: A neurophysiological test of the averageness theory of facial attractiveness. Cognit. Affect Behav. Neurosci. 14, 1061–1076 (2014).
    OpenUrl
  48. ↵
    1. S. Sun et al.
    , Decision ambiguity is mediated by a late positive potential originating from cingulate cortex. NeuroImage 157, 400–414 (2017).
    OpenUrlCrossRefPubMed
  49. ↵
    1. O. K. Kaminska et al.
    , Ambiguous at the second sight: Mixed facial expressions trigger late electrophysiological responses linked to lower social impressions. Cognit. Affect Behav. Neurosci. 20, 441–454 (2020).
    OpenUrl
  50. ↵
    1. J. Guan,
    2. C. Ryali,
    3. A. J. Yu
    , Computational modeling of social face perception in humans: Leveraging the active appearance model bioRxiv:10.1101/360776 (3 July 2018).
  51. ↵
    1. L. Chang,
    2. D. Y. Tsao
    , The code for facial identity in the primate brain. Cell 169, 1013–1028.e14 (2017).
    OpenUrlCrossRefPubMed
  52. ↵
    1. C. P. Said,
    2. A. Todorov
    , A statistical model of facial attractiveness. Psychol. Sci. 22, 1183–1190 (2011).
    OpenUrlCrossRefPubMed
  53. ↵
    1. A. Todorov,
    2. N. Oosterhof
    , Modeling social perception of faces [social sciences]. IEEE Signal Process. Mag. 28, 117–122 (2011).
    OpenUrl
  54. ↵
    1. A. Todorov,
    2. R. Dotsch,
    3. D. H. Wigboldus,
    4. C. P. Said
    , Data-driven methods for modeling social perception. Social Pers. Psychol. Compass 5, 775–791 (2011).
    OpenUrl
  55. ↵
    1. E. Rosch,
    2. C. B. Mervis
    , Family resemblance: Studies in the internal structure of categories. Cogn. Psychol. 7, 573–605 (1975).
    OpenUrlCrossRef
  56. ↵
    1. A. B. Markman,
    2. B. H. Ross
    , Category use and category learning. Psychol. Bull. 129, 592–613 (2003).
    OpenUrlCrossRefPubMed
  57. ↵
    1. B. C. Malt,
    2. B. H. Ross,
    3. G. L. Murphy
    , Predicting features for members of natural categories when categorization is uncertain. J. Exp. Psychol. Learn. Mem. Cognit. 21, 646–661 (1995).
    OpenUrlPubMed
  58. ↵
    1. S. Y. Chen,
    2. B. H. Ross,
    3. G. L. Murphy
    , Implicit and explicit processes in category-based induction: Is induction best when we don’t think? J. Exp. Psychol. Gen. 143, 227–246 (2014).
    OpenUrl
  59. ↵
    1. A. L. Jones,
    2. B. Jaeger
    , Biological bases of beauty revisited: The effect of symmetry, averageness, and sexual dimorphism on female facial attractiveness. Symmetry 11, 279 (2019).
    OpenUrl
  60. ↵
    1. C. Summerfield,
    2. E. H. Trittschuh,
    3. J. M. Monti,
    4. M. M. Mesulam,
    5. T. Egner
    , Neural repetition suppression reflects fulfilled perceptual expectations. Nat. Neurosci. 11, 1004–1006 (2008).
    OpenUrlCrossRefPubMed
  61. ↵
    1. R. Elliott,
    2. R. J. Dolan
    , Neural response during preference and memory judgments for subliminally presented stimuli: A functional neuroimaging study. J. Neurosci. 18, 4697–4704 (2013).
    OpenUrl
  62. ↵
    1. M. I. Gobbini,
    2. J. V. Haxby
    , Neural response to the visual familiarity of faces. Brain Res. Bull. 71, 76–82 (2006).
    OpenUrlCrossRefPubMed
  63. ↵
    1. W. Koustaal et al.
    , Perceptual specificity in visual object priming: Functional magnetic resonance imaging evidence for a laterality difference in fusiform cortex. Neuropsychologia 39, 184–199 (2013).
    OpenUrl
  64. ↵
    1. R. B. Zajonc
    , Attitudinal effects of mere exposure. J. Pers. Soc. Psychol. 9, 1–27 (1968).
    OpenUrlCrossRefPubMed
  65. ↵
    1. E. W. Carr,
    2. T. F. Brady,
    3. P. Winkielman
    , Are you smiling, or have i seen you before? Familiarity makes faces look happier. Psychol. Sci. 28, 1087–1102 (2017).
    OpenUrl
  66. ↵
    1. S. Kastner
    1. A. C. Nobre
    1. A. J. Yu
    , “Bayesian models of attention” in Handbook of Attention, S. Kastner, A. C. Nobre, Eds. (Oxford University Press, Oxford, United Kingdom, 2014), pp. 1159–1197.
  67. ↵
    1. K. Grill-Spector,
    2. K. S. Weiner,
    3. J. Gomez,
    4. A. Stigliani,
    5. V. S. Natu
    , The functional neuroanatomy of face perception: From brain measurements to deep neural networks. Interf. Focus 8, 20180013 (2018).
    OpenUrl
  68. ↵
    1. A. Todorov
    , The role of amygdala in face perception and evaluation. Motiv. Emot. 36, 16–26 (2012).
    OpenUrlPubMed
  69. ↵
    1. D. Marr,
    2. Vision
    , A Computational Investigation into the Human Representation and Processing of Visual Information (Henry Holt and Co., Inc., 1982).
  70. ↵
    1. S. M. Landi,
    2. W. A. Freiwald
    , Two areas for familiar face recognition in the primate brain. Science 357, 591–595 (2017).
    OpenUrlAbstract/FREE Full Text
  71. ↵
    1. V. Natu,
    2. A. J. O’Toole
    , The neural processing of familiar and unfamiliar faces: A review and synopsis. Br. J. Psychol. 102, 726–747 (2011).
    OpenUrlCrossRefPubMed
  72. ↵
    1. L. Reddy,
    2. R. Q. Quiroga,
    3. G. Kreiman,
    4. C. Koch,
    5. I. Fried
    , Invariant visual representation by single eneurons in the human brain. Nature 435, 1102–1107 (2005).
    OpenUrlCrossRefPubMed
  73. ↵
    1. E. Vul,
    2. N. Goodman,
    3. T. L. Grifiths,
    4. J. B. Tenenbaum
    , One and done? Optimal decisions from very few samples. Cogn. Sci. 38, 599–637 (2014).
    OpenUrlCrossRefPubMed
  74. ↵
    1. Y. Weiss,
    2. B. Schölkopf,
    3. J. C. Platt
    1. J. Silva,
    2. J. Marques,
    3. J. Lemos
    , “Selecting landmark points for sparse manifold learning” in Advances in Neural Information Processing Systems, Y. Weiss, B. Schölkopf, J. C. Platt, Eds. (MIT Press, Cambridge, MA, 2006), pp. 1241–1248.
  75. ↵
    1. D. A. Ross,
    2. M. Deroche,
    3. T. J. Palmeri
    , Not just the norm: Exemplar-based models also predict face aftereffects. Psychon. Bull. Rev. 21, 47–70 (2014).
    OpenUrl
  76. ↵
    1. R. P. Rao,
    2. D. H. Ballard
    , Predictive coding in the visual cortex: A functional interpretation of some extra-classical receptive-field effects. Nat. Neurosci. 2, 79–87 (1999).
    OpenUrlCrossRefPubMed
  77. ↵
    1. P. Zmarz,
    2. G. B. Keller
    , Mismatch receptive fields in mouse visual cortex. Neuron 92, 766–772 (2016).
    OpenUrl
  78. ↵
    1. F. Meyniel,
    2. D. Schlunegger,
    3. S. Dehaene
    , The sense of confidence during probabilistic learning: A normative account. PLoS Comput. Biol. 11, e1004305 (2015).
    OpenUrlCrossRefPubMed
  79. ↵
    1. M. R. Nassar,
    2. R. C. Wilson,
    3. B. Heasly,
    4. J. I. Gold
    , An approximately Bayesian delta-rule model explains the dynamics of belief updating in a changing environment. J. Neurosci. 30, 12366–12378 (2010).
    OpenUrlAbstract/FREE Full Text
  80. ↵
    1. F. Meyniel,
    2. S. Dehaene
    , Brain networks for confidence weighting and hierarchical inference during probabilistic learning. Proc. Natl. Acad. Sci. U.S.A. 114, E3859–E3868 (2017).
    OpenUrlAbstract/FREE Full Text
  81. ↵
    1. J. T. McGuire,
    2. M. R. Nassar,
    3. J. I. Gold,
    4. J. W. Kable
    , Functionally dissociable influences on learning rate in a dynamic environment. Neuron 84, 870–881 (2014).
    OpenUrlCrossRefPubMed
  82. ↵
    1. E. Payzan-LeNestour,
    2. S. Dunne,
    3. P. Bossaerts,
    4. J. P. O’Doherty
    , The neural representation of unexpected uncertainty during value-based decision making. Neuron 79, 191–201 (2013).
    OpenUrlCrossRefPubMed
  83. ↵
    1. J. Reimer et al.
    , Pupil fluctuations track rapid changes in adrenergic and cholinergic activity in cortex. Nat. Commun. 7, 13289 (2016).
    OpenUrlCrossRefPubMed
  84. ↵
    1. H.-I. Liao,
    2. M. Kashino,
    3. S. Shimojo
    , Transient pupil constriction reflects and affects facial attractiveness. bioRxiv:10.1101/2020.04.02.021436 (3 April 2020).
  85. ↵
    1. A. J. Yu
    , Change is in the eye of the beholder. Nat. Neurosci. 15, 933–935 (2012).
    OpenUrlCrossRefPubMed
  86. ↵
    1. M. Hendricks,
    2. R. Bootzin
    , Race and sex as stimuli for negative affect and physical avoidance. J. Soc. Psychol. 98, 111–120 (1976).
    OpenUrlCrossRef
  87. ↵
    1. A. A. Harrison
    , Response competition, frequency, exploratory behavior, and liking. J. Pers. Soc. Psychol. 9, 363–368 (1968).
    OpenUrlCrossRefPubMed
  88. ↵
    1. J. Gottlieb,
    2. P. Y. Oudeyer,
    3. M. Lopes,
    4. A. Baranes
    , Information-seeking, curiosity, and attention: Computational and neural mechanisms. Trends Cogn. Sci. 17, 585–593 (2013).
    OpenUrlCrossRefPubMed
  89. ↵
    1. J. Park,
    2. E. Shimojo,
    3. S. Shimojo
    , Roles of familiarity and novelty in visual preference judgments are segregated across object categories. Proc. Natl. Acad. Sci. U.S.A. 107, 14552–14555 (2010).
    OpenUrlAbstract/FREE Full Text
  90. ↵
    1. A. C. Little,
    2. L. M. DeBruine,
    3. B. C. Jones
    , Sex differences in attraction to familiar and unfamiliar opposite-sex faces: Men prefer novelty and women prefer familiarity. Arch. Sex. Behav. 43, 973–981 (2014).
    OpenUrlCrossRefPubMed
  91. ↵
    1. T. Vogel,
    2. M. N. Ingendahl,
    3. P. Winkielman
    , The architecture of prototype preferences: Typicality, fluency, and valence. J. Exp. Psychol. General, https://doi.org/10.1037/xge0000798 (2020).
  92. ↵
    1. J. P. Renoult,
    2. J. Bovet,
    3. M. Raymond
    , Beauty is in the efficient coding of the beholder. Royal Soc. Open Sci. 3, 160027 (2016).
    OpenUrl
  93. ↵
    1. M. Komori,
    2. S. Kawamura,
    3. S. Ishihara
    , Averageness or symmetry: Which is more important for facial attractiveness? Acta Psychol. 131, 136–142 (2009).
    OpenUrlCrossRefPubMed
  94. ↵
    1. A. J. O’Toole,
    2. T. Price,
    3. T. Vetter,
    4. J. C. Bartlett,
    5. V. Blanz
    , 3D shape and 2D surface textures of human faces: The role of “averages” in attractiveness and age. Image Vis. Comput. 18, 9–19 (1999).
    OpenUrl
  95. ↵
    1. R. Reber,
    2. N. Schwarz,
    3. P. Winkielman
    , Processing fluency and aesthetic pleasure: Is beauty in the perceiver’s processing experience? Pers. Soc. Psychol. Rev. 8, 364–382 (2004).
    OpenUrlCrossRefPubMed
  96. ↵
    1. N. Chater,
    2. P. Vitányi
    , Simplicity, A unifying principle in cognitive science? Trends Cognit. Sci. 7, 19–22 (2003).
    OpenUrlCrossRefPubMed
  97. ↵
    1. A. Westbrook,
    2. T. S. Braver
    , Cognitive effort: A neuroeconomic approach. Cogn. Affect Behav. Neurosci. 15, 395–415 (2015).
    OpenUrlCrossRefPubMed
  98. ↵
    1. M. Inzlicht,
    2. A. Shenhav,
    3. C. Y. Olivola
    , The effort paradox: Effort is both costly and valued. Trends Cogn. Sci. 22, 337–349 (2018).
    OpenUrlCrossRefPubMed
  99. ↵
    1. N. N. Oosterhof,
    2. A. Todorov
    , The functional basis of face evaluation. Proc. Natl. Acad. Sci. U.S.A. 105, 11087–11092 (2008).
    OpenUrlAbstract/FREE Full Text
  100. ↵
    1. S. Denison,
    2. M. Mack,
    3. Y. Xu,
    4. B. C. Armstrong
    1. C. K. Ryali,
    2. X. Wang,
    3. A. J. Yu
    , “Leveraging computer vision face representation to understand human face representation” in Proceedings of the 42nd Annual Meeting of the Cognitive Science Society, S. Denison, M. Mack, Y. Xu, B. C. Armstrong, Eds. (Cognitive Science Society, Austin, TX, 2020), pp. 1080–1086.
  101. ↵
    1. C. K. Ryali,
    2. S. Goffin,
    3. P. Winkielman,
    4. A. J. Yu
    , From likely to likable: The role of statistical typicality in human social assessment of faces. Open Science Framework (OSF). https://osf.io/h54eu/. Deposited 11 August 2020.
PreviousNext
Back to top
Article Alerts
Email Article

Thank you for your interest in spreading the word on PNAS.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
From likely to likable: The role of statistical typicality in human social assessment of faces
(Your Name) has sent you a message from PNAS
(Your Name) thought you would like to see the PNAS web site.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Citation Tools
From likely to likable: The role of statistical typicality in human social assessment of faces
Chaitanya K. Ryali, Stanny Goffin, Piotr Winkielman, Angela J. Yu
Proceedings of the National Academy of Sciences Nov 2020, 117 (47) 29371-29380; DOI: 10.1073/pnas.1912343117

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Request Permissions
Share
From likely to likable: The role of statistical typicality in human social assessment of faces
Chaitanya K. Ryali, Stanny Goffin, Piotr Winkielman, Angela J. Yu
Proceedings of the National Academy of Sciences Nov 2020, 117 (47) 29371-29380; DOI: 10.1073/pnas.1912343117
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Mendeley logo Mendeley

Article Classifications

  • Social Sciences
  • Psychological and Cognitive Sciences
  • Biological Sciences
  • Neuroscience
Proceedings of the National Academy of Sciences: 117 (47)
Table of Contents

Submit

Sign up for Article Alerts

Jump to section

  • Article
    • Abstract
    • Results
    • Discussion
    • Materials and Methods
    • Data Availability.
    • Acknowledgments
    • Footnotes
    • References
  • Figures & SI
  • Info & Metrics
  • PDF

You May Also be Interested in

Setting sun over a sun-baked dirt landscape
Core Concept: Popular integrated assessment climate policy models have key caveats
Better explicating the strengths and shortcomings of these models will help refine projections and improve transparency in the years ahead.
Image credit: Witsawat.S.
Model of the Amazon forest
News Feature: A sea in the Amazon
Did the Caribbean sweep into the western Amazon millions of years ago, shaping the region’s rich biodiversity?
Image credit: Tacio Cordeiro Bicudo (University of São Paulo, São Paulo, Brazil), Victor Sacek (University of São Paulo, São Paulo, Brazil), and Lucy Reading-Ikkanda (artist).
Syrian archaeological site
Journal Club: In Mesopotamia, early cities may have faltered before climate-driven collapse
Settlements 4,200 years ago may have suffered from overpopulation before drought and lower temperatures ultimately made them unsustainable.
Image credit: Andrea Ricci.
Steamboat Geyser eruption.
Eruption of Steamboat Geyser
Mara Reed and Michael Manga explore why Yellowstone's Steamboat Geyser resumed erupting in 2018.
Listen
Past PodcastsSubscribe
Birds nestling on tree branches
Parent–offspring conflict in songbird fledging
Some songbird parents might improve their own fitness by manipulating their offspring into leaving the nest early, at the cost of fledgling survival, a study finds.
Image credit: Gil Eckrich (photographer).

Similar Articles

Site Logo
Powered by HighWire
  • Submit Manuscript
  • Twitter
  • Facebook
  • RSS Feeds
  • Email Alerts

Articles

  • Current Issue
  • Special Feature Articles – Most Recent
  • List of Issues

PNAS Portals

  • Anthropology
  • Chemistry
  • Classics
  • Front Matter
  • Physics
  • Sustainability Science
  • Teaching Resources

Information

  • Authors
  • Editorial Board
  • Reviewers
  • Subscribers
  • Librarians
  • Press
  • Site Map
  • PNAS Updates
  • FAQs
  • Accessibility Statement
  • Rights & Permissions
  • About
  • Contact

Feedback    Privacy/Legal

Copyright © 2021 National Academy of Sciences. Online ISSN 1091-6490