New Research In
Physical Sciences
Social Sciences
Featured Portals
Articles by Topic
Biological Sciences
Featured Portals
Articles by Topic
- Agricultural Sciences
- Anthropology
- Applied Biological Sciences
- Biochemistry
- Biophysics and Computational Biology
- Cell Biology
- Developmental Biology
- Ecology
- Environmental Sciences
- Evolution
- Genetics
- Immunology and Inflammation
- Medical Sciences
- Microbiology
- Neuroscience
- Pharmacology
- Physiology
- Plant Biology
- Population Biology
- Psychological and Cognitive Sciences
- Sustainability Science
- Systems Biology
Local conformal autoencoder for standardized data coordinates
Contributed by Ronald R. Coifman, September 20, 2020 (sent for review July 14, 2020; reviewed by Richard Baraniuk and Guillermo Sapiro)

Significance
A fundamental issue in empirical science is the ability to calibrate between different types of measurements/observations of the same phenomenon. This naturally suggests the selection of canonical variables, in the spirit of principal components, to enable matching/calibration among different observation modalities/instruments. We develop a method for extracting standardized, nonlinear, intrinsic coordinates from measured data, leading to a generalized isometric embedding of the observations. This is achieved through a local burst data acquisition strategy that allows us to capture the local z-scored structure. We implement this method using a local conformal autoencoder architecture and illustrate it computationally. The proposed embedding is fast, parallelizable, easy to implement using existing open-source neural network implementations and exhibits surprising interpolation and extrapolation capabilities.
Abstract
We propose a local conformal autoencoder (LOCA) for standardized data coordinates. LOCA is a deep learning-based method for obtaining standardized data coordinates from scientific measurements. Data observations are modeled as samples from an unknown, nonlinear deformation of an underlying Riemannian manifold, which is parametrized by a few normalized, latent variables. We assume a repeated measurement sampling strategy, common in scientific measurements, and present a method for learning an embedding in
Footnotes
↵1E.P. and O.L. contributed equally to this work.
- ↵2To whom correspondence may be addressed. Email: coifman-ronald{at}yale.edu.
Author contributions: O.L., M.G., I.G.K., and R.R.C. designed research; E.P., O.L., F.D., and T.B. performed research; E.P., O.L., and F.D. contributed new reagents/analytic tools; E.P., O.L., F.D., and T.B. analyzed data; and E.P., O.L., M.G., I.G.K., and R.R.C. wrote the paper.
Reviewers: R.B., Rice University; and G.S., Duke University.
The authors declare no competing interest.
This article contains supporting information online at https://www.pnas.org/lookup/suppl/doi:10.1073/pnas.2014627117/-/DCSupplemental.
Data Availability.
The code and data supplement are available online at the Stanford Digital Repository (https://purl.stanford.edu/zt044bg9296).
- Copyright © 2020 the Author(s). Published by PNAS.
This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND).
Citation Manager Formats
Sign up for Article Alerts
Article Classifications
- Physical Sciences
- Applied Mathematics