Essential auditory contrast-sharpening is preneuronal
- Institute of Neuroinformatics, University/Eidgenössische Technische Hochschule Zürich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
-
Edited by Charles S. Peskin, New York University, New York, NY (received for review December 18, 2003)
Abstract
Contrast-sharpening is a fundamental feature of mammalian sensory perception. Whereas visual contrast-sharpening has been fully understood in terms of the retinal neuronal wiring [DeVries, S. H. & Baylor, D. A. (1993) Cell 72, Suppl., 139–149], a corresponding explanation of auditory contrast-sharpening is still lacking. Here, we show that the essentials of auditory contrast-sharpening can be explained by using cochlear biophysics. This finding indicates that the phenomenon is basically of preneuronal origin.
Sound impinging on the mammalian ear is transduced into neuronal excitation patterns by means of the cochlea. In 1863, H. L. F. Helmholtz (1) proposed the tonotopic principle to explain the cochlea. According to this principle, there is a mapping between the frequency of an incoming tone and the location of maximal response within the cochlea (termed the characteristic place), analogous to the arrangement of strings on a piano. This linear theory, however, fails to explain important cochlear hearing phenomena, the most prominent being two-tone suppression (2), the principal mechanism underlying auditory contrast-sharpening (3). When two or more tones are presented simultaneously to the cochlea, attenuated responses of the individual tones are evoked, and only the main contributions survive. Understanding the mechanism of two-tone suppression is of both theoretical and practical interest, e.g., for the development of noncontextual measures of hearing loss and the optimization of hearing implants. Here we show that a detailed qualitative and quantitative description of two-tone suppression can be furnished by using a Hopf-type cochlea model.
It is generally assumed that two-tone suppression is based on active hearing processes, i.e., on the ability of the cochlea
to selectively amplify its response. Recently, we proposed a biomorphic model of the cochlea that uses subcritical Hopf amplifiers
(4) to implement the active processes. For a Hopf system, the steady-state response, R, to a periodic forcing of amplitude, F, and frequency, ω, is determined by a cubic equation in R
2 (5):
In Eq. 1, ω0 is the frequency at which the optimal response is measured, at a given place. This incorporates the relationship between
place and frequency, as given by the tonotopic principle. The control parameter μ measures the distance from the Hopf bifurcation
point μ = 0, where the system starts to generate self-oscillations. To describe the amplification of interacting tones, we
expanded the response around their noninteracting states and found that their respective active amplifications are also described
by Eq. 1. The presence of two tones of amplitudes, F
t and F
s, however, is now reflected in a pair of effective Hopf parameters:
These expressions display the mutual relationship between test Ft and suppressor F
s tones.
In our biomorphic cochlea model, all active contributions R
t,s are locally injected into the passive cochlea, whose behavior is governed by the laws of hydrodynamics. The resulting system
can be described by a differential equation for the energy density e(x, ω), related to the basilar membrane (BM) displacements by
where E(x) is the exponentially decaying BM stiffness (4). Realistic responses, at biophysical parameter values, are only obtained for μ < 0, i.e., if noncritical tuning is used.
A great advantage here is that a single free parameter in the cochlea differential equation suffices to determine the gain
and the width of the compressive nonlinear regime. As is shown in Fig. 1, responses obtained from our model closely match physiological measurements of two-tone suppression.
High-side suppression (ωs > ωt). (a) Model response at resonance. Suppressor intensities from 10 to 110 dB, in steps of 10 dB. The 10-, 20-, and 30-dB lines almost coincide. The dashed line separates the onset from the intermediate suppressor regime; the dashed–dotted line separates the intermediate from the compressive-nonlinear suppressor regime (ωt/2π = 0.9 kHz, ωs/2π = 1.0 kHz). (b) Laser velocimetry measurements of high-side suppression [chinchilla (3), ωt/2π = 8 kHz]. [b is reproduced with permission from ref. 3 (Copyright 1992, Am. Physiol. Soc.).]
Our approach provides an explanation for the mechanism underlying two-tone suppression. For instance, each curve shown in Fig. 1 indicates that the BM responds with the characteristic Hopf behavior described by Eq. 1, up to relatively high suppressor intensities. To make this observation more transparent, notice that the BM response A t,s corresponds to the Hopf response R t,s, whereas the stimulation intensity I t,s corresponds to the squared forcing F 2 t,s. Linear regimes of the test tone input–output function, obtained for weak stimulation, always end up in the compressive nonlinear behavior characteristic of mammalian hearing. This scenario ends in the passive hydrodynamic behavior obtained for very high stimulation intensities. The equivalent situation emerges when a fixed low-intensity test tone is combined with a suppressor of increased intensity, because of the duality between the test and the suppressor tone.
Our biomorphically motivated modeling also allows the quantitative extraction of the scaling in the different regimes of suppressor
efficacy, as well as the derivation of the slow onset of suppression. To show this, the correspondence between the cochlea
and the Hopf amplifier response needs to be made rigorous, starting from the cochlea differential equation. The detailed analytical
derivation of two-tone suppression scaling proceeds along the following lines: Our Hopf cochlea initial value problem has
the form (4)
where d is the dissipation rate, which is counteracted by the power a delivered by the active process. v
G is the group velocity. The origin of this equation is in the steady-state energy balance (6) between dissipation and active amplification. The initial energy density e(x = 0, ω) is derived from the steady-state conditions at the base of the cochlea. There, the rate at which the wave energy
exits from the base is equal to the average incoming acoustic power, P ∼ I (where I in W·m-2 is the sound intensity). From this we obtain
For a detailed biophysical derivation of the model, where it is also shown how second-order couplings (7) fine-tune the cochlear response shape, see ref. 4.
Because of the mutual role between the test and the suppressor tone, Eq. 2 provides the key mechanism underlying high-side as well as low-side cochlear suppression. This mechanism implies that the discussion of high-side suppression provided below can be applied analogously to low-side suppression. Only the phenomenon of phasic suppression (3, 8), which is of a different (temporal periodic modulation) nature, is not covered in our discussion.
For the derivation of the scaling laws of high-side suppression, we identify the BM-response, A
t,s, with the Hopf response, R
t,s, and the stimulation intensity, I
t,s, with the square of the forcing,
. In the following discussion, we focus on the linear test-tone regime. There,
always holds, and the identification A
t,s ∼ R
t,s can be made rigorous by referring back to the formal solution of the cochlea equation. For the test tone,
then corresponds to the response R
t. For weak stimulation F
t, we find
The suppression change, Δs, caused by increasing the stimulation from a value F
s(1) to F
s(2), is
Choosing F
s(1) = 0 (no suppressor stimulation), we obtain the onset behavior
For weak stimulations, F
s, we have
, where I
s is the intensity of suppressor stimulation. This explains the slow departure of the curves from zero suppression. As soon
as
(intermediate regime), we have
where the last approximation can be justified in the intermediate, suppressor-linear regime only. The result explains the
large constant distances between the response curves. When F
s is increased further, the suppressor enters its compressive nonlinear regime. This leads to
, resulting in
Thus, the distances between adjacent lines in the compressive nonlinear regime are reduced by a factor of three, compatible
with the physiological results. In Fig. 2, the obtained results have been collected, demonstrating the close agreement between suppression derived analytically, simulation
results, and physiological measurements.
Results from the nonphasic low-side suppression regime match equally well with the experimental (9) data and further corroborate our approach. Emergent suppression rates, which are larger than those obtained from high-side suppression, can also be fully explained by using arguments parallel to those above. To this end, note that the frequency response curve is asymmetrical in shape, with a noncompressive low-frequency tail. In low-side suppression experiments, the difference between test tone and suppressor frequency can be, and usually is, chosen much larger than for the high-side case (e.g., 10 kHz and 500 Hz). Therefore, at the characteristic place of the test tone, the suppressor response R s will be approximately linear in F s. The larger suppression growth rates observed in low-side suppression experiments are the immediate consequence of this absence of a compressive nonlinear regime.
We have qualitatively and quantitatively demonstrated that the essentials of two-tone suppression emerge from the interaction between the active elements of the cochlea. Two-tone suppression is considered the main underlying mechanism of auditory signal sharpening (2, 3, 10), although higher, neuronal processing levels also contribute (11). Therefore, our contribution implies that auditory contrast sharpening essentially emerges from cochlear, i.e., preneuronal, Hopf-amplifier biophysics.
The close correspondence between our biomorphically motivated model and the mammalian cochlea can also be used for the detailed simulation of hearing defects, providing a basis for quantitative noncontextual measures of cochlear hearing damage.
Acknowledgments
We thank M. Magnasco, T. Kohda, W. Schwarz, S. N. Fry, and S. Launer for helpful comments, and the Swiss Kommission für Technologie und Innovation and Phonak Hearing Systems for continued support.
Footnotes
-
↵ * To whom correspondence should be addressed. E-mail: ruedi{at}ini.phys.ethz.ch.
-
This paper was submitted directly (Track II) to the PNAS office.
-
Abbreviation: BM, basilar membrane.
- Copyright © 2004, The National Academy of Sciences







