Fact-checked by Grok 2 weeks ago

Auditory masking

Auditory masking is the psychoacoustic phenomenon in which the presence of one , known as the masker, elevates the detection of another , called the signal or , thereby impairing its . This effect arises from the auditory system's limited capacity to process multiple sounds simultaneously and is a cornerstone of understanding human hearing. Masking is broadly categorized into energetic and informational types. Energetic masking occurs due to the spectral and temporal overlap of the masker and signal energies, primarily at the peripheral level of the , such as in the . In contrast, informational masking results from central auditory processing challenges, including perceptual uncertainty, similarity between sounds, or difficulty in segregating the target from the masker in complex acoustic scenes. Additionally, masking can be simultaneous, when the signal and masker coincide in time, or temporal, encompassing forward masking (where the masker precedes the signal by a short ) and (where the masker follows the signal). Temporal masking effects typically decay within 100-200 milliseconds after the masker offset. Historically, auditory masking was first systematically studied in the early , with seminal experiments by Wegel and Lane demonstrating how one masks another based on proximity. These investigations revealed the frequency-selective nature of auditory filters, modeled as bandpass filters with bandwidths of about 10-15% of their , as measured via methods like psychophysical tuning curves and notched-noise masking. In practical applications, auditory masking principles underpin perceptual audio coding schemes, such as those in compression, where inaudible components masked by louder sounds are discarded to reduce file sizes without compromising perceived quality. Similarly, in hearing aids, techniques like auditory masked threshold noise suppression exploit masking to enhance speech clarity by attenuating background noise that would otherwise obscure target sounds.

Fundamentals

Definition and Types

Auditory masking refers to the phenomenon in which the presence of a masker elevates the detection of a target , rendering the target inaudible or more difficult to perceive. This occurs when the struggles to distinguish the target amid competing acoustic energy or perceptual complexity. The masked represents the minimum intensity level at which the target becomes detectable in the presence of the masker. The systematic study of auditory masking began in the early with the work of Wegel and , who in 1924 examined how one masks another and linked it to dynamics. This laid foundational insights into masking mechanisms. The concept was further formalized in by and Munson in 1937, who established key relationships between sound and masking effects through experimental measurements. Auditory masking is primarily divided into energetic masking and informational masking. Energetic masking involves peripheral interference within the auditory periphery, particularly overlapping excitation patterns along the basilar membrane in the cochlea, where the masker's energy suppresses neural responses to the target. A representative example is broadband noise masking a faint pure tone in an otherwise quiet environment, as the noise overwhelms cochlear detection of the tone's frequency-specific vibrations. Informational masking, by contrast, arises from central auditory processing limitations, where perceptual uncertainty or similarity between the target and maskers overloads higher-level interpretation, even without significant energy overlap. This type was first articulated by Pollack in 1975 to describe masking beyond peripheral energetics. An illustrative case is the scenario, where multiple simultaneous talkers in a crowded room hinder comprehension of a single speaker due to competing linguistic and attentional demands, despite individual words remaining acoustically detectable.

Masked Threshold

The masked threshold is defined as the minimum level at which a becomes detectable in the presence of a masking , representing the point where the is just audible despite the interfering stimulus. This concept, first quantitatively explored in early psychophysical studies, quantifies how the masker elevates the detection of the compared to quiet conditions. Masked thresholds are typically measured using psychophysical procedures that minimize bias and ensure reliable estimates of detectability. Common methods include adaptive tracking paradigms, such as the two-interval forced-choice (2IFC) task, where listeners identify which of two intervals contains the target amid the masker, with stimulus levels adjusted dynamically to converge on the 70.7% correct performance point. These techniques, refined over decades, allow for efficient threshold estimation by focusing trials near the detection boundary and accounting for listener response variability. The extent of masking is often expressed as the threshold shift, calculated as \text{threshold shift} = 10 \log_{10} \left( \frac{I_{\text{masked}}}{I_{\text{quiet}}} \right), where I_{\text{masked}} is the of the at threshold with the masker present, and I_{\text{quiet}} is the at the without masking; this yields the threshold shift in decibels (). For tonal maskers close in to the , typical shifts range from 10 to 20 , depending on the masker-target separation and overall stimulus levels. Several factors the masked , including the overall level of the masker, which generally produces greater as increases to enhanced peripheral overlap. Target duration also plays a role, with shorter signals yielding higher thresholds because of limited temporal in the . Additionally, listener variability contributes to differences in masked thresholds, arising from individual differences in cochlear mechanics and attentional factors. Both energetic masking, from overlapping patterns, and informational masking, from perceptual uncertainty, can contribute to this .

Simultaneous Masking

Critical Bandwidth

The critical bandwidth (CBW) represents the frequency range over which sounds interact most effectively in the , arising from the overlapping excitation patterns produced by those sounds on the basilar membrane. Within this bandwidth, a masker or elevates the detection of a target sound more substantially than when the sounds are separated by larger intervals, reflecting the limited of the cochlea's filtering. This underpins simultaneous masking, as from the masker spreads across the basilar membrane, interfering with the neural representation of nearby frequencies. In the , Eberhard Zwicker formalized the CBW in his model of auditory processing, defining it as the approximate width of an auditory filter centered at a given , which aligns closely with the equivalent rectangular (ERB) of these filters. Zwicker's framework subdivided the audible into 24 critical bands, each corresponding to regions of integrated auditory processing where summation and masking effects occur nonlinearly. This model built on earlier psychophysical observations but provided a quantitative basis for understanding how the ear groups frequencies into perceptually equivalent units. The ERB, as a measure of filter bandwidth, is approximated by the formula \text{ERB}(f) \approx 24.7 + 0.108f where f is the center frequency in Hz; for instance, at a center frequency of 1 kHz, the ERB is approximately 132.7 Hz, indicating the span over which masking interactions are prominent. This linear approximation captures the filter's broadening with increasing frequency, mimicking the anatomical gradient along the basilar membrane. More refined polynomial forms exist, but this equation suffices for estimating bandwidths in the typical speech range of 100 Hz to 8 kHz. CBW values are empirically derived from masking experiments, in which the threshold shift for a target tone is measured as a function of its separation from a masker; masking reaches its peak when the target and masker fall within the same CBW, beyond which the effect diminishes sharply. Zwicker employed techniques such as masking a noise with flanking tones to delineate these boundaries, confirming that CBW increases from about 100 Hz at low frequencies to over 3 kHz at high frequencies. These measurements validate the CBW as a functional equivalent to the auditory filter's , essential for modeling selectivity without direct physiological access.

Frequency and Intensity Effects

In simultaneous auditory masking, the strength of masking depends critically on the frequency relationship between the masker and the target signal. When the masker and target frequencies are similar—termed on-frequency masking—the masking is strongest because the masker's energy is concentrated within the same auditory filter centered on the target's . In contrast, masking weakens as the frequency separation increases, reflecting the auditory system's selectivity, where dissimilar frequencies engage different filters with less overlap. This is asymmetric: a masker at a lower than the target produces greater masking at higher frequencies—a phenomenon known as upward spread of masking—due to the broader spread of excitation toward higher frequencies along the basilar membrane. Conversely, downward spread from higher-frequency maskers to lower targets is more limited, resulting in weaker masking. The intensity of the masker also modulates masking strength in a non-linear fashion. Near the , the masked typically shifts by approximately 3 dB for every 10 dB increase in masker intensity, indicating compressive that tempers the of masking compared to a linear expectation. At low intensities close to , masking follows a more linear pattern, but as masker levels rise, cochlear reduces the relative impact, leading to shallower slopes in the growth-of-masking function—often around 0.3 to 0.5 for off-frequency maskers. This non-linearity arises from the active amplification and saturation mechanisms in the , which alter the effective excitation patterns at higher levels. Listeners can mitigate masking through off-frequency listening, where they attend to spectral sidebands of the target in adjacent auditory rather than the on-frequency component, thereby reducing the effective masking from the primary 's output. This strategy is particularly effective when the masker is off-frequency, as it exploits the asymmetric shape of auditory , which have steeper slopes on the high-frequency side. shapes are often modeled using the rounded exponential (roex) function, capturing the gradual and enabling quantitative predictions of masking reduction. Experimental studies, such as those employing notched-noise maskers, have generated masking curves as functions of frequency offset and intensity, revealing peak masking near zero offset that broadens and asymmetry increases with higher intensities—key data from Patterson and Moore's work on .

Temporal Masking

Forward Masking

Forward masking, also known as pre-masking, refers to the phenomenon where the detection threshold of a (probe) is elevated due to a preceding masker that ends shortly before the probe's onset, typically within a gap of 0 to 200 ms. This temporal separation distinguishes forward masking from simultaneous masking, where the masker and target overlap in time, highlighting sequential processing effects in the . The time course of forward masking exhibits an exponential decay following the masker's offset, with the masking effect diminishing as the interstimulus interval (time between masker offset and probe onset) increases. This recovery process has a time constant of approximately 50 ms at high frequencies, reflecting the auditory system's temporal resolution limits. The masking depth can be modeled as M(t) \approx A \cdot e^{-t / \tau}, where M(t) is the masking depth in dB, A is the initial masking depth, t is the interstimulus interval, and \tau is the time constant typically around 50-100 ms depending on frequency and intensity. Several factors influence the magnitude of forward masking, including masker intensity and proximity to the probe; masking is stronger for higher masker levels and when the masker and probe share similar (on-frequency conditions). Physiologically, forward masking is associated with cochlear adaptation, where the basilar membrane's response to the masker persists, and neural recovery processes in the auditory nerve, which temporarily reduce responsiveness to subsequent stimuli. In psychoacoustic research, forward masking serves as a tool to investigate the temporal properties of auditory filters, providing insights into how the integrates over time and aiding the development of models for auditory processing.

Backward Masking

Backward masking, also known as post-masking, occurs when the perception of a is impaired by a subsequent masker that begins shortly after the target's offset, typically within 0-50 , leading to elevated detection thresholds for the target. This phenomenon arises because the integrates sensory information over a brief temporal , allowing the masker to interfere with the processing of the preceding signal. Unlike simultaneous masking, backward masking highlights the limits of in the auditory pathway, where the masker disrupts the neural representation of the target before it is fully encoded. The time course of is relatively short, with maximal effects observed at zero delay between offset and masker onset, decaying rapidly to negligible levels by 20-50 post-target. For brief s like 10- tones, masking thresholds can increase by approximately 20-35 at onset, but this effect diminishes with longer delays or durations exceeding 20 . This brevity contrasts with forward masking, which persists longer due to slower neural recovery processes, illustrating an asymmetry in temporal masking where backward effects are weaker overall but peak more sharply. Mechanisms underlying backward masking involve both peripheral and central auditory processing, including temporal smearing of neural responses and interference within an integration window of approximately 200 ms for tonal signals. At the periphery, the masker may alter basilar vibrations lingering from the target, while centrally, it suppresses signal-evoked activity in the , reducing and firing rates. This disruption is more pronounced for short-duration targets, as the relies on to build a robust percept, which the masker interrupts. Experimental findings, such as those by Zwicker, demonstrate peak at zero delay, with overshoots up to 13 dB for a 2-ms, 5000-Hz masked by , decreasing with spectral similarity between target and masker. Data from Zwicker and Fastl further show that exhibits an asymmetric profile compared to forward masking, with effects confined to shorter intervals and lower magnitudes, emphasizing the auditory system's forward-biased temporal processing. Factors enhancing backward masking include greater intensity differences and frequency similarity between target and masker, though these influences are less potent than in forward masking. For instance, masker levels of 50-90 SPL can elevate thresholds by 4-8 more at close frequencies, but dissimilar spectra (e.g., vs. ) yield stronger initial overshoots. Brief target durations amplify the effect, underscoring the role of temporal integration limits.

Other Masking Phenomena

Informational Masking

Informational masking refers to the degradation in the detection, discrimination, or recognition of a target sound caused by competing sounds, where the interference arises not from the energetic overlap of their spectra or temporal envelopes at the auditory , but from higher-level perceptual, cognitive, or attentional processes that introduce and difficulty in segregating the target. This form of masking was first introduced by Pollack in 1975 as a distinct phenomenon beyond traditional energetic masking. Unlike energetic masking observed in simple tone-in-noise scenarios, informational masking involves complex auditory scenes where the maskers create confusion through similarity or perceptual grouping errors, diverting attention from the target. Key components of informational masking include the difficulty in auditory stream segregation when multiple similar sound sources are present, leading to confusion in identifying which sounds belong to the target versus the maskers, and failures in perceptual grouping that exacerbate uncertainty. For instance, in the cocktail party effect, listeners struggle to focus on one conversation amid multiple talkers due to overlapping acoustic cues and attentional demands, resulting in informational interference that hinders speech understanding. Similarity-based confusion is particularly pronounced when maskers share spectral or temporal characteristics with the target, such as in speech-on-speech masking where concurrent talkers with comparable voices or prosody increase the required for segregation. These elements highlight how informational masking operates at central auditory processing stages, relying on linguistic and contextual cues rather than mere energy summation. Informational masking is typically measured through psychophysical tasks involving detection thresholds or speech in multi-source environments, often isolating its contribution by subtracting energetic masking estimates, such as those derived from time-frequency (ITFS) models that compute the maximum possible signal glimpses available after removing masker energy overlap. In multi-talker scenarios, informational masking can elevate thresholds by 10-20 or more beyond energetic components alone, as demonstrated in experiments where speech maskers cause significantly greater impairment than of equivalent energy. For example, speech intelligibility can drop to near zero with just a few competing talkers, reflecting losses up to 30 compared to 4 for maskers at detection threshold. Theoretical frameworks for informational masking, such as that proposed by Kidd et al. in 2023, emphasize the role of in handling auditory complexity, integrating perceptual with attentional selection mechanisms to explain masking release through cues like spatial separation or glimpsing opportunities in the signal. Factors influencing its magnitude include the acoustic similarity between target and maskers, which amplifies confusion, and linguistic familiarity, where maskers in an unfamiliar reduce by minimizing semantic overlap. Informational masking is especially prominent in speech-on-speech masking, where it accounts for much of the difficulty in noisy social settings, but can be mitigated by spatial cues or brief temporal gaps that aid . Seminal work by Neff in 1991 further linked informational masking to auditory , showing how divided focus in multi-tone identification tasks increases masking independent of energy overlap. Recent advances as of 2025 include brain-inspired algorithms that improve listening by isolating talkers using narrow spatial tuning, demonstrating practical applications in enhancing speech separation in noisy environments.

Binaural Masking

masking refers to the alteration in the detection of a when the masker and exhibit differences in interaural time differences (ITD) or interaural level differences (ILD) across the two ears. This phenomenon arises because the processes cues to enhance signal detection in noisy environments, particularly when the and masker are spatially separated. Unlike masking, masking leverages interaural disparities to reduce the effective masking, improving the perceived by the listener. The masking level difference (MLD), first described by Hirsh in 1948, quantifies the binaural advantage as the improvement in detection threshold when the target signal is presented out-of-phase across the ears relative to the in-phase masker noise. Typically, MLD values range from 10 to 15 dB for normal-hearing listeners, representing a substantial reduction in the required signal intensity for detection. The MLD is calculated using the formula: \text{MLD} = 10 \log_{10} \left( \frac{I_{\text{monaural threshold}}}{I_{\text{binaural threshold}}} \right) where I denotes the signal intensity at threshold, with the monaural condition corresponding to the homophasic (S_0N_0) configuration and the binaural to the antiphasic (S_{\pi}N_0). This effect is maximal for low-frequency tones in correlated broadband noise, as the auditory system exploits phase differences to segregate the target. Binaural masking release (BMLR) describes the overall improvement in signal detection due to these spatial cues, enabling better performance in complex acoustic scenes. A related phenomenon, the squelch effect, provides an additional binaural benefit of approximately 5-10 dB in speech understanding for hearing-impaired individuals, particularly those with bilateral cochlear implants, by suppressing noise through perceived source separation. MLD and BMLR are frequency-dependent, with the strongest effects below 1.5 kHz where ITD cues dominate, and diminish at higher frequencies where ILD becomes more relevant. These mechanisms are particularly valuable in reverberant noise environments, such as classrooms or auditoriums, where spatial separation aids in overcoming masking from reflections. As of 2025, research has advanced in characterizing the maturation of the binaural intelligibility level difference (BILD) in school-age children, aiding clinical assessments of central auditory development.

Physiological Mechanisms

Peripheral Mechanisms

Auditory masking at the peripheral level primarily arises from processes in the , where the basilar membrane's frequency-specific tuning leads to interference between sounds. The basilar membrane decomposes incoming sounds into frequency components along its tonotopic organization, with each location responding maximally to a characteristic frequency. When a masker and target sound stimulate overlapping regions of the basilar membrane, the masker's excitation can saturate the mechanical response, reducing the basilar membrane's vibration amplitude to the target by approximately 1 dB per dB increase in masker level near the characteristic frequency. This saturation occurs due to compressive nonlinearity in the , limiting the and diminishing the target's neural representation. Suppression mechanisms further contribute to peripheral masking, particularly in two-tone interactions where one sound reduces the response to another. In two-tone suppression, a suppressor decreases the basilar membrane vibration and auditory nerve firing rate to a probe , with the effect most pronounced for off-frequency suppressors (e.g., an below the probe's characteristic frequency), shifting the rate-level and elevating thresholds by 5-26 without altering discharge rates. This suppression grows faster for lower-frequency tones and originates from nonlinear cochlear . plays a key role in forward masking, where preceding maskers lead to temporary reduction in sensitivity; while outer electromotility amplifies low-level sounds, in inner s and synaptic processes post-maskers contributes to elevated thresholds, with recovery tied to slower rather than direct outer involvement. At the auditory level, masking manifests as reduced spike rates and increased detection thresholds due to the masker's influence on cochlear output. In cat auditory fibers, simultaneous masking grows at approximately 2 per for masker tones near or slightly above (10%) the fiber's characteristic , reflecting hydromechanical spread of excitation that broadens the effective and overlaps with target responses. Post-masker thresholds rise due to this spread and , with maskers suppressing tone-evoked spikes by competing for neural resources in overlapping channels. Electrophysiological from animal models, such as , confirm these effects trace to early cochlear stages before central . Linear predictive models, such as the gammatone filterbank, simulate these peripheral processes by mimicking basilar membrane filtering with gamma-enveloped tones spaced on the equivalent rectangular bandwidth scale. These models capture masking by convolving inputs with filters that approximate cochlear frequency selectivity, predicting reduced target output in through linear superposition, and have been calibrated to psychoacoustic for tasks like tone-in-noise detection. Such simulations align with animal , reproducing suppression and spread without nonlinear extensions for basic energetic effects. The critical bandwidth, underlying simultaneous masking spread, emerges from the width of these peripheral filters, typically spanning 100-300 Hz at low frequencies. However, peripheral mechanisms primarily explain energetic masking, where physical overlap limits signal representation in the and auditory nerve, but fail to account for informational masking involving perceptual uncertainty at higher levels.

Central Mechanisms

Central mechanisms of auditory masking involve neural processing in the , , and , where incoming signals from the are integrated, modulated, and interpreted to influence in noisy environments. In the and , the serves as a critical hub for integrating temporal cues, with neurons showing sound-evoked inhibition that underlies forward masking by suppressing responses to subsequent targets. Efferent feedback from the medial olivocochlear system further modulates these effects by dynamically adjusting cochlear gain, as demonstrated in a 2024 subcortical that incorporates efferent control of cochlear gain to predict forward masking. This feedback mechanism helps mitigate masking by enhancing neural recovery times, particularly for brief temporal separations between masker and target. At the cortical level, the contributes to informational masking through attention-dependent processes and auditory scene analysis, where competing sounds interfere with target beyond simple energetic overlap. Functional MRI reveals reduced neural of target speech in masked conditions, with attentional focus modulating activity in primary and non-primary auditory areas to prioritize relevant streams while suppressing distractors. Subcortical delays in processing masked speech, observed via electrophysiological measures, further support central involvement, as responses become progressively smaller and delayed with increasing masking severity, reflecting integration lags that propagate to cortical stages. Recent advances highlight objective assessment and modeling of these central processes. Auditory steady-state response (ASSR) techniques enable non-invasive evaluation of masking effects, with a 2025 pilot study exploring the use of ASSR to assess auditory masking thresholds non-invasively, offering a tool to probe central temporal integration without behavioral reliance. Computational models incorporating efferent dynamics have improved predictions of forward masking, aligning simulated neural responses with human by simulating brainstem-to-cortex pathways. Central mechanisms also account for asymmetries in masking, such as the persistence of despite peripheral recovery, due to central overwriting of target traces in short-term rather than ongoing cochlear adaptation. In complex acoustic environments, these processes amplify peripheral excitation patterns through gain control and attentional biasing, enhancing target detection while exacerbating interference from informational maskers like fluctuating speech.

Applications

Audio Compression and Psychoacoustics

Auditory masking plays a central role in perceptual audio coding, where psychoacoustic models leverage masking thresholds to compress audio signals by quantizing components below human perception thresholds. In codecs such as (MPEG-1 Audio Layer III), developed in the , these models identify inaudible spectral components masked by dominant tones or noise, allowing for aggressive quantization without perceptible distortion. This approach exploits the ear's insensitivity to signals near masking sounds, enabling significant data reduction while preserving subjective audio quality. Simultaneous masking is particularly exploited in these systems through the concept of critical bands, which divide the into regions analogous to the basilar membrane's filtering. Bit allocation algorithms assign fewer bits to masked subbands, prioritizing audible elements and hiding quantization within the masking threshold. Temporal masking predictions further mitigate artifacts like pre-echoes, where a strong signal masks subsequent weaker ones, by adjusting block sizes in the (MDCT) to align with auditory integration times. ISO/MPEG standards incorporate spreading functions to model frequency masking, simulating how masking energy spreads across critical bands via excitation patterns derived from psychophysical data. For instance, these models enable encoding at bit rates as low as 128 kbps with minimal perceptible loss for stereo music, as validated in listening tests showing transparency thresholds comparable to uncompressed CD audio. Modern codecs, such as Advanced Audio Coding (AAC) and Opus, build on these principles with refined psychoacoustic models incorporating auditory masking to achieve transparent quality at even lower bit rates, often below 96 kbps. As of 2025, the LC3 codec in Bluetooth Low Energy (LE) Audio, standardized by the Bluetooth SIG in 2020 and widely implemented in devices by 2024, uses advanced masking thresholds for efficient wireless streaming, supporting features like Auracast for broadcast audio to multiple listeners, including hearing aids, with reduced latency and power consumption. In applications, masking principles inform the tuning of auditory filters for acoustics, where simulated environments replicate masking effects to enhance spatial audio rendering. Models are rigorously tested against detection thresholds in controlled experiments, refining parameters like the masking for better in immersive systems. However, a key limitation is the over-reliance on energetic masking, which often underperforms in complex musical passages where informational masking—arising from perceptual grouping—degrades efficiency and introduces audible artifacts.

Hearing Aids and Clinical Uses

In audiological diagnostics, clinical masking is employed during pure-tone audiometry to isolate thresholds in the test ear by introducing noise to the non-test ear, preventing cross-hearing via bone or air conduction pathways. Masking is required for air conduction when interaural differences exceed 40 dB (or 55 dB with insert earphones) and for bone conduction when air-bone gaps surpass 10 dB or bone conduction in one ear is better by 40 dB than the contralateral air conduction threshold. This technique ensures accurate differentiation between conductive and sensorineural hearing loss, with procedures involving incremental noise levels until a plateau threshold is achieved, typically using narrowband noise at 10-20 dB above the non-test ear's expected threshold. Cochlear dead regions, areas of dysfunctional inner hair cells, are identified through off-frequency listening deficits where signals are detected via adjacent viable regions, leading to elevated masked thresholds. The threshold equalizing noise (TEN) test diagnoses these by presenting pure tones in noise; a dead region is indicated if the masked threshold exceeds the TEN level by at least 10 per equivalent rectangular bandwidth (ERB) and 10 above the , often revealing perceptual distortions like noise-like tones in affected frequencies. Modern hearing aids incorporate noise reduction algorithms based on auditory masking thresholds to suppress background interference while preserving speech cues. These algorithms, such as the generalized minimum mean square error estimator adapted for masking (GMMSE-AMT[ERB]), estimate simultaneous and temporal masking using ERB-scaled filters tailored to hearing loss profiles, reducing noise gain in low-signal-to-noise ratio bands without distorting speech intelligibility. As of 2025, AI-enhanced systems in hearing aids further refine masking-based noise suppression by dynamically adjusting to acoustic scenes, improving speech-in-noise performance. Directional microphones in binaural hearing aids exploit spatial release from masking by enhancing frontal signals and attenuating off-axis noise, providing up to 5-10 dB signal-to-noise ratio improvements in reverberant environments through interaural time and level differences. Bluetooth LE Audio integration allows direct high-quality streaming to aids, leveraging masking models for efficient low-bitrate transmission in applications like Auracast for public announcements. In cochlear implants, channel allocation strategies using n-of-m approaches, such as SNR-based methods like the IdBM (intelligibility-based mask estimation), select the most prominent spectral envelopes from target-dominated channels ( ≥0 dB) over masker-dominated ones to minimize masking effects and provide clearer speech representation. This reduces energetic and informational masking by limiting channel interactions, with studies showing improved sentence recognition in noise by 10-20% compared to non-selective strategies like , particularly when avoiding overlap in competing talkers. Hearing-impaired individuals often face heightened informational masking challenges in noisy environments, where perceptual segregation of speech streams is impaired. Clinical phenomena such as , an exaggerated sound intolerance often co-occurring with hearing impairment, can be managed with low-level masking to desensitize the and reduce discomfort from everyday sounds. Impaired ears exhibit reduced masking release, with hearing-impaired listeners showing only 0-2 benefit in fluctuating versus 4-8 in normal-hearing individuals, due to diminished audibility during dips. Recent applications of auditory steady-state responses (ASSR) in 2025 pediatric assessments provide objective thresholds for infants, incorporating masking for high-intensity stimuli to prevent contralateral and aiding early identification of mild losses in non-responsive children. Sound for tinnitus relief leverages controlled masking with broadband noise generators, delivering partial masking at or slightly above hearing thresholds for 8-10 hours daily to promote and reduce perceived . Integrated into , this approach may provide subjective improvements in for 60-80% of patients over 12-24 months according to some studies, though systematic reviews indicate limited overall of ; often using behind-the-ear devices without reported adverse effects.

References

  1. [1]
    [PDF] 13. Psychoacoustics - UC Davis Math
    It is often demonstrated and measured by study- ing masking, which has been defined as 'The process by which the threshold of audibility for one sound is raised.
  2. [2]
    [PDF] Listening effort elicited by energetic versus informational masking
    Auditory masking typically is considered to comprise both energetic masking (EM, or the result of spectrotemporal overlap between target and masker energy) and.
  3. [3]
    [PDF] ELE 201, Spring 2014 Laboratory No. 5 MP3 Audio Compression 1 ...
    Psychoacoustics: Detect frequencies of interest in the audio and infer frequencies that are masked by others. Based on this, the psychoacoustic model returns a ...
  4. [4]
    [PDF] Evaluation of an auditory masked threshold noise suppression ...
    In this study, our goals were as follows: (1) evaluate the effectiveness of the auditory masked threshold noise suppression (AMT-NS) technique in improving the ...
  5. [5]
    Mechanisms and mechanics of auditory masking - PMC - NIH
    May 15, 2013 · Masking tends to occur when the tonotopic representation of one sound interferes with that of another. The 'classical' masking studies in both ...
  6. [6]
    The Auditory Masking of One Pure Tone by Another and its Probable ...
    The Auditory Masking of One Pure Tone by Another and its Probable Relation to the Dynamics of the Inner Ear. R. L. Wegel and C. E. Lane. Research Laboratories ...
  7. [7]
    Relation Between Loudness and Masking - AIP Publishing
    Harvey Fletcher, W. A. Munson; Relation Between Loudness and Masking, The Journal of the Acoustical Society of America, Volume 9, Issue 1, 1 July 1937, Pages 1–
  8. [8]
    Auditory informational masking - AIP Publishing
    Aug 11, 2005 · By contrast with “energetic” masking, informational masking is defined in terms of the threshold change in statistical structure resulting from ...
  9. [9]
    Auditory Masking - an overview | ScienceDirect Topics
    ... Auditory Masking, defined as the process by which the threshold of hearing one sound is raised by the presence of another (Moore, 2012). Consequently ...
  10. [10]
  11. [11]
    Factors affecting thresholds for sinusoidal signals in narrow‐band ...
    Jul 1, 1987 · When a signal is higher in frequency than a narrow‐band masker, thresholds are lower when the masker envelope fluctuates than when it is ...
  12. [12]
    Effects of masker frequency and duration in forward masking
    The effect of masker duration was found to depend more on signal level than on gap duration or masker level. Overall, the results support the idea that forward ...Missing: variability | Show results with:variability
  13. [13]
    [PDF] Central Masking: Fact or Artifact?
    As such, the current authors call into question previous methods of threshold measurement used in central masking ... Psychophysics, 11(3), 198–202. doi:10.3758/ ...
  14. [14]
    Subdivision of the Audible Frequency Range into Critical Bands ...
    E. Zwicker; Subdivision of the Audible Frequency Range into Critical Bands (Frequenzgruppen), The Journal of the Acoustical Society of America, Volume 33, ...
  15. [15]
    Critical Bandwidth - an overview | ScienceDirect Topics
    Critical bandwidth is defined as the bandwidth associated with a specific auditory filter on the basilar membrane, which increases with center frequency and ...
  16. [16]
    Auditory Masking: Using Sound to Control Sound - Ansys
    Aug 22, 2023 · Auditory masking is the reduction of perception of one sound by the presence of another sound. For example, in medical or legal offices, where ...
  17. [17]
    Forward Masking - an overview | ScienceDirect Topics
    Forward masking occurs when a sound (the signal) cannot be perceived due to the presence of a preceding sound (the masker).Missing: course equation
  18. [18]
    Forward masking in normal-hearing, hearing-impaired, and noise ...
    ... time constant (τ) was estimated from the best ... forward masking over a range of signal delays (0–40 ms). ... 67 ms, with a mean of 50 ms. Hearing ...<|control11|><|separator|>
  19. [19]
    Forward masking of frequency modulation - PMC - PubMed Central
    Recovery from FM forward masking is well described by an exponential function with a time constant between 72 and 167 ms, for the three modulation rates used in ...
  20. [20]
    Forward masking as a function of frequency, masker level ... - PubMed
    Forward masking in dB is proportional to both masker level and log signal delay at each frequency. More forward masking occurs at very low frequencies than at ...Missing: decay half- 50 ms
  21. [21]
    Forward masking in patients with cochlear implants
    An alternative interpretation of the present data is that there are two sources of adaptation that contribute to for- ward masking: cochlear and retrocochlear.<|control11|><|separator|>
  22. [22]
    Recovery from short-term adaptation in single neurons in the ...
    Recovery from short-term adaptation was measured in single neurons in the cochlear nucleus using a forward masking stimulus paradigm.
  23. [23]
  24. [24]
  25. [25]
  26. [26]
  27. [27]
    Informational masking and auditory attention - PubMed
    Informational masking is broadly defined as a degradation of auditory detection or discrimination of a signal embedded in a context of other similar sounds.
  28. [28]
    Binaural masking level difference for pure tone signals - PMC
    Jul 4, 2023 · The binaural masking level difference (BMLD) is a psychoacoustic method to determine binaural interaction and central auditory processes.Missing: MLD | Show results with:MLD
  29. [29]
    Masking Level Difference (MLD) - AC40 - Interacoustics
    Feb 10, 2022 · The masking level difference (MLD) is the improvement in detecting a tone or speech in noise when the phase of the tone or the noise is reversed by 180 degrees.Missing: psychoacoustics | Show results with:psychoacoustics
  30. [30]
    Masking level differences under two different measurement conditions
    Masking level difference (MLD) is a binaural interaction phenomenon that describes the improvement in the detection threshold due to phase differences of ...Missing: psychoacoustics | Show results with:psychoacoustics
  31. [31]
    [PDF] SPATIAL RELEASE FROM MASKING - Acoustics Today
    A simpler version of the BILD is the binaural masking level difference (BMLD), where a target ... al listening condition (known as the squelch effect).
  32. [32]
    [PDF] Masking Level Difference (MLD): Literature Review
    Mar 10, 2017 · Masking Level Difference composes a tests set for the central auditory processing behavioral evaluation and estimates decoding binaural.
  33. [33]
    Masking of sounds by a background noise – cochlear mechanical ...
    In the search for cochlear correlates of auditory masking by noise stimuli, we recorded basilar membrane (BM) vibrations evoked by either tone or click signals ...
  34. [34]
    The role of suppression in psychophysical tone-on-tone masking - NIH
    This study tested the hypothesis that suppression contributes to the difference between simultaneous masking (SM) and forward masking (FM).
  35. [35]
    Masking by Inaudible Sounds and the Linearity of Temporal ...
    Aug 23, 2006 · This phenomenon, known as “forward masking,” is usually quantified by measuring how much the just-detectable level (or threshold) of a target ...Missing: probe | Show results with:probe
  36. [36]
    Adaptation in auditory processing | Physiological Reviews
    Jan 3, 2023 · Release of acetylcholine from medial olivocochlear neurons inhibits the electromotility exhibited by outer hair cells, which is responsible for ...<|separator|>
  37. [37]
    Growth rate of simultaneous masking in cat auditory-nerve fibers ...
    The growth rate of maskings averaged approximately 2 dB/dB, was typically largest for tones about 10% above fiber CF, and decreased at higher and lower ...
  38. [38]
    [PDF] History and Future of Auditory Filter Models - Google Research
    Abstract—Auditory filter models have a history of over a hundred years, with explicit bio-mimetic inspiration at many stages along the way.
  39. [39]
    Determining the energetic and informational components of speech ...
    SNHL listeners demonstrated both greater energetic and informational masking as well as higher glimpsed thresholds.
  40. [40]
    Temporal Masking Reveals Properties of Sound-Evoked Inhibition in ...
    The inferior colliculus (IC) is the first place in the central auditory pathway where duration-selective neurons are found.
  41. [41]
    A Subcortical Model for Auditory Forward Masking with Efferent ...
    Sep 4, 2024 · A recent computational model of the auditory subcortex including medial olivocochlear efferents generates forward masking, an increase in detection threshold.Missing: seminal | Show results with:seminal
  42. [42]
    FMRI speech tracking in primary and non-primary auditory cortex ...
    Sep 30, 2024 · These results indicate that the fMRI signal tracks cortical responses and attention effects related to continuous speech.
  43. [43]
    The effect of speech masking on the human subcortical response to ...
    Dec 11, 2024 · We show that the subcortical response to masked speech becomes smaller and increasingly delayed as the masking becomes more severe.
  44. [44]
  45. [45]
    Backward Masking - an overview | ScienceDirect Topics
    This confirms that the critical aspect of backward masking is overwriting of the target's sensory memory by the mask, not the mere proximity of target and mask.
  46. [46]
    Hearing in Complex Environments: Auditory Gain Control, Attention ...
    Feb 10, 2022 · Thus, while peripheral mechanisms certainly contribute to perceptual alterations associated with cochlear degeneration, there is growing ...
  47. [47]
    Audiology Clinical Masking - StatPearls - NCBI Bookshelf - NIH
    Mar 31, 2022 · Therefore, clinical masking is required when the difference in hearing thresholds between the ears is greater than the interaural attenuation.
  48. [48]
    Dead Regions in the Cochlea: Diagnosis, Perceptual ...
    Off-frequency listening can occur even in normally hearing people, especially when a signal has to be detected in the presence of another sound. However, it ...
  49. [49]
    An Auditory-Masking-Threshold-Based Noise Suppression ...
    Nov 10, 2005 · This study describes a new noise suppression scheme for hearing aid applications based on the auditory masking threshold (AMT) in ...Missing: reduction | Show results with:reduction
  50. [50]
    Unilateral and bilateral hearing aids, spatial release from masking ...
    Jul 11, 2013 · Spatial release from masking (SRM) was tested within the first week of fitting and after 12 weeks hearing aid use for unilateral and ...
  51. [51]
    A new sound coding strategy for suppressing noise in cochlear ... - NIH
    The new criterion selects target-dominated (SNR⩾0 dB) channels and discards masker-dominated (SNR<0 dB) channels. Experiment 1 assessed cochlear implant users' ...Missing: allocation | Show results with:allocation
  52. [52]
    Speech Understanding With Various Maskers in Cochlear-Implant ...
    Jul 19, 2018 · For example, informational masking can occur when a competing talker interferes with a target talker due to difficulty perceptually segregating ...
  53. [53]
    Evaluation and Selection of Maskers and Other Devices Used in the ...
    We review not only the phenomenon of tinnitus masking, but also discuss the use of controlled levels of noise in the treatment of hyperacusis, an exaggerated or ...
  54. [54]
    Masking release for hearing-impaired listeners: The effect of ...
    The current study demonstrates that EEQ processing leads to a greater release of masking for HI listeners in certain types of fluctuating noise (in particular, ...Missing: hyperacusis | Show results with:hyperacusis
  55. [55]
    Auditory Steady-State Responses for Detecting Mild Hearing Loss in ...
    Jul 15, 2025 · Our conclusion is that ASSR assessments via bone are effective in the pediatric population, although in the adult population, they should be ...
  56. [56]
    Sound therapy (masking) in the management of tinnitus in adults
    Initial approaches to sound therapy involved 'complete masking' whereby the masking noise is raised in intensity until the tinnitus becomes inaudible (Coles ...