Fact-checked by Grok 2 weeks ago

Bark scale

The Bark scale is a psychoacoustical scale proposed by German acoustician Eberhard Zwicker in 1961, designed to model the nonlinear resolution of by the human through its critical bands. It divides the audible spectrum from approximately 20 Hz to 15.5 kHz into 24 critical bands, each exactly 1 Bark wide, where each band represents the frequency range over which the integrates sound energy similarly to a single filter. Named after Heinrich Barkhausen, who earlier contributed to subjective measurements, the scale approximates the tonotopic organization of the , with band widths increasing from about 100 Hz at low frequencies to around 3-4 kHz at higher frequencies. The Bark scale is empirically derived from psychoacoustic experiments on masking and , rather than a purely mathematical construct, and it closely aligns with the equivalent rectangular bandwidth (ERB) scale for many applications. A common approximation formula for converting frequency f in Hz to Bark units z is z = 13 \arctan(0.00076 f) + 3.5 \arctan\left( (f/7500)^2 \right), which provides a smooth mapping up to 24 Barks, though the original definition uses tabulated edges and centers based on experimental data. This nonlinear scaling ensures that equal distances on the Bark axis correspond to perceptually equal frequency differences, improving models of auditory processing over linear scales like Hertz. The Bark scale has become foundational in and , particularly for applications such as algorithms (e.g., ), where it helps identify masking thresholds to remove imperceptible spectral components without audible distortion. It is also employed in , speech analysis, and auditory modeling for hearing aids and virtual acoustics, enabling more accurate simulations of human hearing responses. Ongoing research continues to refine Bark-based transforms for real-time audio applications, such as filter banks in for sound classification.

Fundamentals

Definition and Purpose

The Bark scale is a psychoacoustic frequency scale that models the human auditory system's perception of pitch by dividing the audible frequency spectrum, approximately 20 Hz to 15.5 kHz, into 24 critical bands, with each band spanning one Bark unit to reflect perceptual rather than linear or logarithmic frequency intervals. This scale ensures that equal distances on the Bark axis correspond to perceptually equal intervals in frequency resolution, based on experimental measurements of and discrimination thresholds. The primary purpose of the Bark scale is to approximate the nonlinear frequency selectivity of the human ear, enabling more perceptually relevant analysis and processing of audio signals in fields such as acoustics, , and digital signal engineering. By aligning signal representations with how the resolves spectral components, it supports applications like perceptual audio coding, where it helps determine masking thresholds to reduce data rates without audible quality loss, as seen in standards such as MP3. A key characteristic of the Bark scale is its non-uniform band widths, which are narrower at low frequencies (around 100 Hz below 500 Hz) and widen progressively to about 3-4 kHz at higher frequencies, thereby mimicking the tuning properties of the basilar membrane in the . This design allows for a compact, human-centered depiction of spectral energy, where energy distributions across Barks provide an effective proxy for auditory excitation patterns.

Psychoacoustic Basis

The human processes sound through the , where the basilar membrane exhibits frequency-selective vibrations, with the apex responding to low and the base to high , creating a tonotopic map that underlies and . This mechanical filtering leads to the formation of critical bands, narrow frequency ranges in which the auditory system integrates acoustic energy, treating sounds within each band as contributing collectively to while limiting the ability to resolve individual pitches. Human perception of demonstrates a non-linear relationship to physical , such that equal intervals in perceived pitch—such as musical octaves—correspond to multiplicative rather than additive changes in , with larger absolute differences required at higher registers to achieve equivalent perceptual steps. This compressive non-linearity arises from the cochlea's varying and the neural encoding in the auditory pathway, necessitating psychoacoustic models that linear scales to better approximate auditory . Auditory masking illustrates the functional implications of critical bands, where a stronger sound elevates the detection threshold for a weaker one either simultaneously (when frequencies overlap within the same band) or temporally (when the target precedes or follows the masker by tens of milliseconds). In simultaneous masking, energy from the masker spreads across the critical band via the auditory filter's skirts, obscuring nearby tones; temporal masking, conversely, reflects recovery time in neural excitation, with forward masking persisting longer than backward due to post-stimulatory adaptation. These psychoacoustic phenomena are empirically grounded in psychophysical experiments measuring discrimination, which reveal that the (JND) in frequency is typically around 0.3% of the center frequency for mid-range frequencies, with the absolute JND increasing with frequency while the relative JND remains roughly constant above 500 Hz, and larger at lower frequencies reflecting denser neural innervation on the cochlear apex. Such variations in limits, observed under controlled levels from 200 Hz to 8000 Hz, confirm the auditory system's adaptive filtering.

Historical Development

Origins in Critical Band Theory

The foundations of critical band theory emerged from early 20th-century investigations into human auditory perception, particularly in the context of telephone speech quality and loudness measurement. In the 1920s, at Bell Laboratories conducted pioneering research on how frequency bands affect speech intelligibility and perceived in transmissions. His experiments involved filtering speech signals through low- and high-pass filters to identify bands where between components significantly degraded quality, revealing that certain ranges contributed disproportionately to overall auditory due to overlapping neural responses in the . These studies laid the groundwork for understanding how sounds within specific bands interact perceptually, influencing later models of auditory processing. The concept was formally introduced by in 1933, based on masking experiments that demonstrated how raises the threshold of a only within a limited range around the tone's . Fletcher's key 1940 experiment measured tone-in- masking thresholds, showing that as the bandwidth narrows around the tone, the masking effect remains constant once the bandwidth reaches a certain "critical" width—approximately the range over which the tone is effectively masked by the power. This critical bandwidth was estimated to correspond to the effective filtering action of the , where sounds outside this band do not significantly interfere. The concept was further refined in the 1950s and 1960s by Eberhard Zwicker through extensive masking studies, including those using narrow-band maskers and tonal signals, which confirmed the critical bandwidth's dependence: roughly 100 Hz at low frequencies (below 500 Hz) and expanding to 3-4 kHz at higher frequencies (above 3 kHz). Zwicker's experiments, such as those measuring thresholds for tones masked by bands of , provided empirical data showing that this bandwidth represents the resolution limit of auditory analysis, beyond which components are processed independently. Prior to the development of more perceptually uniform scales, early approximations of critical bands often relied on linear frequency divisions, assuming constant bandwidths in hertz across the audible . These linear models, derived from initial and masking data in Fletcher's work, proved limited in capturing the non-uniform nature of human audition, as they failed to account for the widening of effective bandwidths at higher , leading to inaccuracies in predicting perceptual interactions like masking spread and summation. Such approximations highlighted the need for frequency-warped representations that better aligned with cochlear and psychoacoustic uniformity.

Evolution and Standardization

The Bark scale was formalized by Eberhard Zwicker in as a psychoacoustical representation of the audible frequency range, dividing it into 24 critical bands based on empirical measurements of auditory critical bandwidths to model human auditory perception more accurately than linear frequency scales. Zwicker named the scale's unit "Bark" in honor of the German Heinrich Barkhausen, recognizing his foundational contributions to subjective scaling. This initial formulation provided tabular data for critical band boundaries, spanning from approximately 0 to 24 Bark to cover the human audible spectrum from 20 Hz to about 15.5 kHz, emphasizing the nonlinear spacing of auditory filters along the basilar membrane. In the 1970s, Ernst Terhardt extended Zwicker's framework through research on pitch perception, particularly virtual pitch and interactions, which highlighted the need for refined scale extensions to better account for perceptual phenomena across the full spectrum. Collaborating with Zwicker, Terhardt contributed to the 1980 publication of analytical expressions for critical-band rate and as functions of frequency, enabling more precise computational implementations and solidifying the 24-Bark division as a standard perceptual metric. These refinements shifted the scale from empirical tables toward mathematical models, facilitating broader application in auditory research. The Bark scale gained formal standardization in psychoacoustic standards, notably through ISO/R 532 (1975), which adopted Zwicker's calculation method incorporating the 24 critical bands for assessment. This was further refined in ISO 532-1:, maintaining the Bark scale as the basis for specific computations in /Bark units. In parallel, the scale was integrated into technologies during the 1980s and 1990s, such as the psychoacoustic model of the in ISO/IEC 11172-3 (1993), where it defined 25 critical bands (approximating 24 Bark) for masking threshold estimation and bit allocation. These adoptions marked key milestones, evolving the scale from analog psychoacoustic experiments to robust frameworks, with later updates in standards like ISO 226:2023 addressing individual variations in hearing sensitivity, such as age-related shifts, without altering the core 24-Bark structure.

Theoretical Framework

Critical Bands on the Bark Scale

The Bark scale partitions the audible frequency spectrum into 24 critical bands, extending from 0 to 24 Bark and covering frequencies approximately from 20 Hz to 15.5 kHz. These bands are defined such that the nth band is centered at frequencies where the aligns with the perceptual resolution of the human , as determined through psychoacoustic experiments on masking and loudness summation. The widths of the critical bands on the Bark scale increase progressively with frequency, beginning at roughly 100 Hz in the low-frequency region and expanding to about 3.5 kHz in the high-frequency region. This variation reflects the physiology of the , where low-frequency regions have finer frequency selectivity compared to higher ones. Specific boundary frequencies delineate each band; for instance, the first band spans 0–100 Hz, while the 24th band covers 12,000–15,500 Hz.
Band NumberLower Edge (Hz)Upper Edge (Hz)Bandwidth (Hz)
10100100
2100200100
3200300100
4300400100
5400510110
6510630120
7630770140
8770920150
99201,080160
101,0801,270190
111,2701,480210
121,4801,720240
131,7202,000280
142,0002,320320
152,3202,700380
162,7003,150450
173,1503,700550
183,7004,400700
194,4005,300900
205,3006,4001,100
216,4007,7001,300
227,7009,5001,800
239,50012,0002,500
2412,00015,5003,500
Each unit on the Bark scale corresponds to an equal perceptual distance in terms of rate, ensuring perceptual uniformity across the spectrum in contrast to the non-uniform of a linear Hertz scale. This of higher frequencies in the Bark representation visualizes the auditory system's reduced at elevated pitches, with bands becoming progressively wider to capture equivalent psychophysical increments.

Relationship to Human Audition

The Bark scale models the tonotopic organization of the human , where mechanical vibrations along the basilar create frequency-specific peaks that correspond to critical bands of hearing, with each Bark unit approximating the width of these bands (roughly 1.5 mm spacing on the ). This biomimetic design reflects how sound waves traveling through the cochlear fluid displace the basilar in a frequency-dependent manner, stimulating hair cells at precise locations to encode auditory information. The scale thus provides a physiological of how the processes spectral content, aligning linear frequency differences at low ranges with the 's stiffness and transitioning to logarithmic at higher frequencies to mimic neural firing patterns. In terms of perception, the Bark scale aligns closely with subjective pitch intervals, where equal steps on the scale correspond to perceptually equivalent differences, as demonstrated in psychoacoustic experiments involving adjustments and masking thresholds. For instance, bandwidths expressed in Barks yield consistent strength ratings across center frequencies in normal-hearing listeners, supporting the scale's utility in modeling how the integrates formants in speech or intervals in musical scales. This perceptual equivalence arises because critical bands on the Bark scale capture the ear's nonlinear resolution of frequency, making it more intuitive for human judgment than linear Hertz scales. Individual variations in auditory processing influence the applicability of the Bark scale, which serves as an averaged model derived from young, normal-hearing adults. widths show minimal change with age, with estimates only up to 50% wider than adults, indicating stable resolution from early development. However, hearing impairment can degrade band resolution, particularly through deficits in temporal cues, leading to reduced strength for narrow bandwidths (e.g., 5-135 Hz) even when is matched. Despite its strengths, the Bark scale has limitations as a universal model of audition, particularly for edge cases and diverse listeners. It approximates low-frequency behavior linearly below 500 Hz and becomes logarithmic above, but this does not perfectly capture all perceptual nuances in very low or high frequencies. The scale is defined only up to 15.5 kHz (24 Barks), requiring for higher ranges beyond typical hearing limits, and it overlooks inter-individual differences such as those from age-related or cochlear damage, where band broadening or asymmetric losses occur.

Mathematical Descriptions

Frequency-to-Bark Conversions

The frequency-to-Bark conversion maps physical frequencies in hertz (Hz) to the perceptual Bark scale, which approximates the nonlinear resolution of human hearing across the audible spectrum. The standard approximation, known as Zwicker's formula, is given by z = 13 \arctan(0.00076 f) + 3.5 \arctan\left( \left( \frac{f}{7500} \right)^2 \right), where z represents the critical band rate in Barks and f is the in Hz. This expression provides a smooth transition from near-linear spacing at low frequencies to logarithmic compression at higher frequencies, aligning with the structure of s derived from psychoacoustic measurements. This formula arises from empirical fits to data on thresholds, where the masking pattern's spread is analyzed to determine critical bandwidths as a function of . The first arctangent term captures the approximately linear increase in bandwidth below about 500 Hz, while the second term, with its squared argument, models the asymptotic approach to logarithmic scaling above 1 kHz, effectively combining these behaviors into a single analytical function. In practice, this conversion is employed to transform spectra into equally spaced perceptual bands on the Bark scale, facilitating the design of banks that mimic auditory . For instance, it enables the subdivision of the into 24 Bark-spaced channels corresponding to the primary critical bands of hearing. The approximation holds reliably over the typical audible range of 20 to 16,000 Hz, exhibiting an error of approximately 1% in predicting critical values compared to tabulated empirical data.

Bark-to-Frequency Transformations

The from Bark units (z) to physical frequency (f in Hz) is essential for applications requiring mapping back from the psychoacoustic scale to the linear . A widely adopted approximate formula for this transformation, proposed by Traunmüller (1990), is given by f = \frac{1960 (z + 0.53)}{26.28 - z} To account for corrections in the forward model, adjust z as follows before applying the formula: for z < 2 Bark, z' = (z + 0.3) / 0.85; for z > 20.1 Bark, z' = (z + 4.422) / 1.22; then compute f using z'. This ensures a closed-form solution that is computationally efficient and maintains invertibility and accuracy across the audible range up to approximately 15.5 kHz. Alternative formulations, such as the forward equation from Zwicker and Terhardt (1980), z = 13 \arctan(0.00076 f) + 3.5 \arctan\left(\left(\frac{f}{7500}\right)^2\right), lack a simple closed-form inverse and typically require numerical methods like root-finding algorithms (e.g., Newton-Raphson) for inversion in software implementations. Piecewise approximations enhance precision for specific ranges, with a linear segment for low Barks (z < 2, where f ≈ 100 z) capturing the near-constant bandwidth behavior below 500 Hz, and an asymptotic form for high Barks (z > 15) approximating the logarithmic compression at frequencies above 2 kHz. These segments ensure minimal error in round-trip conversions (Bark to frequency and back), with deviations typically under 1% across the scale. The Bark scale relates to the equivalent rectangular bandwidth (ERB) scale, which provides a refined estimate of auditory filter bandwidths as ERB(f) = 24.7 (4.37 f/1000 + 1) Hz. The two scales share a basis in auditory filter characteristics and are similar in their nonlinear mapping of frequency, with the ERB scale offering smoother bandwidth estimates at higher frequencies. In (DSP) software, such as MATLAB's Audio Toolbox, these transformations are implemented using the Traunmüller approximation for speed, often supplemented by precomputed lookup tables for discrete frequency bins (e.g., 24-32 bands spanning 0-24 ) to avoid floating-point errors in audio applications like perceptual and masking analysis. Numerical inversion or table is preferred when high is needed for non-standard formulas.

Applications and Comparisons

Use in Audio Processing

The Bark scale plays a central role in perceptual audio coding schemes, such as those employed in the (MPEG-1 Layer III) and (MPEG-2/4 Advanced Audio Coding) codecs, where it informs the design of filter banks that align with human critical bands to exploit psychoacoustic masking effects. In these systems, the audio spectrum is analyzed using a (DFT) grouped into Bark-scale bands, typically around 25 bands, to compute masking thresholds that determine which spectral components can be quantized more coarsely without perceptible distortion. This approach leverages simultaneous and temporal masking, where stronger signals obscure weaker ones within or adjacent to the same , allowing quantization noise to remain below the . As a result, these codecs achieve significant bitrate reductions—often 50-90% compared to uncompressed CD-quality audio (1.411 Mbps stereo)—while preserving perceived quality; for instance, typically operates at 128 kbps per channel, and at even lower rates like 64 kbps for similar fidelity. In noise reduction applications, particularly speech enhancement, the Bark scale enables multi-band processing that isolates noise from desired signals by aligning filters with auditory critical bands, improving the separation of speech in colored noise environments such as vehicular or cockpit settings. Techniques like spectral over-subtraction divide the spectrum into Bark-spaced subbands (e.g., with center frequencies spaced at 1/4 Bark intervals), apply gain modifications based on signal presence probability, and estimate noise variance using decision-directed methods, which outperform uniform-band approaches in segmental signal-to-noise ratio (SNR) by reducing musical noise artifacts. For example, Bark-scaled wavelet packet decomposition decomposes speech into 84 redundant subbands for soft-decision Wiener filtering, enhancing noisy speech from databases like Noisex-92 with minimal distortion in white Gaussian or car interior noise. Hearing aids incorporate the in adaptive filtering algorithms to tailor responses to individual profiles, warping the spectrum to match resolution and thereby improving speech intelligibility in . Binaural architectures use subband processing in arrays to apply suppression and source separation techniques, such as filtering or binary masking. This perceptual alignment ensures that amplification prioritizes bands where hearing deficits are most pronounced. Software libraries facilitate Bark scale implementations in audio processing workflows, enabling researchers and engineers to compute Bark spectrograms for and . In MATLAB's Audio Toolbox, the hz2bark function converts Hertz frequencies to Bark values using the formula z = 26.81 f / (1960 + f) - 0.53 (with boundary adjustments), supporting auditory design via designAuditoryFilterBank for warped-domain processing in tasks like feature extraction. Similarly, Python's Librosa , while natively supporting Mel-scale spectrograms, allows custom Bark transformations through its constant-Q or linear frequency resamplers, commonly used to generate Bark-aligned representations for machine learning-based audio tasks such as source separation. Recent advancements as of 2025 include the use of Bark-scale filterbanks in neural networks for acoustic cancellation and audio , such as heart sound classification.

Differences from Other Scales

The Bark scale differs from the primarily in its foundational basis and application focus. While the , derived from experiments on perceived pitch equality, emphasizes a nonlinear mapping that approximates logarithmic pitch perception—linear below 1000 Hz and logarithmic above—the Bark scale is grounded in theory, where each unit corresponds to one critical band of , resulting in wider bands at higher frequencies to model simultaneous masking effects more accurately. This makes the Bark scale particularly suitable for audio signals involving masking phenomena, such as in perceptual audio coding, whereas the excels in speech processing tasks like recognition, where pitch linearity aids feature extraction for . In contrast to the linear Hertz (Hz) scale, which treats all frequencies equally in terms of bandwidth, the Bark scale compresses the higher frequency range to achieve perceptual uniformity, aligning band widths with the nonlinear resolution of the human cochlea—narrower at lows (e.g., ~100 Hz per band below 500 Hz) and broader at highs (up to ~3 octaves per band above 8 kHz). This perceptual warping reduces the effective number of analysis bands from potentially hundreds or thousands in linear spectral representations (e.g., full FFT bins across 20 kHz) to just 24 Bark units, thereby lowering computational demands in implementations for audio analysis by focusing processing on psychoacoustically relevant resolutions. Compared to octave-based scales, which divide the frequency axis logarithmically into roughly 10 equal-ratio bands across the audible range (20 Hz to 20 kHz), the Bark scale provides finer granularity with its 24 fixed critical bands, offering greater precision in modeling psychoacoustic phenomena like masking and loudness summation without the coarser resolution of octaves. Octave scales, while simple and musically intuitive, overlook the varying bandwidths of auditory filters, making Bark preferable for detailed auditory simulations. Despite these strengths, the Bark scale introduces trade-offs relative to simpler alternatives: its nonlinear transformation requires more complex computations for frequency warping (e.g., via allpass filters or approximations) compared to the straightforward linear Hz or uniform logarithmic divisions, potentially increasing overhead in resource-constrained systems. To address such limitations in modern applications, extensions like multi-resolution variants adapt the Bark framework by incorporating variable bandwidths or decompositions for enhanced flexibility in tasks such as or source separation.

References

  1. [1]
    Subdivision of the Audible Frequency Range into Critical Bands ...
    February 01 1961 Subdivision of the Audible Frequency Range into Critical Bands (Frequenzgruppen) Available E. Zwicker
  2. [2]
    The Bark Frequency Scale - Stanford CCRMA
    — Click for http://en.wikipedia.org/wiki/Psychoacoustics. Search JOS ... Based on the results of many psychoacoustic experiments, the Bark scale is defined ...
  3. [3]
    [PDF] Improving Broadband Noise Filter for Audio Signals
    The Bark scale was proposed by Eberhard. Zwicker in 1961 and is named after ... Through his studies, Zwicker developed a scale that has 25 critical bands.Missing: paper | Show results with:paper
  4. [4]
    [PDF] 13. Psychoacoustics - UC Davis Math
    Psychoacoustics is concerned with the relationships between the physical characteristics of sounds and their perceptual attributes. This chapter describes:.
  5. [5]
    [PDF] Digital Watermarking of Audio Signals using a Psychoacoustic A
    A psychoacoustic auditory model is applied to shape and embed the watermark into the audio signal while retaining its perceptual quality for the average ...
  6. [6]
    Bark frequency transform using an arbitrary order allpass filter - PMC
    ... scale to the Bark scale. The use of a first ... Narayanan, Email: shri@sipi.usc.edu. References. 1. Zwicker E, Fastl H. Psychoacoustics, Facts and Models.
  7. [7]
    [PDF] Tutorial on Perceptual Audio Coding Algorithms
    The so called bark scale results if one puts together critical band widths such that their borders line up and in the literature about 25 bark bands can be ...
  8. [8]
    Psychoacoustic Models for Perceptual Audio Coding—A Tutorial ...
    Then, the signal energy is computed in frequency bands that are designed to resemble the Bark perceptual frequency scale [26] by appropriate grouping of the DFT ...
  9. [9]
    The Bark Frequency Scale | Spectral Audio Signal Processing
    Based on the results of many psychoacoustic experiments, the Bark scale is defined so that the critical bands of human hearing each have a width of one Bark. ...Missing: Zwicker | Show results with:Zwicker
  10. [10]
    Auditory Patterns | Rev. Mod. Phys. - Physical Review Link Manager
    Auditory Patterns, Harvey Fletcher, Harvey Fletcher Bell Telephone Laboratories, New York, New York, PDF Share, Rev. Mod. Phys. 12, 47 – Published 1 January, ...
  11. [11]
    Frequency discrimination as a function of frequency and sensation ...
    Jan 1, 1977 · Frequency discrimination was measured for frequencies from 200 to 8000 Hz and for sensation levels from 5 to 80 dB using pulsed sinusoids as stimuli.
  12. [12]
    [PDF] Harvey Fletcher's role in the creation of communication acousticsa)
    Harvey Fletcher made major contributions to communication acoustics, and was the first President of the Acoustical Society of America. His book was re-printed ...
  13. [13]
    Analytical expressions for critical‐band rate and critical bandwidth ...
    Nov 1, 1980 · The critical‐band rate as well as the critical bandwidth are functions of frequency. These dependencies have been given in table form.
  14. [14]
    LECTURE 04: HEARING PHYSIOLOGY
    BARK AND MEL SCALES. Critical Bandwidths: correspond to approximately 1.5 mm spacings along the basilar membrane, suggesting a set of 24 bandpass filters.
  15. [15]
    [PDF] Quantifying Pitch—Logarithms Just Noticeable Difference Direct ...
    A similar Bark Scale (Zwicker) ranges from 1 to 24 and corresponds to the first 24 critical bands. As frequency in Hertz increases, larger intervals are judged ...
  16. [16]
    On the Pitch Strength of Bandpass Noise in Normal-Hearing ... - NIH
    As the critical bandwidth increases with its center frequency (Zwicker, 1961; Zwicker ... (1980) Analytical expressions for critical-band rate and critical ...
  17. [17]
    [PDF] Measuring Speech. Fundamental frequency and pitch. - HAL
    Mar 3, 2022 · Several studies have suggested that the optimal scale for pitch intervals is intermediate between a linear scale and a logarithmic scale using ...
  18. [18]
    Size of Critical Band in Infants, Children, and Adults - ResearchGate
    These critical widths (estimates of critical band size) did not change substantially with age (critical widths for infants were no more than 50% larger than ...
  19. [19]
    bark2hz - Convert from Bark scale to hertz - MATLAB - MathWorks
    hz = bark2hz( bark ) converts values on the Bark frequency scale to values in hertz. example. Examples. collapse all. Convert Between Bark Scale and Hz.
  20. [20]
  21. [21]
    [PDF] Enhancement of Speech Using Bark-Scaled Wavelet Packet ...
    Abstract. In this paper, we propose a speech enhancement system, which integrates a bark-scaled wavelet packet decompo- sition (BS-WPD), a soft-decision ...
  22. [22]
    An algorithm to improve speech recognition in noise for hearing ...
    Oct 1, 2013 · To extract the 13-D RASTA-PLP feature, the power spectrum was first warped to the Bark scale, which was then log compressed. This auditory ...
  23. [23]
    hz2bark - Convert from hertz to Bark scale - MATLAB - MathWorks
    ... | double. Algorithms. The frequency conversion from Hz to the Bark scale uses the following formula: b a r k = ( 26.81 ) ( h z ) 1960 + h z − 0.53 i ...
  24. [24]
    [PDF] Intonational Equivalence: An Experimental Evaluation of Pitch Scales
    The Bark scale is derived from experiments which establish the maximum bandwidth within which masking noise will affect the perception of a tone. This ...
  25. [25]
    Giampiero Salvi :: Auditory Scales
    Comparison between ERB, Mel and Bark scale. There are three commonly used psychoacoustical scales that relate physical frequency (in Hz) to subjective ...
  26. [26]
    [PDF] Studies on inter-speaker variability in speech and its application in ...
    Although in speech recognition, the Mel scale is the most commonly used psychoacoustic scale, in many other areas of speech, the Bark scale or Equivalent ...
  27. [27]
    [PDF] Hearing Is Believing - CMU School of Computer Science
    Despite the differences in how they were developed, the Bark, Mel, and ERB scales describe a very similar dependence of auditory bandwidth on frequency ...
  28. [28]
    (PDF) A Multi-Resolution Approach for Audio Classification
    We describe a multi-resolution approach for audio classification and illustrate its application to the open data set for environmental sound classification.