Fact-checked by Grok 2 weeks ago

Cepstrum

The cepstrum (/ˈkɛpstrəm/) is a mathematical representation in defined as the inverse of the logarithm of the magnitude of a signal's , which transforms convolutions in the into additions in the quefrency domain, enabling the separation of source and filter components in composite signals. Introduced by B. P. Bogert, M. J. R. Healy, and J. W. Tukey in 1963, the concept originated from their analysis of data to detect echoes, such as in seismic recordings, where periodic ripples in the reveal hidden periodicities like delays. The term "cepstrum" is a deliberate reversal of "," with the transform domain called "quefrency" (reversing "frequency") and other related terms like "liftering" (filtering) coined in the same playful nomenclature to describe operations in this domain. Subsequent developments by and Ronald W. Schafer in 1969 formalized the complex cepstrum as the inverse of the complex logarithm of the , allowing reversible for signal decomposition, while the power cepstrum—the squared magnitude of the complex cepstrum or the inverse transform of the log power spectrum—focuses on amplitude-based periodicity detection without phase information. The real cepstrum, closely related to the power cepstrum, applies the inverse transform to the log of the magnitude spectrum alone, emphasizing symmetric properties for real-valued signals. These variants arose from efforts to address limitations in the original power-oriented approach, particularly in handling phase for applications requiring signal reconstruction. Cepstrum analysis has become foundational in diverse fields, including —where mel-frequency cepstral coefficients (MFCCs) model human auditory perception for recognition and synthesis tasks—and mechanical diagnostics, such as identifying gear faults or through harmonic family detection in spectra. In and acoustics, it aids removal and source separation, while extensions like the phase cepstrum enhance blind source separation in noisy environments. Despite computational challenges with logarithmic operations and unwrapping, its ability to reveal periodic structures invisible in traditional spectra ensures ongoing relevance in .

History and Origin

Invention in 1963

The cepstrum was introduced in 1963 by engineers B. P. Bogert and M. J. R. Healy of Bell Telephone Laboratories, along with statistician J. W. Tukey of , as a novel technique for analyzing data containing echoes. Their seminal work, titled "The Quefrency Analysis of Time Series for Echoes: Cepstrum, Pseudo-Autocovariance, Cross-Cepstrum, and Saphe Cracking," was presented at the Symposium on Time Series Analysis and published in the proceedings edited by M. Rosenblatt. This paper marked the first formal description of the cepstrum as the inverse of the logarithm of the power spectrum, designed to reveal hidden periodic structures in frequency-domain data. The primary motivation stemmed from challenges in , where direct time-domain access to signals was limited, but echoes from seismic events created detectable periodicities in the spectra. Bogert, Healy, and Tukey sought to address problems in non-stationary signals, such as recordings, by transforming spectral ripples—caused by echo delays—back into a domain resembling the for easier interpretation. This approach allowed for the identification of echo arrival times without assuming stationarity, providing a practical tool for geophysical where traditional methods fell short. To emphasize the inversion of time and frequency operations, the authors coined playful terminology as anagrams: "cepstrum" from "," "" from "frequency" to denote the transform domain, "liftering" for filtering in the quefrency domain, and "rahmonics" for harmonic-like components in quefrency. As Tukey later noted, these terms highlighted the : "In general, we find ourselves operating on the frequency side in ways customary on the time side and ." This inventive nomenclature not only facilitated discussion but also underscored the technique's foundational role in homomorphic . Subsequent adaptations extended the cepstrum to speech analysis for detection and separation.

Early Applications in Seismology

Following its introduction in , the cepstrum found immediate application in for analyzing records, particularly at Bell Laboratories where John W. Tukey and colleagues developed it as a tool for detection. The seminal work by Bogert, Healy, and Tukey demonstrated the cepstrum's utility in identifying hidden periodic es in time series data from earthquakes and explosions, enabling researchers to estimate echo delay times that correspond to wave travel paths. This was achieved by transforming the signal into the quefrency , where echoes manifest as distinct peaks, facilitating the separation of direct arrivals from reverberations. In the early , Bogert implemented Tukey's suggestion to compute the logarithm of the power spectrum on computer programs to process real seismic data, marking one of the first practical geophysical uses of the technique. A key advantage of the cepstrum over traditional autocorrelation methods in seismology was its ability to handle non-linear phase effects and provide logarithmic compression, which effectively separates multiplicative noise components that often obscure seismic signals. typically struggles with phase distortions in echoed waveforms, leading to smeared peaks, whereas the cepstrum's inverse of the log isolates periodicities more robustly, even in noisy environments. This made it particularly valuable for deconvolving complex seismic traces, where convolutional effects from and propagation must be unraveled to reveal underlying signal structure. For instance, in signal analysis, the cepstrum was applied to deconvolve waveforms and estimate characteristics, such as shapes and radiation patterns, by identifying rahmonics—periodic components indicative of the seismic . These early applications in the geophysical research underscored the cepstrum's potential beyond initial echo detection, influencing broader techniques for handling reverberant environments.

Mathematical Definition

General Formulation

The cepstrum of a time-domain f(t) is defined as the inverse of the logarithm of the magnitude of its , yielding a c(\tau) in the quefrency , where \tau represents quefrency. The primary equation is c(\tau) = \mathcal{F}^{-1} \left\{ \log \left| \mathcal{F} \left\{ f(t) \right\} \right| \right\}, where \mathcal{F} denotes the Fourier transform and \log is the natural logarithm. To compute the cepstrum, the process proceeds in three steps: first, obtain the Fourier transform F(\omega) = \mathcal{F}\{f(t)\} of the input signal; second, compute the logarithm of the magnitude \log |F(\omega)|; third, apply the inverse Fourier transform to yield c(\tau). This formulation assumes the signal f(t) is real-valued to ensure the cepstrum is an even function. The derivation stems from signal processing properties where a convolution in the time domain, f(t) = g(t) * h(t), transforms to multiplication in the frequency domain, |F(\omega)| = |G(\omega)| |H(\omega)|. Taking the logarithm converts this to addition, \log |F(\omega)| = \log |G(\omega)| + \log |H(\omega)|, and the inverse Fourier transform then yields additive components in the quefrency domain, c(\tau) = c_g(\tau) + c_h(\tau), facilitating analysis of periodic structures such as echoes by revealing peaks corresponding to delays. This cepstral approach, introduced by Bogert, Healy, and Tukey in 1963 using the logarithm of the power spectrum, provides a domain for analyzing periodic structures in spectra that are obscured in the original time or frequency representations.

Quefrency Domain

In the cepstral domain, the independent variable is known as the quefrency, denoted by τ, which serves as a transformed time scale representing the rate of change in the logarithmic . This domain arises from the inverse applied to the logarithm of the signal's , providing a framework where spectral periodicities are mapped onto a time-like axis. The term "quefrency" was coined by Bogert, Healy, and Tukey in their seminal work to evoke its to while emphasizing its role in analyzing spectral echoes and periodic structures. Quefrency shares the same units as the original of the signal, typically seconds or milliseconds, due to the dimensional properties of the pair involved in the cepstral computation. Specifically, since the cepstrum involves taking the logarithm of the frequency-domain representation (with units of frequency) and then applying an inverse transform, the resulting quefrency axis inherits time units, scaling directly with the sampling rate of the input signal. This equivalence to time units allows quefrency to intuitively correspond to physical or periods in the original signal, facilitating interpretations akin to time-domain analysis but focused on modulations. A key property of the quefrency domain is its ability to reveal periodicities inherent in the logarithm of the , where such periodicities often stem from , , or other convolutive effects in signal. For instance, ripples in the log- caused by an echo delay τ manifest as distinct peaks in the cepstrum at quefrency values equal to that delay, enabling clear identification of periodic components that might be obscured in the time or domains. Similarly, harmonic spacings, such as those from voiced speech or musical tones, produce peaks at quefrencies matching the fundamental period, highlighting the domain's sensitivity to these structures. Compared to the time and frequency domains, the quefrency domain leverages cepstral analysis, a nonlinear transformation that converts multiplicative relationships in the frequency domain—arising from convolutions in the time domain—into additive components. This allows for the isolation of convolved signal elements, such as excitation and filtering effects, through operations along the quefrency axis, a capability not directly available in linear time-frequency representations.

Types of Cepstra

Power Cepstrum

The power cepstrum of a signal f(t) is defined as the inverse Fourier transform of the natural logarithm of the squared magnitude of its Fourier transform, effectively analyzing the log-power spectrum while disregarding phase information. This formulation, originally introduced as the cepstrum by Bogert, Healy, and Tukey, focuses on amplitude-based periodicities in the frequency domain. Mathematically, it is expressed as C_p(\tau) = \mathcal{F}^{-1} \left\{ \log \left( |\mathcal{F} \{ f(t) \)|^2 \right) \right\}, where \mathcal{F} denotes the Fourier transform, \mathcal{F}^{-1} its inverse, and \tau is the quefrency variable. The output C_p(\tau) is typically real-valued and symmetric, highlighting echoes or periodic components in the log-magnitude spectrum. To compute the power cepstrum, the process involves three main steps: first, obtain the power spectral density by computing the squared magnitude of the Fourier transform of the input signal; second, apply the natural logarithm to this power spectrum; third, perform the inverse Fourier transform on the result, often yielding a real-valued sequence due to the even nature of the input. This magnitude-only approach simplifies analysis but requires careful handling of the logarithmic operation to avoid issues with zero or negative values, typically addressed via windowing or small constant additions. A key advantage of the power cepstrum is its robustness to phase distortions, as it operates solely on the magnitude spectrum, making it insensitive to variations in signal that might otherwise obscure periodic structures. It is particularly effective for detecting periodicities in the amplitude spectra of signals, such as echoes or patterns, even in the presence of additive . In contrast to the complex cepstrum, which incorporates for more complete signal representation, the power cepstrum prioritizes simplicity in magnitude-based diagnostics. However, this phase-ignoring nature constitutes a primary limitation, as the power cepstrum discards potentially valuable information essential for accurate signal reconstruction or tasks. Consequently, it is less suitable for applications requiring full spectral recovery, where the may be preferred.

Complex Cepstrum

The complex cepstrum of a signal f(t) is defined as the Fourier transform of the of its , thereby preserving both the and information of the . This formulation, introduced by in the context of homomorphic , contrasts with the power cepstrum by retaining the full spectral , enabling more complete signal analysis. Mathematically, the complex cepstrum \hat{c}(\tau) is given by \hat{c}(\tau) = \mathcal{F}^{-1} \left\{ \log \left( \mathcal{F} \{ f(t) \} \right) \right\}, where \mathcal{F} denotes the , and the \log(X(f)) = \log|X(f)| + j\phi(f) incorporates the real part corresponding to the log-magnitude (related to the power cepstrum) and the imaginary part encoding the unwrapped phase \phi(f). Computing the complex cepstrum presents challenges primarily due to the multi-valued nature of the complex logarithm, which requires selecting the principal value and addressing phase wrapping discontinuities that occur in jumps of $2\pi i. These issues necessitate phase unwrapping algorithms, such as those based on integration or adaptive thresholding, to obtain a continuous phase function before applying the logarithm; failure to unwrap properly can introduce artifacts in the cepstral domain. A key property of the cepstrum is its invertibility, allowing exact reconstruction of the original signal through of the cepstrum followed by an , provided the is accurately unwrapped. Additionally, it facilitates additive separation of signal components in the quefrency domain: for minimum-phase signals (with all poles and zeros inside the unit circle), the cepstrum is causal and zero for negative quefrencies, while for maximum-phase signals (poles and zeros outside the unit circle), it is anti-causal and zero for positive quefrencies, enabling their isolation via linear filtering. Historically, the complex cepstrum gained prominence in the post-1960s era, particularly following the 1965 development of the , as it became the preferred tool for techniques aimed at deconvolving convolved signals, such as separating source and channel effects in speech or seismic data. This approach, formalized by and Schafer, transformed multiplicative spectral relationships into additive ones in the cepstral domain for easier manipulation.

Properties and Analysis

Key Mathematical Properties

The power cepstrum of a real-valued signal exhibits even symmetry in the quefrency domain, meaning c(\tau) = c(-\tau) for all quefrencies \tau, due to the real and even nature of the power spectrum's logarithm. Similarly, the complex cepstrum of a real signal is real-valued, reflecting the conjugate-even symmetry of the complex logarithm of the spectrum. If the original signal is scaled by a positive constant k, the cepstrum experiences a shift solely at quefrency zero, where c(0) becomes c(0) + \log k, as the logarithm of the adds a constant that inverse-transforms to a delta function at the . This isolates changes without affecting other quefrency components. A fundamental convolution property holds for the complex cepstrum: the cepstrum of the of two signals equals the sum of their individual cepstra, \hat{c}_{x_1 * x_2}(n) = \hat{c}_{x_1}(n) + \hat{c}_{x_2}(n), enabling via subtraction in the cepstral domain after logarithmic transformation. For , the cepstrum of a stable signal—characterized by poles inside the unit —is bounded, with |\hat{c}(n)| < C \alpha^{|n|} where \alpha < 1 and C is a constant, ensuring under minimum-phase assumptions. Uniqueness is guaranteed for minimum-phase systems, where the cepstrum fully determines the original signal, as the causal cepstrum corresponds uniquely to the minimum-phase sequence sharing the same magnitude spectrum.

Interpretation in Signal Processing

In signal processing, peaks in the cepstrum, particularly ridges at a specific quefrency \tau, reveal periodic components in the spectrum, where \tau = 1/f corresponds to the inverse of a periodic frequency f, such as harmonics or modulation frequencies. This interpretation arises because the logarithm in the cepstral transform converts multiplicative spectral periodicities into additive structures in the quefrency domain, making them detectable as distinct peaks. For echo detection, the complex cepstrum exhibits delta-like peaks at quefrencies \tau equal to the echo delay times, directly indicating the temporal separation between the direct signal and its es without requiring prior knowledge of the signal shape. These peaks stem from the phase information preserved in the , allowing precise localization of delayed replicas in composite signals like seismic or acoustic recordings. The structure of a signal manifests in the cepstrum as rahmonics, a decaying sequence of peaks spaced at multiples of the fundamental quefrency, reflecting the of the series in the . Rahmonics decrease in with higher orders due to the 's smoothing effect, providing insight into the periodicity and energy distribution of the original signal. Liftering, a form of selective filtering in the quefrency domain, isolates components based on their physical origins: low-quefrency liftering extracts slowly varying spectral envelopes (e.g., formants in speech), while high-quefrency liftering targets rapid variations like harmonics. This operation exploits the additive separation in the cepstrum to enhance or suppress specific phenomena, such as removing echo-related rahmonics. In speech signals, a prominent quefrency at \tau = 1/T, where T is the , reveals the voicing structure by highlighting the quasi-periodic glottal pulses, distinguishing voiced from unvoiced segments. For instance, a at 12.5 ms quefrency corresponds to an 80 Hz , aiding in estimation and source-filter separation.

Applications

Echo Detection and Deconvolution

Homomorphic deconvolution employs the cepstral transform to decompose convolved signals into additive components in the quefrency domain, facilitating the detection and removal of echoes by isolating and subtracting their contributions. The procedure starts by calculating the cepstrum of the received signal, modeled as the convolution of the source signal with an echo impulse response, where echoes appear as prominent peaks at quefrencies equal to their time delays. These echo-related peaks are then identified—often through thresholding or visual inspection—and nulled or excised via liftering operations. The resulting modified cepstrum undergoes an inverse cepstral transformation, comprising an exponential operation followed by an inverse Fourier transform, to yield the reconstructed source signal. When the echo cepstrum is estimated or known, the deconvolved cepstrum is given by c_{\text{deconvolved}}(\tau) = c_{\text{received}}(\tau) - c_{\text{echo}}(\tau), where \tau denotes quefrency, illustrating the additive separation inherent to the domain. This approach excels in managing echoes without necessitating prior knowledge or detailed models of the echo path, as the quefrency peaks directly reveal delay structures regardless of amplitude or waveform specifics. Furthermore, the logarithmic preprocessing in cepstrum computation provides robustness to additive noise by compressing spectral magnitudes and mitigating the impact of low-amplitude components. In seismic , for example, homomorphic separates overlapping from subsurface reflections, enhancing of primary arrivals; similarly, in systems, it clarifies target returns amid . Despite these strengths, homomorphic assumes minimum-phase properties for reliable echo identification in the power cepstrum, and it remains sensitive to phase inaccuracies arising from unwrapping errors in the . The mitigates phase-related issues by retaining full information.

Speech and Audio Analysis

In speech and audio analysis, the cepstrum plays a crucial role in detection for voiced sounds, where the f_0 manifests as a prominent peak in the cepstrum at a quefrency of $1/f_0. This property arises because the cepstral transform separates the periodic excitation source from the spectral envelope, allowing reliable estimation of f_0 even under distortions or moderate noise. The technique, introduced in early vocal-pitch detection methods, processes short-time spectra to identify these rahmonic peaks, offering robustness compared to direct approaches. Formant analysis leverages the cepstrum's ability to decompose the speech signal into excitation and vocal tract components via . Low-quefrency liftering isolates the slowly varying spectral envelope, which encodes the frequencies representing vocal tract resonances, while high-quefrency components capture the harmonics. This separation facilitates accurate formant tracking for applications like and identification, with the complex cepstrum providing precise envelope estimation by mitigating effects in the logarithmic spectrum. Beyond core analysis, cepstral methods enable practical applications in speaker identification and . In speaker identification, cepstral coefficients capture speaker-specific vocal tract characteristics, achieving up to 70% accuracy in early experiments on short speech segments by distinguishing individual variations. For , cepstral detection extracts fundamental frequencies from harmonic-rich signals; for instance, a 441 Hz sampled at 44.1 kHz yields a clear cepstral at a 100-sample quefrency, aiding tasks like note onset detection and polyphonic transcription. Post-2000 advancements have integrated cepstrum into automatic speech recognition (ASR) systems for noise-robust feature extraction, such as to mitigate additive noise effects on log-spectral representations. In , cepstral analysis supports efficient encoding of speech features in distributed systems, compressing vectors of 13 cepstral coefficients plus log-energy for low-bitrate transmission while preserving recognition performance. Recent developments in the 2020s combine cepstral processing with for enhanced robustness in noisy environments, where homomorphic feeds and vocal tract estimates into .

Homomorphic Signal Processing

Homomorphic is a technique that employs nonlinear transformations to convert signals combined through or into additive components, facilitating linear filtering operations in a transformed . This approach generalizes linear by mapping signals into a where nonadditive interactions become separable via addition, allowing for easier manipulation and analysis. The framework evolved from the initial development of the cepstrum in the early 1960s, where Bogert, Healy, and Tukey introduced the concept for echo detection in seismic signals, and was independently advanced at MIT by Thomas Stockham, Alan V. Oppenheim, and Ronald W. Schafer for audio and speech applications. Alan V. Oppenheim extended this in his 1964 MIT dissertation by formalizing homomorphic systems as a class of nonlinear processors that enable superposition through generalized linear operations. In the 1970s, Oppenheim and Ronald W. Schafer further developed the theory into a comprehensive methodology, detailed in their book Digital Signal Processing (1975) and subsequent works like Rabiner and Schafer's Digital Processing of Speech Signals (1978), emphasizing applications in discrete-time signals and speech analysis. The cepstrum serves as the core domain in homomorphic processing, representing the inverse Fourier transform of the logarithm of a signal's , which transforms convolutional operations in the into additive terms. This additive structure in the cepstral domain allows for straightforward separation and filtering of signal components that are intertwined in the original domain. The typical homomorphic processing pipeline involves several stages to analyze and modify signals:
  1. Compute the of the input signal to obtain its .
  2. Apply the to the , converting multiplications to additions.
  3. Perform the inverse to yield the cepstrum.
  4. Apply linear filtering, such as low-pass or high-pass liftering, to isolate desired components in the cepstral domain.
  5. Apply the to reverse the logarithm.
  6. Compute the inverse to reconstruct the modified and time-domain signal.
One key benefit in is the separation of the excitation source, such as the glottal pulse, from the vocal tract response, enabling independent modeling for and tasks. This framework has been generalized beyond speech to image enhancement, where it compresses by filtering illumination and reflectance components, and to biomedical signals for improved feature extraction in noisy environments.

Mel-Frequency Cepstral Coefficients

Mel-Frequency Cepstral Coefficients (MFCCs), introduced by Steven B. Davis and Paul Mermelstein in 1980, represent a perceptual modification of cepstral coefficients, obtained by warping the frequency axis of the signal's spectrum to the before computing the cepstrum, thereby mimicking the nonlinear frequency resolution of the human . This warping aligns the feature extraction process with human auditory perception, where pitch differences are perceived logarithmically rather than linearly. The , introduced in 1937, maps linear frequency f to mel scale m via the approximate formula m = 2595 \log_{10}(1 + f/700), emphasizing lower frequencies where human hearing is more sensitive. The computation of MFCCs involves several steps tailored to short-term audio frames, typically 20-40 ms in length with 50% overlap. First, the power is derived from the framed signal using a . Next, a bank of triangular filters, spaced linearly on the (usually 20-40 filters), is applied to this spectrum to obtain mel-filterbank energies. The logarithm is then taken of these energies to compress the , followed by a (DCT) to decorrelate the coefficients and yield the MFCCs, with the lower-order coefficients (typically 13-20) capturing the most perceptually relevant spectral envelope information. Higher-order coefficients are often discarded or used selectively to reduce dimensionality. The k-th MFCC is computed as: \text{MFCC}_k = \sum_{m=1}^{M} \log(S_m) \cos\left( \frac{\pi k (m - 0.5)}{M} \right), \quad k = 1, \dots, K where S_m is the output of the m-th mel filter, M is the number of filters, and K is the number of desired coefficients (e.g., 13). This DCT step approximates the inverse used in the standard cepstrum, producing a compact representation of the signal's spectral shape. MFCCs offer advantages over linear-frequency cepstral coefficients by better capturing and perceptual nuances, leading to improved robustness in automatic speech recognition (ASR) tasks; for instance, in experiments on monosyllabic , MFCCs achieved higher accuracy than coefficients or filter-bank parameters. Their perceptual basis reduces sensitivity to irrelevant spectral details, making them effective for modeling vocal tract resonances in speech. In modern applications, MFCCs continue to serve as a foundational feature in ASR pipelines for voice assistants and systems, often as inputs to convolutional or recurrent neural networks; for example, studies in the demonstrate their efficacy in hybrid models for and speaker identification, with accuracies exceeding 90% on benchmark datasets. Extensions like Gammatone Frequency Cepstral Coefficients (GFCCs), introduced in the , enhance robustness in noisy environments by using gammatone filters that more closely model cochlear impulse responses, outperforming MFCCs in non-stationary noise scenarios by 10-20% in reduction. Despite their strengths, MFCCs have limitations in non-speech audio applications, such as or environmental , where their speech-tuned mel-scale warping may overlook or structures, leading to suboptimal performance compared to spectrogram-based features. In the , neural methods, such as self-supervised representations from models like wav2vec 2.0, address these gaps by directly processing raw waveforms to learn task-specific embeddings, bypassing handcrafted features like MFCCs and achieving state-of-the-art results on diverse audio tasks without perceptual warping assumptions.

References

  1. [1]
    Cepstral - an overview | ScienceDirect Topics
    2. The real cepstrum, as introduced by Bogert, Healy, and Tukey, is the inverse Fourier transform of the logarithm of the magnitude spectrum and is typically ...
  2. [2]
    From Frequency to Quefrency: A History of the Cepstrum
    To suggest what prompted the invention of the term cepstrum, this article narrates the historical and mathematical background that led to its discovery.
  3. [3]
    [PDF] Dsp history - From frequency to quefrency: a history of the cepstrum
    the original paper have faded into the background, the term cepstrum has survived and become part of the digital signal processing lexicon. In the early ...Missing: definition | Show results with:definition
  4. [4]
    [PDF] D Cepstrum
    Bogert, M. J. R. Healy, and J. W. Tukey. The quefrency alanysis of time series for echoes: Cepstrum, pseudo- autocovariance, cross-cepstrum and saphe cracking.
  5. [5]
    A history of cepstrum analysis and its application to mechanical ...
    The cepstrum is a “spectrum of a log spectrum” and separates input and transfer path. Families of harmonics and/or sidebands can be identified, assessed, and/ ...
  6. [6]
    [PDF] Cepstrum Analysis and Gearbox Fault Diagnosis
    Cepstrum Analysis is a tool for the detection of periodicity in a frequency spectrum, and seems so far to have been used mainly in speech analysis for voice ...
  7. [7]
    [PDF] Investigation of Cepstrum Analysis for Seismic/Acoustic Signal ...
    DEFINITION OF TERMS USED IN THF REPORT. Page 84. 73. DEFINITION OF TERMS USED IN THIS PAPER. In order to get a meaningful terminology for the cepstrum technique ...
  8. [8]
    [PDF] Power Cepstrum Technique With Application to Model Helicopter ...
    The concept of the cepstrum was first reported by Bogert, Healy, and Tukey. (ref. 1 ) as a technique for detecting echoed signals in measured time series.
  9. [9]
    [PDF] Application of Power Spectrum, Cepstrum, Higher Order Spectrum ...
    The algorithum of power cepstrum of a signal can be summarized as: signal → FFT → abs() → square → log → FFT → abs() → square → power cepstrum. 2.3 Higher order ...Missing: limitations | Show results with:limitations
  10. [10]
    Cepstrum Pitch Determination - AIP Publishing
    Cepstrum pitch determination uses the cepstrum, the power spectrum of the log of the power spectrum, to find peaks corresponding to pitch periods. Algorithms ...
  11. [11]
    [PDF] Chapter 5 - ISY
    This is an identification problem, based on the cepstrum. 5.1 Introduction. In 1963, Bogert, Healy and Tukey published a paper with the unusual title `The.
  12. [12]
    [PDF] Cepstrum Analysis
    The power cepstrum was first described and used by Bogert C'tal. [I] in. 1963 ... phase information is lost [2-5]. The complex cepstrum of a data ...
  13. [13]
    A history of cepstrum analysis and its application to mechanical problems
    ### Summary of Complex Cepstrum from https://www.sciencedirect.com/science/article/abs/pii/S0888327016305556
  14. [14]
    [PDF] Homomorphic Analysis of Speech
    For a minimum phase window, then, the complex cepstrum of pl(n) will be zero for n <O. Similarly, if w(n) is maximum phase, such that all poles and zeros of its ...
  15. [15]
  16. [16]
    [PDF] Cepstrum Analysis - Pearl HiFi
    After a discussion of the basic theory, this paper describes many of the applications of the cepstrum, including the study of signals containing echoes (land- ...
  17. [17]
    [PDF] ECE 8440 Unit 22 Cepstrum properties.pptx
    phase, we must perform "unwrapping” of to obtain the actual con&nuous phase ... Figure 13.2 Determine of the complex cepstrum for minumum-‐phase signals. 17 ...
  18. [18]
    [PDF] The Cepstrum and Homomorphic Speech Processing 倒谱与同态 ...
    1. complex cepstrum is non-zero and of infinite extent for both positive and negative n, even though x[n] may be causal, or ...
  19. [19]
    Minimum-Phase and Causal Cepstra - DSPRelated.com
    Minimum-Phase and Causal Cepstra. To show that a frequency response is minimum phase if and only if the corresponding cepstrum is causal, we may take the log ...
  20. [20]
    [PDF] Machine Diagnostics using Advanced Signal Processing
    Ref: Bogert, Healy and Tukey (yes the one of FFT fame, but two years earlier) – “The Quefrency Alanysis of Time Series For Echoes;. Cepstrum, Pseudo ...
  21. [21]
  22. [22]
    [PDF] Deconvolution of Seismic Data using Homomorphic Filtering - DTIC
    cepstrum was directed toward echo detection rath- er than a deconvolution retention of phase infor- mation was not important. Thus, it does not uti- lize ...Missing: seminal | Show results with:seminal
  23. [23]
    Homomorphic deconvolution in reverberant and distortional channels
    Homomorphic deconvolution is used to determine the difference in arrival times or time delays among the paths of a channel, to estimate both the impulse ...Missing: seminal papers
  24. [24]
    Short‐Time Spectrum and “Cepstrum” Techniques for Vocal‐Pitch ...
    Cepstral pitch detection has the important advantages that it is insensitive to phase distortion, and is also resistant to additive noise and amplitude ...Missing: seminal | Show results with:seminal
  25. [25]
  26. [26]
    [PDF] CEPSTRAL NOISE SUBTRACTION FOR ROBUST AUTOMATIC ...
    ABSTRACT. The robustness of speech recognizers towards noise can be increased by normalizing the statistical moments of the Mel-frequency cepstral.
  27. [27]
    [PDF] ETSI ES 201 108 V1.1.2 (2000-04)
    The feature vectors consist of 13 static cepstral coefficients and a log-energy coefficient. The feature extraction algorithm defined in this clause forms a ...Missing: post- | Show results with:post-<|control11|><|separator|>
  28. [28]
    An overview of logarithm-based image processing techniques for ...
    This paper presents an overview of the three logarithm-based image processing techniques: the multiplicative homomorphic system (MHS), the log-ratio (LR) and ...
  29. [29]
    Biologically Inspired Features for Non-Speech Audio Classification
    May 17, 2012 · Mel frequency cepstral coefficients (MFCC) have become a de facto standard for audio parameterization. Taking as a basis the MFCC computation ...Missing: limitations | Show results with:limitations