7.8 DFT NOISE SUBSTITUTION

Whereas DPAC exploits temporal correlation, a substitution technique that exploits decorrelation was devised for coding efficiently noise-like portions of the spectrum. In a noise substitution procedure [Schu96], Schulz parameterizes transform coefficients corresponding to noise-like portions of the spectrum in terms of average power, frequency range, and temporal evolution, resulting in an increased coding efficiency of 15% on average. A temporal envelope for each parametric noise band is required because transform block sizes for most codecs are much longer (e.g., 30 ms) than the human auditory system's temporal resolution (e.g., 2 ms). In this method, noise-like spectral regions are identified in the following way. First, least-mean-square (LMS) adaptive linear predictors (LP) are applied to the output channels of a multi-band QMF analysis filter bank that has as input the original audio, s(n). A predicted signal, image, is obtained by passing the LP output sequences through the QMF synthesis filter bank. Prediction is done in subbands rather than over the entire spectrum to prevent classification errors that could result if high-energy noise subbands are allowed to dominate predictor adaptation, resulting in misinterpretation of low-energy tonal subbands as noisy. Next, the DFT is used to obtain magnitude (S(k),) and phase components , of the input, s(

Get Audio Signal Processing and Coding now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.