WO2014096236A2 - Signal adaptive fir/iir predictors for minimizing entropy - Google Patents

Signal adaptive fir/iir predictors for minimizing entropy Download PDF

Info

Publication number
WO2014096236A2
WO2014096236A2 PCT/EP2013/077461 EP2013077461W WO2014096236A2 WO 2014096236 A2 WO2014096236 A2 WO 2014096236A2 EP 2013077461 W EP2013077461 W EP 2013077461W WO 2014096236 A2 WO2014096236 A2 WO 2014096236A2
Authority
WO
WIPO (PCT)
Prior art keywords
filter
frame
fir
signal
coefficients
Prior art date
Application number
PCT/EP2013/077461
Other languages
French (fr)
Other versions
WO2014096236A3 (en
Inventor
Arijit Biswas
Original Assignee
Dolby International Ab
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dolby International Ab filed Critical Dolby International Ab
Priority to US14/649,477 priority Critical patent/US9548056B2/en
Publication of WO2014096236A2 publication Critical patent/WO2014096236A2/en
Publication of WO2014096236A3 publication Critical patent/WO2014096236A3/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/0017Lossless audio signal coding; Perfect reconstruction of coded audio signal by transmission of coding error
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques

Definitions

  • the present document relates to coding.
  • the present document relates to lossless coding using linear prediction, possibly in combination with entropy encoding.
  • Audio encoders and in particular lossless audio encoders typically employ a FIR (Finite Impulse Response) prediction filter to reduce the entropy of an audio signal.
  • a FIR Finite Impulse Response
  • IIR Infinite Impulse Response
  • IIR prediction filters may e.g. be used in the so-called Dolby TrueHD lossless encoder.
  • Dolby TrueHD lossless encoder unlike FIR predictors it is typically difficult to derive optimal IIR prediction coefficients on a frame-by-frame basis that guarantees the stability of the predictor system (for the encoder) and its inverse system (for the decoder).
  • the present document addresses the above-mentioned technical problem.
  • the present document describes methods for determining the coefficients of IIR-based prediction filters which lead to improved prediction results (i.e. which lead to a reduction of the entropy of the prediction error signal).
  • the IIR prediction filters may be determined such that stability may be guaranteed.
  • the methods described in the present document enable the use of IIR-based prediction, thereby providing audio encoders (in particular lossless audio encoders) with improved coding gains.
  • the general prediction filter may be determined such that it is ensured that the determined general prediction filter is stable.
  • the frame of the input signal e.g. an audio signal such as a speech signal or a music signal, or an image signal, e.g. a line or a column of an image
  • the general prediction filter may comprise an infinite impulse response (IIR) prediction filter.
  • the general prediction filter may comprise an IIR prediction filter component and/or an FIR prediction filter component.
  • the z-transform of the general prediction filter may be represented as a ratio of an FIR filter in the numerator and an FIR filter in the denominator.
  • the z-transform of the general prediction filter (also referred to as the transfer function of the general prediction filter or the z-transform of the impulse response of the general prediction filter) may be presented in a form which comprises an approximation to the z-transform of a finite impulse response (FIR) filter with the z variable of the FIR prediction filter being replaced by the z-transform of an allpass filter.
  • FIR finite impulse response
  • the general prediction filter may be presented in a form which comprises the z-transform of a FIR filter with the z variable of the FIR prediction filter being replaced by the z-transform of an allpass filter.
  • a general prediction filter which may comprise an IIR prediction filter
  • the allpass filter may exhibit a pole defined by an adjustable pole parameter ⁇ .
  • the general prediction filter may be defined by the plurality of FIR coefficients and by the pole parameter ⁇ .
  • the allpass filter exhibits a single pole defined by a single adjustable pole parameter.
  • the z-transform of the general prediction filter may be derived from (an approximation of) the z-transform of an FIR filter with the z variable of the FIR prediction filter being replaced by the z-transform of an allpass filter.
  • the general prediction filter may be determined by first determining an intermediate general prediction filter having a z-transform which (exactly) comprises the z-transform of an FIR filter with the z variable of the FIR prediction filter being replaced by the z-transform of an allpass filter.
  • the coefficients of the intermediate general prediction filter may then be approximated (e.g. the coefficients may be quantized), thereby yielding the coefficients of the general prediction filter.
  • the z-transform of the general prediction filter comprises an approximation of the z-transform of an FIR filter with the z variable of the FIR prediction filter being replaced by the z-transform of an allpass filter.
  • the approximation may be due to the quantization of filter coefficients and/or due to the transformation of the FIR filter coefficients and the pole parameter to an IIR filter representation (as described below in the context of the "mapping" feature).
  • the pole parameter ⁇ may be used to adapt the general prediction filter between an FIR prediction filter and an IIR prediction filter.
  • the method may yield an adaptive general prediction filter which may adapt its filter structure (i.e.
  • a general prediction filter having an IIR structure typically also comprises an FIR filter component.
  • a general prediction filter having an FIR structure typically only comprises an FIR filter component.
  • the pole parameter may be unequal to zero, thereby providing a general prediction filter which exhibits an infinite impulse response.
  • the general prediction filter typically corresponds to an FIR prediction filter. This means that for the particular frame of the input signal, entropy minimization may be achieved using an FIR prediction filter without the need of providing an IIR prediction filter.
  • the z-transform of the general prediction filter may comprise a pre-filter configured to whiten a spectrum of the prediction error signal. By whitening the spectrum of the prediction error signal, the entropy encoding of the prediction error signal may be performed with increased efficiency.
  • the z-transform of the general prediction filter may comprise an overall delay. By inserting an overall delay, it may be ensured that the general prediction may be performed in a causal manner.
  • the general prediction filter comprise an overall delay z 1 and that each filter component H k (z) comprises a pre-filter -j— for whitening l- z- purposes.
  • the method may comprise determining the pole parameter and the plurality of FIR coefficients, such that an entropy of a frame of a prediction error signal which is derived from the frame of the input signal using the general prediction filter defined by the pole parameter and the plurality of FIR coefficients is reduced (e.g. is minimized).
  • the general prediction filter may be used to determine a frame of an estimated signal (e.g. an estimated audio signal or an estimated image signal) from the frame of the input signal.
  • the difference between the frame of the estimated signal and the frame of the input signal may provide the frame of the prediction error signal.
  • the pole parameter and the plurality of FIR coefficients may specify the general prediction filter, and the general prediction filter may be adjusted such that the entropy of the frame of the prediction error signal is reduced (e.g. minimized).
  • the entropy of the frame of the prediction error signal may be estimated by determining a probability distribution of the values of samples of the frame of the prediction error signal.
  • the entropy may be estimated based on a weighted sum of the probability distribution.
  • Determining the pole parameter and the plurality of FIR coefficients may comprise setting the adjustable pole parameter to a fixed first value and determining the plurality of FIR coefficients using the set pole parameter.
  • determining the plurality of FIR coefficients may comprise determining the plurality of FIR coefficients such that a mean squared power of the frame of the prediction error signal is reduced.
  • this target may be achieved by solving a set of normal equations (e.g. using a Levinson-Durbin algorithm).
  • determining the plurality of FIR coefficients may comprise determining a frame of a regressor signal based on the frame of the input signal for each tap of the general prediction filter (i.e. for each filter component H k (z)), thereby yielding a plurality of regressor signal frames.
  • the plurality of regressor signal frames may be used to determine an autocorrelation matrix Q for the plurality of regressor signal frames.
  • the size of the autocorrelation matrix Q typically depends on the number K of FIR coefficients which are to be determined.
  • a cross-correlation vector P may be determined based on the plurality of regressor signal frames and the frame of the input signal.
  • Determining the pole parameter and the plurality of FIR coefficients may comprise estimating the entropy of the frame of the prediction error signal obtained using the general prediction filter defined by the set pole parameter and the plurality of FIR coefficients.
  • the plurality of FIR coefficients have been determined based on the set pole parameter (e.g. using the above mentioned set of normal equations).
  • the steps of determining the plurality of FIR coefficients (for a set pole parameter) and of estimating the entropy may be repeated for a plurality of differently set pole parameters, thereby yielding a corresponding plurality of entropy values.
  • the pole parameter may be selected from the plurality of differently set pole parameters, which reduces the estimated entropy of the frame of the prediction error signal.
  • the pole parameter which yields the lowest entropy from the plurality of entropies may be selected.
  • the plurality of FIR coefficients which has been determined using the selected pole parameters may be selected.
  • the selected pole parameter and the selected plurality of FIR coefficients may be the pole parameter and the plurality of FIR coefficients, which reduce (e.g. minimize) the entropy of the frame of the prediction error signal.
  • setting the pole parameter to a fixed first value may comprise estimating a frequency based on the frame of the input signal.
  • a dominant frequency of the frame of the input signal may be estimated.
  • Estimating a frequency based on the frame of the input signal may comprise determining a spectral envelope of a spectrum of the frame of the input signal, and estimating the frequency of the frame of the input signal based on the spectral envelope (e.g. based on a maximum of the spectral envelope).
  • the first value for the pole parameter may be determined based on the estimated frequency, e.g. using a pre-determined look-up table or a pre-determined function.
  • the pre-determined look-up table or function may provide a mapping between a plurality of frequency values and a corresponding plurality of pole parameter values.
  • the pre-determined look-up table or function may be determined experimentally, e.g. using a training set of input signals.
  • the z-transform of the general prediction filter may be representable as a ratio of a first and a second FIR filter (e.g. the filters A and B as described in the present document) comprising first and second sets of coefficients, respectively.
  • the first and second FIR filters may be filters in accordance to the True HD coder.
  • the method may further comprise mapping the determined pole parameter and the determined plurality of FIR coefficients to the first and second sets of coefficients.
  • the mapping may make use of formulas (e.g. the formulas described in the present document) for determining the first and second sets of coefficients from the determined pole parameter and from the determined plurality of FIR coefficients.
  • the formulas may provide for an exact bi-directional transformation of the first and second sets of coefficients and of the determined pole parameter and the determined plurality of FIR coefficients.
  • the formulas may yield an approximation of the general prediction filter described by the determined pole parameter and the determined plurality of FIR coefficients.
  • the mapping may comprise quantizing of the first and second sets of coefficients.
  • the general prediction filter may be used in conjunction with incumbent IIR-based encoders such as the True HD coder, thereby allowing the reuse of an already existing installed base of decoders.
  • a method for encoding a frame of an input signal using a general prediction filter comprises determining the general prediction filter using the methods described in the present document. Furthermore, the method comprises determining an estimate of the frame of the input signal using the determined general prediction filter. A frame of a prediction error signal may be determined based on the estimated frame and the frame of the input signal (e.g. based on the difference). The method may comprise encoding information indicative of the determined general prediction filter; and encoding the frame of the prediction error signal (e.g. using an entropy encoder). The information indicative of the determined general prediction filter may comprise the pole parameter.
  • an encoded signal e.g.
  • the encoded signal comprises information indicative of a general prediction filter to be used by a decoder for decoding the encoded signal.
  • the z-transform of the general prediction filter may be representable by a filter comprising (or having) the z-transform of a FIR filter with the z variable of the FIR filter being replaced by the z-transform of an allpass filter or an approximation of the z-transform of a FIR filter with the z variable of the FIR filter being replaced by the z-transform of an allpass filter.
  • the FIR filter may comprise a plurality of FIR coefficients and the allpass filter may exhibit a pole defined by a pole parameter.
  • the information indicative of the general prediction filter may comprise information indicative of the pole parameter.
  • a method for determining a look-up table providing a mapping between an estimated frequency of a frame of an input signal and a pole parameter defining a pole of an allpass filter is described.
  • the allpass filter may be used to provide a general prediction filter based on an FIR filter.
  • the method may comprise providing a training set of different frames of input signals.
  • the training set of frames may be used to estimate a corresponding set of frequencies for the training set of frames.
  • a set of pole parameters may be determined which provide general prediction filters that reduce an entropy of frames of prediction error signals.
  • the set of pole parameters may be determined using the methods described in the present document.
  • the method may comprise determining the look-up table based on the set of frequencies and based on the corresponding set of pole parameters. In particular, clustering techniques may be used to determine the look-up table from the set of frequencies and the corresponding set of pole parameters.
  • a method for decoding an encoded signal may have been encoded as described in the present document.
  • the method may comprise receiving information indicative of a pole parameter of an allpass filter.
  • the allpass filter may be used to provide a general prediction filter based on an FIR filter comprising a plurality of FIR coefficients.
  • the method may comprise receiving information indicative of the plurality of FIR coefficients.
  • the general prediction filter may be determined based on the received information indicative of the pole parameter and based on the received information indicative of the plurality of FIR coefficients.
  • the general prediction filter may be used to decode the encoded signal.
  • the method may comprise decoding a frame of a prediction error signal (comprised within the encoded signal).
  • a frame of an estimated input signal (also referred to as the estimated decoded signal) may be determined based on the decoded frame of the prediction error signal and based the FIR prediction filter.
  • a decoded frame of the encoded signal may be determined based on the frame of the estimated input signal and based the decoded frame of the prediction error signal.
  • an encoder e.g. an audio encoder or an image encoder configured to determine a general prediction filter for a frame of an input signal.
  • the z-transform of the general prediction filter may be indicative of (or may correspond to) the z-transform of a FIR filter with the z variable of the FIR filter being replaced by the z-transform of an allpass filter or of an approximation to the z-transform of a FIR filter with the z variable of the FIR filter being replaced by the z-transform of an allpass filter.
  • the FIR filter may comprise a plurality of FIR coefficients.
  • the allpass filter may exhibit a pole defined by an adjustable pole parameter.
  • the encoder may be configured to determine the pole parameter and the plurality of FIR coefficients, such that an entropy of a frame of a prediction error signal is reduced (e.g. minimized).
  • the frame of the prediction error signal is derived from the frame of the input signal using the general prediction filter, wherein the general prediction filter is defined by the pole parameter and the plurality of FIR coefficients.
  • a decoder e.g. an audio decoder or an image decoder for decoding an encoded signal (e.g. an encoded audio signal or an encoded image signal)
  • the decoder may be configured to extract information indicative of a pole parameter of an allpass filter from the encoded signal.
  • the allpass filter may be used to provide a general prediction filter based on an FIR filter comprising a plurality of FIR coefficients.
  • the decoder may be further configured to extract information indicative of the plurality of FIR coefficients from the encoded signal.
  • the decoder may be configured to determine the general prediction filter based on the extracted information indicative of the pole parameter and based on the extracted information indicative of the plurality of FIR coefficients.
  • the general prediction filter may be used by the decoder to decode the encoded signal.
  • a method for decoding a frame of an encoded signal using a general prediction filter is described.
  • the frame of the encoded signal may be indicative of coefficients of the general prediction filter.
  • the general prediction filter may comprise an IIR prediction filter.
  • the frame of the encoded signal may be indicative of a frame of a prediction error signal.
  • the method may comprise extracting (indications of) coefficients of the general prediction filter from the encoded signal.
  • the coefficients of the general prediction filter may have been determined using the methods described in the present document.
  • the method may comprise decoding the frame of the prediction error signal (e.g. using a de-quantizer).
  • the method may proceed in determining a frame of an estimated decoded signal based on the decoded frame of the prediction error signal and based on the general prediction filter. Furthermore, the method may comprise determining a decoded frame of the encoded signal based on the frame of the estimated decoded signal and based on the decoded frame of the prediction error signal. In particular, the decoded frame of the encoded signal may be determined by adding corresponding samples of the frame of the estimated decoded signal and of the decoded frame of the prediction error signal.
  • a decoder for decoding an encoded signal may be indicative of coefficients of a general prediction filter and of samples of a prediction error signal.
  • the decoder may comprise means for extracting coefficients of the general prediction filter from the encoded signal.
  • the coefficients of the general prediction filter may have been determined using the methods described in the present document.
  • the coefficients may be associated with a frame of the encoded signal.
  • the decoder may comprise means for decoding a frame of the prediction error signal, e.g. using a de-quantizer.
  • the decoder may comprise means for determining a frame of an estimated decoded signal based on the decoded frame of the prediction error signal and based on the general prediction filter.
  • the decoder may comprise means for determining a decoded frame of the encoded signal based on the frame of the estimated decoded signal and based the decoded frame of the prediction error signal.
  • a software program is described. The software program may be adapted for execution on a processor and for performing the method steps outlined in the present document when carried out on the processor.
  • a storage medium is described. The storage medium may comprise a software program adapted for execution on a processor and for performing the method steps outlined in the present document when carried out on the processor.
  • the computer program may comprise executable instructions for performing the method steps outlined in the present document when executed on a computer.
  • Fig. la shows an example short-term spectrum of an audio signal comprising an excerpt of music
  • Fig. lb shows a block diagram of an example encoder and decoder using linear prediction
  • Fig. 2a illustrates example spectra of an audio signal and of prediction error signals determined using FIR prediction and IIR prediction, respectively;
  • Figs. 2b and 2c show block diagrams of example encoders and decoders using linear prediction
  • Figs. 2d and 2e show block diagrams of an example encoder and decoder using IIR based linear prediction, respectively;
  • Fig. 3a illustrates the concept of frequency warping
  • Fig. 3b illustrates block diagrams of an example encoder and decoder using warped linear prediction (WLP);
  • WLP warped linear prediction
  • Fig. 3c shows example prediction results obtained using linear prediction and warped linear prediction
  • Fig. 4 shows block diagrams of an example encoder and decoder using linear prediction based on Laguerre filters
  • Fig. 5a illustrates an experimental relationship between example adjustable pole parameters and signal frequency such that entropy of the signal is minimized
  • Figs. 5b and 5 c illustrate example entropy reductions which are achievable when using IIR based linear prediction
  • Fig. 6 shows a flow chart of an example method for determining the filter coefficients of an adjustable FIR/IIR based linear predictor.
  • aspects described in the context of an audio signal are also applicable to prediction-based encoding of other types of signals, e.g. of image signals such as lines or columns of an image frame.
  • aspects described in the present document are applicable to lossless audio coding, as well as to lossless image coding.
  • linear prediction is frequently used to reduce the entropy of an input audio signal, thereby yielding a prediction error signal having reduced entropy.
  • linear prediction is directed at removing redundancies from the input audio signal, thereby yielding a decorrelated prediction error signal. If the values of future audio samples of the input audio signal can be estimated, then only the rules of prediction need to be transmitted along with the difference between the estimated signal and the actual signal, i.e. along with the prediction error signal.
  • the prediction is typically performed by a so called decorrelator (so called because when optimally adapted there is no correlation between the currently transmitted sample of the prediction error signal and its previous samples).
  • Fig. la shows the short-term spectrum 100 of an example input audio signal (e.g. an excerpt of a music track). It can be seen that the spectrum 100 is not flat and it is an objective of the decorrelator to flatten the spectrum 100 using prediction techniques, thereby yielding coding gains. Ideally the decorrelator yields a prediction error signal having a flat spectrum 101, i.e. a prediction error signal which essentially corresponds to white noise.
  • the Gerzon-Craven theorems show that the level of the optimally decorrelated prediction error signal is given by the average of the original signal spectrum when plotted as decibels versus linear frequency. As illustrated in Fig.
  • this decibel average can have significantly less power than the original audio signal, thereby yielding to a reduction in data rate when encoding the prediction error signal compared to encoding the input audio signal.
  • the power reduction achieved by the (optimal) decorrelator represents the information content of the input audio signal as defined by Shannon.
  • Fig. lb illustrates the block diagram of an example FIR based decorrelator 1 10 (or encoder) and a corresponding example FIR based re-correlator 120 (or decoder).
  • a sample of the input audio signal x 1 11 is predicted based on a plurality of previous samples of the input audio signal x 1 11 using the plurality of filter coefficients a fe 112, thereby yielding a sample of the predicted or estimated audio signal x 1 13.
  • the residual signal 114 typically exhibits reduced entropy compared to the input audio signal 1 1 1.
  • the residual signal 1 14 may be encoded using an appropriate entropy- coding scheme (e.g. using a Rice code, or Huffman coding, or Arithmetic coding), thereby providing a lossless audio coding scheme.
  • the plurality of filter coefficients a k ⁇ 12 may be determined by the decorrelator 1 10 on a frame-by-frame basis using the samples of a frame of the input audio signal 11 1.
  • the plurality of filter coefficients a fc 112 may be determined such that the mean squared energy of the prediction error signal 114 is reduced (minimized). This may be achieved in an efficient manner using the Levinson-Durbin algorithm.
  • a lossless audio coder may be provided by first removing the redundancy from the input audio signal 1 11 (e.g. using linear prediction techniques) and by then coding the resulting prediction error signal 114 with an efficient entropy-coding scheme.
  • the encoded signal comprises for each frame of the input audio signal 1 1 1 a representation of the plurality of filter coefficients a k l 12 and the entropy-encoded samples of the frame of the prediction error signal 114.
  • the re-correlator 120 (also referred to as the decoder) performs corresponding steps to the decorrelator 1 10.
  • the re-correlator 120 uses the same FIR filter comprising the same plurality of filter coefficients a k 1 12 to reconstruct the input audio signal 11 1 from the residual audio signal r 1 14.
  • the degree to which an input audio signal can be "whitened” depends on the content of the input audio signal 1 1 1 and on the complexity (e.g. the number K of coefficients and/or the structure) of the prediction filter. Infinite complexity (e.g. an infinite number K of filter coefficients) could theoretically achieve a prediction at the entropy level 101 shown in Fig. la. However, all the coefficients that define such a decorrelator 1 10 would then need to be transmitted to the decoder 120 (in addition to the prediction error signal 1 14) to reconstruct (recorrelate) the input audio signal 1 11. There is therefore a need to obtain a balance between predictor complexity (e.g. the number K of filter coefficients and/or the type of the prediction filter, e.g. FIR or IIR) and performance.
  • predictor complexity e.g. the number K of filter coefficients and/or the type of the prediction filter, e.g. FIR or IIR
  • lossless audio coders make use of an FIR-based predictor or decorrelator 1 10.
  • IIR-based predictors or decorrelators 1 10 may be beneficial, in situations where the control of peak data rates is important.
  • a further situation where IIR-based decorrelators 110 may be beneficial is where the spectrum 100 of the input audio signal 1 11 exhibits a relatively wide dynamic range. In such a situation, compression gains may be expected, in particular for relatively high sampling rates.
  • IIR-based predictors show an improvement over FIR-based predictors of approx.
  • IIR-based predictors are increasingly beneficial for encoding input audio signals 1 1 1 having an increasingly high ratio of sampling rate over signal bandwidth.
  • FIR8 8 coefficients
  • IIR4 IIR decorrelator
  • the spectrum of the input audio signal 1 1 1 rolls-off at frequencies lower than the Nyquist frequency (which is at half of the sampling frequency). This implies that the spectrum of the prediction error signal created with an FIR filter will also roll-off at significantly lower frequencies than the Nyquist frequency. On the other hand, the spectrum of the prediction error signal created with an IIR filter will typically be flat up to the Nyquist frequency.
  • Fig. 2b shows an example block diagram of a conventional prediction architecture, where in an encoder 210 a prediction filter is used to determine an estimated signal which is subtracted from the input signal, thereby yielding the prediction error signal. At the decoder 220, the same prediction filter may be used to reconstruct the input signal.
  • the prediction architecture of Fig. 2b may be used for lossy IIR-based prediction coders, however, the architecture of Fig. 2b typically cannot be used for lossless compression. This is due to the fact that the output of the prediction filter 220 in Fig. 2b typically has a longer word length than the input signal because of the multiplication by fractional coefficients. As the transmitted data rate depends on the total word length, extending the word size would be counterproductive.
  • This problem may be overcome by quantizing the output of the prediction filter at the encoder 210, i.e. by quantizing the estimated signal using a quanitzer 216.
  • quantizing the output of the prediction filter at the encoder 210 i.e. by quantizing the estimated signal using a quanitzer 216.
  • Fig. 2c where the output of the prediction filter at the encoder 210 is quantized so that the transmitted prediction error signal has the same word length as the input signal.
  • the decoder 220 can recover the original signal despite the fact that the side chain is nonlinear.
  • the use of a quanitzer 216 assumes that the predictors of the decoder 220 and the encoder 210 produce outputs which, when requantized, correspond exactly.
  • FIGs. 2d and 2e A possible architecture for overcoming this technical problem is illustrated in Figs. 2d and 2e for the encoder 210 and the corresponding decoder 220, respectively.
  • the input and output signals are both quantized and as filters A 212 and B 213 are both FIR filters, the input to the quantizer Q 216 is a finite precision signal, and the quantization can therefore be specified precisely.
  • the total response of the predictor in the encoder 210 and in the decoder 220 is IIR.
  • the architecture of Figs. 2d and 2d illustrates an IIR predictor which is portable across different hardware platforms, even when used for lossless encoding.
  • the encoder 210 of Fig. 2d determines a prediction error signal 214 from the input signal 1 1 1.
  • the prediction error signal 214 typically has a reduced entropy compared to the input signal 1 11 and can be entropy encoded (e.g. using a Rice code, or Huffman coding, or Arithmetic coding).
  • optimal prediction coefficients can be obtained using the Levinson-Durbin algorithm.
  • IIR-based predictors there is no such efficient algorithm for obtaining the optimal IIR prediction coefficients.
  • the present document addresses the technical problem of determining the coefficients of an IIR-based decorrelator in an efficient manner such that the entropy of the prediction error signal is reduced (e.g. minimized).
  • WLP Warped Linear Prediction
  • LLP Laguerre Linear Prediction
  • Frequency warped processing may be used to process audio signals according to the frequency resolution of the human auditory system.
  • the frequency range of an input signal may be mapped to a warped frequency range, thereby modeling the frequency resolution of the human auditory system.
  • Fig. 3 a where it is shown how an original frequency range 301 may be warped into a warped frequency range 302.
  • a Bark scale may be used to warp the original frequency range.
  • frequency warping may be implemented by replacing the delays 1 15 of an FIR prediction filter with an allpass filter
  • A(z) -—— ⁇ ; ⁇ 1 ,
  • Fig. 3b illustrates a modified encoder 310 and a modified decoder 320, where the delay units 1 15 have been replaced by allpass filters A(z) 3 ⁇ 5.
  • the optimal coefficients of the allpass filters A(z) 315 for a fixed pole parameter ⁇ may be determined using the Levinson- Durbin algorithm.
  • Fig. 3c illustrates how a WLP based encoder 310 approximates an input signal 11 1.
  • WLP provides prediction error signals which are not whitened in the original frequency domain. This problem may be overcome by whitening the prediction error signal using a residual post-filter
  • optional WLP coefficients can be obtained using a pre-filter
  • the pre-filter is typically not applied in the prediction filtering operation.
  • the pre-filter W(z) may be used when determining the optimal prediction coefficients k and the pole parameter X.
  • the determined filter coefficients may be used without the additional pre-filter W z).
  • a post-filter or a pre-filter whitens the prediction error signal, it is typically not possible to implement a synthesis filter at the decoder 320 because of delay- free loops.
  • This technical problem may be solved by adding an explicit delay unit 1 15 to the encoder and the decoder, thereby yielding a so called Laguerre Linear Prediction (LLP) scheme which is illustrated in Fig. 4 showing an encoder 410 and a corresponding decoder 420.
  • the pre-filtering using the whitening filter W(z) may also be absorbed into the prediction structure, thereby yielding so called Laguerre filters 41 1
  • the encoder and decoder structure of Fig. 4 correspond to an FIR-based linear predictor.
  • the encoder 410 receives an input signal 11 1 and determines an estimated signal 413 using the decorrelator comprising the delay unit 1 15, the Laguerre filters 411 and respective filter coefficients 412 (referred to as LLP coefficients).
  • the estimated signal 413 is subtracted from the input signal 111, thereby yielding the prediction error signal 414.
  • the corresponding decoder 420 performs the corresponding operations to reconstruct the input signal 1 1 1.
  • the decoder 420 receives the LLP coefficients 412 and uses a delay unit 1 15, the Laguerre filters 411 and the received LLP coefficients 412 to reconstruct the input audio signal 1 11 from the prediction error signal 414.
  • the estimated signal x 413 may be
  • the LLP coefficients 412 are usually optimized to minimize the mean squared energy of the prediction error signal r414 (within the frame for which the LLP coefficients 412 are determined).
  • the regressor signals y k can be derived from the input signal
  • Y k (z) z ⁇ ] H k (z) - X(z)
  • X(z) and Y k (z) are the z- transforms of xand y k , respectively, and where H k (z) are stable and causal IIR filters.
  • the encoder 410 and decoder 420 may be transformed in accordance to the encoder 210, 220 of Figs. 2d and 2e, respectively. This means that the encoder 410 and decoder 420 effectively provide an IIR based decorrelator when using a pole parameter ⁇ which is different from zero.
  • Laguerre filters 41 1 for implementing a decorrelator has several advantages.
  • the encoder / decoder of Fig. 4 can be implemented using the predictor structure of Figs. 2d and 2e, wherein perfect reconstruction is guaranteed even in case of finite word length arithmetic. Furthermore, stability of the used synthesis filter is guaranteed for such Laguerre (and possibly Kautz) filters.
  • efficient autocorrelation based methods exist (similar to the ones used in linear prediction) for deriving optimal filter coefficients 412.
  • the prediction error signal 414 exhibits spectral flatness on the original frequency scale 301.
  • the pole parameter ⁇ (which defines the pole of the allpass filter) provides an extra degree of freedom. It is proposed in the present document to use this extra degree of freedom to provide for an additional reduction (e.g. a minimization) of the entropy of the prediction error signal 414. By doing this, an optimal combination of FIR/IIR filters may be determined for each block or frame of the input audio signal 11 1.
  • the encoder 410 of Fig. 4 preserves desirable qualities of WLP.
  • the encoder 410 may provide improved perceptual noise shaping for cascaded lossy data compression with a lossless kernel.
  • the quantization noise is shaped according to prediction parameters which model the spectral envelope of the signal (similar to spectral noise shaping as used in linear prediction based speech codecs). Warped linear prediction typically models spectral envelopes perceptually better, and is therefore better suitable for lossy coding. As such, the encoder 410 (which preserves the properties of WLP) provides an improved perceptual noise shaping.
  • the methods described in the present document provide an FIR prediction filter.
  • the pole parameter ⁇ may be used to reduce the entropy of the prediction error signal 414. This may be achieved e.g. by using a brute force approach.
  • the pole parameter ⁇ (and the corresponding pole of the allpass filter A(z) ) may be varied from -0.9 to +0.9 and the pole parameter ⁇ may be selected, which produces a prediction error signal 414 with the least entropy.
  • the pole parameter ⁇ may be varied from -0.9 to 0.9 in steps of 0.1. For each pole parameter ⁇ , the optimal LLP coefficients 412 are determined and the residual signal 414 and its entropy are determined.
  • the pole parameter ⁇ for which the entropy of the residual signal 414 is reduced (e.g. is minimal) may be selected, and the (entropy encoded) residual signal 414 and the LLP coefficients 412 for the selected pole parameter ⁇ may be transmitted to the decoder 420.
  • the determined LLP coefficients P k 412 may be transformed into filter coefficients for the filters A 212 and B 213 which are used by the encoder 210 and decoder 220 of Fig. 2d and 2e (used e.g. in the Dolby True HD coder). This is beneficial, as it allows the reuse of an existing encoder / decoder structure, as well as the reuse of quantization and coding routines.
  • the benefits of using an IIR-based decorrelator have been tested using a sine sweep ranging from 0 to 24kHz, sampled with 16bits/sample and with a sampling rate of 48kHz.
  • the performance of FIR-based decorrelators using an FIR predictor of order 4 (FIR4) and an FIR predictor of order 8 (FIR8) were compared to the performance of an IIR-based decorrelator using an IIR predictor of order 4 (IIR4).
  • the tests were performed for different frame sizes of the input audio signal 1 1 1, i.e. for different predictor analysis frame sizes.
  • the example results are shown in Table 1.
  • the optimal pole parameter ⁇ has an almost linear relationship to the frequency of the input audio signal 1 11.
  • the optimal pole parameter ⁇ 501 i.e. the pole parameter which provides LLP coefficients 412 which minimize the entropy of the prediction error signal 41
  • the input audio signal 1 1 1 comprises a sine sweep, therefore the x-axis 502 can be thought of as the frequency increasing with time.
  • This observation can be used to provide efficient schemes for determining the pole parameter ⁇ which provides (almost optimal) LLP coefficients 412. It should be noted that for a particular frequency (around frame number 150), the optimal pole parameter ⁇ 501 is zero, thereby indicating that for an input signal 1 11 at this frequency, the use of an FIR predictor is optimal.
  • Fig. 5b illustrates the entropy reduction (measured in bits/sample) which is possible when using an IIR4 predictor compared to the situation when using a FIR4 predictor.
  • the input audio signal 11 1 comprises a sine sweep such that the x-axis 502 may be viewed as the frequency of the input audio signal 11 1.
  • the entropy reduction 503 which may be achieved when using an IIR predictor varies with the frequency of the input signal.
  • Fig. 5c where the entropy reduction (measured in bits/sample) which is possible when using an IIR4 predictor compared to the situation when using a FIR8 predictor is illustrated.
  • the FIR8 requires the determination and transmission of eight filter coefficients.
  • a look-up table may be determined offline, wherein the look-up table provides a mapping between an estimated frequency of a frame of the input audio signal 1 1 1 and a corresponding pole parameter ⁇ which minimizes the entropy of the prediction error signal 414.
  • the look-up table may be determined based on a training set comprising a plurality of input audio signals 1 1 1.
  • the look-up table will typically have a mapping distribution similar to the one illustrated in Fig. 5a.
  • the encoder 410 may be configured to use the pre-determined look-up table to determine the pole parameter ⁇ which is to be used to calculate the LLP coefficients 412 for a particular frame of an input audio signal 111.
  • the encoder 410 may employ a frequency estimation method, and estimate the (dominant) frequency content of the particular frame of the input signal 1 1 1.
  • the encoder may employ a low-order linear predictor and estimate the spectral envelope of the particular frame of the input audio signal 1 1 1.
  • the estimated (dominant) frequency may correspond to the peak of the spectral envelope. Once the dominant frequency is estimated, the encoder 410 may look-up the corresponding optimal entropy minimizing pole parameter ⁇ from the look-up table.
  • This entropy minimizing pole parameter ⁇ may be used to determine optimal LLP coefficients 412 which minimize the power of the corresponding frame of the prediction error signal 414 (using a Levinson-Durbin type algorithm).
  • the determined LLP coefficients 412 may optionally be mapped to the prediction structure of Fig. 2d, in order to make use of existing Dolby TrueHD quantization and coding techniques.
  • various other methods may be used to determine the pole parameter X.
  • a hybrid method for determining the optimal entropy minimizing pole parameter ⁇ may make use of a combination of a look-up table and a brute force search. For instance, a look-up table may be used to determine a first estimate of the optimal pole parameter ⁇ .
  • the value for ⁇ may be chosen which minimizes entropy. For example, if the looked-up value of ⁇ is 0.7, one could evaluate other value of ⁇ in the range of 0.6 and 0.8 in addition to 0.7 (and possibly the value 0, in order to verify whether the FIR predictor provides a better solution than the IIR predictor).
  • Fig. 6 illustrates an example method 600 for determining an IIR prediction filter for performing decorrelation of an input signal.
  • a frame of samples of the input signal 1 1 1 is selected for determining an IIR prediction filter.
  • a dominant frequency of the selected frame is estimated.
  • the estimated frequency may be used to determine a pole parameter ⁇ (step 603), e.g. using a pre-determined look-up table.
  • LLP prediction coefficients may be determined (step 604).
  • the LLP prediction coefficient may optionally be transformed into an explicit FIR and IIR filter representation comprising e.g. filter A 212 and filter B 213 (step 605).
  • the method may be implemented in an efficient manner and allows for the determination of IIR filter prediction filter coefficients which minimize the entropy of the prediction error signal. As such, the method enables the implementation of audio coding schemes having increased coding gains.
  • the IIR-based decorrelator may be used in conjunction with an entropy encoder of the prediction error signal to provide a lossless audio coder.
  • the method may be used to adaptively switch between FIR and IIR based linear prediction on a frame-by-frame basis, in order to minimize the entropy of the prediction error signal.
  • the IIR-based decorrelator is compliant with existing Dolby True HD coders, thereby enabling the reuse of already deployed Dolby True HD decoders.
  • the methods and systems described in the present document may be implemented as software, firmware and/or hardware. Certain components may e.g. be implemented as software running on a digital signal processor or microprocessor. Other components may e.g. be implemented as hardware and or as application specific integrated circuits.
  • the signals encountered in the described methods and systems may be stored on media such as random access memory or optical storage media. They may be transferred via networks, such as radio networks, satellite networks, wireless networks or wireline networks, e.g. the Internet. Typical devices making use of the methods and systems described in the present document are portable electronic devices or other consumer equipment which are used to store and/or render audio signals.

Abstract

The present document relates to coding. In particular, the present document relates to coding using linear prediction in combination with entropy encoding. A method (600) for determining a general prediction filter for a frame of an input signal (111) is described. The z-transform of the general prediction filter comprises an approximation to the z-transform of a finite impulse response, referred to as FIR, filter with the z variable of the FIR filter being replaced by the z-transform of an allpass filter. The FIR filter comprises a plurality of FIR coefficients (412). The allpass filter exhibits a pole defined by an adjustable pole parameter. The method (600) comprises determining the pole parameter and the plurality of FIR coefficients, such that an entropy of a frame of a prediction error signal (414) which is derived from the frame of the input signal (111) using the general prediction filter defined by the pole parameter and the plurality of FIR coefficients (412) is reduced.

Description

Signal Adaptive FIR/IIR Predictors for Minimizing Entropy
TECHNICAL FIELD
The present document relates to coding. In particular, the present document relates to lossless coding using linear prediction, possibly in combination with entropy encoding.
BACKGROUND
Audio encoders and in particular lossless audio encoders typically employ a FIR (Finite Impulse Response) prediction filter to reduce the entropy of an audio signal. Employing an IIR (Infinite Impulse Response) prediction filter may lead to improved prediction results and to reduced entropy of the prediction error signal. IIR prediction filters may e.g. be used in the so-called Dolby TrueHD lossless encoder. However, unlike FIR predictors it is typically difficult to derive optimal IIR prediction coefficients on a frame-by-frame basis that guarantees the stability of the predictor system (for the encoder) and its inverse system (for the decoder).
The present document addresses the above-mentioned technical problem. In particular, the present document describes methods for determining the coefficients of IIR-based prediction filters which lead to improved prediction results (i.e. which lead to a reduction of the entropy of the prediction error signal). The IIR prediction filters may be determined such that stability may be guaranteed. As such, the methods described in the present document enable the use of IIR-based prediction, thereby providing audio encoders (in particular lossless audio encoders) with improved coding gains.
SUMMARY
According to an aspect a method for determining a general prediction filter for a frame of an input signal is described. The general prediction filter may be determined such that it is ensured that the determined general prediction filter is stable. Typically the frame of the input signal (e.g. an audio signal such as a speech signal or a music signal, or an image signal, e.g. a line or a column of an image) comprises a plurality of samples (e.g. 50 or more, or 100 or more samples). The general prediction filter may comprise an infinite impulse response (IIR) prediction filter. In general terms, the general prediction filter may comprise an IIR prediction filter component and/or an FIR prediction filter component. The z-transform of the general prediction filter may be represented as a ratio of an FIR filter in the numerator and an FIR filter in the denominator. In particular, the z-transform of the general prediction filter (also referred to as the transfer function of the general prediction filter or the z-transform of the impulse response of the general prediction filter) may be presented in a form which comprises an approximation to the z-transform of a finite impulse response (FIR) filter with the z variable of the FIR prediction filter being replaced by the z-transform of an allpass filter. By way of example, the general prediction filter may be presented in a form which comprises the z-transform of a FIR filter with the z variable of the FIR prediction filter being replaced by the z-transform of an allpass filter. In other words, it is proposed to make use of a general prediction filter (which may comprise an IIR prediction filter) which may be derived by replacing the delays of an FIR prediction filter with allpass filters. The FIR filter typically comprises a plurality (K with K>\, e.g. K= 4 or 8 or more) of FIR coefficients. The allpass filter may exhibit a pole defined by an adjustable pole parameter λ. As such, the general prediction filter may be defined by the plurality of FIR coefficients and by the pole parameter λ. In an embodiment, the allpass filter exhibits a single pole defined by a single adjustable pole parameter.
As indicated above, the z-transform of the general prediction filter may be derived from (an approximation of) the z-transform of an FIR filter with the z variable of the FIR prediction filter being replaced by the z-transform of an allpass filter. In particular, the general prediction filter may be determined by first determining an intermediate general prediction filter having a z-transform which (exactly) comprises the z-transform of an FIR filter with the z variable of the FIR prediction filter being replaced by the z-transform of an allpass filter. The coefficients of the intermediate general prediction filter may then be approximated (e.g. the coefficients may be quantized), thereby yielding the coefficients of the general prediction filter. As a consequence of the approximation of the coefficients of the intermediate general prediction filter, the z-transform of the general prediction filter comprises an approximation of the z-transform of an FIR filter with the z variable of the FIR prediction filter being replaced by the z-transform of an allpass filter. The approximation may be due to the quantization of filter coefficients and/or due to the transformation of the FIR filter coefficients and the pole parameter to an IIR filter representation (as described below in the context of the "mapping" feature). The pole parameter λ may be used to adapt the general prediction filter between an FIR prediction filter and an IIR prediction filter. In other words, the method may yield an adaptive general prediction filter which may adapt its filter structure (i.e. IIR structure or FIR structure) to the frame of the input signal using one or more pole parameters X. It should be noted that a general prediction filter having an IIR structure typically also comprises an FIR filter component. On the other hand, a general prediction filter having an FIR structure typically only comprises an FIR filter component.
By way of example, the z-transform of the allpass filter may comprise the z-transform of the following allpass filter A(z) = -— | l| < l , with λ being the pole parameter
\ - λε~
adjustable between values ±1. In particular, the pole parameter may be unequal to zero, thereby providing a general prediction filter which exhibits an infinite impulse response. On the other hand, if the pole parameter is determined to be zero, the general prediction filter typically corresponds to an FIR prediction filter. This means that for the particular frame of the input signal, entropy minimization may be achieved using an FIR prediction filter without the need of providing an IIR prediction filter. Furthermore, the z-transform of the general prediction filter may comprise a pre-filter configured to whiten a spectrum of the prediction error signal. By whitening the spectrum of the prediction error signal, the entropy encoding of the prediction error signal may be performed with increased efficiency. In addition, the z-transform of the general prediction filter may comprise an overall delay. By inserting an overall delay, it may be ensured that the general prediction may be performed in a causal manner.
In a particular example, the z-transform of the general prediction filter may be representable as ∑fc=1 z_1 kHk( h with k = 1, ... , K, with K > 1 , with
Hk(z) : , and with k, k = 1, ... , K being the plurality of FIR
Figure imgf000004_0001
coefficients (412). It can be seen that the general prediction filter comprise an overall delay z 1 and that each filter component Hk(z) comprises a pre-filter -j— for whitening l- z- purposes. The method may comprise determining the pole parameter and the plurality of FIR coefficients, such that an entropy of a frame of a prediction error signal which is derived from the frame of the input signal using the general prediction filter defined by the pole parameter and the plurality of FIR coefficients is reduced (e.g. is minimized). The general prediction filter may be used to determine a frame of an estimated signal (e.g. an estimated audio signal or an estimated image signal) from the frame of the input signal. The difference between the frame of the estimated signal and the frame of the input signal may provide the frame of the prediction error signal. The pole parameter and the plurality of FIR coefficients may specify the general prediction filter, and the general prediction filter may be adjusted such that the entropy of the frame of the prediction error signal is reduced (e.g. minimized).
The entropy of the frame of the prediction error signal may be estimated by determining a probability distribution of the values of samples of the frame of the prediction error signal. The entropy may be estimated based on a weighted sum of the probability distribution. The weighted sum of the probability distribution may be given by ^ p. , . \ogb{P, ) , with Pt being the probability of the value of a sample of the prediction error signal and with b being the base of the log function (e.g. b=2 or 10 or e, i.e. Euler's number). Determining the pole parameter and the plurality of FIR coefficients may comprise setting the adjustable pole parameter to a fixed first value and determining the plurality of FIR coefficients using the set pole parameter. For a fixed or set pole parameter, determining the plurality of FIR coefficients may comprise determining the plurality of FIR coefficients such that a mean squared power of the frame of the prediction error signal is reduced. In view of the fact that the general prediction filter is derived from an FIR filter, this target may be achieved by solving a set of normal equations (e.g. using a Levinson-Durbin algorithm). By way of example, for a fixed or a set pole parameter, determining the plurality of FIR coefficients may comprise determining a frame of a regressor signal based on the frame of the input signal for each tap of the general prediction filter (i.e. for each filter component Hk(z)), thereby yielding a plurality of regressor signal frames. The plurality of regressor signal frames may be used to determine an autocorrelation matrix Q for the plurality of regressor signal frames. The size of the autocorrelation matrix Q typically depends on the number K of FIR coefficients which are to be determined. Furthermore, a cross-correlation vector P may be determined based on the plurality of regressor signal frames and the frame of the input signal. An FIR coefficient vector β comprising the plurality of FIR coefficients may be determined by solving the normal equations QP = P .
Determining the pole parameter and the plurality of FIR coefficients may comprise estimating the entropy of the frame of the prediction error signal obtained using the general prediction filter defined by the set pole parameter and the plurality of FIR coefficients. The plurality of FIR coefficients have been determined based on the set pole parameter (e.g. using the above mentioned set of normal equations). The steps of determining the plurality of FIR coefficients (for a set pole parameter) and of estimating the entropy may be repeated for a plurality of differently set pole parameters, thereby yielding a corresponding plurality of entropy values. The pole parameter may be selected from the plurality of differently set pole parameters, which reduces the estimated entropy of the frame of the prediction error signal. In other words, the pole parameter which yields the lowest entropy from the plurality of entropies may be selected. Furthermore, the plurality of FIR coefficients which has been determined using the selected pole parameters may be selected. The selected pole parameter and the selected plurality of FIR coefficients may be the pole parameter and the plurality of FIR coefficients, which reduce (e.g. minimize) the entropy of the frame of the prediction error signal.
Alternatively or in addition, setting the pole parameter to a fixed first value may comprise estimating a frequency based on the frame of the input signal. In particular, a dominant frequency of the frame of the input signal may be estimated. Estimating a frequency based on the frame of the input signal may comprise determining a spectral envelope of a spectrum of the frame of the input signal, and estimating the frequency of the frame of the input signal based on the spectral envelope (e.g. based on a maximum of the spectral envelope). The first value for the pole parameter may be determined based on the estimated frequency, e.g. using a pre-determined look-up table or a pre-determined function. The pre-determined look-up table or function may provide a mapping between a plurality of frequency values and a corresponding plurality of pole parameter values. The pre-determined look-up table or function may be determined experimentally, e.g. using a training set of input signals.
The z-transform of the general prediction filter may be representable as a ratio of a first and a second FIR filter (e.g. the filters A and B as described in the present document) comprising first and second sets of coefficients, respectively. The first and second FIR filters may be filters in accordance to the True HD coder. The method may further comprise mapping the determined pole parameter and the determined plurality of FIR coefficients to the first and second sets of coefficients. By way of example, the mapping may make use of formulas (e.g. the formulas described in the present document) for determining the first and second sets of coefficients from the determined pole parameter and from the determined plurality of FIR coefficients. The formulas may provide for an exact bi-directional transformation of the first and second sets of coefficients and of the determined pole parameter and the determined plurality of FIR coefficients. Alternatively the formulas may yield an approximation of the general prediction filter described by the determined pole parameter and the determined plurality of FIR coefficients. Alternatively or in addition, the mapping may comprise quantizing of the first and second sets of coefficients. As such, the general prediction filter may be used in conjunction with incumbent IIR-based encoders such as the True HD coder, thereby allowing the reuse of an already existing installed base of decoders.
According to a further aspect, a method for encoding a frame of an input signal using a general prediction filter is described. The method comprises determining the general prediction filter using the methods described in the present document. Furthermore, the method comprises determining an estimate of the frame of the input signal using the determined general prediction filter. A frame of a prediction error signal may be determined based on the estimated frame and the frame of the input signal (e.g. based on the difference). The method may comprise encoding information indicative of the determined general prediction filter; and encoding the frame of the prediction error signal (e.g. using an entropy encoder). The information indicative of the determined general prediction filter may comprise the pole parameter. According to a further aspect, an encoded signal (e.g. an encoded audio signal or an encoded image signal) is described. The encoded signal comprises information indicative of a general prediction filter to be used by a decoder for decoding the encoded signal. The z-transform of the general prediction filter may be representable by a filter comprising (or having) the z-transform of a FIR filter with the z variable of the FIR filter being replaced by the z-transform of an allpass filter or an approximation of the z-transform of a FIR filter with the z variable of the FIR filter being replaced by the z-transform of an allpass filter. The FIR filter may comprise a plurality of FIR coefficients and the allpass filter may exhibit a pole defined by a pole parameter. The information indicative of the general prediction filter may comprise information indicative of the pole parameter.
According to another aspect, a method for determining a look-up table providing a mapping between an estimated frequency of a frame of an input signal and a pole parameter defining a pole of an allpass filter is described. The allpass filter may be used to provide a general prediction filter based on an FIR filter. The method may comprise providing a training set of different frames of input signals. The training set of frames may be used to estimate a corresponding set of frequencies for the training set of frames. Furthermore, a set of pole parameters may be determined which provide general prediction filters that reduce an entropy of frames of prediction error signals. The set of pole parameters may be determined using the methods described in the present document. The method may comprise determining the look-up table based on the set of frequencies and based on the corresponding set of pole parameters. In particular, clustering techniques may be used to determine the look-up table from the set of frequencies and the corresponding set of pole parameters.
According to a further aspect, a method for decoding an encoded signal is described. The encoded signal may have been encoded as described in the present document. The method may comprise receiving information indicative of a pole parameter of an allpass filter. The allpass filter may be used to provide a general prediction filter based on an FIR filter comprising a plurality of FIR coefficients. The method may comprise receiving information indicative of the plurality of FIR coefficients. The general prediction filter may be determined based on the received information indicative of the pole parameter and based on the received information indicative of the plurality of FIR coefficients. The general prediction filter may be used to decode the encoded signal. In particular, the method may comprise decoding a frame of a prediction error signal (comprised within the encoded signal). A frame of an estimated input signal (also referred to as the estimated decoded signal) may be determined based on the decoded frame of the prediction error signal and based the FIR prediction filter. A decoded frame of the encoded signal may be determined based on the frame of the estimated input signal and based the decoded frame of the prediction error signal.
According to another aspect, an encoder (e.g. an audio encoder or an image encoder) configured to determine a general prediction filter for a frame of an input signal is described. The z-transform of the general prediction filter may be indicative of (or may correspond to) the z-transform of a FIR filter with the z variable of the FIR filter being replaced by the z-transform of an allpass filter or of an approximation to the z-transform of a FIR filter with the z variable of the FIR filter being replaced by the z-transform of an allpass filter. The FIR filter may comprise a plurality of FIR coefficients. The allpass filter may exhibit a pole defined by an adjustable pole parameter. The encoder may be configured to determine the pole parameter and the plurality of FIR coefficients, such that an entropy of a frame of a prediction error signal is reduced (e.g. minimized). The frame of the prediction error signal is derived from the frame of the input signal using the general prediction filter, wherein the general prediction filter is defined by the pole parameter and the plurality of FIR coefficients.
According to another aspect, a decoder (e.g. an audio decoder or an image decoder) for decoding an encoded signal (e.g. an encoded audio signal or an encoded image signal) is described. The decoder may be configured to extract information indicative of a pole parameter of an allpass filter from the encoded signal. The allpass filter may be used to provide a general prediction filter based on an FIR filter comprising a plurality of FIR coefficients. The decoder may be further configured to extract information indicative of the plurality of FIR coefficients from the encoded signal. In addition, the decoder may be configured to determine the general prediction filter based on the extracted information indicative of the pole parameter and based on the extracted information indicative of the plurality of FIR coefficients. The general prediction filter may be used by the decoder to decode the encoded signal. According to a further aspect, a method for decoding a frame of an encoded signal using a general prediction filter is described. The frame of the encoded signal may be indicative of coefficients of the general prediction filter. As outlined above, the general prediction filter may comprise an IIR prediction filter. Furthermore, the frame of the encoded signal may be indicative of a frame of a prediction error signal. The method may comprise extracting (indications of) coefficients of the general prediction filter from the encoded signal. The coefficients of the general prediction filter may have been determined using the methods described in the present document. Furthermore, the method may comprise decoding the frame of the prediction error signal (e.g. using a de-quantizer). The method may proceed in determining a frame of an estimated decoded signal based on the decoded frame of the prediction error signal and based on the general prediction filter. Furthermore, the method may comprise determining a decoded frame of the encoded signal based on the frame of the estimated decoded signal and based on the decoded frame of the prediction error signal. In particular, the decoded frame of the encoded signal may be determined by adding corresponding samples of the frame of the estimated decoded signal and of the decoded frame of the prediction error signal.
According to another aspect, a decoder for decoding an encoded signal is described. The encoded signal may be indicative of coefficients of a general prediction filter and of samples of a prediction error signal. The decoder may comprise means for extracting coefficients of the general prediction filter from the encoded signal. The coefficients of the general prediction filter may have been determined using the methods described in the present document. The coefficients may be associated with a frame of the encoded signal. Furthermore, the decoder may comprise means for decoding a frame of the prediction error signal, e.g. using a de-quantizer. In addition, the decoder may comprise means for determining a frame of an estimated decoded signal based on the decoded frame of the prediction error signal and based on the general prediction filter. Furthermore, the decoder may comprise means for determining a decoded frame of the encoded signal based on the frame of the estimated decoded signal and based the decoded frame of the prediction error signal. According to a further aspect, a software program is described. The software program may be adapted for execution on a processor and for performing the method steps outlined in the present document when carried out on the processor. According to another aspect, a storage medium is described. The storage medium may comprise a software program adapted for execution on a processor and for performing the method steps outlined in the present document when carried out on the processor.
According to a further aspect, a computer program product is described. The computer program may comprise executable instructions for performing the method steps outlined in the present document when executed on a computer.
It should be noted that the methods and systems including their preferred embodiments as outlined in the present patent application may be used stand-alone or in combination with the other methods and systems disclosed in this document. Furthermore, all aspects of the methods and systems outlined in the present patent application may be arbitrarily combined. In particular, the features of the claims may be combined with one another in an arbitrary manner. SHORT DESCRIPTION OF THE FIGURES
The invention is explained below in an exemplary manner with reference to the accompanying drawings, wherein
Fig. la shows an example short-term spectrum of an audio signal comprising an excerpt of music;
Fig. lb shows a block diagram of an example encoder and decoder using linear prediction; Fig. 2a illustrates example spectra of an audio signal and of prediction error signals determined using FIR prediction and IIR prediction, respectively;
Figs. 2b and 2c show block diagrams of example encoders and decoders using linear prediction;
Figs. 2d and 2e show block diagrams of an example encoder and decoder using IIR based linear prediction, respectively;
Fig. 3a illustrates the concept of frequency warping; Fig. 3b illustrates block diagrams of an example encoder and decoder using warped linear prediction (WLP);
Fig. 3c shows example prediction results obtained using linear prediction and warped linear prediction;
Fig. 4 shows block diagrams of an example encoder and decoder using linear prediction based on Laguerre filters;
Fig. 5a illustrates an experimental relationship between example adjustable pole parameters and signal frequency such that entropy of the signal is minimized;
Figs. 5b and 5 c illustrate example entropy reductions which are achievable when using IIR based linear prediction; and
Fig. 6 shows a flow chart of an example method for determining the filter coefficients of an adjustable FIR/IIR based linear predictor.
DETAILED DESCRIPTION
The following aspects are described in the context of an audio signal. It should be noted that the aspects described in the present document are also applicable to prediction-based encoding of other types of signals, e.g. of image signals such as lines or columns of an image frame. In particular, the aspects described in the present document are applicable to lossless audio coding, as well as to lossless image coding.
As outlined in the background section, linear prediction is frequently used to reduce the entropy of an input audio signal, thereby yielding a prediction error signal having reduced entropy. In other words, linear prediction is directed at removing redundancies from the input audio signal, thereby yielding a decorrelated prediction error signal. If the values of future audio samples of the input audio signal can be estimated, then only the rules of prediction need to be transmitted along with the difference between the estimated signal and the actual signal, i.e. along with the prediction error signal. The prediction is typically performed by a so called decorrelator (so called because when optimally adapted there is no correlation between the currently transmitted sample of the prediction error signal and its previous samples).
Fig. la shows the short-term spectrum 100 of an example input audio signal (e.g. an excerpt of a music track). It can be seen that the spectrum 100 is not flat and it is an objective of the decorrelator to flatten the spectrum 100 using prediction techniques, thereby yielding coding gains. Ideally the decorrelator yields a prediction error signal having a flat spectrum 101, i.e. a prediction error signal which essentially corresponds to white noise. The Gerzon-Craven theorems show that the level of the optimally decorrelated prediction error signal is given by the average of the original signal spectrum when plotted as decibels versus linear frequency. As illustrated in Fig. la by the flat spectrum 101, this decibel average can have significantly less power than the original audio signal, thereby yielding to a reduction in data rate when encoding the prediction error signal compared to encoding the input audio signal. The power reduction achieved by the (optimal) decorrelator represents the information content of the input audio signal as defined by Shannon.
Fig. lb illustrates the block diagram of an example FIR based decorrelator 1 10 (or encoder) and a corresponding example FIR based re-correlator 120 (or decoder). The decorrelator 110 makes use of an FIR prediction filter comprising a plurality of delay units z_1 1 15 and a plurality of filter coefficients ak 112, with k = 1, ... , K, typically K>\. A sample of the input audio signal x 1 11 is predicted based on a plurality of previous samples of the input audio signal x 1 11 using the plurality of filter coefficients afe 112, thereby yielding a sample of the predicted or estimated audio signal x 1 13. The difference between the sample of the input audio signal 1 11 and the estimated audio signal 1 13 yields a corresponding sample of the prediction error signal r 114 (also referred to as the residual signal). The residual signal 114 typically exhibits reduced entropy compared to the input audio signal 1 1 1. The residual signal 1 14 may be encoded using an appropriate entropy- coding scheme (e.g. using a Rice code, or Huffman coding, or Arithmetic coding), thereby providing a lossless audio coding scheme.
The plurality of filter coefficients ak \ 12 may be determined by the decorrelator 1 10 on a frame-by-frame basis using the samples of a frame of the input audio signal 11 1. In particular, the plurality of filter coefficients afc 112 may be determined such that the mean squared energy of the prediction error signal 114 is reduced (minimized). This may be achieved in an efficient manner using the Levinson-Durbin algorithm. As such, a lossless audio coder may be provided by first removing the redundancy from the input audio signal 1 11 (e.g. using linear prediction techniques) and by then coding the resulting prediction error signal 114 with an efficient entropy-coding scheme. The encoded signal comprises for each frame of the input audio signal 1 1 1 a representation of the plurality of filter coefficients ak l 12 and the entropy-encoded samples of the frame of the prediction error signal 114.
The re-correlator 120 (also referred to as the decoder) performs corresponding steps to the decorrelator 1 10. In particular, the re-correlator 120 uses the same FIR filter comprising the same plurality of filter coefficients ak 1 12 to reconstruct the input audio signal 11 1 from the residual audio signal r 1 14.
The degree to which an input audio signal can be "whitened" depends on the content of the input audio signal 1 1 1 and on the complexity (e.g. the number K of coefficients and/or the structure) of the prediction filter. Infinite complexity (e.g. an infinite number K of filter coefficients) could theoretically achieve a prediction at the entropy level 101 shown in Fig. la. However, all the coefficients that define such a decorrelator 1 10 would then need to be transmitted to the decoder 120 (in addition to the prediction error signal 1 14) to reconstruct (recorrelate) the input audio signal 1 11. There is therefore a need to obtain a balance between predictor complexity (e.g. the number K of filter coefficients and/or the type of the prediction filter, e.g. FIR or IIR) and performance.
Typically, lossless audio coders (including the MPEG-4 ALS, Audio Lossless Coding, coder) make use of an FIR-based predictor or decorrelator 1 10. IIR-based predictors or decorrelators 1 10 may be beneficial, in situations where the control of peak data rates is important. A further situation where IIR-based decorrelators 110 may be beneficial is where the spectrum 100 of the input audio signal 1 11 exhibits a relatively wide dynamic range. In such a situation, compression gains may be expected, in particular for relatively high sampling rates. By way of example, IIR-based predictors show an improvement over FIR-based predictors of approx. 0.2 bits/sample (for audio signals at a 44.1kHz sampling rate) and an improvement of more than 1 bit/sample (for audio signals at a 96 kHz sampling rate, which are band-limited to 32kHz). As such, it can be seen that IIR-based predictors are increasingly beneficial for encoding input audio signals 1 1 1 having an increasingly high ratio of sampling rate over signal bandwidth.
Fig. 2a illustrates the spectral level 201 of an example input audio signal 11 1, the spectral level 203 of an example prediction error signal 114 obtained using an FIR decorrelator comprising A=8 coefficients (FIR8) and the spectral level 202 of an example prediction error signal 1 14 obtained using an IIR decorrelator comprising a length of four (IIR4). From Fig. 2a, it can be observed that the IIR predictor is able to flatten the prediction error signal better and it can be expected that compared to an FIR predictor, the IIR predictor provides an "improved" spectral flattening at relatively high sampling rates. The degree of "improvement" may be proportional to the "input sampling frequency". The reason for this is that for higher sampling rates, the spectrum of the input audio signal 1 1 1 rolls-off at frequencies lower than the Nyquist frequency (which is at half of the sampling frequency). This implies that the spectrum of the prediction error signal created with an FIR filter will also roll-off at significantly lower frequencies than the Nyquist frequency. On the other hand, the spectrum of the prediction error signal created with an IIR filter will typically be flat up to the Nyquist frequency.
Fig. 2b shows an example block diagram of a conventional prediction architecture, where in an encoder 210 a prediction filter is used to determine an estimated signal which is subtracted from the input signal, thereby yielding the prediction error signal. At the decoder 220, the same prediction filter may be used to reconstruct the input signal. The prediction architecture of Fig. 2b may be used for lossy IIR-based prediction coders, however, the architecture of Fig. 2b typically cannot be used for lossless compression. This is due to the fact that the output of the prediction filter 220 in Fig. 2b typically has a longer word length than the input signal because of the multiplication by fractional coefficients. As the transmitted data rate depends on the total word length, extending the word size would be counterproductive. This problem may be overcome by quantizing the output of the prediction filter at the encoder 210, i.e. by quantizing the estimated signal using a quanitzer 216. This is illustrated in Fig. 2c, where the output of the prediction filter at the encoder 210 is quantized so that the transmitted prediction error signal has the same word length as the input signal. By symmetry, the decoder 220 can recover the original signal despite the fact that the side chain is nonlinear. However, the use of a quanitzer 216 assumes that the predictors of the decoder 220 and the encoder 210 produce outputs which, when requantized, correspond exactly. This may be achieved with an FIR filter, as the output of an FIR filter can be computed exactly using finite-word length arithmetic. In contrast, IIR filters with fractional coefficients cannot be exactly implemented since the representation of the recirculating signal requires an ever-increasing word length. The IIR output is thus dependent on the rounding behavior of the underlying arithmetic, and it may not be ensured that the rounding behavior does not affect the quantized output of the decorrelators. Consequently, there is the risk that a decoder 220 implemented on hardware (such as a computer or a DSP chip) which is different from the hardware of the encoder 210 does not reproduce exactly the same bits and by consequence the compression scheme is not lossless. A possible architecture for overcoming this technical problem is illustrated in Figs. 2d and 2e for the encoder 210 and the corresponding decoder 220, respectively. As the input and output signals are both quantized and as filters A 212 and B 213 are both FIR filters, the input to the quantizer Q 216 is a finite precision signal, and the quantization can therefore be specified precisely. On the other hand, because of the recirculation through filter B 213, the total response of the predictor in the encoder 210 and in the decoder 220 is IIR. As such, the architecture of Figs. 2d and 2d illustrates an IIR predictor which is portable across different hardware platforms, even when used for lossless encoding.
The encoder 210 of Fig. 2d determines a prediction error signal 214 from the input signal 1 1 1. The prediction error signal 214 typically has a reduced entropy compared to the input signal 1 11 and can be entropy encoded (e.g. using a Rice code, or Huffman coding, or Arithmetic coding).
For FIR-based predictors, optimal prediction coefficients can be obtained using the Levinson-Durbin algorithm. For IIR-based predictors, there is no such efficient algorithm for obtaining the optimal IIR prediction coefficients. The present document addresses the technical problem of determining the coefficients of an IIR-based decorrelator in an efficient manner such that the entropy of the prediction error signal is reduced (e.g. minimized).
It is proposed in the present document to make use of so called Warped Linear Prediction (WLP) and/or Laguerre Linear Prediction (LLP) as a preprocessor to determine the coefficients of IIR-based decorrelators. It is shown that prediction filters which have been determined using a WLP and/or LLP scheme can be transformed into filters A 212 and B 213 of an IIR-based decorrelator (as shown in Fig. 2d). Furthermore, an efficient scheme for determining the coefficients of the filters A 212 and B 213 is described.
Frequency warped processing may be used to process audio signals according to the frequency resolution of the human auditory system. For this purpose, the frequency range of an input signal may be mapped to a warped frequency range, thereby modeling the frequency resolution of the human auditory system. This is illustrated in Fig. 3 a, where it is shown how an original frequency range 301 may be warped into a warped frequency range 302. By way of example, a Bark scale may be used to warp the original frequency range. In the context of FIR-based linear prediction, frequency warping may be implemented by replacing the delays 1 15 of an FIR prediction filter with an allpass filter
, , . 2 — I„ 1
A(z) = -—— Γ; < 1 ,
I— ΛΖ
wherein the parameter λ defines the pole of the allpass filter. In case of a pole parameter λ=0 conventional FIR-based linear prediction is implemented. For an input signal at a sampling rate of 44.1hHz, a Bark scale mapping is obtained with a pole parameter 1=0.756. Fig. 3b illustrates a modified encoder 310 and a modified decoder 320, where the delay units 1 15 have been replaced by allpass filters A(z) 3 \ 5. The optimal coefficients of the allpass filters A(z) 315 for a fixed pole parameter λ may be determined using the Levinson- Durbin algorithm. Fig. 3c illustrates how a WLP based encoder 310 approximates an input signal 11 1. Fig. 3c shows the spectrum 331 of an input signal 1 1 1 , the spectrum 332 of an estimated signal determined using an FIR-based linear predictor and the spectrum 333 of an estimated signal determined using WLP. It can be seen that WLP exhibits improved modeling quality in the perceptually relevant low frequency range (using e.g. a pole parameter of 2=0.756 for an input audio signal 11 1 at a sampling rate of 44.1kHz).
A problem of WLP is that WLP provides prediction error signals which are not whitened in the original frequency domain. This problem may be overcome by whitening the prediction error signal using a residual post-filter
or alternatively, optional WLP coefficients can be obtained using a pre-filter
wherein the pre-filter is typically not applied in the prediction filtering operation. This means that the pre-filter W(z) may be used when determining the optimal prediction coefficients k and the pole parameter X. However, when performing linear prediction filtering as shown in Fig. 3b, the determined filter coefficients may be used without the additional pre-filter W z). In other words, the residual signal r may be determined as shown in Fig. 3b without using the additional pre-filter W(z), but only using the coefficients ak, k = 1, ... , K and the pole parameter X, which have been determined using the additional pre-filter W(z).
While the use of a post-filter or a pre-filter whitens the prediction error signal, it is typically not possible to implement a synthesis filter at the decoder 320 because of delay- free loops. This technical problem may be solved by adding an explicit delay unit 1 15 to the encoder and the decoder, thereby yielding a so called Laguerre Linear Prediction (LLP) scheme which is illustrated in Fig. 4 showing an encoder 410 and a corresponding decoder 420. The pre-filtering using the whitening filter W(z) may also be absorbed into the prediction structure, thereby yielding so called Laguerre filters 41 1
= W(z)(A(z))k~l ; with k = 1,2, ...,K ,
Figure imgf000018_0001
wherein for a pole parameter =0, the encoder and decoder structure of Fig. 4 correspond to an FIR-based linear predictor. The encoder 410 receives an input signal 11 1 and determines an estimated signal 413 using the decorrelator comprising the delay unit 1 15, the Laguerre filters 411 and respective filter coefficients 412 (referred to as LLP coefficients). The estimated signal 413 is subtracted from the input signal 111, thereby yielding the prediction error signal 414. The corresponding decoder 420 performs the corresponding operations to reconstruct the input signal 1 1 1. In particular, the decoder 420 receives the LLP coefficients 412 and uses a delay unit 1 15, the Laguerre filters 411 and the received LLP coefficients 412 to reconstruct the input audio signal 1 11 from the prediction error signal 414. One method for determining optimal LLP coefficients /¾; with k = 1,2, ... , AT is as follows:
Consider the input signal x 1 1 1 and a set of K regressor signals yk (with k = 1,..., K) at the output of the K Laguerre filters 411. The estimated signal x 413 may be
K
determined from the regressor signals yk as yk. = ^ ¾ · yk , where ¾ are the LLP coefficients 412.
· The LLP coefficients 412 are usually optimized to minimize the mean squared energy of the prediction error signal r414 (within the frame for which the LLP coefficients 412 are determined). The regressor signals yk can be derived from the input signal
1 1 1 by linear filtering, thus Yk(z) = z~]Hk(z) - X(z) , where X(z) and Yk(z) are the z- transforms of xand yk , respectively, and where Hk(z) are stable and causal IIR filters. · In matrix notation, the optimal LLP coefficients fik are given by the normal equations QP = P , where β is a vector comprising the optimal LLP coefficients ¾ , and where the elements of the matrix Q and the vector P are given by Qk l = y,yk and Pk - ^ xyk , i.e. where the matrix Q reflects the correlation between the different regressor signals yk and where the vector P reflects the correlation between the input signal and the different regressor signals yk .
Hence, the predictor coefficients ¾ 412 may be determined in an efficient manner under the assumption of a fixed pole parameter λ using e.g. a Levinson-Durbin algorithm. This is particularly true for a pole parameter 1=0, for which the Laguerre filters 411 become delays, i.e. Hk(z) = z k, and for which the optimal LLP coefficients ¾ ; with k = 1,2, ... , K correspond to the coefficients of an FIR prediction filter.
As will be shown below, the encoder 410 and decoder 420 may be transformed in accordance to the encoder 210, 220 of Figs. 2d and 2e, respectively. This means that the encoder 410 and decoder 420 effectively provide an IIR based decorrelator when using a pole parameter λ which is different from zero.
The use of Laguerre filters 41 1 for implementing a decorrelator has several advantages. The encoder / decoder of Fig. 4 can be implemented using the predictor structure of Figs. 2d and 2e, wherein perfect reconstruction is guaranteed even in case of finite word length arithmetic. Furthermore, stability of the used synthesis filter is guaranteed for such Laguerre (and possibly Kautz) filters. In addition, efficient autocorrelation based methods exist (similar to the ones used in linear prediction) for deriving optimal filter coefficients 412.
Furthermore, the prediction error signal 414 exhibits spectral flatness on the original frequency scale 301. In this context, the pole parameter λ (which defines the pole of the allpass filter) provides an extra degree of freedom. It is proposed in the present document to use this extra degree of freedom to provide for an additional reduction (e.g. a minimization) of the entropy of the prediction error signal 414. By doing this, an optimal combination of FIR/IIR filters may be determined for each block or frame of the input audio signal 11 1. As a further advantage it should be noted that the encoder 410 of Fig. 4 preserves desirable qualities of WLP. In particular, the encoder 410 may provide improved perceptual noise shaping for cascaded lossy data compression with a lossless kernel. This is particularly relevant when using a lossless codec, such as TrueHD, in a lossy mode of operation. In the lossy mode of operation, the quantization noise is shaped according to prediction parameters which model the spectral envelope of the signal (similar to spectral noise shaping as used in linear prediction based speech codecs). Warped linear prediction typically models spectral envelopes perceptually better, and is therefore better suitable for lossy coding. As such, the encoder 410 (which preserves the properties of WLP) provides an improved perceptual noise shaping.
When using a pole parameter λ=0, the methods described in the present document provide an FIR prediction filter. As the pole parameter provides a further degree of freedom, it can be stipulated that, for an equal number of prediction coefficients, the IIR predictors which are determined using the methods described in the present document should provide an entropy reduction which is at least as good as the corresponding FIR predictor (with a pole parameter 1=0).
As indicated above, the pole parameter λ may be used to reduce the entropy of the prediction error signal 414. This may be achieved e.g. by using a brute force approach. By way of example, the pole parameter λ (and the corresponding pole of the allpass filter A(z) ) may be varied from -0.9 to +0.9 and the pole parameter λ may be selected, which produces a prediction error signal 414 with the least entropy. In an embodiment, for every analysis frame of the input audio signal 1 1 1 , the pole parameter λ may be varied from -0.9 to 0.9 in steps of 0.1. For each pole parameter λ, the optimal LLP coefficients 412 are determined and the residual signal 414 and its entropy are determined. Then, the pole parameter λ for which the entropy of the residual signal 414 is reduced (e.g. is minimal) may be selected, and the (entropy encoded) residual signal 414 and the LLP coefficients 412 for the selected pole parameter λ may be transmitted to the decoder 420.
It should be noted that more efficient schemes than the above mentioned brute force approach for selecting a pole parameter λ which reduces (e.g. minimizes) the entropy of the prediction error signal 414 may be provided and are discussed below.
The determined LLP coefficients Pk 412 may be transformed into filter coefficients for the filters A 212 and B 213 which are used by the encoder 210 and decoder 220 of Fig. 2d and 2e (used e.g. in the Dolby True HD coder). This is beneficial, as it allows the reuse of an existing encoder / decoder structure, as well as the reuse of quantization and coding routines. In case of filters A 212 and B 213 each comprising four filter coefficients and in case of K=4, the filters 212, 213 may be determined as 4λ + ϋβλ - RAp2 + RXfi^ - RA3pA
- R(3 )ft + R(\ + 2λ22 - R(2 + λ33 + R(3A2) ?4
A = ; where R - l - λ2 , and
+ i?(3/L2 ) ?, - R(2A + λ32 + R(\ + 2λ23 - R(3 )P4
- 14 - R PX + RA2p2 - R + R ,
- 6λ2
B =
3
- λ4
It should be noted that in case of a pole parameter 1=0, only the FIR filter A 212 is active. The transformation formulas for other values of K may be determined in an analogous manner.
The benefits of using an IIR-based decorrelator have been tested using a sine sweep ranging from 0 to 24kHz, sampled with 16bits/sample and with a sampling rate of 48kHz. The performance of FIR-based decorrelators using an FIR predictor of order 4 (FIR4) and an FIR predictor of order 8 (FIR8) were compared to the performance of an IIR-based decorrelator using an IIR predictor of order 4 (IIR4). The tests were performed for different frame sizes of the input audio signal 1 1 1, i.e. for different predictor analysis frame sizes. The example results are shown in Table 1.
Figure imgf000022_0001
Table 1 It can be seen that in most of the cases, a reduction of the entropy of the prediction error signal can be achieved when using an IIR predictor.
Furthermore, it has been observed using a sine sweep test that the optimal pole parameter λ has an almost linear relationship to the frequency of the input audio signal 1 11. This is illustrated in Fig. 5a, where it can be seen that the optimal pole parameter λ 501 (i.e. the pole parameter which provides LLP coefficients 412 which minimize the entropy of the prediction error signal 414) decreases (linearly) with increasing frequency 502 of the input audio signal 111. In the illustrated example, the input audio signal 1 1 1 comprises a sine sweep, therefore the x-axis 502 can be thought of as the frequency increasing with time. This observation can be used to provide efficient schemes for determining the pole parameter λ which provides (almost optimal) LLP coefficients 412. It should be noted that for a particular frequency (around frame number 150), the optimal pole parameter λ 501 is zero, thereby indicating that for an input signal 1 11 at this frequency, the use of an FIR predictor is optimal.
Fig. 5b illustrates the entropy reduction (measured in bits/sample) which is possible when using an IIR4 predictor compared to the situation when using a FIR4 predictor. The input audio signal 11 1 comprises a sine sweep such that the x-axis 502 may be viewed as the frequency of the input audio signal 11 1. It can be seen that the entropy reduction 503 which may be achieved when using an IIR predictor varies with the frequency of the input signal. A similar observation can be made in Fig. 5c where the entropy reduction (measured in bits/sample) which is possible when using an IIR4 predictor compared to the situation when using a FIR8 predictor is illustrated. It should be noted that in the case of Fig. 5b only four filter coefficients are determined and transmitted (for the FIR4 and for the IIR4 case), whereas in the case of Fig. 5c, the FIR8 requires the determination and transmission of eight filter coefficients.
As indicated above, the observation of Fig. 5a suggests that the pole parameter λ may be determined in an efficient manner (compared to the brute force approach outlined above). In particular, a look-up table may be determined offline, wherein the look-up table provides a mapping between an estimated frequency of a frame of the input audio signal 1 1 1 and a corresponding pole parameter λ which minimizes the entropy of the prediction error signal 414. The look-up table may be determined based on a training set comprising a plurality of input audio signals 1 1 1. The look-up table will typically have a mapping distribution similar to the one illustrated in Fig. 5a. The encoder 410 may be configured to use the pre-determined look-up table to determine the pole parameter λ which is to be used to calculate the LLP coefficients 412 for a particular frame of an input audio signal 111. The encoder 410 may employ a frequency estimation method, and estimate the (dominant) frequency content of the particular frame of the input signal 1 1 1. By way of example, the encoder may employ a low-order linear predictor and estimate the spectral envelope of the particular frame of the input audio signal 1 1 1. The estimated (dominant) frequency may correspond to the peak of the spectral envelope. Once the dominant frequency is estimated, the encoder 410 may look-up the corresponding optimal entropy minimizing pole parameter λ from the look-up table. This entropy minimizing pole parameter λ may be used to determine optimal LLP coefficients 412 which minimize the power of the corresponding frame of the prediction error signal 414 (using a Levinson-Durbin type algorithm). The determined LLP coefficients 412 may optionally be mapped to the prediction structure of Fig. 2d, in order to make use of existing Dolby TrueHD quantization and coding techniques. It should be noted that various other methods may be used to determine the pole parameter X. In particular, a hybrid method for determining the optimal entropy minimizing pole parameter λ may make use of a combination of a look-up table and a brute force search. For instance, a look-up table may be used to determine a first estimate of the optimal pole parameter λ. Furthermore, the looked-up value of λ may be refined by evaluating additional surrounding values of the looked-up value of λ (and possibly λ = 0). Finally, the value for λ may be chosen which minimizes entropy. For example, if the looked-up value of λ is 0.7, one could evaluate other value of λ in the range of 0.6 and 0.8 in addition to 0.7 (and possibly the value 0, in order to verify whether the FIR predictor provides a better solution than the IIR predictor).
Fig. 6 illustrates an example method 600 for determining an IIR prediction filter for performing decorrelation of an input signal. In step 601, a frame of samples of the input signal 1 1 1 is selected for determining an IIR prediction filter. In step 602, a dominant frequency of the selected frame is estimated. The estimated frequency may be used to determine a pole parameter λ (step 603), e.g. using a pre-determined look-up table. Using the pole parameter, LLP prediction coefficients may be determined (step 604). The LLP prediction coefficient may optionally be transformed into an explicit FIR and IIR filter representation comprising e.g. filter A 212 and filter B 213 (step 605).
In the present document, a method for determining an IIR-based decorrelator has been described. The method may be implemented in an efficient manner and allows for the determination of IIR filter prediction filter coefficients which minimize the entropy of the prediction error signal. As such, the method enables the implementation of audio coding schemes having increased coding gains. The IIR-based decorrelator may be used in conjunction with an entropy encoder of the prediction error signal to provide a lossless audio coder. Furthermore, the method may be used to adaptively switch between FIR and IIR based linear prediction on a frame-by-frame basis, in order to minimize the entropy of the prediction error signal. In addition, the IIR-based decorrelator is compliant with existing Dolby True HD coders, thereby enabling the reuse of already deployed Dolby True HD decoders.
The methods and systems described in the present document may be implemented as software, firmware and/or hardware. Certain components may e.g. be implemented as software running on a digital signal processor or microprocessor. Other components may e.g. be implemented as hardware and or as application specific integrated circuits. The signals encountered in the described methods and systems may be stored on media such as random access memory or optical storage media. They may be transferred via networks, such as radio networks, satellite networks, wireless networks or wireline networks, e.g. the Internet. Typical devices making use of the methods and systems described in the present document are portable electronic devices or other consumer equipment which are used to store and/or render audio signals.

Claims

1) A method (600) for determining a stable general prediction filter for a frame of an input signal (111); wherein the z-transform of the general prediction filter comprises an approximation to the z-transform of a finite impulse response, referred to as FIR, filter with the z variable of the FIR filter being replaced by the z-transform of an allpass filter; wherein the FIR filter comprises a plurality of FIR coefficients (412); wherein the allpass filter exhibits a pole defined by an adjustable pole parameter; wherein the method (600) comprises
- determining the pole parameter and the plurality of FIR coefficients, such that an entropy of a frame of a prediction error signal (414) which is derived from the frame of the input signal (1 1 1) using the general prediction filter defined by the pole parameter and the plurality of FIR coefficients (412) is reduced.
2) The method (600) of claim 1, wherein determining the pole parameter and the plurality of FIR coefficients comprises
- setting the pole parameter to a fixed first value; and
- determining (604) the plurality of FIR coefficients (412) using the set pole parameter.
3) The method (600) of claim 2, wherein determining the pole parameter and the plurality of FIR coefficients (412) comprises
- estimating the entropy of the frame of the prediction error signal (414) obtained using the general prediction filter defined by the set pole parameter and the plurality of FIR coefficients (412) determined using the set pole parameter;
- repeating determining the plurality of FIR coefficients (412) and estimating the entropy for a plurality of differently set pole parameters; and
- selecting the pole parameter from the plurality of differently set pole parameters which reduces the estimated entropy of the frame of the prediction error signal (414). 4) The method (600) of any of claims 2 to 3, wherein estimating the entropy of the frame of the prediction error signal (414) comprises
- determining a probability distribution of the values of samples of the frame of the prediction error signal (414); and
- estimating the entropy based on a weighted sum of the probability distribution.
5) The method (600) of claim 4, wherein the weighted sum of the probability distribution is given by p . iog b ) , with Ρ,· being the probability of the value i of a sample of the prediction error signal (414) and with b being the base of the log function.
6) The method (600) of claim 2, wherein setting the pole parameter to a fixed first value comprises
- estimating (602) a frequency based on the frame of the input signal (1 1 1); and
- determining (603) the first value based on the estimated frequency.
7) The method (600) of claim 6, wherein
- the first value is determined based on the estimated frequency using a predetermined look-up table; and
- the pre-determined look-up table provides a mapping between a plurality of frequency values and a corresponding plurality of pole parameter values.
8) The method of any of claims 6 to 7, wherein estimating a frequency based on the frame of the input signal (1 11) comprises
- determining a spectral envelope of a spectrum of the frame of the input signal (1 11); and
- estimating the frequency of the frame of the input signal (1 11) based on the spectral envelope.
9) The method (600) of any previous claim, wherein for a fixed pole parameter, determining the plurality of FIR coefficients comprises determining the plurality of FIR coefficients such that a mean squared power of the frame of the prediction error signal (414) is reduced.
10) The method (600) of any previous claim, wherein for a fixed pole parameter, determining the plurality of FIR coefficients (412) comprises
- for each tap of the general prediction filter, determining a frame of a regressor signal based on the frame of the input signal (1 11), thereby yielding a plurality of regressor signal frames;
- determining an autocorrelation matrix Q based on the plurality of regressor signal frames;
- determining a cross-correlation vector P based on the plurality of regressor signal frames and the frame of the input signal (1 1 1 ); and
- determining a FIR coefficient vector β comprising the plurality of FIR coefficients (412) based on the normal equations QP = P .
1 1) The method (600) of any previous claim, wherein the z-transform of the general prediction filter further comprises a pre-filter configured to whiten a spectrum of the prediction error signal (414).
12) The method (600) of claim 1 1 , wherein the z-transform of the general prediction filter further comprises an overall delay.
13) The method (600) of any previous claim, wherein the allpass filter comprises the allpass filter
ζ - λ
U < i
Χ - L·- with λ being the pole parameter.
14) The method (600) of any previous claim, wherein the z-transform of the general prediction filter is representable as a ratio of a first and second FIR filter comprising first and second sets of coefficients, respectively; wherein the method further comprises - mapping (605) the determined pole parameter and the determined plurality of FIR coefficients (412) to the first and second sets of coefficients.
15) The method (600) of any previous claim, wherein the z-transform of the
prediction filter is representable
with k = 1, ... , K, with K > 1 a
Figure imgf000029_0001
k = 1, ... , K being the plurality of FIR coefficients (412). 16) A method for encoding a frame of an input signal (1 1 1) using a general prediction filter, the method comprising
- determining the general prediction filter using the method of any of claims 1 to 15;
- determining an estimate (413) of the frame of the input signal (111) using the determined general prediction filter;
- determining a frame of a prediction error signal (414) based on the estimated frame and the frame of the input signal (11 1);
- encoding information indicative of the determined general prediction filter; and
- encoding the frame of the prediction error signal (414).
17) The method of claim 16, wherein the frame of the prediction error signal (414) is encoded using an entropy encoder.
18) The method of any of claims 16 to 17, wherein the information indicative of the determined general prediction filter comprises the pole parameter.
19) An encoded signal, wherein the encoded signal comprises information indicative of a general prediction filter to be used by a decoder for decoding the encoded signal; wherein the z-transform of the general prediction filter is representable by an approximation to the z-transform of a FIR filter with the z variable of the FIR filter being replaced by the z-transform of an allpass filter; wherein the FIR filter comprises a plurality of FIR coefficients (412); wherein the allpass filter exhibits a pole defined by a pole parameter; wherein the information indicative of the general prediction filter comprises information indicative of the pole parameter.
20) A method for determining a look-up table providing a mapping between an estimated frequency of a frame of an input signal (1 1 1) and a pole parameter defining a pole of an allpass filter; wherein the allpass filter is used to provide a general prediction filter based on an FIR filter, the method comprising
- providing a training set of different frames of input signals (11 1);
- estimating a corresponding set of frequencies for the training set of frames;
- determining a set of pole parameters which provide general prediction filters that reduce an entropy of frames of prediction error signals; and
- determining the look-up table based on the set of frequencies and the corresponding set of pole parameters.
21) A method for decoding an encoded signal according to claim 19, wherein the method comprises
- receiving information indicative of a pole parameter of an allpass filter; wherein the allpass filter is used to provide a general prediction filter based on an FIR filter comprising a plurality of FIR coefficients (412);
- receiving information indicative of the plurality of FIR coefficients (412);
- determining the general prediction filter based on the received information indicative of the pole parameter and the received information indicative of the plurality of FIR coefficients (412);
- decoding a frame of a prediction error signal (414); and
- determining a frame of an estimated input signal (413) based on the decoded frame of the prediction error signal (414) and the FIR prediction filter; and
- determining a decoded frame of the encoded signal based on the frame of the estimated input signal (413) and the decoded frame of the prediction error signal (414). 22) An encoder (410) configured to determine a general prediction filter for a frame of an input signal (11 1); wherein the z-transform of the general prediction filter is indicative of an approximation to the z-transform of a FIR filter with the z variable of the FIR filter being replaced by the z-transform of an allpass filter; wherein the FIR filter comprises a plurality of FIR coefficients (412); wherein the allpass filter exhibits a pole defined by an adjustable pole parameter; wherein the encoder (410) is configured to
- determine the pole parameter and the plurality of FIR coefficients, such that an entropy of a frame of a prediction error signal (414) which is derived from the frame of the input signal (1 11) using the general prediction filter defined by the pole parameter and the plurality of FIR coefficients (412) is reduced.
23) An decoder (420) for decoding an encoded signal; wherein the decoder (420) is configured to
- extract information indicative of a pole parameter of an allpass filter from the encoded signal; wherein the allpass filter is used to provide a general prediction filter based on an FIR filter comprising a plurality of FIR coefficients (412);
- extract information indicative of the plurality of FIR coefficients (412) from the encoded signal;
- determine the general prediction filter based on the extracted information indicative of the pole parameter and the extracted information indicative of the plurality of FIR coefficients (412); wherein the general prediction filter is used to decode the encoded signal.
24) A method for decoding a frame of an encoded signal using a general prediction filter, the method comprising
- extracting coefficients of the general prediction filter from the encoded signal; wherein the coefficients of the general prediction filter were determined using the method of any of claims 1 to 15;
- decoding a frame of a prediction error signal (414);
- determining a frame of an estimated decoded signal (413) based on the decoded frame of the prediction error signal (414) and the general prediction filter; and - determining a decoded frame of the encoded signal based on the frame of the estimated decoded signal (413) and the decoded frame of the prediction error signal (414).
A decoder for decoding an encoded signal; wherein the decoder (420) comprises
- means for extracting coefficients of the general prediction filter from the encoded signal; wherein the coefficients of the general prediction filter were determined using the method of any of claims 1 to 15;
- means for decoding a frame of a prediction error signal (414);
- means for determining a frame of an estimated decoded signal (413) based on the decoded frame of the prediction error signal (414) and the general prediction filter; and
- means for determining a decoded frame of the encoded signal based on the frame of the estimated decoded signal (413) and the decoded frame of the prediction error signal (414).
PCT/EP2013/077461 2012-12-19 2013-12-19 Signal adaptive fir/iir predictors for minimizing entropy WO2014096236A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US14/649,477 US9548056B2 (en) 2012-12-19 2013-12-19 Signal adaptive FIR/IIR predictors for minimizing entropy

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201261739379P 2012-12-19 2012-12-19
US61/739,379 2012-12-19

Publications (2)

Publication Number Publication Date
WO2014096236A2 true WO2014096236A2 (en) 2014-06-26
WO2014096236A3 WO2014096236A3 (en) 2014-08-28

Family

ID=49886907

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2013/077461 WO2014096236A2 (en) 2012-12-19 2013-12-19 Signal adaptive fir/iir predictors for minimizing entropy

Country Status (2)

Country Link
US (1) US9548056B2 (en)
WO (1) WO2014096236A2 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2547877A (en) * 2015-12-21 2017-09-06 Graham Craven Peter Lossless bandsplitting and bandjoining using allpass filters
CN107924683A (en) * 2015-10-15 2018-04-17 华为技术有限公司 Sinusoidal coding and decoded method and apparatus
CN110850162A (en) * 2019-11-22 2020-02-28 西南交通大学 Frequency estimation method of three-phase power system based on error correlation entropy

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017115145A1 (en) 2015-12-31 2017-07-06 Delta Faucet Company Water sensor
KR101968456B1 (en) 2016-01-26 2019-04-11 돌비 레버러토리즈 라이쎈싱 코오포레이션 Adaptive quantization
CN108616373B (en) * 2016-12-12 2021-06-18 中国科学院深圳先进技术研究院 Spectral entropy prediction method and system
EP3872808A4 (en) * 2018-10-25 2021-11-03 NEC Corporation Voice processing apparatus, voice processing method, and computer-readable recording medium

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008000316A1 (en) * 2006-06-30 2008-01-03 Fraunhofer Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder, audio decoder and audio processor having a dynamically variable harping characteristic

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB9509831D0 (en) * 1995-05-15 1995-07-05 Gerzon Michael A Lossless coding method for waveform data
US7020279B2 (en) * 2001-10-19 2006-03-28 Quartics, Inc. Method and system for filtering a signal and for providing echo cancellation
CN100343895C (en) * 2002-05-30 2007-10-17 皇家飞利浦电子股份有限公司 Audio coding
CN1886783A (en) 2003-12-01 2006-12-27 皇家飞利浦电子股份有限公司 Audio coding
KR20080015878A (en) * 2005-05-25 2008-02-20 코닌클리케 필립스 일렉트로닉스 엔.브이. Predictive encoding of a multi channel signal
US7746970B2 (en) * 2005-11-15 2010-06-29 Qualcomm Incorporated Method and apparatus for filtering noisy estimates to reduce estimation errors
US7873511B2 (en) * 2006-06-30 2011-01-18 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder, audio decoder and audio processor having a dynamically variable warping characteristic
KR100790163B1 (en) 2006-08-08 2008-01-02 삼성전자주식회사 Channel estimator and method for changing iir filter coefficient followed mobile terminal's moving speed
EP2144231A1 (en) * 2008-07-11 2010-01-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Low bitrate audio encoding/decoding scheme with common preprocessing
EP2410522B1 (en) * 2008-07-11 2017-10-04 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio signal encoder, method for encoding an audio signal and computer program
MY154452A (en) * 2008-07-11 2015-06-15 Fraunhofer Ges Forschung An apparatus and a method for decoding an encoded audio signal
WO2010027722A1 (en) * 2008-08-25 2010-03-11 Dolby Laboratories Licensing Corporation Method for determining updated filter coefficients of an adaptive filter adapted by an lms algorithm with pre-whitening
US20100135172A1 (en) * 2008-09-08 2010-06-03 Qualcomm Incorporated Method and apparatus for predicting channel quality indicator in a high speed downlink packet access system
DE112009002137B4 (en) * 2008-10-06 2014-09-04 Mitsubishi Electric Corporation Signal processing circuit
US8626809B2 (en) * 2009-02-24 2014-01-07 Samsung Electronics Co., Ltd Method and apparatus for digital up-down conversion using infinite impulse response filter
US8199924B2 (en) * 2009-04-17 2012-06-12 Harman International Industries, Incorporated System for active noise control with an infinite impulse response filter
US8499019B2 (en) * 2009-11-30 2013-07-30 Ross Video Limited Electronic hardware resource management in video processing
US8443273B2 (en) * 2010-03-30 2013-05-14 International Business Machines Corporation Data dependent NPML detection and systems thereof
WO2011128272A2 (en) 2010-04-13 2011-10-20 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Hybrid video decoder, hybrid video encoder, data stream
US9343076B2 (en) 2011-02-16 2016-05-17 Dolby Laboratories Licensing Corporation Methods and systems for generating filter coefficients and configuring filters

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008000316A1 (en) * 2006-06-30 2008-01-03 Fraunhofer Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder, audio decoder and audio processor having a dynamically variable harping characteristic

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
GERALD D T SCHULLER ET AL: "Perceptual Audio Coding Using Adaptive Pre-and Post-Filters and Lossless Compression", IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, IEEE SERVICE CENTER, NEW YORK, NY, US, vol. 10, no. 6, 1 September 2002 (2002-09-01), XP011079662, ISSN: 1063-6676 *
V Voitishchuk ET AL: "Alternatives for Warped Linear Predictors", Proc. 12th ProRISC Workshop, 29 November 2001 (2001-11-29), pages 710-713, XP055104660, Retrieved from the Internet: URL:http://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=F6C175EF024E23FB114F799ACB80C910?doi=10.1.1.4.9393&rep=rep1&type=pdf [retrieved on 2014-02-27] *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107924683A (en) * 2015-10-15 2018-04-17 华为技术有限公司 Sinusoidal coding and decoded method and apparatus
US10971165B2 (en) 2015-10-15 2021-04-06 Huawei Technologies Co., Ltd. Method and apparatus for sinusoidal encoding and decoding
GB2547877A (en) * 2015-12-21 2017-09-06 Graham Craven Peter Lossless bandsplitting and bandjoining using allpass filters
GB2547877B (en) * 2015-12-21 2019-08-14 Graham Craven Peter Lossless bandsplitting and bandjoining using allpass filters
US10896683B2 (en) 2015-12-21 2021-01-19 Mqa Limited Lossless bandsplitting and bandjoining using allpass filters
CN110850162A (en) * 2019-11-22 2020-02-28 西南交通大学 Frequency estimation method of three-phase power system based on error correlation entropy

Also Published As

Publication number Publication date
US9548056B2 (en) 2017-01-17
US20150317985A1 (en) 2015-11-05
WO2014096236A3 (en) 2014-08-28

Similar Documents

Publication Publication Date Title
JP6728416B2 (en) Method for parametric multi-channel encoding
US9548056B2 (en) Signal adaptive FIR/IIR predictors for minimizing entropy
JP5688852B2 (en) Audio codec post filter
US8010348B2 (en) Adaptive encoding and decoding with forward linear prediction
KR101428487B1 (en) Method and apparatus for encoding and decoding multi-channel
RU2630887C2 (en) Sound coding device and decoding device
JP5719941B2 (en) Efficient encoding / decoding of audio signals
EP0966793A1 (en) Audio coding method and apparatus
US6593872B2 (en) Signal processing apparatus and method, signal coding apparatus and method, and signal decoding apparatus and method
KR20100113065A (en) Rounding noise shaping for integer transfrom based encoding and decoding
EP2227682A1 (en) An encoder
KR101771065B1 (en) Flexible and scalable combined innovation codebook for use in celp coder and decoder
WO2011045926A1 (en) Encoding device, decoding device, and methods therefor
JP4359949B2 (en) Signal encoding apparatus and method, and signal decoding apparatus and method
JP3811110B2 (en) Digital signal encoding method, decoding method, apparatus, program and recording medium
JP4734859B2 (en) Signal encoding apparatus and method, and signal decoding apparatus and method
EP2663978A1 (en) An audio encoder/decoder apparatus
KR101387808B1 (en) Apparatus for high quality multiple audio object coding and decoding using residual coding with variable bitrate
US11176954B2 (en) Encoding and decoding of multichannel or stereo audio signals
JP4351684B2 (en) Digital signal decoding method, apparatus, program, and recording medium
US20100280830A1 (en) Decoder
JP4721355B2 (en) Coding rule conversion method and apparatus for coded data
JP2004246038A (en) Speech or musical sound signal encoding method, decoding method, encoding device, decoding device, encoding program, and decoding program
JP4618823B2 (en) Signal encoding apparatus and method
WO2011045927A1 (en) Encoding device, decoding device and methods therefor

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13814909

Country of ref document: EP

Kind code of ref document: A2

DPE1 Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101)
WWE Wipo information: entry into national phase

Ref document number: 14649477

Country of ref document: US

122 Ep: pct application non-entry in european phase

Ref document number: 13814909

Country of ref document: EP

Kind code of ref document: A2