CA2717584A1

CA2717584A1 - Method and apparatus for processing an audio signal

Info

Publication number: CA2717584A1
Application number: CA2717584A
Authority: CA
Inventors: Hyun Kook Lee; Sung Yong Yoon; Dong Soo Kim; Jae Hyun Lim
Original assignee: LG Electronics Inc
Current assignee: LG Electronics Inc
Priority date: 2008-03-04
Filing date: 2009-03-04
Publication date: 2009-09-11
Anticipated expiration: 2029-03-04
Also published as: EP2259254B1; CN102007534A; KR20100134623A; AU2009220341B2; JP2011514558A; EP2259254A4; ES2464722T3; US20100070272A1; JP5108960B2; CN102007534B; AU2009220341A1; EP2259254A2; RU2010140365A; RU2452042C1; CA2717584C; US8135585B2; WO2009110751A2; WO2009110751A3

Abstract

An apparatus for processing an encoded signal and method thereof are disclosed, by which an audio signal can be compressed and reconstructed in higher efficiency. An audio signal processing method includes the steps of identifying whether a coding type of the audio signal is a music signal coding type using first type information, if the coding type of the audio signal is not the music signal coding type, identifying whether the coding type of the audio signal is a speech signal coding type or a mixed signal coding type using second type information, if the coding type of the audio signal is the mixed signal coding type, extracting spectral data and a linear predictive coefficient from the audio signal, generating a residual signal for linear prediction by performing inverse frequency conversion on the spectral data, reconstructing the audio signal by performing linear prediction coding on the linear predictive coefficient and the residual signal, and reconstructing a high frequency region signal using an extension base signal corresponding to a partial region of the reconstructed audio signal and band extension information. Accordingly, various kinds of audio signals can be encoded/decoded in higher efficiency.

Claims

1. In an audio signal processing apparatus including an audio decoder, a method of processing an audio signal, comprising the steps of:

identifying whether a coding type of the audio signal is a music signal coding type using first type information;
if the coding type of the audio signal is not the music signal coding type, identifying whether the coding type of the audio signal is a speech signal coding type or a mixed signal coding type using second type information;

if the coding type of the audio signal is the mixed signal coding type, extracting spectral data and a linear predictive coefficient from the audio signal;

generating a residual signal for linear prediction by performing inverse frequency conversion on the spectral data;

reconstructing the audio signal by performing linear prediction coding on the linear predictive coefficient and the residual signal; and reconstructing a high frequency region signal using an extension base signal corresponding to a partial region of the reconstructed audio signal and band extension information.

2. The method of claim 1, wherein the audio signal includes a plurality of subframes and wherein the second type information exists by a unit of the subframe.

3. The method of claim 1, wherein a bandwidth of the high frequency region signal is not equal to that of the extension base signal.

4. The method of claim 1, wherein the band extension information includes at least one of a filter range applied to the reconstructed audio signal, a start frequency of the extension base signal and an end frequency of the extension base signal.

5. The method of claim 1, wherein if the coding type of the audio signal is the music signal coding type, the audio signal comprises a frequency-domain signal, wherein if the coding type of the audio signal is the speech signal coding type, the audio signal comprises a time-domain signal, and wherein if the coding type of the audio signal is the mixed signal coding type, the audio signal comprises an MDCT-domain signal.

6. The method of claim 1, the linear predictive coefficient extracting step comprises the steps of:

extracting a linear predictive coefficient mode; and extracting the linear predictive coefficient having a variable bit size corresponding to the extracted linear predictive coefficient mode.

7. An apparatus for processing an audio signal, comprising:

a demultiplexer extracting first type information and second type information from a bitstream;

a decoder determining unit identifying whether a coding type of the audio signal is a music signal coding type using first type information, the decoder, if the coding type of the audio signal is not the music signal coding type, identifying whether the coding type of the audio signal is a speech signal coding type or a mixed signal coding type using second type information, the decoder then determining a decoding scheme;

an information extracting unit, if the coding type of the audio signal is the mixed signal coding type, extracting spectral data and a linear predictive coefficient from the audio signal;

a frequency transforming unit generating a residual signal for linear prediction by performing inverse frequency conversion on the spectral data;

a linear prediction unit reconstructing the audio signal by performing linear prediction coding on the linear predictive coefficient and the residual signal; and a bandwidth extension decoding unit reconstructing a high frequency region signal using an extension base signal corresponding to a partial region of the reconstructed audio signal and band extension information.

8. The apparatus of claim 7, wherein the audio signal includes a plurality of subframes and wherein the second type information exists by a unit of the subframe.

9. The apparatus of claim 7, wherein a bandwidth of the high frequency region signal is not equal to that of the extension base signal.

10. The apparatus of claim 7, wherein the band extension information includes at least one of a filter range applied to the reconstructed audio signal, a start frequency of the extension base signal and an end frequency of the extension base signal.

11. The apparatus of claim 7, wherein if the coding type of the audio signal is the music signal coding type, the audio signal comprises a frequency-domain signal, wherein if the coding type of the audio signal is the speech signal coding type, the audio signal comprises a time-domain signal, and wherein if the coding type of the audio signal is the mixed signal coding type, the audio signal comprises an MDCT-domain signal.

12. The apparatus of claim 7, the linear predictive coefficient extracting comprising:

extracting a linear predictive coefficient mode; and extracting the linear predictive coefficient having a variable bit size corresponding to the extracted linear predictive coefficient mode.

13. In an audio signal processing apparatus including an audio coder for processing an audio signal, a method of processing the audio signal, comprising the steps of:

removing a high frequency band signal of the audio signal and generating band extension information for reconstructing the high frequency band signal;
determining a coding type of the audio signal;

if the audio signal is a music signal, generating first type information indicating that the audio signal is coded into a music signal coding type;

if the audio signal is not the music signal, generating second type information indicating that the audio signal is coded into either a speech signal coding type or a mixed signal coding type;

if the coding type of the audio signal is the mixed signal coding type, generating a linear predictive coefficient by performing linear prediction coding on the audio signal;

generating a residual signal for the linear prediction coding;

generating a spectral coefficient by frequency-transforming the residual signal; and generating an audio bitstream including the first type information, the second type information, the linear predictive coefficient and the residual signal.

14. An apparatus for processing an audio signal, comprising:

a bandwidth preprocessing unit removing a high frequency band signal of the audio signal, the bandwidth preprocessing unit generating band extension information for reconstructing the high frequency band signal;

a signal classifying unit determining a coding type of the audio signal, the signal classifying unit, if the audio signal is a music signal, generating first type information indicating that the audio signal is coded into a music signal coding type, the signal classifying unit, if the audio signal is not the music signal, generating second type information indicating that the audio signal is coded into either a speech signal coding type or a mixed signal coding type;

a linear prediction modeling unit, if the coding type of the audio signal is the mixed signal coding type, generating a linear predictive coefficient by performing linear prediction coding on the audio signal;

a residual signal extracting unit generating a residual signal for the linear prediction coding; and a frequency transforming unit generating a spectral coefficient by frequency-transforming the residual signal.

15. The apparatus of claim 14, wherein the audio signal includes a plurality of subframes and wherein the second type information is generated per the subframe.