CA2717584A1 - Method and apparatus for processing an audio signal - Google Patents

Method and apparatus for processing an audio signal Download PDF

Info

Publication number
CA2717584A1
CA2717584A1 CA2717584A CA2717584A CA2717584A1 CA 2717584 A1 CA2717584 A1 CA 2717584A1 CA 2717584 A CA2717584 A CA 2717584A CA 2717584 A CA2717584 A CA 2717584A CA 2717584 A1 CA2717584 A1 CA 2717584A1
Authority
CA
Canada
Prior art keywords
signal
audio signal
coding type
type
coding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CA2717584A
Other languages
French (fr)
Other versions
CA2717584C (en
Inventor
Hyun Kook Lee
Sung Yong Yoon
Dong Soo Kim
Jae Hyun Lim
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
LG Electronics Inc
Original Assignee
LG Electronics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by LG Electronics Inc filed Critical LG Electronics Inc
Publication of CA2717584A1 publication Critical patent/CA2717584A1/en
Application granted granted Critical
Publication of CA2717584C publication Critical patent/CA2717584C/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/00007Time or data compression or expansion
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/00007Time or data compression or expansion
    • G11B2020/00014Time or data compression or expansion the compressed signal being an audio signal

Abstract

An apparatus for processing an encoded signal and method thereof are disclosed, by which an audio signal can be compressed and reconstructed in higher efficiency. An audio signal processing method includes the steps of identifying whether a coding type of the audio signal is a music signal coding type using first type information, if the coding type of the audio signal is not the music signal coding type, identifying whether the coding type of the audio signal is a speech signal coding type or a mixed signal coding type using second type information, if the coding type of the audio signal is the mixed signal coding type, extracting spectral data and a linear predictive coefficient from the audio signal, generating a residual signal for linear prediction by performing inverse frequency conversion on the spectral data, reconstructing the audio signal by performing linear prediction coding on the linear predictive coefficient and the residual signal, and reconstructing a high frequency region signal using an extension base signal corresponding to a partial region of the reconstructed audio signal and band extension information. Accordingly, various kinds of audio signals can be encoded/decoded in higher efficiency.

Claims (15)

1. In an audio signal processing apparatus including an audio decoder, a method of processing an audio signal, comprising the steps of:

identifying whether a coding type of the audio signal is a music signal coding type using first type information;
if the coding type of the audio signal is not the music signal coding type, identifying whether the coding type of the audio signal is a speech signal coding type or a mixed signal coding type using second type information;

if the coding type of the audio signal is the mixed signal coding type, extracting spectral data and a linear predictive coefficient from the audio signal;

generating a residual signal for linear prediction by performing inverse frequency conversion on the spectral data;

reconstructing the audio signal by performing linear prediction coding on the linear predictive coefficient and the residual signal; and reconstructing a high frequency region signal using an extension base signal corresponding to a partial region of the reconstructed audio signal and band extension information.
2. The method of claim 1, wherein the audio signal includes a plurality of subframes and wherein the second type information exists by a unit of the subframe.
3. The method of claim 1, wherein a bandwidth of the high frequency region signal is not equal to that of the extension base signal.
4. The method of claim 1, wherein the band extension information includes at least one of a filter range applied to the reconstructed audio signal, a start frequency of the extension base signal and an end frequency of the extension base signal.
5. The method of claim 1, wherein if the coding type of the audio signal is the music signal coding type, the audio signal comprises a frequency-domain signal, wherein if the coding type of the audio signal is the speech signal coding type, the audio signal comprises a time-domain signal, and wherein if the coding type of the audio signal is the mixed signal coding type, the audio signal comprises an MDCT-domain signal.
6. The method of claim 1, the linear predictive coefficient extracting step comprises the steps of:

extracting a linear predictive coefficient mode; and extracting the linear predictive coefficient having a variable bit size corresponding to the extracted linear predictive coefficient mode.
7. An apparatus for processing an audio signal, comprising:

a demultiplexer extracting first type information and second type information from a bitstream;

a decoder determining unit identifying whether a coding type of the audio signal is a music signal coding type using first type information, the decoder, if the coding type of the audio signal is not the music signal coding type, identifying whether the coding type of the audio signal is a speech signal coding type or a mixed signal coding type using second type information, the decoder then determining a decoding scheme;

an information extracting unit, if the coding type of the audio signal is the mixed signal coding type, extracting spectral data and a linear predictive coefficient from the audio signal;

a frequency transforming unit generating a residual signal for linear prediction by performing inverse frequency conversion on the spectral data;

a linear prediction unit reconstructing the audio signal by performing linear prediction coding on the linear predictive coefficient and the residual signal; and a bandwidth extension decoding unit reconstructing a high frequency region signal using an extension base signal corresponding to a partial region of the reconstructed audio signal and band extension information.
8. The apparatus of claim 7, wherein the audio signal includes a plurality of subframes and wherein the second type information exists by a unit of the subframe.
9. The apparatus of claim 7, wherein a bandwidth of the high frequency region signal is not equal to that of the extension base signal.
10. The apparatus of claim 7, wherein the band extension information includes at least one of a filter range applied to the reconstructed audio signal, a start frequency of the extension base signal and an end frequency of the extension base signal.
11. The apparatus of claim 7, wherein if the coding type of the audio signal is the music signal coding type, the audio signal comprises a frequency-domain signal, wherein if the coding type of the audio signal is the speech signal coding type, the audio signal comprises a time-domain signal, and wherein if the coding type of the audio signal is the mixed signal coding type, the audio signal comprises an MDCT-domain signal.
12. The apparatus of claim 7, the linear predictive coefficient extracting comprising:

extracting a linear predictive coefficient mode; and extracting the linear predictive coefficient having a variable bit size corresponding to the extracted linear predictive coefficient mode.
13. In an audio signal processing apparatus including an audio coder for processing an audio signal, a method of processing the audio signal, comprising the steps of:

removing a high frequency band signal of the audio signal and generating band extension information for reconstructing the high frequency band signal;
determining a coding type of the audio signal;

if the audio signal is a music signal, generating first type information indicating that the audio signal is coded into a music signal coding type;

if the audio signal is not the music signal, generating second type information indicating that the audio signal is coded into either a speech signal coding type or a mixed signal coding type;

if the coding type of the audio signal is the mixed signal coding type, generating a linear predictive coefficient by performing linear prediction coding on the audio signal;

generating a residual signal for the linear prediction coding;

generating a spectral coefficient by frequency-transforming the residual signal; and generating an audio bitstream including the first type information, the second type information, the linear predictive coefficient and the residual signal.
14. An apparatus for processing an audio signal, comprising:

a bandwidth preprocessing unit removing a high frequency band signal of the audio signal, the bandwidth preprocessing unit generating band extension information for reconstructing the high frequency band signal;

a signal classifying unit determining a coding type of the audio signal, the signal classifying unit, if the audio signal is a music signal, generating first type information indicating that the audio signal is coded into a music signal coding type, the signal classifying unit, if the audio signal is not the music signal, generating second type information indicating that the audio signal is coded into either a speech signal coding type or a mixed signal coding type;

a linear prediction modeling unit, if the coding type of the audio signal is the mixed signal coding type, generating a linear predictive coefficient by performing linear prediction coding on the audio signal;

a residual signal extracting unit generating a residual signal for the linear prediction coding; and a frequency transforming unit generating a spectral coefficient by frequency-transforming the residual signal.
15. The apparatus of claim 14, wherein the audio signal includes a plurality of subframes and wherein the second type information is generated per the subframe.
CA2717584A 2008-03-04 2009-03-04 Method and apparatus for processing an audio signal Active CA2717584C (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US3371508P 2008-03-04 2008-03-04
US61/033,715 2008-03-04
US7876208P 2008-07-07 2008-07-07
US61/078,762 2008-07-07
PCT/KR2009/001081 WO2009110751A2 (en) 2008-03-04 2009-03-04 Method and apparatus for processing an audio signal

Publications (2)

Publication Number Publication Date
CA2717584A1 true CA2717584A1 (en) 2009-09-11
CA2717584C CA2717584C (en) 2015-05-12

Family

ID=41056476

Family Applications (1)

Application Number Title Priority Date Filing Date
CA2717584A Active CA2717584C (en) 2008-03-04 2009-03-04 Method and apparatus for processing an audio signal

Country Status (10)

Country Link
US (1) US8135585B2 (en)
EP (1) EP2259254B1 (en)
JP (1) JP5108960B2 (en)
KR (1) KR20100134623A (en)
CN (1) CN102007534B (en)
AU (1) AU2009220341B2 (en)
CA (1) CA2717584C (en)
ES (1) ES2464722T3 (en)
RU (1) RU2452042C1 (en)
WO (1) WO2009110751A2 (en)

Families Citing this family (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2198426A4 (en) * 2007-10-15 2012-01-18 Lg Electronics Inc A method and an apparatus for processing a signal
ES2439549T3 (en) * 2008-07-11 2014-01-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. An apparatus and a method for decoding an encoded audio signal
JP5232121B2 (en) * 2009-10-02 2013-07-10 株式会社東芝 Signal processing device
US8447617B2 (en) * 2009-12-21 2013-05-21 Mindspeed Technologies, Inc. Method and system for speech bandwidth extension
KR101826331B1 (en) 2010-09-15 2018-03-22 삼성전자주식회사 Apparatus and method for encoding and decoding for high frequency bandwidth extension
EP3249647B1 (en) 2010-12-29 2023-10-18 Samsung Electronics Co., Ltd. Apparatus and method for encoding for high-frequency bandwidth extension
CN102610231B (en) * 2011-01-24 2013-10-09 华为技术有限公司 Method and device for expanding bandwidth
CN103918247B (en) 2011-09-23 2016-08-24 数字标记公司 Intelligent mobile phone sensor logic based on background environment
CN103035248B (en) 2011-10-08 2015-01-21 华为技术有限公司 Encoding method and device for audio signals
LT2774145T (en) * 2011-11-03 2020-09-25 Voiceage Evs Llc Improving non-speech content for low rate celp decoder
CN102446509B (en) * 2011-11-22 2014-04-09 中兴通讯股份有限公司 Audio coding and decoding method for enhancing anti-packet loss capability and system thereof
WO2013147668A1 (en) * 2012-03-29 2013-10-03 Telefonaktiebolaget Lm Ericsson (Publ) Bandwidth extension of harmonic audio signal
SG10201608613QA (en) * 2013-01-29 2016-12-29 Fraunhofer Ges Forschung Decoder For Generating A Frequency Enhanced Audio Signal, Method Of Decoding, Encoder For Generating An Encoded Signal And Method Of Encoding Using Compact Selection Side Information
EP2830051A3 (en) * 2013-07-22 2015-03-04 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder, audio decoder, methods and computer program using jointly encoded residual signals
CN103413553B (en) 2013-08-20 2016-03-09 腾讯科技(深圳)有限公司 Audio coding method, audio-frequency decoding method, coding side, decoding end and system
CN103500580B (en) * 2013-09-23 2017-04-12 广东威创视讯科技股份有限公司 Audio mixing processing method and system
EP2863386A1 (en) 2013-10-18 2015-04-22 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio decoder, apparatus for generating encoded audio output data and methods permitting initializing a decoder
US9311639B2 (en) 2014-02-11 2016-04-12 Digimarc Corporation Methods, apparatus and arrangements for device to device communication
CN106256001B (en) 2014-02-24 2020-01-21 三星电子株式会社 Signal classification method and apparatus and audio encoding method and apparatus using the same
CN107424621B (en) * 2014-06-24 2021-10-26 华为技术有限公司 Audio encoding method and apparatus
CN104269173B (en) * 2014-09-30 2018-03-13 武汉大学深圳研究院 The audio bandwidth expansion apparatus and method of switch mode
CN107077849B (en) * 2014-11-07 2020-09-08 三星电子株式会社 Method and apparatus for restoring audio signal
CN106075728B (en) * 2016-08-22 2018-09-28 卢超 Music applied to electronic acupuncture apparatus modulates pulse acquisition methods
US10074378B2 (en) * 2016-12-09 2018-09-11 Cirrus Logic, Inc. Data encoding detection
CN115334349B (en) * 2022-07-15 2024-01-02 北京达佳互联信息技术有限公司 Audio processing method, device, electronic equipment and storage medium

Family Cites Families (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5742735A (en) * 1987-10-06 1998-04-21 Fraunhofer Gesellschaft Zur Forderung Der Angewanten Forschung E.V. Digital adaptive transformation coding method
NL9000338A (en) * 1989-06-02 1991-01-02 Koninkl Philips Electronics Nv DIGITAL TRANSMISSION SYSTEM, TRANSMITTER AND RECEIVER FOR USE IN THE TRANSMISSION SYSTEM AND RECORD CARRIED OUT WITH THE TRANSMITTER IN THE FORM OF A RECORDING DEVICE.
JPH04150522A (en) * 1990-10-15 1992-05-25 Sony Corp Digital signal processor
US5680508A (en) * 1991-05-03 1997-10-21 Itt Corporation Enhancement of speech coding in background noise for low-rate speech coder
US5371853A (en) * 1991-10-28 1994-12-06 University Of Maryland At College Park Method and system for CELP speech coding and codebook for use therewith
DE4202140A1 (en) * 1992-01-27 1993-07-29 Thomson Brandt Gmbh Digital audio signal transmission using sub-band coding - inserting extra fault protection signal, or fault protection bit into data frame
US5285498A (en) * 1992-03-02 1994-02-08 At&T Bell Laboratories Method and apparatus for coding audio signals based on perceptual model
IT1257065B (en) * 1992-07-31 1996-01-05 Sip LOW DELAY CODER FOR AUDIO SIGNALS, USING SYNTHESIS ANALYSIS TECHNIQUES.
US5579404A (en) * 1993-02-16 1996-11-26 Dolby Laboratories Licensing Corporation Digital audio limiter
DE4405659C1 (en) * 1994-02-22 1995-04-06 Fraunhofer Ges Forschung Method for the cascaded coding and decoding of audio data
EP0720316B1 (en) * 1994-12-30 1999-12-08 Daewoo Electronics Co., Ltd Adaptive digital audio encoding apparatus and a bit allocation method thereof
IT1281001B1 (en) * 1995-10-27 1998-02-11 Cselt Centro Studi Lab Telecom PROCEDURE AND EQUIPMENT FOR CODING, HANDLING AND DECODING AUDIO SIGNALS.
US5778335A (en) * 1996-02-26 1998-07-07 The Regents Of The University Of California Method and apparatus for efficient multiband celp wideband speech and music coding and decoding
US6061793A (en) * 1996-08-30 2000-05-09 Regents Of The University Of Minnesota Method and apparatus for embedding data, including watermarks, in human perceptible sounds
KR100261254B1 (en) * 1997-04-02 2000-07-01 윤종용 Scalable audio data encoding/decoding method and apparatus
JP3185748B2 (en) * 1997-04-09 2001-07-11 日本電気株式会社 Signal encoding device
CA2233896C (en) * 1997-04-09 2002-11-19 Kazunori Ozawa Signal coding system
ES2247741T3 (en) * 1998-01-22 2006-03-01 Deutsche Telekom Ag SIGNAL CONTROLLED SWITCHING METHOD BETWEEN AUDIO CODING SCHEMES.
JP3199020B2 (en) * 1998-02-27 2001-08-13 日本電気株式会社 Audio music signal encoding device and decoding device
US6424938B1 (en) * 1998-11-23 2002-07-23 Telefonaktiebolaget L M Ericsson Complex signal activity detection for improved speech/noise classification of an audio signal
SG98418A1 (en) * 2000-07-10 2003-09-19 Cyberinc Pte Ltd A method, a device and a system for compressing a musical and voice signal
US6658383B2 (en) * 2001-06-26 2003-12-02 Microsoft Corporation Method for coding speech and music signals
SE521600C2 (en) * 2001-12-04 2003-11-18 Global Ip Sound Ab Lågbittaktskodek
JP2003257125A (en) * 2002-03-05 2003-09-12 Seiko Epson Corp Sound reproducing method and sound reproducing device
EP1497631B1 (en) * 2002-04-22 2007-12-12 Nokia Corporation Generating lsf vectors
US8359197B2 (en) * 2003-04-01 2013-01-22 Digital Voice Systems, Inc. Half-rate vocoder
CN1898724A (en) * 2003-12-26 2007-01-17 松下电器产业株式会社 Voice/musical sound encoding device and voice/musical sound encoding method
KR100854534B1 (en) 2004-05-19 2008-08-26 노키아 코포레이션 Supporting a switch between audio coder modes
US7596486B2 (en) * 2004-05-19 2009-09-29 Nokia Corporation Encoding an audio signal using different audio coder modes
KR101171098B1 (en) 2005-07-22 2012-08-20 삼성전자주식회사 Scalable speech coding/decoding methods and apparatus using mixed structure
DE602006013359D1 (en) * 2006-09-13 2010-05-12 Ericsson Telefon Ab L M ENDER AND RECEIVERS
CN101965612B (en) * 2008-03-03 2012-08-29 Lg电子株式会社 Method and apparatus for processing a signal

Also Published As

Publication number Publication date
EP2259254B1 (en) 2014-04-30
CN102007534A (en) 2011-04-06
KR20100134623A (en) 2010-12-23
AU2009220341B2 (en) 2011-09-22
JP2011514558A (en) 2011-05-06
EP2259254A4 (en) 2013-02-20
ES2464722T3 (en) 2014-06-03
US20100070272A1 (en) 2010-03-18
JP5108960B2 (en) 2012-12-26
CN102007534B (en) 2012-11-21
AU2009220341A1 (en) 2009-09-11
EP2259254A2 (en) 2010-12-08
RU2010140365A (en) 2012-04-10
RU2452042C1 (en) 2012-05-27
CA2717584C (en) 2015-05-12
US8135585B2 (en) 2012-03-13
WO2009110751A2 (en) 2009-09-11
WO2009110751A3 (en) 2009-10-29

Similar Documents

Publication Publication Date Title
CA2717584A1 (en) Method and apparatus for processing an audio signal
US9728196B2 (en) Method and apparatus to encode and decode an audio/speech signal
CN1272911C (en) Audio signal decoding device and audio signal encoding device
EP3493204B1 (en) Method for encoding of integrated speech and audio
CN1279512C (en) Methods for improving high frequency reconstruction
RU2010140362A (en) METHOD AND DEVICE FOR PROCESSING AN AUDIO SIGNAL
KR20190087368A (en) Encoding and decoding apparatus for linear predictive coder residual signal of modified discrete cosine transform based unified speech and audio coding
CN102150024B (en) Apparatus and method for encoding and decoding of integrated speech and audio
EP2439737B1 (en) Compression coding and decoding method, coder, decoder and coding device
EP1569203A3 (en) Lossless audio decoding/encoding method and apparatus
TW200746052A (en) Apparatus and method for encoding and decoding signal
CN110047500B (en) Audio encoder, audio decoder and method thereof
WO2009128667A3 (en) Method and apparatus for encoding/decoding an audio signal by using audio semantic information
EP1713061A3 (en) Apparatus and method of encoding audio data and apparatus and method of decoding encoded audio data
CN1516865A (en) Encoder and decoder
CN104718572A (en) Audio encoding method and device, audio decoding method and device, and multimedia device employing same
CN104170009A (en) Phase coherence control for harmonic signals in perceptual audio codecs
JP5629319B2 (en) Apparatus and method for efficiently encoding quantization parameter of spectral coefficient coding
JP6526091B2 (en) Low complexity tonal adaptive speech signal quantization
JP2003523535A (en) Method and apparatus for converting an audio signal between a plurality of data compression formats
CN106030704A (en) Method and apparatus for encoding/decoding an audio signal
JP2006126372A (en) Audio signal coding device, method, and program
Rajani et al. Vocoder (LPC) Analysis by Variation of Input Parameters and Signals
Kikuiri et al. MPEG unified speech and audio coding enabling efficient coding of both speech and music
KR101297026B1 (en) Apparatus and method for processing window for interlocking between mdct-tcx frame and celp frame

Legal Events

Date Code Title Description
EEER Examination request