WO2002023533A3 - System for improved use of pitch enhancement with subcodebooks - Google Patents

System for improved use of pitch enhancement with subcodebooks Download PDF

Info

Publication number
WO2002023533A3
WO2002023533A3 PCT/IB2001/001735 IB0101735W WO0223533A3 WO 2002023533 A3 WO2002023533 A3 WO 2002023533A3 IB 0101735 W IB0101735 W IB 0101735W WO 0223533 A3 WO0223533 A3 WO 0223533A3
Authority
WO
WIPO (PCT)
Prior art keywords
speech
rate
codec
rate codec
subcodebooks
Prior art date
Application number
PCT/IB2001/001735
Other languages
French (fr)
Other versions
WO2002023533A2 (en
Inventor
Yang Gao
Original Assignee
Conexant Systems Inc
Yang Gao
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Conexant Systems Inc, Yang Gao filed Critical Conexant Systems Inc
Priority to AU2001287973A priority Critical patent/AU2001287973A1/en
Publication of WO2002023533A2 publication Critical patent/WO2002023533A2/en
Publication of WO2002023533A3 publication Critical patent/WO2002023533A3/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0004Design or structure of the codebook
    • G10L2019/0005Multi-stage vector quantisation

Abstract

A speech compression system capable of encoding a speech signal into a bitstream for subsequent decoding to generate synthesized speech is disclosed. The speech compression system optimizes the bandwidth consumed by the bitstream by balancing the desired average bit rate with the perceptual quality of the reconstructed speech. The speech compression system comprises a full-rate codec, a half-rate codec, a quarter-rate codec and an eighth-rate codec. The codecs are selectively activated based on a rate selection. In addition, the full and half-rate codec are selectively activated based on a type classification. Each codec is selectively activated to encode and decode the speech signals at different bit rates emphasizing different aspects of the speech signal to enhance overall quality of the synthesized speech. The overall quality of the system is strongly related to the excitation. In order to enhance the excitation, the system contains a fixed codebook comprising several subcodebooks. The invention reveals a way to apply a pitch enhancement efficiently and differently for different subcodebooks without using additional bits. The technique is particularly applicable to selectable mode vocoder (SMV) systems.
PCT/IB2001/001735 2000-09-15 2001-09-17 System for improved use of pitch enhancement with subcodebooks WO2002023533A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2001287973A AU2001287973A1 (en) 2000-09-15 2001-09-17 System for improved use of pitch enhancement with subcodebooks

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US23293800P 2000-09-15 2000-09-15
US60/232,938 2000-09-15

Publications (2)

Publication Number Publication Date
WO2002023533A2 WO2002023533A2 (en) 2002-03-21
WO2002023533A3 true WO2002023533A3 (en) 2002-08-15

Family

ID=22875191

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2001/001735 WO2002023533A2 (en) 2000-09-15 2001-09-17 System for improved use of pitch enhancement with subcodebooks

Country Status (2)

Country Link
AU (1) AU2001287973A1 (en)
WO (1) WO2002023533A2 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FI119955B (en) * 2001-06-21 2009-05-15 Nokia Corp Method, encoder and apparatus for speech coding in an analysis-through-synthesis speech encoder

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2000011657A1 (en) * 1998-08-24 2000-03-02 Conexant Systems, Inc. Completed fixed codebook for speech encoder

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2000011657A1 (en) * 1998-08-24 2000-03-02 Conexant Systems, Inc. Completed fixed codebook for speech encoder
US6173257B1 (en) * 1998-08-24 2001-01-09 Conexant Systems, Inc Completed fixed codebook for speech encoder

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
MCCLELLAN S ET AL: "EFFICIENT PITCH FILTER ENCODING FOR VARIABLE RATE SPEECH PROCESSING", IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, IEEE INC. NEW YORK, US, vol. 7, no. 1, January 1999 (1999-01-01), pages 18 - 29, XP000890821, ISSN: 1063-6676 *

Also Published As

Publication number Publication date
AU2001287973A1 (en) 2002-03-26
WO2002023533A2 (en) 2002-03-21

Similar Documents

Publication Publication Date Title
US10438601B2 (en) Method and arrangement for controlling smoothing of stationary background noise
US8224657B2 (en) Method and device for efficient in-band dim-and-burst signaling and half-rate max operation in variable bit-rate wideband speech coding for CDMA wireless systems
AU7486200A (en) Multimode speech encoder
US7203638B2 (en) Method for interoperation between adaptive multi-rate wideband (AMR-WB) and multi-mode variable bit-rate wideband (VMR-WB) codecs
US7778827B2 (en) Method and device for gain quantization in variable bit rate wideband speech coding
AU2001287969A1 (en) Codebook structure and search for speech coding
US8630864B2 (en) Method for switching rate and bandwidth scalable audio decoding rate
US7657427B2 (en) Methods and devices for source controlled variable bit-rate wideband speech coding
US7191125B2 (en) Method and apparatus for high performance low bit-rate coding of unvoiced speech
CN101322181B (en) Effective speech stream conversion method and device
CA2566489A1 (en) Supporting a switch between audio coder modes
JP2003525473A (en) Closed-loop multimode mixed-domain linear prediction speech coder
US6980948B2 (en) System of dynamic pulse position tracks for pulse-like excitation in speech coding
CN101622667B (en) Postfilter for layered codecs
Sinder et al. Recent speech coding technologies and standards
Krishnan et al. EVRC-Wideband: the new 3GPP2 wideband vocoder standard
WO2002023533A3 (en) System for improved use of pitch enhancement with subcodebooks
KR20010087393A (en) Closed-loop variable-rate multimode predictive speech coder
US7133823B2 (en) System for an adaptive excitation pattern for speech coding
EP1808852A1 (en) Method of interoperation between adaptive multi-rate wideband (AMR-WB) and multi-mode variable bit-rate wideband (VMR-WB) codecs
Gibson Speech coding for wireless communications
Wang et al. Transcoding Scheme between AMR-WB and VMR-WB
CA2491623C (en) Method and device for efficient in-band dim-and-burst signaling and half-rate max operation in variable bit-rate wideband speech coding for cdma wireless systems
Erdmann et al. Embedded speech coding based on pyramid CELP
Woodard et al. A Range of Low and High Delay CELP Speech Codecs between 8 and 4 kbits/s

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PH PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP