WO2004066269A3 - Method and apparatus for speech reconstruction within a distributed speech recognition system - Google Patents

Method and apparatus for speech reconstruction within a distributed speech recognition system Download PDF

Info

Publication number
WO2004066269A3
WO2004066269A3 PCT/US2004/000871 US2004000871W WO2004066269A3 WO 2004066269 A3 WO2004066269 A3 WO 2004066269A3 US 2004000871 W US2004000871 W US 2004000871W WO 2004066269 A3 WO2004066269 A3 WO 2004066269A3
Authority
WO
WIPO (PCT)
Prior art keywords
speech
mfccs
recognition system
reconstruction
distributed
Prior art date
Application number
PCT/US2004/000871
Other languages
French (fr)
Other versions
WO2004066269A2 (en
Inventor
Tenkasi Ramabadran
Original Assignee
Motorola Inc
Tenkasi Ramabadran
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Motorola Inc, Tenkasi Ramabadran filed Critical Motorola Inc
Priority to KR1020057013048A priority Critical patent/KR101059640B1/en
Priority to EP04701832A priority patent/EP1588354B1/en
Priority to BRPI0406765-7A priority patent/BRPI0406765B1/en
Publication of WO2004066269A2 publication Critical patent/WO2004066269A2/en
Publication of WO2004066269A3 publication Critical patent/WO2004066269A3/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum

Abstract

A method and apparatus for speech reconstruction within a distributed speech recognition system is provided herein. Missing MFCCs are reconstructed (219) and utilized to generate speech (223) . Particularly, partial recovery of the missing MFCCs is achieved by exploiting the dependence of the missing MFCCs on the transmitted pitch period P (213) as well as on the transmitted MFCCs. Harmonic magnitudes are then obtained from the transmitted and reconstructed MFCCs, and the speech is reconstructed (223) utilizing these harmonic magnitudes.
PCT/US2004/000871 2003-01-14 2004-01-13 Method and apparatus for speech reconstruction within a distributed speech recognition system WO2004066269A2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
KR1020057013048A KR101059640B1 (en) 2003-01-14 2004-01-13 Method and apparatus for speech restoration in distributed speech recognition system
EP04701832A EP1588354B1 (en) 2003-01-14 2004-01-13 Method and apparatus for speech reconstruction
BRPI0406765-7A BRPI0406765B1 (en) 2003-01-14 2004-01-13 METHOD AND APPARATUS FOR SPEECH RECONSTRUCTION IN A DISTRIBUTED SPEECH RECOGNITION SYSTEM

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/341,726 2003-01-14
US10/341,726 US7027979B2 (en) 2003-01-14 2003-01-14 Method and apparatus for speech reconstruction within a distributed speech recognition system

Publications (2)

Publication Number Publication Date
WO2004066269A2 WO2004066269A2 (en) 2004-08-05
WO2004066269A3 true WO2004066269A3 (en) 2005-01-27

Family

ID=32711568

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2004/000871 WO2004066269A2 (en) 2003-01-14 2004-01-13 Method and apparatus for speech reconstruction within a distributed speech recognition system

Country Status (7)

Country Link
US (1) US7027979B2 (en)
EP (1) EP1588354B1 (en)
KR (1) KR101059640B1 (en)
CN (1) CN100371988C (en)
BR (1) BRPI0406765B1 (en)
RU (1) RU2366007C2 (en)
WO (1) WO2004066269A2 (en)

Families Citing this family (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7305339B2 (en) * 2003-04-01 2007-12-04 International Business Machines Corporation Restoration of high-order Mel Frequency Cepstral Coefficients
US8412526B2 (en) * 2003-04-01 2013-04-02 Nuance Communications, Inc. Restoration of high-order Mel frequency cepstral coefficients
US7386443B1 (en) 2004-01-09 2008-06-10 At&T Corp. System and method for mobile automatic speech recognition
JP2009501353A (en) * 2005-07-14 2009-01-15 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Audio signal synthesis
US20070191736A1 (en) * 2005-10-04 2007-08-16 Don Alden Method for loading penetrating members in a collection device
US7783488B2 (en) 2005-12-19 2010-08-24 Nuance Communications, Inc. Remote tracing and debugging of automatic speech recognition servers by speech reconstruction from cepstra and pitch information
KR100735343B1 (en) * 2006-04-11 2007-07-04 삼성전자주식회사 Apparatus and method for extracting pitch information of a speech signal
US8306817B2 (en) * 2008-01-08 2012-11-06 Microsoft Corporation Speech recognition with non-linear noise reduction on Mel-frequency cepstra
EP3273442B1 (en) * 2008-03-20 2021-10-20 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for synthesizing a parameterized representation of an audio signal
US9020816B2 (en) * 2008-08-14 2015-04-28 21Ct, Inc. Hidden markov model for speech processing with training method
US9767806B2 (en) * 2013-09-24 2017-09-19 Cirrus Logic International Semiconductor Ltd. Anti-spoofing
US20100174539A1 (en) * 2009-01-06 2010-07-08 Qualcomm Incorporated Method and apparatus for vector quantization codebook search
KR101712101B1 (en) * 2010-01-28 2017-03-03 삼성전자 주식회사 Signal processing method and apparatus
US8595005B2 (en) * 2010-05-31 2013-11-26 Simple Emotion, Inc. System and method for recognizing emotional state from a speech signal
CN104766608A (en) * 2014-01-07 2015-07-08 深圳市中兴微电子技术有限公司 Voice control method and voice control device
US9549068B2 (en) 2014-01-28 2017-01-17 Simple Emotion, Inc. Methods for adaptive voice interaction
RU2610285C1 (en) * 2016-02-15 2017-02-08 федеральное государственное казенное военное образовательное учреждение высшего образования "Военная академия связи имени Маршала Советского Союза С.М. Буденного" Министерства обороны Российской Федерации Method of detecting low-rate encoding protocols
CN106847280B (en) * 2017-02-23 2020-09-15 海信集团有限公司 Audio information processing method, intelligent terminal and voice control terminal
CN106856093A (en) * 2017-02-23 2017-06-16 海信集团有限公司 Audio-frequency information processing method, intelligent terminal and Voice command terminal
CN107527611A (en) * 2017-08-23 2017-12-29 武汉斗鱼网络科技有限公司 MFCC audio recognition methods, storage medium, electronic equipment and system
RU2667462C1 (en) * 2017-10-24 2018-09-19 федеральное государственное казенное военное образовательное учреждение высшего образования "Военная академия связи имени Маршала Советского Союза С.М. Буденного" Министерства обороны Российской Федерации Method of recognizing low-speed speech coding protocols
CN109616129B (en) * 2018-11-13 2021-07-30 南京南大电子智慧型服务机器人研究院有限公司 Mixed multi-description sinusoidal coder method for improving voice frame loss compensation performance
US11227579B2 (en) * 2019-08-08 2022-01-18 International Business Machines Corporation Data augmentation by frame insertion for speech data
CN111199747A (en) * 2020-03-05 2020-05-26 北京花兰德科技咨询服务有限公司 Artificial intelligence communication system and communication method
RU2748935C1 (en) * 2020-09-03 2021-06-01 федеральное государственное казенное военное образовательное учреждение высшего образования "Военная академия связи имени Маршала Советского Союза С.М. Буденного" Министерства обороны Российской Федерации Method of recognition of new low bit rate coding protocols

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002062120A2 (en) * 2001-02-02 2002-08-15 Motorola, Inc. Method and apparatus for speech reconstruction in a distributed speech recognition system

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5745874A (en) * 1996-03-04 1998-04-28 National Semiconductor Corporation Preprocessor for automatic speech recognition system
FR2766604B1 (en) * 1997-07-22 1999-10-01 France Telecom METHOD AND DEVICE FOR BLIND EQUALIZATION OF THE EFFECTS OF A TRANSMISSION CHANNEL ON A DIGITAL SPOKEN SIGNAL
US6076058A (en) * 1998-03-02 2000-06-13 Lucent Technologies Inc. Linear trajectory models incorporating preprocessing parameters for speech recognition
FI19992350A (en) * 1999-10-29 2001-04-30 Nokia Mobile Phones Ltd Improved voice recognition
GB2355834A (en) * 1999-10-29 2001-05-02 Nokia Mobile Phones Ltd Speech recognition

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002062120A2 (en) * 2001-02-02 2002-08-15 Motorola, Inc. Method and apparatus for speech reconstruction in a distributed speech recognition system

Also Published As

Publication number Publication date
CN1739143A (en) 2006-02-22
BRPI0406765B1 (en) 2018-08-07
WO2004066269A2 (en) 2004-08-05
EP1588354A2 (en) 2005-10-26
RU2366007C2 (en) 2009-08-27
EP1588354A4 (en) 2006-03-01
KR101059640B1 (en) 2011-08-25
US20040138888A1 (en) 2004-07-15
EP1588354B1 (en) 2011-08-24
US7027979B2 (en) 2006-04-11
CN100371988C (en) 2008-02-27
RU2005125737A (en) 2006-01-10
BRPI0406765A (en) 2005-12-20
KR20050092112A (en) 2005-09-20

Similar Documents

Publication Publication Date Title
WO2004066269A3 (en) Method and apparatus for speech reconstruction within a distributed speech recognition system
EP1924531B8 (en) Ammonium/ammonia removal from a stream
NO20050268L (en) Procedure for purifying Fischer-Tropsch-extracted water
MY144376A (en) Method for recovery of carbon dioxide from a gas
WO2001018789A8 (en) Formant tracking in speech signal with probability models
WO2004012055A3 (en) System and method for musical sonification of data
EP1376584A3 (en) System and method for automatically generating video cliplets from digital video
NO20050251L (en) Procedure for purifying Fischer-Tropsch-extracted water
MY145597A (en) Method and apparatus for representing image granularity by one or more parameters
WO2006111401A3 (en) A technique for platform-independent service modeling
ATE542283T1 (en) DEVICE FOR MECHANICAL ENERGY RECOVERY WITH VARIABLE STIFFNESS
HK1114901A1 (en) Systems, methods, and apparatus for highband excitation generation
UA94041C2 (en) Method and device for anti-sparseness filtering
EP0955628A3 (en) A method of and a device for speech recognition employing neural network and Markov model recognition techniques
WO2006096728A3 (en) System and method for ranging
DK1633779T3 (en) Oxidoreductase from Pichia capsulata
AU2003285633A1 (en) Generator for use in wind turbines or water-powered wheels
GB0308407D0 (en) Method of obtaining 68 GA
EP1338565A3 (en) Free Radical Generator and method for water treatment
WO2006056980A3 (en) Method and accessory for preparing a dental crown or bridge
WO2006105103A3 (en) An expandable gas or fluid distribution system
AU2003262278A1 (en) Method of generating hydrogen gas, hydrogen gas production apparatus and energy conversion system
FR2838844B1 (en) METHOD FOR GENERATING A PERFORMANCE MODEL FROM A FUNCTIONAL MODEL
WO2006018295A3 (en) Nanotransport system having a dendritic architecture
WO2004047425A3 (en) Apparatus and method for multiple description encoding

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 20048021854

Country of ref document: CN

Ref document number: 1020057013048

Country of ref document: KR

WWE Wipo information: entry into national phase

Ref document number: 2004701832

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2005125737

Country of ref document: RU

Kind code of ref document: A

WWP Wipo information: published in national office

Ref document number: 1020057013048

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 2004701832

Country of ref document: EP

ENP Entry into the national phase

Ref document number: PI0406765

Country of ref document: BR