EP0874352A3 - Voice activity detection - Google Patents

Voice activity detection Download PDF

Info

Publication number
EP0874352A3
EP0874352A3 EP98102842A EP98102842A EP0874352A3 EP 0874352 A3 EP0874352 A3 EP 0874352A3 EP 98102842 A EP98102842 A EP 98102842A EP 98102842 A EP98102842 A EP 98102842A EP 0874352 A3 EP0874352 A3 EP 0874352A3
Authority
EP
European Patent Office
Prior art keywords
speech
activity identification
voice activity
activity detection
controlling
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP98102842A
Other languages
German (de)
French (fr)
Other versions
EP0874352A2 (en
EP0874352B1 (en
Inventor
Joachim Dipl.-Ing. Stegmann
Gerhard Dipl.-Ing. Schröder
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Deutsche Telekom AG
Original Assignee
Deutsche Telekom AG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Deutsche Telekom AG filed Critical Deutsche Telekom AG
Publication of EP0874352A2 publication Critical patent/EP0874352A2/en
Publication of EP0874352A3 publication Critical patent/EP0874352A3/en
Application granted granted Critical
Publication of EP0874352B1 publication Critical patent/EP0874352B1/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique

Abstract

The speech activity identification method involves using segmentation of a speech signal with a wavelet transformation calculated for each frame, from which a set of parameters are extracted. A set of decision variables is provided for controlling a decision logic, providing a signal indicating whether or not speech is present. The speech activity identification method is employed by a speech activity identification module (5) controlling a speech coder (7) and a speech decoder (22) and a background noise coder (10) and background noise decoder (23).
EP98102842A 1997-04-22 1998-02-19 Voice activity detection Expired - Lifetime EP0874352B1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
DE19716862A DE19716862A1 (en) 1997-04-22 1997-04-22 Voice activity detection
DE19716862 1997-04-22

Publications (3)

Publication Number Publication Date
EP0874352A2 EP0874352A2 (en) 1998-10-28
EP0874352A3 true EP0874352A3 (en) 1999-06-02
EP0874352B1 EP0874352B1 (en) 2003-10-15

Family

ID=7827317

Family Applications (1)

Application Number Title Priority Date Filing Date
EP98102842A Expired - Lifetime EP0874352B1 (en) 1997-04-22 1998-02-19 Voice activity detection

Country Status (4)

Country Link
US (1) US6374211B2 (en)
EP (1) EP0874352B1 (en)
AT (1) ATE252265T1 (en)
DE (2) DE19716862A1 (en)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE10026904A1 (en) * 2000-04-28 2002-01-03 Deutsche Telekom Ag Calculating gain for encoded speech transmission by dividing into signal sections and determining weighting factor from periodicity and stationarity
WO2001084536A1 (en) 2000-04-28 2001-11-08 Deutsche Telekom Ag Method for detecting a voice activity decision (voice activity detector)
US7505594B2 (en) * 2000-12-19 2009-03-17 Qualcomm Incorporated Discontinuous transmission (DTX) controller system and method
US6725191B2 (en) * 2001-07-19 2004-04-20 Vocaltec Communications Limited Method and apparatus for transmitting voice over internet
US8315865B2 (en) * 2004-05-04 2012-11-20 Hewlett-Packard Development Company, L.P. Method and apparatus for adaptive conversation detection employing minimal computation
US7574353B2 (en) * 2004-11-18 2009-08-11 Lsi Logic Corporation Transmit/receive data paths for voice-over-internet (VoIP) communication systems
US7983922B2 (en) * 2005-04-15 2011-07-19 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus and method for generating multi-channel synthesizer control signal and apparatus and method for multi-channel synthesizing
KR100655953B1 (en) * 2006-02-06 2006-12-11 한양대학교 산학협력단 Speech processing system and method using wavelet packet transform
US7680657B2 (en) * 2006-08-15 2010-03-16 Microsoft Corporation Auto segmentation based partitioning and clustering approach to robust endpointing
KR100789084B1 (en) 2006-11-21 2007-12-26 한양대학교 산학협력단 Speech enhancement method by overweighting gain with nonlinear structure in wavelet packet transform
US9361883B2 (en) * 2012-05-01 2016-06-07 Microsoft Technology Licensing, Llc Dictation with incremental recognition of speech
CN104019885A (en) 2013-02-28 2014-09-03 杜比实验室特许公司 Sound field analysis system
EP2974253B1 (en) 2013-03-15 2019-05-08 Dolby Laboratories Licensing Corporation Normalization of soundfield orientations based on auditory scene analysis
US10917611B2 (en) 2015-06-09 2021-02-09 Avaya Inc. Video adaptation in conferencing using power or view indications
WO2020252782A1 (en) * 2019-06-21 2020-12-24 深圳市汇顶科技股份有限公司 Voice detection method, voice detection device, voice processing chip and electronic apparatus

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5459814A (en) * 1993-03-26 1995-10-17 Hughes Aircraft Company Voice activity detector for speech signals in variable background noise
EP0751495A2 (en) * 1995-06-30 1997-01-02 Deutsche Telekom AG Method and device for coding speech

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5152007A (en) * 1991-04-23 1992-09-29 Motorola, Inc. Method and apparatus for detecting speech
GB2272554A (en) * 1992-11-13 1994-05-18 Creative Tech Ltd Recognizing speech by using wavelet transform and transient response therefrom
US5388182A (en) * 1993-02-16 1995-02-07 Prometheus, Inc. Nonlinear method and apparatus for coding and decoding acoustic signals with data compression and noise suppression using cochlear filters, wavelet analysis, and irregular sampling reconstruction
JP3090842B2 (en) * 1994-04-28 2000-09-25 沖電気工業株式会社 Transmitter adapted to Viterbi decoding method
FR2727236B1 (en) * 1994-11-22 1996-12-27 Alcatel Mobile Comm France DETECTION OF VOICE ACTIVITY
US5822726A (en) * 1995-01-31 1998-10-13 Motorola, Inc. Speech presence detector based on sparse time-random signal samples
DE19538852A1 (en) * 1995-06-30 1997-01-02 Deutsche Telekom Ag Method and arrangement for classifying speech signals
CA2188369C (en) * 1995-10-19 2005-01-11 Joachim Stegmann Method and an arrangement for classifying speech signals

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5459814A (en) * 1993-03-26 1995-10-17 Hughes Aircraft Company Voice activity detector for speech signals in variable background noise
EP0751495A2 (en) * 1995-06-30 1997-01-02 Deutsche Telekom AG Method and device for coding speech

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
"Digital cellular telecommunications system; Discontinuous Transmission (DTX) for Enhanced Full Rate (EFR) speech traffic channels (GSM 06.81)", EUROPEAN TELECOMMUNICATION STANDARD, FINAL DRAFT PRETS 300 729, November 1996 (1996-11-01), European Telecommunications Standards Institute (ETSI), XP002098616 *
BENYASSINE A ET AL: "ITU-T RECOMMENDATION G.729 ANNEX B: A SILENCE COMPRESSION SCHEME FOR USE WITH G.729 OPTIMIZED FOR V.70 DIGITAL SIMULTANEOUS VOICE AND DATA APPLICATIONS", IEEE COMMUNICATIONS MAGAZINE, vol. 35, no. 9, September 1997 (1997-09-01), pages 64 - 73, XP000704425 *
STEGMANN J ET AL: "ROBUST VOICE-ACTIVITY DETECTION BASED ON THE WAVELET TRANSFORM", PROCEEDINGS OF THE IEEE WORKSHOP ON SPEECH CODING FOR TELECOMMUNICATIONS, 7 September 1997 (1997-09-07), pages 99 - 100, XP002073237 *

Also Published As

Publication number Publication date
EP0874352A2 (en) 1998-10-28
US20010014854A1 (en) 2001-08-16
US6374211B2 (en) 2002-04-16
ATE252265T1 (en) 2003-11-15
DE59809897D1 (en) 2003-11-20
DE19716862A1 (en) 1998-10-29
EP0874352B1 (en) 2003-10-15

Similar Documents

Publication Publication Date Title
EP0874352A3 (en) Voice activity detection
WO1995028824A3 (en) Method of encoding a signal containing speech
AU2001266278A1 (en) A speech communication system and method for handling lost frames
CA2228948A1 (en) Pattern recognition
EP1083542A3 (en) A method and apparatus for speech detection
CA2343661A1 (en) Method and apparatus for improving the intelligibility of digitally compressed speech
DE69926821D1 (en) Method for signal-controlled switching between different audio coding systems
EP0770989A3 (en) Speech encoding method and apparatus
CA2124643A1 (en) Method and Device for Speech Signal Pitch Period Estimation and Classification in Digital Speech Coders
MY124630A (en) Complex signal activity detection for improved speech/noise classification of an audio signal
CA2177422A1 (en) Voice/Unvoiced Classification of Speech for Use in Speech Decoding During Frame Erasures
WO1999016052A3 (en) Speech recognition system for recognizing continuous and isolated speech
CA2210490A1 (en) Spectral subtraction noise suppression method
CA2158849A1 (en) Speech Recognition with Pause Detection
HK40596A (en) Optimal method of data reduction in a speech recognition system
GB2308483A (en) Method and system for recognizing a boundary beween sounds in continuous speech
GB2307582A (en) System for recognizing spoken sounds from continuous speech and method of using same
CA2188369A1 (en) Method and an arrangement for classifying speech signals
EP0651521A3 (en) Discriminating signal noise from received signals
EP1093112A3 (en) A method for generating speech feature signals and an apparatus for carrying through this method
EP1145221A3 (en) A method and apparatus for determining speech coding parameters
EP0233718B1 (en) Speech processing apparatus and methods
GR3032375T3 (en) Speech recognition based on HMMs.
CA2315324A1 (en) Speech signal decoding method and apparatus
EP0817167A3 (en) Speech recognition method and device for carrying out the method

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

AX Request for extension of the european patent

Free format text: AL;LT;LV;MK;RO;SI

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

AX Request for extension of the european patent

Free format text: AL;LT;LV;MK;RO;SI

17P Request for examination filed

Effective date: 19991202

AKX Designation fees paid

Free format text: AT BE CH DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

GRAH Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOS IGRA

RIC1 Information provided on ipc code assigned before grant

Ipc: 7G 10L 11/02 A

RIC1 Information provided on ipc code assigned before grant

Ipc: 7G 10L 11/02 A

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT;WARNING: LAPSES OF ITALIAN PATENTS WITH EFFECTIVE DATE BEFORE 2007 MAY HAVE OCCURRED AT ANY TIME BEFORE 2007. THE CORRECT EFFECTIVE DATE MAY BE DIFFERENT FROM THE ONE RECORDED.

Effective date: 20031015

Ref country code: IE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20031015

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20031015

Ref country code: ES

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20031015

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

Free format text: NOT ENGLISH

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

Free format text: GERMAN

REF Corresponds to:

Ref document number: 59809897

Country of ref document: DE

Date of ref document: 20031120

Kind code of ref document: P

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040115

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040115

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20040115

GBT Gb: translation of ep patent filed (gb section 77(6)(a)/1977)

Effective date: 20040123

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040219

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MC

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040228

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040229

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040229

REG Reference to a national code

Ref country code: IE

Ref legal event code: FD4D

ET Fr: translation filed
PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed

Effective date: 20040716

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: PT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20040315

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 19

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20160218

Year of fee payment: 19

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20160222

Year of fee payment: 19

Ref country code: NL

Payment date: 20160222

Year of fee payment: 19

Ref country code: BE

Payment date: 20160222

Year of fee payment: 19

Ref country code: AT

Payment date: 20160218

Year of fee payment: 19

Ref country code: FR

Payment date: 20160222

Year of fee payment: 19

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170228

REG Reference to a national code

Ref country code: DE

Ref legal event code: R119

Ref document number: 59809897

Country of ref document: DE

REG Reference to a national code

Ref country code: NL

Ref legal event code: MM

Effective date: 20170301

REG Reference to a national code

Ref country code: AT

Ref legal event code: MM01

Ref document number: 252265

Country of ref document: AT

Kind code of ref document: T

Effective date: 20170219

GBPC Gb: european patent ceased through non-payment of renewal fee

Effective date: 20170219

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: AT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170219

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: NL

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170301

REG Reference to a national code

Ref country code: FR

Ref legal event code: ST

Effective date: 20171031

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170901

Ref country code: FR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170228

REG Reference to a national code

Ref country code: BE

Ref legal event code: MM

Effective date: 20170228

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170219