WO2001048737A3 - Speech recognizer with a lexical tree based n-gram language model - Google Patents

Speech recognizer with a lexical tree based n-gram language model Download PDF

Info

Publication number
WO2001048737A3
WO2001048737A3 PCT/CN1999/000217 CN9900217W WO0148737A3 WO 2001048737 A3 WO2001048737 A3 WO 2001048737A3 CN 9900217 W CN9900217 W CN 9900217W WO 0148737 A3 WO0148737 A3 WO 0148737A3
Authority
WO
WIPO (PCT)
Prior art keywords
probabilities
lexical tree
estimated probabilities
stored
phonemes
Prior art date
Application number
PCT/CN1999/000217
Other languages
French (fr)
Other versions
WO2001048737A2 (en
Inventor
Zhiwei Lin
Yonghong Yan
Qingwei Zhao
Baosheng Yuan
Original Assignee
Intel Corp
Zhiwei Lin
Yonghong Yan
Qingwei Zhao
Baosheng Yuan
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Intel Corp, Zhiwei Lin, Yonghong Yan, Qingwei Zhao, Baosheng Yuan filed Critical Intel Corp
Priority to AU17676/00A priority Critical patent/AU1767600A/en
Priority to CN99817058.5A priority patent/CN1201286C/en
Priority to PCT/CN1999/000217 priority patent/WO2001048737A2/en
Publication of WO2001048737A2 publication Critical patent/WO2001048737A2/en
Publication of WO2001048737A3 publication Critical patent/WO2001048737A3/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • G10L15/197Probabilistic grammars, e.g. word n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams

Abstract

In some embodiments, the invention includes a method comprising creating a lexical tree and identifying beginning phonemes in the lexical tree. The method of these embodiments further includes estimating probabilities of words in the lexical tree having particular ones of the beginning phonemes and storing at least some of the estimated probabilities, wherein backoff weights are not stored with the estimated probabilities. The estimated probabilities may be stored in a lookup table. In other embodiment, the invention includes a method of receiving phonemes and identifying them on a lexical tree. The method of these embodiments also includes estimating probabilities of words that include the phonemes through use of estimated probabilities retrieved from storage, wherein the retrieve probabilities do not include backoff weights stored with the estimated probabilities. Again, the estimated probabilities may be stored in a lookup table. The estimated probabilities may be used in establishing a pruning threshold. The methods may be implemented by instructions on a computer readable medium.
PCT/CN1999/000217 1999-12-23 1999-12-23 Speech recognizer with a lexical tree based n-gram language model WO2001048737A2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
AU17676/00A AU1767600A (en) 1999-12-23 1999-12-23 Speech recognizer with a lexical tree based n-gram language model
CN99817058.5A CN1201286C (en) 1999-12-23 1999-12-23 Speech recognizer with a lexial tree based N-gram language model
PCT/CN1999/000217 WO2001048737A2 (en) 1999-12-23 1999-12-23 Speech recognizer with a lexical tree based n-gram language model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN1999/000217 WO2001048737A2 (en) 1999-12-23 1999-12-23 Speech recognizer with a lexical tree based n-gram language model

Publications (2)

Publication Number Publication Date
WO2001048737A2 WO2001048737A2 (en) 2001-07-05
WO2001048737A3 true WO2001048737A3 (en) 2002-11-14

Family

ID=4575158

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN1999/000217 WO2001048737A2 (en) 1999-12-23 1999-12-23 Speech recognizer with a lexical tree based n-gram language model

Country Status (3)

Country Link
CN (1) CN1201286C (en)
AU (1) AU1767600A (en)
WO (1) WO2001048737A2 (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB0420464D0 (en) * 2004-09-14 2004-10-20 Zentian Ltd A speech recognition circuit and method
CN101271450B (en) * 2007-03-19 2010-09-29 株式会社东芝 Method and device for cutting language model
GB2453366B (en) 2007-10-04 2011-04-06 Toshiba Res Europ Ltd Automatic speech recognition method and apparatus
CN102439540B (en) * 2009-03-19 2015-04-08 谷歌股份有限公司 Input method editor
KR101522375B1 (en) 2009-03-19 2015-05-21 구글 인코포레이티드 Input method editor
US8655647B2 (en) 2010-03-11 2014-02-18 Microsoft Corporation N-gram selection for practical-sized language models
US8589164B1 (en) * 2012-10-18 2013-11-19 Google Inc. Methods and systems for speech recognition processing using search query information
CN111128172B (en) * 2019-12-31 2022-12-16 达闼机器人股份有限公司 Voice recognition method, electronic equipment and storage medium

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0473694A (en) * 1990-07-13 1992-03-09 Nippon Telegr & Teleph Corp <Ntt> Japanese language speech recognizing method
EP0533260A2 (en) * 1991-09-14 1993-03-24 Philips Patentverwaltung GmbH Method and apparatus for recognizing the uttered words in a speech signal
US5502791A (en) * 1992-09-29 1996-03-26 International Business Machines Corporation Speech recognition by concatenating fenonic allophone hidden Markov models in parallel among subwords
JPH08123479A (en) * 1994-10-26 1996-05-17 Atr Onsei Honyaku Tsushin Kenkyusho:Kk Continuous speech recognition device
JPH08221091A (en) * 1995-02-17 1996-08-30 Matsushita Electric Ind Co Ltd Voice recognition device
WO1996027872A1 (en) * 1995-03-07 1996-09-12 British Telecommunications Public Limited Company Speech recognition
US5621859A (en) * 1994-01-19 1997-04-15 Bbn Corporation Single tree method for grammar directed, very large vocabulary speech recognizer
EP0825586A2 (en) * 1996-08-22 1998-02-25 Dragon Systems Inc. Lexical tree pre-filtering in speech recognition
US5758024A (en) * 1996-06-25 1998-05-26 Microsoft Corporation Method and system for encoding pronunciation prefix trees
US5832428A (en) * 1995-10-04 1998-11-03 Apple Computer, Inc. Search engine for phrase recognition based on prefix/body/suffix architecture
CN1233803A (en) * 1998-04-29 1999-11-03 松下电器产业株式会社 Method and apparatus using decision trees to generate and score multiple pronunciations for spelled word
WO1999059141A1 (en) * 1998-05-11 1999-11-18 Siemens Aktiengesellschaft Method and array for introducing temporal correlation in hidden markov models for speech recognition
JPH11344991A (en) * 1998-05-30 1999-12-14 Brother Ind Ltd Voice recognition device and storage medium

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0473694A (en) * 1990-07-13 1992-03-09 Nippon Telegr & Teleph Corp <Ntt> Japanese language speech recognizing method
EP0533260A2 (en) * 1991-09-14 1993-03-24 Philips Patentverwaltung GmbH Method and apparatus for recognizing the uttered words in a speech signal
US5502791A (en) * 1992-09-29 1996-03-26 International Business Machines Corporation Speech recognition by concatenating fenonic allophone hidden Markov models in parallel among subwords
US5621859A (en) * 1994-01-19 1997-04-15 Bbn Corporation Single tree method for grammar directed, very large vocabulary speech recognizer
JPH08123479A (en) * 1994-10-26 1996-05-17 Atr Onsei Honyaku Tsushin Kenkyusho:Kk Continuous speech recognition device
JPH08221091A (en) * 1995-02-17 1996-08-30 Matsushita Electric Ind Co Ltd Voice recognition device
WO1996027872A1 (en) * 1995-03-07 1996-09-12 British Telecommunications Public Limited Company Speech recognition
US5832428A (en) * 1995-10-04 1998-11-03 Apple Computer, Inc. Search engine for phrase recognition based on prefix/body/suffix architecture
US5758024A (en) * 1996-06-25 1998-05-26 Microsoft Corporation Method and system for encoding pronunciation prefix trees
EP0825586A2 (en) * 1996-08-22 1998-02-25 Dragon Systems Inc. Lexical tree pre-filtering in speech recognition
CN1233803A (en) * 1998-04-29 1999-11-03 松下电器产业株式会社 Method and apparatus using decision trees to generate and score multiple pronunciations for spelled word
WO1999059141A1 (en) * 1998-05-11 1999-11-18 Siemens Aktiengesellschaft Method and array for introducing temporal correlation in hidden markov models for speech recognition
JPH11344991A (en) * 1998-05-30 1999-12-14 Brother Ind Ltd Voice recognition device and storage medium

Also Published As

Publication number Publication date
CN1406374A (en) 2003-03-26
WO2001048737A2 (en) 2001-07-05
CN1201286C (en) 2005-05-11
AU1767600A (en) 2001-07-09

Similar Documents

Publication Publication Date Title
AU2001274936A1 (en) Creating a unified task dependent language models with information retrieval techniques
US7711561B2 (en) Speech recognition system and technique
EP1128361A3 (en) Language models for speech recognition
WO2004110030A3 (en) Assistive call center interface
CA2508946A1 (en) Method and apparatus for natural language call routing using confidence scores
WO2008115285A3 (en) Content selection using speech recognition
EP1220197A3 (en) Speech recognition method and system
CA2321112A1 (en) Information retrieval and speech recognition based on language models
EP1538535A3 (en) Determination of meaning for text input in natural language understanding systems
EP2416262A3 (en) Information retrieval based on historical data
EP1083545A3 (en) Voice recognition of proper names in a navigation apparatus
CA2493640A1 (en) Improvements in or relating to information provision for call centres
WO2002046719A3 (en) Cryostorage method and device
WO2007035186A3 (en) A method and system for the automatic recognition of deceptive language
EP1653444A3 (en) System and method for converting text to speech
US20070033025A1 (en) Algorithm for n-best ASR result processing to improve accuracy
IT1279171B1 (en) CONTINUOUS SPEECH RECOGNITION SYSTEM
WO2001048737A3 (en) Speech recognizer with a lexical tree based n-gram language model
WO2001084357A3 (en) Cluster and pruning-based language model compression
EP1471501A3 (en) Speech recognition apparatus, speech recognition method, and recording medium on which speech recognition program is computer-readable recorded
Nocera et al. Phoneme lattice based A* search algorithm for speech recognition
EP0949606A3 (en) Method and system for speech recognition based on phonetic transcriptions
Wang et al. A multi-pass linear fold algorithm for sentence boundary detection using prosodic cues
WO2004072947A3 (en) Speech recognition with soft pruning
EP1321862A3 (en) Hash function based transcription database

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 998170585

Country of ref document: CN

WWE Wipo information: entry into national phase

Ref document number: 09979628

Country of ref document: US

AK Designated states

Kind code of ref document: A3

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase