WO2008064137A3 - Predictive speech-to-text input - Google Patents

Predictive speech-to-text input Download PDF

Info

Publication number
WO2008064137A3
WO2008064137A3 PCT/US2007/085031 US2007085031W WO2008064137A3 WO 2008064137 A3 WO2008064137 A3 WO 2008064137A3 US 2007085031 W US2007085031 W US 2007085031W WO 2008064137 A3 WO2008064137 A3 WO 2008064137A3
Authority
WO
WIPO (PCT)
Prior art keywords
text
spelling
text input
word
given
Prior art date
Application number
PCT/US2007/085031
Other languages
French (fr)
Other versions
WO2008064137A2 (en
Inventor
Ashwin P Rao
Original Assignee
Ashwin P Rao
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ashwin P Rao filed Critical Ashwin P Rao
Priority claimed from US11/941,910 external-priority patent/US7904298B2/en
Publication of WO2008064137A2 publication Critical patent/WO2008064137A2/en
Publication of WO2008064137A3 publication Critical patent/WO2008064137A3/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • G10L2015/228Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context

Abstract

This disclosure describes a practical system/method for predicting spoken text (105) given that text's partial spelling (example, initial characters forming the spelling of a word/sentence). The partial spelling may be given using 'Speech' (107) or may be inputted using the keyboard/keypad (101) or may be obtained using other input methods (102). The disclosed system is an alternative method for inputting text into devices; the method is faster (especially for long words or phrases) compared to existing predictive-text-input and/or word-completion methods.
PCT/US2007/085031 2006-11-17 2007-11-16 Predictive speech-to-text input WO2008064137A2 (en)

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
US85958906P 2006-11-17 2006-11-17
US60/859,589 2006-11-17
US99956507P 2007-10-19 2007-10-19
US60/999,565 2007-10-19
US11/941,910 2007-11-16
US11/941,910 US7904298B2 (en) 2006-11-17 2007-11-16 Predictive speech-to-text input

Publications (2)

Publication Number Publication Date
WO2008064137A2 WO2008064137A2 (en) 2008-05-29
WO2008064137A3 true WO2008064137A3 (en) 2008-11-06

Family

ID=39430528

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2007/085031 WO2008064137A2 (en) 2006-11-17 2007-11-16 Predictive speech-to-text input

Country Status (1)

Country Link
WO (1) WO2008064137A2 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102009423B1 (en) * 2012-10-08 2019-08-09 삼성전자주식회사 Method and apparatus for action of preset performance mode using voice recognition

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5293452A (en) * 1991-07-01 1994-03-08 Texas Instruments Incorporated Voice log-in using spoken name input
US5956683A (en) * 1993-12-22 1999-09-21 Qualcomm Incorporated Distributed voice recognition system
US6223150B1 (en) * 1999-01-29 2001-04-24 Sony Corporation Method and apparatus for parsing in a spoken language translation system
US20040172258A1 (en) * 2002-12-10 2004-09-02 Dominach Richard F. Techniques for disambiguating speech input using multimodal interfaces

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5293452A (en) * 1991-07-01 1994-03-08 Texas Instruments Incorporated Voice log-in using spoken name input
US5956683A (en) * 1993-12-22 1999-09-21 Qualcomm Incorporated Distributed voice recognition system
US6223150B1 (en) * 1999-01-29 2001-04-24 Sony Corporation Method and apparatus for parsing in a spoken language translation system
US20040172258A1 (en) * 2002-12-10 2004-09-02 Dominach Richard F. Techniques for disambiguating speech input using multimodal interfaces

Also Published As

Publication number Publication date
WO2008064137A2 (en) 2008-05-29

Similar Documents

Publication Publication Date Title
Seyfarth Word informativity influences acoustic duration: Effects of contextual predictability on lexical representation
TW200638337A (en) Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system
WO2006086511A3 (en) Method and apparatus utilizing voice input to resolve ambiguous manually entered text input
Karpov et al. Large vocabulary Russian speech recognition using syntactico-statistical language modeling
WO2009026270A3 (en) Hmm-based bilingual (mandarin-english) tts techniques
WO2007118100A3 (en) Automatic language model update
TW200707404A (en) Speech recognition assisted autocompletion of composite characters
WO2007118020A3 (en) Method and system for managing pronunciation dictionaries in a speech application
WO2005077098A3 (en) Handwriting and voice input with automatic correction
WO2009025356A1 (en) Voice recognition device and voice recognition method
Yuan et al. Pauses and pause fillers in Mandarin monologue speech: The effects of sex and proficiency
Kipyatkova et al. Lexicon size and language model order optimization for Russian LVCSR
WO2008064137A3 (en) Predictive speech-to-text input
Wutiwiwatchai et al. Thai ASR development for network-based speech translation
Li et al. Language modeling for mixed language speech recognition using weighted phrase extraction.
Yang et al. Modeling pronunciation variations for non-native speech recognition of Korean produced by Chinese learners.
KR20050101695A (en) A system for statistical speech recognition using recognition results, and method thereof
Al-Haj et al. Pronunciation modeling for dialectal Arabic speech recognition
Lim et al. Towards an interactive voice agent for Singapore Hokkien
KR20050101694A (en) A system for statistical speech recognition with grammatical constraints, and method thereof
Nouza et al. Czech-to-slovak adapted broadcast news transcription system.
Masmoudi et al. Conditional Random Fields for the Tunisian Dialect Grapheme-to-Phoneme Conversion.
Chung A Study on the Rhythm of Korean English Learners' Interlanguage Talk
Streefkerk Prominence
Zahra et al. Building a pronunciation dictionary for Indonesian speech recognition system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07864569

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 07864569

Country of ref document: EP

Kind code of ref document: A2