WO2008064137A3 - Predictive speech-to-text input - Google Patents
Predictive speech-to-text input Download PDFInfo
- Publication number
- WO2008064137A3 WO2008064137A3 PCT/US2007/085031 US2007085031W WO2008064137A3 WO 2008064137 A3 WO2008064137 A3 WO 2008064137A3 US 2007085031 W US2007085031 W US 2007085031W WO 2008064137 A3 WO2008064137 A3 WO 2008064137A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- text
- spelling
- text input
- word
- given
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/228—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context
Abstract
This disclosure describes a practical system/method for predicting spoken text (105) given that text's partial spelling (example, initial characters forming the spelling of a word/sentence). The partial spelling may be given using 'Speech' (107) or may be inputted using the keyboard/keypad (101) or may be obtained using other input methods (102). The disclosed system is an alternative method for inputting text into devices; the method is faster (especially for long words or phrases) compared to existing predictive-text-input and/or word-completion methods.
Applications Claiming Priority (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US85958906P | 2006-11-17 | 2006-11-17 | |
US60/859,589 | 2006-11-17 | ||
US99956507P | 2007-10-19 | 2007-10-19 | |
US60/999,565 | 2007-10-19 | ||
US11/941,910 | 2007-11-16 | ||
US11/941,910 US7904298B2 (en) | 2006-11-17 | 2007-11-16 | Predictive speech-to-text input |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2008064137A2 WO2008064137A2 (en) | 2008-05-29 |
WO2008064137A3 true WO2008064137A3 (en) | 2008-11-06 |
Family
ID=39430528
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2007/085031 WO2008064137A2 (en) | 2006-11-17 | 2007-11-16 | Predictive speech-to-text input |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2008064137A2 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102009423B1 (en) * | 2012-10-08 | 2019-08-09 | 삼성전자주식회사 | Method and apparatus for action of preset performance mode using voice recognition |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5293452A (en) * | 1991-07-01 | 1994-03-08 | Texas Instruments Incorporated | Voice log-in using spoken name input |
US5956683A (en) * | 1993-12-22 | 1999-09-21 | Qualcomm Incorporated | Distributed voice recognition system |
US6223150B1 (en) * | 1999-01-29 | 2001-04-24 | Sony Corporation | Method and apparatus for parsing in a spoken language translation system |
US20040172258A1 (en) * | 2002-12-10 | 2004-09-02 | Dominach Richard F. | Techniques for disambiguating speech input using multimodal interfaces |
-
2007
- 2007-11-16 WO PCT/US2007/085031 patent/WO2008064137A2/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5293452A (en) * | 1991-07-01 | 1994-03-08 | Texas Instruments Incorporated | Voice log-in using spoken name input |
US5956683A (en) * | 1993-12-22 | 1999-09-21 | Qualcomm Incorporated | Distributed voice recognition system |
US6223150B1 (en) * | 1999-01-29 | 2001-04-24 | Sony Corporation | Method and apparatus for parsing in a spoken language translation system |
US20040172258A1 (en) * | 2002-12-10 | 2004-09-02 | Dominach Richard F. | Techniques for disambiguating speech input using multimodal interfaces |
Also Published As
Publication number | Publication date |
---|---|
WO2008064137A2 (en) | 2008-05-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Seyfarth | Word informativity influences acoustic duration: Effects of contextual predictability on lexical representation | |
TW200638337A (en) | Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system | |
WO2006086511A3 (en) | Method and apparatus utilizing voice input to resolve ambiguous manually entered text input | |
Karpov et al. | Large vocabulary Russian speech recognition using syntactico-statistical language modeling | |
WO2009026270A3 (en) | Hmm-based bilingual (mandarin-english) tts techniques | |
WO2007118100A3 (en) | Automatic language model update | |
TW200707404A (en) | Speech recognition assisted autocompletion of composite characters | |
WO2007118020A3 (en) | Method and system for managing pronunciation dictionaries in a speech application | |
WO2005077098A3 (en) | Handwriting and voice input with automatic correction | |
WO2009025356A1 (en) | Voice recognition device and voice recognition method | |
Yuan et al. | Pauses and pause fillers in Mandarin monologue speech: The effects of sex and proficiency | |
Kipyatkova et al. | Lexicon size and language model order optimization for Russian LVCSR | |
WO2008064137A3 (en) | Predictive speech-to-text input | |
Wutiwiwatchai et al. | Thai ASR development for network-based speech translation | |
Li et al. | Language modeling for mixed language speech recognition using weighted phrase extraction. | |
Yang et al. | Modeling pronunciation variations for non-native speech recognition of Korean produced by Chinese learners. | |
KR20050101695A (en) | A system for statistical speech recognition using recognition results, and method thereof | |
Al-Haj et al. | Pronunciation modeling for dialectal Arabic speech recognition | |
Lim et al. | Towards an interactive voice agent for Singapore Hokkien | |
KR20050101694A (en) | A system for statistical speech recognition with grammatical constraints, and method thereof | |
Nouza et al. | Czech-to-slovak adapted broadcast news transcription system. | |
Masmoudi et al. | Conditional Random Fields for the Tunisian Dialect Grapheme-to-Phoneme Conversion. | |
Chung | A Study on the Rhythm of Korean English Learners' Interlanguage Talk | |
Streefkerk | Prominence | |
Zahra et al. | Building a pronunciation dictionary for Indonesian speech recognition system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 07864569 Country of ref document: EP Kind code of ref document: A2 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 07864569 Country of ref document: EP Kind code of ref document: A2 |