DE60012655D1 - Audiowiedergabe von einem geschriebenen Dokument aus mehreren Quellen - Google Patents

Audiowiedergabe von einem geschriebenen Dokument aus mehreren Quellen

Info

Publication number
DE60012655D1
DE60012655D1 DE60012655T DE60012655T DE60012655D1 DE 60012655 D1 DE60012655 D1 DE 60012655D1 DE 60012655 T DE60012655 T DE 60012655T DE 60012655 T DE60012655 T DE 60012655T DE 60012655 D1 DE60012655 D1 DE 60012655D1
Authority
DE
Germany
Prior art keywords
text
utility
audio data
playback
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE60012655T
Other languages
English (en)
Other versions
DE60012655T2 (de
Inventor
Jeffrey C Reynar
Erik Rucker
Hwan Kim Paul Kyong
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Application granted granted Critical
Publication of DE60012655D1 publication Critical patent/DE60012655D1/de
Publication of DE60012655T2 publication Critical patent/DE60012655T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/221Announcement of recognition results
DE60012655T 1999-10-27 2000-10-17 Audiowiedergabe von einem geschriebenen Dokument aus mehreren Quellen Expired - Lifetime DE60012655T2 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US428259 1989-10-27
US09/428,259 US6446041B1 (en) 1999-10-27 1999-10-27 Method and system for providing audio playback of a multi-source document

Publications (2)

Publication Number Publication Date
DE60012655D1 true DE60012655D1 (de) 2004-09-09
DE60012655T2 DE60012655T2 (de) 2005-07-28

Family

ID=23698146

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60012655T Expired - Lifetime DE60012655T2 (de) 1999-10-27 2000-10-17 Audiowiedergabe von einem geschriebenen Dokument aus mehreren Quellen

Country Status (6)

Country Link
US (1) US6446041B1 (de)
EP (1) EP1096472B1 (de)
JP (1) JP2001188777A (de)
CN (1) CN1140871C (de)
AT (1) ATE272882T1 (de)
DE (1) DE60012655T2 (de)

Families Citing this family (68)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6611802B2 (en) 1999-06-11 2003-08-26 International Business Machines Corporation Method and system for proofreading and correcting dictated text
US6904405B2 (en) * 1999-07-17 2005-06-07 Edwin A. Suominen Message recognition using shared language model
US6687689B1 (en) 2000-06-16 2004-02-03 Nusuara Technologies Sdn. Bhd. System and methods for document retrieval using natural language-based queries
US7653748B2 (en) * 2000-08-10 2010-01-26 Simplexity, Llc Systems, methods and computer program products for integrating advertising within web content
US7383187B2 (en) * 2001-01-24 2008-06-03 Bevocal, Inc. System, method and computer program product for a distributed speech recognition tuning platform
EP1374226B1 (de) * 2001-03-16 2005-07-20 Koninklijke Philips Electronics N.V. Transkriptionsdienst mit abbruch der automatischen transkription
US6996531B2 (en) * 2001-03-30 2006-02-07 Comverse Ltd. Automated database assistance using a telephone for a speech based or text based multimedia communication mode
US7225126B2 (en) * 2001-06-12 2007-05-29 At&T Corp. System and method for processing speech files
US20030046071A1 (en) * 2001-09-06 2003-03-06 International Business Machines Corporation Voice recognition apparatus and method
US7272564B2 (en) * 2002-03-22 2007-09-18 Motorola, Inc. Method and apparatus for multimodal communication with user control of delivery modality
US9165478B2 (en) 2003-04-18 2015-10-20 International Business Machines Corporation System and method to enable blind people to have access to information printed on a physical document
JP4608650B2 (ja) * 2003-05-30 2011-01-12 独立行政法人産業技術総合研究所 既知音響信号除去方法及び装置
WO2004109659A1 (ja) * 2003-06-05 2004-12-16 Kabushiki Kaisha Kenwood 音声合成装置、音声合成方法及びプログラム
US7346506B2 (en) 2003-10-08 2008-03-18 Agfa Inc. System and method for synchronized text display and audio playback
US7424154B2 (en) * 2003-11-10 2008-09-09 Microsoft Corporation Boxed and lined input panel
CN1886726A (zh) * 2003-11-28 2006-12-27 皇家飞利浦电子股份有限公司 转录音频信号的方法和设备
WO2005093553A2 (en) * 2004-03-24 2005-10-06 Robert Harvey Rines Electronic & accoustic reading of printed material
US7412378B2 (en) * 2004-04-01 2008-08-12 International Business Machines Corporation Method and system of dynamically adjusting a speech output rate to match a speech input rate
US20080275700A1 (en) * 2004-05-27 2008-11-06 Koninklijke Philips Electronics, N.V. Method of and System for Modifying Messages
CN100547654C (zh) * 2004-07-21 2009-10-07 松下电器产业株式会社 语音合成装置
US7844464B2 (en) * 2005-07-22 2010-11-30 Multimodal Technologies, Inc. Content-based audio playback emphasis
EP2009637B1 (de) * 2004-09-08 2012-02-29 Panasonic Corporation Auf Trickplaykommando Empfang, Kontrol von Blu-Ray-Applet mit Trickplay-Status und Applet Management Information
US7395204B2 (en) * 2005-03-30 2008-07-01 Motorola, Inc. Methods and apparatus for providing push to talk text data
US7729478B1 (en) * 2005-04-12 2010-06-01 Avaya Inc. Change speed of voicemail playback depending on context
US8015009B2 (en) * 2005-05-04 2011-09-06 Joel Jay Harband Speech derived from text in computer presentation applications
DE102005021526A1 (de) * 2005-05-10 2006-11-23 Siemens Ag Verfahren und Vorrichtung zum Eingeben von Schriftzeichen in eine Datenverarbeitungsanlage
TWI270052B (en) * 2005-08-09 2007-01-01 Delta Electronics Inc System for selecting audio content by using speech recognition and method therefor
CN101110861B (zh) * 2006-07-18 2011-06-22 中兴通讯股份有限公司 一种在智能网中播放文本语音的系统和方法
JP4973664B2 (ja) * 2006-11-24 2012-07-11 富士通株式会社 文書読上げ装置、文書読上げ装置を制御する制御方法及び文書読上げ装置を制御する制御プログラム
US8831948B2 (en) 2008-06-06 2014-09-09 At&T Intellectual Property I, L.P. System and method for synthetically generated speech describing media content
US8121842B2 (en) * 2008-12-12 2012-02-21 Microsoft Corporation Audio output of a document from mobile device
US8498866B2 (en) * 2009-01-15 2013-07-30 K-Nfb Reading Technology, Inc. Systems and methods for multiple language document narration
US8681106B2 (en) 2009-06-07 2014-03-25 Apple Inc. Devices, methods, and graphical user interfaces for accessibility using a touch-sensitive surface
US11416214B2 (en) 2009-12-23 2022-08-16 Google Llc Multi-modal input on an electronic device
EP3091535B1 (de) 2009-12-23 2023-10-11 Google LLC Multimodale eingabe in eine elektronische vorrichtung
FR2956515A1 (fr) * 2010-02-15 2011-08-19 France Telecom Procede de navigation dans un contenu sonore
US8392186B2 (en) 2010-05-18 2013-03-05 K-Nfb Reading Technology, Inc. Audio synchronization for document narration with user-selected playback
US8707195B2 (en) 2010-06-07 2014-04-22 Apple Inc. Devices, methods, and graphical user interfaces for accessibility via a touch-sensitive surface
US20110313762A1 (en) * 2010-06-20 2011-12-22 International Business Machines Corporation Speech output with confidence indication
US9009592B2 (en) * 2010-06-22 2015-04-14 Microsoft Technology Licensing, Llc Population of lists and tasks from captured voice and audio content
US8452600B2 (en) * 2010-08-18 2013-05-28 Apple Inc. Assisted reader
US9953643B2 (en) * 2010-12-23 2018-04-24 Lenovo (Singapore) Pte. Ltd. Selective transmission of voice data
US8352245B1 (en) 2010-12-30 2013-01-08 Google Inc. Adjusting language models
US8296142B2 (en) 2011-01-21 2012-10-23 Google Inc. Speech recognition using dock context
JP5105126B2 (ja) * 2011-04-14 2012-12-19 シャープ株式会社 情報処理装置及び情報処理方法
US8751971B2 (en) 2011-06-05 2014-06-10 Apple Inc. Devices, methods, and graphical user interfaces for providing accessibility using a touch-sensitive surface
WO2013046055A1 (en) * 2011-09-30 2013-04-04 Audionamix Extraction of single-channel time domain component from mixture of coherent information
US10192176B2 (en) 2011-10-11 2019-01-29 Microsoft Technology Licensing, Llc Motivation of task completion and personalization of tasks and lists
CN102945074B (zh) * 2011-10-12 2016-04-27 微软技术许可有限责任公司 根据所捕捉的语音和音频内容来填充列表和任务
CN108014002A (zh) 2011-11-04 2018-05-11 马萨诸塞眼科耳科诊所 自适应视觉辅助装置
US8881269B2 (en) 2012-03-31 2014-11-04 Apple Inc. Device, method, and graphical user interface for integrating recognition of handwriting gestures with a screen reader
JP6045175B2 (ja) * 2012-04-05 2016-12-14 任天堂株式会社 情報処理プログラム、情報処理装置、情報処理方法及び情報処理システム
US9135911B2 (en) * 2014-02-07 2015-09-15 NexGen Flight LLC Automated generation of phonemic lexicon for voice activated cockpit management systems
US9842592B2 (en) 2014-02-12 2017-12-12 Google Inc. Language models using non-linguistic context
US9412365B2 (en) 2014-03-24 2016-08-09 Google Inc. Enhanced maximum entropy models
US10134394B2 (en) 2015-03-20 2018-11-20 Google Llc Speech recognition using log-linear model
GB201516553D0 (en) 2015-09-18 2015-11-04 Microsoft Technology Licensing Llc Inertia audio scrolling
GB201516552D0 (en) * 2015-09-18 2015-11-04 Microsoft Technology Licensing Llc Keyword zoom
US9886433B2 (en) * 2015-10-13 2018-02-06 Lenovo (Singapore) Pte. Ltd. Detecting logograms using multiple inputs
US9978367B2 (en) 2016-03-16 2018-05-22 Google Llc Determining dialog states for language models
US10832664B2 (en) 2016-08-19 2020-11-10 Google Llc Automated speech recognition using language models that selectively use domain-specific model components
US10140973B1 (en) * 2016-09-15 2018-11-27 Amazon Technologies, Inc. Text-to-speech processing using previously speech processed data
US10311860B2 (en) 2017-02-14 2019-06-04 Google Llc Language model biasing system
US10592203B2 (en) 2017-12-18 2020-03-17 Mitel Networks Corporation Device including a digital assistant for personalized speech playback and method of using same
US10671251B2 (en) 2017-12-22 2020-06-02 Arbordale Publishing, LLC Interactive eReader interface generation based on synchronization of textual and audial descriptors
US11443646B2 (en) 2017-12-22 2022-09-13 Fathom Technologies, LLC E-Reader interface system with audio and highlighting synchronization for digital books
DE102018213602B3 (de) 2018-08-13 2019-10-31 Audi Ag Verfahren zum Erzeugen einer Sprachansage als Rückmeldung zu einer handschriftlichen Nutzereingabe sowie entsprechende Bedienvorrichtung und Kraftfahrzeug
US11423073B2 (en) * 2018-11-16 2022-08-23 Microsoft Technology Licensing, Llc System and management of semantic indicators during document presentations

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5664060A (en) * 1994-01-25 1997-09-02 Information Storage Devices Message management methods and apparatus
US5960447A (en) * 1995-11-13 1999-09-28 Holt; Douglas Word tagging and editing system for speech recognition
GB2303955B (en) * 1996-09-24 1997-05-14 Allvoice Computing Plc Data processing method and apparatus
US6023678A (en) * 1998-03-27 2000-02-08 International Business Machines Corporation Using TTS to fill in for missing dictation audio
US6078885A (en) * 1998-05-08 2000-06-20 At&T Corp Verbal, fully automatic dictionary updates by end-users of speech synthesis and recognition systems
US6199042B1 (en) * 1998-06-19 2001-03-06 L&H Applications Usa, Inc. Reading system
US6151576A (en) * 1998-08-11 2000-11-21 Adobe Systems Incorporated Mixing digitized speech and text using reliability indices
US6064965A (en) * 1998-09-02 2000-05-16 International Business Machines Corporation Combined audio playback in speech recognition proofreader

Also Published As

Publication number Publication date
DE60012655T2 (de) 2005-07-28
CN1140871C (zh) 2004-03-03
EP1096472A2 (de) 2001-05-02
EP1096472A3 (de) 2001-09-12
JP2001188777A (ja) 2001-07-10
EP1096472B1 (de) 2004-08-04
ATE272882T1 (de) 2004-08-15
US6446041B1 (en) 2002-09-03
CN1303047A (zh) 2001-07-11

Similar Documents

Publication Publication Date Title
DE60012655D1 (de) Audiowiedergabe von einem geschriebenen Dokument aus mehreren Quellen
JP2003241644A (ja) 外国語会話学習法及び外国語会話学習装置
Stopar Mamma Mia, A Singable Translation!
Brainerd The contractions of not: A historical note
Erçin From-ness: The identity of the practitioner in the laboratory
CN101556796A (zh) 汉字发音资料库生成系统及其方法
Siska THE FIGURATIVE LANGUAGE IN DARK HORSE SONG LYRICS BY KATY PARRY
Wallace Improvisation and Literature: a brief guide
Wheeler Undead Eliot: How" The Waste Land" Sounds Now
HALLIDAY CHAPTER TWELVE EDGAR ALLAN POE’S WORDS AS MUSICAL INSPIRATION IAIN HALLIDAY AND MARIATERESA FRANZA
Smith et al. The Acoustic World of Early Modern England: Attending to the O-Factor
Erwan The Term Umarmaye/Base Lampaq for the Obstacle of Sasak Dialect Standardization
Walters Performing Peribáñez y el Comendador de Ocaña: Music as an Agent of Harmony
Hough Why Musicians Need Silence in an Always‐Connected World
Cho Four new song settings of the poems by Dongjoo Yoon
Bell The Wonder of Christmas
Komara Washington Phillips and his Manzarene Dreams
Wilson Pronunciation Issues Within Twentieth Century French Music
Butzmann That's Comish Music! Mutant Sounds
RAWSON GIOVANNI BONONCINI (1670–1747) SAN NICOLA DI BARI Lavinia Bertotti (soprano), Elena Cecchi Fedi (soprano), Gabriella Martellacci (alto), Furio Zanasi (bass)/Les Muffatti/Peter Van Heyghen Ramée RAM 0806, 2008; one disc, 82 minutes
Chisholm Singing Shakespeare's Words
Heathers The Worst of All Possible Worlds: Schopenhauer Meets Sabbath
Nelson Book Review: Bruce R. Smith. The Acoustic World of Early Modern England: Attending to the O-Factor. Chicago: University of Chicago Press, 1999.
McClure Observations of European Orchestras
Bronson “That Wicked Arsonist”: Writing the musical interlude

Legal Events

Date Code Title Description
8364 No opposition during term of opposition