WO2004070560A3 - Reduced unit database generation based on cost information - Google Patents
Reduced unit database generation based on cost information Download PDFInfo
- Publication number
- WO2004070560A3 WO2004070560A3 PCT/US2004/002784 US2004002784W WO2004070560A3 WO 2004070560 A3 WO2004070560 A3 WO 2004070560A3 US 2004002784 W US2004002784 W US 2004002784W WO 2004070560 A3 WO2004070560 A3 WO 2004070560A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- unit database
- reduced unit
- database
- cost information
- generation based
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
Abstract
An arrangement is provided for generating a reduced unit database of a desired size to be used in text to speech operations. A reduced unit database with a desired size is generated based on a full unit database. The reduction is carried out with respect to a text database with a plurality of sentences. Units from the full database are pruned to minimize an overall cost associated with using alternative units other than the units in the reduced unit database.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/355,143 US6988069B2 (en) | 2003-01-31 | 2003-01-31 | Reduced unit database generation based on cost information |
US10/355,143 | 2003-01-31 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2004070560A2 WO2004070560A2 (en) | 2004-08-19 |
WO2004070560A3 true WO2004070560A3 (en) | 2004-12-16 |
Family
ID=32770475
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2004/002784 WO2004070560A2 (en) | 2003-01-31 | 2004-01-30 | Reduced unit database generation based on cost information |
Country Status (2)
Country | Link |
---|---|
US (1) | US6988069B2 (en) |
WO (1) | WO2004070560A2 (en) |
Families Citing this family (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7082396B1 (en) * | 1999-04-30 | 2006-07-25 | At&T Corp | Methods and apparatus for rapid acoustic unit selection from a large speech corpus |
US7369994B1 (en) | 1999-04-30 | 2008-05-06 | At&T Corp. | Methods and apparatus for rapid acoustic unit selection from a large speech corpus |
US20070121939A1 (en) * | 2004-01-13 | 2007-05-31 | Interdigital Technology Corporation | Watermarks for wireless communications |
US7869999B2 (en) * | 2004-08-11 | 2011-01-11 | Nuance Communications, Inc. | Systems and methods for selecting from multiple phonectic transcriptions for text-to-speech synthesis |
WO2006050238A1 (en) * | 2004-10-28 | 2006-05-11 | Voice Signal Technologies, Inc. | Codec-dependent unit selection for mobile devices |
US7904723B2 (en) * | 2005-01-12 | 2011-03-08 | Interdigital Technology Corporation | Method and apparatus for enhancing security of wireless communications |
JP4586615B2 (en) * | 2005-04-11 | 2010-11-24 | 沖電気工業株式会社 | Speech synthesis apparatus, speech synthesis method, and computer program |
US7693716B1 (en) * | 2005-09-27 | 2010-04-06 | At&T Intellectual Property Ii, L.P. | System and method of developing a TTS voice |
US7630898B1 (en) | 2005-09-27 | 2009-12-08 | At&T Intellectual Property Ii, L.P. | System and method for preparing a pronunciation dictionary for a text-to-speech voice |
US7711562B1 (en) * | 2005-09-27 | 2010-05-04 | At&T Intellectual Property Ii, L.P. | System and method for testing a TTS voice |
US7742921B1 (en) | 2005-09-27 | 2010-06-22 | At&T Intellectual Property Ii, L.P. | System and method for correcting errors when generating a TTS voice |
US7742919B1 (en) | 2005-09-27 | 2010-06-22 | At&T Intellectual Property Ii, L.P. | System and method for repairing a TTS voice database |
US20080183474A1 (en) * | 2007-01-30 | 2008-07-31 | Damion Alexander Bethune | Process for creating and administrating tests made from zero or more picture files, sound bites on handheld device |
US8027835B2 (en) * | 2007-07-11 | 2011-09-27 | Canon Kabushiki Kaisha | Speech processing apparatus having a speech synthesis unit that performs speech synthesis while selectively changing recorded-speech-playback and text-to-speech and method |
JP5238205B2 (en) * | 2007-09-07 | 2013-07-17 | ニュアンス コミュニケーションズ,インコーポレイテッド | Speech synthesis system, program and method |
JP5446873B2 (en) * | 2007-11-28 | 2014-03-19 | 日本電気株式会社 | Speech synthesis apparatus, speech synthesis method, and speech synthesis program |
US8160919B2 (en) * | 2008-03-21 | 2012-04-17 | Unwired Nation | System and method of distributing audio content |
US8536976B2 (en) | 2008-06-11 | 2013-09-17 | Veritrix, Inc. | Single-channel multi-factor authentication |
US8166297B2 (en) | 2008-07-02 | 2012-04-24 | Veritrix, Inc. | Systems and methods for controlling access to encrypted data stored on a mobile device |
WO2010051342A1 (en) * | 2008-11-03 | 2010-05-06 | Veritrix, Inc. | User authentication for social networks |
US8798998B2 (en) * | 2010-04-05 | 2014-08-05 | Microsoft Corporation | Pre-saved data compression for TTS concatenation cost |
US8731931B2 (en) * | 2010-06-18 | 2014-05-20 | At&T Intellectual Property I, L.P. | System and method for unit selection text-to-speech using a modified Viterbi approach |
US8751236B1 (en) | 2013-10-23 | 2014-06-10 | Google Inc. | Devices and methods for speech unit reduction in text-to-speech synthesis systems |
US9520123B2 (en) * | 2015-03-19 | 2016-12-13 | Nuance Communications, Inc. | System and method for pruning redundant units in a speech synthesis process |
US10353863B1 (en) | 2018-04-11 | 2019-07-16 | Capital One Services, Llc | Utilizing machine learning to determine data storage pruning parameters |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020143543A1 (en) * | 2001-03-30 | 2002-10-03 | Sudheer Sirivara | Compressing & using a concatenative speech database in text-to-speech systems |
US20030212555A1 (en) * | 2002-05-09 | 2003-11-13 | Oregon Health & Science | System and method for compressing concatenative acoustic inventories for speech synthesis |
US20030229494A1 (en) * | 2002-04-17 | 2003-12-11 | Peter Rutten | Method and apparatus for sculpting synthesized speech |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6366883B1 (en) | 1996-05-15 | 2002-04-02 | Atr Interpreting Telecommunications | Concatenation of speech segments by use of a speech synthesizer |
US6173263B1 (en) | 1998-08-31 | 2001-01-09 | At&T Corp. | Method and system for performing concatenative speech synthesis using half-phonemes |
EP1138038B1 (en) | 1998-11-13 | 2005-06-22 | Lernout & Hauspie Speech Products N.V. | Speech synthesis using concatenation of speech waveforms |
US6260016B1 (en) | 1998-11-25 | 2001-07-10 | Matsushita Electric Industrial Co., Ltd. | Speech synthesis employing prosody templates |
-
2003
- 2003-01-31 US US10/355,143 patent/US6988069B2/en not_active Expired - Lifetime
-
2004
- 2004-01-30 WO PCT/US2004/002784 patent/WO2004070560A2/en active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020143543A1 (en) * | 2001-03-30 | 2002-10-03 | Sudheer Sirivara | Compressing & using a concatenative speech database in text-to-speech systems |
US20030229494A1 (en) * | 2002-04-17 | 2003-12-11 | Peter Rutten | Method and apparatus for sculpting synthesized speech |
US20030212555A1 (en) * | 2002-05-09 | 2003-11-13 | Oregon Health & Science | System and method for compressing concatenative acoustic inventories for speech synthesis |
Non-Patent Citations (4)
Title |
---|
CONKIE A. ET AL: "Preselection of Candidate Units in a Unit Selection-Based Text-To-Speech Synthesis System", SIXTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING (ICSLP 2000), vol. 3, October 2000 (2000-10-01), pages 314 - 317, XP002971946 * |
DONOVAN R.E.: "Segment pre-selection in decision-tree based speech synthesis systems", 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, vol. 2, June 2000 (2000-06-01), pages 937 - 940, XP010504878 * |
HON ET AL: "Automatic generation of synthesis units for trainable text-to-speech systems", PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING. ICASSP '98, May 1998 (1998-05-01), pages 293 - 296, XP010279159 * |
YI ET AL: "Information-Theoretic Criteria for Unit Selection Synthesis", PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, 2002, pages 2617 - 2620, XP002982190 * |
Also Published As
Publication number | Publication date |
---|---|
US20040153324A1 (en) | 2004-08-05 |
US6988069B2 (en) | 2006-01-17 |
WO2004070560A2 (en) | 2004-08-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2004070560A3 (en) | Reduced unit database generation based on cost information | |
WO2004070701A3 (en) | Linguistic prosodic model-based text to speech | |
WO2005074630A3 (en) | Multilingual text-to-speech system with limited resources | |
ATE374991T1 (en) | METHOD AND SYSTEM FOR TEXT-TO-SPEECH CONVERSION | |
JP2004287444A5 (en) | ||
WO2007044568A3 (en) | Generating words and names using n-grams of phonemes | |
EP1544746A3 (en) | Creation of normalized summaries using common domain models for input text analysis and output text generation | |
WO2004003688A3 (en) | A method for comparing a transcribed text file with a previously created file | |
ATE484029T1 (en) | TRANSLATION PROCEDURE FOR HIGHLIGHTED WORDS | |
WO2007027410A3 (en) | Information synthesis engine | |
WO2004097791A3 (en) | Methods and systems for creating a second generation session file | |
MXPA05007544A (en) | Device and method for voicing phonemes, and keyboard for use in such a device. | |
GB2451371A (en) | Method and systems for correcting transcribed audio files | |
ATE404967T1 (en) | TEXT-TO-SPEECH SYSTEM AND METHOD, COMPUTER PROGRAM THEREOF | |
WO2001001373A3 (en) | Electronic book with voice synthesis and recognition | |
WO2003098486A3 (en) | Methods and systems for providing supplemental contextual content | |
WO2003071393A3 (en) | Linguistic support for a regognizer of mathematical expressions | |
WO2008142836A1 (en) | Voice tone converting device and voice tone converting method | |
WO2007005884A3 (en) | Generating chinese language couplets | |
WO2006107586A3 (en) | Method and system for interpreting verbal inputs in a multimodal dialog system | |
WO2001033409A3 (en) | Computer generated poetry system | |
ATE537499T1 (en) | MULTIMEDIA CONSOLE WITH ALPHANUMERIC KEYBOARD AND MUSIC KEYBED | |
WO2007002652A3 (en) | Translating expressions in a computing environment | |
CA2694317A1 (en) | Apparatus, systems and methods for language instruction | |
TW200707239A (en) | E-mail assisted and text-to-sound system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): BW GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DPEN | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed from 20040101) | ||
122 | Ep: pct application non-entry in european phase |