WO2004070560A3 - Reduced unit database generation based on cost information - Google Patents

Reduced unit database generation based on cost information Download PDF

Info

Publication number
WO2004070560A3
WO2004070560A3 PCT/US2004/002784 US2004002784W WO2004070560A3 WO 2004070560 A3 WO2004070560 A3 WO 2004070560A3 US 2004002784 W US2004002784 W US 2004002784W WO 2004070560 A3 WO2004070560 A3 WO 2004070560A3
Authority
WO
WIPO (PCT)
Prior art keywords
unit database
reduced unit
database
cost information
generation based
Prior art date
Application number
PCT/US2004/002784
Other languages
French (fr)
Other versions
WO2004070560A2 (en
Inventor
Michael Stuart Phillips
Original Assignee
Scansoft Inc
Michael Stuart Phillips
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Scansoft Inc, Michael Stuart Phillips filed Critical Scansoft Inc
Publication of WO2004070560A2 publication Critical patent/WO2004070560A2/en
Publication of WO2004070560A3 publication Critical patent/WO2004070560A3/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management

Abstract

An arrangement is provided for generating a reduced unit database of a desired size to be used in text to speech operations. A reduced unit database with a desired size is generated based on a full unit database. The reduction is carried out with respect to a text database with a plurality of sentences. Units from the full database are pruned to minimize an overall cost associated with using alternative units other than the units in the reduced unit database.
PCT/US2004/002784 2003-01-31 2004-01-30 Reduced unit database generation based on cost information WO2004070560A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/355,143 US6988069B2 (en) 2003-01-31 2003-01-31 Reduced unit database generation based on cost information
US10/355,143 2003-01-31

Publications (2)

Publication Number Publication Date
WO2004070560A2 WO2004070560A2 (en) 2004-08-19
WO2004070560A3 true WO2004070560A3 (en) 2004-12-16

Family

ID=32770475

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2004/002784 WO2004070560A2 (en) 2003-01-31 2004-01-30 Reduced unit database generation based on cost information

Country Status (2)

Country Link
US (1) US6988069B2 (en)
WO (1) WO2004070560A2 (en)

Families Citing this family (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7082396B1 (en) * 1999-04-30 2006-07-25 At&T Corp Methods and apparatus for rapid acoustic unit selection from a large speech corpus
US7369994B1 (en) 1999-04-30 2008-05-06 At&T Corp. Methods and apparatus for rapid acoustic unit selection from a large speech corpus
US20070121939A1 (en) * 2004-01-13 2007-05-31 Interdigital Technology Corporation Watermarks for wireless communications
US7869999B2 (en) * 2004-08-11 2011-01-11 Nuance Communications, Inc. Systems and methods for selecting from multiple phonectic transcriptions for text-to-speech synthesis
WO2006050238A1 (en) * 2004-10-28 2006-05-11 Voice Signal Technologies, Inc. Codec-dependent unit selection for mobile devices
US7904723B2 (en) * 2005-01-12 2011-03-08 Interdigital Technology Corporation Method and apparatus for enhancing security of wireless communications
JP4586615B2 (en) * 2005-04-11 2010-11-24 沖電気工業株式会社 Speech synthesis apparatus, speech synthesis method, and computer program
US7693716B1 (en) * 2005-09-27 2010-04-06 At&T Intellectual Property Ii, L.P. System and method of developing a TTS voice
US7630898B1 (en) 2005-09-27 2009-12-08 At&T Intellectual Property Ii, L.P. System and method for preparing a pronunciation dictionary for a text-to-speech voice
US7711562B1 (en) * 2005-09-27 2010-05-04 At&T Intellectual Property Ii, L.P. System and method for testing a TTS voice
US7742921B1 (en) 2005-09-27 2010-06-22 At&T Intellectual Property Ii, L.P. System and method for correcting errors when generating a TTS voice
US7742919B1 (en) 2005-09-27 2010-06-22 At&T Intellectual Property Ii, L.P. System and method for repairing a TTS voice database
US20080183474A1 (en) * 2007-01-30 2008-07-31 Damion Alexander Bethune Process for creating and administrating tests made from zero or more picture files, sound bites on handheld device
US8027835B2 (en) * 2007-07-11 2011-09-27 Canon Kabushiki Kaisha Speech processing apparatus having a speech synthesis unit that performs speech synthesis while selectively changing recorded-speech-playback and text-to-speech and method
JP5238205B2 (en) * 2007-09-07 2013-07-17 ニュアンス コミュニケーションズ,インコーポレイテッド Speech synthesis system, program and method
JP5446873B2 (en) * 2007-11-28 2014-03-19 日本電気株式会社 Speech synthesis apparatus, speech synthesis method, and speech synthesis program
US8160919B2 (en) * 2008-03-21 2012-04-17 Unwired Nation System and method of distributing audio content
US8536976B2 (en) 2008-06-11 2013-09-17 Veritrix, Inc. Single-channel multi-factor authentication
US8166297B2 (en) 2008-07-02 2012-04-24 Veritrix, Inc. Systems and methods for controlling access to encrypted data stored on a mobile device
WO2010051342A1 (en) * 2008-11-03 2010-05-06 Veritrix, Inc. User authentication for social networks
US8798998B2 (en) * 2010-04-05 2014-08-05 Microsoft Corporation Pre-saved data compression for TTS concatenation cost
US8731931B2 (en) * 2010-06-18 2014-05-20 At&T Intellectual Property I, L.P. System and method for unit selection text-to-speech using a modified Viterbi approach
US8751236B1 (en) 2013-10-23 2014-06-10 Google Inc. Devices and methods for speech unit reduction in text-to-speech synthesis systems
US9520123B2 (en) * 2015-03-19 2016-12-13 Nuance Communications, Inc. System and method for pruning redundant units in a speech synthesis process
US10353863B1 (en) 2018-04-11 2019-07-16 Capital One Services, Llc Utilizing machine learning to determine data storage pruning parameters

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020143543A1 (en) * 2001-03-30 2002-10-03 Sudheer Sirivara Compressing & using a concatenative speech database in text-to-speech systems
US20030212555A1 (en) * 2002-05-09 2003-11-13 Oregon Health & Science System and method for compressing concatenative acoustic inventories for speech synthesis
US20030229494A1 (en) * 2002-04-17 2003-12-11 Peter Rutten Method and apparatus for sculpting synthesized speech

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6366883B1 (en) 1996-05-15 2002-04-02 Atr Interpreting Telecommunications Concatenation of speech segments by use of a speech synthesizer
US6173263B1 (en) 1998-08-31 2001-01-09 At&T Corp. Method and system for performing concatenative speech synthesis using half-phonemes
EP1138038B1 (en) 1998-11-13 2005-06-22 Lernout & Hauspie Speech Products N.V. Speech synthesis using concatenation of speech waveforms
US6260016B1 (en) 1998-11-25 2001-07-10 Matsushita Electric Industrial Co., Ltd. Speech synthesis employing prosody templates

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020143543A1 (en) * 2001-03-30 2002-10-03 Sudheer Sirivara Compressing & using a concatenative speech database in text-to-speech systems
US20030229494A1 (en) * 2002-04-17 2003-12-11 Peter Rutten Method and apparatus for sculpting synthesized speech
US20030212555A1 (en) * 2002-05-09 2003-11-13 Oregon Health & Science System and method for compressing concatenative acoustic inventories for speech synthesis

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
CONKIE A. ET AL: "Preselection of Candidate Units in a Unit Selection-Based Text-To-Speech Synthesis System", SIXTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING (ICSLP 2000), vol. 3, October 2000 (2000-10-01), pages 314 - 317, XP002971946 *
DONOVAN R.E.: "Segment pre-selection in decision-tree based speech synthesis systems", 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, vol. 2, June 2000 (2000-06-01), pages 937 - 940, XP010504878 *
HON ET AL: "Automatic generation of synthesis units for trainable text-to-speech systems", PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING. ICASSP '98, May 1998 (1998-05-01), pages 293 - 296, XP010279159 *
YI ET AL: "Information-Theoretic Criteria for Unit Selection Synthesis", PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, 2002, pages 2617 - 2620, XP002982190 *

Also Published As

Publication number Publication date
US20040153324A1 (en) 2004-08-05
US6988069B2 (en) 2006-01-17
WO2004070560A2 (en) 2004-08-19

Similar Documents

Publication Publication Date Title
WO2004070560A3 (en) Reduced unit database generation based on cost information
WO2004070701A3 (en) Linguistic prosodic model-based text to speech
WO2005074630A3 (en) Multilingual text-to-speech system with limited resources
ATE374991T1 (en) METHOD AND SYSTEM FOR TEXT-TO-SPEECH CONVERSION
JP2004287444A5 (en)
WO2007044568A3 (en) Generating words and names using n-grams of phonemes
EP1544746A3 (en) Creation of normalized summaries using common domain models for input text analysis and output text generation
WO2004003688A3 (en) A method for comparing a transcribed text file with a previously created file
ATE484029T1 (en) TRANSLATION PROCEDURE FOR HIGHLIGHTED WORDS
WO2007027410A3 (en) Information synthesis engine
WO2004097791A3 (en) Methods and systems for creating a second generation session file
MXPA05007544A (en) Device and method for voicing phonemes, and keyboard for use in such a device.
GB2451371A (en) Method and systems for correcting transcribed audio files
ATE404967T1 (en) TEXT-TO-SPEECH SYSTEM AND METHOD, COMPUTER PROGRAM THEREOF
WO2001001373A3 (en) Electronic book with voice synthesis and recognition
WO2003098486A3 (en) Methods and systems for providing supplemental contextual content
WO2003071393A3 (en) Linguistic support for a regognizer of mathematical expressions
WO2008142836A1 (en) Voice tone converting device and voice tone converting method
WO2007005884A3 (en) Generating chinese language couplets
WO2006107586A3 (en) Method and system for interpreting verbal inputs in a multimodal dialog system
WO2001033409A3 (en) Computer generated poetry system
ATE537499T1 (en) MULTIMEDIA CONSOLE WITH ALPHANUMERIC KEYBOARD AND MUSIC KEYBED
WO2007002652A3 (en) Translating expressions in a computing environment
CA2694317A1 (en) Apparatus, systems and methods for language instruction
TW200707239A (en) E-mail assisted and text-to-sound system

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DPEN Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed from 20040101)
122 Ep: pct application non-entry in european phase