WO2003052624A1 - A real time translator and method of performing real time translation of a plurality of spoken word languages - Google Patents
A real time translator and method of performing real time translation of a plurality of spoken word languages Download PDFInfo
- Publication number
- WO2003052624A1 WO2003052624A1 PCT/AU2002/001706 AU0201706W WO03052624A1 WO 2003052624 A1 WO2003052624 A1 WO 2003052624A1 AU 0201706 W AU0201706 W AU 0201706W WO 03052624 A1 WO03052624 A1 WO 03052624A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- voice
- text
- language
- translator
- real time
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/58—Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
Definitions
- This invention relates to a real time translator for providing multi language "spoken word" communication, conversation, and/or dialogue, conferencing and public address system. It is particularly related to a multilanguage conversation translator for the tourist, business or professional translation but is not limited to such use.
- the greatest ability the human race possesses is that of communication via sophisticated languages that have evolved over time. However, it is also the biggest barrier currently facing humankind. Even as the word “globalisation” is frequently used these days in the field of trade and business as well as many other areas of interaction between the different peoples of the world, the main "obstacle” to achieving true globalisation are language barriers. This limits the ability to communicate & converse one-on-one between people who converse through one of the many different languages.
- Translations are required in a number of situations including: • The tourist in a foreign country where he does not speak the language struggles to make himself understood for the most basic of requirements like asking for directions or making a purchase.
- Translators though must be created with regard to the basic architecture of a typical spoken language translation or natural language processing system processes sounds produced by a speaker by converting them into digital form using an analogue-to-digital converter. This signal is processed to extract various features, such as the intensity of sound at different frequencies and the change in intensity over time. These features serve as the input to a speech recognition system, which generally uses Hidden Markov Model (HMM) techniques to identify the most likely sequence of words that could have produced the speech signal.
- HMM Hidden Markov Model
- the speech recogniser outputs the most likely sequence of words to serve as input to a natural language processing system.
- the natural language processing system needs to generate an utterance, it passes a sentence to a module that translates the words into phonemic sequence and determines an intonational contour, and passes this information on to a speech synthesis system, which produces the spoken output.
- a natural language processing system uses considerable knowledge about the structure of the language, including what the words are, how words combine to form sentences, what the words mean, and how word meanings contribute to sentence meanings.
- linguistic behaviour cannot be completely accounted for without also taking into account another aspect of what makes humans intelligent-their general world knowledge and their reasoning abilities. For example, to answer questions or to participate in a conversation, a person not only must have knowledge about the structure of the language being used, but also must know about the world in general and the conversational setting.
- the different forms of knowledge relevant for natural language processing comprise phonetic and phonological knowledge, morphological knowledge, syntactic knowledge, semantic knowledge, and pragmatic knowledge.
- Phonetic and phonological knowledge concerns how words are related to the sounds that realize them. Such knowledge is crucial for speech-based systems.
- Morphological knowledge concerns how words are constructed from basic units called morphemes. A morpheme is the primitive unit in a language; for example, the word friendly is derivable from the meaning of the noun friend and the suffix "-ly", which transforms a noun into an adjective.
- Syntactic knowledge concerns how words can be put together to form correct sentences and determines what structural role each word plays in the sentence and what phrases are subparts of what other phrases.
- Typical syntactic representations of language are based on the notion of context-free grammars, which represent sentence structure in terms of what phrases are subparts of other phrases. This syntactic information is often presented in a tree form.
- Semantic knowledge concerns what words mean and how these meanings combine in sentences to form sentence meanings. This is the study of context-independent meaning- -the meaning a sentence has regardless of the context in which it is used.
- the representation of the context-independent meaning of a sentence is called its logical form.
- the logical form encodes possible word senses and identifies the semantic relationships between the words and phrases.
- Natural language processing systems further comprise interpretation processes that map from one representation to the other.
- the process that maps a sentence to its syntactic structure and logical form is called parsing, and it is performed by a component called a parser.
- the parser uses knowledge about word and word meaning, the lexicon, and a set of rules defining the legal structures, the grammar, in order to assign a syntactic structure and a logical form to an input sentence.
- a context-free grammar of a language is a quadruple comprising non-terminal vocabularies, terminal vocabularies, a finite set of production rules, and a starting symbol for all productions.
- the non-terminal and terminal vocabularies are disjoint.
- the set of terminal symbols is called the vocabulary of the language.
- Pragmatic knowledge concerns how sentences are used in different situations and how use affects the interpretation of the sentence.
- a natural language processor receives an input sentence, lexically separates the words in the sentence, syntactically determines the types of words, semantically understands the words, pragmatically determines the type of response to generate, and generates the response.
- the natural language processor employs many types of knowledge and stores different types of knowledge in different knowledge structures that separate the knowledge into organized types.
- a typical natural language processor also uses very complex capabilities. The knowledge and capabilities of the typical natural language processor must be reduced in complexity and refined to make the natural language processor manageable and useful because a natural language processor must have more than a reasonably correct response to an input sentence.
- Identified problems with previous approaches to natural language processing are numerous and involve many components of the typical speech translation system.
- one previous approach combines the syntactic rules for analysis together with the transfer patterns or transfer rules.
- the syntactic rules and the transfer rules become inter-dependent, and the system becomes less modular and difficult to extend in coverage or apply to a new translation domain.
- US Patent No 6,278,968 also describes a detailed large computer translator.
- the described invention relates to translating from one language to another. More particularly, the described invention relates to providing translation between languages based, at least in part, on a user selecting a particular topic that the translation focuses on. In this way, the translator is limited and not able to provide a true conversation translator.
- the aim of the invention is to provide an electronic solution to the language barrier between languages for the spoken word.
- the invention provides a multilanguage conversation translator having dual voice paths operated by one or more sound cards and software so that conversation from one person in one spoken word language is translated and received by a second person in a second spoken word language at the same time or substantially at the same time as conversation from the second person in the second spoken word language is translated and received by the first person whereby the two persons can undertake a normal conversation in normal time but in different spoken word languages.
- the translator can be portable or hand-held with inbuilt or attached headset or the like.
- a real time translator comprising:
- a real time translator comprising:
- the real time translator could include two sound paths formed by two separate electronic sound manipulators with associated software such that the sound of the first voice in first language being received can be converted to text while the translated text into the second selected language is being converted to voice by the second separate electronic sound manipulator with associated software.
- the separate electronic sound manipulators may be two personal computer sound cards or the like, or two separate left and right channels of a single personal computer sound card or the like with separate software control.
- a portable real time translator comprising (a) first and second voice receivers for receiving first and second selected voice languages;
- first and second voice to text converters at least one text to text spoken language converter for receiving a first selected language text and translating to a second selected language text and/or for receiving the second selected language text and translating to the first selected language text;
- response time in the processing of conversion of first and second voice conversions to or from text and/or with text to text voice language translation such that the lag time between receiving voice and emitting translated voice is within a reasonable conversation period. Such period can be less than one second to a maximum of two seconds.
- voice translation and emission is in voice phrases substantially corresponding with voice phrasing of input voice such that a continual flow of spaced voice phrases simulates conversations.
- voice phrases are a sentence or part of a sentence.
- This can be by separate processing paths including the separate personal computer sound cards or the like or separate channels on a sound card or the like or by a switching system for switching between two processing paths at a rate to maintain reasonable real time processing of both paths simultaneously.
- the invention also provides a method of providing real time translation of voices.
- the method includes:
- the parallel processing can be by central processing unit (cpu), parallel processing techniques but primarily by parallel processing via software controlled switching techniques. Therefore both paths are always operating bi-directional both ways to provide conversation.
- the software has to overcome the difficulty that another later installed sound cards will generally override a single sound card-operating environment in normal uses.
- the software overcomes this predetermined intent and the unusual parallel operation of two sound cards in a parallel operation of software controlled switching between the speed of a voice phrase of between less than one second to a maximum 2 seconds to the megahertz speed of the central processing unit (cpu).
- This invention provides a practical solution to enable:
- a conversation and/or dialogue (which is relatively immediate, instant and on-the- spot) between two persons or groups wishing to communicate by conversing in two different languages either face-to-face or over a telephone line (or similar); and (2) a speaker to communicate by addressing an audience in a language that is different to that of the audience (3) the audience to respond with comments and questions to the speaker.
- the main applications that can use the disclosed translator are the three scenarios of:
- Person-to-person conversation and/or dialogue in two different languages at any one instance enabling a face-to-face conversation or dialogue (type method of communication) between speakers of two different languages.
- the invention provides an innovative and practical solution to the above scenarios providing the ability to communicate (speak) in language-A and be understood (heard) in language-B - immediately, instantly and "on the spot". With the ability in reverse to communicate (reply back) in language-B and be understood (heard) in language-A. As in the first two scenarios the ability to have a real-time conversation / dialogue in two different languages. In the third scenario the ability to communicate by "addressing" or "to inform” in one language but be understood (heard) in a different language and to receive response from the audience in the form of comments or questions.
- the system is also particularly useful as an educational tool because it is able to provide variable inputs and real time translations.
- a keyboard entry can provide a real time verbal translation.
- Figure 1 is a flow chart of a real time translator in accordance with a first embodiment of the invention
- Figure 2 is a diagrammatic representation of a real time translator of Figure 1
- Figure 3 is a diagrammatic representation of a first use of a real time translator in accordance with the invention
- Figure 4 is a diagrammatic representation of a second use of a real time translator in accordance with the invention.
- Figure 4A is a diagrammatic representation of a further use of a real time translator in accordance with the invention as used on a server of a telephone company or telecommunication service provider;
- Figure 5 is a diagrammatic representation of a third use of a real time translator in accordance with the invention.
- a real time translator (101) having a voice receiver or microphone (101), a voice to text converter (102), a text-to-text spoken language translator (103) for receiving a first language and translating to a second selected language, a text to speech converter (105) for converting the translated second selected language to a voice output and a voice emitter or speaker (211) for emitting the voice output.
- the real time translator (101) having a second voice receiver or microphone (201), a voice to text converter (202), a text-to-text spoken language translator (203) for receiving a second language and translating to the first selected language, a text to speech converter (105) for converting the translated first selected language to a voice output and a voice emitter or speaker (111) for emitting the voice output.
- Processing of the voice to text conversion and/or text translation and/or the text to voice conversion is by a central processing unit (cpu) or the like with software control of the sound card/s (151,152).
- the parallel processing can be by central processing unit (cpu) parallel processing techniques or by software controlled switching techniques.
- the real time translator (101) includes two sound paths formed by two separate electronic sound manipulators with associated software such that the sound of the first voice in first language being received can be converted to text while the translated text into the second selected language is being converted to voice by the second separate electronic sound manipulator with associated software. This is provided by the separate electronic sound manipulators of the two personal computer sound cards (151,152) or the like, or two separately operated left and right channels (151 A, 15 IB) of a single personal computer sound card (151) or the like with separate software control.
- response time in the processing of conversion of first and second voice conversions to or from text and/or with text to text voice language translation such that the lag time between receiving voice and emitting translated voice is within a reasonable conversation period. Such period can be less than one second to a maximum of two seconds.
- voice translation and emission is in voice phrases substantially corresponding with voice phrasing of input voice such that a continual flow of spaced voice phrases simulates conversations.
- voice phrases are a sentence or part of a sentence.
- the essence of the invention is to enable a conversation / dialogue between two different languages and as such the invention remains unchanged irrespective of the languages in which the conversation or dialogue is conducted in.
- Conversation between the following languages will include English, Korean, French, Simplified Chinese, Traditional Chinese, Italian, German, Spanish, and Japanese.
- the technical methodology behind the invention includes three (3) basic steps:
- Step - 1 Receive spoken word or sentence via an input source
- Words spoken in reply or any words spoken in language-B is received via microphone (201) and converted to text. Words of language-B (in text format) are translated within real time translator (150) to language-A (also in text format). Real time translator (150) switches focus to speaker (111) and, the text from the words of language-A is converted to speech and "spoken out” through speaker (111). All of the above happens instantly, immediately and "on-the-spot" enabling a real-time conversation/dialogue between two different languages.
- Real time translator software (160) is invoked based on input from one of the two voice input sources (101,201) and will receive the input-source of the "spoken word” and/or "sentence” via a channel of input such as a microphone or via a telephone line, spoken by person- 1 in language A.
- the invention works based on software-controlled operation of two sound cards or through software, that utilises the operating system aspects of the "left & right" channel (151A, 151B) capability of a single sound card (151).
- the preferred embodiment has the two sound cards plus software method.
- the invention of real time translator ( 150) is based on receiving spoken words from voice input devices such as.
- Step - 2 Translate the text
- the input source of words/sentence that was received and converted to text from step-1 is translated from one language to another.
- the software package used for this purpose was IBM's software package of "Language
- Step - 3 Speak out the converted text
- the final step is - text-to-speech.
- the last step is to convert back to speech and "speak out" the text in words of translated language.
- the software package used for this purpose was the TTS Software PackageTM by the Microsoft Corporation. This software package is specifically marketed and sold by MicrosoftTM for the development of text-to-speech applications. However, any similar text-to-speech software can be used of which there are several on the market or similar software can be written. However, either way the overall real time translator (150) invention behind the entire process of real time translator software (160) remains unchanged.
- Real time translator hardware (151 , 152, 153) (Portable Hardware configured for real time translator software (160)) - running real time translator software (160). Attached with microphone/speaker (via headset or other) to sound card-1. Also attached to sound card-2 is another microphone/speaker (either free-standing or also via a headset). Sound card-1 and the corresponding microphone& speaker are used by person-
- Person-1 speaks into microphone attached to sound card-1 - those words (sentence) spoken in language-A, are received by the real time translator software (160) controlling input microphone (101), plus the conversion to text.
- Real time translator software switches control internally within real time translator (150) to sound card-2
- Real time translator software controls input from microphone (201).
- Real time translator software 160 and Software controlled by it translates the language-B text to language-A text.
- Real time translator software switches control internally within real time translator (150) to sound card-1
- Real time translator hardware 151,152, 153 (Portable personal computer configured for real time translator software (160)) - running real time translator software (160). Attached with Microphone/speaker (via headset or other) to sound card-1. Sound card-2 is attached to the normal, industry standard Noice Modem and the output from the Noice Modem is connected to a normal, standard telephone socket. No special connection is required at Person-2's location and s represented by a normal telephone acting as another Microphone/speaker. Therefore sound card-1 and the corresponding microphone& speaker are used by Person-1 and sound card-2 and the corresponding microphone& speaker (via telephone) are used by Person-2.
- Real time translator software controls input from microphone (101).
- Real time franslator software 160
- Software controlled by it translates the language-A text to language-B text.
- Real time translator software switches control internally within real time translator (150) to sound card-2.
- Real time translator software (160) switches control internally within real time translator (150) to sound card-1, • The translated words by real time translator (150) of language-A are switched to sound card-1, converted to speech and "spoken” and heard by Person-1 via the speaker (headset or other) attached to sound card-1.
- person-2 at the other end does not require real time translator (150) or any special device, as real time translator (150) of person-1 performs all the work.
- Telephone or similar telecommunication method as provided by the telephone company or the telecommunication service provider.:
- Real time translator hardware ( 151 , 152, 153) (Portable personal computer configured for real time translator software (160)) - running real time translator software (160). Attached with the handset of a telephone of the caller (Person 1) or a
- Microphone/speaker via headset or other to sound card-1 on the telephone company or service provider's server.
- Sound card-2 is also attached to the telephone company or service provider's server and connects out to the out-going telephone network that would when a call is placed eventually connecting to the telephone being called (Person 2) I this Person-to-Person
- Dialling of the telephone number is done by person-1 through a special number provided by the telephone company or service provider for this special service and is then connected to the server (where the real time translator software (160) resides).
- Person- 1 then follows some voice prompts as instructed by the telephone company or service provider's server and then dials the receiver's telephone number.
- the receiver is then connected to the same server where the real time translator software (160) resides and to soundcard-2.
- Person-1 speaks into microphone attached to sound card-1 on the telephone company or service provider's server - and those words of language-A is received by the real time translator software (160) controlling input microphone/telephone (101), plus the conversion to text.
- Real time translator software (160) on the telephone company or service provider's server switches control internally within real time translator (150) to sound card-2.
- Real time translator software and Software controlled by it translates the language-B text to language-A text.
- Real time translator software switches control internally within real time translator (150) to sound card-1,
- person-1 talks to many persons (represented by person-2) • Real time translator hardware ( 151 , 152, 153) (portable personal computer configured for real time translator software (160)) - running real time translator software (160). Attach Microphone/speaker (via headset or stand alone) to sound card-1.
- Real time translator software controls input from microphone (101).
- Real time translator software 160 and Software controlled by it translates the language-A text to language-B text.
- Real time translator software switches control internally within real time translator (150) to sound card-2
- the invention including the real time translator software (160) and hardware provides for an easy two-way conversation/ dialogue between two (2) different languages at a single instance.
- the special configuration requirement of the real time translator (150) is to add two sound cards.
- the same effect can also be obtained by coding to utilise the "left & right" channel invention of the single sound card but for the prototype the two sound card, approach was taken.
- Real time translator software effectively breaks down the barriers of language. Whether it be English to Chinese or German to Japanese the difference of language and the inability to speak and establish a dialogue with someone unable to understand your own and only speaking a different language is changed forever by real time translator (150).
- Real time translator (150) is a companion and friend for the traveller and the tourist means and provides complete freedom. User can travel freely and easily from country to country and make themselves understood as well as to understand the spoken language - instantly and "on the spot", without requiring to study or know any language at all
- the real time translator (150) for the businessperson provides an effective means of communication.
- the invention also provides a commercial tool that provides for easy communication over the phone without the expensive and wasteful exercise of wasting time and money. No language barrier & the accompanying problems/frustrations, talk directly to clients, suppliers, and potential business contacts.
- Real time translator provides for an effective tool in mass communications, and education presentations, when communication is required in a different language, as well as for government organizations requiring dealing with people speaking different languages.
- the invention also provides two versions of the software.
- a first version there is the following set-up: ⁇
- the real time translator software will be installed on a PC and will be represented by application screens to guide the users.
- a microphone will be controlled by the software and receive input as spoken by the user via the microphone or by a keyboard entry.
- the real time translator software will convert and translate the software from language A to language B and speak it back instantly through the PC speaker in real time and substantially instantly. It can therefore be seen that the software is also useable as a learning tool/aid for the purpose of earning to speak a foreign language.
- the software further enables the user to hear the words back in language B enabling the user to learn the equivalent words in language and also the correct pronunciation and proper method of speaking.
- This is a distinct advantage over any other similar tool that only permits pre-recorded and pre-inputted phrases and words. This enables the user to learn by speaking and hearing back in "free format" speech of their choice. Thus the learning process will be significantly easier and of more practical usage.
- a parallel application screen using the same functionality of the real time translator. This enables the user to practice pronunciation and speaking in language B and have it translated back into language A in real time and substantially instantly. The user thereby can learn pronunciation accurately as the translation back to language A will only speak back the original words if the pronunciation is substantially correct.
Abstract
Description
Claims
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2002351866A AU2002351866A1 (en) | 2001-12-17 | 2002-12-17 | A real time translator and method of performing real time translation of a plurality of spoken word languages |
JP2003553443A JP2005513619A (en) | 2001-12-17 | 2002-12-17 | Real-time translator and method for real-time translation of multiple spoken languages |
EP02787195A EP1468376A1 (en) | 2001-12-17 | 2002-12-17 | A real time translator and method of performing real time translation of a plurality of spoken word languages |
CA002510663A CA2510663A1 (en) | 2001-12-17 | 2002-12-17 | A real time translator and method of performing real time translation of a plurality of spoken word languages |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AUPR9569A AUPR956901A0 (en) | 2001-12-17 | 2001-12-17 | Real time translator |
AUPR9569 | 2001-12-17 | ||
US10/081,773 | 2002-02-20 | ||
US10/081,773 US20030115059A1 (en) | 2001-12-17 | 2002-02-20 | Real time translator and method of performing real time translation of a plurality of spoken languages |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2003052624A1 true WO2003052624A1 (en) | 2003-06-26 |
Family
ID=25646860
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/AU2002/001706 WO2003052624A1 (en) | 2001-12-17 | 2002-12-17 | A real time translator and method of performing real time translation of a plurality of spoken word languages |
Country Status (5)
Country | Link |
---|---|
EP (1) | EP1468376A1 (en) |
JP (1) | JP2005513619A (en) |
CN (1) | CN1602483A (en) |
CA (1) | CA2510663A1 (en) |
WO (1) | WO2003052624A1 (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2923928A1 (en) * | 2007-11-19 | 2009-05-22 | Bonneton William | AUTOMATIC SIMULTANEOUS INTERPRETATION SYSTEM. |
WO2009080908A1 (en) | 2007-09-28 | 2009-07-02 | Pedre Joel | Method and device for translation as well as a headset implemented by said device |
WO2012038612A1 (en) | 2010-09-21 | 2012-03-29 | Pedre Joel | Built-in verbal translator having built-in speaker recognition |
US9818397B2 (en) | 2013-08-26 | 2017-11-14 | Google Technology Holdings LLC | Method and system for translating speech |
DE102019208742A1 (en) * | 2019-06-17 | 2020-12-17 | Audi Ag | Speech translation system for providing a translation of a speech input signal from a speaker into a speech output signal in a different language for a listener, as well as a translation method for such a speech translation system |
CN112435690A (en) * | 2019-08-08 | 2021-03-02 | 百度在线网络技术(北京)有限公司 | Duplex Bluetooth translation processing method and device, computer equipment and storage medium |
Families Citing this family (29)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8239185B2 (en) * | 2008-01-17 | 2012-08-07 | Geacom, Inc. | Method and system for situational language translation |
CN101739867B (en) * | 2008-11-19 | 2012-03-28 | 中国科学院自动化研究所 | Method for scoring interpretation quality by using computer |
JP5750380B2 (en) * | 2012-02-10 | 2015-07-22 | 株式会社東芝 | Speech translation apparatus, speech translation method, and speech translation program |
CN103838714A (en) * | 2012-11-22 | 2014-06-04 | 北大方正集团有限公司 | Method and device for converting voice information |
US9646626B2 (en) | 2013-11-22 | 2017-05-09 | At&T Intellectual Property I, L.P. | System and method for network bandwidth management for adjusting audio quality |
TW201608414A (en) * | 2014-08-18 | 2016-03-01 | Richplay Information Co Ltd | Speech assistance system in combination with mobile device |
US9542929B2 (en) * | 2014-09-26 | 2017-01-10 | Intel Corporation | Systems and methods for providing non-lexical cues in synthesized speech |
CN104409077A (en) * | 2014-10-17 | 2015-03-11 | 广州三星通信技术研究有限公司 | Method and device for obtaining texts corresponding to conversation voices in terminal |
CN105632498A (en) * | 2014-10-31 | 2016-06-01 | 株式会社东芝 | Method, device and system for generating conference record |
CN104580779B (en) * | 2015-01-19 | 2018-03-02 | 刘建芳 | A kind of remote speech translates terminal |
CN106156009A (en) * | 2015-04-13 | 2016-11-23 | 中兴通讯股份有限公司 | Voice translation method and device |
KR20170044849A (en) * | 2015-10-16 | 2017-04-26 | 삼성전자주식회사 | Electronic device and method for transforming text to speech utilizing common acoustic data set for multi-lingual/speaker |
CN107230477A (en) * | 2016-03-23 | 2017-10-03 | 刘国 | Automatic translation global communications systems |
CN105979421A (en) * | 2016-06-24 | 2016-09-28 | 陈灿伟 | Bluetooth headset based on simultaneous interpretation and simultaneous interpretation system using the same |
CN107798386B (en) * | 2016-09-01 | 2022-02-15 | 微软技术许可有限责任公司 | Multi-process collaborative training based on unlabeled data |
CN107979686A (en) * | 2016-10-25 | 2018-05-01 | 北京分音塔科技有限公司 | A kind of system for realizing real-time voice intertranslation |
CN107993646A (en) * | 2016-10-25 | 2018-05-04 | 北京分音塔科技有限公司 | A kind of method for realizing real-time voice intertranslation |
CN106935240A (en) * | 2017-03-24 | 2017-07-07 | 百度在线网络技术(北京)有限公司 | Voice translation method, device, terminal device and cloud server based on artificial intelligence |
JP2018173910A (en) * | 2017-03-31 | 2018-11-08 | 株式会社リクルートライフスタイル | Voice translation system and voice translation program |
CN107708006B (en) * | 2017-08-23 | 2020-08-28 | 广东思派康电子科技有限公司 | Computer-readable storage medium, real-time translation system |
CN107659881A (en) * | 2017-09-30 | 2018-02-02 | 夏敬懿 | One kind orientation sound collector and orientation collection sound translator |
WO2019084962A1 (en) * | 2017-11-06 | 2019-05-09 | 深圳市沃特沃德股份有限公司 | Speech-to-speech translation method and apparatus, and translating machine |
CN107885731A (en) * | 2017-11-06 | 2018-04-06 | 深圳市沃特沃德股份有限公司 | Voice translation method and device |
US20210232776A1 (en) * | 2018-04-27 | 2021-07-29 | Llsollu Co., Ltd. | Method for recording and outputting conversion between multiple parties using speech recognition technology, and device therefor |
CN109614628A (en) * | 2018-11-16 | 2019-04-12 | 广州市讯飞樽鸿信息技术有限公司 | A kind of interpretation method and translation system based on Intelligent hardware |
US20220027579A1 (en) * | 2018-11-30 | 2022-01-27 | Panasonic Intellectual Property Management Co., Ltd. | Translation device and translation method |
CN110232919A (en) * | 2019-06-19 | 2019-09-13 | 北京智合大方科技有限公司 | Real-time voice stream extracts and speech recognition system and method |
WO2021253402A1 (en) * | 2020-06-19 | 2021-12-23 | 深圳市台电实业有限公司 | Portable remote simultaneous interpretation and translation platform |
CN112201224A (en) * | 2020-10-09 | 2021-01-08 | 北京分音塔科技有限公司 | Method, equipment and system for simultaneous translation of instant call |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1995016968A1 (en) * | 1993-12-15 | 1995-06-22 | Jean Gachot | Method and device for converting a first voice message in a first language into a second message in a predetermined second language |
DE19508017A1 (en) * | 1995-03-07 | 1996-09-12 | Siemens Ag | Portable communication device with translation capability |
DE20100313U1 (en) * | 2001-01-09 | 2001-03-29 | Lich Willi | Mobile system for voice output |
US6266642B1 (en) * | 1999-01-29 | 2001-07-24 | Sony Corporation | Method and portable apparatus for performing spoken language translation |
US6292769B1 (en) * | 1995-02-14 | 2001-09-18 | America Online, Inc. | System for automated translation of speech |
-
2002
- 2002-12-17 CA CA002510663A patent/CA2510663A1/en not_active Abandoned
- 2002-12-17 EP EP02787195A patent/EP1468376A1/en not_active Withdrawn
- 2002-12-17 JP JP2003553443A patent/JP2005513619A/en active Pending
- 2002-12-17 CN CNA028248325A patent/CN1602483A/en active Pending
- 2002-12-17 WO PCT/AU2002/001706 patent/WO2003052624A1/en not_active Application Discontinuation
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1995016968A1 (en) * | 1993-12-15 | 1995-06-22 | Jean Gachot | Method and device for converting a first voice message in a first language into a second message in a predetermined second language |
US6292769B1 (en) * | 1995-02-14 | 2001-09-18 | America Online, Inc. | System for automated translation of speech |
DE19508017A1 (en) * | 1995-03-07 | 1996-09-12 | Siemens Ag | Portable communication device with translation capability |
US6266642B1 (en) * | 1999-01-29 | 2001-07-24 | Sony Corporation | Method and portable apparatus for performing spoken language translation |
DE20100313U1 (en) * | 2001-01-09 | 2001-03-29 | Lich Willi | Mobile system for voice output |
Non-Patent Citations (3)
Title |
---|
KARAORMAN M. ET AL.: "An experimental Japanese/English interpreting video phone system", PROCEEDINGS OF ICSLP'96, 1997 * |
LAVIE A. ET AL.: "JANUS III: speech-to-speech translation in multiple languages", PROCEEDINGS OF ICASSP'97, 1997 * |
WAIBEL A.: "Interactive translation of conversational speech", COMPUTER, vol. 29, no. 7, July 1996 (1996-07-01) * |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2009080908A1 (en) | 2007-09-28 | 2009-07-02 | Pedre Joel | Method and device for translation as well as a headset implemented by said device |
US8311798B2 (en) | 2007-09-28 | 2012-11-13 | Joel Pedre | Translation method and a device, and a headset forming part of said device |
FR2923928A1 (en) * | 2007-11-19 | 2009-05-22 | Bonneton William | AUTOMATIC SIMULTANEOUS INTERPRETATION SYSTEM. |
WO2009071795A1 (en) * | 2007-11-19 | 2009-06-11 | Bonneton, William | Automatic simultaneous interpretation system |
US8606560B2 (en) | 2007-11-19 | 2013-12-10 | William BONNETON | Automatic simultaneous interpertation system |
WO2012038612A1 (en) | 2010-09-21 | 2012-03-29 | Pedre Joel | Built-in verbal translator having built-in speaker recognition |
US9818397B2 (en) | 2013-08-26 | 2017-11-14 | Google Technology Holdings LLC | Method and system for translating speech |
DE102019208742A1 (en) * | 2019-06-17 | 2020-12-17 | Audi Ag | Speech translation system for providing a translation of a speech input signal from a speaker into a speech output signal in a different language for a listener, as well as a translation method for such a speech translation system |
DE102019208742B4 (en) * | 2019-06-17 | 2021-01-14 | Audi Ag | Speech translation system for providing a translation of a speech input signal from a speaker into a speech output signal in a different language for a listener, as well as a translation method for such a speech translation system |
CN112435690A (en) * | 2019-08-08 | 2021-03-02 | 百度在线网络技术(北京)有限公司 | Duplex Bluetooth translation processing method and device, computer equipment and storage medium |
Also Published As
Publication number | Publication date |
---|---|
EP1468376A1 (en) | 2004-10-20 |
CA2510663A1 (en) | 2003-06-26 |
CN1602483A (en) | 2005-03-30 |
JP2005513619A (en) | 2005-05-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20030115059A1 (en) | Real time translator and method of performing real time translation of a plurality of spoken languages | |
EP1468376A1 (en) | A real time translator and method of performing real time translation of a plurality of spoken word languages | |
Seeber | Community sex work: Simultaneous interpreting | |
CN111128126A (en) | Multi-language intelligent voice conversation method and system | |
US20090144048A1 (en) | Method and device for instant translation | |
Takezawa et al. | Multilingual spoken language corpus development for communication research | |
US20020198716A1 (en) | System and method of improved communication | |
JPH07129594A (en) | Automatic interpretation system | |
KR101959439B1 (en) | Method for interpreting | |
Cooper et al. | Reading aids for the blind: A special case of machine-to-man communication | |
JP2009122989A (en) | Translation apparatus | |
US11790913B2 (en) | Information providing method, apparatus, and storage medium, that transmit related information to a remote terminal based on identification information received from the remote terminal | |
JP2021027430A (en) | Multilingual conference system | |
TWM556360U (en) | Video-based synchronous translation system | |
AU2002351866A1 (en) | A real time translator and method of performing real time translation of a plurality of spoken word languages | |
KR101233655B1 (en) | Apparatus and method of interpreting an international conference based speech recognition | |
KR20200081925A (en) | System for voice recognition of interactive robot and the method therof | |
KR20020054192A (en) | A system and method for interpreting automatically a telephony guidance for a foreigner | |
JP7267346B2 (en) | Translation method and translation device | |
Farangiz | Characteristics of Simultaneous Interpretation Activity and Its Importance in the Modern World | |
Wang et al. | Real-Time Voice-Call Language Translation | |
Delic et al. | Speech-Enabled Computers as a Tool for Serbian-Speaking Blind Persons | |
ElAarag et al. | A speech recognition and synthesis tool | |
Thomas | Audibly: Speech to American Sign Language converter | |
SCHINDLER | Web-Based Education: A Speech Recognition And Synthesis Tool |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ OM PH PL PT RO RU SC SD SE SG SK SL TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR IE IT LU MC NL PT SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2002351866 Country of ref document: AU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 20028248325 Country of ref document: CN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2003553443 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2002787195 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 2002787195 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2510663 Country of ref document: CA |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: 2002787195 Country of ref document: EP |