DE60221408D1 - Verfahren zur bild- und tonbearbeitung unter verwendung von spracherkennung - Google Patents
Verfahren zur bild- und tonbearbeitung unter verwendung von spracherkennungInfo
- Publication number
- DE60221408D1 DE60221408D1 DE60221408T DE60221408T DE60221408D1 DE 60221408 D1 DE60221408 D1 DE 60221408D1 DE 60221408 T DE60221408 T DE 60221408T DE 60221408 T DE60221408 T DE 60221408T DE 60221408 D1 DE60221408 D1 DE 60221408D1
- Authority
- DE
- Germany
- Prior art keywords
- picture
- processing method
- voice recognition
- sound processing
- user interface
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000003672 processing method Methods 0.000 title 1
- 238000000034 method Methods 0.000 abstract 1
- 230000001360 synchronised effect Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/34—Indicating arrangements
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/02—Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
- G11B27/031—Electronic editing of digitised analogue information signals, e.g. audio or video signals
- G11B27/034—Electronic editing of digitised analogue information signals, e.g. audio or video signals on discs
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Studio Circuits (AREA)
- Television Signal Processing For Recording (AREA)
- Image Processing (AREA)
- Management Or Editing Of Information On Record Carriers (AREA)
- Indexing, Searching, Synchronizing, And The Amount Of Synchronization Travel Of Record Carriers (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/067,131 US7343082B2 (en) | 2001-09-12 | 2001-09-12 | Universal guide track |
PCT/CA2002/001386 WO2003023765A1 (en) | 2001-09-12 | 2002-09-12 | Method and device for processing audiovisual data using speech recognition |
Publications (1)
Publication Number | Publication Date |
---|---|
DE60221408D1 true DE60221408D1 (de) | 2007-09-06 |
Family
ID=22073905
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE60221408T Expired - Lifetime DE60221408D1 (de) | 2001-09-12 | 2002-09-12 | Verfahren zur bild- und tonbearbeitung unter verwendung von spracherkennung |
Country Status (6)
Country | Link |
---|---|
US (2) | US7343082B2 (de) |
EP (1) | EP1425736B1 (de) |
AT (1) | ATE368277T1 (de) |
CA (1) | CA2538981C (de) |
DE (1) | DE60221408D1 (de) |
WO (1) | WO2003023765A1 (de) |
Families Citing this family (49)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9286941B2 (en) | 2001-05-04 | 2016-03-15 | Legend3D, Inc. | Image sequence enhancement and motion picture project management system |
US8897596B1 (en) | 2001-05-04 | 2014-11-25 | Legend3D, Inc. | System and method for rapid image sequence depth enhancement with translucent elements |
US8401336B2 (en) | 2001-05-04 | 2013-03-19 | Legend3D, Inc. | System and method for rapid image sequence depth enhancement with augmented computer-generated elements |
US7343082B2 (en) * | 2001-09-12 | 2008-03-11 | Ryshco Media Inc. | Universal guide track |
US7587318B2 (en) * | 2002-09-12 | 2009-09-08 | Broadcom Corporation | Correlating video images of lip movements with audio signals to improve speech recognition |
US8009966B2 (en) | 2002-11-01 | 2011-08-30 | Synchro Arts Limited | Methods and apparatus for use in sound replacement with automatic synchronization to images |
KR20050085344A (ko) * | 2002-12-04 | 2005-08-29 | 코닌클리즈케 필립스 일렉트로닉스 엔.브이. | 신호 동기화 방법 및 시스템 |
US7142250B1 (en) * | 2003-04-05 | 2006-11-28 | Apple Computer, Inc. | Method and apparatus for synchronizing audio and video streams |
WO2004093059A1 (en) * | 2003-04-18 | 2004-10-28 | Unisay Sdn. Bhd. | Phoneme extraction system |
WO2004100128A1 (en) * | 2003-04-18 | 2004-11-18 | Unisay Sdn. Bhd. | System for generating a timed phomeme and visem list |
JP3945778B2 (ja) * | 2004-03-12 | 2007-07-18 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 設定装置、プログラム、記録媒体、及び設定方法 |
GB2424534B (en) * | 2005-03-24 | 2007-09-05 | Zootech Ltd | Authoring audiovisual content |
US20070011012A1 (en) * | 2005-07-11 | 2007-01-11 | Steve Yurick | Method, system, and apparatus for facilitating captioning of multi-media content |
US8060591B1 (en) | 2005-09-01 | 2011-11-15 | Sprint Spectrum L.P. | Automatic delivery of alerts including static and dynamic portions |
US7653418B1 (en) * | 2005-09-28 | 2010-01-26 | Sprint Spectrum L.P. | Automatic rotation through play out of audio-clips in response to detected alert events |
ATE440334T1 (de) * | 2006-02-10 | 2009-09-15 | Harman Becker Automotive Sys | System für sprachgesteuerte auswahl einer audiodatei und verfahren dafür |
US8713191B1 (en) | 2006-11-20 | 2014-04-29 | Sprint Spectrum L.P. | Method and apparatus for establishing a media clip |
US7747290B1 (en) | 2007-01-22 | 2010-06-29 | Sprint Spectrum L.P. | Method and system for demarcating a portion of a media file as a ringtone |
US8179475B2 (en) * | 2007-03-09 | 2012-05-15 | Legend3D, Inc. | Apparatus and method for synchronizing a secondary audio track to the audio track of a video source |
US20080256136A1 (en) * | 2007-04-14 | 2008-10-16 | Jerremy Holland | Techniques and tools for managing attributes of media content |
US20080263433A1 (en) * | 2007-04-14 | 2008-10-23 | Aaron Eppolito | Multiple version merge for media production |
US8751022B2 (en) * | 2007-04-14 | 2014-06-10 | Apple Inc. | Multi-take compositing of digital media assets |
US20080295040A1 (en) * | 2007-05-24 | 2008-11-27 | Microsoft Corporation | Closed captions for real time communication |
TWI341956B (en) * | 2007-05-30 | 2011-05-11 | Delta Electronics Inc | Projection apparatus with function of speech indication and control method thereof for use in the apparatus |
US9390169B2 (en) * | 2008-06-28 | 2016-07-12 | Apple Inc. | Annotation of movies |
US8265450B2 (en) * | 2009-01-16 | 2012-09-11 | Apple Inc. | Capturing and inserting closed captioning data in digital video |
FR2955183B3 (fr) * | 2010-01-11 | 2012-01-13 | Didier Calle | Procede de traitement automatique de donnees numeriques destinees a des doublages ou a des post-synchronisations de videos |
US8572488B2 (en) * | 2010-03-29 | 2013-10-29 | Avid Technology, Inc. | Spot dialog editor |
US8744239B2 (en) | 2010-08-06 | 2014-06-03 | Apple Inc. | Teleprompter tool for voice-over tool |
US8730232B2 (en) | 2011-02-01 | 2014-05-20 | Legend3D, Inc. | Director-style based 2D to 3D movie conversion system and method |
US8621355B2 (en) | 2011-02-02 | 2013-12-31 | Apple Inc. | Automatic synchronization of media clips |
US9241147B2 (en) | 2013-05-01 | 2016-01-19 | Legend3D, Inc. | External depth map transformation method for conversion of two-dimensional images to stereoscopic images |
US9407904B2 (en) | 2013-05-01 | 2016-08-02 | Legend3D, Inc. | Method for creating 3D virtual reality from 2D images |
US9288476B2 (en) | 2011-02-17 | 2016-03-15 | Legend3D, Inc. | System and method for real-time depth modification of stereo images of a virtual reality environment |
US9282321B2 (en) | 2011-02-17 | 2016-03-08 | Legend3D, Inc. | 3D model multi-reviewer system |
US9280905B2 (en) * | 2011-12-12 | 2016-03-08 | Inkling Systems, Inc. | Media outline |
WO2014018652A2 (en) | 2012-07-24 | 2014-01-30 | Adam Polak | Media synchronization |
US9007365B2 (en) | 2012-11-27 | 2015-04-14 | Legend3D, Inc. | Line depth augmentation system and method for conversion of 2D images to 3D images |
US9547937B2 (en) | 2012-11-30 | 2017-01-17 | Legend3D, Inc. | Three-dimensional annotation system and method |
US9007404B2 (en) | 2013-03-15 | 2015-04-14 | Legend3D, Inc. | Tilt-based look around effect image enhancement method |
US9438878B2 (en) | 2013-05-01 | 2016-09-06 | Legend3D, Inc. | Method of converting 2D video to 3D video using 3D object models |
US8719032B1 (en) | 2013-12-11 | 2014-05-06 | Jefferson Audio Video Systems, Inc. | Methods for presenting speech blocks from a plurality of audio input data streams to a user in an interface |
US20160042766A1 (en) * | 2014-08-06 | 2016-02-11 | Echostar Technologies L.L.C. | Custom video content |
GB2553960A (en) | 2015-03-13 | 2018-03-21 | Trint Ltd | Media generating and editing system |
US9609307B1 (en) | 2015-09-17 | 2017-03-28 | Legend3D, Inc. | Method of converting 2D video to 3D video using machine learning |
US10387543B2 (en) * | 2015-10-15 | 2019-08-20 | Vkidz, Inc. | Phoneme-to-grapheme mapping systems and methods |
GB201715753D0 (en) * | 2017-09-28 | 2017-11-15 | Royal Nat Theatre | Caption delivery system |
CN112653916B (zh) * | 2019-10-10 | 2023-08-29 | 腾讯科技(深圳)有限公司 | 一种音视频同步优化的方法及设备 |
US11545134B1 (en) * | 2019-12-10 | 2023-01-03 | Amazon Technologies, Inc. | Multilingual speech translation with adaptive speech synthesis and adaptive physiognomy |
Family Cites Families (33)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE3170907D1 (en) | 1981-01-19 | 1985-07-18 | Richard Welcher Bloomstein | Apparatus and method for creating visual images of lip movements |
GB2101795B (en) | 1981-07-07 | 1985-09-25 | Cross John Lyndon | Dubbing translations of sound tracks on films |
CA1270063A (en) | 1985-05-14 | 1990-06-05 | Kouji Miyao | Translating apparatus |
US5155805A (en) | 1989-05-08 | 1992-10-13 | Apple Computer, Inc. | Method and apparatus for moving control points in displaying digital typeface on raster output devices |
US5159668A (en) | 1989-05-08 | 1992-10-27 | Apple Computer, Inc. | Method and apparatus for manipulating outlines in improving digital typeface on raster output devices |
EP0526064B1 (de) | 1991-08-02 | 1997-09-10 | The Grass Valley Group, Inc. | Bedienerschnittstelle für Videoschnittsystem zur Anzeige und interaktive Steuerung von Videomaterial |
US5434678A (en) | 1993-01-11 | 1995-07-18 | Abecassis; Max | Seamless transmission of non-sequential video segments |
US5481296A (en) | 1993-08-06 | 1996-01-02 | International Business Machines Corporation | Apparatus and method for selectively viewing video information |
JP3356536B2 (ja) | 1994-04-13 | 2002-12-16 | 松下電器産業株式会社 | 機械翻訳装置 |
US5717468A (en) | 1994-12-02 | 1998-02-10 | International Business Machines Corporation | System and method for dynamically recording and displaying comments for a video movie |
JP4078677B2 (ja) | 1995-10-08 | 2008-04-23 | イーサム リサーチ デヴェロップメント カンパニー オブ ザ ヘブライ ユニヴァーシティ オブ エルサレム | 映画のコンピュータ化された自動オーディオビジュアルダビングのための方法 |
JP3454396B2 (ja) | 1995-10-11 | 2003-10-06 | 株式会社日立製作所 | 動画像の変化点検出制御方法とそれに基づく再生停止制御方法およびそれらを用いた動画像の編集システム |
US5732184A (en) | 1995-10-20 | 1998-03-24 | Digital Processing Systems, Inc. | Video and audio cursor video editing system |
US5880788A (en) | 1996-03-25 | 1999-03-09 | Interval Research Corporation | Automated synchronization of video image sequences to new soundtracks |
US6154601A (en) | 1996-04-12 | 2000-11-28 | Hitachi Denshi Kabushiki Kaisha | Method for editing image information with aid of computer and editing system |
US5832171A (en) | 1996-06-05 | 1998-11-03 | Juritech, Inc. | System for creating video of an event with a synchronized transcript |
JPH1074204A (ja) | 1996-06-28 | 1998-03-17 | Toshiba Corp | 機械翻訳方法及び原文・訳文表示方法 |
EP0848850A1 (de) | 1996-07-08 | 1998-06-24 | Régis Dubos | Bild- und tongestützte verfahren und vorrichtung zur synchronisation eines filmes |
US5969716A (en) | 1996-08-06 | 1999-10-19 | Interval Research Corporation | Time-based media processing system |
AU6313498A (en) | 1997-02-26 | 1998-09-18 | Tall Poppy Records Limited | Sound synchronizing |
US6134378A (en) | 1997-04-06 | 2000-10-17 | Sony Corporation | Video signal processing device that facilitates editing by producing control information from detected video signal information |
FR2765354B1 (fr) | 1997-06-25 | 1999-07-30 | Gregoire Parcollet | Systeme de synchronisation du doublage de films |
EP0899737A3 (de) | 1997-08-18 | 1999-08-25 | Tektronix, Inc. | Drehbucherkennung durch Spracherkennung |
DE19740119A1 (de) * | 1997-09-12 | 1999-03-18 | Philips Patentverwaltung | System zum Schneiden digitaler Video- und Audioinformationen |
US6174170B1 (en) * | 1997-10-21 | 2001-01-16 | Sony Corporation | Display of text symbols associated with audio data reproducible from a recording disc |
JPH11162152A (ja) | 1997-11-26 | 1999-06-18 | Victor Co Of Japan Ltd | 歌詞表示制御情報編集装置 |
JPH11289512A (ja) * | 1998-04-03 | 1999-10-19 | Sony Corp | 編集リスト作成装置 |
US6490563B2 (en) * | 1998-08-17 | 2002-12-03 | Microsoft Corporation | Proofreading with text to speech feedback |
IT1314671B1 (it) * | 1998-10-07 | 2002-12-31 | Cselt Centro Studi Lab Telecom | Procedimento e apparecchiatura per l'animazione di un modellosintetizzato di volto umano pilotata da un segnale audio. |
US20010044719A1 (en) * | 1999-07-02 | 2001-11-22 | Mitsubishi Electric Research Laboratories, Inc. | Method and system for recognizing, indexing, and searching acoustic signals |
US7047191B2 (en) * | 2000-03-06 | 2006-05-16 | Rochester Institute Of Technology | Method and system for providing automated captioning for AV signals |
US7085842B2 (en) * | 2001-02-12 | 2006-08-01 | Open Text Corporation | Line navigation conferencing system |
US7343082B2 (en) * | 2001-09-12 | 2008-03-11 | Ryshco Media Inc. | Universal guide track |
-
2001
- 2001-09-12 US US10/067,131 patent/US7343082B2/en not_active Expired - Fee Related
-
2002
- 2002-09-12 DE DE60221408T patent/DE60221408D1/de not_active Expired - Lifetime
- 2002-09-12 EP EP02759989A patent/EP1425736B1/de not_active Expired - Lifetime
- 2002-09-12 CA CA2538981A patent/CA2538981C/en not_active Expired - Fee Related
- 2002-09-12 AT AT02759989T patent/ATE368277T1/de not_active IP Right Cessation
- 2002-09-12 WO PCT/CA2002/001386 patent/WO2003023765A1/en active IP Right Grant
-
2004
- 2004-03-11 US US10/797,576 patent/US20040234250A1/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
ATE368277T1 (de) | 2007-08-15 |
EP1425736B1 (de) | 2007-07-25 |
EP1425736A1 (de) | 2004-06-09 |
CA2538981A1 (en) | 2003-03-20 |
US7343082B2 (en) | 2008-03-11 |
WO2003023765A1 (en) | 2003-03-20 |
US20030049015A1 (en) | 2003-03-13 |
CA2538981C (en) | 2011-07-26 |
US20040234250A1 (en) | 2004-11-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE60221408D1 (de) | Verfahren zur bild- und tonbearbeitung unter verwendung von spracherkennung | |
MX2009009651A (es) | Un metodo y un aparato para procesar una señal de audio. | |
DE69926481D1 (de) | Vorrichtung und verfahren für aufnahme, entwurf und wiedergabe synchronisierter audio- und videodaten unter verwendung von spracherkennung und drehbüchern | |
KR20150057591A (ko) | 동영상파일에 대한 자막데이터 생성방법 및 장치 | |
DE69915455D1 (de) | Verfahren und vorrichtung, um gewünschte video- und audioszenen durch spracherkennung wiederzufinden | |
GB2429889A (en) | Method, system, and program product for measuring audio video synchronization | |
CN107112026A (zh) | 用于智能语音识别和处理的系统、方法和装置 | |
CN105975569A (zh) | 一种语音处理的方法及终端 | |
ATE428221T1 (de) | Verfahren zur automatischen verstarkungseinstellung in einem hírhilfegerat sowie hírhilfegerat | |
EP1596389A3 (de) | System und Verfahren zur hochqualitativen Wiedergabe mit variabeler Geschwindigkeit | |
ATE322065T1 (de) | Verfahren zur verarbeitung von audiodateien und erfassungsvorrichtung zur anwendung davon | |
ATE179827T1 (de) | Verfahren zur veränderung eines sprachsignales mittels grundfrequenzmanipulation | |
DE60128270D1 (de) | Verfahren und System zur Erzeugung von Sprechererkennungsdaten, und Verfahren und System zur Sprechererkennung | |
EP1422668A3 (de) | Gerät und Verfahren zur Kurzfilmgenerierung und -reproduktion | |
WO2004040576A8 (en) | Methods and apparatus for use in sound replacement with automatic synchronization to images | |
WO1998034216A3 (en) | System and method for detecting a recorded voice | |
GB2440384A (en) | Method,system and program product for measuring audio video synchronization using lip and teeth characteristics | |
WO2004095419A3 (en) | System and method for text-to-speech processing in a portable device | |
AU2002238961A1 (en) | Information processing apparatus and method, and program | |
EP1119194A3 (de) | Audio- und Video-Wiedergabeanlage, und Audio- und Video-Wiedergabeverfahren | |
CN104469487B (zh) | 一种场景切换点的检测方法及装置 | |
ATE407411T1 (de) | Verfahren zum bereitstellen von kontoinformation und system zum aufschreiben von diktiertem text | |
EP1441330A3 (de) | Encodier- und/oder Decodierverfahren für digitale Audiosignale, basierend auf Zeit-Frequenzkorrelation und Vorrichtung hierzu | |
DE60318282D1 (de) | Methoden und Vorrichtung zur Verarbeitung von Ausführungsdaten und zur Synthetisierung von Tonsignalen | |
AU2003237231A1 (en) | Method and apparatus for differential compression of speaker models for speaker recognition |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8332 | No legal effect for de |