DE60221408D1 - Verfahren zur bild- und tonbearbeitung unter verwendung von spracherkennung - Google Patents

Verfahren zur bild- und tonbearbeitung unter verwendung von spracherkennung

Info

Publication number
DE60221408D1
DE60221408D1 DE60221408T DE60221408T DE60221408D1 DE 60221408 D1 DE60221408 D1 DE 60221408D1 DE 60221408 T DE60221408 T DE 60221408T DE 60221408 T DE60221408 T DE 60221408T DE 60221408 D1 DE60221408 D1 DE 60221408D1
Authority
DE
Germany
Prior art keywords
picture
processing method
voice recognition
sound processing
user interface
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE60221408T
Other languages
English (en)
Inventor
Jocelyne Cote
Howard Ryshpan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ryshco Media Inc
Original Assignee
Ryshco Media Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ryshco Media Inc filed Critical Ryshco Media Inc
Application granted granted Critical
Publication of DE60221408D1 publication Critical patent/DE60221408D1/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/34Indicating arrangements 
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/02Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
    • G11B27/031Electronic editing of digitised analogue information signals, e.g. audio or video signals
    • G11B27/034Electronic editing of digitised analogue information signals, e.g. audio or video signals on discs
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Studio Circuits (AREA)
  • Television Signal Processing For Recording (AREA)
  • Image Processing (AREA)
  • Management Or Editing Of Information On Record Carriers (AREA)
  • Indexing, Searching, Synchronizing, And The Amount Of Synchronization Travel Of Record Carriers (AREA)
DE60221408T 2001-09-12 2002-09-12 Verfahren zur bild- und tonbearbeitung unter verwendung von spracherkennung Expired - Lifetime DE60221408D1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/067,131 US7343082B2 (en) 2001-09-12 2001-09-12 Universal guide track
PCT/CA2002/001386 WO2003023765A1 (en) 2001-09-12 2002-09-12 Method and device for processing audiovisual data using speech recognition

Publications (1)

Publication Number Publication Date
DE60221408D1 true DE60221408D1 (de) 2007-09-06

Family

ID=22073905

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60221408T Expired - Lifetime DE60221408D1 (de) 2001-09-12 2002-09-12 Verfahren zur bild- und tonbearbeitung unter verwendung von spracherkennung

Country Status (6)

Country Link
US (2) US7343082B2 (de)
EP (1) EP1425736B1 (de)
AT (1) ATE368277T1 (de)
CA (1) CA2538981C (de)
DE (1) DE60221408D1 (de)
WO (1) WO2003023765A1 (de)

Families Citing this family (49)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9286941B2 (en) 2001-05-04 2016-03-15 Legend3D, Inc. Image sequence enhancement and motion picture project management system
US8897596B1 (en) 2001-05-04 2014-11-25 Legend3D, Inc. System and method for rapid image sequence depth enhancement with translucent elements
US8401336B2 (en) 2001-05-04 2013-03-19 Legend3D, Inc. System and method for rapid image sequence depth enhancement with augmented computer-generated elements
US7343082B2 (en) * 2001-09-12 2008-03-11 Ryshco Media Inc. Universal guide track
US7587318B2 (en) * 2002-09-12 2009-09-08 Broadcom Corporation Correlating video images of lip movements with audio signals to improve speech recognition
US8009966B2 (en) 2002-11-01 2011-08-30 Synchro Arts Limited Methods and apparatus for use in sound replacement with automatic synchronization to images
KR20050085344A (ko) * 2002-12-04 2005-08-29 코닌클리즈케 필립스 일렉트로닉스 엔.브이. 신호 동기화 방법 및 시스템
US7142250B1 (en) * 2003-04-05 2006-11-28 Apple Computer, Inc. Method and apparatus for synchronizing audio and video streams
WO2004093059A1 (en) * 2003-04-18 2004-10-28 Unisay Sdn. Bhd. Phoneme extraction system
WO2004100128A1 (en) * 2003-04-18 2004-11-18 Unisay Sdn. Bhd. System for generating a timed phomeme and visem list
JP3945778B2 (ja) * 2004-03-12 2007-07-18 インターナショナル・ビジネス・マシーンズ・コーポレーション 設定装置、プログラム、記録媒体、及び設定方法
GB2424534B (en) * 2005-03-24 2007-09-05 Zootech Ltd Authoring audiovisual content
US20070011012A1 (en) * 2005-07-11 2007-01-11 Steve Yurick Method, system, and apparatus for facilitating captioning of multi-media content
US8060591B1 (en) 2005-09-01 2011-11-15 Sprint Spectrum L.P. Automatic delivery of alerts including static and dynamic portions
US7653418B1 (en) * 2005-09-28 2010-01-26 Sprint Spectrum L.P. Automatic rotation through play out of audio-clips in response to detected alert events
ATE440334T1 (de) * 2006-02-10 2009-09-15 Harman Becker Automotive Sys System für sprachgesteuerte auswahl einer audiodatei und verfahren dafür
US8713191B1 (en) 2006-11-20 2014-04-29 Sprint Spectrum L.P. Method and apparatus for establishing a media clip
US7747290B1 (en) 2007-01-22 2010-06-29 Sprint Spectrum L.P. Method and system for demarcating a portion of a media file as a ringtone
US8179475B2 (en) * 2007-03-09 2012-05-15 Legend3D, Inc. Apparatus and method for synchronizing a secondary audio track to the audio track of a video source
US20080256136A1 (en) * 2007-04-14 2008-10-16 Jerremy Holland Techniques and tools for managing attributes of media content
US20080263433A1 (en) * 2007-04-14 2008-10-23 Aaron Eppolito Multiple version merge for media production
US8751022B2 (en) * 2007-04-14 2014-06-10 Apple Inc. Multi-take compositing of digital media assets
US20080295040A1 (en) * 2007-05-24 2008-11-27 Microsoft Corporation Closed captions for real time communication
TWI341956B (en) * 2007-05-30 2011-05-11 Delta Electronics Inc Projection apparatus with function of speech indication and control method thereof for use in the apparatus
US9390169B2 (en) * 2008-06-28 2016-07-12 Apple Inc. Annotation of movies
US8265450B2 (en) * 2009-01-16 2012-09-11 Apple Inc. Capturing and inserting closed captioning data in digital video
FR2955183B3 (fr) * 2010-01-11 2012-01-13 Didier Calle Procede de traitement automatique de donnees numeriques destinees a des doublages ou a des post-synchronisations de videos
US8572488B2 (en) * 2010-03-29 2013-10-29 Avid Technology, Inc. Spot dialog editor
US8744239B2 (en) 2010-08-06 2014-06-03 Apple Inc. Teleprompter tool for voice-over tool
US8730232B2 (en) 2011-02-01 2014-05-20 Legend3D, Inc. Director-style based 2D to 3D movie conversion system and method
US8621355B2 (en) 2011-02-02 2013-12-31 Apple Inc. Automatic synchronization of media clips
US9241147B2 (en) 2013-05-01 2016-01-19 Legend3D, Inc. External depth map transformation method for conversion of two-dimensional images to stereoscopic images
US9407904B2 (en) 2013-05-01 2016-08-02 Legend3D, Inc. Method for creating 3D virtual reality from 2D images
US9288476B2 (en) 2011-02-17 2016-03-15 Legend3D, Inc. System and method for real-time depth modification of stereo images of a virtual reality environment
US9282321B2 (en) 2011-02-17 2016-03-08 Legend3D, Inc. 3D model multi-reviewer system
US9280905B2 (en) * 2011-12-12 2016-03-08 Inkling Systems, Inc. Media outline
WO2014018652A2 (en) 2012-07-24 2014-01-30 Adam Polak Media synchronization
US9007365B2 (en) 2012-11-27 2015-04-14 Legend3D, Inc. Line depth augmentation system and method for conversion of 2D images to 3D images
US9547937B2 (en) 2012-11-30 2017-01-17 Legend3D, Inc. Three-dimensional annotation system and method
US9007404B2 (en) 2013-03-15 2015-04-14 Legend3D, Inc. Tilt-based look around effect image enhancement method
US9438878B2 (en) 2013-05-01 2016-09-06 Legend3D, Inc. Method of converting 2D video to 3D video using 3D object models
US8719032B1 (en) 2013-12-11 2014-05-06 Jefferson Audio Video Systems, Inc. Methods for presenting speech blocks from a plurality of audio input data streams to a user in an interface
US20160042766A1 (en) * 2014-08-06 2016-02-11 Echostar Technologies L.L.C. Custom video content
GB2553960A (en) 2015-03-13 2018-03-21 Trint Ltd Media generating and editing system
US9609307B1 (en) 2015-09-17 2017-03-28 Legend3D, Inc. Method of converting 2D video to 3D video using machine learning
US10387543B2 (en) * 2015-10-15 2019-08-20 Vkidz, Inc. Phoneme-to-grapheme mapping systems and methods
GB201715753D0 (en) * 2017-09-28 2017-11-15 Royal Nat Theatre Caption delivery system
CN112653916B (zh) * 2019-10-10 2023-08-29 腾讯科技(深圳)有限公司 一种音视频同步优化的方法及设备
US11545134B1 (en) * 2019-12-10 2023-01-03 Amazon Technologies, Inc. Multilingual speech translation with adaptive speech synthesis and adaptive physiognomy

Family Cites Families (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE3170907D1 (en) 1981-01-19 1985-07-18 Richard Welcher Bloomstein Apparatus and method for creating visual images of lip movements
GB2101795B (en) 1981-07-07 1985-09-25 Cross John Lyndon Dubbing translations of sound tracks on films
CA1270063A (en) 1985-05-14 1990-06-05 Kouji Miyao Translating apparatus
US5155805A (en) 1989-05-08 1992-10-13 Apple Computer, Inc. Method and apparatus for moving control points in displaying digital typeface on raster output devices
US5159668A (en) 1989-05-08 1992-10-27 Apple Computer, Inc. Method and apparatus for manipulating outlines in improving digital typeface on raster output devices
EP0526064B1 (de) 1991-08-02 1997-09-10 The Grass Valley Group, Inc. Bedienerschnittstelle für Videoschnittsystem zur Anzeige und interaktive Steuerung von Videomaterial
US5434678A (en) 1993-01-11 1995-07-18 Abecassis; Max Seamless transmission of non-sequential video segments
US5481296A (en) 1993-08-06 1996-01-02 International Business Machines Corporation Apparatus and method for selectively viewing video information
JP3356536B2 (ja) 1994-04-13 2002-12-16 松下電器産業株式会社 機械翻訳装置
US5717468A (en) 1994-12-02 1998-02-10 International Business Machines Corporation System and method for dynamically recording and displaying comments for a video movie
JP4078677B2 (ja) 1995-10-08 2008-04-23 イーサム リサーチ デヴェロップメント カンパニー オブ ザ ヘブライ ユニヴァーシティ オブ エルサレム 映画のコンピュータ化された自動オーディオビジュアルダビングのための方法
JP3454396B2 (ja) 1995-10-11 2003-10-06 株式会社日立製作所 動画像の変化点検出制御方法とそれに基づく再生停止制御方法およびそれらを用いた動画像の編集システム
US5732184A (en) 1995-10-20 1998-03-24 Digital Processing Systems, Inc. Video and audio cursor video editing system
US5880788A (en) 1996-03-25 1999-03-09 Interval Research Corporation Automated synchronization of video image sequences to new soundtracks
US6154601A (en) 1996-04-12 2000-11-28 Hitachi Denshi Kabushiki Kaisha Method for editing image information with aid of computer and editing system
US5832171A (en) 1996-06-05 1998-11-03 Juritech, Inc. System for creating video of an event with a synchronized transcript
JPH1074204A (ja) 1996-06-28 1998-03-17 Toshiba Corp 機械翻訳方法及び原文・訳文表示方法
EP0848850A1 (de) 1996-07-08 1998-06-24 Régis Dubos Bild- und tongestützte verfahren und vorrichtung zur synchronisation eines filmes
US5969716A (en) 1996-08-06 1999-10-19 Interval Research Corporation Time-based media processing system
AU6313498A (en) 1997-02-26 1998-09-18 Tall Poppy Records Limited Sound synchronizing
US6134378A (en) 1997-04-06 2000-10-17 Sony Corporation Video signal processing device that facilitates editing by producing control information from detected video signal information
FR2765354B1 (fr) 1997-06-25 1999-07-30 Gregoire Parcollet Systeme de synchronisation du doublage de films
EP0899737A3 (de) 1997-08-18 1999-08-25 Tektronix, Inc. Drehbucherkennung durch Spracherkennung
DE19740119A1 (de) * 1997-09-12 1999-03-18 Philips Patentverwaltung System zum Schneiden digitaler Video- und Audioinformationen
US6174170B1 (en) * 1997-10-21 2001-01-16 Sony Corporation Display of text symbols associated with audio data reproducible from a recording disc
JPH11162152A (ja) 1997-11-26 1999-06-18 Victor Co Of Japan Ltd 歌詞表示制御情報編集装置
JPH11289512A (ja) * 1998-04-03 1999-10-19 Sony Corp 編集リスト作成装置
US6490563B2 (en) * 1998-08-17 2002-12-03 Microsoft Corporation Proofreading with text to speech feedback
IT1314671B1 (it) * 1998-10-07 2002-12-31 Cselt Centro Studi Lab Telecom Procedimento e apparecchiatura per l'animazione di un modellosintetizzato di volto umano pilotata da un segnale audio.
US20010044719A1 (en) * 1999-07-02 2001-11-22 Mitsubishi Electric Research Laboratories, Inc. Method and system for recognizing, indexing, and searching acoustic signals
US7047191B2 (en) * 2000-03-06 2006-05-16 Rochester Institute Of Technology Method and system for providing automated captioning for AV signals
US7085842B2 (en) * 2001-02-12 2006-08-01 Open Text Corporation Line navigation conferencing system
US7343082B2 (en) * 2001-09-12 2008-03-11 Ryshco Media Inc. Universal guide track

Also Published As

Publication number Publication date
ATE368277T1 (de) 2007-08-15
EP1425736B1 (de) 2007-07-25
EP1425736A1 (de) 2004-06-09
CA2538981A1 (en) 2003-03-20
US7343082B2 (en) 2008-03-11
WO2003023765A1 (en) 2003-03-20
US20030049015A1 (en) 2003-03-13
CA2538981C (en) 2011-07-26
US20040234250A1 (en) 2004-11-25

Similar Documents

Publication Publication Date Title
DE60221408D1 (de) Verfahren zur bild- und tonbearbeitung unter verwendung von spracherkennung
MX2009009651A (es) Un metodo y un aparato para procesar una señal de audio.
DE69926481D1 (de) Vorrichtung und verfahren für aufnahme, entwurf und wiedergabe synchronisierter audio- und videodaten unter verwendung von spracherkennung und drehbüchern
KR20150057591A (ko) 동영상파일에 대한 자막데이터 생성방법 및 장치
DE69915455D1 (de) Verfahren und vorrichtung, um gewünschte video- und audioszenen durch spracherkennung wiederzufinden
GB2429889A (en) Method, system, and program product for measuring audio video synchronization
CN107112026A (zh) 用于智能语音识别和处理的系统、方法和装置
CN105975569A (zh) 一种语音处理的方法及终端
ATE428221T1 (de) Verfahren zur automatischen verstarkungseinstellung in einem hírhilfegerat sowie hírhilfegerat
EP1596389A3 (de) System und Verfahren zur hochqualitativen Wiedergabe mit variabeler Geschwindigkeit
ATE322065T1 (de) Verfahren zur verarbeitung von audiodateien und erfassungsvorrichtung zur anwendung davon
ATE179827T1 (de) Verfahren zur veränderung eines sprachsignales mittels grundfrequenzmanipulation
DE60128270D1 (de) Verfahren und System zur Erzeugung von Sprechererkennungsdaten, und Verfahren und System zur Sprechererkennung
EP1422668A3 (de) Gerät und Verfahren zur Kurzfilmgenerierung und -reproduktion
WO2004040576A8 (en) Methods and apparatus for use in sound replacement with automatic synchronization to images
WO1998034216A3 (en) System and method for detecting a recorded voice
GB2440384A (en) Method,system and program product for measuring audio video synchronization using lip and teeth characteristics
WO2004095419A3 (en) System and method for text-to-speech processing in a portable device
AU2002238961A1 (en) Information processing apparatus and method, and program
EP1119194A3 (de) Audio- und Video-Wiedergabeanlage, und Audio- und Video-Wiedergabeverfahren
CN104469487B (zh) 一种场景切换点的检测方法及装置
ATE407411T1 (de) Verfahren zum bereitstellen von kontoinformation und system zum aufschreiben von diktiertem text
EP1441330A3 (de) Encodier- und/oder Decodierverfahren für digitale Audiosignale, basierend auf Zeit-Frequenzkorrelation und Vorrichtung hierzu
DE60318282D1 (de) Methoden und Vorrichtung zur Verarbeitung von Ausführungsdaten und zur Synthetisierung von Tonsignalen
AU2003237231A1 (en) Method and apparatus for differential compression of speaker models for speaker recognition

Legal Events

Date Code Title Description
8332 No legal effect for de