WO2006113409A3 - Method, system, and program product for measuring audio video synchronization using lip and teeth charateristics - Google Patents
Method, system, and program product for measuring audio video synchronization using lip and teeth charateristics Download PDFInfo
- Publication number
- WO2006113409A3 WO2006113409A3 PCT/US2006/014023 US2006014023W WO2006113409A3 WO 2006113409 A3 WO2006113409 A3 WO 2006113409A3 US 2006014023 W US2006014023 W US 2006014023W WO 2006113409 A3 WO2006113409 A3 WO 2006113409A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- audio
- video
- information
- program product
- video synchronization
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/04—Synchronising
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/60—Analysis of geometric attributes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/233—Processing of audio elementary streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
- H04N21/23418—Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/242—Synchronization processes, e.g. processing of PCR [Program Clock References]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transforming into visible information
- G10L2021/105—Synthesis of the lips movements from speech, e.g. for talking heads
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/44—Receiver circuitry for the reception of television signals according to analogue transmission standards
- H04N5/60—Receiver circuitry for the reception of television signals according to analogue transmission standards for the sound signals
- H04N5/602—Receiver circuitry for the reception of television signals according to analogue transmission standards for the sound signals for digital sound signals
Abstract
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2006235990A AU2006235990A1 (en) | 2005-04-13 | 2006-04-13 | Method, system, and program product for measuring audio video synchronization using lip and teeth charateristics |
CA002566844A CA2566844A1 (en) | 2005-04-13 | 2006-04-13 | Method, system, and program product for measuring audio video synchronization using lip and teeth charateristics |
GB0622592A GB2440384B (en) | 2005-04-13 | 2006-04-13 | Method,system and program product for measuring audio video synchronization using lip and teeth characteristics |
EP06750137A EP1969858A2 (en) | 2004-05-14 | 2006-04-13 | Method, system, and program product for measuring audio video synchronization using lip and teeth charateristics |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/US2005/012588 WO2005115014A2 (en) | 2004-05-14 | 2005-04-13 | Method, system, and program product for measuring audio video synchronization |
USPCT/US05/12588 | 2005-04-13 | ||
PCT/US2005/041623 WO2007035183A2 (en) | 2005-04-13 | 2005-11-16 | Method, system, and program product for measuring audio video synchronization independent of speaker characteristics |
USPCT/US05/41623 | 2005-11-16 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2006113409A2 WO2006113409A2 (en) | 2006-10-26 |
WO2006113409A3 true WO2006113409A3 (en) | 2007-06-07 |
Family
ID=37115719
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2006/014023 WO2006113409A2 (en) | 2004-05-14 | 2006-04-13 | Method, system, and program product for measuring audio video synchronization using lip and teeth charateristics |
Country Status (3)
Country | Link |
---|---|
CA (1) | CA2566844A1 (en) |
GB (1) | GB2438691A (en) |
WO (1) | WO2006113409A2 (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE102007039603A1 (en) * | 2007-08-22 | 2009-02-26 | Siemens Ag | Method for synchronizing media data streams |
FR3014675A1 (en) * | 2013-12-12 | 2015-06-19 | Oreal | METHOD FOR EVALUATING AT LEAST ONE CLINICAL FACE SIGN |
CN110750152B (en) * | 2019-09-11 | 2023-08-29 | 云知声智能科技股份有限公司 | Man-machine interaction method and system based on lip actions |
CN111081270B (en) * | 2019-12-19 | 2021-06-01 | 大连即时智能科技有限公司 | Real-time audio-driven virtual character mouth shape synchronous control method |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4313135A (en) * | 1980-07-28 | 1982-01-26 | Cooper J Carl | Method and apparatus for preserving or restoring audio to video synchronization |
US4769845A (en) * | 1986-04-10 | 1988-09-06 | Kabushiki Kaisha Carrylab | Method of recognizing speech using a lip image |
US5387943A (en) * | 1992-12-21 | 1995-02-07 | Tektronix, Inc. | Semiautomatic lip sync recovery system |
US5572261A (en) * | 1995-06-07 | 1996-11-05 | Cooper; J. Carl | Automatic audio to video timing measurement device and method |
US5880788A (en) * | 1996-03-25 | 1999-03-09 | Interval Research Corporation | Automated synchronization of video image sequences to new soundtracks |
US5920842A (en) * | 1994-10-12 | 1999-07-06 | Pixel Instruments | Signal synchronization |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4975960A (en) * | 1985-06-03 | 1990-12-04 | Petajan Eric D | Electronic facial tracking and detection system and method and apparatus for automated speech recognition |
US6829018B2 (en) * | 2001-09-17 | 2004-12-07 | Koninklijke Philips Electronics N.V. | Three-dimensional sound creation assisted by visual information |
-
2005
- 2005-11-16 GB GB0622589A patent/GB2438691A/en not_active Withdrawn
-
2006
- 2006-04-13 WO PCT/US2006/014023 patent/WO2006113409A2/en active Application Filing
- 2006-04-13 CA CA002566844A patent/CA2566844A1/en not_active Abandoned
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4313135A (en) * | 1980-07-28 | 1982-01-26 | Cooper J Carl | Method and apparatus for preserving or restoring audio to video synchronization |
US4313135B1 (en) * | 1980-07-28 | 1996-01-02 | J Carl Cooper | Method and apparatus for preserving or restoring audio to video |
US4769845A (en) * | 1986-04-10 | 1988-09-06 | Kabushiki Kaisha Carrylab | Method of recognizing speech using a lip image |
US5387943A (en) * | 1992-12-21 | 1995-02-07 | Tektronix, Inc. | Semiautomatic lip sync recovery system |
US5920842A (en) * | 1994-10-12 | 1999-07-06 | Pixel Instruments | Signal synchronization |
US5572261A (en) * | 1995-06-07 | 1996-11-05 | Cooper; J. Carl | Automatic audio to video timing measurement device and method |
US5880788A (en) * | 1996-03-25 | 1999-03-09 | Interval Research Corporation | Automated synchronization of video image sequences to new soundtracks |
Also Published As
Publication number | Publication date |
---|---|
CA2566844A1 (en) | 2006-10-26 |
GB2438691A (en) | 2007-12-05 |
WO2006113409A2 (en) | 2006-10-26 |
GB0622589D0 (en) | 2007-02-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
GB2440384A (en) | Method,system and program product for measuring audio video synchronization using lip and teeth characteristics | |
GB2429889A (en) | Method, system, and program product for measuring audio video synchronization | |
JP4600828B2 (en) | Document association apparatus and document association method | |
US20190370283A1 (en) | Systems and methods for consolidating recorded content | |
US20140095165A1 (en) | System and method for synchronizing sound and manually transcribed text | |
Vlasenko et al. | Combining frame and turn-level information for robust recognition of emotions within speech | |
Yegnanarayana et al. | Epoch-based analysis of speech signals | |
EP1657721A3 (en) | Music content reproduction apparatus, method thereof and recording apparatus | |
AU2003222001A1 (en) | Method and system for generating a likelihood of cardiovascular disease from analyzing cardiovascular sound signals. | |
AU2003225928A1 (en) | Method for robust voice recognition by analyzing redundant features of source signal | |
MX2021014721A (en) | Systems and methods for machine learning of voice attributes. | |
EP1345210A3 (en) | Speech recognition system, speech recognition method, speech synthesis system, speech synthesis method, and program product | |
TW200721109A (en) | Pronunciation diagnosis device, pronunciation diagnosis method, recording medium, and pronunciation diagnosis program | |
EP1329877A3 (en) | Speech synthesis and decoding | |
WO2010024426A1 (en) | Sound recording device | |
WO2006082868A3 (en) | Method and system for identifying speech sound and non-speech sound in an environment | |
KR101616112B1 (en) | Speaker separation system and method using voice feature vectors | |
JP2007233239A (en) | Method, system, and program for utterance event separation | |
US9240190B2 (en) | Formant based speech reconstruction from noisy signals | |
US20120078625A1 (en) | Waveform analysis of speech | |
WO2006113409A3 (en) | Method, system, and program product for measuring audio video synchronization using lip and teeth charateristics | |
JPH04158397A (en) | Voice quality converting system | |
Sztahó et al. | Automatic classification of emotions in spontaneous speech | |
WO2002079744A3 (en) | Sound characterisation and/or identification based on prosodic listening | |
WO2007095413A3 (en) | Method and apparatus for detecting affects in speech |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 200680021184.3 Country of ref document: CN |
|
ENP | Entry into the national phase |
Ref document number: 0622592 Country of ref document: GB Kind code of ref document: A Free format text: PCT FILING DATE = 20060413 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006235990 Country of ref document: AU Ref document number: 0622592.4 Country of ref document: GB |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2566844 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1432/MUMNP/2006 Country of ref document: IN |
|
WWP | Wipo information: published in national office |
Ref document number: 2006235990 Country of ref document: AU |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
NENP | Non-entry into the national phase |
Ref country code: DE |
|
NENP | Non-entry into the national phase |
Ref country code: RU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006750137 Country of ref document: EP |