WO2015023463A1 - Clock synchronization for multi-processor/multi-chipset solution - Google Patents

Clock synchronization for multi-processor/multi-chipset solution Download PDF

Info

Publication number
WO2015023463A1
WO2015023463A1 PCT/US2014/049593 US2014049593W WO2015023463A1 WO 2015023463 A1 WO2015023463 A1 WO 2015023463A1 US 2014049593 W US2014049593 W US 2014049593W WO 2015023463 A1 WO2015023463 A1 WO 2015023463A1
Authority
WO
WIPO (PCT)
Prior art keywords
clock rate
processing unit
reference time
processor
capture
Prior art date
Application number
PCT/US2014/049593
Other languages
French (fr)
Inventor
Min Wang
Tien-Hsin LEE
Shankar Ganesh LAKSHMANASWAMY
Andy Yu
Vikram Singh
Sandeep PADUBIDRI RAMAMURTHY
Yau Mo CHAN
Vasudev Sujir NAYAK
Srinivasan Balasubramanian
Original Assignee
Qualcomm Incorporated
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Incorporated filed Critical Qualcomm Incorporated
Publication of WO2015023463A1 publication Critical patent/WO2015023463A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • H04L65/762Media network packet handling at the source 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/643Communication protocols
    • H04N21/6437Real-time Transport Protocol [RTP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04JMULTIPLEX COMMUNICATION
    • H04J3/00Time-division multiplex systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/65Network streaming protocols, e.g. real-time transport protocol [RTP] or real-time control protocol [RTCP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/70Media network packetisation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/28Timers or timing mechanisms used in protocols
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • H04N21/8547Content authoring involving timestamps for synchronizing content
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/04Synchronising

Definitions

  • Wireless communications systems are widely deployed to provide various types of communication content such as voice, video, video telephony, packet data, messaging, broadcast, and so on.
  • These systems may be multiple-access systems capable of supporting communication with multiple users by sharing the available system resources (e.g., time, frequency, and power).
  • Examples of such multiple-access systems include code-division multiple access (CDMA) systems, time- division multiple access (TDMA) systems, frequency-division multiple access (FDMA) systems, and orthogonal frequency-division multiple access (OFDMA) systems.
  • CDMA code-division multiple access
  • TDMA time- division multiple access
  • FDMA frequency-division multiple access
  • OFDMA orthogonal frequency-division multiple access
  • Media synchronization including audiovisual (also referred to as audio-video or AV) synchronization, is central to a positive user experience for services that have both audio and video components (e.g., video telephony (VT)).
  • AV synchronization which is also known as "lip sync," is generally defined as the process of ensuring that a relative delay between audio and video stream captures are maintained when they are viewed at some AV receiver. Relative delay being the time difference of an audio frame captured at a microphone and video captured at a camera.
  • Audio and video components are often transmitted in independent Real-time Transport Protocol (RTP) streams. These independent RTP streams may lead to poor synchronization of audio and video packets.
  • RTP Real-time Transport Protocol
  • Low production-quality movies provide limited AV synchronization.
  • the words of the soundtrack may not match the actor's apparent speech patterns. In some cases, it may appear as though the speaker is not actually saying the words attributed to her.
  • poor AV synchronization may occur during real-time communication, such as VT phone calls.
  • the described features generally relate to one or more improved systems, methods, and/or apparatuses for audiovisual synchronization.
  • the described features may include tools and techniques to maintain synchronization of separate media RTP streams.
  • media synchronization may be achieved by utilizing timing information extracted from the received RTP or RTP Control Protocol (RTCP) media streams.
  • RTCP RTP Control Protocol
  • timing information of media streams may be captured at the source— e.g., at or near a camera or microphone.
  • initial RTP timestamps in each media stream are randomly set by a sender in order to fend off attacks on encryption.
  • initial RTP timestamps may be set according to the parameters set forth in the IETF Network Working Group Request for Comments (RFC) 3550.
  • An RTCP sender report may provide for mapping of RTP timestamps of the individual RTP streams according to a common "wall clock.” In such cases, the RTP timestamps may be used to determine a relative time separation between consecutive frames.
  • Some AV synchronization examples require the RTP timestamps of the received audio and video RTP packets and/or the RTP timestamps and a Networking Protocol (NTP) timestamp of incoming audio and video RTCP SRs, where the NTP timestamp is the common "wall clock" time when an RTCP SR is sent.
  • NTP Networking Protocol
  • an audio processing path may be different from a video processing path.
  • the audio processing path may include audio compression and an audio RTP stack, while the video processing path includes separate video compression and a separate video RTP stack.
  • Such multi-processor implementations may involve separate clocks associated with audio and video, respectively; and the separate clocks may tend to drift from one another. That is, respective audio and video clock rates may tend to become further out of sync as time passes.
  • the clock of one processor may be unable to read the clock of another processor, it may be impossible to sync the clocks to each other.
  • synchronization may be performed before displaying, for example, audio and video for a user.
  • a method of media synchronization includes selecting a reference time at a first clock rate, mapping the reference time from the first clock rate to a second clock rate, recording a first timestamp of a first frame capture at the second clock rate, and determining a first capture time of the first frame capture based at least in part on the reference time and the first timestamp.
  • a system for media synchronization includes means for selecting a reference time at a first clock rate, means for mapping the reference time from the first clock rate to a second clock rate, means for recording a first timestamp of a first frame capture at the second clock rate, and means for determining a first capture time of the first frame capture based at least in part on the reference time and the first timestamp.
  • an apparatus for media synchronization includes a processor and memory in electronic communication with the processor.
  • the memory may embody instructions executable by the processor to select a reference time at a first clock rate, map the reference time from the first clock rate to a second clock rate, record a first timestamp of a first frame capture at the second clock rate, and determine a first capture time of the first frame capture based at least in part on the reference time and the first timestamp.
  • a computer-program product for media synchronization includes a non-transitory computer-readable medium storing instructions executable by a processor to select a reference time at a first clock rate, map the reference time from the first clock rate to a second clock rate, record a first timestamp of a first frame capture at the second clock rate, and determine a first capture time of the first frame capture based at least in part on the reference time and the first timestamp.
  • Various embodiments of the method, system, apparatus, and/or computer-program product described above may also include the features of, steps for, means for, and/or processor-executable instructions for adjusting the first capture time for a first propagation delay for mapping the reference time from the first clock rate to the second clock rate.
  • the first clock rate and the second clock rate are functions of a common processing unit.
  • the first clock rate may be a function of a first processing unit and the second clock rate may be a function of a second processing unit.
  • the first processing unit and the second processing unit are each one of an audio processor or a video processor.
  • Various embodiments of the method, system, apparatus, and/or computer-program product described above may also include the features of, steps for, means for, and/or processor-executable instructions for mapping the reference time from the first clock rate to a third clock rate, recording a second timestamp of a second frame capture at the third clock rate, and determining a second capture time of the second frame capture based at least in part on the reference time and the second timestamp. Additionally or alternatively, embodiments may include features of, steps for, means for, and/or processor-executable instructions for adjusting the second capture time for a second propagation delay for mapping the reference time from the first clock rate to the third clock rate.
  • the first clock rate, the second clock rate, and the third clock rate may be functions of a common processing unit.
  • the first clock rate is a function of a first processing unit
  • the second clock rate is a function of a second processing unit
  • the third clock rate is a function of a third processing unit.
  • the first processing unit may be a modem and the second processing unit and the third processing unit may each be one of an audio processor or a video processor.
  • inventions of the method, system, apparatus, and/or computer-program product described above may also include the features of, steps for, means for, and/or processor-executable instructions for updating the reference time periodically. Additionally or alternatively, embodiments may include the features of, steps for, means for, and/or processor-executable instructions for re-mapping the updated reference time after each reference time update.
  • Various embodiments of the method, system, apparatus, and/or computer-program product described above may also include the features of, steps for, means for, and/or processor-executable instructions for updating the reference time non-periodically.
  • embodiments may include the features of, steps for, means for, and/or processor-executable instructions for re-mapping the updated reference time after each reference time update.
  • Some embodiments of the method, system, apparatus, and/or computer-program product described above may also include the features of, steps for, means for, and/or processor-executable instructions for transmitting the first capture time via a packet switched network. Additionally or alternatively, embodiments may include the features of, steps for, means for, and/or processor-executable instructions for utilizing the first capture time to perform audiovisual synchronization of a video display and an audio speaker.
  • FIGS. 1 A and IB show diagrams of a wireless communications system or systems in accordance with some embodiments
  • FIG. 2A and 2B show block diagrams of an example device or devices in accordance with some embodiments
  • FIG. 3 shows a block diagram of an example communications system in accordance with some embodiments
  • FIG. 4 shows a block diagram of an example communications system in accordance with some embodiments
  • FIG. 5 is a flowchart of a method of audiovisual synchronization in accordance with some embodiments
  • FIG. 6 is a flowchart of a method of audiovisual synchronization in accordance with some embodiments.
  • FIG. 7 is a flowchart of a method of audiovisual synchronization in accordance with some embodiments.
  • FIG. 8 is a flowchart of a method of audiovisual synchronization in accordance with some embodiments.
  • Multi-stream media processes may include media streams captured with respect to different clock rates.
  • multi-processor AV implementations may include different audio and video processing paths.
  • the audio processing path may include audio compression and an audio RTP stack, while the video processing path includes separate video compression and a separate video RTP stack.
  • Such multi-processor implementations may involve separate clocks associated with audio and video, respectively. The separate clocks may tend to drift from one another, becoming further out of sync as time passes. Selecting a reference time of one of the processors to function as a "wall clock,” recording frame capture times with respect to the reference time, and transmitting frame capture times in terms of the reference time may aid in AV synchronization at a device where audio and video streams are received.
  • FIG. 1A a diagram illustrates an example of a wireless communications system 100.
  • the system 100 includes base stations (or cells) 105, communication devices 115 and 120, and a core network 130.
  • the base stations 105 may communicate with the communication devices 115 under the control of a base station controller (not shown), which may be part of the core network 130 or the base stations 105 in various embodiments.
  • Base stations 105 may communicate control information and/or user data with the core network 130 through backhaul links 132.
  • the base stations 105 communicate, either directly or indirectly, with each other over backhaul links 134, which may be wired or wireless communication links.
  • the system 100 may support operation on multiple carriers (waveform signals of different frequencies).
  • Multi-carrier transmitters can transmit modulated signals simultaneously on the multiple carriers.
  • Each communication link 125 may be a multi-carrier signal modulated according to the various radio technologies described above.
  • Each modulated signal may be sent on a different carrier and may carry control information (e.g., reference signals, control channels, etc.), overhead information, data, etc.
  • the base stations 105 wirelessly communicate with the devices 1 15 via one or more base station antennas. Each of the base station 105 sites may provide communication coverage for a respective geographic area 1 10.
  • base stations 105 are referred to as a base transceiver station, a radio base station, an access point, a radio transceiver, a basic service set (BSS), an extended service set (ESS), a NodeB, eNodeB (eNB), Home NodeB, a Home eNodeB, or some other suitable terminology.
  • the coverage area 1 10 for a base station may be divided into sectors making up only a portion of the coverage area (not shown).
  • the system 100 may include base stations 105 of different types (e.g., macro, micro, and/or pico base stations). There may be overlapping coverage areas for different technologies.
  • the communications device 120 may be a WLAN router, such as a Wi-Fi router.
  • the device 120 may be in communication with a device 1 15.
  • the system 100 is an LTE/LTE-A network.
  • LTE/LTE-A networks the terms evolved Node B (eNB) and user equipment (UE) may be generally used to describe the base stations 105 and devices 1 15, respectively.
  • the system 100 may be a Heterogeneous LTE/LTE-A network in which different types of eNBs provide coverage for various geographical regions.
  • Each eNB 105 for example, may provide communication coverage for a macro cell, a pico cell, a femto cell, and/or other types of cell(s).
  • the core network 130 may communicate with the eNBs 105 via a backhaul 132 (e.g., S I , etc.).
  • a backhaul 132 e.g., S I , etc.
  • the eNBs 105 may also communicate with one another, e.g., directly or indirectly via backhaul links 134 (e.g., X2, etc.) and/or via backhaul links 132 (e.g., through core network 130).
  • the wireless network 100 may support synchronous or asynchronous operation.
  • the eNBs may have similar frame timing, and transmissions from different eNBs may be approximately aligned in time.
  • the eNBs may have different frame timing, and transmissions from different eNBs may not be aligned in time.
  • the techniques described herein may be used for either synchronous or asynchronous operations.
  • the UEs 115 are dispersed throughout the wireless network 100, and each UE may be stationary or mobile. Some of the UEs 115 may be connected with media-capture devices, such as cameras and microphones.
  • a UE 115 may also be referred to by those skilled in the art as a mobile station, a subscriber station, a mobile unit, a subscriber unit, a wireless unit, a remote unit, a mobile device, a wireless device, a wireless communications device, a remote device, a mobile subscriber station, an access terminal, a mobile terminal, a wireless terminal, a remote terminal, a handset, a user agent, a mobile client, a client, or some other suitable terminology.
  • a UE 115 may be a cellular phone, a personal digital assistant (PDA), a wireless modem, a wireless communication device, a handheld device, a tablet computer, a laptop computer, a cordless phone, a wireless local loop (WLL) station, or the like.
  • a UE 115 may be able to communicate with macro eNBs, pico eNBs, femto eNBs, relays and the like.
  • the transmission links 125 shown in network 100 may include uplink (UL) transmissions from a mobile device 115 to a base station 105, and/or downlink (DL) transmissions, from a base station 105 to a mobile device 115.
  • UL uplink
  • DL downlink
  • the downlink transmissions may also be called forward link transmissions while the uplink transmissions may also be called reverse link transmissions.
  • the transmission links 125 may provide for VT calls or videos streaming, or both, between two or more UEs 115.
  • one UE 115 may have a camera and a microphone respectively operating at different clock rates or operated by separate audio and video processors or processing units.
  • a transmitting UE 115 may separately transmit audio and video streams, using RTP, to a receiving UE 115.
  • the receiving UE 115 may need to sync the AV streams.
  • the system 100-a may be an example of one or more aspects of the system 100 of FIG. 1A.
  • the system 100-a may include UEs 115-a and 115-b, which may be examples of UEs 115 of FIG. 1A.
  • the system 100-a may also include a network 130-a, which may be an example of the network 130 of FIG. 1A.
  • the system 100-a includes media-capture, or AV, devices 150.
  • AV devices 150 For example, a camera 150-a and a microphone 150-b may be connected to the UE 115-a. The connection may be wired or wireless.
  • the AV devices 150 are integral to the UE 115-a.
  • the UE 115-b is equipped with an onboard camera and microphone.
  • the UEs 115 may communicate with each other via the network 130-a by way of one or a number of wireless technologies discussed above.
  • the UE 115-a may communicate via a Wi-Fi connection to the network 130-a, which may communicate to the UE 115-b via LTE.
  • the AV devices 150 may each include separate processing units.
  • a microphone 150-b may include an audio processing unit with an encoder, an RTP stack, and a clock; and a camera 150-a may include a video processing unit with a separate encoder, RTP stack, and clock.
  • Each AV device 150 may capture, respectively, audio and video frames.
  • a reference time may be selected at a first clock rate and the reference time may be mapped from the first clock rate to a second clock rate.
  • the first and second clock rates may be functions of a common processor, such as a processor associated with a UE 115; or the first and second clock rates may be functions of separate processing units.
  • a reference time is selected from the clock or clock rate of the audio processing unit at AV device 150-b.
  • the reference time may be mapped to a second clock rate, which may be associated with a video processing unit at AV device 150-a. This mapping may be by way of the UE 115-a, which may be a computer.
  • a video frame capture of the camera 150-b may be timestamped based on the clock of the video processing unit.
  • a video frame capture time may be determined based on the video timestamp and the reference time, which, as described more fully below, may be understood with respect to the audio processing unit.
  • a frame capture time may be adjusted to account for the propagation time.
  • FIG. 2A shows a block diagram of an device 200 that may be configured for media synchronization in accordance with various embodiments.
  • the device 200 may be an AV device 150-a, which may be an example of an AV device 150 described with reference to FIG. IB.
  • the device 200 may, additionally or alternatively, be an aspect of a UE 115 described with reference to FIGS. 1 A and IB.
  • the device 200 may include a receiver module 205, a processor module 210, and/or a transmitter module 215. Each of the modules may be in communication with one another.
  • the processor module 210 may be an AV device processing unit, such as an audio or video processing unit.
  • the device 200 may be an aspect of a camera or a microphone, and it may be configured to capture video or audio frames.
  • the device 200 is itself a processor, and the modules 205, 210, and 215 are aspects, or submodules, of the processor.
  • a reference time is selected at a separate AV device (or AV processing unit) and mapped to the device 200.
  • the mapped reference time may be received by the receiver module 205.
  • the processor module 210 may be configured to capture an audio or video frame and it may record a timestamp of the frame capture.
  • the timestamp may be a reading of a clock located at or within the processor module 210.
  • the processor module 210 may be configured to determine a capture time of the frame capture, in terms of the received reference time. For example, the capture time may be based on the reference time and recorded timestamp.
  • the device 200 may be configured to transmit the captured frame and the determined capture time to a user via the transmitter module 215.
  • the determined capture time may be adjusted accordingly.
  • the device 200 is a video processing unit associated with a camera; and the reference time is mapped from a separate audio processing unit associated with a microphone. But in other embodiments, the device 200 is an audio processing unit associated with a microphone.
  • a reference time is updated and re-mapped from one device to another.
  • a reference time from an AV device is periodically updated and re -mapped to the device 200 after each update.
  • a reference time from another AV device is updated in a non-periodic fashion— e.g., the reference time may be updated each time the device 200 is powered on, or at the direction of user. In either case, updating and re-mapping the reference time may prevent a clock associated with the processor module 210 from drifting with respect to a clock associated with the reference time.
  • the components of the device 200 are, individually or collectively, implemented with one or more application-specific integrated circuits (ASICs) adapted to perform some or all of the applicable functions in hardware.
  • ASICs application-specific integrated circuits
  • the functions may be performed by one or more processing units (or cores), on one or more integrated circuits.
  • other types of integrated circuits e.g., Structured/Platform ASICs, field-programmable gate arrays (FPGAs), and other Semi-custom integrated circuits (ICs)
  • the functions of each unit also may be wholly or partially implemented with instructions embodied in a memory, formatted to be executed by one or more general or application specific processors.
  • the device 200-a may be an AV device 150-b, which may be an example of the AV devices 150 described with reference to FIGS. IB and 2 A.
  • the device 200-a may also be an example of aspects of a UE 1 15 described with reference to FIGS. 1A and IB.
  • the device 200-a may include a receiver module 205-a, a processor module 210-a, and/or a transmitter module 215-a. These modules may be substantially the same as the corresponding modules of FIG. 2A.
  • the processor module 210-a may include an intelligent hardware device, such as a central processing unit (CPU).
  • the processor module 210-a is an AV device processing unit.
  • the processor module 210-a may be an audio processing unit or a video processing unit.
  • Each of the modules may be in communication with one another.
  • some modules may be in communication with one another.
  • the device 200-a is itself a processor and the modules 205-a, 210-a, and/or 215-a are aspects, or submodules, of the processor.
  • the processor module 210-a may include a frame capture module 230 or a timer module 235, or both.
  • the frame capture module may be an audio frame capture module associated with a microphone, and it may capture frames of audio.
  • the frame capture module may be a video frame capture module associated with a camera, and it may capture frames of video.
  • the timer module 235 is clock.
  • the frame capture module 230 may timestamp a captured frame with the time of capture by reference to the timer module 235.
  • a reference time is selected at a first clock rate and mapped to the device 200-a.
  • the first clock rate may be a function of a first processing unit, such as an AV processor, which is external to the device 200-a. Or the first clock rate may be a function of an additional timer (not shown) within the device 200-a.
  • the first clock rate may be mapped to a second clock rate, which may be the clock rate of the timer module 235. The mapping may be by way of the receiver module 205 -a.
  • the frame capture module 230 may capture a frame and, with reference to the timer module 235, record a timestamp of the frame capture at the second clock rate. The frame capture module 230 may then determine a frame capture time based on the reference time and the timestamp. The determined frame capture time may then be transmitted via the transmitter module 215-a.
  • the components of the device 200 are, individually or collectively, implemented with one or more application-specific integrated circuits (ASICs) adapted to perform some or all of the applicable functions in hardware.
  • ASICs application-specific integrated circuits
  • the functions may be performed by one or more processing units (or cores), on one or more integrated circuits.
  • other types of integrated circuits e.g., Structured/Platform ASICs, field-programmable gate arrays (FPGAs), and other Semi-custom integrated circuits (ICs)
  • the functions of each unit also may be wholly or partially implemented with instructions embodied in a memory, formatted to be executed by one or more general or application specific processors.
  • the device 200-a includes a memory module 240.
  • the memory module 240 may include random access memory (RAM) and/or read-only memory (ROM).
  • the memory module 240 also stores computer-readable, computer executable software (SW) code 245 containing instructions configured to, when executed, cause the processor module 210-a to perform various functions related to AV
  • the SW code 245 may not be directly executable by the processor module 210-a; but it may be configured to cause a computer, for example, when compiled and executed, to perform the functions described herein.
  • FIG. 3 a block diagram illustrates a system 300 configured for AV synchronization.
  • the system 300 may be an example of aspects of the systems 100 and 100-a of FIGS. 1A and IB.
  • the system 300 may include aspects of the devices 200 and 200- a of FIGS. 2A and 2B.
  • the system 300 may include a UE 1 15-c, a network 130-b, a video processing unit 150-c, and/or an audio processing unit 150-d.
  • the UE 1 15-c may be an example of the UEs 1 15 described with reference to FIGS. 1A, IB, 2A, and/or 2B.
  • the network 130-b may be an example of the network 130 described with reference to FIGS. 1A and IB.
  • the video processing unit 150-c and the audio processing unit 150-d may be examples of AV devices 150 described with reference FIGS. IB, 2A, and/or 2B.
  • the video processing unit 150-c may include a video frame capture module 230-a and/or a video timer module 235-a.
  • the audio processing unit 150-d may include an audio frame capture module 230-b and/or an audio timer module 235-b.
  • the audio timer module 235-b may be in communication with the video timer module 235-a.
  • the UE 1 15-c may include a video display 320, a receive modem 325, an audio speaker 330, and/or an AV synchronization module 335.
  • the UE 1 15-c may receive audio and video streams (e.g., RTP streams) at the receive modem 325 from the audio processing unit 150-d and the video processing unit 150-c via the network 130-b.
  • UE 1 15-c is configured for AV synchronization such that the received audio and video streams are synchronized. This AV synchronization may occur at or within the AV synchronization module 335, which may feed synchronized audio and video streams to the audio speaker 330 and the video display 320, respectively.
  • a reference time at a clock rate of the audio timer module 235-b is selected.
  • the reference time may then be mapped to a clock rate at the video processing unit 150-c (e.g., to the clock rate of the video timer module 235-a and/or to the video frame capture module 230-a).
  • the video frame capture module 230-a may capture a video frame from the video input 310.
  • the video input 310 may be associated with a camera.
  • the video processing unit 230-a, through the video frame capture module 230-a may record a first timestamp of the video frame capture by reference at the clock rate of the video timer module 235-a.
  • the video processing unit 150-c may determine a video capture time based on the reference time and the recorded first timestamp.
  • the determined video capture time may thus be understood by reference to the clock rate of the audio timer module 235-b (e.g., the clock associated with the audio capture).
  • the video processing unit 150-c may then transmit the captured video frame and the video capture time to the UE 1 15-c via the network 130-b, which, in some embodiments, is a packet switched network.
  • the audio frame capture module 230-b may capture an audio frame from the audio input 305.
  • the audio input 305 may be associated with a microphone.
  • the captured audio frame relates to the captured video frame, such that the video and audio frames occurred at the same point in real time.
  • a camera associated with the video input 310 and a microphone associated with the audio input 305 respectively gathered visual and audio data associated with the same event.
  • the audio processing unit 150-d may record a timestamp of the audio frame capture by reference to the audio timer module 235-b (e.g., the first clock rate).
  • the audio processing unit 150-d may then transmit the captured audio frame and the audio frame timestamp to the UE 1 15-c via the network 130-b.
  • the video display module 320 may display video based on the captured video frame; and the audio speaker 330 may play audio based on the captured audio frame.
  • the UE 1 15-c may perform AV synchronization, e.g., with the AV synchronization module 335, by utilizing the determined video capture time and the audio frame timestamp, both of which are measurements with respect to the first clock rate (e.g., the clock of the audio processing unit 150-d, the audio timer module 235-b).
  • the first clock rate e.g., the clock of the audio processing unit 150-d, the audio timer module 235-b.
  • clock rates may be functions of a processing unit or units (or a processor or processors) to the extent that the clock rates may be based on clock speeds, timers, processing speeds, processes, frequencies, oscillators, crystals, sampling rates, and the like, of a given processor or processing unit. While the discussion of clock rates is generally described with reference to multiple processing units, those skilled in the art will recognize the first and second clock rates could be functions of a common processing unit.
  • the video processing unit 150-c and the audio processing unit 150-d may be aspects of a common processing unit.
  • each processing unit may be discrete
  • the video processing unit 150-c and the audio processing unit 150-d may be a video processor and an audio processor, respectively.
  • the selected reference time may be updated, either periodically or non-periodically, and re-mapped from the audio processing unit 150-d to the video processing unit 150-c.
  • the selected reference time may be updated periodically. In some cases, this periodic update occurs every twenty (20) milliseconds.
  • the video timer module 235-a e.g., the clock of the video processing unit 150-c
  • the audio timer module 235-b e.g., the clock of the audio processing unit
  • a determined capture time T mc (e.g., the determined capture time of a video frame) may be represented as:
  • Tmc T m + t ac — t a ) + D d , (1)
  • T m represents a reference time (e.g., a time of the audio processing unit 150-c clock)
  • t a represents a mapped reference time (e.g., the time of the audio processing unit 150-d clock mapped to the video processing unit 150-c)
  • t ac represents a recorded timestamp of a frame capture (e.g., a video frame capture at the video frame capture module 230-a)
  • D d represents a propagation delay associated with mapping the reference time from one processing unit to the other.
  • FIG. 4 a block diagram illustrates a system 400 configured for AV synchronization.
  • the system 400 may be an example of aspects of the systems 100, 100- a, and/or 300 of FIGS. 1A, IB, and 3.
  • the system 400 may include aspects of the devices 200 and 200-a of FIGS.
  • the system 400 may include a UE 1 15-d, a network 130-c, a video processing unit 150-e, an audio processing unit 150-f, and/or a network communications processing unit 405.
  • the UE 1 15-d may be an example of the UEs 1 15 described with reference to FIGS. 1A, IB, 2A, and/or 2B.
  • the network 130-b may be an example of the network 130 described with reference to FIGS. 1A and IB.
  • the video processing unit 150-c and the audio processing unit 150-d may be examples of AV devices 150 described with reference FIGS. IB, 2A, and/or 2B.
  • the video processing unit 150-e may include a video frame capture module 230-c and/or a video timer module 235-c.
  • the audio processing unit 150-f may include an audio frame capture module 230-d and/or an audio timer module 235-d.
  • the audio timer module 235-d may be in communication with the video timer module 235-c.
  • the UE 1 15-d may include a video display 420, a receive modem 425, an audio speaker 430, and/or an AV synchronization module 435.
  • the UE 1 15-d may receive audio and video streams (e.g., RTP streams) at the receive modem 425 from the audio processing unit 150-f and the video processing unit 150-e via the network 130-c.
  • UE 1 15-d is configured for AV synchronization such that the received audio and video streams are synchronized.
  • AV synchronization occurs at or within the AV synchronization module 335, which may feed synchronized audio and video streams to the audio speaker 430 and the video display 420, respectively.
  • the network communications processing unit 405 may include a modem module 410 and/or a modem timer module 415.
  • a reference time at a clock rate of the modem timer module 415 is selected.
  • the reference time may then be mapped to a clock rate at the video processing unit 150-e (e.g., a clock rate of the video timer module 235-c and/or to the video frame capture module 230-c).
  • the video frame capture module 230-c may capture a video frame from the video input 440.
  • the video input 440 may be associated with a camera.
  • the video processing unit 150-e, through the video frame capture module 230-c may record a video timestamp of the video frame capture by reference to the clock rate of the video timer module 235-c.
  • the video processing unit 150-e may determine a video capture time based on the reference time and the recorded video timestamp.
  • the determined video capture time may thus be understood by reference to the clock rate of the modem timer module 415 (e.g., the clock associated with the modem module 410).
  • the video processing unit 150-e may then transmit the captured video frame and the video capture time to the UE 1 15-d via the network 130-c by way of the modem module 410.
  • the reference time may be mapped to the clock rate at the audio processing unit 150-f (e.g., the clock rate of the audio timer module 235-d and/or to the audio frame capture module 230-d).
  • the audio frame capture module 230-d may capture an audio frame from an audio input 445.
  • the audio input 445 may be associated with a microphone.
  • the audio processing unit 150-f, through the audio frame capture module 230-d may record an audio timestamp of the audio frame capture by reference to the clock rate of the audio timer module 235-d. Then, the audio processing unit 150-f, through the audio frame capture module 230-d, may determine an audio capture time based on the reference time and the recorded audio timestamp. The determined audio capture time likewise may be understood by reference to the clock rate of the modem timer module 415 (e.g., the clock associated with the modem module 410). The audio processing unit 150-f may then transmit the captured audio frame and the audio capture time to the UE 1 15-d via the network 130-c by way of the modem 410.
  • the discussion of clock rates is generally described with reference to multiple processing units, those skilled in the art will recognize the first and second clock rates could be functions of a common processing unit.
  • the video processing unit 150-c and the audio processing unit 150-d may be aspects of a common processing unit.
  • each processing unit may be discrete, and the video processing unit 150- c and the audio processing unit 150-d may be a video processor and an audio processor, respectively.
  • the video display module 420 may display video based on the captured video frame; and the audio speaker 430 may play audio based on the captured audio frame.
  • the UE 1 15-d may perform AV synchronization, e.g., with the AV synchronization module 435, by utilizing the determined video capture time and the determined audio frame capture, both of which are measurements with respect to the modem timer module 415 (e.g., the clock of the network communications processing unit 405).
  • FIG. 5 shows a flowchart of a method 500 of media synchronization in accordance with some embodiments. The method may be implemented using one or more aspects of the systems 100, 100-a, 300, and 400 of FIGS. 1A, IB, 3, and 4 and the devices 200 and 200-a of FIGS. 2A and 2B.
  • the method may include selecting a reference time at a first clock rate.
  • the operations at block 505 may be performed by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG. 3, the audio timer module 235-b of FIG. 3, and/or the modem timer module 415 of FIG. 4.
  • the first and second clock rates may be functions of a common processing unit. Alternatively, the first and second clock rates may be functions of a first and second processing unit, respectively. In some cases, the first and second processing units are each an audio or video processor.
  • the method may involve mapping the reference time from the first clock rate to a second clock rate.
  • the operations at block 510 may be implemented by the timer module 235 of FIG.
  • the video timer module 235-a of FIG. 3 the audio timer module 235-b of FIG. 3, the video timer module 235-c of FIG. 4, the audio timer module 235-d of FIG. 4, and/or the modem timer module 415 of FIG. 4.
  • the method may involve recording a first timestamp of a first frame capture at the second clock rate.
  • the operations at block 515 may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
  • the method may include determining a first capture time based at least in part on the reference time and the first timestamp.
  • the operations at block 520 may be performed by may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
  • FIG. 6 depicts a flowchart of a method 600 of media synchronization in accordance with some embodiments.
  • the method may be implemented using one or more aspects of the systems 100, 100-a, 300, and 400 of FIGS. 1A, IB, 3, and 4 and the devices 200 and 200-a of FIGS. 2 A and 2B.
  • the method may include selecting a reference time at a first clock rate.
  • the operations at block 605 may be performed by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG. 3, the audio timer module 235-b of FIG. 3, and/or the modem timer module 415 of FIG. 4.
  • the method may involve mapping the reference time from the first clock rate to a second clock rate.
  • the operations at block 610 may be implemented by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG. 3, the audio timer module 235-b of FIG. 3, the video timer module 235-c of FIG. 4, the audio timer module 235-d of FIG.
  • the method may involve recording a first timestamp of a first frame capture at the second clock rate.
  • the operations at block 615 may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
  • the method may include determining a first capture time based at least in part on the reference time and the first timestamp.
  • the operations at block 620 may be performed by may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
  • the method may include adjusting the first capture time for a first propagation delay for mapping the reference time from the first clock rate to the second clock rate.
  • the operations at block 625 may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
  • the method may include mapping the reference time from the first clock rate to a third clock rate.
  • the operations at block 630 may be implemented by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG. 3, the audio timer module 235- b of FIG. 3, the video timer module 235-c of FIG. 4, the audio timer module 235-d of FIG. 4, and/or the modem timer module 415 of FIG. 4.
  • the method may involve recording a second timestamp of a second frame capture a the third clock rate.
  • the operations at block 635 may be performed by may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
  • the method may include determining a second capture time of a second frame capture based at least in part on the reference time and the second timestamp.
  • the operations at block 640 may be performed by may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
  • the method may involve adjusting the second capture time for a second propagation delay from mapping the reference time from the first clock rate to the third clock rate.
  • the operations at block 645 may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
  • the first, second, and third clock rates may be functions of a common processor.
  • the first, second, and/or third clock rates may be functions of a first, second, and/or third processing unit, respectively.
  • the first processing unit is a modem and the second and third processing units are audio or video processors.
  • the method may be implemented using one or more aspects of the systems 100, 100-a, 300, and 400 of FIGS. 1A, IB, 3, and 4 and the devices 200 and 200-a of FIGS. 2 A and 2B.
  • the method may include selecting a reference time at a first clock rate.
  • the operations at block 705 may be performed by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG. 3, the audio timer module 235-b of FIG. 3, and/or the modem timer module 415 of FIG. 4.
  • the first and second clock rates may be functions of a common processing unit. Alternatively, the first and second clock rates may be functions of a first and second processing unit, respectively.
  • the method may involve mapping the reference time from the first clock rate to a second clock rate.
  • the operations at block 710 may be implemented by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG.
  • the method may involve recording a first timestamp of a first frame capture at the second clock rate.
  • the operations at block 715 may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
  • the method may include determining a first capture time based at least in part on the reference time and the first timestamp.
  • the operations at block 720 may be performed by may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
  • the method may include updating the reference time periodically or non-periodically.
  • the operations at block 725 may be performed by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG. 3, the audio timer module 235-b of FIG. 3, and/or the modem timer module 415 of FIG. 4.
  • the first and second clock rates may be functions of a common processing unit. Alternatively, the first and second clock rates may be functions of a first and second processing unit, respectively.
  • the method may involve re -mapping the updated reference time after each reference time update.
  • the operations at block 730 may be implemented by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG. 3, the audio timer module 235- b of FIG. 3, the video timer module 235-c of FIG. 4, the audio timer module 235-d of FIG. 4, and/or the modem timer module 415 of FIG. 4.
  • FIG. 8 depicts a flowchart of a method 800 of media synchronization in accordance with some embodiments.
  • the method may be implemented using one or more aspects of the systems 100, 100-a, 300, and 400 of FIGS. 1A, IB, 3, and 4 and the devices 200 and 200-a of FIGS. 2 A and 2B.
  • the method may include selecting a reference time at a first clock rate.
  • the operations at block 805 may be performed by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG. 3, the audio timer module 235-b of FIG. 3, and/or the modem timer module 415 of FIG. 4.
  • the first and second clock rates may be functions of a common processing unit. Alternatively, the first and second clock rates may be functions of a first and second processing unit, respectively.
  • the method may involve mapping the reference time from the first clock rate to a second clock rate.
  • the operations at block 810 may be implemented by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG. 3, the audio timer module 235-b of FIG. 3, the video timer module 235-c of FIG. 4, the audio timer module 235-d of FIG. 4, and/or the modem timer module 415 of FIG. 4.
  • the method may involve recording a first timestamp of a first frame capture at the second clock rate.
  • the operations at block 815 may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
  • the method may include determining a first capture time based at least in part on the reference time and the first timestamp.
  • the operations at block 820 may be performed by may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
  • the method may include transmitting the first capture time via a packet switched network.
  • the operations of block 825 may be performed by the transmitter modules 215 of FIGS. 2A and 2B, the AV processing units 150 of FIG. 3, and/or the modem module 410 of FIG. 4.
  • the method may include utilizing the first capture time to perform audiovisual synchronization of a video display and an audio speaker.
  • the operations of block 830 may be performed by the AV synchronization module 335 of FIG. 3 and/or the AV synchronization module 435 of FIG. 4.
  • a CDMA system may implement a radio technology such as CDMA2000, Universal Terrestrial Radio Access (UTRA), etc.
  • CDMA2000 covers IS-2000, IS-95, and IS-856 standards.
  • IS-2000 Releases 0 and A are commonly referred to as CDMA2000 IX, IX, etc.
  • IS-856 (TIA-856) is commonly referred to as CDMA2000 lxEV-DO, High Rate Packet Data (HRPD), etc.
  • UTRA includes Wideband CDMA (WCDMA) and other variants of CDMA.
  • a TDMA system may implement a radio technology such as Global System for Mobile Communications (GSM).
  • GSM Global System for Mobile Communications
  • An OFDMA system may implement a radio technology such as Ultra Mobile Broadband (UMB), Evolved UTRA (E-UTRA), IEEE 802.11 (Wi-Fi), IEEE 802.16 (WiMAX), IEEE 802.20, Flash-OFDM, etc.
  • UMB Ultra Mobile Broadband
  • E-UTRA Evolved UTRA
  • Wi-Fi Wi-Fi
  • WiMAX IEEE 802.16
  • IEEE 802.20 Flash-OFDM
  • UMB Ultra Mobile Broadband
  • E-UTRA Evolved UTRA
  • Wi-Fi Wi-Fi
  • WiMAX IEEE 802.16
  • Flash-OFDM Flash-OFDM
  • UMTS Telecommunication System
  • LTE Long Term Evolution
  • LTE-A LTE- Advanced
  • 3GPP2 3rd Generation Partnership Project 2
  • 3GPP2 3rd Generation Partnership Project 2
  • the techniques described herein may be used for the systems and radio technologies mentioned above as well as other systems and radio technologies.
  • Information and signals may be represented using any of a variety of different technologies and techniques.
  • data, instructions, commands, information, signals, bits, symbols, and chips that may be referenced throughout the above description may be represented by voltages, currents, electromagnetic waves, magnetic fields or particles, optical fields or particles, or any combination thereof.
  • DSP digital signal processor
  • ASIC application specific integrated circuit
  • FPGA field programmable gate array
  • a general-purpose processor may be a
  • processor may be any conventional processor, controller, microcontroller, or state machine.
  • a processor may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, multiple microprocessors, one or more microprocessors in conjunction with a DSP core, or any other such configuration.
  • the functions described herein may be implemented in hardware, software executed by a processor, firmware, or any combination thereof. If implemented in software executed by a processor, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium. Other examples and implementations are within the scope and spirit of the disclosure and appended claims. For example, due to the nature of software, functions described above can be implemented using software executed by a processor, hardware, firmware, hardwiring, or combinations of any of these. Features implementing functions may also be physically located at various positions, including being distributed such that portions of functions are implemented at different physical locations.
  • Computer-readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another.
  • a storage medium may be any available medium that can be accessed by a general purpose or special purpose computer.
  • computer-readable media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to carry or store desired program code means in the form of instructions or data structures and that can be accessed by a general-purpose or special- purpose computer, or a general-purpose or special-purpose processor. Also, any connection is properly termed a computer-readable medium.
  • Disk and disc include compact disc (CD), laser disc, optical disc, digital versatile disc (DVD), floppy disk and blu-ray disc where disks usually reproduce data magnetically, while discs reproduce data optically with lasers. Combinations of the above are also included within the scope of computer-readable media.

Abstract

Methods, systems, and devices are described for media synchronization. Multi-stream media processes may include media streams captured with respect to different clock rates. Multi-processor implementations may involve separate clocks associated with different media streams, such as audio and video, respectively. The separate clocks may tend to drift from one another, becoming further out of sync as time passes. Selecting a reference time of one of the processors to function as a "wall clock," recording frame capture times with respect to the reference time, accounting for propagation delays, and transmitting frame capture times in terms of the reference time may aid in AV synchronization at a device where audio and video streams are received.

Description

CLOCK SYNCHRONIZATION FOR MULTI-PROCESSOR/MULTI-CHIPSET
SOLUTION
CROSS REFERENCES
[0001] The present Application for Patent claims priority to U.S. Patent Application No. 14/262,994 by Wang et al, "Clock Synchronization for Multi-Processor/Multi-Chipset Solution," filed April 28, 2014; and U.S. Provisional Patent Application No. 61/866,924 by Wang et al., entitled "Audio/Video Clock Synchronization for Multi-Processor/Multichipset Solution," filed August 16, 2013; each of which is assigned to the assignee hereof.
BACKGROUND
[0002] The following relates generally to media synchronization, and more specifically to audiovisual synchronization in wireless communications systems. Wireless communications systems are widely deployed to provide various types of communication content such as voice, video, video telephony, packet data, messaging, broadcast, and so on. These systems may be multiple-access systems capable of supporting communication with multiple users by sharing the available system resources (e.g., time, frequency, and power). Examples of such multiple-access systems include code-division multiple access (CDMA) systems, time- division multiple access (TDMA) systems, frequency-division multiple access (FDMA) systems, and orthogonal frequency-division multiple access (OFDMA) systems.
[0003] Media synchronization, including audiovisual (also referred to as audio-video or AV) synchronization, is central to a positive user experience for services that have both audio and video components (e.g., video telephony (VT)). AV synchronization, which is also known as "lip sync," is generally defined as the process of ensuring that a relative delay between audio and video stream captures are maintained when they are viewed at some AV receiver. Relative delay being the time difference of an audio frame captured at a microphone and video captured at a camera.
[0004] Audio and video components are often transmitted in independent Real-time Transport Protocol (RTP) streams. These independent RTP streams may lead to poor synchronization of audio and video packets. Low production-quality movies provide limited AV synchronization. For example, the words of the soundtrack may not match the actor's apparent speech patterns. In some cases, it may appear as though the speaker is not actually saying the words attributed to her. Similarly poor AV synchronization may occur during real-time communication, such as VT phone calls.
SUMMARY
[0005] The described features generally relate to one or more improved systems, methods, and/or apparatuses for audiovisual synchronization. The described features may include tools and techniques to maintain synchronization of separate media RTP streams. At the receiver, media synchronization may be achieved by utilizing timing information extracted from the received RTP or RTP Control Protocol (RTCP) media streams. For example, timing information of media streams may be captured at the source— e.g., at or near a camera or microphone.
[0006] In some cases, initial RTP timestamps in each media stream (e.g. , audio and video streams) are randomly set by a sender in order to fend off attacks on encryption. For example, initial RTP timestamps may be set according to the parameters set forth in the IETF Network Working Group Request for Comments (RFC) 3550. An RTCP sender report (SR) may provide for mapping of RTP timestamps of the individual RTP streams according to a common "wall clock." In such cases, the RTP timestamps may be used to determine a relative time separation between consecutive frames. Some AV synchronization examples require the RTP timestamps of the received audio and video RTP packets and/or the RTP timestamps and a Networking Protocol (NTP) timestamp of incoming audio and video RTCP SRs, where the NTP timestamp is the common "wall clock" time when an RTCP SR is sent.
[0007] Moreover, in many cases, including multi-processor AV implementations, an audio processing path may be different from a video processing path. For example, the audio processing path may include audio compression and an audio RTP stack, while the video processing path includes separate video compression and a separate video RTP stack. Such multi-processor implementations may involve separate clocks associated with audio and video, respectively; and the separate clocks may tend to drift from one another. That is, respective audio and video clock rates may tend to become further out of sync as time passes. In some cases, because the clock of one processor may be unable to read the clock of another processor, it may be impossible to sync the clocks to each other. It is therefore beneficial to select a reference time of one of the processors or clock rates to function as a "wall clock," and to transmit frame capture times (e.g. , timestamps for each audio and video frame capture) of each processor with respect to the clock of one of the processors. Then media
synchronization may be performed before displaying, for example, audio and video for a user.
[0008] In some embodiments, a method of media synchronization includes selecting a reference time at a first clock rate, mapping the reference time from the first clock rate to a second clock rate, recording a first timestamp of a first frame capture at the second clock rate, and determining a first capture time of the first frame capture based at least in part on the reference time and the first timestamp.
[0009] In some embodiments, a system for media synchronization includes means for selecting a reference time at a first clock rate, means for mapping the reference time from the first clock rate to a second clock rate, means for recording a first timestamp of a first frame capture at the second clock rate, and means for determining a first capture time of the first frame capture based at least in part on the reference time and the first timestamp.
[0010] In some embodiments, an apparatus for media synchronization includes a processor and memory in electronic communication with the processor. The memory may embody instructions executable by the processor to select a reference time at a first clock rate, map the reference time from the first clock rate to a second clock rate, record a first timestamp of a first frame capture at the second clock rate, and determine a first capture time of the first frame capture based at least in part on the reference time and the first timestamp.
[0011] In some embodiments, a computer-program product for media synchronization includes a non-transitory computer-readable medium storing instructions executable by a processor to select a reference time at a first clock rate, map the reference time from the first clock rate to a second clock rate, record a first timestamp of a first frame capture at the second clock rate, and determine a first capture time of the first frame capture based at least in part on the reference time and the first timestamp.
[0012] Various embodiments of the method, system, apparatus, and/or computer-program product described above may also include the features of, steps for, means for, and/or processor-executable instructions for adjusting the first capture time for a first propagation delay for mapping the reference time from the first clock rate to the second clock rate. In some embodiments, the first clock rate and the second clock rate are functions of a common processing unit. Alternatively, the first clock rate may be a function of a first processing unit and the second clock rate may be a function of a second processing unit. In some
embodiments, the first processing unit and the second processing unit are each one of an audio processor or a video processor.
[0013] Various embodiments of the method, system, apparatus, and/or computer-program product described above may also include the features of, steps for, means for, and/or processor-executable instructions for mapping the reference time from the first clock rate to a third clock rate, recording a second timestamp of a second frame capture at the third clock rate, and determining a second capture time of the second frame capture based at least in part on the reference time and the second timestamp. Additionally or alternatively, embodiments may include features of, steps for, means for, and/or processor-executable instructions for adjusting the second capture time for a second propagation delay for mapping the reference time from the first clock rate to the third clock rate. The first clock rate, the second clock rate, and the third clock rate may be functions of a common processing unit. Alternatively, in some embodiments, the first clock rate is a function of a first processing unit, the second clock rate is a function of a second processing unit, and the third clock rate is a function of a third processing unit. For example, the first processing unit may be a modem and the second processing unit and the third processing unit may each be one of an audio processor or a video processor.
[0014] Various embodiments of the method, system, apparatus, and/or computer-program product described above may also include the features of, steps for, means for, and/or processor-executable instructions for updating the reference time periodically. Additionally or alternatively, embodiments may include the features of, steps for, means for, and/or processor-executable instructions for re-mapping the updated reference time after each reference time update.
[0015] Various embodiments of the method, system, apparatus, and/or computer-program product described above may also include the features of, steps for, means for, and/or processor-executable instructions for updating the reference time non-periodically.
Additionally or alternatively, embodiments may include the features of, steps for, means for, and/or processor-executable instructions for re-mapping the updated reference time after each reference time update.
[0016] Some embodiments of the method, system, apparatus, and/or computer-program product described above may also include the features of, steps for, means for, and/or processor-executable instructions for transmitting the first capture time via a packet switched network. Additionally or alternatively, embodiments may include the features of, steps for, means for, and/or processor-executable instructions for utilizing the first capture time to perform audiovisual synchronization of a video display and an audio speaker.
[0017] Further scope of the applicability of the described methods and apparatuses will become apparent from the following detailed description, claims, and drawings. The detailed description and specific examples are given by way of illustration only, since various changes and modifications within the spirit and scope of the description will become apparent to those skilled in the art.
BRIEF DESCRIPTION OF THE DRAWINGS
[0018] A further understanding of the nature and advantages of the present invention may be realized by reference to the following drawings. In the appended figures, similar components or features may have the same reference label. Further, various components of the same type may be distinguished by following the reference label by a dash and a second label that distinguishes among the similar components. If only the first reference label is used in the specification, the description is applicable to any one of the similar components having the same first reference label irrespective of the second reference label.
[0019] FIGS. 1 A and IB show diagrams of a wireless communications system or systems in accordance with some embodiments;
[0020] FIG. 2A and 2B show block diagrams of an example device or devices in accordance with some embodiments;
[0021] FIG. 3 shows a block diagram of an example communications system in accordance with some embodiments;
[0022] FIG. 4 shows a block diagram of an example communications system in accordance with some embodiments; [0023] FIG. 5 is a flowchart of a method of audiovisual synchronization in accordance with some embodiments;
[0024] FIG. 6 is a flowchart of a method of audiovisual synchronization in accordance with some embodiments; [0025] FIG. 7 is a flowchart of a method of audiovisual synchronization in accordance with some embodiments; and
[0026] FIG. 8 is a flowchart of a method of audiovisual synchronization in accordance with some embodiments.
DETAILED DESCRIPTION
[0027] Multi-stream media processes may include media streams captured with respect to different clock rates. For instance, multi-processor AV implementations may include different audio and video processing paths. The audio processing path may include audio compression and an audio RTP stack, while the video processing path includes separate video compression and a separate video RTP stack. Such multi-processor implementations may involve separate clocks associated with audio and video, respectively. The separate clocks may tend to drift from one another, becoming further out of sync as time passes. Selecting a reference time of one of the processors to function as a "wall clock," recording frame capture times with respect to the reference time, and transmitting frame capture times in terms of the reference time may aid in AV synchronization at a device where audio and video streams are received.
[0028] Video telephony (VT) calls and streaming media are often made over and/or associated with wireless communication systems. Referring first to FIG. 1A, a diagram illustrates an example of a wireless communications system 100. The system 100 includes base stations (or cells) 105, communication devices 115 and 120, and a core network 130. The base stations 105 may communicate with the communication devices 115 under the control of a base station controller (not shown), which may be part of the core network 130 or the base stations 105 in various embodiments. Base stations 105 may communicate control information and/or user data with the core network 130 through backhaul links 132. In some embodiments, the base stations 105 communicate, either directly or indirectly, with each other over backhaul links 134, which may be wired or wireless communication links. The system 100 may support operation on multiple carriers (waveform signals of different frequencies). Multi-carrier transmitters can transmit modulated signals simultaneously on the multiple carriers. Each communication link 125 may be a multi-carrier signal modulated according to the various radio technologies described above. Each modulated signal may be sent on a different carrier and may carry control information (e.g., reference signals, control channels, etc.), overhead information, data, etc.
[0029] In some cases, the base stations 105 wirelessly communicate with the devices 1 15 via one or more base station antennas. Each of the base station 105 sites may provide communication coverage for a respective geographic area 1 10. In some embodiments, base stations 105 are referred to as a base transceiver station, a radio base station, an access point, a radio transceiver, a basic service set (BSS), an extended service set (ESS), a NodeB, eNodeB (eNB), Home NodeB, a Home eNodeB, or some other suitable terminology. The coverage area 1 10 for a base station may be divided into sectors making up only a portion of the coverage area (not shown). The system 100 may include base stations 105 of different types (e.g., macro, micro, and/or pico base stations). There may be overlapping coverage areas for different technologies. For example, the communications device 120 may be a WLAN router, such as a Wi-Fi router. The device 120 may be in communication with a device 1 15.
[0030] In some embodiments, the system 100 is an LTE/LTE-A network. In LTE/LTE-A networks, the terms evolved Node B (eNB) and user equipment (UE) may be generally used to describe the base stations 105 and devices 1 15, respectively. The system 100 may be a Heterogeneous LTE/LTE-A network in which different types of eNBs provide coverage for various geographical regions. Each eNB 105, for example, may provide communication coverage for a macro cell, a pico cell, a femto cell, and/or other types of cell(s). [0031] The core network 130 may communicate with the eNBs 105 via a backhaul 132 (e.g., S I , etc.). The eNBs 105 may also communicate with one another, e.g., directly or indirectly via backhaul links 134 (e.g., X2, etc.) and/or via backhaul links 132 (e.g., through core network 130). The wireless network 100 may support synchronous or asynchronous operation. For synchronous operation, the eNBs may have similar frame timing, and transmissions from different eNBs may be approximately aligned in time. For asynchronous operation, the eNBs may have different frame timing, and transmissions from different eNBs may not be aligned in time. The techniques described herein may be used for either synchronous or asynchronous operations.
[0032] The UEs 115 are dispersed throughout the wireless network 100, and each UE may be stationary or mobile. Some of the UEs 115 may be connected with media-capture devices, such as cameras and microphones. A UE 115 may also be referred to by those skilled in the art as a mobile station, a subscriber station, a mobile unit, a subscriber unit, a wireless unit, a remote unit, a mobile device, a wireless device, a wireless communications device, a remote device, a mobile subscriber station, an access terminal, a mobile terminal, a wireless terminal, a remote terminal, a handset, a user agent, a mobile client, a client, or some other suitable terminology. A UE 115 may be a cellular phone, a personal digital assistant (PDA), a wireless modem, a wireless communication device, a handheld device, a tablet computer, a laptop computer, a cordless phone, a wireless local loop (WLL) station, or the like. A UE 115 may be able to communicate with macro eNBs, pico eNBs, femto eNBs, relays and the like. [0033] The transmission links 125 shown in network 100 may include uplink (UL) transmissions from a mobile device 115 to a base station 105, and/or downlink (DL) transmissions, from a base station 105 to a mobile device 115. The downlink transmissions may also be called forward link transmissions while the uplink transmissions may also be called reverse link transmissions. [0034] In some embodiments, the transmission links 125 may provide for VT calls or videos streaming, or both, between two or more UEs 115. For example, one UE 115 may have a camera and a microphone respectively operating at different clock rates or operated by separate audio and video processors or processing units. During a VT call, a transmitting UE 115 may separately transmit audio and video streams, using RTP, to a receiving UE 115. The receiving UE 115 may need to sync the AV streams.
[0035] Turning next to FIG. IB, shown is a system 100-a, which may be an example of one or more aspects of the system 100 of FIG. 1A. The system 100-a may include UEs 115-a and 115-b, which may be examples of UEs 115 of FIG. 1A. The system 100-a may also include a network 130-a, which may be an example of the network 130 of FIG. 1A. In some cases, the system 100-a includes media-capture, or AV, devices 150. For example, a camera 150-a and a microphone 150-b may be connected to the UE 115-a. The connection may be wired or wireless. In some cases, the AV devices 150 are integral to the UE 115-a.
Additionally, in some embodiments, the UE 115-b is equipped with an onboard camera and microphone.
[0036] The UEs 115 may communicate with each other via the network 130-a by way of one or a number of wireless technologies discussed above. For example, the UE 115-a may communicate via a Wi-Fi connection to the network 130-a, which may communicate to the UE 115-b via LTE.
[0037] The AV devices 150 may each include separate processing units. For example, a microphone 150-b may include an audio processing unit with an encoder, an RTP stack, and a clock; and a camera 150-a may include a video processing unit with a separate encoder, RTP stack, and clock. Each AV device 150 may capture, respectively, audio and video frames. A reference time may be selected at a first clock rate and the reference time may be mapped from the first clock rate to a second clock rate. The first and second clock rates may be functions of a common processor, such as a processor associated with a UE 115; or the first and second clock rates may be functions of separate processing units. Thus, in some embodiments, a reference time is selected from the clock or clock rate of the audio processing unit at AV device 150-b. The reference time may be mapped to a second clock rate, which may be associated with a video processing unit at AV device 150-a. This mapping may be by way of the UE 115-a, which may be a computer. [0038] A video frame capture of the camera 150-b may be timestamped based on the clock of the video processing unit. A video frame capture time may be determined based on the video timestamp and the reference time, which, as described more fully below, may be understood with respect to the audio processing unit. In some cases, there may be a propagation delay associated with mapping the reference time from one AV device 150 processing unit to another. A frame capture time may be adjusted to account for the propagation time.
[0039] Next, FIG. 2A shows a block diagram of an device 200 that may be configured for media synchronization in accordance with various embodiments. The device 200 may be an AV device 150-a, which may be an example of an AV device 150 described with reference to FIG. IB. The device 200 may, additionally or alternatively, be an aspect of a UE 115 described with reference to FIGS. 1 A and IB. The device 200 may include a receiver module 205, a processor module 210, and/or a transmitter module 215. Each of the modules may be in communication with one another. The processor module 210 may be an AV device processing unit, such as an audio or video processing unit. For example, the device 200 may be an aspect of a camera or a microphone, and it may be configured to capture video or audio frames. In some embodiments, the device 200 is itself a processor, and the modules 205, 210, and 215 are aspects, or submodules, of the processor.
[0040] In some cases, a reference time is selected at a separate AV device (or AV processing unit) and mapped to the device 200. The mapped reference time may be received by the receiver module 205. The processor module 210 may be configured to capture an audio or video frame and it may record a timestamp of the frame capture. The timestamp may be a reading of a clock located at or within the processor module 210. Then, the processor module 210 may be configured to determine a capture time of the frame capture, in terms of the received reference time. For example, the capture time may be based on the reference time and recorded timestamp. The device 200 may be configured to transmit the captured frame and the determined capture time to a user via the transmitter module 215. If the process of mapping the reference time to the device 200 is associated with a propagation delay, the determined capture time may be adjusted accordingly. In some embodiments, the device 200 is a video processing unit associated with a camera; and the reference time is mapped from a separate audio processing unit associated with a microphone. But in other embodiments, the device 200 is an audio processing unit associated with a microphone.
[0041] In some cases, a reference time is updated and re-mapped from one device to another. By way of example, a reference time from an AV device (not shown) is periodically updated and re -mapped to the device 200 after each update. Alternatively, a reference time from another AV device (not shown) is updated in a non-periodic fashion— e.g., the reference time may be updated each time the device 200 is powered on, or at the direction of user. In either case, updating and re-mapping the reference time may prevent a clock associated with the processor module 210 from drifting with respect to a clock associated with the reference time.
[0042] In some embodiments, the components of the device 200 are, individually or collectively, implemented with one or more application-specific integrated circuits (ASICs) adapted to perform some or all of the applicable functions in hardware. Alternatively, the functions may be performed by one or more processing units (or cores), on one or more integrated circuits. In other embodiments, other types of integrated circuits are used (e.g., Structured/Platform ASICs, field-programmable gate arrays (FPGAs), and other Semi-custom integrated circuits (ICs)), which may be programmed in any manner known in the art. The functions of each unit also may be wholly or partially implemented with instructions embodied in a memory, formatted to be executed by one or more general or application specific processors.
[0043] Turning next to FIG. 2B, a block diagram illustrates and example device configured for AV synchronization in accordance with various embodiments. The device 200-a may be an AV device 150-b, which may be an example of the AV devices 150 described with reference to FIGS. IB and 2 A. The device 200-a may also be an example of aspects of a UE 1 15 described with reference to FIGS. 1A and IB. The device 200-a may include a receiver module 205-a, a processor module 210-a, and/or a transmitter module 215-a. These modules may be substantially the same as the corresponding modules of FIG. 2A. The processor module 210-a may include an intelligent hardware device, such as a central processing unit (CPU). In some cases, the processor module 210-a is an AV device processing unit. For example, the processor module 210-a may be an audio processing unit or a video processing unit. Each of the modules may be in communication with one another. In some
embodiments, the device 200-a is itself a processor and the modules 205-a, 210-a, and/or 215-a are aspects, or submodules, of the processor.
[0044] The processor module 210-a may include a frame capture module 230 or a timer module 235, or both. For example, the frame capture module may be an audio frame capture module associated with a microphone, and it may capture frames of audio. Or the frame capture module may be a video frame capture module associated with a camera, and it may capture frames of video. In some embodiments, the timer module 235 is clock. The frame capture module 230 may timestamp a captured frame with the time of capture by reference to the timer module 235.
[0045] According to some embodiments, a reference time is selected at a first clock rate and mapped to the device 200-a. The first clock rate may be a function of a first processing unit, such as an AV processor, which is external to the device 200-a. Or the first clock rate may be a function of an additional timer (not shown) within the device 200-a. The first clock rate may be mapped to a second clock rate, which may be the clock rate of the timer module 235. The mapping may be by way of the receiver module 205 -a. The frame capture module 230 may capture a frame and, with reference to the timer module 235, record a timestamp of the frame capture at the second clock rate. The frame capture module 230 may then determine a frame capture time based on the reference time and the timestamp. The determined frame capture time may then be transmitted via the transmitter module 215-a.
[0046] In some embodiments, the components of the device 200 are, individually or collectively, implemented with one or more application-specific integrated circuits (ASICs) adapted to perform some or all of the applicable functions in hardware. Alternatively, the functions may be performed by one or more processing units (or cores), on one or more integrated circuits. In other embodiments, other types of integrated circuits are used (e.g., Structured/Platform ASICs, field-programmable gate arrays (FPGAs), and other Semi-custom integrated circuits (ICs)), which may be programmed in any manner known in the art. The functions of each unit also may be wholly or partially implemented with instructions embodied in a memory, formatted to be executed by one or more general or application specific processors.
[0047] In some cases, the device 200-a includes a memory module 240. The memory module 240 may include random access memory (RAM) and/or read-only memory (ROM). In some embodiments, the memory module 240 also stores computer-readable, computer executable software (SW) code 245 containing instructions configured to, when executed, cause the processor module 210-a to perform various functions related to AV
synchronization, as described herein. In other embodiments, the SW code 245 may not be directly executable by the processor module 210-a; but it may be configured to cause a computer, for example, when compiled and executed, to perform the functions described herein.
[0048] Now, referring to FIG. 3, a block diagram illustrates a system 300 configured for AV synchronization. The system 300 may be an example of aspects of the systems 100 and 100-a of FIGS. 1A and IB. The system 300 may include aspects of the devices 200 and 200- a of FIGS. 2A and 2B. The system 300 may include a UE 1 15-c, a network 130-b, a video processing unit 150-c, and/or an audio processing unit 150-d. For example, the UE 1 15-c may be an example of the UEs 1 15 described with reference to FIGS. 1A, IB, 2A, and/or 2B. The network 130-b may be an example of the network 130 described with reference to FIGS. 1A and IB. The video processing unit 150-c and the audio processing unit 150-d may be examples of AV devices 150 described with reference FIGS. IB, 2A, and/or 2B.
[0049] The video processing unit 150-c may include a video frame capture module 230-a and/or a video timer module 235-a. The audio processing unit 150-d may include an audio frame capture module 230-b and/or an audio timer module 235-b. The audio timer module 235-b may be in communication with the video timer module 235-a.
[0050] The UE 1 15-c may include a video display 320, a receive modem 325, an audio speaker 330, and/or an AV synchronization module 335. The UE 1 15-c may receive audio and video streams (e.g., RTP streams) at the receive modem 325 from the audio processing unit 150-d and the video processing unit 150-c via the network 130-b. In some embodiments, UE 1 15-c is configured for AV synchronization such that the received audio and video streams are synchronized. This AV synchronization may occur at or within the AV synchronization module 335, which may feed synchronized audio and video streams to the audio speaker 330 and the video display 320, respectively.
[0051] By way of example, a reference time at a clock rate of the audio timer module 235-b is selected. The reference time may then be mapped to a clock rate at the video processing unit 150-c (e.g., to the clock rate of the video timer module 235-a and/or to the video frame capture module 230-a). The video frame capture module 230-a may capture a video frame from the video input 310. The video input 310 may be associated with a camera. The video processing unit 230-a, through the video frame capture module 230-a, may record a first timestamp of the video frame capture by reference at the clock rate of the video timer module 235-a. Then, the video processing unit 150-c, through the video frame capture module 230- a, may determine a video capture time based on the reference time and the recorded first timestamp. The determined video capture time may thus be understood by reference to the clock rate of the audio timer module 235-b (e.g., the clock associated with the audio capture). The video processing unit 150-c may then transmit the captured video frame and the video capture time to the UE 1 15-c via the network 130-b, which, in some embodiments, is a packet switched network. [0052] Concurrently, the audio frame capture module 230-b may capture an audio frame from the audio input 305. The audio input 305 may be associated with a microphone. In some cases, the captured audio frame relates to the captured video frame, such that the video and audio frames occurred at the same point in real time. In other words, a camera associated with the video input 310 and a microphone associated with the audio input 305 respectively gathered visual and audio data associated with the same event. The audio processing unit 150-d, through the audio frame capture module 230-b, may record a timestamp of the audio frame capture by reference to the audio timer module 235-b (e.g., the first clock rate). The audio processing unit 150-d may then transmit the captured audio frame and the audio frame timestamp to the UE 1 15-c via the network 130-b.
[0053] In accordance with some embodiments, the video display module 320 may display video based on the captured video frame; and the audio speaker 330 may play audio based on the captured audio frame. The UE 1 15-c may perform AV synchronization, e.g., with the AV synchronization module 335, by utilizing the determined video capture time and the audio frame timestamp, both of which are measurements with respect to the first clock rate (e.g., the clock of the audio processing unit 150-d, the audio timer module 235-b). [0054] As used herein, clock rates may be functions of a processing unit or units (or a processor or processors) to the extent that the clock rates may be based on clock speeds, timers, processing speeds, processes, frequencies, oscillators, crystals, sampling rates, and the like, of a given processor or processing unit. While the discussion of clock rates is generally described with reference to multiple processing units, those skilled in the art will recognize the first and second clock rates could be functions of a common processing unit. For example, the video processing unit 150-c and the audio processing unit 150-d may be aspects of a common processing unit. Or, each processing unit may be discrete, and the video processing unit 150-c and the audio processing unit 150-d may be a video processor and an audio processor, respectively. [0055] In some cases, the selected reference time may be updated, either periodically or non-periodically, and re-mapped from the audio processing unit 150-d to the video processing unit 150-c. For example, the selected reference time may be updated periodically. In some cases, this periodic update occurs every twenty (20) milliseconds. In this way, the video timer module 235-a (e.g., the clock of the video processing unit 150-c) does not drift with respect to the audio timer module 235-b (e.g., the clock of the audio processing unit [0056] According to some embodiments, a determined capture time Tmc (e.g., the determined capture time of a video frame) may be represented as:
Tmc = Tm + tac— ta) + Dd, (1) where Tm represents a reference time (e.g., a time of the audio processing unit 150-c clock), ta represents a mapped reference time (e.g., the time of the audio processing unit 150-d clock mapped to the video processing unit 150-c), tac represents a recorded timestamp of a frame capture (e.g., a video frame capture at the video frame capture module 230-a), and Dd represents a propagation delay associated with mapping the reference time from one processing unit to the other.
[0057] Those skilled in the art will recognize that the clock, or clock rate, of either processing unit may be selected as a reference time. Thus, in the described example, the respective roles of the audio and video processing units may be reversed. Furthermore, the skilled artisan will note that the described tools and techniques may apply equally to timestamps associated with RTP and RTCP. Thus, Equation 1 may be applied to either RTP or RTCP, or both. [0058] Next, referring to FIG. 4, a block diagram illustrates a system 400 configured for AV synchronization. The system 400 may be an example of aspects of the systems 100, 100- a, and/or 300 of FIGS. 1A, IB, and 3. The system 400 may include aspects of the devices 200 and 200-a of FIGS. 2 A and 2B. The system 400 may include a UE 1 15-d, a network 130-c, a video processing unit 150-e, an audio processing unit 150-f, and/or a network communications processing unit 405. For example, the UE 1 15-d may be an example of the UEs 1 15 described with reference to FIGS. 1A, IB, 2A, and/or 2B. The network 130-b may be an example of the network 130 described with reference to FIGS. 1A and IB. The video processing unit 150-c and the audio processing unit 150-d may be examples of AV devices 150 described with reference FIGS. IB, 2A, and/or 2B. [0059] The video processing unit 150-e may include a video frame capture module 230-c and/or a video timer module 235-c. The audio processing unit 150-f may include an audio frame capture module 230-d and/or an audio timer module 235-d. The audio timer module 235-d may be in communication with the video timer module 235-c. [0060] The UE 1 15-d may include a video display 420, a receive modem 425, an audio speaker 430, and/or an AV synchronization module 435. The UE 1 15-d may receive audio and video streams (e.g., RTP streams) at the receive modem 425 from the audio processing unit 150-f and the video processing unit 150-e via the network 130-c. In some embodiments, UE 1 15-d is configured for AV synchronization such that the received audio and video streams are synchronized. In some embodiments, AV synchronization occurs at or within the AV synchronization module 335, which may feed synchronized audio and video streams to the audio speaker 430 and the video display 420, respectively.
[0061] The network communications processing unit 405 may include a modem module 410 and/or a modem timer module 415.
[0062] By way of example, a reference time at a clock rate of the modem timer module 415, is selected. The reference time may then be mapped to a clock rate at the video processing unit 150-e (e.g., a clock rate of the video timer module 235-c and/or to the video frame capture module 230-c). The video frame capture module 230-c may capture a video frame from the video input 440. The video input 440 may be associated with a camera. The video processing unit 150-e, through the video frame capture module 230-c, may record a video timestamp of the video frame capture by reference to the clock rate of the video timer module 235-c. Then, the video processing unit 150-e, through the video frame capture module 230-c, may determine a video capture time based on the reference time and the recorded video timestamp. The determined video capture time may thus be understood by reference to the clock rate of the modem timer module 415 (e.g., the clock associated with the modem module 410). The video processing unit 150-e may then transmit the captured video frame and the video capture time to the UE 1 15-d via the network 130-c by way of the modem module 410. [0063] Concurrently, The reference time may be mapped to the clock rate at the audio processing unit 150-f (e.g., the clock rate of the audio timer module 235-d and/or to the audio frame capture module 230-d). The audio frame capture module 230-d may capture an audio frame from an audio input 445. The audio input 445 may be associated with a microphone. The audio processing unit 150-f, through the audio frame capture module 230-d, may record an audio timestamp of the audio frame capture by reference to the clock rate of the audio timer module 235-d. Then, the audio processing unit 150-f, through the audio frame capture module 230-d, may determine an audio capture time based on the reference time and the recorded audio timestamp. The determined audio capture time likewise may be understood by reference to the clock rate of the modem timer module 415 (e.g., the clock associated with the modem module 410). The audio processing unit 150-f may then transmit the captured audio frame and the audio capture time to the UE 1 15-d via the network 130-c by way of the modem 410.
[0064] Again, with reference to FIG. 4, the discussion of clock rates is generally described with reference to multiple processing units, those skilled in the art will recognize the first and second clock rates could be functions of a common processing unit. For example, the video processing unit 150-c and the audio processing unit 150-d may be aspects of a common processing unit. Or, each processing unit may be discrete, and the video processing unit 150- c and the audio processing unit 150-d may be a video processor and an audio processor, respectively.
[0065] In accordance with some embodiments, the video display module 420 may display video based on the captured video frame; and the audio speaker 430 may play audio based on the captured audio frame. The UE 1 15-d may perform AV synchronization, e.g., with the AV synchronization module 435, by utilizing the determined video capture time and the determined audio frame capture, both of which are measurements with respect to the modem timer module 415 (e.g., the clock of the network communications processing unit 405). [0066] Next, FIG. 5 shows a flowchart of a method 500 of media synchronization in accordance with some embodiments. The method may be implemented using one or more aspects of the systems 100, 100-a, 300, and 400 of FIGS. 1A, IB, 3, and 4 and the devices 200 and 200-a of FIGS. 2A and 2B.
[0067] At block 505, the method may include selecting a reference time at a first clock rate. The operations at block 505 may be performed by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG. 3, the audio timer module 235-b of FIG. 3, and/or the modem timer module 415 of FIG. 4. The first and second clock rates may be functions of a common processing unit. Alternatively, the first and second clock rates may be functions of a first and second processing unit, respectively. In some cases, the first and second processing units are each an audio or video processor. [0068] At block 510, the method may involve mapping the reference time from the first clock rate to a second clock rate. The operations at block 510 may be implemented by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG. 3, the audio timer module 235-b of FIG. 3, the video timer module 235-c of FIG. 4, the audio timer module 235-d of FIG. 4, and/or the modem timer module 415 of FIG. 4.
[0069] At block 515, the method may involve recording a first timestamp of a first frame capture at the second clock rate. The operations at block 515 may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
[0070] At block 520, the method may include determining a first capture time based at least in part on the reference time and the first timestamp. The operations at block 520 may be performed by may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
[0071] Turning next to FIG. 6, which depicts a flowchart of a method 600 of media synchronization in accordance with some embodiments. The method may be implemented using one or more aspects of the systems 100, 100-a, 300, and 400 of FIGS. 1A, IB, 3, and 4 and the devices 200 and 200-a of FIGS. 2 A and 2B.
[0072] At block 605, the method may include selecting a reference time at a first clock rate. The operations at block 605 may be performed by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG. 3, the audio timer module 235-b of FIG. 3, and/or the modem timer module 415 of FIG. 4. [0073] At block 610, the method may involve mapping the reference time from the first clock rate to a second clock rate. The operations at block 610 may be implemented by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG. 3, the audio timer module 235-b of FIG. 3, the video timer module 235-c of FIG. 4, the audio timer module 235-d of FIG. 4, and/or the modem timer module 415 of FIG. 4. [0074] At block 615, the method may involve recording a first timestamp of a first frame capture at the second clock rate. The operations at block 615 may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
[0075] At block 620, the method may include determining a first capture time based at least in part on the reference time and the first timestamp. The operations at block 620 may be performed by may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
[0076] At block 625, the method may include adjusting the first capture time for a first propagation delay for mapping the reference time from the first clock rate to the second clock rate. The operations at block 625 may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
[0077] At block 630, the method may include mapping the reference time from the first clock rate to a third clock rate. The operations at block 630 may be implemented by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG. 3, the audio timer module 235- b of FIG. 3, the video timer module 235-c of FIG. 4, the audio timer module 235-d of FIG. 4, and/or the modem timer module 415 of FIG. 4.
[0078] At block 635, the method may involve recording a second timestamp of a second frame capture a the third clock rate. The operations at block 635 may be performed by may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
[0079] At block 640, the method may include determining a second capture time of a second frame capture based at least in part on the reference time and the second timestamp. The operations at block 640 may be performed by may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
[0080] At block 645, the method may involve adjusting the second capture time for a second propagation delay from mapping the reference time from the first clock rate to the third clock rate. The operations at block 645 may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4. [0081] In the method 600, the first, second, and third clock rates may be functions of a common processor. Alternatively, the first, second, and/or third clock rates may be functions of a first, second, and/or third processing unit, respectively. In some embodiments, the first processing unit is a modem and the second and third processing units are audio or video processors. [0082] Now, referring to FIG. 7, a flowchart depicts a method 700 of media
synchronization in accordance with some embodiments. The method may be implemented using one or more aspects of the systems 100, 100-a, 300, and 400 of FIGS. 1A, IB, 3, and 4 and the devices 200 and 200-a of FIGS. 2 A and 2B.
[0083] At block 705, the method may include selecting a reference time at a first clock rate. The operations at block 705 may be performed by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG. 3, the audio timer module 235-b of FIG. 3, and/or the modem timer module 415 of FIG. 4. The first and second clock rates may be functions of a common processing unit. Alternatively, the first and second clock rates may be functions of a first and second processing unit, respectively. [0084] At block 710, the method may involve mapping the reference time from the first clock rate to a second clock rate. The operations at block 710 may be implemented by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG. 3, the audio timer module 235-b of FIG. 3, the video timer module 235-c of FIG. 4, the audio timer module 235-d of FIG. 4, and/or the modem timer module 415 of FIG. 4. [0085] At block 715, the method may involve recording a first timestamp of a first frame capture at the second clock rate. The operations at block 715 may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
[0086] At block 720, the method may include determining a first capture time based at least in part on the reference time and the first timestamp. The operations at block 720 may be performed by may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
[0087] At block 725, the method may include updating the reference time periodically or non-periodically. The operations at block 725 may be performed by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG. 3, the audio timer module 235-b of FIG. 3, and/or the modem timer module 415 of FIG. 4. The first and second clock rates may be functions of a common processing unit. Alternatively, the first and second clock rates may be functions of a first and second processing unit, respectively.
[0088] At block 730, the method may involve re -mapping the updated reference time after each reference time update. The operations at block 730 may be implemented by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG. 3, the audio timer module 235- b of FIG. 3, the video timer module 235-c of FIG. 4, the audio timer module 235-d of FIG. 4, and/or the modem timer module 415 of FIG. 4.
[0089] Turning now to FIG. 8, which depicts a flowchart of a method 800 of media synchronization in accordance with some embodiments. The method may be implemented using one or more aspects of the systems 100, 100-a, 300, and 400 of FIGS. 1A, IB, 3, and 4 and the devices 200 and 200-a of FIGS. 2 A and 2B.
[0090] At block 805, the method may include selecting a reference time at a first clock rate. The operations at block 805 may be performed by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG. 3, the audio timer module 235-b of FIG. 3, and/or the modem timer module 415 of FIG. 4. The first and second clock rates may be functions of a common processing unit. Alternatively, the first and second clock rates may be functions of a first and second processing unit, respectively.
[0091] At block 810, the method may involve mapping the reference time from the first clock rate to a second clock rate. The operations at block 810 may be implemented by the timer module 235 of FIG. 2B, the video timer module 235-a of FIG. 3, the audio timer module 235-b of FIG. 3, the video timer module 235-c of FIG. 4, the audio timer module 235-d of FIG. 4, and/or the modem timer module 415 of FIG. 4.
[0092] At block 815, the method may involve recording a first timestamp of a first frame capture at the second clock rate. The operations at block 815 may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
[0093] At block 820, the method may include determining a first capture time based at least in part on the reference time and the first timestamp. The operations at block 820 may be performed by may be implemented by the frame capture module 230 of FIG. 2B, the video frame capture module 230-a of FIG. 3, the audio frame capture module 230-b of FIG. 3, the video frame capture module 230-c of FIG. 4, and/or the audio frame capture module 230-d of FIG. 4.
[0094] At block 825, the method may include transmitting the first capture time via a packet switched network. The operations of block 825 may be performed by the transmitter modules 215 of FIGS. 2A and 2B, the AV processing units 150 of FIG. 3, and/or the modem module 410 of FIG. 4.
[0095] At block 830, the method may include utilizing the first capture time to perform audiovisual synchronization of a video display and an audio speaker. The operations of block 830 may be performed by the AV synchronization module 335 of FIG. 3 and/or the AV synchronization module 435 of FIG. 4.
[0096] Techniques described herein may be used for various wireless communications systems such as CDMA, TDMA, FDMA, OFDMA, SC-FDMA, and other systems. The terms "system" and "network" are often used interchangeably. A CDMA system may implement a radio technology such as CDMA2000, Universal Terrestrial Radio Access (UTRA), etc. CDMA2000 covers IS-2000, IS-95, and IS-856 standards. IS-2000 Releases 0 and A are commonly referred to as CDMA2000 IX, IX, etc. IS-856 (TIA-856) is commonly referred to as CDMA2000 lxEV-DO, High Rate Packet Data (HRPD), etc. UTRA includes Wideband CDMA (WCDMA) and other variants of CDMA. A TDMA system may implement a radio technology such as Global System for Mobile Communications (GSM). An OFDMA system may implement a radio technology such as Ultra Mobile Broadband (UMB), Evolved UTRA (E-UTRA), IEEE 802.11 (Wi-Fi), IEEE 802.16 (WiMAX), IEEE 802.20, Flash-OFDM, etc. UTRA and E-UTRA are part of Universal Mobile
Telecommunication System (UMTS). 3GPP Long Term Evolution (LTE) and LTE- Advanced (LTE-A) are new releases of UMTS that use E-UTRA. UTRA, E-UTRA, UMTS, LTE, LTE-A, and GSM are described in documents from an organization named "3rd Generation Partnership Project" (3GPP). CDMA2000 and UMB are described in documents from an organization named "3rd Generation Partnership Project 2" (3GPP2). The techniques described herein may be used for the systems and radio technologies mentioned above as well as other systems and radio technologies. The description above, however, describes an LTE system for purposes of example, and LTE terminology is used in much of the description below, although the techniques are applicable beyond LTE applications.
[0097] Thus, the description set forth above provides examples, and is not limiting of the scope, applicability, or configuration set forth in the claims. Changes may be made in the function and arrangement of elements discussed without departing from the spirit and scope of the disclosure. Various embodiments may omit, substitute, or add various procedures or components as appropriate. For instance, the methods described may be performed in an order different from that described, and various steps may be added, omitted, or combined. Also, features described with respect to certain embodiments may be combined in other embodiments.
[0098] The detailed description set forth above in connection with the appended drawings describes exemplary embodiments and does not represent the only embodiments that may be implemented or that are within the scope of the claims. The detailed description includes specific details for the purpose of providing an understanding of the described techniques. These techniques, however, may be practiced without these specific details. In some instances, well-known structures and devices are shown in block diagram form in order to avoid obscuring the concepts of the described embodiments.
[0099] Information and signals may be represented using any of a variety of different technologies and techniques. For example, data, instructions, commands, information, signals, bits, symbols, and chips that may be referenced throughout the above description may be represented by voltages, currents, electromagnetic waves, magnetic fields or particles, optical fields or particles, or any combination thereof.
[0100] The various illustrative blocks and modules described in connection with the disclosure herein may be implemented or performed with a general-purpose processor, a digital signal processor (DSP), an application specific integrated circuit (ASIC), a field programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the functions described herein. A general-purpose processor may be a
microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, multiple microprocessors, one or more microprocessors in conjunction with a DSP core, or any other such configuration.
[0101] The functions described herein may be implemented in hardware, software executed by a processor, firmware, or any combination thereof. If implemented in software executed by a processor, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium. Other examples and implementations are within the scope and spirit of the disclosure and appended claims. For example, due to the nature of software, functions described above can be implemented using software executed by a processor, hardware, firmware, hardwiring, or combinations of any of these. Features implementing functions may also be physically located at various positions, including being distributed such that portions of functions are implemented at different physical locations. Also, as used herein, including in the claims, "or" as used in a list of items prefaced by "at least one of indicates a disjunctive list such that, for example, a list of "at least one of A, B, or C" means A or B or C or AB or AC or BC or ABC (i.e., A and B and C). [0102] Computer-readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another. A storage medium may be any available medium that can be accessed by a general purpose or special purpose computer. By way of example, and not limitation, computer-readable media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to carry or store desired program code means in the form of instructions or data structures and that can be accessed by a general-purpose or special- purpose computer, or a general-purpose or special-purpose processor. Also, any connection is properly termed a computer-readable medium. For example, if the software is transmitted from a website, server, or other remote source using a coaxial cable, fiber optic cable, twisted pair, digital subscriber line (DSL), or wireless technologies such as infrared, radio, and microwave, then the coaxial cable, fiber optic cable, twisted pair, DSL, or wireless technologies such as infrared, radio, and microwave are included in the definition of medium. Disk and disc, as used herein, include compact disc (CD), laser disc, optical disc, digital versatile disc (DVD), floppy disk and blu-ray disc where disks usually reproduce data magnetically, while discs reproduce data optically with lasers. Combinations of the above are also included within the scope of computer-readable media.
[0103] The previous description of the disclosure is provided to enable a person skilled in the art to make or use the disclosure. Various modifications to the disclosure will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other variations without departing from the spirit or scope of the disclosure. Throughout this disclosure the term "example" or "exemplary" indicates an example or instance and does not imply or require any preference for the noted example. Thus, the disclosure is not to be limited to the examples and designs described herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims

What is claimed is: 1. A method of media synchronization, comprising:
selecting a reference time at a first clock rate;
mapping the reference time from the first clock rate to a second clock rate; recording a first timestamp of a first frame capture at the second clock rate; and
determining a first capture time of the first frame capture based at least in part on the reference time and the first timestamp.
2. The method of claim 1, further comprising:
adjusting the first capture time for a first propagation delay for mapping the reference time from the first clock rate to the second clock rate.
3. The method of claim 1, wherein the first clock rate and the second clock rate are functions of a common processing unit.
4. The method of claim 1 , wherein the first clock rate is a function of a first processing unit and the second clock rate is a function of a second processing unit.
5. The method of claim 4, wherein the first processing unit and the second processing unit each comprise one of an audio processor or a video processor.
6. The method of claim 1, further comprising:
mapping the reference time from the first clock rate to a third clock rate;
recording a second timestamp of a second frame capture at the third clock rate; and
determining a second capture time of the second frame capture based at least in part on the reference time and the second timestamp.
7. The method of claim 6, further comprising:
adjusting the second capture time for a second propagation delay for mapping the reference time from the first clock rate to the third clock rate.
8. The method of claim 6, wherein the first clock rate, the second clock rate, and the third clock rate are functions of a common processing unit.
9. The method of claim 6, wherein the first clock rate is a function of a first processing unit, the second clock rate is a function of a second processing unit, and the third clock rate is a function of a third processing unit.
10. The method of claim 9, wherein the first processing unit comprises a modem and the second processing unit and the third processing unit each comprise one of an audio processor or a video processor.
11. The method of claim 1 , further comprising:
updating the reference time periodically.
12. The method of claim 11, further comprising:
re-mapping the updated reference time after each reference time update.
13. The method of claim 1, further comprising:
updating the reference time non-periodically.
14. The method of claim 13, further comprising:
re-mapping the updated reference time after each reference time update.
15. The method of claim 1, further comprising:
transmitting the first capture time via a packet switched network.
16. The method of claim 1, further comprising:
utilizing the first capture time to perform audiovisual synchronization of a video display and an audio speaker.
17. A system for media synchronization, comprising:
means for selecting a reference time at a first clock rate;
means for mapping the reference time from the first clock rate to a second clock rate; means for recording a first timestamp of a first frame capture at the second clock rate; and
means for determining a first capture time of the first frame capture based at least in part on the reference time and the first timestamp.
18. The system of claim 17, further comprising:
means for adjusting the first capture time for a first propagation delay for mapping the reference time from the first clock rate to the second clock rate.
19. The system of claim 17, wherein the first clock rate and the second clock rate are functions of a common processing unit.
20. The system of claim 17, wherein the first clock rate is a function of a first processing unit and the second clock rate is a function of a second processing unit.
21. The system of claim 20, wherein the first processing unit and the second processing unit each comprise one of an audio processor or a video processor.
22. The system of claim 17, further comprising:
means for mapping the reference time from the first clock rate to a third clock rate;
means for recording a second timestamp of a second frame capture at the third clock rate; and
means for determining a second capture time of the second frame capture based at least in part on the reference time and the second timestamp.
23. The system of claim 22, further comprising:
means for adjusting the second capture time for a second propagation delay for mapping the reference time from the first clock rate to the third clock rate.
24. The system of claim 22, wherein the first clock rate, the second clock rate, and the third clock rate are functions of a common processing unit.
25. The system of claim 22, wherein the first clock rate is a function of a first processing unit, the second clock rate is a function of a second processing unit, and the third clock rate is a function of a third processing unit.
26. The system of claim 25, wherein the first processing unit comprises a modem and the second processing unit and the third processing unit each comprise one of an audio processor or a video processor.
27. The system of claim 17, further comprising:
means for updating the reference time periodically.
28. The system of claim 27, further comprising:
means for re -mapping the updated reference time after each reference time update.
29. The system of claim 17, further comprising:
means for updating the reference time non-periodically.
30. The system of claim 29, further comprising:
means for re -mapping the updated reference time after each reference time update.
31. The system of claim 17, further comprising:
means for transmitting the first capture time via a packet switched network.
32. The system of claim 17, further comprising:
means for utilizing the first capture time to perform audiovisual synchronization of a video display and an audio speaker.
33. An apparatus for media synchronization, comprising: a processor;
memory in electronic communication with the processor, the memory embodying instructions, the instructions being executable by the processor to:
select a reference time at a first clock rate;
map the reference time from the first clock rate to a second clock rate; record a first timestamp of a first frame capture at the second clock rate; and
determine a first capture time of the first frame capture based at least in part on the reference time and the first timestamp.
34. The apparatus of claim 33, wherein the instructions are executable by the processor to:
adjust the first capture time for a first propagation delay for mapping the reference time from the first clock rate to the second clock rate.
35. The apparatus of claim 33, wherein the first clock rate and the second clock rate are functions of a common processing unit.
36. The apparatus of claim 33, wherein the first clock rate is a function of a first processing unit and the second clock rate is a function of a second processing unit.
37. The apparatus of claim 36, wherein the first processing unit and the second processing unit each comprise one of an audio processor or a video processor.
38. The apparatus of claim 33, wherein the instructions are executable by the processor to:
map the reference time from the first clock rate to a third clock rate;
record a second timestamp of a second frame capture at the third clock rate; and
determine a second capture time of the second frame capture based at least in part on the reference time and the second timestamp.
39. The apparatus of claim 38, wherein the instructions are executable by the processor to:
adjust the second capture time for a second propagation delay for mapping the reference time from the first clock rate to the third clock rate.
40. The apparatus of claim 38, wherein the first clock rate, the second clock rate, and the third clock rate are functions of a common processing unit.
41. The apparatus of claim 38, wherein the first clock rate is a function of a first processing unit, the second clock rate is a function of a second processing unit, and the third clock rate is a function of a third processing unit.
42. The apparatus of claim 41, wherein the first processing unit comprises a modem and the second processing unit and the third processing unit each comprise one of an audio processor or a video processor.
43. The apparatus of claim 33, wherein the instructions are executable by the processor to:
update the reference time periodically.
44. The apparatus of claim 43, wherein the instructions are executable by the processor to:
re-map the updated reference time after each reference time update.
45. The apparatus of claim 33, wherein the instructions are executable by the processor to:
update the reference time non-periodically.
46. The apparatus of claim 45, wherein the instructions are executable by the processor to:
re-map the updated reference time after each reference time update.
47. The apparatus of claim 33, wherein the instructions are executable by the processor to:
transmit the first capture time via a packet switched network.
48. The apparatus of claim 33, wherein the instructions are executable by the processor to:
utilize the first capture time to perform audiovisual synchronization of a video display and an audio speaker.
49. A computer-program product for media synchronization, comprising a non-transitory computer-readable medium storing instructions executable by a processor to:
select a reference time at a first clock rate;
map the reference time from the first clock rate to a second clock rate;
record a first timestamp of a first frame capture at the second clock rate; and determine a first capture time of the first frame capture based at least in part on the reference time and the first timestamp.
50. The computer-program product of claim 49, wherein the instructions are executable by the processor to:
adjust the first capture time for a first propagation delay for mapping the reference time from the first clock rate to the second clock rate.
51. The computer-program product of claim 49, wherein the first clock rate and the second clock rate are functions of a common processing unit.
52. The computer-program product of claim 49, wherein the first clock rate is a function of a first processing unit and the second clock rate is a function of a second processing unit.
53. The computer-program product of claim 52, wherein the first processing unit and the second processing unit each comprise one of an audio processor or a video processor.
54. The computer-program product of claim 49, wherein the instructions are executable by the processor to:
map the reference time from the first clock rate to a third clock rate;
record a second timestamp of a second frame capture at the third clock rate; and
determine a second capture time of the second frame capture based at least in part on the reference time and the second timestamp.
55. The computer-program product of claim 54, wherein the instructions are executable by the processor to:
adjust the second capture time for a second propagation delay for mapping the reference time from the first clock rate to the third clock rate.
56. The computer-program product of claim 54, wherein the first clock rate, the second clock rate, and the third clock rate are functions of a common processing unit.
57. The computer-program product of claim 54, wherein the first clock rate is a function of a first processing unit, the second clock rate is a function of a second processing unit, and the third clock rate is a function of a third processing unit.
58. The computer-program product of claim 57, wherein the first processing unit comprises a modem and the second processing unit and the third processing unit each comprise one of an audio processor or a video processor.
59. The computer-program product of claim 49, wherein the instructions are executable by the processor to:
update the reference time periodically.
60. The computer-program product of claim 59, wherein the instructions are executable by the processor to:
re-map the updated reference time after each reference time update.
61. The computer-program product of claim 49, wherein the instructions are executable by the processor to:
update the reference time non-periodically.
62. The computer-program product of claim 61, wherein the instructions are executable by the processor to:
re-map the updated reference time after each reference time update.
63. The computer-program product of claim 49, wherein the instructions are executable by the processor to:
transmit the first capture time via a packet switched network.
64. The computer-program product of claim 49, wherein the instructions are executable by the processor to:
utilize the first capture time to perform audiovisual synchronization of a video display and an audio speaker.
PCT/US2014/049593 2013-08-16 2014-08-04 Clock synchronization for multi-processor/multi-chipset solution WO2015023463A1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US201361866924P 2013-08-16 2013-08-16
US61/866,924 2013-08-16
US14/262,994 US9300713B2 (en) 2013-08-16 2014-04-28 Clock synchronization for multi-processor/multi-chipset solution
US14/262,994 2014-04-28

Publications (1)

Publication Number Publication Date
WO2015023463A1 true WO2015023463A1 (en) 2015-02-19

Family

ID=52466587

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2014/049593 WO2015023463A1 (en) 2013-08-16 2014-08-04 Clock synchronization for multi-processor/multi-chipset solution

Country Status (2)

Country Link
US (1) US9300713B2 (en)
WO (1) WO2015023463A1 (en)

Families Citing this family (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8290603B1 (en) 2004-06-05 2012-10-16 Sonos, Inc. User interfaces for controlling and manipulating groupings in a multi-zone media system
US8086752B2 (en) 2006-11-22 2011-12-27 Sonos, Inc. Systems and methods for synchronizing operations among a plurality of independently clocked digital data processing devices that independently source digital data
US11294618B2 (en) 2003-07-28 2022-04-05 Sonos, Inc. Media player system
US9207905B2 (en) 2003-07-28 2015-12-08 Sonos, Inc. Method and apparatus for providing synchrony group status information
US11106424B2 (en) 2003-07-28 2021-08-31 Sonos, Inc. Synchronizing operations among a plurality of independently clocked digital data processing devices
US11106425B2 (en) 2003-07-28 2021-08-31 Sonos, Inc. Synchronizing operations among a plurality of independently clocked digital data processing devices
US8234395B2 (en) 2003-07-28 2012-07-31 Sonos, Inc. System and method for synchronizing operations among a plurality of independently clocked digital data processing devices
US11650784B2 (en) 2003-07-28 2023-05-16 Sonos, Inc. Adjusting volume levels
US9977561B2 (en) 2004-04-01 2018-05-22 Sonos, Inc. Systems, methods, apparatus, and articles of manufacture to provide guest access
US9374607B2 (en) 2012-06-26 2016-06-21 Sonos, Inc. Media playback system with guest access
US8868698B2 (en) 2004-06-05 2014-10-21 Sonos, Inc. Establishing a secure wireless network with minimum human intervention
US8326951B1 (en) 2004-06-05 2012-12-04 Sonos, Inc. Establishing a secure wireless network with minimum human intervention
US8788080B1 (en) 2006-09-12 2014-07-22 Sonos, Inc. Multi-channel pairing in a media system
US9202509B2 (en) 2006-09-12 2015-12-01 Sonos, Inc. Controlling and grouping in a multi-zone media system
US8483853B1 (en) 2006-09-12 2013-07-09 Sonos, Inc. Controlling and manipulating groupings in a multi-zone media system
US11429343B2 (en) 2011-01-25 2022-08-30 Sonos, Inc. Stereo playback configuration and control
US11265652B2 (en) 2011-01-25 2022-03-01 Sonos, Inc. Playback device pairing
US9729115B2 (en) 2012-04-27 2017-08-08 Sonos, Inc. Intelligently increasing the sound level of player
US9008330B2 (en) 2012-09-28 2015-04-14 Sonos, Inc. Crossover frequency adjustments for audio speakers
US9516440B2 (en) 2012-10-01 2016-12-06 Sonos Providing a multi-channel and a multi-zone audio environment
KR20150026069A (en) * 2013-08-30 2015-03-11 삼성전자주식회사 Method for playing contents and an electronic device thereof
US9226073B2 (en) 2014-02-06 2015-12-29 Sonos, Inc. Audio output balancing during synchronized playback
US9226087B2 (en) 2014-02-06 2015-12-29 Sonos, Inc. Audio output balancing during synchronized playback
JP6449568B2 (en) * 2014-06-24 2019-01-09 京セラ株式会社 Mobile terminal and control method
US9665341B2 (en) 2015-02-09 2017-05-30 Sonos, Inc. Synchronized audio mixing
US10248376B2 (en) 2015-06-11 2019-04-02 Sonos, Inc. Multiple groupings in a playback system
US9548048B1 (en) * 2015-06-19 2017-01-17 Amazon Technologies, Inc. On-the-fly speech learning and computer model generation using audio-visual synchronization
WO2017083829A1 (en) * 2015-11-13 2017-05-18 Pyrotechnics Management, Inc. Time code controlled logic device
US10021438B2 (en) 2015-12-09 2018-07-10 Comcast Cable Communications, Llc Synchronizing playback of segmented video content across multiple video playback devices
US10678294B2 (en) * 2016-10-14 2020-06-09 Sound Devices, LLC Clock for recording devices
US10712997B2 (en) 2016-10-17 2020-07-14 Sonos, Inc. Room association based on name
CN109348247B (en) * 2018-11-23 2021-03-30 广州酷狗计算机科技有限公司 Method and device for determining audio and video playing time stamp and storage medium
EP3726842A1 (en) * 2019-04-16 2020-10-21 Nokia Technologies Oy Selecting a type of synchronization
US11924317B1 (en) * 2020-09-25 2024-03-05 Apple Inc. Method and system for time-aligned media playback
CN115776683A (en) * 2021-09-08 2023-03-10 华为技术有限公司 Medium synchronous time delay timer setting method and related device
CN114025116B (en) * 2021-11-25 2023-08-04 北京字节跳动网络技术有限公司 Video generation method, device, readable medium and electronic equipment
US20240054688A1 (en) * 2022-08-11 2024-02-15 Qualcomm Incorporated Enhanced Dual Video Call with Augmented Reality Stream

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1610558A2 (en) * 2004-06-22 2005-12-28 LG Electronics Inc. Synchronizing video/audio of mobile communication terminal
WO2006126826A1 (en) * 2005-05-24 2006-11-30 Pixtree Technologis, Inc. A hardware apparatus having video/audio encoding function and multiplexing function, and method thereof
US20080168294A1 (en) * 2007-01-08 2008-07-10 Apple Computer, Inc. Time synchronization of multiple time-based data streams with independent clocks
US20080304573A1 (en) * 2007-06-10 2008-12-11 Moss Nicolas Capturing media in synchronized fashion
US7664057B1 (en) * 2004-07-13 2010-02-16 Cisco Technology, Inc. Audio-to-video synchronization system and method for packet-based network video conferencing
EP2448265A1 (en) * 2010-10-26 2012-05-02 Google, Inc. Lip synchronization in a video conference

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6975363B1 (en) * 2000-08-31 2005-12-13 Microsoft Corporation Methods and systems for independently controlling the presentation speed of digital video frames and digital audio samples
US7710965B2 (en) * 2004-11-23 2010-05-04 Broadlogic Network Technologies Inc. Method and system for multi-program clock recovery and timestamp correction
US20060123063A1 (en) * 2004-12-08 2006-06-08 Ryan William J Audio and video data processing in portable multimedia devices
US8321593B2 (en) * 2007-01-08 2012-11-27 Apple Inc. Time synchronization of media playback in multiple processes
EP2141689A1 (en) 2008-07-04 2010-01-06 Koninklijke KPN N.V. Generating a stream comprising interactive content
FR2939592B1 (en) 2008-12-10 2011-04-08 Alcatel Lucent METHOD AND DEVICE FOR VISIOCONFERENCE COMMUNICATION.
US8605783B2 (en) * 2009-05-22 2013-12-10 Microsoft Corporation Composite video generation
WO2011018368A1 (en) 2009-08-12 2011-02-17 Koninklijke Kpn N.V. Dynamic rtcp relay
EP3627798A1 (en) 2010-01-27 2020-03-25 Koninklijke KPN N.V. Method, system and device for synchronization of media streams

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1610558A2 (en) * 2004-06-22 2005-12-28 LG Electronics Inc. Synchronizing video/audio of mobile communication terminal
US7664057B1 (en) * 2004-07-13 2010-02-16 Cisco Technology, Inc. Audio-to-video synchronization system and method for packet-based network video conferencing
WO2006126826A1 (en) * 2005-05-24 2006-11-30 Pixtree Technologis, Inc. A hardware apparatus having video/audio encoding function and multiplexing function, and method thereof
US20080168294A1 (en) * 2007-01-08 2008-07-10 Apple Computer, Inc. Time synchronization of multiple time-based data streams with independent clocks
US20080304573A1 (en) * 2007-06-10 2008-12-11 Moss Nicolas Capturing media in synchronized fashion
EP2448265A1 (en) * 2010-10-26 2012-05-02 Google, Inc. Lip synchronization in a video conference

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
KURT ROTHERMEL ET AL: "Clock Hierarchies: An Abstraction for Grouping and Controlling Media Streams", IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, IEEE SERVICE CENTER, PISCATAWAY, US, vol. 14, no. 1, 1 January 1996 (1996-01-01), XP011054430, ISSN: 0733-8716 *

Also Published As

Publication number Publication date
US20150049248A1 (en) 2015-02-19
US9300713B2 (en) 2016-03-29

Similar Documents

Publication Publication Date Title
US9300713B2 (en) Clock synchronization for multi-processor/multi-chipset solution
US11310841B2 (en) Long-term evolution assisted new radio initial access and mobility for 5G or other next generation networks
US20220038978A1 (en) Avoiding out of order uplink data reception upon data radio bearer release, handover to another data radio bearer, or quality of service flow addition
US11770729B2 (en) Distributed scheduling algorithm for CPRI over ethernet
US20210250884A1 (en) Synchronization in multi-hop nr iab deployment
KR102237971B1 (en) Harq design for lte in unlicensed spectrum utilizing individual ack/nack
US9698962B2 (en) Timing advance techniques for large cells
WO2018177406A1 (en) Method for signal transmission and device
US9179358B2 (en) Techniques for reducing network congestion in a wireless communications system
US20150365830A1 (en) Techniques for enhancing frame structure and listen before talk procedure (lbt) for transmissions using an unlicensed radio frequency spectrum band
WO2019243669A1 (en) Time-synchronized radio bearer for supporting precision timing protocol (ptp) based time sensitive network (tsn) applications
US10978096B2 (en) Optimized uplink operation for voice over long-term evolution (VoLte) and voice over new radio (VoNR) listen or silent periods
US20160056907A1 (en) DUAL THRESHOLD BASED CELL CLUSTERING INTERFERENCE MITIGATION FOR eIMTA
US11576133B2 (en) Timing synchronization of 5G V2X sidelink transmissions
US20230171720A1 (en) End-to-end time synchronization for cloud-based services
US20240056889A1 (en) Communication processing method for data transmission and related device
US20180103471A1 (en) Split dwell time scan optimization for delay sensitive traffic
US11889447B2 (en) Supporting inter-media synchronization in wireless communications
US20240107476A1 (en) Supporting Inter-Media Synchronization In Wireless Communications
US20240089210A1 (en) Dejitter target delay value based on silence descriptors
US20230217358A1 (en) Staggering transmission to wireless clients based on wireless network information
TW201724887A (en) Frame configuration of dynamic uplink/downlink switch

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14761435

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 14761435

Country of ref document: EP

Kind code of ref document: A1