WO2016138493A1 - Systems and methods for frame duplication and frame extension in live video encoding and streaming - Google Patents

Systems and methods for frame duplication and frame extension in live video encoding and streaming Download PDF

Info

Publication number
WO2016138493A1
WO2016138493A1 PCT/US2016/019955 US2016019955W WO2016138493A1 WO 2016138493 A1 WO2016138493 A1 WO 2016138493A1 US 2016019955 W US2016019955 W US 2016019955W WO 2016138493 A1 WO2016138493 A1 WO 2016138493A1
Authority
WO
WIPO (PCT)
Prior art keywords
live
encoding
frame
input stream
encoding system
Prior art date
Application number
PCT/US2016/019955
Other languages
French (fr)
Inventor
Yuri BULAVA
Pavel Potapov
Original Assignee
Sonic Ip, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sonic Ip, Inc. filed Critical Sonic Ip, Inc.
Priority to SG11201706160UA priority Critical patent/SG11201706160UA/en
Priority to KR1020177023590A priority patent/KR101897959B1/en
Priority to CN201680012053.2A priority patent/CN107251008B/en
Priority to EP19206431.9A priority patent/EP3627337A1/en
Priority to ES16756526T priority patent/ES2768979T3/en
Priority to JP2017544732A priority patent/JP6588987B2/en
Priority to EP16756526.6A priority patent/EP3262523B1/en
Publication of WO2016138493A1 publication Critical patent/WO2016138493A1/en
Priority to HK18105666.2A priority patent/HK1246423A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234381Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by altering the temporal resolution, e.g. decreasing the frame rate by frame skipping
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/70Media network packetisation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • H04L65/752Media network packet handling adapting media to network capabilities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/60Network streaming of media packets
    • H04L65/75Media network packet handling
    • H04L65/765Media network packet handling intermediate
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/21Server components or server architectures
    • H04N21/218Source of audio or video content, e.g. local disk arrays
    • H04N21/2187Live feed
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/24Monitoring of processes or resources, e.g. monitoring of server load, available bandwidth, upstream requests
    • H04N21/2405Monitoring of the internal components or processes of the server, e.g. server load
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/02Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]

Definitions

  • the present invention generally relates to the field of live encoding of adaptive bitrate streams from live input streams. Specifically, the present invention relates to several techniques for optimizing and improving the live encoding of adaptive bitrate streams from live input streams.
  • MPEG-DASH (ISO/IEC 23009-1 ) is a standard for streaming multimedia content over the internet.
  • MPEG-DASH was developed by the Moving Picture Expert Group (MPEG).
  • MPEG has been responsible for developing previous multimedia standards, including MPEG-2, MPEG-4, MPEG-7, MPEG-21 and others.
  • MPEG-DASH is an adaptive bitrate streaming technique that enables high quality streaming of media content over the Internet delivered from conventional HTTP web servers.
  • MPEG-DASH uses sequences of small files that each contain a segment of video that are retrieved via Hypertext Transfer Protocol (HTTP), each segment containing a short interval of playback time of a presentation.
  • Presentations can be can live events and/or have specified durations.
  • the adaptive bitrate streams can be made available at a variety of different bit rates, such as 300kb/s, 500kb/s, and 3MB/s. Live encoding and/or transcoding of source streams into multiple adaptive bitrate streams can require substantial computing resources and live encoding hardware is fairly expensive.
  • FIG. 1 is a network diagram illustrating a live encoding system in accordance with an embodiment of the invention.
  • FIG. 2 is a flow chart illustrating a high level process performed by a live encoding system in accordance with an embodiment of the invention.
  • FIG. 3 conceptually illustrates an example a live encoding system extending frames to compensate for missing input frames in accordance with an embodiment of the invention.
  • FIG. 4 conceptually illustrates an alternative example a live encoding system extending frames to compensate for missing input frames in accordance with an embodiment of the invention.
  • FIG. 5 conceptually illustrates an example of a live encoding system extending frames to compensate for delayed input frames in accordance with an embodiment of the invention.
  • FIG. 6 conceptually illustrates an alternative example of a live encoding system extending frames to compensate for delayed input frames in accordance with an embodiment of the invention.
  • FIG. 7 conceptually illustrates an example of a live encoding system replicating frames to compensate for system load in accordance with an embodiment of the invention.
  • FIG. 8 is a data flow diagram for a live encoding system and a streaming in accordance with an embodiment of the invention.
  • FIG. 9 is an example of a Media Presentation Description (MPD) data model for MPEG-DASH that can be utilized by embodiments of the invention.
  • MPD Media Presentation Description
  • FIG. 10 conceptually illustrates an architecture of a live encoding server in accordance with an embodiment of the invention.
  • live encoding systems in accordance with embodiments of the invention are illustrated.
  • the live encoding systems receive live media feeds such as (but not limited to) sporting events, live news coverage, web live streams, and/or singular or multiplexed streams of media.
  • Streams of media contain multimedia that is constantly received by and presented to a client while being delivered by a provider.
  • Streaming refers to the process of delivering media via streams.
  • Live encoding systems can provide streams of media to clients encoded from a live input stream.
  • live encoding systems can encode received live media feeds into several different adaptive bitrate streams having different maximum bitrates.
  • the live encoding systems can further transmit the encoded adaptive bitrate streams in live media presentations to streaming clients via protocols including (but not limited to) HTTP requests and/or provide the encoded adaptive bitrate streams to servers for distribution to client devices.
  • Encoding and transmission of live media presentations can be taxing on the hardware used to perform these operations.
  • Embodiments of the invention provide for several techniques to reduce the load on hardware performing live encoding and transmission operations. For instance, live encoding systems in accordance with many embodiments of the invention can assess network and/or server load levels according to several measures. Load is often measured as an amount of work (e.g., computations, encoding operations, memory operations, etc.) a live encoding system is performing.
  • the live encoding systems can adjust how frames of video from live media feeds are being encoded. For instance, some embodiments of the live encoding systems replicate a current encoded frame instead of re-encoding said current frame, and then adjust the replicated frame to different bitrates, resolutions, and/or contexts as necessary for the several different adaptive bitrate streams. In addition, various embodiments of the live encoding systems can extend a duration of a current frame being repackaged and/or re- encoded. Utilizing these and other techniques, live encoding systems in accordance with embodiments of the invention can more efficiently handle gaps in received data, slower feeding of data, and/or heavy loads on server hardware.
  • Network transmission levels can affect live encoding processes. For instance, when a live media feed suffers interruptions in network transmission levels from the live input stream to the live encoding system, the live encoding system may encounter a gap in incoming data. Gaps in incoming data can produce gaps in output data and/or result in the live encoding system failing to deliver output frames when requested. Live encoding systems in accordance with some embodiments of the invention can assess incoming media feeds to determine when gaps have occurred. These assessments can be based on several measures including (but not limited to) incoming frame rate, incoming bit rates, time between arrived frames, and/or network bandwidth measurements.
  • Live encoding systems in accordance with many embodiments of the invention can compensate for detected gaps in data by replicating frames and/or extending frames during repackaging of incoming media streams into several adaptive bitrate streams.
  • the live encoding systems can allow network conditions a chance to stabilize without jeopardizing the availability of frames at the requested time that clients depend on.
  • the live encoding system can fall behind the live edge of live streamed media. Clients typically request frames from a live stream at the live edge of the presentation.
  • live edge refers to the most recently encoded segments of the live stream that clients can request without the risk of requesting segments that are not yet available. Requesting not yet available segments result numerous streaming errors such as (but not limited) delays, HTTP not found errors, and can result in bandwidth-clogging repeated requests.
  • Server load levels can affect live encoding processes as well.
  • a live encoding system is implemented as a live encoding server
  • the server hardware can become overwhelmed by encoding processes.
  • a live encoding server falls behind the live edge, the several adaptive bitrate streams can fail as the clients rely on requests being made at the live edge.
  • live streaming clients can request segments of video based on an assumption that live encoding systems generate the segments not slower than real time.
  • Live encoding systems in accordance with many embodiments of the invention can compensate for server load by extending current frames and adjusting timestamps of output frames. The extended frames can produce minor and/or difficult to perceive visual errors but will preserve the request and receive HTTP cycle clients depend on for live streaming.
  • live encoding systems in accordance with embodiments of the invention can also compensate for server load by replicated current frames and adjusting their frame contexts as necessary for the output streams.
  • System 100 includes live encoding servers and supporting hardware 102 that includes application servers, database servers, and/or databases as necessary to support live encoding.
  • the live encoding servers and supporting hardware 102 can receive live media content and/or non-live content from content sources 1 14.
  • Content sources 1 14 can include hardware use to provide media to live encoding servers and supporting hardware 102.
  • the media received from content sources 1 14 can include (but is not limited to) web streams, live media broadcasts, television broadcasts, live event coverage, video feeds from live cameras, previously stored media, raw media feeds, encoded media feeds, and/or static files received from local and/or remote storages.
  • the live encoding servers and supporting hardware 102 can communicate over network 104 with several groups of devices in order to provide streams of content.
  • the groups of devices include (but are not limited to) web, file, and/or Media Servers 106, computing devices 108, and/or mobile devices 1 12. Users of the devices from these groups of devices can view provided streaming content utilizing local streaming clients.
  • a web server from web, file, and/or Media Servers 106 can also serve as hosts for additional downstream viewers and/or clients of the provided streaming content.
  • live encoding servers and supporting hardware 102 includes application servers, database servers, and databases.
  • live encoding servers and supporting hardware 102 can include varying numbers and types of devices.
  • live encoding servers and supporting hardware 102 can be implemented as a single computing device where the single computing device has sufficient storage, networking, and/or computing power.
  • live encoding servers and supporting hardware 102 may also be implemented using multiple computing devices of various types and multiple locations.
  • live encoding servers and supporting hardware 102 may be implement as a live encoding server for encoding live media and an HTTP server for responding to HTTP requests for segments encoded by the live encoding server.
  • live encoding servers and supporting hardware 102 is shown including application servers, database servers, and databases, a person skilled in the art will recognize that the invention is not limited to the devices shown in FIG. 1 and can include additional types of computing devices (e.g., web servers, and/or cloud storage systems).
  • network 104 is the Internet.
  • Live encoding servers and supporting hardware 102 can receive requests and transmit media segments to and from mobile devices 1 12 through network 104 and over a wireless connection 1 10.
  • Wireless connection 1 10 can be (but is not limited to) a 4G connection, a cellular network, a Wi-Fi network, and/or any other wireless data communication link appropriate to the requirements of specific applications.
  • Live encoding servers and supporting hardware 102 can communicate directly with computing devices 108 and web, file, and/or Media Servers 106 through network 104.
  • Other embodiments may use other networks, such as Ethernet or virtual networks, to communicate between devices.
  • a person skilled in the art will recognize that the invention is not limited to the network types shown in FIG. 1 and can include additional types of networks (e.g., intranets, virtual networks, mobile networks, and/or other networks appropriate to the requirements of specific applications).
  • FIG. 1 Although a specific architecture is shown in FIG. 1 , different architectures involving electronic devices and network communications can be utilized to implement live encoding systems to perform operations and provide functionalities in accordance with embodiments of the invention.
  • Live encoding systems in accordance with numerous embodiments of the invention can use real time analysis of incoming media and/or encoding system loads to mitigate losses and interruptions in live encoding through techniques discussed below.
  • FIG. 2 conceptually illustrates a process 200 that can be performed by live encoding systems in accordance with embodiments of the invention in receiving media, generating streams, and providing the generated streams to live streaming clients.
  • the process 200 is performed by a live encoding server in accordance with the embodiment described above in connection with FIG. 1.
  • process 200 can be performed by an MPEG-DASH live encoding server during continuous live encoding and live streaming of media.
  • Media can be received (210).
  • media can encompass numerous different types, formats, standards, and/or presentations.
  • the received media is a live feed of already encoded media.
  • the received media can include (but not limited to) input streams, live media feeds, television feeds, satellite feeds, web streams, and/or static files received from local and/or remote storages.
  • Streams can be generated (220) from the received media.
  • the generated streams can be of many possible formats, such as (but not limited to) MPEG-DASH, H.264/AVC, HTTP Live Streaming, Smooth Streaming, and/or any other adaptive bitrate format.
  • the generated streams can then be provided to streaming clients over a network connection.
  • the generated streams will be of different maximum bitrates and be encoded according to varying encoding parameters.
  • streams are generated utilizing a repackaging application of a live encoding server.
  • the repackaging application repackages received media into output streams. Thereby, the repackaging application can utilize utilizing various encoders and decoders as necessary to generate as necessary to generate the streams.
  • the generation of streams can be a continuous process that is performed as live media is received.
  • load levels on the live encoding system, load levels in a communication network, gaps in receipt of media, and/or gaps in generation of streams can be assessed (230).
  • different embodiments may assess other aspects of live encoding server operations. Performing said assessments can include several sub- operations.
  • the live encoding system can check incoming data rates and/or frame rates of the received media. The incoming data rates and/or frame rates of the received media can be compared to frame times determined according to internal logic of the live encoding system.
  • the internal logic can include several sources of determining a reliable time, such as (but not limited to) time stamps of the received media, clock implementations on the live encoding system, and/or the declared frame rate of the received media.
  • the live encoding systems can measure differences in times between incoming frames in order to calculate an overall incoming data rate. The live encoding systems can then monitor the calculated overall incoming data rate to identify gaps in incoming data or potential surges that may overwhelm the processing power of the live encoding system.
  • One or more of these assessments can indicate that the live encoding system has not received a frame at a proper time and/or will fail to encode a frame in time to meet the live edge requirement for live encoding systems.
  • frames of received media can optionally be duplicated and/or replicated (240).
  • the duplicated frames can be modified to account for new frame contexts associated with the various generated streams.
  • Different frame contexts can include (but are not limited to) different resolutions, different frames types (such as I- frames, B-frames, and/or P-frames), different maximum bitrates.
  • Generation of streams from received media often involves re-encoding the received media to a different format where the received media includes encoded frames. Re-encoding of the received media can be among the more resource intensive operations performed by live encoding systems.
  • the duplicated frames can then be utilized in the generated streams without a relatively costly re-encoding operation.
  • the duplicated frames can also be duplicated from raw frames from the received media in addition to encoded frames from the received media.
  • replicating encoded frames instead of re-encoding the frames as a part of a live encoding process can result in the output streams violating certain requirements of the hypothetical reference decoder (HRD) in H.264/AVC.
  • HRD hypothetical reference decoder
  • the HRD shall not overflow nor underflow when its input is a compliant stream.
  • Replicating a large encoded frame and utilizing the replicated stream in a low maximum bitrate stream risks causing a buffer overflow that would fail the HRD requirements.
  • software decoder clients can compensate for this without a problem due to their more flexible buffers. The software decoder clients will can require additional CPU cycles to process the replicated frames.
  • Hardware decoder clients will encounter errors due to possible buffer overflows when replicated frames are used in lower maximum bitrate streams.
  • Some embodiments of the invention provide for reducing the bit values of replicated frames for lower maximum bitrate output streams in order to mitigate against the risk of buffer overflows in hardware decoders.
  • duplicated frames are only used for their own specific maximum bitrate output streams; thereby preventing high bit value frames from being utilized low maximum bitrate streams. This can be accomplished by including separate encoding processes for each output stream.
  • frames can be replicated and/or duplicated from input streams where the input stream and the output stream share same formats, maximum bitrates, and/or resolutions. This can occur where the desired output stream is the same as the input stream. Where this occurs, re-encoding can be skipped and several embodiments can simply replicated the instantaneous decoding refreshes (IDR) frames from the input streams. As discussed above, the resulting output stream can be non-HRD compliant in said several embodiments.
  • IDR instantaneous decoding refreshes
  • frames of received media can optionally be extended (250).
  • Extending frames can include packaging a given frame into an output stream at times different than the given frame's assigned time stamp.
  • different extensions of frames may occur.
  • a current frame may be extended in generation of the output streams.
  • the repackaging application can perform the extension during repackaging of frames into output streams.
  • the repackaging application can spread several smaller frame extensions over multiple frames in order to compensate for the gap in multiple steps. The smaller extensions can serve to conceal the extensions from streaming client viewers.
  • the generated output streams can be provided (260) to streaming clients.
  • the generated output streams can be at different maximum bitrate yet each represent a single media presentation.
  • a given media presentation can be provided to streaming clients in several streams having different maximum bitrates.
  • the provision of generated output streams can be accomplished via HTTP requests for segments from the generated output streams.
  • process 200 While the operations presented in process 200 are presented in a linear order, various embodiments can perform said operations in varying orders. For instance, the generation and provision of streams to clients can be performed continuously as live media is received. Thus, the order of operations presented in process 200 is merely demonstrative and can be performed continuously as a part of a cyclical process for live generation of streams from frames of received media. Having discussed an overview of processes performed by live encoding systems of some embodiments, the following discussion will provide several examples of frame extension and frame replication that can performed as a part of said processes.
  • live encoding systems in accordance with embodiments of the invention can extend frames and/or replicate frames in response to assessed network and/or server conditions.
  • Frame extensions and/or frame replications can compensate for dropped input frames, delayed input frames, and/or encoding system load.
  • FIG. 3, FIG. 4, FIG. 5, FIG. 6, and FIG. 7 conceptually illustrate several examples of frame extension and frame duplication in accordance with embodiments of the invention.
  • the examples presented in the aforementioned figures are abstractions of the live encoding process illustrated to show the effects of frame replications and/or frame extensions.
  • Live encoding systems in accordance with embodiments of the invention will include additional details, components, and/or functionalities not illustrated in the examples in FIG. 3, FIG. 4, FIG. 5, FIG. 6, and FIG. 7.
  • time stamps, frame numbers, and/or frame durations are presented for demonstrative purposes. Embodiments of the invention are not limited to the specific values presented in FIG. 3, FIG. 4, FIG. 5, FIG. 6, and FIG. 7 and can incorporate wide ranges of possible time stamps, frame numbers, and/or frame durations as required for live encoding operations. Moreover, while only a single output stream is shown in the following figures, embodiments of the invention typically generate multiple output streams at varying maximum bitrates with varying encoding parameters.
  • FIG. 3 conceptually illustrates an example a live encoding system extending frames to compensate for missing input frames in accordance with an embodiment of the invention.
  • live encoding system 300 in receiving an input stream 310 and generating an output stream 360.
  • the live encoding processes of live encoding system 300 are performed during continuous receipt of input stream 310 and generation of output stream 360.
  • Input stream 310 can be any of the input streams and/or media discussed above.
  • Live encoding system 360 can provide the generated output stream 360 to streaming clients (not shown) via any of the techniques discussed above. Techniques such as receiving HTTP requests and transmitting segments from the output stream.
  • input stream input stream 310 includes several frames with identified time stamps and durations.
  • the frames can include portions of media, such as frames video.
  • Time stamps are indicated by the abbreviation "TS”.
  • Durations are indicated by the abbreviation "D”.
  • TS Time stamps
  • D Durations
  • FIG. 3 the values shown in FIG. 3 are demonstrative. Embodiments of the invention can receive and process various different time stamp and duration values as necessary to support live encoding.
  • Frame 5 320 has a time stamp value equal to 5 and a duration value equal to 1 .
  • Live encoding system 300 expects to receive frames from input stream 310 at specified times. When frames are not received at the specified times, live encoding system 300 may not be able to generate the output stream 360 in time for the live edge expected by live streaming clients. Live encoding system 300 can assess whether frames are missing from the input stream 310 using a variety of measures as discussed above. Such as comparing internal clocks maintained by the live encoding system 300 to the time stamps of the received frames of the live input stream 310. Live encoding system 310 can also include thresholds for missing frames that must be met before extending frames. Live encoding system 310 includes a threshold of two missing frames before electing to extending frames to compensate for the at least two frame gap.
  • Different embodiments may include different thresholds that can be based on a different number of frames and/or a different threshold measurement, such as missing frames over a segment of time instead of missing frames in sequence.
  • Live encoding of video is inherently a resource intensive process, thus various embodiments can utilize a variety of thresholds in connection with assessing encoding conditions, such encoding system loads, client stuttering, network bandwidth stability, video quality, and other metrics and/or conditions that can affect live encoding of video.
  • specific counts of frames and their delivery can be calculated and compared to different thresholds of frame counts and times in different embodiments of the invention.
  • different embodiments can use different metrics for assessing such streaming conditions, processing cycle counts, time benchmarks for encoding of sets of frames, network transfer rates, delivered and displayed framerates, and various measurements of visual quality/fidelity. While specific values are not provided herein, different specific values (such as dips below 24 frames per second, visual errors causing display failures in excess of certain gamma values, frames encoded per second, etc.) can be utilized as necessary to implement the invention without departing from the spirit of the invention.
  • Input frames can go missing under a variety of different circumstances, such (but not limited to) when there is a failure in the network connection between the provider of the input stream and the live encoding system, when there is fault in the input stream, and/or internal errors of the live encoding system.
  • input stream 310 is missing frames 330 and frames 340.
  • Live encoding system 300 can detect this gap by comparing the time stamp of frame 8 350 to the time stamp of frame 5 320 and an internal clock maintained by live encoding system 300. Once the missing frame threshold is met, live encoding system 300 can extend frames to compensate for the gap in frames.
  • Various embodiments can use different thresholding schemes, including any of those discussed above.
  • live encoding system 300 extends frame 5 320 from the input stream 310 in generating output stream 360.
  • Extended frame 370 is extended to have a duration value equal to 3 in order to cover the missing frames 330 and 340.
  • Extended frame 370 will be available when requested by live streaming clients and preserves the live edge required to support uninterrupted live streaming. However, extending frame durations can result in visual artifacts if used excessively.
  • FIG. 4 conceptually illustrates an alternate method of extending frame durations that helps to conceal the effects of frame extensions.
  • live encoding system 400 is generating an output stream 460 from an input stream 410.
  • Input stream 410 is missing frames 430 and 440.
  • live encoding system 400 can extend the durations of frame 5 420 and frame 8 450, and also adjust the time stamp value of frame 8 450.
  • extended frame 5 470 has been extended to have a duration value of 2
  • extended frame 8 480 has been extended to have a duration value of 2 as well.
  • the time stamp for extended frame 8 470 has been adjusted to be at 7 such that extended frame 8 480 will be available immediately after extended frame 5 470.
  • live encoding system 400 can conceal some of visual artifacts cause by frame duration extensions.
  • FIG. 5 conceptually illustrates an example a live encoding system extending frames to compensate for delayed input frames in accordance with an embodiment of the invention.
  • live encoding system 500 is generating output stream 560 from input stream 510.
  • frame delays 530 and 540 result in frame 6 550 arriving late.
  • Live encoding system 500 can detect the frame delay and use frame duration extension to compensate.
  • Live encoding system 500 generates output stream 560 which include extended frame 5 with a duration extended to 3 and frame 6 580 with a time stamp value adjusted to 8.
  • Extended frame 570 will be available when requested by live streaming clients and preserves the live edge required to support uninterrupted live streaming.
  • extending frame durations can result in visual artifacts if used excessively.
  • FIG. 6 conceptually illustrates an alternate method of extending frame durations to compensate for frame delays that helps to conceal the effects of frame extensions.
  • live encoding system 600 is generating an output stream 660 from an input stream 610.
  • frame delays occur at 630 and 640.
  • live encoding system 600 can extend the durations of frame 5 620 and frame 6 650, and also adjust the time stamp value of frame 6 650.
  • extended frame 8 670 has been extended to have a duration value of 2
  • extended frame 8 has been extended to have a duration value of 2 as well.
  • time stamp for extended frame 8 670 has been adjusted to be at 7 such that extended frame 8 670 will be available immediately after extended frame 5 670.
  • live encoding system 400 can conceal some of visual artifacts cause by frame duration extensions.
  • Embodiments of the invention are not limited to the frame extensions techniques discussed above with respect to FIG. 3, FIG. 4, FIG. 5, and FIG. 6.
  • Various embodiments can utilize sequential extensions of frame durations as shown in FIG. 3 and FIG. 5 and/or interspersed extensions of frame durations as shown in FIG. 4 and FIG. 5 in different circumstances.
  • extending frame durations is not limited to being performed due to missing and/or delayed frames.
  • FIG. 7 conceptually illustrates an example a live encoding system extending frames to compensate for server load in accordance with an embodiment of the invention.
  • live encoding system 700 in receiving an input stream 710 and generating an output stream 760.
  • the live encoding processes of live encoding system 700 are performed during continuous receipt of input stream 710 and generation of output stream 760.
  • Live encoding system 700 is shown under load 740. In order to compensate for this load, live encoding system 700 can replicate frames from encoded input stream in the encoded domain.
  • live encoding system 700 receives encoded frame 4 720 and encoded frame 5 730.
  • Live encoding system 700 replicates these frames in generating encoded output stream 750.
  • Frame fields for replicated frame 4 760 and replicated frame 5 770 may have to be adjusted in order to account for the new frame context. However, these adjustments can require significantly less processing resources as compared to re-encoding operations.
  • Replicated frame 4 760 and replicated frame 5 770 have the same duration values and time stamp values as encoded frame 4 720 and encoded frame 5 730.
  • Embodiments of the invention are not limited to the specific frame replication techniques discussed above in the example conceptually illustrated in FIG. 7.
  • Various embodiments can utilize frame replication and/or duplication with various formats of input streams, such as raw, un-encoded input streams.
  • embodiments of the invention are not limited to performing frame replication and/or frame duplication only during times of server load.
  • some embodiments of the invention can perform encoded frame replication as a part of a continuous encoding process to maintain efficient live encoding without waiting until server load reaches critical levels. Said some embodiments could be utilized on lower powered live encoding servers.
  • MPEG-DASH (ISO/IEC 23009-1 ) is a standard for streaming multimedia content over the internet.
  • MPEG-DASH was developed by the Moving Picture Expert Group (MPEG).
  • MPEG has been responsible for developing previous multimedia standards, including MPEG-2, MPEG-4, MPEG-7, MPEG-21 and others.
  • MPEG-DASH provides for adaptive segmented media delivery using HTTP.
  • the MPEG-DASH specification only defines the MPD and the segment formats. Of note, the delivery of the MPD and the media-encoding formats containing the segments, as well as the client behavior for fetching, adaptation heuristics, and playing content, are undefined within the MPEG-DASH standard.
  • FIG. 8 conceptually illustrates an example data flow diagram for a live encoding system utilizing MPEG-DASH in accordance with an embodiment of the invention.
  • FIG. 8 includes a media feed data 810, a live encoding system 820, a HTTP requests 830, requested stream segments 840, a streaming client 850, and media presentation description 860.
  • media feed data 810, HTTP requests 830, requested stream segments 840, and media presentation description 860 can be transmitted over a communication network.
  • the communication network can include (but is not limited to) the internet.
  • live encoding system 820 is receiving media feed data 810.
  • Media feed data 810 can include at least the types of received media discussed above.
  • Live encoding system 820 can generate output streams from the received media feed data 810.
  • live encoding system 820 can replicate frames from the media feed data 810 and/or extend frames from the media feed data 810 based on assessments of the rate of receipt of media feed data 810, load levels on the live encoding system 820, load levels in the communication network supporting the transmission of media feed data 810, gaps in the media feed data 810, and/or gaps in generation of streams by the live encoding system 820.
  • Live encoding system 820 also receives HTTP requests 830. In response to the HTTP requests, live encoding system 820 provides requested stream segments 840. HTTP requests 830 can include byte range requests for a specific segment from one of the generated output streams. Live encoding system 820 can include multiple components, including separate live encoding servers and HTTP servers.
  • the HTTP servers can support the HTTP communication of media segments and requests with clients. Moreover, the HTTP servers can utilize HTTP-based Content Distribution Networks (CDNs) to assist in delivery of media segments to streaming client 850.
  • CDNs Content Distribution Networks
  • MPEG-DASH uses a Media Presentation Description (MPD) to provide clients with a well structured XML manifest describing several adaptive bitrate streams that can be accessed via HTTP requests for stream segments.
  • MPD corresponds to a single media presentation that can be viewed via the several described adaptive bitrate streams.
  • the MPD describes accessible media segments and corresponding timings for the accessible media segments.
  • the MPD is a hierarchical data model including (descending from the top of the hierarchy) a media presentation, periods, adaptation sets, representations, and segments.
  • a media presentation can include to a live broadcast, a live stream, a live event, and/or a pre-recorded media presentation.
  • a media presentation can be spliced and/or include several periods.
  • Periods can include several adaptation sets. Adaptation sets can include different perspectives on the same presentation, such as different cameras from a live sporting event. In addition, different adaptation sets can include different formats, such as audio adaptation sets and video adaptation sets. Within each adaptation set, several representations may be included. Representations support the selection of different bandwidth and/or maximum bitrate levels form the same presentation. Thus, clients of MPEG-DASH can use adaptive bitrate streaming by switching to different representations as bandwidth and/or client loading allows. Each representation includes segments of media that can be requested via HTTP. The HTTP requests are received on pre-formatted URLs associated with each segment.
  • FIG. 9 conceptually illustrates an example Media Presentation Description MPD data model from MPEG-DASH.
  • media presentation 910 includes several periods 915-925.
  • the periods 915-925 each include different period start times.
  • Period 920 at start time 100 seconds is expanded to show several included adaptation sets 925-930.
  • Adaptation set 1 925 includes video from camera 1 of media presentation 910.
  • Adaptation set 2 930 includes audio for media presentation 910.
  • Adaptation set 3 935 includes video from camera 2 of media presentation 910.
  • Adaptation set 1 925 has been expanded to show representation 1 940 and representation 2 945.
  • Representation 1 940 is a 500kb/s representation for adaptation set 1 925 whereas representation 2 945 is a 250kb/s representation for adaptation set 1 925.
  • Within representation 1 940 are initialization segment 100 and media segments 955-965. These segments are requested by streaming clients via HTTP to receive the media contained within them.
  • the live encoding server 1000 includes a processor 1010 in communication with non-volatile memory 1030, volatile memory 1020, and a network interface 1040.
  • the nonvolatile memory includes input data handling application 1050, demuxer application 1055, repackager application 1060, MPD combination application 1065, MPD generation application 1070, HTTP request application 1075, audio decoder application 1080, audio encoder application 1085, video decoder application 1090, and video encoder application 1095.
  • the live encoding server 1000 is an mpeg- dash format live encoding server that prepares MPD files for streams and provides segments of output streams to streaming clients through HTTP requests.
  • Other embodiments may utilize different formats and include different applications as necessary to support said different formats.
  • the input data handling application 1050 receives input streams from the network interface 1040.
  • the input streams can include (but are not limited to) live streams of video content, media presentations, video only files, audio only files, sporting events, web streams, and/or mpeg-dash standard streams.
  • the input data handling application 1050 can perform additional functions including identification of the input streams. Identification can be performed using metadata included with the input streams and/or assessing of characteristics and parameters of the input streams.
  • the demuxer application 1055 demultiplexes individual elementary streams from an input stream. For instance, the demuxer application 1055 can break out the audio, video, and/or subtitle streams within an input stream. The demultiplexed streams can be analyzed, decoded, and reencoded in subsequent operations performed by other applications.
  • the repackager application 1060 can perform the re-encoding, duplication, and frame extension operations as a part of the overall live encoding server operations.
  • the repackager application 1060 can receive input streams from the input data handling application 1050, the demuxer application 1055, the network interface 1040, and/or any other component of the live encoding server 1000 as necessary to repackage streams.
  • the repackager application 1060 can re-encode incoming live frames of received media into several output streams utilizing the video decoder application 1090 and the video encoder application 1095 as necessary.
  • the repackager application 1060 can assess network and/or server load levels of the live encoding server 1000 according to several measures.
  • the repackager application 1060 can duplicate incoming frames to reduce server load levels and/or extend certain frames to compensate for anticipated drops in incoming network bandwidth.
  • the repackager application 1060 can extend frames by manipulating time codes and/or time stamps of frames to increase their duration in output streams.
  • the repackager application 1060 can provide the repackaged, re-encoded, duplicated, and/or extended frames of output streams to the MPD combination application 1065 and/or the MPD generation application 1070 for preparation for later streaming to clients utilizing the HTTP request application 1075.
  • the MPD combination application 1065 combines multiple output streams generated by the repackager application 1060 into a single presentation.
  • the MPD combination application 1070 can generate an MPD file for a combined presentation.
  • the MPD file can describe the periods, adaptation sets, representations, and segments of a media presentation.
  • the MPD combination application 1070 generates MPD's according to characteristics of the generated output streams. These characteristics will vary according to the operations performed by the repackager application 1060.
  • the MPD file is typically the initially requested and provided to streaming clients in order to initiate an mpeg-dash streaming session.
  • the HTTP request application 1075 handles HTTP requests and server media segments according to said HTTP requests.
  • the HTTP request application 1075 may communicate to streaming clients through the network interface 1040.
  • the HTTP request application 1075 is hosted in a separate HTTP server from the live encoding server.
  • the non-volatile memory includes audio decoder application 1080, audio encoder application 1085, video decoder application 1090, and video encoder application 1095. While non-volatile memory 1030 only includes a single video decoder application 1090 and a single video encoder application 1095, other embodiments may include multiple video encoder and video decoder applications. Moreover, some embodiments may utilize sets of applications for each output stream in order to have separate repackager, decorder, and encoder applications to generate each different output stream.
  • the network interface 1040 may be in communication with the processor 1010, the volatile memory 1020, and/or the nonvolatile memory 1030.
  • the above discussion of the applications stored in the nonvolatile memory 1030 of the live encoding server 1000 discusses one exemplary set of applications to support the live encoding server 1000.
  • Other embodiments of the invention may utilize multiple servers with the functions discussed below distributed across multiple servers and/or locations as necessary to implement the invention.
  • the applications discussed below could be combined into one or more applications and implemented as software modules as necessary to implement the invention.
  • the applications discussed below could alternatively be implemented as modules of a single application residing on live encoding server 1000.
  • other embodiments may utilize multiple applications dedicated to similar functions.

Abstract

Embodiments of the invention provide for live encoding systems that can replicate a current encoded frame instead of re-encoding said current frame, and then adjust the replicated frame to different bitrates, resolutions, and/or contexts as necessary for the several different adaptive bitrate streams. In addition, various embodiments of the invention can extend a duration of a current frame being repackaged and/or re-encoded. Utilizing these and other techniques, live encoding systems in accordance with embodiments of the invention can more efficiently handle gaps in received data, slower feeding of data, and/or heavy loads on server hardware.

Description

Systems and Methods for Frame Duplication and Frame Extension in Live Video
Encoding and Streaming
FIELD OF THE INVENTION
[0001] The present invention generally relates to the field of live encoding of adaptive bitrate streams from live input streams. Specifically, the present invention relates to several techniques for optimizing and improving the live encoding of adaptive bitrate streams from live input streams.
BACKGROUND OF THE INVENTION
[0002] Streaming technology has advanced to the point of supporting live over the top streaming. Live events can now be viewed from adaptive bitrate streams generated by live encoding servers. Often, live encoding servers utilize the MPEG-DASH format (i.e., Dynamic Adaptive Streaming over HTTP). MPEG-DASH (ISO/IEC 23009-1 ) is a standard for streaming multimedia content over the internet. MPEG-DASH was developed by the Moving Picture Expert Group (MPEG). MPEG has been responsible for developing previous multimedia standards, including MPEG-2, MPEG-4, MPEG-7, MPEG-21 and others. MPEG-DASH, is an adaptive bitrate streaming technique that enables high quality streaming of media content over the Internet delivered from conventional HTTP web servers. Typically, MPEG-DASH uses sequences of small files that each contain a segment of video that are retrieved via Hypertext Transfer Protocol (HTTP), each segment containing a short interval of playback time of a presentation. Presentations can be can live events and/or have specified durations. The adaptive bitrate streams can be made available at a variety of different bit rates, such as 300kb/s, 500kb/s, and 3MB/s. Live encoding and/or transcoding of source streams into multiple adaptive bitrate streams can require substantial computing resources and live encoding hardware is fairly expensive.
BRIEF DESCRIPTION OF THE DRAWINGS
[0003] FIG. 1 is a network diagram illustrating a live encoding system in accordance with an embodiment of the invention. [0004] FIG. 2 is a flow chart illustrating a high level process performed by a live encoding system in accordance with an embodiment of the invention.
[0005] FIG. 3 conceptually illustrates an example a live encoding system extending frames to compensate for missing input frames in accordance with an embodiment of the invention.
[0006] FIG. 4 conceptually illustrates an alternative example a live encoding system extending frames to compensate for missing input frames in accordance with an embodiment of the invention.
[0007] FIG. 5 conceptually illustrates an example of a live encoding system extending frames to compensate for delayed input frames in accordance with an embodiment of the invention.
[0008] FIG. 6 conceptually illustrates an alternative example of a live encoding system extending frames to compensate for delayed input frames in accordance with an embodiment of the invention.
[0009] FIG. 7 conceptually illustrates an example of a live encoding system replicating frames to compensate for system load in accordance with an embodiment of the invention.
[0010] FIG. 8 is a data flow diagram for a live encoding system and a streaming in accordance with an embodiment of the invention.
[0011] FIG. 9 is an example of a Media Presentation Description (MPD) data model for MPEG-DASH that can be utilized by embodiments of the invention.
[0012] FIG. 10 conceptually illustrates an architecture of a live encoding server in accordance with an embodiment of the invention.
DETAILED DISCLOSURE OF THE INVENTION
[0013] Turning now the drawings, live encoding systems in accordance with embodiments of the invention are illustrated. In several embodiments, the live encoding systems receive live media feeds such as (but not limited to) sporting events, live news coverage, web live streams, and/or singular or multiplexed streams of media. Streams of media contain multimedia that is constantly received by and presented to a client while being delivered by a provider. Streaming refers to the process of delivering media via streams. Live encoding systems can provide streams of media to clients encoded from a live input stream. Moreover, live encoding systems can encode received live media feeds into several different adaptive bitrate streams having different maximum bitrates. The live encoding systems can further transmit the encoded adaptive bitrate streams in live media presentations to streaming clients via protocols including (but not limited to) HTTP requests and/or provide the encoded adaptive bitrate streams to servers for distribution to client devices. Encoding and transmission of live media presentations can be taxing on the hardware used to perform these operations. Embodiments of the invention provide for several techniques to reduce the load on hardware performing live encoding and transmission operations. For instance, live encoding systems in accordance with many embodiments of the invention can assess network and/or server load levels according to several measures. Load is often measured as an amount of work (e.g., computations, encoding operations, memory operations, etc.) a live encoding system is performing. Based on the assessments, the live encoding systems can adjust how frames of video from live media feeds are being encoded. For instance, some embodiments of the live encoding systems replicate a current encoded frame instead of re-encoding said current frame, and then adjust the replicated frame to different bitrates, resolutions, and/or contexts as necessary for the several different adaptive bitrate streams. In addition, various embodiments of the live encoding systems can extend a duration of a current frame being repackaged and/or re- encoded. Utilizing these and other techniques, live encoding systems in accordance with embodiments of the invention can more efficiently handle gaps in received data, slower feeding of data, and/or heavy loads on server hardware.
[0014] Network transmission levels can affect live encoding processes. For instance, when a live media feed suffers interruptions in network transmission levels from the live input stream to the live encoding system, the live encoding system may encounter a gap in incoming data. Gaps in incoming data can produce gaps in output data and/or result in the live encoding system failing to deliver output frames when requested. Live encoding systems in accordance with some embodiments of the invention can assess incoming media feeds to determine when gaps have occurred. These assessments can be based on several measures including (but not limited to) incoming frame rate, incoming bit rates, time between arrived frames, and/or network bandwidth measurements. Live encoding systems in accordance with many embodiments of the invention can compensate for detected gaps in data by replicating frames and/or extending frames during repackaging of incoming media streams into several adaptive bitrate streams. By replicating frames and/or extending frames, the live encoding systems can allow network conditions a chance to stabilize without jeopardizing the availability of frames at the requested time that clients depend on. Specifically, the live encoding system can fall behind the live edge of live streamed media. Clients typically request frames from a live stream at the live edge of the presentation. When used herein, the term "live edge" refers to the most recently encoded segments of the live stream that clients can request without the risk of requesting segments that are not yet available. Requesting not yet available segments result numerous streaming errors such as (but not limited) delays, HTTP not found errors, and can result in bandwidth-clogging repeated requests.
[0015] Server load levels can affect live encoding processes as well. Where a live encoding system is implemented as a live encoding server, the server hardware can become overwhelmed by encoding processes. Where a live encoding server falls behind the live edge, the several adaptive bitrate streams can fail as the clients rely on requests being made at the live edge. Specifically, live streaming clients can request segments of video based on an assumption that live encoding systems generate the segments not slower than real time. Live encoding systems in accordance with many embodiments of the invention can compensate for server load by extending current frames and adjusting timestamps of output frames. The extended frames can produce minor and/or difficult to perceive visual errors but will preserve the request and receive HTTP cycle clients depend on for live streaming. Moreover, live encoding systems in accordance with embodiments of the invention can also compensate for server load by replicated current frames and adjusting their frame contexts as necessary for the output streams. [0016] Having discussed a brief overview of the operations and functionalities live encoding systems in accordance with many embodiments of the invention, a more detailed discussion of systems, servers, and methods for live encoding systems in accordance with embodiments of the invention follows below.
NETWORK ARCHITECTURES FOR LIVE ENCODING SYSTEMS
[0017] A network architecture for a live encoding system in accordance with an embodiment of the invention is illustrated in FIG. 1 . System 100 includes live encoding servers and supporting hardware 102 that includes application servers, database servers, and/or databases as necessary to support live encoding. The live encoding servers and supporting hardware 102 can receive live media content and/or non-live content from content sources 1 14. Content sources 1 14 can include hardware use to provide media to live encoding servers and supporting hardware 102. The media received from content sources 1 14 can include (but is not limited to) web streams, live media broadcasts, television broadcasts, live event coverage, video feeds from live cameras, previously stored media, raw media feeds, encoded media feeds, and/or static files received from local and/or remote storages.
[0018] The live encoding servers and supporting hardware 102 can communicate over network 104 with several groups of devices in order to provide streams of content. The groups of devices include (but are not limited to) web, file, and/or Media Servers 106, computing devices 108, and/or mobile devices 1 12. Users of the devices from these groups of devices can view provided streaming content utilizing local streaming clients. In addition, a web server from web, file, and/or Media Servers 106 can also serve as hosts for additional downstream viewers and/or clients of the provided streaming content.
[0019] As illustrated in FIG. 1 , live encoding servers and supporting hardware 102 includes application servers, database servers, and databases. In various embodiments, live encoding servers and supporting hardware 102 can include varying numbers and types of devices. For instance, live encoding servers and supporting hardware 102 can be implemented as a single computing device where the single computing device has sufficient storage, networking, and/or computing power. However, live encoding servers and supporting hardware 102 may also be implemented using multiple computing devices of various types and multiple locations. For instance, live encoding servers and supporting hardware 102 may be implement as a live encoding server for encoding live media and an HTTP server for responding to HTTP requests for segments encoded by the live encoding server. While live encoding servers and supporting hardware 102 is shown including application servers, database servers, and databases, a person skilled in the art will recognize that the invention is not limited to the devices shown in FIG. 1 and can include additional types of computing devices (e.g., web servers, and/or cloud storage systems).
[0020] In the embodiment illustrated in FIG. 1 , network 104 is the Internet. Live encoding servers and supporting hardware 102 can receive requests and transmit media segments to and from mobile devices 1 12 through network 104 and over a wireless connection 1 10. Wireless connection 1 10 can be (but is not limited to) a 4G connection, a cellular network, a Wi-Fi network, and/or any other wireless data communication link appropriate to the requirements of specific applications. Live encoding servers and supporting hardware 102 can communicate directly with computing devices 108 and web, file, and/or Media Servers 106 through network 104. Other embodiments may use other networks, such as Ethernet or virtual networks, to communicate between devices. A person skilled in the art will recognize that the invention is not limited to the network types shown in FIG. 1 and can include additional types of networks (e.g., intranets, virtual networks, mobile networks, and/or other networks appropriate to the requirements of specific applications).
[0021] Although a specific architecture is shown in FIG. 1 , different architectures involving electronic devices and network communications can be utilized to implement live encoding systems to perform operations and provide functionalities in accordance with embodiments of the invention.
SYSTEMS AND PROCESSES FOR LIVE ENCODING SERVERS
[0022] In live encoding systems, clients often rely on being able to request and receive frames at the live encoding edge. Any interruptions in encoding and/or transmission can result in clients failing to received needed frames, failed HTTP requests, image stuttering, and general frustration by the viewers. Live encoding systems in accordance with numerous embodiments of the invention can use real time analysis of incoming media and/or encoding system loads to mitigate losses and interruptions in live encoding through techniques discussed below.
[0023] FIG. 2 conceptually illustrates a process 200 that can be performed by live encoding systems in accordance with embodiments of the invention in receiving media, generating streams, and providing the generated streams to live streaming clients. In a number of embodiments, the process 200 is performed by a live encoding server in accordance with the embodiment described above in connection with FIG. 1. In particular, process 200 can be performed by an MPEG-DASH live encoding server during continuous live encoding and live streaming of media.
[0024] Media can be received (210). As mentioned above, media can encompass numerous different types, formats, standards, and/or presentations. Often, the received media is a live feed of already encoded media. The received media can include (but not limited to) input streams, live media feeds, television feeds, satellite feeds, web streams, and/or static files received from local and/or remote storages.
[0025] Streams can be generated (220) from the received media. The generated streams can be of many possible formats, such as (but not limited to) MPEG-DASH, H.264/AVC, HTTP Live Streaming, Smooth Streaming, and/or any other adaptive bitrate format. The generated streams can then be provided to streaming clients over a network connection. Typically, the generated streams will be of different maximum bitrates and be encoded according to varying encoding parameters. In some embodiments, streams are generated utilizing a repackaging application of a live encoding server. The repackaging application repackages received media into output streams. Thereby, the repackaging application can utilize utilizing various encoders and decoders as necessary to generate as necessary to generate the streams.
[0026] The generation of streams can be a continuous process that is performed as live media is received. During continuous generation of streams in response to receipt of live media, load levels on the live encoding system, load levels in a communication network, gaps in receipt of media, and/or gaps in generation of streams can be assessed (230). Moreover, different embodiments may assess other aspects of live encoding server operations. Performing said assessments can include several sub- operations. For instance, the live encoding system can check incoming data rates and/or frame rates of the received media. The incoming data rates and/or frame rates of the received media can be compared to frame times determined according to internal logic of the live encoding system. The internal logic can include several sources of determining a reliable time, such as (but not limited to) time stamps of the received media, clock implementations on the live encoding system, and/or the declared frame rate of the received media. In some embodiments, the live encoding systems can measure differences in times between incoming frames in order to calculate an overall incoming data rate. The live encoding systems can then monitor the calculated overall incoming data rate to identify gaps in incoming data or potential surges that may overwhelm the processing power of the live encoding system. One or more of these assessments can indicate that the live encoding system has not received a frame at a proper time and/or will fail to encode a frame in time to meet the live edge requirement for live encoding systems.
[0027] In order to mitigate the risk of failing to generate frames in time for the live edge, frames of received media can optionally be duplicated and/or replicated (240). In some embodiments, the duplicated frames can be modified to account for new frame contexts associated with the various generated streams. Different frame contexts can include (but are not limited to) different resolutions, different frames types (such as I- frames, B-frames, and/or P-frames), different maximum bitrates. Generation of streams from received media often involves re-encoding the received media to a different format where the received media includes encoded frames. Re-encoding of the received media can be among the more resource intensive operations performed by live encoding systems. The duplicated frames can then be utilized in the generated streams without a relatively costly re-encoding operation. Moreover, the duplicated frames can also be duplicated from raw frames from the received media in addition to encoded frames from the received media.
[0028] However, replicating encoded frames instead of re-encoding the frames as a part of a live encoding process can result in the output streams violating certain requirements of the hypothetical reference decoder (HRD) in H.264/AVC. By definition, the HRD shall not overflow nor underflow when its input is a compliant stream. Replicating a large encoded frame and utilizing the replicated stream in a low maximum bitrate stream risks causing a buffer overflow that would fail the HRD requirements. However, software decoder clients can compensate for this without a problem due to their more flexible buffers. The software decoder clients will can require additional CPU cycles to process the replicated frames. Hardware decoder clients will encounter errors due to possible buffer overflows when replicated frames are used in lower maximum bitrate streams. Some embodiments of the invention provide for reducing the bit values of replicated frames for lower maximum bitrate output streams in order to mitigate against the risk of buffer overflows in hardware decoders. In yet other embodiments, duplicated frames are only used for their own specific maximum bitrate output streams; thereby preventing high bit value frames from being utilized low maximum bitrate streams. This can be accomplished by including separate encoding processes for each output stream.
[0029] Moreover, in some embodiments, frames can be replicated and/or duplicated from input streams where the input stream and the output stream share same formats, maximum bitrates, and/or resolutions. This can occur where the desired output stream is the same as the input stream. Where this occurs, re-encoding can be skipped and several embodiments can simply replicated the instantaneous decoding refreshes (IDR) frames from the input streams. As discussed above, the resulting output stream can be non-HRD compliant in said several embodiments.
[0030] In a further technique to mitigate the risk of failing to generate frames in time for the live edge, frames of received media can optionally be extended (250). Extending frames can include packaging a given frame into an output stream at times different than the given frame's assigned time stamp. Depending on previous assessments, different extensions of frames may occur. Where a gap is detected in feeding and/or receiving of media, a current frame may be extended in generation of the output streams. In embodiments utilizing a repackaging application as a part of a live encoding server, the repackaging application can perform the extension during repackaging of frames into output streams. In order to reduce visual artifacts and/or perceptual stalls in video, the repackaging application can spread several smaller frame extensions over multiple frames in order to compensate for the gap in multiple steps. The smaller extensions can serve to conceal the extensions from streaming client viewers.
[0031] The generated output streams can be provided (260) to streaming clients. The generated output streams can be at different maximum bitrate yet each represent a single media presentation. Thus, a given media presentation can be provided to streaming clients in several streams having different maximum bitrates. The provision of generated output streams can be accomplished via HTTP requests for segments from the generated output streams.
[0032] While the operations presented in process 200 are presented in a linear order, various embodiments can perform said operations in varying orders. For instance, the generation and provision of streams to clients can be performed continuously as live media is received. Thus, the order of operations presented in process 200 is merely demonstrative and can be performed continuously as a part of a cyclical process for live generation of streams from frames of received media. Having discussed an overview of processes performed by live encoding systems of some embodiments, the following discussion will provide several examples of frame extension and frame replication that can performed as a part of said processes.
EXAMPLES OF FRAME EXTENSION AND FRAME REPLICATION
[0033] As discussed above, live encoding systems in accordance with embodiments of the invention can extend frames and/or replicate frames in response to assessed network and/or server conditions. Frame extensions and/or frame replications can compensate for dropped input frames, delayed input frames, and/or encoding system load. FIG. 3, FIG. 4, FIG. 5, FIG. 6, and FIG. 7 conceptually illustrate several examples of frame extension and frame duplication in accordance with embodiments of the invention. The examples presented in the aforementioned figures are abstractions of the live encoding process illustrated to show the effects of frame replications and/or frame extensions. Live encoding systems in accordance with embodiments of the invention will include additional details, components, and/or functionalities not illustrated in the examples in FIG. 3, FIG. 4, FIG. 5, FIG. 6, and FIG. 7. The specific numbers for time stamps, frame numbers, and/or frame durations are presented for demonstrative purposes. Embodiments of the invention are not limited to the specific values presented in FIG. 3, FIG. 4, FIG. 5, FIG. 6, and FIG. 7 and can incorporate wide ranges of possible time stamps, frame numbers, and/or frame durations as required for live encoding operations. Moreover, while only a single output stream is shown in the following figures, embodiments of the invention typically generate multiple output streams at varying maximum bitrates with varying encoding parameters.
[0034] FIG. 3 conceptually illustrates an example a live encoding system extending frames to compensate for missing input frames in accordance with an embodiment of the invention. As shown, live encoding system 300 in receiving an input stream 310 and generating an output stream 360. In the example illustrated in FIG. 3, the live encoding processes of live encoding system 300 are performed during continuous receipt of input stream 310 and generation of output stream 360. Input stream 310 can be any of the input streams and/or media discussed above. Live encoding system 360 can provide the generated output stream 360 to streaming clients (not shown) via any of the techniques discussed above. Techniques such as receiving HTTP requests and transmitting segments from the output stream.
[0035] As shown, input stream input stream 310 includes several frames with identified time stamps and durations. The frames can include portions of media, such as frames video. Time stamps are indicated by the abbreviation "TS". Durations are indicated by the abbreviation "D". As mentioned previously, the values shown in FIG. 3 are demonstrative. Embodiments of the invention can receive and process various different time stamp and duration values as necessary to support live encoding. Frame 5 320 has a time stamp value equal to 5 and a duration value equal to 1 .
[0036] Live encoding system 300 expects to receive frames from input stream 310 at specified times. When frames are not received at the specified times, live encoding system 300 may not be able to generate the output stream 360 in time for the live edge expected by live streaming clients. Live encoding system 300 can assess whether frames are missing from the input stream 310 using a variety of measures as discussed above. Such as comparing internal clocks maintained by the live encoding system 300 to the time stamps of the received frames of the live input stream 310. Live encoding system 310 can also include thresholds for missing frames that must be met before extending frames. Live encoding system 310 includes a threshold of two missing frames before electing to extending frames to compensate for the at least two frame gap. Different embodiments may include different thresholds that can be based on a different number of frames and/or a different threshold measurement, such as missing frames over a segment of time instead of missing frames in sequence. Live encoding of video is inherently a resource intensive process, thus various embodiments can utilize a variety of thresholds in connection with assessing encoding conditions, such encoding system loads, client stuttering, network bandwidth stability, video quality, and other metrics and/or conditions that can affect live encoding of video. As discussed above, specific counts of frames and their delivery can be calculated and compared to different thresholds of frame counts and times in different embodiments of the invention. Furthermore, different embodiments can use different metrics for assessing such streaming conditions, processing cycle counts, time benchmarks for encoding of sets of frames, network transfer rates, delivered and displayed framerates, and various measurements of visual quality/fidelity. While specific values are not provided herein, different specific values (such as dips below 24 frames per second, visual errors causing display failures in excess of certain gamma values, frames encoded per second, etc.) can be utilized as necessary to implement the invention without departing from the spirit of the invention.
[0037] Input frames can go missing under a variety of different circumstances, such (but not limited to) when there is a failure in the network connection between the provider of the input stream and the live encoding system, when there is fault in the input stream, and/or internal errors of the live encoding system. As shown, input stream 310 is missing frames 330 and frames 340. Live encoding system 300 can detect this gap by comparing the time stamp of frame 8 350 to the time stamp of frame 5 320 and an internal clock maintained by live encoding system 300. Once the missing frame threshold is met, live encoding system 300 can extend frames to compensate for the gap in frames. Various embodiments can use different thresholding schemes, including any of those discussed above.
[0038] As shown, live encoding system 300 extends frame 5 320 from the input stream 310 in generating output stream 360. Extended frame 370 is extended to have a duration value equal to 3 in order to cover the missing frames 330 and 340. Extended frame 370 will be available when requested by live streaming clients and preserves the live edge required to support uninterrupted live streaming. However, extending frame durations can result in visual artifacts if used excessively.
[0039] FIG. 4 conceptually illustrates an alternate method of extending frame durations that helps to conceal the effects of frame extensions. As shown, live encoding system 400 is generating an output stream 460 from an input stream 410. Input stream 410 is missing frames 430 and 440. In order to compensate for this gap, live encoding system 400 can extend the durations of frame 5 420 and frame 8 450, and also adjust the time stamp value of frame 8 450. As shown in output stream 460, extended frame 5 470 has been extended to have a duration value of 2 and extended frame 8 480 has been extended to have a duration value of 2 as well. However, the time stamp for extended frame 8 470 has been adjusted to be at 7 such that extended frame 8 480 will be available immediately after extended frame 5 470. By distributing extensions around missing frames, live encoding system 400 can conceal some of visual artifacts cause by frame duration extensions.
[0040] FIG. 5 conceptually illustrates an example a live encoding system extending frames to compensate for delayed input frames in accordance with an embodiment of the invention. As shown, live encoding system 500 is generating output stream 560 from input stream 510. However, frame delays 530 and 540 result in frame 6 550 arriving late. Live encoding system 500 can detect the frame delay and use frame duration extension to compensate. Unlike previous examples, there will be no lost frames. Live encoding system 500 generates output stream 560 which include extended frame 5 with a duration extended to 3 and frame 6 580 with a time stamp value adjusted to 8. Extended frame 570 will be available when requested by live streaming clients and preserves the live edge required to support uninterrupted live streaming. Similarly to the examples discussed above, extending frame durations can result in visual artifacts if used excessively.
[0041] FIG. 6 conceptually illustrates an alternate method of extending frame durations to compensate for frame delays that helps to conceal the effects of frame extensions. As shown, live encoding system 600 is generating an output stream 660 from an input stream 610. As above, frame delays occur at 630 and 640. In order to compensate for this delay, live encoding system 600 can extend the durations of frame 5 620 and frame 6 650, and also adjust the time stamp value of frame 6 650. As shown in output stream 660, extended frame 8 670 has been extended to have a duration value of 2 and extended frame 8 has been extended to have a duration value of 2 as well. However, the time stamp for extended frame 8 670 has been adjusted to be at 7 such that extended frame 8 670 will be available immediately after extended frame 5 670. By distributing extensions around delayed frames, live encoding system 400 can conceal some of visual artifacts cause by frame duration extensions.
[0042] Embodiments of the invention are not limited to the frame extensions techniques discussed above with respect to FIG. 3, FIG. 4, FIG. 5, and FIG. 6. Various embodiments can utilize sequential extensions of frame durations as shown in FIG. 3 and FIG. 5 and/or interspersed extensions of frame durations as shown in FIG. 4 and FIG. 5 in different circumstances. Furthermore, extending frame durations is not limited to being performed due to missing and/or delayed frames.
[0043] Live encoding servers typically are very powerful and expensive machines that need significant computing power to encoding live streams that meet the live edge requirement. However, even powerful servers can become overloaded and lesser servers even more so. In particular, re-encoding encoded frames can be a serious drain on server resources. FIG. 7 conceptually illustrates an example a live encoding system extending frames to compensate for server load in accordance with an embodiment of the invention. As shown, live encoding system 700 in receiving an input stream 710 and generating an output stream 760. In the example illustrated in FIG. 7, the live encoding processes of live encoding system 700 are performed during continuous receipt of input stream 710 and generation of output stream 760. Live encoding system 700 is shown under load 740. In order to compensate for this load, live encoding system 700 can replicate frames from encoded input stream in the encoded domain.
[0044] As shown, live encoding system 700 receives encoded frame 4 720 and encoded frame 5 730. Live encoding system 700 replicates these frames in generating encoded output stream 750. Frame fields for replicated frame 4 760 and replicated frame 5 770 may have to be adjusted in order to account for the new frame context. However, these adjustments can require significantly less processing resources as compared to re-encoding operations. Replicated frame 4 760 and replicated frame 5 770 have the same duration values and time stamp values as encoded frame 4 720 and encoded frame 5 730.
[0045] Embodiments of the invention are not limited to the specific frame replication techniques discussed above in the example conceptually illustrated in FIG. 7. Various embodiments can utilize frame replication and/or duplication with various formats of input streams, such as raw, un-encoded input streams. Moreover, embodiments of the invention are not limited to performing frame replication and/or frame duplication only during times of server load. For instance, some embodiments of the invention can perform encoded frame replication as a part of a continuous encoding process to maintain efficient live encoding without waiting until server load reaches critical levels. Said some embodiments could be utilized on lower powered live encoding servers.
MPEG-DASH LIVE ENCODING
[0046] MPEG-DASH (ISO/IEC 23009-1 ) is a standard for streaming multimedia content over the internet. MPEG-DASH was developed by the Moving Picture Expert Group (MPEG). MPEG has been responsible for developing previous multimedia standards, including MPEG-2, MPEG-4, MPEG-7, MPEG-21 and others. MPEG-DASH provides for adaptive segmented media delivery using HTTP. The MPEG-DASH specification only defines the MPD and the segment formats. Of note, the delivery of the MPD and the media-encoding formats containing the segments, as well as the client behavior for fetching, adaptation heuristics, and playing content, are undefined within the MPEG-DASH standard.
[0047] FIG. 8 conceptually illustrates an example data flow diagram for a live encoding system utilizing MPEG-DASH in accordance with an embodiment of the invention. FIG. 8 includes a media feed data 810, a live encoding system 820, a HTTP requests 830, requested stream segments 840, a streaming client 850, and media presentation description 860. Though not shown, media feed data 810, HTTP requests 830, requested stream segments 840, and media presentation description 860 can be transmitted over a communication network. The communication network can include (but is not limited to) the internet.
[0048] As shown, live encoding system 820 is receiving media feed data 810. Media feed data 810 can include at least the types of received media discussed above. Live encoding system 820 can generate output streams from the received media feed data 810. During generation of the output streams from the received media feed data 810, live encoding system 820 can replicate frames from the media feed data 810 and/or extend frames from the media feed data 810 based on assessments of the rate of receipt of media feed data 810, load levels on the live encoding system 820, load levels in the communication network supporting the transmission of media feed data 810, gaps in the media feed data 810, and/or gaps in generation of streams by the live encoding system 820.
[0049] Live encoding system 820 also receives HTTP requests 830. In response to the HTTP requests, live encoding system 820 provides requested stream segments 840. HTTP requests 830 can include byte range requests for a specific segment from one of the generated output streams. Live encoding system 820 can include multiple components, including separate live encoding servers and HTTP servers. The HTTP servers can support the HTTP communication of media segments and requests with clients. Moreover, the HTTP servers can utilize HTTP-based Content Distribution Networks (CDNs) to assist in delivery of media segments to streaming client 850.
[0050] MPEG-DASH uses a Media Presentation Description (MPD) to provide clients with a well structured XML manifest describing several adaptive bitrate streams that can be accessed via HTTP requests for stream segments. Each MPD corresponds to a single media presentation that can be viewed via the several described adaptive bitrate streams. The MPD describes accessible media segments and corresponding timings for the accessible media segments. The MPD is a hierarchical data model including (descending from the top of the hierarchy) a media presentation, periods, adaptation sets, representations, and segments. A media presentation can include to a live broadcast, a live stream, a live event, and/or a pre-recorded media presentation. A media presentation can be spliced and/or include several periods. The periods are by default unlinked and can have advertising periods spliced between them without any loss of functionality. Periods can include several adaptation sets. Adaptation sets can include different perspectives on the same presentation, such as different cameras from a live sporting event. In addition, different adaptation sets can include different formats, such as audio adaptation sets and video adaptation sets. Within each adaptation set, several representations may be included. Representations support the selection of different bandwidth and/or maximum bitrate levels form the same presentation. Thus, clients of MPEG-DASH can use adaptive bitrate streaming by switching to different representations as bandwidth and/or client loading allows. Each representation includes segments of media that can be requested via HTTP. The HTTP requests are received on pre-formatted URLs associated with each segment.
[0051] FIG. 9 conceptually illustrates an example Media Presentation Description MPD data model from MPEG-DASH. As shown, media presentation 910 includes several periods 915-925. The periods 915-925 each include different period start times. Period 920 at start time 100 seconds is expanded to show several included adaptation sets 925-930. Adaptation set 1 925 includes video from camera 1 of media presentation 910. Adaptation set 2 930 includes audio for media presentation 910. Adaptation set 3 935 includes video from camera 2 of media presentation 910. Adaptation set 1 925 has been expanded to show representation 1 940 and representation 2 945. Representation 1 940 is a 500kb/s representation for adaptation set 1 925 whereas representation 2 945 is a 250kb/s representation for adaptation set 1 925. Within representation 1 940 are initialization segment 100 and media segments 955-965. These segments are requested by streaming clients via HTTP to receive the media contained within them.
[0052] Of note, instances of ellipses illustrated in FIG. 9 indicate the possibility of additional periods, adaptation sets, presentations, and segments. The example MPD presented in FIG. 9 is merely one possible example from any variety of configurations supported by various embodiments of the invention. For instance, different embodiments of the invention can support many other maximum bitrates than those provided for demonstrative purposes in the embodiment illustrated in FIG. 9. LIVE ENCODING SERVER ARCHITECTURE
[0053] An architecture of a live encoding server 1000 in accordance with an embodiment of the invention is illustrated in fig. 10. The live encoding server 1000 includes a processor 1010 in communication with non-volatile memory 1030, volatile memory 1020, and a network interface 1040. In the illustrated embodiment, the nonvolatile memory includes input data handling application 1050, demuxer application 1055, repackager application 1060, MPD combination application 1065, MPD generation application 1070, HTTP request application 1075, audio decoder application 1080, audio encoder application 1085, video decoder application 1090, and video encoder application 1095. Of note, the live encoding server 1000 is an mpeg- dash format live encoding server that prepares MPD files for streams and provides segments of output streams to streaming clients through HTTP requests. Other embodiments may utilize different formats and include different applications as necessary to support said different formats.
[0054] The input data handling application 1050 receives input streams from the network interface 1040. The input streams can include (but are not limited to) live streams of video content, media presentations, video only files, audio only files, sporting events, web streams, and/or mpeg-dash standard streams. The input data handling application 1050 can perform additional functions including identification of the input streams. Identification can be performed using metadata included with the input streams and/or assessing of characteristics and parameters of the input streams.
[0055] The demuxer application 1055 demultiplexes individual elementary streams from an input stream. For instance, the demuxer application 1055 can break out the audio, video, and/or subtitle streams within an input stream. The demultiplexed streams can be analyzed, decoded, and reencoded in subsequent operations performed by other applications.
[0056] The repackager application 1060 can perform the re-encoding, duplication, and frame extension operations as a part of the overall live encoding server operations. The repackager application 1060 can receive input streams from the input data handling application 1050, the demuxer application 1055, the network interface 1040, and/or any other component of the live encoding server 1000 as necessary to repackage streams. The repackager application 1060 can re-encode incoming live frames of received media into several output streams utilizing the video decoder application 1090 and the video encoder application 1095 as necessary. During re-encoding operations, the repackager application 1060 can assess network and/or server load levels of the live encoding server 1000 according to several measures. Based on these assessments, the repackager application 1060 can duplicate incoming frames to reduce server load levels and/or extend certain frames to compensate for anticipated drops in incoming network bandwidth. The repackager application 1060 can extend frames by manipulating time codes and/or time stamps of frames to increase their duration in output streams. The repackager application 1060 can provide the repackaged, re-encoded, duplicated, and/or extended frames of output streams to the MPD combination application 1065 and/or the MPD generation application 1070 for preparation for later streaming to clients utilizing the HTTP request application 1075.
[0057] The MPD combination application 1065 combines multiple output streams generated by the repackager application 1060 into a single presentation. The MPD combination application 1070 can generate an MPD file for a combined presentation. As discussed above, the MPD file can describe the periods, adaptation sets, representations, and segments of a media presentation. The MPD combination application 1070 generates MPD's according to characteristics of the generated output streams. These characteristics will vary according to the operations performed by the repackager application 1060. The MPD file is typically the initially requested and provided to streaming clients in order to initiate an mpeg-dash streaming session.
[0058] The HTTP request application 1075 handles HTTP requests and server media segments according to said HTTP requests. The HTTP request application 1075 may communicate to streaming clients through the network interface 1040. In some embodiments, the HTTP request application 1075 is hosted in a separate HTTP server from the live encoding server.
[0059] The non-volatile memory includes audio decoder application 1080, audio encoder application 1085, video decoder application 1090, and video encoder application 1095. While non-volatile memory 1030 only includes a single video decoder application 1090 and a single video encoder application 1095, other embodiments may include multiple video encoder and video decoder applications. Moreover, some embodiments may utilize sets of applications for each output stream in order to have separate repackager, decorder, and encoder applications to generate each different output stream.
[0060] In several embodiments, the network interface 1040 may be in communication with the processor 1010, the volatile memory 1020, and/or the nonvolatile memory 1030. The above discussion of the applications stored in the nonvolatile memory 1030 of the live encoding server 1000 discusses one exemplary set of applications to support the live encoding server 1000. Other embodiments of the invention may utilize multiple servers with the functions discussed below distributed across multiple servers and/or locations as necessary to implement the invention. Furthermore, the applications discussed below could be combined into one or more applications and implemented as software modules as necessary to implement the invention. For instance, the applications discussed below could alternatively be implemented as modules of a single application residing on live encoding server 1000. Moreover, where a single application is shown, other embodiments may utilize multiple applications dedicated to similar functions.
[0061] The various processes discussed above can be implemented on singular, discrete servers. Alternatively, they can each be implemented as shared and/or discrete servers on any number of physical, virtual, or cloud computing devices. Specifically, live encoding systems in accordance with some embodiments of the invention could include separate encoding server(s) and HTTP server(s). Persons of ordinary skill in the art will recognize that various implementations methods may be used to implement the process servers of embodiments of the invention.
[0062] While the above description contains many specific embodiments of the invention, these should not be construed as limitations on the scope of the invention, but rather as an example of one embodiment thereof. Accordingly, the scope of the invention should be determined not by the embodiments illustrated, but by the appended claims and their equivalents.

Claims

1 . A method of encoding an input stream into a plurality of adaptive bitrate streams using a live encoding system, the method comprising:
receiving an input stream using a live encoding system;
assessing encoding conditions using the live encoding system;
encoding a given segment of the input stream into a plurality of adaptive bitrate segments using the live encoding system, wherein encoding the given segment of the input stream into the plurality of adaptive bitrate segments comprises:
extending at least one frame from the given segment of the input stream and using the extended at least one frame from the segment of the input stream in at least one of the plurality of adaptive bitrate segments when the assessed encoding conditions satisfy a first threshold;
replicating at least one frame from the given segment of the input stream and using the replicated at least one frame from the segment of the input stream in at least one of the plurality of adaptive bitrate segments when the assessed encoding conditions satisfy a second threshold; and
encoding frames of the given segment of the input stream into the plurality of adaptive bitrate segments using the live encoding system when the assessed encoding conditions do not satisfy either the first threshold or the second threshold.
2. The method of claim 1 further comprising streaming the encoded plurality of adaptive bitrate segments to a plurality of streaming clients over at least one network.
3. The method of claim 1 , wherein assessing encoding conditions using the live encoding system further comprises determining an arrival time between a previous segment and the given segment, and wherein the first threshold is a particular amount of time between the arrival of different segments.
4. The method of claim 3, wherein determining the arrival time between the previous segment and the given segment further comprises comparing time stamp differences between the previous segment and the given segment to a clock implementation on the live encoding system.
5. The method of claim 1 , wherein assessing encoding conditions using the live encoding system further comprises calculating an amount of data being received from the input stream using the live encoding system, and wherein the first threshold is a particular quantity of data received over a specified amount of time.
6. The method of claim 1 , wherein the plurality of adaptive bitrate segments are encoded into different maximum bitrates.
7. The method of claim 1 , wherein the input stream is a live stream.
8. The method of claim 7, wherein the live stream comprises media selected from the group of audio tracks, video tracks, subtitle tracks, and multimedia tracks.
9. The method of claim 1 , wherein assessing encoding conditions using the live encoding system further comprises calculating an amount of load on the live encoding system using the live encoding system, and wherein the second threshold is a particular amount of computational work that the live encoding system is performing.
10. The method of claim 1 , wherein the input stream has a particular encoding, and wherein the replicated at least one frame from the segment of the input stream has the same particular encoding.
1 1 . A live encoding system, the live encoding system comprising:
at least one processing unit;
a memory storing a live encoding application comprising computer instructions, wherein the live encoding application instructs the at least one processing unit to:
receive an input stream;
assess encoding conditions ;
encode a given segment of the input stream into a plurality of adaptive bitrate segments, wherein the instructions to encode the given segment of the input stream into the plurality of adaptive bitrate segments further comprise instructions to:
extend at least one frame from the given segment of the input stream and using the extended at least one frame from the segment of the input stream in at least one of the plurality of adaptive bitrate segments when the assessed encoding conditions satisfy a first threshold;
replicate at least one frame from the given segment of the input stream and using the replicated at least one frame from the segment of the input stream in at least one of the plurality of adaptive bitrate segments when the assessed encoding conditions satisfy a second threshold; and
encode frames of the given segment of the input stream into the plurality of adaptive bitrate segments when the assessed encoding conditions do not satisfy either the first threshold or the second threshold.
12. The live encoding system of claim 1 1 , wherein the live encoding application further includes instructions to stream the encoded plurality of adaptive bitrate segments to a plurality of streaming clients over at least one network.
13. The live encoding system of claim 1 1 , wherein the instructions to assess encoding conditions further comprises instructions to determine an arrival time between a previous segment and the given segment, and wherein the first threshold is a particular amount of time between the arrival of different segments.
14. The live encoding system of claim 13, wherein the instructions to determine the arrival time between the previous segment and the given segment further comprises instructions to compare time stamp differences between the previous segment and the given segment to a clock implementation on the live encoding system.
15. The live encoding system of claim 1 1 , wherein the instructions to assess encoding conditions further comprises instructions to calculating an amount of data being received from the input stream, and wherein the first threshold is a particular quantity of data received over a specified amount of time.
16. The live encoding system of claim 1 1 , wherein the plurality of adaptive bitrate segments are encoded into different maximum bitrates.
17. The live encoding system of claim 1 1 , wherein the input stream is a live stream.
18. The live encoding system of claim 17, wherein the live stream comprises media selected from the group of audio tracks, video tracks, subtitle tracks, and multimedia tracks.
19. The live encoding system of claim 1 1 , wherein the instructions to assess encoding conditions further comprises instructions to calculating an amount of load on the live encoding system, and wherein the second threshold is a particular amount of computational work that the live encoding system is performing.
20. The live encoding system of claim 1 1 , wherein the input stream has a particular encoding, and wherein the replicated at least one frame from the segment of the input stream has the same particular encoding.
PCT/US2016/019955 2015-02-27 2016-02-26 Systems and methods for frame duplication and frame extension in live video encoding and streaming WO2016138493A1 (en)

Priority Applications (8)

Application Number Priority Date Filing Date Title
SG11201706160UA SG11201706160UA (en) 2015-02-27 2016-02-26 Systems and methods for frame duplication and frame extension in live video encoding and streaming
KR1020177023590A KR101897959B1 (en) 2015-02-27 2016-02-26 System and method for frame replication and frame extension in live video encoding and streaming
CN201680012053.2A CN107251008B (en) 2015-02-27 2016-02-26 System and method for frame replication and frame expansion in live video encoding and streaming
EP19206431.9A EP3627337A1 (en) 2015-02-27 2016-02-26 Systems and methods for frame duplication and frame extension in live video encoding and streaming
ES16756526T ES2768979T3 (en) 2015-02-27 2016-02-26 System and method for frame duplication and frame magnification in streaming and encoding of live video
JP2017544732A JP6588987B2 (en) 2015-02-27 2016-02-26 System and method for frame copying and frame expansion in live video encoding and streaming
EP16756526.6A EP3262523B1 (en) 2015-02-27 2016-02-26 System and method for frame duplication and frame extension in live video encoding and streaming
HK18105666.2A HK1246423A1 (en) 2015-02-27 2018-05-02 Systems and methods for frame duplication and frame extension in live video encoding and streaming

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201562126393P 2015-02-27 2015-02-27
US62/126,393 2015-02-27

Publications (1)

Publication Number Publication Date
WO2016138493A1 true WO2016138493A1 (en) 2016-09-01

Family

ID=56789873

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2016/019955 WO2016138493A1 (en) 2015-02-27 2016-02-26 Systems and methods for frame duplication and frame extension in live video encoding and streaming

Country Status (9)

Country Link
US (3) US10715574B2 (en)
EP (2) EP3627337A1 (en)
JP (2) JP6588987B2 (en)
KR (1) KR101897959B1 (en)
CN (1) CN107251008B (en)
ES (1) ES2768979T3 (en)
HK (1) HK1246423A1 (en)
SG (1) SG11201706160UA (en)
WO (1) WO2016138493A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10715574B2 (en) 2015-02-27 2020-07-14 Divx, Llc Systems and methods for frame duplication and frame extension in live video encoding and streaming

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10291681B2 (en) * 2015-06-18 2019-05-14 Ericsson Ab Directory limit based system and method for storing media segments
JP2018041340A (en) * 2016-09-08 2018-03-15 富士通株式会社 Information processing system
US10652294B2 (en) * 2016-10-31 2020-05-12 Google Llc Anchors for live streams
JP7077396B2 (en) * 2017-04-21 2022-05-30 ゼニマックス メディア インク. Systems and methods for postponed post-process of video encoding
US11461070B2 (en) * 2017-05-15 2022-10-04 MIXHalo Corp. Systems and methods for providing real-time audio and data
US10652166B2 (en) * 2017-06-27 2020-05-12 Cisco Technology, Inc. Non-real time adaptive bitrate recording scheduler
EP3616407A1 (en) * 2017-12-08 2020-03-04 Google LLC Modifying digital video content
WO2019210152A1 (en) * 2018-04-26 2019-10-31 Phenix Real Time Solutions, Inc. Adaptive bit-rate methods for live broadcasting
US11146852B2 (en) * 2018-05-11 2021-10-12 Qualcomm Incorporated Signaling missing sections of media data for network streaming in a segment
CN110798739B (en) * 2019-11-11 2021-10-08 四川东方网力科技有限公司 HTML 5-based video real-time target attribute superposition display method, device and equipment
CN111083162B (en) * 2019-12-30 2022-08-23 广州酷狗计算机科技有限公司 Multimedia stream pause detection method and device
US11425182B1 (en) * 2020-12-30 2022-08-23 Meta Platforms, Inc. Systems and methods for dynamically encoding media streams
KR102568415B1 (en) * 2021-04-28 2023-08-21 (주)이머시브캐스트 HMD-based PC game expansion system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7339993B1 (en) * 1999-10-01 2008-03-04 Vidiator Enterprises Inc. Methods for transforming streaming video data
US20120307886A1 (en) * 2011-05-31 2012-12-06 Broadcom Corporation Adaptive Video Encoding Based on Predicted Wireless Channel Conditions
US20130114744A1 (en) * 2011-11-06 2013-05-09 Akamai Technologies Inc. Segmented parallel encoding with frame-aware, variable-size chunking
US20140019593A1 (en) * 2012-07-10 2014-01-16 Vid Scale, Inc. Quality-driven streaming

Family Cites Families (234)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5400401A (en) 1992-10-30 1995-03-21 Scientific Atlanta, Inc. System and method for transmitting a plurality of digital services
JP2707950B2 (en) 1993-07-30 1998-02-04 ソニー株式会社 Digital image information processing device
US5596564A (en) 1993-10-08 1997-01-21 Matsushita Electric Industrial Co., Ltd. Information recording medium and apparatus and method for recording and reproducing information
US6157391A (en) * 1994-01-10 2000-12-05 International Business Machines Corporation Method and apparatus for providing slow motion video at normal motion play speed
JPH07327029A (en) 1994-05-31 1995-12-12 Fujitsu Ltd Ciphering communication system
JP2842796B2 (en) 1994-12-06 1999-01-06 富士通株式会社 Moving image encryption processing method and apparatus, and encrypted moving image decryption processing method and apparatus
US6002834A (en) 1995-02-24 1999-12-14 Hitachi, Ltd. Optical disk having table relating sector address and time and optical disk reproducing apparatus
US6009237A (en) 1995-02-24 1999-12-28 Hitachi Ltd. Optical disk and optical disk reproduction apparatus
US5813010A (en) 1995-04-14 1998-09-22 Kabushiki Kaisha Toshiba Information storage and information transmission media with parental control
TW430785B (en) 1995-04-14 2001-04-21 Toshiba Corp Interactively broadcasting recording-media and its regeneration system
CN100351911C (en) 1995-08-21 2007-11-28 松下电器产业株式会社 Multimedia optical disk reproducing device for realizing unexpected scene unfold according to interactive control
TW436777B (en) 1995-09-29 2001-05-28 Matsushita Electric Ind Co Ltd A method and an apparatus for reproducing bitstream having non-sequential system clock data seamlessly therebetween
TW385431B (en) 1995-09-29 2000-03-21 Matsushita Electric Ind Co Ltd A method and an apparatus for encoding a bitstream with plural possible searching reproduction paths information useful in multimedia optical disk
CA2237293A1 (en) 1995-12-29 1997-07-10 Scientific-Atlanta, Inc. Method and apparatus for providing conditional access in connection-oriented, interactive networks with a multiplicity of service providers
US6957350B1 (en) 1996-01-30 2005-10-18 Dolby Laboratories Licensing Corporation Encrypted and watermarked temporal and resolution layering in advanced television
US6065050A (en) 1996-06-05 2000-05-16 Sun Microsystems, Inc. System and method for indexing between trick play and normal play video streams in a video delivery system
US5805700A (en) 1996-10-15 1998-09-08 Intel Corporation Policy based selective encryption of compressed video data
FI106071B (en) 1997-03-13 2000-11-15 Nokia Mobile Phones Ltd Adaptive filter
EP0866461A3 (en) 1997-03-19 2001-11-07 Sony Corporation Video data decoding and video signal reproduction
US6654933B1 (en) 1999-09-21 2003-11-25 Kasenna, Inc. System and method for media stream indexing
US6970564B1 (en) 1998-07-13 2005-11-29 Sony Corporation Data multiplexing device, program distribution system, program transmission system, pay broadcast system, program transmission method, conditional access system, and data reception device
FI103003B (en) 1997-06-13 1999-03-31 Nokia Corp Filtering procedure, filter and mobile terminal
KR100242448B1 (en) 1997-06-28 2000-02-01 윤종용 Apparatus and method for controlling high speed reproducing for digital video disk
US6594699B1 (en) 1997-10-10 2003-07-15 Kasenna, Inc. System for capability based multimedia streaming over a network
US6057832A (en) 1997-12-02 2000-05-02 V Soft Ltd. Method and apparatus for video-on-demand with fast play capability
JP4186242B2 (en) 1997-12-26 2008-11-26 ソニー株式会社 Image signal processing apparatus and image signal processing method
US6751623B1 (en) 1998-01-26 2004-06-15 At&T Corp. Flexible interchange of coded multimedia facilitating access and streaming
US7809138B2 (en) 1999-03-16 2010-10-05 Intertrust Technologies Corporation Methods and apparatus for persistent control and protection of content
US6320905B1 (en) 1998-07-08 2001-11-20 Stream Machine Company Postprocessing system for removing blocking artifacts in block-based codecs
US7457415B2 (en) 1998-08-20 2008-11-25 Akikaze Technologies, Llc Secure information distribution system utilizing information segment scrambling
US6931531B1 (en) 1998-09-02 2005-08-16 Matsushita Electric Industrial Co., Ltd. Image object recording, compression, and encryption method and system
US6351538B1 (en) 1998-10-06 2002-02-26 Lsi Logic Corporation Conditional access and copy protection scheme for MPEG encoded video data
WO2000022623A1 (en) 1998-10-12 2000-04-20 Matsushita Electric Industrial Co., Ltd. Information recording medium, apparatus and method for recording or reproducing data thereof
US6850965B2 (en) * 1998-11-17 2005-02-01 Arthur Douglas Allen Method for connection acceptance and rapid determination of optimal multi-media content delivery over network
CA2289958C (en) 1998-11-19 2003-01-21 Tomoyuki Okada Information recording medium, apparatus and method for recording or reproducing data thereof
US6236764B1 (en) 1998-11-30 2001-05-22 Equator Technologies, Inc. Image processing circuit and method for reducing a difference between pixel values across an image boundary
EP1021048A3 (en) 1999-01-14 2002-10-02 Kabushiki Kaisha Toshiba Digital video recording system and its recording medium
AU2515800A (en) 1999-01-26 2000-08-07 Infolio, Inc. Universal mobile id system and method for digital rights management
JP3433125B2 (en) 1999-01-27 2003-08-04 三洋電機株式会社 Video playback device
JP3715533B2 (en) 1999-02-05 2005-11-09 株式会社東芝 Information storage medium for stream information, recording method, reproducing method, recording apparatus, and reproducing apparatus
DE19906449C1 (en) 1999-02-16 2000-08-10 Fraunhofer Ges Forschung Multimedia data stream encryption method provides first section of useful data block which is not encrypted for providing preview or prelist function
DE19906450C1 (en) 1999-02-16 2000-08-17 Fraunhofer Ges Forschung Generating encoded useful data flow involves producing encoded version of useful data key using asymmetrical encoding and entering in useful data stream header block
JP3805985B2 (en) 1999-02-18 2006-08-09 株式会社東芝 Stream data information storage medium, recording method, reproducing method, recording apparatus, and reproducing apparatus
US6415031B1 (en) 1999-03-12 2002-07-02 Diva Systems Corporation Selective and renewable encryption for secure distribution of video on-demand
WO2000055854A1 (en) 1999-03-17 2000-09-21 Kabushiki Kaisha Toshiba Method for recording stream data and its data structure
EP1039468A3 (en) 1999-03-19 2000-10-04 Deutsche Thomson-Brandt Gmbh Method for implementing trickplay modes in a data stream recorder
US6912513B1 (en) 1999-10-29 2005-06-28 Sony Corporation Copy-protecting management using a user scrambling key
US7151832B1 (en) 1999-11-18 2006-12-19 International Business Machines Corporation Dynamic encryption and decryption of a stream of data
CN1779689A (en) 2000-01-21 2006-05-31 索尼公司 Data processing apparatus and data processing method
JP2001209583A (en) 2000-01-26 2001-08-03 Sony Corp Recorded data regenerator and method for saved data processing and program distribution media
JP4599740B2 (en) 2000-04-21 2010-12-15 ソニー株式会社 Information processing apparatus and method, recording medium, program, and recording medium
GB2362532B (en) 2000-05-15 2004-05-05 Nokia Mobile Phones Ltd Video coding
KR100448452B1 (en) 2000-06-09 2004-09-13 엘지전자 주식회사 Method for supporting menu of a high-density recording medium
US6871006B1 (en) 2000-06-30 2005-03-22 Emc Corporation Processing of MPEG encoded video for trick mode operation
US7373422B1 (en) 2000-08-04 2008-05-13 Oracle International Corporation Techniques for supporting multiple devices in mobile applications
US6704024B2 (en) 2000-08-07 2004-03-09 Zframe, Inc. Visual content browsing using rasterized representations
US20020164024A1 (en) 2000-08-25 2002-11-07 Hiroshi Arakawa Data transmission method and data relay method
US6453115B1 (en) 2000-08-31 2002-09-17 Keen Personal Media, Inc. Digital video recording system which generates an index data structure for displaying a video stream in trickplay mode
US7242772B1 (en) 2000-09-07 2007-07-10 Eastman Kodak Company Encryption apparatus and method for synchronizing multiple encryption keys with a data stream
US7212726B2 (en) 2000-09-15 2007-05-01 International Business Machines Corporation System and method of processing MPEG streams for file index insertion
US7031393B2 (en) 2000-10-20 2006-04-18 Matsushita Electric Industrial Co., Ltd. Block distortion detection method, block distortion detection apparatus, block distortion removal method, and block distortion removal apparatus
US7450641B2 (en) 2001-09-14 2008-11-11 Sharp Laboratories Of America, Inc. Adaptive filtering based upon boundary strength
US7110664B2 (en) 2001-04-20 2006-09-19 Front Porch Digital, Inc. Methods and apparatus for indexing and archiving encoded audio-video data
US7065213B2 (en) 2001-06-29 2006-06-20 Scientific-Atlanta, Inc. In a subscriber network receiving digital packets and transmitting digital packets below a predetermined maximum bit rate
US6928603B1 (en) * 2001-07-19 2005-08-09 Adaptix, Inc. System and method for interference mitigation using adaptive forward error correction in a wireless RF data transmission system
WO2003010766A1 (en) 2001-07-23 2003-02-06 Matsushita Electric Industrial Co., Ltd. Information recording medium, and apparatus and method for recording information on information recording medium
JP4145586B2 (en) 2001-07-24 2008-09-03 セイコーエプソン株式会社 Image processing apparatus, image processing program, and image processing method
US7426315B2 (en) 2001-09-05 2008-09-16 Zoran Microelectronics Ltd. Method for reducing blocking artifacts
KR100424762B1 (en) 2001-09-06 2004-03-30 삼성전자주식회사 Image data providing system and method thereof
US6983079B2 (en) 2001-09-20 2006-01-03 Seiko Epson Corporation Reducing blocking and ringing artifacts in low-bit-rate coding
US20030077071A1 (en) 2001-10-23 2003-04-24 Shu Lin Fast forward trick mode and reverse trick mode using an information file
JP2003152698A (en) 2001-11-15 2003-05-23 Nippon Hoso Kyokai <Nhk> Contents utilization control transmitting method, contents utilization control receiving method, contents utilization control transmitting device, contents utilization control receiving device, contents utilization control transmitting program and contents utilization control receiving program
PT1978747E (en) 2001-11-29 2014-07-24 Panasonic Ip Corp America Coding distortion removal method
DE60230666D1 (en) 2001-11-29 2009-02-12 Panasonic Corp PROCESS FOR ELIMINATING CODING FORCED AND METHOD FOR VIDEO CODING AND DECODING
US20040037421A1 (en) 2001-12-17 2004-02-26 Truman Michael Mead Parital encryption of assembled bitstreams
US8027470B2 (en) 2002-01-02 2011-09-27 Sony Corporation Video slice and active region based multiple partial encryption
US7065651B2 (en) 2002-01-16 2006-06-20 Microsoft Corporation Secure video card methods and systems
JP2003230089A (en) 2002-01-31 2003-08-15 Toshiba Corp Information storage medium and information recording apparatus
US7174021B2 (en) 2002-06-28 2007-02-06 Microsoft Corporation Systems and methods for providing secure server key operations
US20040022391A1 (en) 2002-07-30 2004-02-05 O'brien Royal Digital content security system and method
WO2004012378A2 (en) 2002-07-30 2004-02-05 Digital Interactive Streams, Inc. Digital content security system and method
US7167560B2 (en) 2002-08-08 2007-01-23 Matsushita Electric Industrial Co., Ltd. Partial encryption of stream-formatted media
EP1550122A4 (en) 2002-09-05 2009-12-02 Lg Electronics Inc Recording medium having data structure for managing reproduction of still images recorded thereon and recording and reproducing methods and apparatuses
CN100495558C (en) 2002-09-06 2009-06-03 Lg电子株式会社 Methdo and device for recording and reproducing data structure for managing still images
US20050144468A1 (en) 2003-01-13 2005-06-30 Northcutt J. D. Method and apparatus for content protection in a personal digital network environment
US7020287B2 (en) 2002-09-30 2006-03-28 Sony Corporation Method and system for key insertion for stored encrypted content
US7295673B2 (en) 2002-10-23 2007-11-13 Divx, Inc. Method and system for securing compressed digital video
KR20040039852A (en) 2002-11-05 2004-05-12 주식회사 디지털앤디지털 Trick play embodiment method using frame index
US8572408B2 (en) 2002-11-05 2013-10-29 Sony Corporation Digital rights management of a digital device
JP2006506772A (en) 2002-11-20 2006-02-23 エルジー エレクトロニクス インコーポレーテッド Recording medium having data structure for managing reproduction of recorded data, and recording and reproduction method and apparatus using the same
US7227901B2 (en) 2002-11-21 2007-06-05 Ub Video Inc. Low-complexity deblocking filter
US9352222B2 (en) * 2002-12-10 2016-05-31 Sony Interactive Entertainment America Llc System and method for capturing text for an online application
EP1602239A1 (en) 2003-03-03 2005-12-07 Koninklijke Philips Electronics N.V. Video encoding
US7007170B2 (en) 2003-03-18 2006-02-28 Widevine Technologies, Inc. System, method, and apparatus for securely providing content viewable on a secure device
KR101030176B1 (en) 2003-04-10 2011-04-18 파나소닉 주식회사 Information recording medium, device and method for recording information in information recording medium
KR20040096718A (en) 2003-05-10 2004-11-17 삼성전자주식회사 Multimedia data decoding apparatus, audio data receiving method and audio data structure therein
KR100492567B1 (en) 2003-05-13 2005-06-03 엘지전자 주식회사 Http-based video streaming apparatus and method for a mobile communication system
US7424501B2 (en) 2003-06-30 2008-09-09 Intel Corporation Nonlinear filtering and deblocking applications utilizing SIMD sign and absolute value operations
JP4411879B2 (en) 2003-07-01 2010-02-10 株式会社ニコン Signal processing apparatus, signal processing program, and electronic camera
US8055910B2 (en) 2003-07-07 2011-11-08 Rovi Solutions Corporation Reprogrammable security for controlling piracy and enabling interactive content
US20050013494A1 (en) 2003-07-18 2005-01-20 Microsoft Corporation In-loop deblocking filter
US7382879B1 (en) 2003-07-23 2008-06-03 Sprint Communications Company, L.P. Digital rights management negotiation for streaming media over a network
JP2005057435A (en) 2003-08-01 2005-03-03 Sony Corp Client equipment, content processing method for client equipment, and contents providing system
DE602004031625D1 (en) 2003-08-07 2011-04-14 Pervenio Ltd SERVER FOR DETERMINING AND SAVING MOBILE DEVICE PERFORMANCE FEATURES
US7853980B2 (en) 2003-10-31 2010-12-14 Sony Corporation Bi-directional indices for trick mode video-on-demand
JP4537083B2 (en) 2004-01-28 2010-09-01 キヤノン株式会社 Data processing apparatus and control method thereof
US9094699B2 (en) 2004-02-05 2015-07-28 Broadcom Corporation System and method for security key transmission with strong pairing to destination client
US7546641B2 (en) 2004-02-13 2009-06-09 Microsoft Corporation Conditional access to digital rights management conversion
US9094615B2 (en) * 2004-04-16 2015-07-28 Intheplay, Inc. Automatic event videoing, tracking and content generation
US7539248B2 (en) 2004-04-29 2009-05-26 Mediatek Incorporation Adaptive de-blocking filtering apparatus and method for MPEG video decoder
US7400679B2 (en) 2004-04-29 2008-07-15 Mediatek Incorporation Adaptive de-blocking filtering apparatus and method for MPEG video decoder
US7397853B2 (en) 2004-04-29 2008-07-08 Mediatek Incorporation Adaptive de-blocking filtering apparatus and method for MPEG video decoder
US7477749B2 (en) 2004-05-12 2009-01-13 Nokia Corporation Integrity protection of streamed content
US8600217B2 (en) 2004-07-14 2013-12-03 Arturo A. Rodriguez System and method for improving quality of displayed picture during trick modes
US7571246B2 (en) 2004-07-29 2009-08-04 Microsoft Corporation Media transrating over a bandwidth-limited network
US7930184B2 (en) * 2004-08-04 2011-04-19 Dts, Inc. Multi-channel audio coding/decoding of random access points and transients
KR100825548B1 (en) 2004-08-17 2008-04-28 마쯔시다덴기산교 가부시키가이샤 Information recording medium, data sorting device, and data reproducing device
WO2006077850A1 (en) 2005-01-18 2006-07-27 Matsushita Electric Industrial Co., Ltd. Data storing method, data reproducing method, data recording device, data reproducing device, and recording medium
US7349886B2 (en) 2005-03-25 2008-03-25 Widevine Technologies, Inc. Securely relaying content using key chains
KR100677026B1 (en) 2005-04-07 2007-02-01 (주)아이티너스 코리아 System for Receiving Or Transmitting Video Contents And Information Security Module Used In the Same
US8683066B2 (en) 2007-08-06 2014-03-25 DISH Digital L.L.C. Apparatus, system, and method for multi-bitrate content streaming
JP4321550B2 (en) 2005-08-31 2009-08-26 ソニー株式会社 Information processing apparatus, information recording medium manufacturing apparatus, information recording medium and method, and computer program
US20070067472A1 (en) 2005-09-20 2007-03-22 Lsi Logic Corporation Accurate and error resilient time stamping method and/or apparatus for the audio-video interleaved (AVI) format
US20070083467A1 (en) 2005-10-10 2007-04-12 Apple Computer, Inc. Partial encryption techniques for media data
CN100425078C (en) * 2005-11-08 2008-10-08 上海广电(集团)有限公司中央研究院 Self-adaptive associated controlling method for coding speed and video quality in bit rate switchover
US7991891B2 (en) 2006-02-02 2011-08-02 Microsoft Corporation Version-specific content searching
EP2043293A4 (en) 2006-07-19 2011-01-19 Panasonic Corp Medium data processing device and medium data processing method
US7743161B2 (en) 2006-10-10 2010-06-22 Ortiva Wireless, Inc. Digital content buffer for adaptive streaming
TW200820783A (en) 2006-10-25 2008-05-01 Realtek Semiconductor Corp Apparatus and method for deblock filtering
US8243924B2 (en) 2007-06-29 2012-08-14 Google Inc. Progressive download or streaming of digital media securely through a localized container and communication protocol proxy
WO2009018171A1 (en) * 2007-07-27 2009-02-05 Synergy Sports Technology, Llc Systems and methods for generating bookmark video fingerprints
US10277956B2 (en) 2007-10-01 2019-04-30 Cabot Communications Method and apparatus for streaming digital media content and a communication system
US20090106082A1 (en) 2007-10-23 2009-04-23 Senti Thad E System and method to facilitate targeted advertising
WO2009065137A1 (en) 2007-11-16 2009-05-22 Divx, Inc. Hierarchical and reduced index structures for multimedia files
US8169916B1 (en) 2007-11-23 2012-05-01 Media Melon, Inc. Multi-platform video delivery configuration
US8131875B1 (en) 2007-11-26 2012-03-06 Adobe Systems Incorporated Device profile assignment based on device capabilities
WO2009109976A2 (en) 2008-03-05 2009-09-11 Webtview Ltd. Distributing internet channels to a media viewer
CN101540645A (en) 2008-03-21 2009-09-23 华为技术有限公司 Method and device for distributing dynamic content
US20090249081A1 (en) 2008-03-31 2009-10-01 Kabushiki Kaisha Toshiba-1 Shibaura 1-Chomominatoku Storage device encryption and method
US8379851B2 (en) 2008-05-12 2013-02-19 Microsoft Corporation Optimized client side rate control and indexed file layout for streaming media
ES2426714T3 (en) 2008-06-06 2013-10-24 Deluxe Digital Studios, Inc. Playing supplementary content of variable length on a media player of fixed length content
WO2010000910A1 (en) * 2008-06-30 2010-01-07 Nokia Corporation Transmission capacity probing using adaptive redundancy adjustment
CN101426179A (en) 2008-09-22 2009-05-06 深圳华为通信技术有限公司 Service activation method, service providing method, terminal equipment and server
EP2180664A1 (en) 2008-10-22 2010-04-28 Vivendi Mobile Entertainment System and method for accessing multi-media content via a mobile terminal
US9210431B2 (en) * 2008-11-13 2015-12-08 Thomson Licensing Multiple thread video encoding using GOP merging and bit allocation
CA2755774C (en) * 2009-03-19 2015-01-06 Azuki Systems, Inc. Method for scalable live streaming delivery for mobile audiences
US9380091B2 (en) * 2012-06-12 2016-06-28 Wi-Lan Labs, Inc. Systems and methods for using client-side video buffer occupancy for enhanced quality of experience in a communication network
US20130290492A1 (en) * 2009-06-12 2013-10-31 Cygnus Broadband, Inc. State management for video streaming quality of experience degradation control and recovery using a video quality metric
JP5826747B2 (en) * 2009-06-26 2015-12-02 トムソン ライセンシングThomson Licensing Video encoding and decoding method and apparatus using adaptive geometric partitioning
CN102792291B (en) 2009-08-17 2015-11-25 阿卡麦科技公司 Based on the method and system of the stream distribution of HTTP
US9203816B2 (en) 2009-09-04 2015-12-01 Echostar Technologies L.L.C. Controlling access to copies of media content by a client device
US8473558B2 (en) 2009-09-22 2013-06-25 Thwapr, Inc. Progressive registration for mobile media sharing
US20110096828A1 (en) 2009-09-22 2011-04-28 Qualcomm Incorporated Enhanced block-request streaming using scalable encoding
US10264029B2 (en) 2009-10-30 2019-04-16 Time Warner Cable Enterprises Llc Methods and apparatus for packetized content delivery over a content delivery network
US8930991B2 (en) 2009-11-19 2015-01-06 Gregory Philpott System and method for delivering content to mobile devices
EP2507995A4 (en) 2009-12-04 2014-07-09 Sonic Ip Inc Elementary bitstream cryptographic material transport systems and methods
US20110296048A1 (en) 2009-12-28 2011-12-01 Akamai Technologies, Inc. Method and system for stream handling using an intermediate format
US9038116B1 (en) 2009-12-28 2015-05-19 Akamai Technologies, Inc. Method and system for recording streams
CN102714662B (en) 2010-01-18 2017-06-09 瑞典爱立信有限公司 For the method and apparatus of HTTP media stream distribution
US20110179185A1 (en) 2010-01-20 2011-07-21 Futurewei Technologies, Inc. System and Method for Adaptive Differentiated Streaming
GB2477515B (en) * 2010-02-03 2012-09-26 Orbital Multi Media Holdings Corp Data flow control method and apparatus
US9532113B2 (en) 2010-02-17 2016-12-27 Verimatrix, Inc. Systems and methods for securing content delivered using a playlist
EP2362651A1 (en) 2010-02-19 2011-08-31 Thomson Licensing Multipath delivery for adaptive streaming
EP2360923A1 (en) 2010-02-24 2011-08-24 Thomson Licensing Method for selectively requesting adaptive streaming content and a device implementing the method
US8819116B1 (en) 2010-03-08 2014-08-26 Amazon Technologies, Inc. Providing services using a device capabilities service
EP2375680A1 (en) 2010-04-01 2011-10-12 Thomson Licensing A method for recovering content streamed into chunk
US8402155B2 (en) * 2010-04-01 2013-03-19 Xcira, Inc. Real-time media delivery with automatic catch-up
US9043484B2 (en) 2010-04-02 2015-05-26 Disney Enterprises, Inc. Streaming playback and dynamic ad insertion
US9137278B2 (en) 2010-04-08 2015-09-15 Vasona Networks Inc. Managing streaming bandwidth for multiple clients
AU2010202741B1 (en) 2010-06-30 2010-12-23 Brightcove Inc. Dynamic chunking for media streaming
US20130080267A1 (en) 2011-09-26 2013-03-28 Unicorn Media, Inc. Single-url content delivery
US8824560B2 (en) * 2010-07-07 2014-09-02 Netzyn, Inc. Virtual frame buffer system and method
US8782268B2 (en) 2010-07-20 2014-07-15 Microsoft Corporation Dynamic composition of media
US9456015B2 (en) 2010-08-10 2016-09-27 Qualcomm Incorporated Representation groups for network streaming of coded multimedia data
US8677428B2 (en) 2010-08-20 2014-03-18 Disney Enterprises, Inc. System and method for rule based dynamic server side streaming manifest files
JP5961174B2 (en) 2010-11-02 2016-08-02 テレフオンアクチーボラゲット エルエム エリクソン(パブル) Method and device for media description delivery
US9001886B2 (en) * 2010-11-22 2015-04-07 Cisco Technology, Inc. Dynamic time synchronization
US8532171B1 (en) * 2010-12-23 2013-09-10 Juniper Networks, Inc. Multiple stream adaptive bit rate system
US9264750B2 (en) 2010-12-23 2016-02-16 Verizon Patent And Licensing Inc. Advertising insertion for playback of video streams on user devices
US8914534B2 (en) 2011-01-05 2014-12-16 Sonic Ip, Inc. Systems and methods for adaptive bitrate streaming of media stored in matroska container files using hypertext transfer protocol
MX2013008755A (en) * 2011-01-28 2014-01-31 Eye Io Llc Encoding of video stream based on scene type.
US9026671B2 (en) 2011-04-05 2015-05-05 Qualcomm Incorporated IP broadcast streaming services distribution using file delivery methods
EP2685742A4 (en) * 2011-04-07 2014-03-05 Huawei Tech Co Ltd Method, device and system for transmitting and processing media content
US8849950B2 (en) 2011-04-07 2014-09-30 Qualcomm Incorporated Network streaming of video data using byte range requests
US9646141B2 (en) 2011-06-22 2017-05-09 Netflix, Inc. Fast start of streaming digital media playback with deferred license retrieval
US9615126B2 (en) * 2011-06-24 2017-04-04 Google Technology Holdings LLC Intelligent buffering of media streams delivered over internet
CN103621151B (en) 2011-06-28 2017-09-15 瑞典爱立信有限公司 For managing the technology that stream broadcasts media business in network entity
EP2730072B1 (en) 2011-07-07 2016-09-07 Telefonaktiebolaget LM Ericsson (publ) Network-capacity optimized adaptive streaming
US9590814B2 (en) 2011-08-01 2017-03-07 Qualcomm Incorporated Method and apparatus for transport of dynamic adaptive streaming over HTTP (DASH) initialization segment description fragments as user service description fragments
US9887852B2 (en) 2011-08-11 2018-02-06 Intel Corporation Methods for switching between a MBMS download and an HTTP-based delivery of DASH formatted content over an IMS network
WO2013048484A1 (en) * 2011-09-30 2013-04-04 Intel Corporation Quality of experience enhancements over wireless networks
US9800945B2 (en) * 2012-04-03 2017-10-24 Activevideo Networks, Inc. Class-based intelligent multiplexing over unmanaged networks
US9246741B2 (en) * 2012-04-11 2016-01-26 Google Inc. Scalable, live transcoding with support for adaptive streaming and failover
JP2015518350A (en) 2012-04-24 2015-06-25 ヴィド スケール インコーポレイテッド Method and apparatus for smooth stream switching in MPEG / 3GPP-DASH
US20140201329A1 (en) * 2012-06-11 2014-07-17 Intel Corporation Distribution of layered multi-media streams over multiple radio links
US9635369B2 (en) * 2012-07-02 2017-04-25 Qualcomm Incorporated Video parameter set including HRD parameters
CN104247377B (en) * 2012-07-09 2018-07-27 松下知识产权经营株式会社 Communication device, communication means, program
KR101629748B1 (en) 2012-07-09 2016-06-13 후아웨이 테크놀러지 컴퍼니 리미티드 Dynamic adaptive streaming over http client behavior framework and implementation of session management
IN2015DN00468A (en) 2012-07-09 2015-06-26 Ericsson Telefon Ab L M
US10021394B2 (en) * 2012-09-24 2018-07-10 Qualcomm Incorporated Hypothetical reference decoder parameters in video coding
US9374585B2 (en) * 2012-12-19 2016-06-21 Qualcomm Incorporated Low-delay buffering model in video coding
US9426196B2 (en) * 2013-01-04 2016-08-23 Qualcomm Incorporated Live timing for dynamic adaptive streaming over HTTP (DASH)
US20140209493A1 (en) * 2013-01-28 2014-07-31 Susan Jean Hale Garment and Accessories Organizer and Shoulder Bag
US8752113B1 (en) 2013-03-15 2014-06-10 Wowza Media Systems, LLC Insertion of graphic overlays into a stream
US8869218B2 (en) * 2013-03-15 2014-10-21 Wowza Media Systems, LLC On the fly transcoding of video on demand content for adaptive streaming
US20140351871A1 (en) * 2013-05-22 2014-11-27 Microsoft Corporation Live media processing and streaming service
US9094737B2 (en) * 2013-05-30 2015-07-28 Sonic Ip, Inc. Network video streaming with trick play based on separate trick play files
DE102013211571B4 (en) * 2013-06-19 2016-02-11 Opticom Dipl.-Ing. Michael Keyhl Gmbh CONCEPT FOR DETERMINING THE QUALITY OF A MEDIA DATA FLOW WITH A VARIANT QUALITY-TO-BIT RATE
US9179159B2 (en) * 2013-06-20 2015-11-03 Wowza Media Systems, LLC Distributed encoding of a video stream
WO2014209493A1 (en) 2013-06-28 2014-12-31 Wi-Lan Labs, Inc. State management for video streaming quality of experience degradation control and recovery using a video quality metric
GB2520334B (en) * 2013-11-18 2015-11-25 Helen Bradley Lennon A video broadcast system and a method of disseminating video content
US9282133B2 (en) * 2013-12-12 2016-03-08 Ooma, Inc. Communicating control information within a real-time stream
US9253231B2 (en) * 2013-12-19 2016-02-02 Verizon Patent And Licensing Inc. Retrieving and caching adaptive bitrate stream segments based on network congestion
US11228764B2 (en) * 2014-01-15 2022-01-18 Avigilon Corporation Streaming multiple encodings encoded using different encoding parameters
KR20160110442A (en) * 2014-01-29 2016-09-21 코닌클리즈케 케이피엔 엔.브이. Establishing a streaming presentation of an event
US9106887B1 (en) * 2014-03-13 2015-08-11 Wowza Media Systems, LLC Adjusting encoding parameters at a mobile device based on a change in available network bandwidth
US9635077B2 (en) * 2014-03-14 2017-04-25 Adobe Systems Incorporated Low latency live video streaming
US20150264404A1 (en) * 2014-03-17 2015-09-17 Nokia Technologies Oy Method and apparatus for video coding and decoding
CN103944675B (en) * 2014-04-04 2017-08-04 浙江大学 It is adapted to the self adaptation stream waterline transmission method that physical layer is transmitted without rate coding
US8896765B1 (en) * 2014-05-16 2014-11-25 Shadowbox Media, Inc. Systems and methods for remote control of a television
US10110657B2 (en) * 2014-07-03 2018-10-23 Telefonaktiebolaget Lm Ericsson (Publ) System and method for pushing live media content in an adaptive streaming environment
US9426478B2 (en) * 2014-07-21 2016-08-23 Cisco Technology, Inc. Resolution robust video quality metric
US9756361B2 (en) * 2014-07-25 2017-09-05 Verizon Patent And Licensing Inc. On-demand load balancer and virtual live slicer server farm for program ingest
US9838455B2 (en) * 2014-09-19 2017-12-05 Mobitv, Inc. Fast encoding of live streaming media content
CN104318926B (en) * 2014-09-29 2018-08-31 四川九洲电器集团有限责任公司 Lossless audio coding method based on IntMDCT, coding/decoding method
CA2965484C (en) * 2014-10-22 2019-07-23 Arris Enterprises Llc Adaptive bitrate streaming latency reduction
CN107113462B (en) * 2014-11-20 2020-10-27 松下电器(美国)知识产权公司 Transmission method, reception method, transmission device, and reception device
WO2016098056A1 (en) * 2014-12-18 2016-06-23 Nokia Technologies Oy An apparatus, a method and a computer program for video coding and decoding
KR101942208B1 (en) * 2015-01-08 2019-01-24 애리스 엔터프라이지즈 엘엘씨 Server-side Adaptive Bitrate Control for DLNA HTTP Streaming Clients
GB2534136A (en) * 2015-01-12 2016-07-20 Nokia Technologies Oy An apparatus, a method and a computer program for video coding and decoding
US9781084B2 (en) * 2015-01-23 2017-10-03 Arris Enterprises Llc Reducing start-up delay in streaming media sessions
US10218981B2 (en) * 2015-02-11 2019-02-26 Wowza Media Systems, LLC Clip generation based on multiple encodings of a media stream
US9756106B2 (en) * 2015-02-13 2017-09-05 Citrix Systems, Inc. Methods and systems for estimating quality of experience (QoE) parameters of secured transactions
SG11201706160UA (en) 2015-02-27 2017-09-28 Sonic Ip Inc Systems and methods for frame duplication and frame extension in live video encoding and streaming

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7339993B1 (en) * 1999-10-01 2008-03-04 Vidiator Enterprises Inc. Methods for transforming streaming video data
US20120307886A1 (en) * 2011-05-31 2012-12-06 Broadcom Corporation Adaptive Video Encoding Based on Predicted Wireless Channel Conditions
US20130114744A1 (en) * 2011-11-06 2013-05-09 Akamai Technologies Inc. Segmented parallel encoding with frame-aware, variable-size chunking
US20140019593A1 (en) * 2012-07-10 2014-01-16 Vid Scale, Inc. Quality-driven streaming

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10715574B2 (en) 2015-02-27 2020-07-14 Divx, Llc Systems and methods for frame duplication and frame extension in live video encoding and streaming
US11134115B2 (en) 2015-02-27 2021-09-28 Divx, Llc Systems and methods for frame duplication and frame extension in live video encoding and streaming
US11824912B2 (en) 2015-02-27 2023-11-21 Divx, Llc Systems and methods for frame duplication and frame extension in live video encoding and streaming

Also Published As

Publication number Publication date
US20200344284A1 (en) 2020-10-29
JP2019193312A (en) 2019-10-31
JP6588987B2 (en) 2019-10-09
US20160255131A1 (en) 2016-09-01
HK1246423A1 (en) 2018-09-07
KR20170118759A (en) 2017-10-25
US11134115B2 (en) 2021-09-28
CN107251008B (en) 2020-11-13
CN107251008A (en) 2017-10-13
EP3262523A4 (en) 2018-07-25
KR101897959B1 (en) 2018-09-12
JP2018511978A (en) 2018-04-26
EP3262523A1 (en) 2018-01-03
US10715574B2 (en) 2020-07-14
US11824912B2 (en) 2023-11-21
EP3262523B1 (en) 2019-12-04
SG11201706160UA (en) 2017-09-28
JP6928038B2 (en) 2021-09-01
EP3627337A1 (en) 2020-03-25
US20220124137A1 (en) 2022-04-21
ES2768979T3 (en) 2020-06-24

Similar Documents

Publication Publication Date Title
US11134115B2 (en) Systems and methods for frame duplication and frame extension in live video encoding and streaming
EP3072301B1 (en) Transcoding media streams using subchunking
US20220360861A1 (en) Multimedia content delivery with reduced delay
US20160037176A1 (en) Automatic and adaptive selection of profiles for adaptive bit rate streaming
US11201903B1 (en) Time synchronization between live video streaming and live metadata
US10812550B1 (en) Bitrate allocation for a multichannel media stream
US9866459B1 (en) Origin failover for live streaming
WO2009103351A1 (en) Method and apparatus for obtaining media over a communications network
US10693642B1 (en) Output switching for encoded content streams
EP3891999B1 (en) Just after broadcast media content
EP3264709B1 (en) A method for computing, at a client for receiving multimedia content from a server using adaptive streaming, the perceived quality of a complete media session, and client
US11172244B2 (en) Process controller for creation of ABR VOD product manifests
US11917327B2 (en) Dynamic resolution switching in live streams based on video quality assessment
US11909795B1 (en) Input switching for streaming content

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16756526

Country of ref document: EP

Kind code of ref document: A1

REEP Request for entry into the european phase

Ref document number: 2016756526

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 11201706160U

Country of ref document: SG

ENP Entry into the national phase

Ref document number: 20177023590

Country of ref document: KR

Kind code of ref document: A

Ref document number: 2017544732

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE