US20080013838A1 - System and method for processing symbols embedded in digital video - Google Patents

System and method for processing symbols embedded in digital video Download PDF

Info

Publication number
US20080013838A1
US20080013838A1 US11/779,170 US77917007A US2008013838A1 US 20080013838 A1 US20080013838 A1 US 20080013838A1 US 77917007 A US77917007 A US 77917007A US 2008013838 A1 US2008013838 A1 US 2008013838A1
Authority
US
United States
Prior art keywords
symbol
stream
image information
correlated
representation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US11/779,170
Inventor
Richard Lane
Amnon Silberger
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qualcomm Inc
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Priority to US11/779,170 priority Critical patent/US20080013838A1/en
Assigned to QUALCOMM INCORPORATED reassignment QUALCOMM INCORPORATED ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: LANE, RICHARD D., SILBERGER, AMNON
Publication of US20080013838A1 publication Critical patent/US20080013838A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/24Systems for the transmission of television signals using pulse code modulation
    • H04N7/52Systems for transmission of a pulse code modulated video signal with one or more other pulse code modulated signals, e.g. an audio signal or a synchronizing signal
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/20Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/23412Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs for generating or manipulating the scene composition of objects, e.g. MPEG-4 objects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/23418Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/235Processing of additional data, e.g. scrambling of additional data or processing content descriptors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/44012Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving rendering scenes according to scene graphs, e.g. MPEG-4 scene graphs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/462Content or additional data management, e.g. creating a master electronic program guide from data received from the Internet and a Head-end, controlling the complexity of a video stream by scaling the resolution or bit-rate based on the client capabilities
    • H04N21/4622Retrieving content or additional data from different sources, e.g. from a broadcast channel and the Internet

Definitions

  • the present invention relates generally to multimedia transmission.
  • Multimedia streams which typically include video frames, are compressed prior to transmission, to conserve bandwidth.
  • Compressing video generally means representing a relatively larger group of bits with a relatively smaller group of bits.
  • the bits representing the symbol are compressed along with the bits representing the image. That is, when video is overlaid with graphics or alpha numeric symbols, both the underlying image and the symbols are compressed together for transmission. Upon receipt, the stream is decompressed to render a “lossy”version of the original image.
  • the present invention understands that compressing and decompressing a multimedia stream, while critical to the efficient transmission of multimedia, often results in some lost data. As a result, some useful information in the original stream might not reappear in the decompressed stream.
  • the present invention further understands that in the context of video images, losing a small amount of information usually is of small consequence, since a viewer will be unable to detect the loss in the relatively large volume of data that makes up a video image.
  • the present invention critically recognizes, however, that symbols generally do not require much data to render. Consequently, losing a small amount of symbol information during the compression/decompression process can result in detectable distortions and/or artifacts in the decompressed symbol. Furthermore, having to compress and transmit symbols along with the images to which they pertain consumes multimedia transmission bandwidth.
  • the present invention understands that transmission of images over wireless links to portable devices often necessitates a significant reduction in resolution. Reduced resolution means there are fewer pixels available for representing a symbol.
  • the small display screens commonly used on mobile devices limits the ability of the human visual system, or HVS, to easily and accurately discern symbols.
  • the present invention understands that being able to enlarge or transform symbols in video that is displayed on small screens would improve the ability of the HVS to discern the symbols.
  • a system for processing an image such as a video image, that has a symbol, such as a graphics symbol or an alpha-numeric character, includes a processor that executes logic including scanning the image. Also, the processor uses character/symbol recognition techniques to create a representation of the symbol, and then compresses the image to render a compressed image.
  • the logic executed by the processor can include blanking out the symbol in the image prior to the act of compressing.
  • the symbol may be compressed along with the image.
  • the representation of the symbol may include not only a symbol type or identification, but also other descriptive parameters including but not limited to symbol identity, symbol position, symbol size, symbol font, and symbol color.
  • the system can further include a receive processor that decompresses the compressed image to render a decompressed image.
  • the receiver processor can combine the symbol with the decompressed image. For instance, the receive processor can use the position of the symbol to insert the symbol or a transformed version of the symbol into the decompressed image.
  • the representation of the symbol may be embedded in the compressed stream, or it may be transmitted separately therefrom.
  • a method for processing a multimedia stream that includes video image information and symbols in the video includes removing the symbols from the stream, and then compressing the video image information to render a compressed stream.
  • a receive processor decompresses a compressed video stream to render a decompressed image.
  • the receive processor uses a representation of a symbol that had been included in the original image, the receive processor combines the symbol with the decompressed image.
  • the preferred non limiting symbol representation could take on one of several forms. For example, a pixel-by-pixel bit map representation of the symbol can be sent either embedded in the compressed stream in, e.g., a header of the compressed stream, or the bitmap can be transmitted in a separate stream. Or, a code representing the symbol can likewise be transmitted in lieu of a bitmap.
  • a computer product for processing scanned video images having symbols.
  • the product includes means for recognizing symbols in the scanned images to render symbol information. Means then compress the images.
  • FIG. 1 is a functional block diagram of the present system
  • FIG. 2 is a flow chart of the compression and transmission logic
  • FIG. 3 is a flow chart of various receive and decompress methods that can be used.
  • a system for transmitting and receiving compressed multimedia including video streams, from a source 12 of multimedia. While the disclosure herein focuses on the video portion of multimedia streams, it is to be understood that the present principles apply to other multimedia including audio, slides, 2D graphics and 3D graphics, etc.
  • the system 10 includes a scanner 14 that scans the multimedia source images from the source 12 and sends the images and/or results of the scanning to a transmit processor 16 , which, among other things, functions as a symbol recognizer and video compressor in accordance with the logic discussed below.
  • symbol means both graphics symbols and alpha-numeric characters, and consequently “symbol recognition” encompasses both symbol and character recognition.
  • a symbol processor 17 can be provided as shown.
  • compressed video, along with symbol information derived from character/symbol recognition, are sent to a transmitter 18 for transmission thereof.
  • the non-limiting preferred embodiment shown in FIG. 1 shows a multimedia transmitter 18 that uses wireless means, and more particularly that uses code division multiple access (CDMA) principles.
  • the streams can be broadcast to plural receivers if desired, or transmitted using point-to point or multicast wireless transmission principles. It is to be understood that the present principles apply to other forms of wireless communication such as GSM, TDMA, S-TDMA, wideband CDMA, OFDM, etc. as well as transmission of multimedia over cable systems, the Internet, etc.
  • One or more receivers 20 can receive the transmitted video and symbol information and send it to one or more receive processors 22 .
  • the receiver processor 22 functions as a video decompresser. If desired, the receiver processor 22 can also decompress symbols, if they were compressed during transmission, or the symbols can be decompressed by a separate symbol processor 23 as shown.
  • a stream combiner 24 then combines the symbols with the image stream in accordance with the logic of FIG. 3 discussed below.
  • the decompressed multimedia with symbols can be displayed on an output device 25 , such as a video display, printer, speaker, etc.
  • video images in the stream are scanned by an appropriate scanning device 14 .
  • Any appropriate scanner 14 may be used.
  • the images are scanned at a relatively high sampling rate that is sufficient to reduce or eliminate distortion of symbols in the video images.
  • a digitized representation of the images with symbols is generated by the scanner 14 .
  • not all video frames in a stream need be scanned. For example, every tenth frame might be scanned until a symbol is detected. Then, every succeeding frame can be scanned for the symbol. When a frame is encountered having no symbols, the scanning frequency can revert to every n th frame. In this way, scanning speed is increased.
  • symbols in the image are recognized by character recognition software executed on the transmit processor 16 (or, if desired, the symbol processor 17 ). Any appropriate character recognition algorithms can be used.
  • the transmit processor 16 (or symbol processor 17 ) generates symbol information, i.e., binary representations of the symbols that indicate not only the identification (type) of symbol scanned, but also, preferably, the position of the symbol in the image, and if desired the color, font, size, etc. of the symbol.
  • symbol information and information/data “representation of a symbol” are distinct from the pixilated data that represents the color or gray shade of the non-symbol portions of the video images on a pixel-by-pixel basis. Rather, “symbol information” means the binary code that is output by a character recognition engine that can be correlated to a particular symbol (and its characteristics mentioned above). This symbol information is recorded at block 30 .
  • the video images are compressed at block 32 .
  • Any appropriate video compression scheme or algorithm may be used.
  • the compression ignores the symbols in the original image by, e.g., the processor 16 blanking out the symbols (e.g., by setting the pixels covered by the symbol area to a single predetermined binary value), such that the symbols are not compressed during image compression. This saves bandwidth.
  • Other means known in the art for ignoring portions of an image stream to be compressed may be used. For example, a symbol in an image can be replaced with pixel information that matches the pixel information surrounding the symbol, which when compressed would render a smaller stream than would be rendered otherwise.
  • the symbol can be replaced with ancillary data useful for control, conveying general information, or error correction.
  • the symbols in the image need not be blanked out, but may be compressed along with the image data, with the recognized symbol information nonetheless being used at the receiver to render non-distorted symbols after expansion.
  • the symbol information output by the symbol recognition algorithm can itself be compressed (by, e.g., the symbol processor 17 ), whether transmitted separately with the compressed video information or with it (e.g., in the video frame headers).
  • the compressed stream representing the video images is transmitted at block 34 .
  • the symbol information rendered by the character recognition logic of the transmit processor 16 (or symbol processor 17 ) is also transmitted.
  • This symbol information may be transmitted apart from the compressed video image stream, indeed, it may be transmitted on an entirely different channel than the video stream.
  • the symbol information may be embedded in the stream, e.g., the symbol information may be included in headers of video frames.
  • the symbols When the symbols are transmitted separately from the video stream, the symbols may be transmitted on a wireless communication path and the video stream transmitted on another wireless path or indeed a separate transmission system.
  • the symbols and video stream can be transmitted point-to-point to a receiver, or they can be multicast or broadcast to plural receivers.
  • the compressed video image stream is received. Also at block 38 , when the symbol information is transmitted along with the video stream in the same channel, it also is received.
  • the compressed video stream is decompressed, as is the symbol information, if it had been compressed and transmitted in, e.g., the headers of the video stream.
  • the symbol if it had not been compressed, it is sent from block 40 to block 41 , where the symbol can be processed and transformed as desired to improve visibility.
  • the symbol can be decoded, if an encoded representation of the symbol had been sent, and if desired enlarged or otherwise transformed (by, e.g., changing the shape, font, color, or other attribute of the symbol).
  • FIG. 3 also shows the alternative wherein the symbols are transmitted separately from the video stream, in which embodiment the symbols are received directly from their separate channels to block 41 for transformation and/or processing.
  • the position information and the other information in the symbol information can be used to re-insert the symbols into their corresponding positions in the video images, with the images with symbols being displayed at block 44 .
  • the symbols in a video stream may be handled separately from the video images themselves, allowing for greater flexibility.
  • the symbols can be transmitted separately from the video stream.
  • the symbols can be repositioned in the images if desired in time or space by the receiver processor 22 .
  • the symbols can be enlarged in the decompressed video images for enhanced visibility, or replaced altogether by icons to consume less space on the video images to improve picture quality and symbol recognition and readability.
  • the icons can be used as drop-down or pop-up menus which when a representative symbol is selected by a user could display the scanned symbol.
  • the symbols can be displayed separately from the video images if desired.

Abstract

Video images in, e.g., a multimedia stream are scanned prior to compression for transmission to recognize symbols, such as graphics symbols and alpha-numeric characters. The types, positions, sizes, etc. of the symbols are recorded to render symbol information, and then the images are compressed with or without compressing the symbols, which may be removed from the images prior to compression if desired. The compressed video and symbol information are sent to a receiver, which decompresses the video, optionally transforms the symbols, and then inserts the symbols where indicated by the symbol information.

Description

    CLAIM OF PRIORITY UNDER 35 U.S.C. § 120
  • The present Application for Patent is a continuation application of U.S. patent application Serial No. 10/293,593, entitled “System and Method for Processing Symbols Embedded in Digital Video,” filed Nov. 12, 2002,and assigned to the assignee hereof and hereby expressly incorporated by reference herein.
  • FIELD OF THE INVENTION
  • The present invention relates generally to multimedia transmission.
  • BACKGROUND
  • Multimedia streams, which typically include video frames, are compressed prior to transmission, to conserve bandwidth. Compressing video generally means representing a relatively larger group of bits with a relatively smaller group of bits.
  • When a video frame or frames contains a symbol, generally used herein to refer to both graphics symbols and alpha-numeric characters, the bits representing the symbol are compressed along with the bits representing the image. That is, when video is overlaid with graphics or alpha numeric symbols, both the underlying image and the symbols are compressed together for transmission. Upon receipt, the stream is decompressed to render a “lossy”version of the original image.
  • The present invention understands that compressing and decompressing a multimedia stream, while critical to the efficient transmission of multimedia, often results in some lost data. As a result, some useful information in the original stream might not reappear in the decompressed stream. The present invention further understands that in the context of video images, losing a small amount of information usually is of small consequence, since a viewer will be unable to detect the loss in the relatively large volume of data that makes up a video image.
  • The present invention critically recognizes, however, that symbols generally do not require much data to render. Consequently, losing a small amount of symbol information during the compression/decompression process can result in detectable distortions and/or artifacts in the decompressed symbol. Furthermore, having to compress and transmit symbols along with the images to which they pertain consumes multimedia transmission bandwidth.
  • In addition to the above recognition, the present invention understands that transmission of images over wireless links to portable devices often necessitates a significant reduction in resolution. Reduced resolution means there are fewer pixels available for representing a symbol. The small display screens commonly used on mobile devices limits the ability of the human visual system, or HVS, to easily and accurately discern symbols. The present invention understands that being able to enlarge or transform symbols in video that is displayed on small screens would improve the ability of the HVS to discern the symbols. With these observations in mind, the invention below is provided.
  • SUMMARY OF THE INVENTION
  • A system for processing an image, such as a video image, that has a symbol, such as a graphics symbol or an alpha-numeric character, includes a processor that executes logic including scanning the image. Also, the processor uses character/symbol recognition techniques to create a representation of the symbol, and then compresses the image to render a compressed image.
  • In a preferred embodiment, the logic executed by the processor can include blanking out the symbol in the image prior to the act of compressing. Alternatively, the symbol may be compressed along with the image. The representation of the symbol may include not only a symbol type or identification, but also other descriptive parameters including but not limited to symbol identity, symbol position, symbol size, symbol font, and symbol color.
  • When the system includes a transmitter processor and the compressed image and representation of the symbol are transmitted, the system can further include a receive processor that decompresses the compressed image to render a decompressed image. Using the representation of the symbol, the receiver processor can combine the symbol with the decompressed image. For instance, the receive processor can use the position of the symbol to insert the symbol or a transformed version of the symbol into the decompressed image. The representation of the symbol may be embedded in the compressed stream, or it may be transmitted separately therefrom.
  • In another aspect, a method for processing a multimedia stream that includes video image information and symbols in the video includes removing the symbols from the stream, and then compressing the video image information to render a compressed stream.
  • In still another aspect, a receive processor decompresses a compressed video stream to render a decompressed image. Using a representation of a symbol that had been included in the original image, the receive processor combines the symbol with the decompressed image. The preferred non limiting symbol representation could take on one of several forms. For example, a pixel-by-pixel bit map representation of the symbol can be sent either embedded in the compressed stream in, e.g., a header of the compressed stream, or the bitmap can be transmitted in a separate stream. Or, a code representing the symbol can likewise be transmitted in lieu of a bitmap.
  • In yet another aspect, a computer product is disclosed for processing scanned video images having symbols. The product includes means for recognizing symbols in the scanned images to render symbol information. Means then compress the images.
  • The details of the present invention, both as to its structure and operation, can best be understood in reference to the accompanying drawings, in which like reference numerals refer to like parts, and in which:
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a functional block diagram of the present system;
  • FIG. 2 is a flow chart of the compression and transmission logic; and
  • FIG. 3 is a flow chart of various receive and decompress methods that can be used.
  • DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT
  • Referring initially to FIG. 1, a system is shown, generally designated 10, for transmitting and receiving compressed multimedia including video streams, from a source 12 of multimedia. While the disclosure herein focuses on the video portion of multimedia streams, it is to be understood that the present principles apply to other multimedia including audio, slides, 2D graphics and 3D graphics, etc.
  • As shown, the system 10 includes a scanner 14 that scans the multimedia source images from the source 12 and sends the images and/or results of the scanning to a transmit processor 16, which, among other things, functions as a symbol recognizer and video compressor in accordance with the logic discussed below. As used herein, “symbol” means both graphics symbols and alpha-numeric characters, and consequently “symbol recognition” encompasses both symbol and character recognition.
  • While a single transmit processor 16 is shown for clarity of disclosure, plural processors can be used to execute the logic of FIG. 2. For instance, in embodiments wherein the symbols themselves are to be compressed apart from the stream, a symbol processor 17 can be provided as shown. As discussed below, compressed video, along with symbol information derived from character/symbol recognition, are sent to a transmitter 18 for transmission thereof.
  • The non-limiting preferred embodiment shown in FIG. 1 shows a multimedia transmitter 18 that uses wireless means, and more particularly that uses code division multiple access (CDMA) principles. The streams can be broadcast to plural receivers if desired, or transmitted using point-to point or multicast wireless transmission principles. It is to be understood that the present principles apply to other forms of wireless communication such as GSM, TDMA, S-TDMA, wideband CDMA, OFDM, etc. as well as transmission of multimedia over cable systems, the Internet, etc.
  • One or more receivers 20 can receive the transmitted video and symbol information and send it to one or more receive processors 22. The receiver processor 22 functions as a video decompresser. If desired, the receiver processor 22 can also decompress symbols, if they were compressed during transmission, or the symbols can be decompressed by a separate symbol processor 23 as shown. A stream combiner 24 then combines the symbols with the image stream in accordance with the logic of FIG. 3 discussed below. The decompressed multimedia with symbols can be displayed on an output device 25, such as a video display, printer, speaker, etc.
  • Now referring to FIG. 2, the compression and transmission logic can be seen. At block 26, video images in the stream are scanned by an appropriate scanning device 14. Any appropriate scanner 14 may be used. Preferably, the images are scanned at a relatively high sampling rate that is sufficient to reduce or eliminate distortion of symbols in the video images. A digitized representation of the images with symbols is generated by the scanner 14.
  • If desired, not all video frames in a stream need be scanned. For example, every tenth frame might be scanned until a symbol is detected. Then, every succeeding frame can be scanned for the symbol. When a frame is encountered having no symbols, the scanning frequency can revert to every nth frame. In this way, scanning speed is increased.
  • Proceeding to block 28, symbols in the image are recognized by character recognition software executed on the transmit processor 16 (or, if desired, the symbol processor 17). Any appropriate character recognition algorithms can be used. The transmit processor 16 (or symbol processor 17) generates symbol information, i.e., binary representations of the symbols that indicate not only the identification (type) of symbol scanned, but also, preferably, the position of the symbol in the image, and if desired the color, font, size, etc. of the symbol.
  • Accordingly, as used herein “symbol information” and information/data “representation of a symbol” are distinct from the pixilated data that represents the color or gray shade of the non-symbol portions of the video images on a pixel-by-pixel basis. Rather, “symbol information” means the binary code that is output by a character recognition engine that can be correlated to a particular symbol (and its characteristics mentioned above). This symbol information is recorded at block 30.
  • After scanning, the video images (at a minimum, the pixel information representing non-symbol image data) are compressed at block 32. Any appropriate video compression scheme or algorithm may be used. In one preferred embodiment, the compression ignores the symbols in the original image by, e.g., the processor 16 blanking out the symbols (e.g., by setting the pixels covered by the symbol area to a single predetermined binary value), such that the symbols are not compressed during image compression. This saves bandwidth. Other means known in the art for ignoring portions of an image stream to be compressed may be used. For example, a symbol in an image can be replaced with pixel information that matches the pixel information surrounding the symbol, which when compressed would render a smaller stream than would be rendered otherwise. Or, the symbol can be replaced with ancillary data useful for control, conveying general information, or error correction. However, if desired the symbols in the image need not be blanked out, but may be compressed along with the image data, with the recognized symbol information nonetheless being used at the receiver to render non-distorted symbols after expansion. In any case, as mentioned above the symbol information output by the symbol recognition algorithm can itself be compressed (by, e.g., the symbol processor 17), whether transmitted separately with the compressed video information or with it (e.g., in the video frame headers).
  • The compressed stream representing the video images is transmitted at block 34. At block 36, the symbol information rendered by the character recognition logic of the transmit processor 16 (or symbol processor 17) is also transmitted. This symbol information may be transmitted apart from the compressed video image stream, indeed, it may be transmitted on an entirely different channel than the video stream. Or, the symbol information may be embedded in the stream, e.g., the symbol information may be included in headers of video frames.
  • When the symbols are transmitted separately from the video stream, the symbols may be transmitted on a wireless communication path and the video stream transmitted on another wireless path or indeed a separate transmission system. The symbols and video stream can be transmitted point-to-point to a receiver, or they can be multicast or broadcast to plural receivers.
  • Now referring to FIG. 3, at block 38 the compressed video image stream is received. Also at block 38, when the symbol information is transmitted along with the video stream in the same channel, it also is received. Moving to block 40, the compressed video stream is decompressed, as is the symbol information, if it had been compressed and transmitted in, e.g., the headers of the video stream. On the other hand, if the symbol had not been compressed, it is sent from block 40 to block 41, where the symbol can be processed and transformed as desired to improve visibility. For example, at block 41 the symbol can be decoded, if an encoded representation of the symbol had been sent, and if desired enlarged or otherwise transformed (by, e.g., changing the shape, font, color, or other attribute of the symbol). FIG. 3 also shows the alternative wherein the symbols are transmitted separately from the video stream, in which embodiment the symbols are received directly from their separate channels to block 41 for transformation and/or processing.
  • At block 42, the position information and the other information in the symbol information can be used to re-insert the symbols into their corresponding positions in the video images, with the images with symbols being displayed at block 44.
  • With the above invention, it may be appreciated that the symbols in a video stream may be handled separately from the video images themselves, allowing for greater flexibility. For instance, the symbols can be transmitted separately from the video stream. Also, the symbols can be repositioned in the images if desired in time or space by the receiver processor 22. Moreover, the symbols can be enlarged in the decompressed video images for enhanced visibility, or replaced altogether by icons to consume less space on the video images to improve picture quality and symbol recognition and readability. Still further, the icons can be used as drop-down or pop-up menus which when a representative symbol is selected by a user could display the scanned symbol. Indeed, the symbols can be displayed separately from the video images if desired.
  • While the particular SYSTEM AND METHOD FOR PROCESSING SYMBOLS EMBEDDED 1N DIGITAL VIDEO as herein shown and described in detail is fully capable of attaining the above-described objects of the invention, it is to be understood that it is the presently preferred embodiment of the present invention and is thus representative of the subject matter which is broadly contemplated by the present invention, that the scope of the present invention fully encompasses other embodiments which may become obvious to those skilled in the art, and that the scope of the present invention is accordingly to be limited by nothing other than the appended claims, in which reference to an element in the singular is not intended to mean “one and only one” unless explicitly so stated, but rather “one or more.”All structural and functional equivalents to the elements of the above-described preferred embodiment that are known or later come to be known to those of ordinary skill in the art are expressly incorporated herein by reference and are intended to be encompassed by the present claims. Moreover, it is not necessary for a device or method to address each and every problem sought to be solved by the present invention, for it to be encompassed by the present claims. Furthermore, no element, component, or method step in the present disclosure is intended to be dedicated to the public regardless of whether the element, component, or method step is explicitly recited in the claims. No claim element herein is to be construed under the provisions of 35 U.S.C. § 112, sixth paragraph, unless the element is expressly recited using the phrase “means for” or, in the case of a method claim, the element is recited as a “step” instead of an “act.”

Claims (48)

1. A method for processing a multimedia stream including at least video image information, comprising:
scanning at least a portion of the multimedia stream to detect whether a symbol is present-within the video image information;
creating a correlated symbol representation of the detected symbol;
producing remaining video image information by removing the symbol from the video image information of the multimedia stream; and
compressing the remaining video image information of the multimedia stream to produce a compressed stream.
2. The method of claim 1, wherein the correlated symbol representation includes at least one symbol characteristic selected from a group consisting of a position of the symbol within the video image information, a color of the symbol, a font type of the symbol, a size of the symbol, a binary code representing the symbol, and combinations thereof.
3. The method of claim 1, wherein the video image information includes a plurality of video frames and scanning at least a portion of the multimedia stream further includes only scanning every n-th frame of the plurality of video frames until the symbol is detected.
4. The method of claim 1, wherein the image information represents at least one image frame of a plurality of image frames, and wherein scanning comprises scanning the at least one image frame prior to the act of compressing.
5. The method of claim 4, further comprising using character recognition to detect the symbol.
6. The method of claim 4, wherein the correlated symbol representation includes at least one symbol characteristic, which comprises a position of the symbol in the at least one image frame.
7. The method of claim 6, further comprising decompressing the compressed stream to render a decompressed stream.
8. The method of claim 7, further comprising using the position of the symbol to insert the symbol into at least one image represented by the decompressed stream.
9. The method of claim 1, further comprising transmitting the compressed stream apart from the correlated symbol representation.
10. The method of claim 1, further comprising transmitting the compressed stream along with the correlated symbol representation, which is embedded in the compressed stream.
11. The method of claim 1, wherein removing the symbol comprises blanking out the symbol prior to the act of compressing.
12. The method of claim 1, wherein removing the symbol comprises replacing pixel information representing the symbol with pixel information that matches pixels surrounding the symbol prior to compressing the remaining video image information.
13. The method of claim 1, further comprising transmitting the compressed stream and the correlated symbol representation over a wireless channel.
14. The method of claim 13, wherein the wireless channel is a broadcast channel.
15. The method of claim 13, wherein the compressed stream is transmitted on a first wireless channel and the correlated symbol representation is transmitted on a second wireless channel.
16. The method of claim 15, wherein at least one of the channels is a broadcast channel.
17. The method of claim 13, wherein the compressed stream is transmitted on a first transmission system and the correlated symbol representation is transmitted on a second transmission system.
18. A system for processing a multimedia stream comprising at least video image information, said system comprising a processor including logic for undertaking the acts of:
scanning at least a portion of the multimedia stream to locate a symbol within the video image information;
providing a correlated symbol representation of the symbol;
producing remaining video image information by removing the symbol from the video image information of the multimedia stream; and
compressing the remaining video image information to render a compressed stream.
19. The system of claim 18, wherein the act of producing undertaken by the processor includes at least one of:
blanking out the symbol in the video image information prior to the act of compressing;
replacing pixel information representing the symbol with pixel information that matches pixels surrounding the symbol, prior to compressing;
replacing pixel values representing the symbol with values that when compressed with the image produce a smaller compressed stream as compared to compression of the video image information with pixel values representing the symbol; and
replacing the symbol with data relating to one or more of control, conveying general information, and error correction.
20. The system of claim 18, wherein the acts undertaken by the processor include compressing the correlated symbol representation and the video image information.
21. The system of claim 18, wherein the video image information includes a plurality of video frames and scanning at least a portion of the multimedia stream includes only scanning every n-th frame of the plurality of video frames until the symbol is detected.
22. The system of claim 18, wherein the correlated symbol representation includes at least one symbol characteristic selected from a group consisting of a position of the symbol within the video image information, a binary code for the symbol, a color of the symbol, a size of the symbol, a font of the symbol, and combinations thereof.
23. The system of claim 18 wherein the processor is a transmitter processor, and the compressed stream and the correlated symbol representation are transmitted, and wherein the correlated symbol representation includes at least one symbol characteristic, which comprises a position of the symbol in the video image information.
24. The system of claim 23, wherein the system further includes a receive processor executing logic to undertake acts comprising:
decompressing the compressed stream to produce a decompressed stream;
using the correlated symbol representation to produce a symbol image; and
combining the symbol image with the decompressed stream.
25. The system of claim 24, wherein the acts undertaken by the receive processor further comprise using the position of the symbol to combine the symbol into the decompressed image.
26. The system of claim 23, wherein the acts undertaken by the receive processor further comprise using transformation parameters for the symbol to alter the correlated symbol representation before combining the correlated symbol representation with the decompressed stream.
27. The system of claim 18, wherein the compressed stream and the correlated symbol representation is transmitted over a wireless channel.
28. The system of claim 27, wherein the wireless channel is a broadcast channel.
29. The system of claim 18, wherein the compressed stream is transmitted on a first wireless channel and the correlated symbol representation is transmitted on a second wireless channel.
30. The system of claim 29, wherein at least one of the channels is a broadcast channel.
31. The system of claim 18, wherein the compressed stream is transmitted on a first transmission system and the correlated symbol representation is transmitted on a second transmission system.
32. An apparatus for processing a multimedia stream comprising at least video image information, comprising:
means for scanning at least a portion of the multimedia stream to detect whether a symbol is present within the video image information;
means for creating a correlated symbol representation of the detected symbol;
means for producing remaining video image information by removing the symbol from the video information of the multimedia stream; and
means for compressing the remaining video image information to produce a compressed stream.
33. The apparatus of claim 32, wherein the correlated symbol representation includes at least one symbol characteristic selected from a group consisting of a position of the symbol within the video image information, a color of the symbol, a font type of the symbol, a size of the symbol, a binary code representing the symbol, and combinations thereof.
34. The apparatus of claim 32, wherein the video image information includes a plurality of video frames and scanning at least a portion of the multimedia stream further includes only scanning every n-th frame of the plurality of video frames until the symbol is detected.
35. The apparatus of claim 32, wherein the image information represents at least one image frame of a plurality of image frames, and wherein scanning comprises scanning the at least one image frame prior to the act of compressing.
36. The apparatus of claim 35, wherein the correlated symbol representation includes at least one symbol characteristic, which comprises a position of the symbol in the at least one image frame.
37. The apparatus of claim 35, further comprising means for using character recognition to detect the symbol.
38. The apparatus of claim 37, further comprising means for decompressing the compressed stream to render a decompressed stream.
39. The apparatus of claim 38, further comprising means for using the position of the symbol to insert the symbol into at least one image represented by the decompressed stream.
40. The apparatus of claim 32, further comprising means for transmitting the compressed stream apart from the correlated symbol representation.
41. The apparatus of claim 32, further comprising means for transmitting the compressed stream along with the correlated symbol representation, which is embedded in the compressed stream.
42. The apparatus of claim 32, wherein removing the symbol comprises blanking out the symbol prior to the act of compressing.
43. The apparatus of claim 32, wherein removing the symbol comprises replacing pixel information representing the symbol with pixel information that matches pixels surrounding the symbol prior to compressing the remaining video image information.
44. The apparatus of claim 32, further comprising means for transmitting the compressed stream and the correlated symbol representation over a wireless channel.
45. The apparatus of claim 44, wherein the wireless channel is a broadcast channel.
46. The apparatus of claim 32, wherein the compressed stream is transmitted on a first wireless channel and the correlated symbol representation is transmitted on a second wireless channel.
47. The apparatus of claim 46, wherein at least one of the channels is a broadcast channel.
48. The apparatus of claim 32, wherein the compressed stream is transmitted on a first transmission system and the correlated symbol representation is transmitted on a second transmission system.
US11/779,170 2002-11-12 2007-07-17 System and method for processing symbols embedded in digital video Abandoned US20080013838A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US11/779,170 US20080013838A1 (en) 2002-11-12 2007-07-17 System and method for processing symbols embedded in digital video

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/293,593 US7274823B2 (en) 2002-11-12 2002-11-12 System and method for processing symbols embedded in digital video
US11/779,170 US20080013838A1 (en) 2002-11-12 2007-07-17 System and method for processing symbols embedded in digital video

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US10/293,593 Continuation US7274823B2 (en) 2002-11-12 2002-11-12 System and method for processing symbols embedded in digital video

Publications (1)

Publication Number Publication Date
US20080013838A1 true US20080013838A1 (en) 2008-01-17

Family

ID=32229681

Family Applications (2)

Application Number Title Priority Date Filing Date
US10/293,593 Expired - Fee Related US7274823B2 (en) 2002-11-12 2002-11-12 System and method for processing symbols embedded in digital video
US11/779,170 Abandoned US20080013838A1 (en) 2002-11-12 2007-07-17 System and method for processing symbols embedded in digital video

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US10/293,593 Expired - Fee Related US7274823B2 (en) 2002-11-12 2002-11-12 System and method for processing symbols embedded in digital video

Country Status (10)

Country Link
US (2) US7274823B2 (en)
EP (1) EP1568206A4 (en)
JP (1) JP2006517354A (en)
KR (1) KR100993068B1 (en)
CN (1) CN100520810C (en)
AU (1) AU2003291506A1 (en)
BR (1) BR0316185A (en)
MX (1) MXPA05005049A (en)
RU (1) RU2351090C2 (en)
WO (1) WO2004045199A2 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9065884B2 (en) 2002-10-28 2015-06-23 Qualcomm Incorporated Joint transmission of multiple multimedia streams
US9615582B2 (en) 2012-07-02 2017-04-11 Reckitt Benckiser Llc Pressurized, sprayable aqueous alcoholic microbicidal compositions comprising zinc ions
US10776585B2 (en) 2005-10-26 2020-09-15 Cortica, Ltd. System and method for recognizing characters in multimedia content

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2849563B1 (en) * 2002-12-31 2005-02-11 Medialive CUSTOMIZED MARKING FOR THE PROTECTION OF DIGITAL AUDIOVISUAL STREAMS
KR101176501B1 (en) * 2006-11-17 2012-08-22 엘지전자 주식회사 Broadcasting receiving device capable of displaying communication-related information using data service and control method thereof
RU2661760C1 (en) * 2017-08-25 2018-07-19 Общество с ограниченной ответственностью "Аби Продакшн" Multiple chamber using for implementation of optical character recognition

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5960081A (en) * 1997-06-05 1999-09-28 Cray Research, Inc. Embedding a digital signature in a video sequence
US6014466A (en) * 1997-07-10 2000-01-11 Hughes Electronics Corporation Object-based video coding of arbitrarily shaped objects using lapped orthogonal transforms (LOTs) defined on rectangular and L-shaped regions
US6289174B1 (en) * 1995-12-21 2001-09-11 Sony Corporation Composite video apparatus
US6381368B1 (en) * 1997-12-03 2002-04-30 Canon Kabushiki Kaisha Image processing apparatus and method
US6614930B1 (en) * 1999-01-28 2003-09-02 Koninklijke Philips Electronics N.V. Video stream classifiable symbol isolation method and system
US6714591B1 (en) * 1998-01-27 2004-03-30 Sharp Kabushiki Kaisha Video image coding device and video image decoding device
US6775412B1 (en) * 1997-10-10 2004-08-10 Telefonaktiebolaget Lm Ericsson (Publ) Lossless region of interest coding
US7020336B2 (en) * 2001-11-13 2006-03-28 Koninklijke Philips Electronics N.V. Identification and evaluation of audience exposure to logos in a broadcast event

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH04363717A (en) * 1991-01-14 1992-12-16 Toppan Printing Co Ltd Transferring method and device for stripping data for printing, transmitter and receiver therefor
JPH04287179A (en) * 1991-03-15 1992-10-12 Fuji Xerox Co Ltd Method and device for processing image information and image information transmitter
JPH0537700A (en) * 1991-07-26 1993-02-12 Ricoh Co Ltd Facsimile equipment
JPH05276396A (en) * 1992-03-25 1993-10-22 Toshiba Corp Picture information processor
US5710815A (en) * 1995-06-07 1998-01-20 Vtech Communications, Ltd. Encoder apparatus and decoder apparatus for a television signal having embedded viewer access control data
GB2307813A (en) * 1995-11-02 1997-06-04 Int Mobile Satellite Org Text/Image separation and compression encoding method
JPH11308463A (en) * 1998-04-17 1999-11-05 Sanyo Electric Co Ltd Image processor and image processing method
JP2002064709A (en) * 2000-06-06 2002-02-28 Canon Inc Image processing unit and its method, and its computer program and storage medium
JP2002142070A (en) * 2000-11-02 2002-05-17 Canon Inc Image transmission system and image transmitter and method for them
US7167519B2 (en) * 2001-12-20 2007-01-23 Siemens Corporate Research, Inc. Real-time video object generation for smart cameras

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6289174B1 (en) * 1995-12-21 2001-09-11 Sony Corporation Composite video apparatus
US5960081A (en) * 1997-06-05 1999-09-28 Cray Research, Inc. Embedding a digital signature in a video sequence
US6014466A (en) * 1997-07-10 2000-01-11 Hughes Electronics Corporation Object-based video coding of arbitrarily shaped objects using lapped orthogonal transforms (LOTs) defined on rectangular and L-shaped regions
US6775412B1 (en) * 1997-10-10 2004-08-10 Telefonaktiebolaget Lm Ericsson (Publ) Lossless region of interest coding
US6381368B1 (en) * 1997-12-03 2002-04-30 Canon Kabushiki Kaisha Image processing apparatus and method
US6714591B1 (en) * 1998-01-27 2004-03-30 Sharp Kabushiki Kaisha Video image coding device and video image decoding device
US6614930B1 (en) * 1999-01-28 2003-09-02 Koninklijke Philips Electronics N.V. Video stream classifiable symbol isolation method and system
US7020336B2 (en) * 2001-11-13 2006-03-28 Koninklijke Philips Electronics N.V. Identification and evaluation of audience exposure to logos in a broadcast event

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9065884B2 (en) 2002-10-28 2015-06-23 Qualcomm Incorporated Joint transmission of multiple multimedia streams
US10776585B2 (en) 2005-10-26 2020-09-15 Cortica, Ltd. System and method for recognizing characters in multimedia content
US9615582B2 (en) 2012-07-02 2017-04-11 Reckitt Benckiser Llc Pressurized, sprayable aqueous alcoholic microbicidal compositions comprising zinc ions

Also Published As

Publication number Publication date
EP1568206A2 (en) 2005-08-31
RU2005118092A (en) 2006-01-20
RU2351090C2 (en) 2009-03-27
MXPA05005049A (en) 2005-09-08
KR100993068B1 (en) 2010-11-08
US20040091161A1 (en) 2004-05-13
WO2004045199A2 (en) 2004-05-27
KR20050086512A (en) 2005-08-30
WO2004045199A3 (en) 2006-10-05
US7274823B2 (en) 2007-09-25
JP2006517354A (en) 2006-07-20
CN100520810C (en) 2009-07-29
CN1947127A (en) 2007-04-11
AU2003291506A1 (en) 2004-06-03
BR0316185A (en) 2005-09-27
EP1568206A4 (en) 2010-12-22

Similar Documents

Publication Publication Date Title
US6141693A (en) Method and apparatus for extracting digital data from a video stream and using the digital data to configure the video stream for display on a television set
US7155067B2 (en) Adaptive edge detection and enhancement for image processing
US20080013838A1 (en) System and method for processing symbols embedded in digital video
JP3703195B2 (en) Color image transmission device
JPWO2005081515A1 (en) Image compression method, image compression apparatus, image transmission system, data compression pre-processing apparatus, and computer program
EP1847956A1 (en) Image processing device
JP2001056673A (en) Technique for displaying complicated character
US20050253967A1 (en) Broadcast receiving apparatus and broadcast receiving method
US7613349B2 (en) Image processing apparatus and method
JP2004193818A (en) Wireless image transmission/reception system, wireless image transmission apparatus, and wireless image reception apparatus
JP2005020734A (en) Raster image path architecture
US8077987B2 (en) Methods and apparatus for processing of a digital image
US6611626B1 (en) Method of compressing JPEG files using a conditional transform
US7643182B2 (en) System and method for processing image data
JP3122481B2 (en) Image encoding apparatus and image encoding method
US6301384B1 (en) Low pass blurring image filter for filtering only green
JP2003348620A (en) Compression device for video signal and its method, and restoring device for video signal and its method
JP2004260415A (en) Method for encoding image and method for decoding image
JP2008113124A (en) Image processor of motion picture data
WO2002019724A1 (en) Improved color image data processing apparatus and method
JP2008113267A (en) Image data transmission system and method
Du et al. New algorithm for preserving edges and removing compression artifacts in low bit-rates wavelet based image
JP2009506661A (en) Graphical scaling method of LCD in mobile television device

Legal Events

Date Code Title Description
AS Assignment

Owner name: QUALCOMM INCORPORATED, CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:LANE, RICHARD D.;SILBERGER, AMNON;REEL/FRAME:019621/0823

Effective date: 20070724

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION