WO2005107234A1 - Method and apparatus to provide efficient multimedia content storage - Google Patents

Method and apparatus to provide efficient multimedia content storage Download PDF

Info

Publication number
WO2005107234A1
WO2005107234A1 PCT/IB2005/001184 IB2005001184W WO2005107234A1 WO 2005107234 A1 WO2005107234 A1 WO 2005107234A1 IB 2005001184 W IB2005001184 W IB 2005001184W WO 2005107234 A1 WO2005107234 A1 WO 2005107234A1
Authority
WO
WIPO (PCT)
Prior art keywords
file
image data
image
base
contextually
Prior art date
Application number
PCT/IB2005/001184
Other languages
French (fr)
Other versions
WO2005107234B1 (en
Inventor
Tao Wu
Original Assignee
Nokia Corporation
Nokia, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Corporation, Nokia, Inc. filed Critical Nokia Corporation
Priority to EP05740451A priority Critical patent/EP1745639A1/en
Priority to CN2005800202407A priority patent/CN1973529B/en
Publication of WO2005107234A1 publication Critical patent/WO2005107234A1/en
Publication of WO2005107234B1 publication Critical patent/WO2005107234B1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/41Bandwidth or redundancy reduction
    • H04N1/411Bandwidth or redundancy reduction for the transmission or storage or reproduction of two-tone pictures, e.g. black and white pictures
    • H04N1/413Systems or arrangements allowing the picture to be reproduced without loss or modification of picture-information
    • H04N1/417Systems or arrangements allowing the picture to be reproduced without loss or modification of picture-information using predictive or differential encoding
    • H04N1/4177Systems or arrangements allowing the picture to be reproduced without loss or modification of picture-information using predictive or differential encoding encoding document change data, e.g. form drop out data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/147Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals

Definitions

  • This invention relates generally to the efficient use of content storage in multimedia devices and, more specifically, relates to techniques for capturing and storing images in systems having limited memory storage capabilities such as digital cameras and devices containing digital cameras, including modern cellular telephones and personal communicators.
  • digital cameras and camera phones are typically multi-purpose devices where a number of applications can be required to share a single memory card.
  • one commercially-available digital camera is capable of recording video for up to three minutes, at a data rate of approximately lOMB/minute.
  • the use of video recording substantially reduces the memory usable for photography.
  • many camera phones will offer video recording capability in the near future, the same problems will be experienced.
  • File system compression has been used in operating systems to reduce memory (disk) usage.
  • image files are often typically compressed (using a lossy compression method such as JPEG compression) before being stored in the file system. As a result, they cannot be effectively further compressed using typical file compression tools.
  • Cache-based compaction uses the similarity of two web objects to reduce the amount of the network traffic.
  • Cache-based compaction works as follows: a web client requests a URL via a web proxy. The proxy fetches the URL on behalf of the client. The proxy then computes the difference of the requested web object with the most similar web object currently cached in the web client, and transmits only the difference. The client then restores the requested web object by combining the cached object and the difference.
  • Version control systems are also known in the prior art, and are widely used in the software industry to track changes in source code. Version control systems may use a technique known as delta compression to compactly store a later version of a file by storing only the difference of the later file version relative to an earlier version of the file.
  • delta compression a technique known as delta compression
  • a method and a device that includes a programmed data processor, to process image data.
  • the method includes, for a plurality n of files each containing image data representing one of n images, selecting one file as a base file; selecting as a target file an image data file that is contextually-related file to the base file; comparing the target file and the base file to determine differences therebetween; and storing the target file as a reduced file that is a representation of differences between the image data of the target file and the image data of the base file.
  • An image data file is selected as being contextually-related to the base file based on at least an image capture location, and/or on an image capture time.
  • An image data file can also be selected as being contextually- related to the base file based on a user input, such as by the user manually selecting a target image file.
  • An image data file may also be selected as being contextually-related to the base file based on information received from an image capture device other than an image capture device that generated the image data file.
  • storing is performed in a memory device that is a part of a wireless co ⁇ miunications device, such as a cellular telephone, or a personal communicator, or a personal digital assistant (PDA), or any other type of user device, equipment or terminal that includes a digital camera and some type of wireless (RF or optical) communications capability.
  • the method can further include tiansmitting the reduced file to a destination using a wireless link.
  • the wireless link may include a cellular communication channel, or a short range RF (e.g., Bluetooth) or IR cornmunications link.
  • this invention pertains also to computer programs executable by digital data processors, such as general purpose data processors.
  • the comparing of the target and base files includes partitioning the target file into non-overlapping blocks of pixels; for each block in the target file, fmding a best matching block in the base file; representing a block in the target file using a relative location of the best matching block in the base file and as a difference between the blocks; and encoding the difference between the blocks.
  • Fig. 1 is a simplified block diagram of a camera phone and related wireless system that is suitable for practicing this invention
  • Fig. 2 shows a mathematical equation that is referred to in a discussion of a motion- compensation prediction technique that may be used to compress image files in accordance with this invention
  • Fig. 3 is a logic flow diagram of an image compression technique.
  • This invention exploits the redundancy that can exist between some captured images to reduce the storage space for images.
  • two images (as a minimum) may be considered to be contextually-related if, as non-limiting examples, one or more of the following criteria are true:
  • two images are captured within some short interval of time; b) two images are captured at about the same location, with the image capture device being pointed at about the same a-rimuth and elevation; and c) a user declares or specifies that two images are contextually-related.
  • FIG. 1 Before describing this invention in further detail, reference is first made to Fig. 1 for showing an embodiment of a wireless communications system 10 that includes a cellular telephone or mobile station 100 that includes a digital camera 125, also referred to herein as a camera phone.
  • a digital camera 125 also referred to herein as a camera phone.
  • teachings of this invention can be applied to any device that contains a digital camera, as well as to digital cameras per se.
  • Fig. 1 shows a simplified block diagram of an embodiment of the wireless communications system 10 that is suitable for practicing this invention.
  • the wireless cornmunications system 10 includes at least one of the mobile stations (MSs) 100.
  • the mobile station 100 maybe a handheld radiotelephone, such as a cellular telephone or a personal communicator.
  • the mobile station 100 could also be contained within a card or module that is connected during use to another device, or that is wearable by the user.
  • Fig. 1 also shows an exemplary network operator 20 having, for example, anode 30 for connecting to a telecommunications network, such as a Public Packet Data Network or PDN, at least one base station controller (BSC) 40 or equivalent apparatus, and a plurality of base transceiver stations (BTS) 50, also referred to as base stations (BSs), that transmit in a forward or downlink direction both physical and logical channels to the mobile station 100 in accordance with a predetermined air interface standard.
  • BSC base station controller
  • BTS base transceiver stations
  • BSs base stations
  • a reverse or uplink communication path also exists from the mobile station 100 to the network operator, which conveys mobile originated access requests and traffic.
  • a cell 3 is associated with each BTS 50, where one cell will at any given time be considered to be a serving cell, while an adjacent cell(s) will be considered to be a neighbor cell. Smaller cells (e.g., picocells) may also be available.
  • the air interface standard can conform to any suitable standard or protocol, and may enable both voice and data traffic, such as data traffic enabling Internet 70 access and • web page downloads.
  • the air interface standard may be compatible with a code division multiple access (CDMA) air interface standard, such as one known as cdma2000, although this is not a limitation upon the practice of this invention.
  • CDMA code division multiple access
  • the mobile station 100 typically includes a control unit or control logic, such as a microcontrol unit (MCU) 120 having an output coupled to an input of a display 140 and an input coupled to an output of a keyboard or keypad 160.
  • MCU microcontrol unit
  • the MCU 120 is assumed to include or be coupled to some type of a memory 130, mcluding a non- volatile memory for storing an operating program and. other information, as well as a volatile memory for temporarily storing required.data, scratchpad memory, received packet data, packet data to be transmitted, and the like.
  • the operating program is assumed, for the purposes of this invention, to enable the MCU 120 to execute the software routines, layers and protocols required to implement the methods in accordance with this invention, as well as to provide a suitable user interface (Ul), via display 140 and keypad 160, with a user.
  • a microphone and speaker are typically provided for enabling the user to conduct voice calls in a conventional manner.
  • the mobile station 100 also contains a wireless section that includes a digital signal processor (DSP) 180, or equivalent high speed processor or logic, as well as a wireless transceiver that includes a transmitter 200 and a receiver 220, both of which are coupled to an antenna 240 for cornmunication with the network operator.
  • DSP digital signal processor
  • Data such as digitized voice and packet data, is transmitted and received through the antenna 240.
  • the MS 100 includes the camera 125 having a fixed or moveable lens (e.g., a zoom lens) 128.
  • the camera 125 may also include a separate memory (MEM) 123 for the local storage of captured images, or the memory 130 may be used for this purpose.
  • the image memory 123 maybe embodied as a modular and detachable device, enabling a full memory device to be removed and replaced with an empty memory device.
  • the MS 100 may also include a location determining device or sub-system, such as global satellite system (GPS) receiver 150.
  • the MS 100 may also include short range communication capability, such as low power RF (e.g., a Bluetooth) interface and/or an optical (e.g., an LR) interface, shown collectively as the interface (I/F) 155.
  • the I/F 155 may comprises a second wireless transceiver, either RF or J-R, for enabling local communications capability for the MS 100.
  • the I/F 155 may comprise a wired transceiver interface, such as a high speed serial or parallel data link. It is noted that in some devices all wireless communication may take place through the interface 155 (e.g., all wireless communications may be Bluetooth-enabled), such, as in non- camera phone devices.
  • a camera phone 100 in which to practice this invention, it is noted that photographers typically take a number of pictures of the same object under different lighting, focus, composition and other conditions in order to obtain the best result. These pictures usually bear substantial similarity with each other. Photographers then typically select the best picture using a PC or other display-capable device after they return to the studio or home. As a result, the camera must store all of these pictures, even though the corresponding image files exhibit a significant amount redundancy. For example, there may by five pictures of the same object taken from the same vantage point, but with five different illumination conditions. In addition, non-professional consumer photographers often take multiple pictures of a similar composition or background. For example, two friends may take turns taking pictures of each other with some landmark or landscape in the background. These pictures, and the corresponding image files, can also exhibit significant redundancy, i.e., they can be contextually-related.
  • the user may manually specify a set of image files having members that are similar and. are thus suitable for being compressed using inter-image compression in accordance with an aspect of this invention. Although simple, this technique requires' user input, a suitable user interface, and may be inconvenient and intrusive to implement.
  • This goritr-mic technique based on image processing software, although unobtrusive to the user, may be too computationally expensive to be achieved in a device having limited computational and power resources, such as a battery-powered digital camera or the camera phone 100.
  • Location information can be made available in the camera phone by the means of the GPS receiver 150, and/or by cellular, wireless local area network (WLAN) and other locating technologies. To supplement the location information it may be desirable to also provide a system that outputs azimuth information, such as a digital compass, andpossibly also the elevation of the line-of-sight (LOS) of the camera 125.
  • An accelerometer can be used for this purpose to give an indication of the inclination of the camera phone 100 relative to the local normal.
  • the lens is 128 can be pointed in different directions then the current pointing direction can also be made available.
  • the azimuth and/or elevation information can be used to supplement the location information, and if either or both are available may be considered to form a part of the location information.
  • two image files that are captured at the same location, with the same camera pointing direction azimuth and elevation are more likely to be contextually related than two image files captured at the same location, but with the camera 125 pointed in two different directions (e.g., one picture taken pointing North and a second taken pointed East), or with the camera 125 pointed in different elevation directions (e.g., one picture taken with the lens 128 pointed down at 45 degrees from the local horizontal, while the second picture is taken with the lens 128 pointed up at 45 degrees).
  • Time of day (TOD) information is typically available from digital cameras and, if not, maybe made available external to the camera 125, such as by using a TOD clock maintained by the MCU 120, or one that is mamtained external to the camera phone 100.
  • supplemental information can also be used. For example, if a group is traveling together, and if group members have Bluetooth-enabled camera phones 100, i.e., camera phones having short-range RF (or IR) communication capabilities with one ' another, then such information can be recorded at the time of image capture and later used as guidance for image comparison. This type of information can be used as an aid in identifying the same or similar foreground object(s) (the figures or human faces) among different pictures.
  • the inter-image compression can be performed in various ways.
  • the compression may be lossless or lossy, and may operate in the time domain or in the transformation domain.
  • some concepts used in motion picture compression such as inter-frame compression and motion compensation, may be used to reduce the inter-image redundancy.
  • One difference between this invention and motion picture compression is that the invention does not require that two images have a temporal relationship to each other.
  • this invention may include a process of selecting which image to use as a base, which is not done in conventional motion picture compression.
  • "motion compensation" is used herein merely to encode the difference between images efficiently, as there may not in fact be any physical movement between the base image and an image being compared to the base image.
  • This invention differs from conventional file system compression that typically operates to compress a file by reducing the redundancy within the file.
  • image file compression of-this invention reduces inter-file redundancy.
  • file system compression techniques are generally not effective for pre-compressed data, such as JPEG images.
  • This invention differs from conventional cache-based compaction in several significant ways as well.
  • this invention requires only one device, while two devices (the proxy and the client) are involved in cache-based compaction.
  • cache-based compaction the only information used is the similarity in the URL, while this invention is capable of using a richer and more varied set of information, such as image capture location and/or time, among others.
  • the image file compression operation is performed by software executed by the MCU 120, or by the DSP 180, or by a processor that is internal to the camera 125.
  • the program may run periodically (e.g., every hour) or it may be event-triggered (e.g., when the image memory 123 is .80% full). Alternatively, a user may run the program manually.
  • This invention may also be implemented through the use of a computer program that is executed by at least one general purpose computer.
  • FIG. 3 An embodiment of a technique is now described to compress one image (B), also referred to as a target file, based on another image (A), also referred to as a base file. Note that it is preferred to deal with blocks of pixels in each image. Suitable block sizes may be, but are not limited to, 4X4 pixels, 8X8 pixels andl6X16 pixels. Reference is also made to the logic flow diagram of Fig. 3, which may also be viewed as a block diagram of interconnected circuit elements and/or logical units for performing the compression task. Combinations of software elements, circuit elements and/or logical units may also be employed.
  • Step A Decompress (if already compressed) images A and B readout from the image memory 123.
  • Step B Partition B into non-overlapping blocks of pixels (e.g. into non-overlapping blocks of 16X16 pixels).
  • Step C For each block in image B, find the best matching block in image A. This procedure can proceed in a manner similar to a motion compensation technique that is described in: http ://icsl.ee.washington.edu/ ⁇ woobin/papers/General/node5.htinl.
  • motion-compensated prediction assumes that a current image may be locally modeled as a translation of the images of some previous time.
  • each image is divided into blocks of 16X16 pixels, referred to as a macroblock.
  • Each macroblock is predicted from the previous or future frame, by estimating the amount of motion in the macroblock during the frame time interval.
  • the MPEG syntax specifies how to represent the motion information for each macroblock. It does not, however, specify how such vectors are to be computed. Due to the block-based motion representation, many implementations use block-matching techniques, where the motion vector is obtained by minimizing a cost function measuring the mismatch between the reference and the current block.
  • AE absolute difference
  • (z, jj represents ablock of 16X16 pixels (a macroblock) from the current image
  • g(i,j) represents the same macroblock from a reference image.
  • the reference macroblock is displaced by a vector (d x , d representing the search location.
  • AE is calculated at several locations in the search range.
  • TSS Three-Step- Search
  • Step D Represent the block in B using the relative location of the matching block in A and the difference between blocks B and A.
  • Motion compensation a method used in MPEG to encode and decode the difference, is described below. It is noted, however, that other lossy and lossless encoding methods can be used to encode the difference, and are within the scope of this invention.
  • the difference between two images which may be referred to as a prediction error
  • a prediction error may be encoded in a manner similar to the JPEG technique (DCT, quantization, followed by entropy coding).
  • DCT JPEG Digital Video-Coding Standards
  • P-pictures interframe prediction
  • the previously I- or P-picture frame N-1 is stored in a frame store (FS) in both the encoder and the decoder.
  • Motion compensation is performed on a macroblock basis, and one motion vector is estimated between frame N and frame N-1 for a particular macroblock to be encoded. These motion vectors are coded and transmitted to the receiver.
  • the motion-compensated prediction error is calculated by subtracting each pel in a macroblock with its motion-shifted counterpart in the previous frame.
  • An 8 X 8 discrete cosign transform (DCT) is then applied to each of the 8 X 8 blocks contained in the macroblock followed by quantization (Q) of the DCT coefficients with subsequent run-length coding and entropy coding (VLC).
  • DCT discrete cosign transform
  • Step E Store the encoded difference obtained from Step D in the image memory 123 as a compressed or reduced image data file.
  • the reduced image B not only requires less storage space in the image memory 123, but the transfer of the reduced image B over the wireless link (either the cellular link or a local link (e.g. , a Bluetooth link)) requires less bandwidth and can be achieved in a more rapid manner that would be the case with the uncompressed, original target image B.
  • the decompression operation can take place at the destination device or system, assuming that the parameters necessary for decompressing the image are also transferred. It is assumed in' this case that the receiving device has a copy of the base image in order to decode the target image.
  • the foregoing procedure is illustrative of a suitable technique for compressing images with slight movements, and is similar to MPEG.

Abstract

Disclosed is a method and a device, that includes a programmed data processor, to process image data. The method includes, for a plurality n of files each containing image data representing one of n images, selecting one file as a base file; selecting as a target file an image data file that is contextually-related file to the base file; comparing the target file and the base file to determine differences therebetween; and storing the target file as a reduced file that is a representation of differences between the image data of the target file and the image data of the base file. An image data file is selected as being contextually-related to the base file based on at least an image capture location, and/or on an image capture time, or based on a user input. Storing can be performed in a memory device that is a part of a wireless communications device, such as a cellular telephone or a personal communicator that includes a digital camera, such as a camera phone.

Description

METHOD AND APPARATUS TO PROVIDE EFFICIENT MULTIMEDIA CONTENT STORAGE
TECHNICAL FIELD:
This invention relates generally to the efficient use of content storage in multimedia devices and, more specifically, relates to techniques for capturing and storing images in systems having limited memory storage capabilities such as digital cameras and devices containing digital cameras, including modern cellular telephones and personal communicators.
BACKGROUND:
The use of digital cameras is spreading rapidly, and the capabihties and performance of digital photography equipment (mcluding digital cameras and so-called camera phones) is also increasing rapidly. The image resolution of camera phones is expected to follow an exponential growth curve, with 2-megapixel (two million picture elements) image resolution cameras now available in camera phones.
The increasing camera resolution imposes more stringent requirements on image storage subsystems. For example, a picture taken by a 4-megapixel camera can require up to 2 MB (two million bytes) of storage space. On the other hand, although the capacity of memory cards (the subsystem responsible for image storage) is also increasing, there are many occasions that the memory card becomes full before the stored images can be transferred to another device (e.g., to a personal computer or PC). In these situations, new pictures cannot be taken due to lack of storage. When the memory card is full, the typical consumer faces a difficult choice of purchasing at least one additional expensive memory card (which is not always feasible, depending on the user's location) or deleting one or more stored images.
Furthermore, digital cameras and camera phones are typically multi-purpose devices where a number of applications can be required to share a single memory card. For example, one commercially-available digital camera is capable of recording video for up to three minutes, at a data rate of approximately lOMB/minute. Thus, the use of video recording substantially reduces the memory usable for photography. As it is expected that many camera phones will offer video recording capability in the near future, the same problems will be experienced.
File system compression has been used in operating systems to reduce memory (disk) usage. However, such systems do not address the problems considered herein, since image files are often typically compressed (using a lossy compression method such as JPEG compression) before being stored in the file system. As a result, they cannot be effectively further compressed using typical file compression tools.
Another type of conventional compression, known as cache-based compaction, uses the similarity of two web objects to reduce the amount of the network traffic. Cache-based compaction works as follows: a web client requests a URL via a web proxy. The proxy fetches the URL on behalf of the client. The proxy then computes the difference of the requested web object with the most similar web object currently cached in the web client, and transmits only the difference. The client then restores the requested web object by combining the cached object and the difference.
Version control systems are also known in the prior art, and are widely used in the software industry to track changes in source code. Version control systems may use a technique known as delta compression to compactly store a later version of a file by storing only the difference of the later file version relative to an earlier version of the file.
For various reasons explained below, these conventional file size reduction techniques are not suitable for use with image files generated by an image capture device, such as a digital camera that is used alone or as part of another device, such as a camera phone.
SUMMARY OF THE PREFERRED EMBODIMENTS
The foregoing and other problems are overcome, and other advantages are realized, in accordance with the presently preferred embodiments of these teachings. Disclosed is a method and a device, that includes a programmed data processor, to process image data. The method includes, for a plurality n of files each containing image data representing one of n images, selecting one file as a base file; selecting as a target file an image data file that is contextually-related file to the base file; comparing the target file and the base file to determine differences therebetween; and storing the target file as a reduced file that is a representation of differences between the image data of the target file and the image data of the base file. An image data file is selected as being contextually-related to the base file based on at least an image capture location, and/or on an image capture time. An image data file can also be selected as being contextually- related to the base file based on a user input, such as by the user manually selecting a target image file. An image data file may also be selected as being contextually-related to the base file based on information received from an image capture device other than an image capture device that generated the image data file.
, In a presently preferred, but non-limiting embodiment of this invention storing is performed in a memory device that is a part of a wireless coπmiunications device, such as a cellular telephone, or a personal communicator, or a personal digital assistant (PDA), or any other type of user device, equipment or terminal that includes a digital camera and some type of wireless (RF or optical) communications capability. In this case the method can further include tiansmitting the reduced file to a destination using a wireless link. The wireless link may include a cellular communication channel, or a short range RF (e.g., Bluetooth) or IR cornmunications link.
However, it will be made apparent below that devices and equipment having wired communications capabilities, such as PCs connected to a data communications network through a wire or cable, can also benefit from the use of this invention. In general, this invention pertains also to computer programs executable by digital data processors, such as general purpose data processors.
In a non-limiting embodiment the comparing of the target and base files includes partitioning the target file into non-overlapping blocks of pixels; for each block in the target file, fmding a best matching block in the base file; representing a block in the target file using a relative location of the best matching block in the base file and as a difference between the blocks; and encoding the difference between the blocks.
BRIEF DESCRIPTION OF THE DRAWINGS
The. foregoing and other aspects of these teachings are made more evident in the following Detailed Description of the Preferred Embodiments, when read in conjunction with the attached Drawing Figures, wherein:
Fig. 1 is a simplified block diagram of a camera phone and related wireless system that is suitable for practicing this invention;
Fig. 2 shows a mathematical equation that is referred to in a discussion of a motion- compensation prediction technique that may be used to compress image files in accordance with this invention; and
Fig. 3 is a logic flow diagram of an image compression technique.
DETAILED DESCRDPTION OF THE PREFERRED EMBODIMENTS
This invention exploits the redundancy that can exist between some captured images to reduce the storage space for images. In this regard two images (as a minimum) may be considered to be contextually-related if, as non-limiting examples, one or more of the following criteria are true:
a) two images are captured within some short interval of time; b) two images are captured at about the same location, with the image capture device being pointed at about the same a-rimuth and elevation; and c) a user declares or specifies that two images are contextually-related.
If any one of these several criteria is true then there can exist a significant degree of similarity or redundancy among image files stored in a memory, such as the memory of a digital camera or a digital camera phone. This invention exploits this potential for image similarity by using one image file as a "base" or "reference", and by compressing another image file by storing only the difference between the two image files. Two related aspects of this invention relate to identifying image files that are most similar to each other (those that are contextually-related); and performing inter-image compression.
Before describing this invention in further detail, reference is first made to Fig. 1 for showing an embodiment of a wireless communications system 10 that includes a cellular telephone or mobile station 100 that includes a digital camera 125, also referred to herein as a camera phone. However, while described in the context of this presently most- preferred embodiment, it should be appreciated that the teachings of this invention can be applied to any device that contains a digital camera, as well as to digital cameras per se.
Fig. 1 shows a simplified block diagram of an embodiment of the wireless communications system 10 that is suitable for practicing this invention. The wireless cornmunications system 10 includes at least one of the mobile stations (MSs) 100. The mobile station 100 maybe a handheld radiotelephone, such as a cellular telephone or a personal communicator. The mobile station 100 could also be contained within a card or module that is connected during use to another device, or that is wearable by the user.
Fig. 1 also shows an exemplary network operator 20 having, for example, anode 30 for connecting to a telecommunications network, such as a Public Packet Data Network or PDN, at least one base station controller (BSC) 40 or equivalent apparatus, and a plurality of base transceiver stations (BTS) 50, also referred to as base stations (BSs), that transmit in a forward or downlink direction both physical and logical channels to the mobile station 100 in accordance with a predetermined air interface standard. A reverse or uplink communication path also exists from the mobile station 100 to the network operator, which conveys mobile originated access requests and traffic. A cell 3 is associated with each BTS 50, where one cell will at any given time be considered to be a serving cell, while an adjacent cell(s) will be considered to be a neighbor cell. Smaller cells (e.g., picocells) may also be available.
The air interface standard can conform to any suitable standard or protocol, and may enable both voice and data traffic, such as data traffic enabling Internet 70 access and web page downloads. As an example, the air interface standard may be compatible with a code division multiple access (CDMA) air interface standard, such as one known as cdma2000, although this is not a limitation upon the practice of this invention.
The mobile station 100 typically includes a control unit or control logic, such as a microcontrol unit (MCU) 120 having an output coupled to an input of a display 140 and an input coupled to an output of a keyboard or keypad 160. The MCU 120 is assumed to include or be coupled to some type of a memory 130, mcluding a non- volatile memory for storing an operating program and. other information, as well as a volatile memory for temporarily storing required.data, scratchpad memory, received packet data, packet data to be transmitted, and the like. The operating program is assumed, for the purposes of this invention, to enable the MCU 120 to execute the software routines, layers and protocols required to implement the methods in accordance with this invention, as well as to provide a suitable user interface (Ul), via display 140 and keypad 160, with a user. Although not shown, a microphone and speaker are typically provided for enabling the user to conduct voice calls in a conventional manner.
The mobile station 100 also contains a wireless section that includes a digital signal processor (DSP) 180, or equivalent high speed processor or logic, as well as a wireless transceiver that includes a transmitter 200 and a receiver 220, both of which are coupled to an antenna 240 for cornmunication with the network operator. At least one local oscillator, such as a frequency synthesizer (SYNTH) 260, is provided for tuning the transceiver. Data, such as digitized voice and packet data, is transmitted and received through the antenna 240.
In this invention the MS 100 includes the camera 125 having a fixed or moveable lens (e.g., a zoom lens) 128. The camera 125 may also include a separate memory (MEM) 123 for the local storage of captured images, or the memory 130 may be used for this purpose. The image memory 123 maybe embodied as a modular and detachable device, enabling a full memory device to be removed and replaced with an empty memory device.
The MS 100 may also include a location determining device or sub-system, such as global satellite system (GPS) receiver 150. The MS 100 may also include short range communication capability, such as low power RF (e.g., a Bluetooth) interface and/or an optical (e.g., an LR) interface, shown collectively as the interface (I/F) 155. In general, the I/F 155 may comprises a second wireless transceiver, either RF or J-R, for enabling local communications capability for the MS 100. Alternatively, the I/F 155 may comprise a wired transceiver interface, such as a high speed serial or parallel data link. It is noted that in some devices all wireless communication may take place through the interface 155 (e.g., all wireless communications may be Bluetooth-enabled), such, as in non- camera phone devices.
Having thus described one suitable but non-limiting embodiment of a camera phone 100 in which to practice this invention, it is noted that photographers typically take a number of pictures of the same object under different lighting, focus, composition and other conditions in order to obtain the best result. These pictures usually bear substantial similarity with each other. Photographers then typically select the best picture using a PC or other display-capable device after they return to the studio or home. As a result, the camera must store all of these pictures, even though the corresponding image files exhibit a significant amount redundancy. For example, there may by five pictures of the same object taken from the same vantage point, but with five different illumination conditions. In addition, non-professional consumer photographers often take multiple pictures of a similar composition or background. For example, two friends may take turns taking pictures of each other with some landmark or landscape in the background. These pictures, and the corresponding image files, can also exhibit significant redundancy, i.e., they can be contextually-related.
There are several possible techniques for identifying contextually-related image files. These include, but need not be limited to, the following exemplary techniques.
A) Manual selection by the user. The user may manually specify a set of image files having members that are similar and. are thus suitable for being compressed using inter-image compression in accordance with an aspect of this invention. Although simple, this technique requires' user input, a suitable user interface, and may be inconvenient and intrusive to implement. • B) Exhaustive automatic comparison between all image files. This goritr-mic technique, based on image processing software, although unobtrusive to the user, may be too computationally expensive to be achieved in a device having limited computational and power resources, such as a battery-powered digital camera or the camera phone 100. C) Employ additional or supplemental information to facilitate the process of identifying similar and contextually-related image files. Examples of supplemental information can include, but need not be limited to, the following.
Cl) If the location of where each image file is captured is available, then the two image files captured at the same location are more likely to be similar than two image files captured at different locations. Location information can be made available in the camera phone by the means of the GPS receiver 150, and/or by cellular, wireless local area network (WLAN) and other locating technologies. To supplement the location information it may be desirable to also provide a system that outputs azimuth information, such as a digital compass, andpossibly also the elevation of the line-of-sight (LOS) of the camera 125. An accelerometer can be used for this purpose to give an indication of the inclination of the camera phone 100 relative to the local normal. Alternatively, if the lens is 128 can be pointed in different directions then the current pointing direction can also be made available. The azimuth and/or elevation information can be used to supplement the location information, and if either or both are available may be considered to form a part of the location information. In general, it is assumed that two image files that are captured at the same location, with the same camera pointing direction azimuth and elevation, are more likely to be contextually related than two image files captured at the same location, but with the camera 125 pointed in two different directions (e.g., one picture taken pointing North and a second taken pointed East), or with the camera 125 pointed in different elevation directions (e.g., one picture taken with the lens 128 pointed down at 45 degrees from the local horizontal, while the second picture is taken with the lens 128 pointed up at 45 degrees).
C2) If the time when an image file is created is available, then two image files captured within a short period of time are more likely to be contextually-related. Time of day (TOD). information is typically available from digital cameras and, if not, maybe made available external to the camera 125, such as by using a TOD clock maintained by the MCU 120, or one that is mamtained external to the camera phone 100.
C3) Other types of supplemental information can also be used. For example, if a group is traveling together, and if group members have Bluetooth-enabled camera phones 100, i.e., camera phones having short-range RF (or IR) communication capabilities with one ' another, then such information can be recorded at the time of image capture and later used as guidance for image comparison. This type of information can be used as an aid in identifying the same or similar foreground object(s) (the figures or human faces) among different pictures.
The inter-image compression can be performed in various ways. The compression may be lossless or lossy, and may operate in the time domain or in the transformation domain. Specifically, some concepts used in motion picture compression, such as inter-frame compression and motion compensation, may be used to reduce the inter-image redundancy. One difference between this invention and motion picture compression, however, is that the invention does not require that two images have a temporal relationship to each other. In contrast, there are typically strict timing constraints imposed between two frames compressed using inter-frame compression in video compression. Also, this invention may include a process of selecting which image to use as a base, which is not done in conventional motion picture compression. Thus, "motion compensation" is used herein merely to encode the difference between images efficiently, as there may not in fact be any physical movement between the base image and an image being compared to the base image.
This invention differs from conventional file system compression that typically operates to compress a file by reducing the redundancy within the file. In contrast, the image file compression of-this invention reduces inter-file redundancy. In addition, and as was mentioned previously, file system compression techniques are generally not effective for pre-compressed data, such as JPEG images.
This invention differs from conventional cache-based compaction in several significant ways as well. First , this invention requires only one device, while two devices (the proxy and the client) are involved in cache-based compaction. Second, the information used'in the two techniques to aid in identifying similarity is quite different. In cache-based compaction, the only information used is the similarity in the URL, while this invention is capable of using a richer and more varied set of information, such as image capture location and/or time, among others.
In distinction to the conventional version control systems that were mentioned above, in this invention there is no notion of "different versions of the same file"; indeed, images are not acquired by manipulating a known file. Thus, the use of delta compression is not appropriate. Also, the text-based compression algorithms used in version control systems are not designed to work with image data.
In the preferred embodiment of this invention the image file compression operation is performed by software executed by the MCU 120, or by the DSP 180, or by a processor that is internal to the camera 125. The program may run periodically (e.g., every hour) or it may be event-triggered (e.g., when the image memory 123 is .80% full). Alternatively, a user may run the program manually. This invention may also be implemented through the use of a computer program that is executed by at least one general purpose computer.
An embodiment of a technique is now described to compress one image (B), also referred to as a target file, based on another image (A), also referred to as a base file. Note that it is preferred to deal with blocks of pixels in each image. Suitable block sizes may be, but are not limited to, 4X4 pixels, 8X8 pixels andl6X16 pixels. Reference is also made to the logic flow diagram of Fig. 3, which may also be viewed as a block diagram of interconnected circuit elements and/or logical units for performing the compression task. Combinations of software elements, circuit elements and/or logical units may also be employed.
Step A. Decompress (if already compressed) images A and B readout from the image memory 123.
Step B. Partition B into non-overlapping blocks of pixels (e.g. into non-overlapping blocks of 16X16 pixels). Step C. For each block in image B, find the best matching block in image A. This procedure can proceed in a manner similar to a motion compensation technique that is described in: http ://icsl.ee.washington.edu/~woobin/papers/General/node5.htinl.
More specifically, motion-compensated prediction assumes that a current image may be locally modeled as a translation of the images of some previous time. In the MPEG standard, each image is divided into blocks of 16X16 pixels, referred to as a macroblock. Each macroblock is predicted from the previous or future frame, by estimating the amount of motion in the macroblock during the frame time interval. The MPEG syntax specifies how to represent the motion information for each macroblock. It does not, however, specify how such vectors are to be computed. Due to the block-based motion representation, many implementations use block-matching techniques, where the motion vector is obtained by minimizing a cost function measuring the mismatch between the reference and the current block. Although any cost function can be used, a most widely- used choice is the absolute difference (AE) defined as in the Equation shown in Fig. 2. hi this equation, (z, jj represents ablock of 16X16 pixels (a macroblock) from the current image, and g(i,j) represents the same macroblock from a reference image. The reference macroblock is displaced by a vector (dx, d representing the search location. To determine the best matching macroblock at produces a minimum mismatch error, AE is calculated at several locations in the search range. The conceptually simplest, but the most compute-intensive search method, is known as the full search or exhaustive search. This search procedure evaluates the AE at every possible pixel locations in the search area. In order to reduce the computational complexity, algorithms having a reduced number of search points have been developed. One such algorithm is known as a Three-Step- Search (TSS). This algorithm first evaluates the AE at the center and eight surrounding locations of a 32 x 32 search area. The location that produces the smallest AE then becomes the center of the next stage, and the search range is reduced by half. This sequence is repeated three times.
Step D. Represent the block in B using the relative location of the matching block in A and the difference between blocks B and A. Motion compensation, a method used in MPEG to encode and decode the difference, is described below. It is noted, however, that other lossy and lossless encoding methods can be used to encode the difference, and are within the scope of this invention.
' When using motion compensation in MPEG, the difference between two images, which may be referred to as a prediction error, may be encoded in a manner similar to the JPEG technique (DCT, quantization, followed by entropy coding). Reference maybe had, as an example, to a publication "MPEG Digital Video-Coding Standards", T. Sikora, IEEE Signal Processing Magazine, September 1997, pgs. 82-99. Briefly, a first frame in video sequence is encoded in an interfame coding mode (I-picture), and each subsequent frame is coded using interframe prediction (P-pictures), and only data from the nearest previously coded I-picture or P-picture is used for prediction. For coding P-pictures, the previously I- or P-picture frame N-1 is stored in a frame store (FS) in both the encoder and the decoder. Motion compensation is performed on a macroblock basis, and one motion vector is estimated between frame N and frame N-1 for a particular macroblock to be encoded. These motion vectors are coded and transmitted to the receiver. The motion-compensated prediction error is calculated by subtracting each pel in a macroblock with its motion-shifted counterpart in the previous frame. An 8 X 8 discrete cosign transform (DCT) is then applied to each of the 8 X 8 blocks contained in the macroblock followed by quantization (Q) of the DCT coefficients with subsequent run-length coding and entropy coding (VLC).
Step E. Store the encoded difference obtained from Step D in the image memory 123 as a compressed or reduced image data file.
When it is desired to display the image B, the foregoing process is reversed to obtain the original image B.
Note that the reduced image B not only requires less storage space in the image memory 123, but the transfer of the reduced image B over the wireless link (either the cellular link or a local link (e.g. , a Bluetooth link)) requires less bandwidth and can be achieved in a more rapid manner that would be the case with the uncompressed, original target image B. In this case the decompression operation can take place at the destination device or system, assuming that the parameters necessary for decompressing the image are also transferred. It is assumed in' this case that the receiving device has a copy of the base image in order to decode the target image.
The foregoing procedure is illustrative of a suitable technique for compressing images with slight movements, and is similar to MPEG. For images that can produced as a result of zooming in/out, one may resample the image with the higher resolution (the "zoomed- in" image) to represent part of the other image.
The foregoing description has provided by way of exemplary and non-limiting examples a full and informative description of the best method and apparatus presently contemplated by the inventors for carrying out the invention. However, various modifications and adaptations may become apparent to those skilled in the relevant arts in view of the foregoing description, when read in conjunction with the accompanying drawings and the appended claims. As but some examples, the use of other similar or equivalent image compression algorithms may be attempted by those skilled in the art. However, all such and similar modifications of the teachings of this invention will still fall within the scope of this invention. .
Furthermore, some of the features of the present invention could be used to advantage without the corresponding use of other features. As such, the foregoing description should be considered as merely illustrative of the principles of the present invention, and not in limitation thereof.

Claims

CLAIMSWhat is claimed is:
1. A method to process image data, comprising:
for a plurality n of files each containing image data representing one of n images;
selecting one file as a base file;
selecting as a target file an image data file that is a contextually-related file to the base file;
comparing the target file and the base file to determine differences therebetween; and
storing the target file as a reduced file that is a representation of differences between the image data of the target file and the image data of the base file.
2. A method as in claim 1, where an image data file is selected as being contextually- related to the base file based on at least an image capture location.
3. A method as in claim 1, where an image data file is selected as being contextually- related to the base file based on at least an image capture time.
4. A method as in claim 1, where an image data file is selected as being contextually- related to the base file based on a user input.
5. A method as in claim 1, where an image data file is selected as being contextually- related to the base file based on information received from an image capture device other than an image capture device that generated the image data file.
6. A method as in claim 1 , where storing is performed in a memory device that comprises a part of a wireless communications device.
7. A method as in claim 6, further comprising tiansn-itting the reduced file to a' destination using a wireless link.
8. A method as in claim 7, where the wireless link comprises a cellular communication channel.
9. A method as in claim 7, where the wireless link comprises a short range radio frequency (RF) or infrared (IR) communications link.
10. A method as in claim 1, where comparing comprises:
partitioning the target file into non-overlapping blocks of pixels;
for each block in the target file, f ding a best matching block in the base file;
representing a block in the target file using a relative location of the best matching block in the base file and as a difference between the blocks; and
encoding the difference between the blocks.
11. A device to process image data, comprising a data processor coupled to an image memory for storing a plurality n of files each containing image data representing one of n images, said data processor operating under control of a stored program to select one file as abase file; to select as a target file an image data file that is a contextually-related file to the base file; to compare the target file and the base file to determine differences therebetween and to store the target file in the image memory as a reduced file that is a representation of differences between the image data of the target file and the image data of the base file.
12. A device as in claim 11, where said data processor selects an image data file to be contextually-related to the base file based on at least an image capture location.
13. A device as in claim 11, where said data processor selects an image data file to be contextually-related to the base file based on at least an image capture time.
14. A device as in claim 11, where said data processor selects an image data file to be contextually-related to the base file based on at least a user input.
15. A device as in claim 11, where said data processor selects an image data file to be contextually-related to the base file based on at least information received from an image capture device other than an image capture device that generated the image data file.
16. A device as in claim 11, where said image memory comprises a part of a wireless communications device.
17. A device as in claim 16, further comprising a transmitter to transmit the reduced file to a destination using a wireless link.
18. A device as in claim 17, where the wireless link comprises a cellular communication channel.
19. A device as in claim 7, where the wireless link comprises a short range radio frequency (RF) or infrared (TR) communications link.
20. A device as in claim 11, where said data processor operates, when comparing the target file and the base file, to partition the target file into non-overlapping blocks of pixels; to finding a best matching block in the base file for each block in the target file; to represent a block in the target file using a relative location of the best matching block in the base file and as a difference between the blocks and to encode the difference between the blocks.
21. A camera phone, comprising:
a transceiver;
a controller coupled to said transceiver; a digital image capture device coupled to an image storage memory for storing n image data files representing one of n images; and
an image processor coupled to said image storage memory and operable to select an image data file as a base file; to select as a target file an image data file that is contextually-related to the base file; to process the target file and the base file to determine differences therebetween and to store a processed target file in the image storage memory as a file of smaller size than the size of the target file.
22. A camera phone as in claim 21 , where said image storage memory is detachable from said camera phone.
23. A camera phone as in claim 21 , where said image processor selects an image data file to be contextually-related to the base file based on at least an image capture location.
24. A camera phone as in claim 23, where the image capture location is determined by the camera phone.
25. A camera phone as in claim 23, where the image capture location is determined external to said camera phone and is transmitted to said camera phone though said transceiver.
26. A camera phone as in claim 23, where the image capture location comprises an azimuthal pointing direction of said digital image capture device.
27. A camera phone as in claim 23, where the image capture location comprises an elevation angle of a pointing direction of said digital image capture device.
28. A camera phone as in claim 21 , where said image processor selects an image data file to be contextually-related to the base file based on at least an image capture time.
29. A camera phone as in claim 21, where said image processor selects an image data file to be contextually-related to the base file based on at least a user input.
30. A camera phone as in claim 21 , where said image processor selects an image data file to be contextually-related to the base file based on at least information received from another camera phone.
31. A camera phone as in claim 21 , where said processed target file is transmitted from said camera phone through said transceiver.
32. A camera phone as in claim 31, where said transceiver comprises a radio frequency cellular communication transceiver.
33. A camera phone as in claim 31 , where said transceiver comprises one of a short range radio frequency or infrared communications transceiver.
34. A computer program stored on a computer-rreadable medium and comprising computer-executable instructions responsive to n image data files representing one of n images to select an image data file as a base file; to select as a target file an image data file that is contextually-related to the base file; to process the target file and the base file to determine differences therebetween and to store a processed target file as a file of smaller size than the size of the target file.
35. A computer program as in claim 34, where an image data file is selected to be contextually-related to the base file based on at least an image capture location.
36. A computer program as in claim 35, where the image capture location comprises an azimuthal pointing direction of an image capture device.
37. A computer program as in claim 35, where the image capture location comprises an elevation angle of a pointing direction of an image capture device.
38. A computer program as in claim 34, where an image data file is selected to be contextually-related to the base file based on at least an image capture time.
39. A computer program as in claim 34, where an image data .file is selected to be contextually-related to the base file based on at least a user input.
40. A computer program as in claim 34, where said computer program is executed by a data processor that comprises a part of a wireless cornmunications device that includes a digital image capture device.
PCT/IB2005/001184 2004-05-05 2005-05-02 Method and apparatus to provide efficient multimedia content storage WO2005107234A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP05740451A EP1745639A1 (en) 2004-05-05 2005-05-02 Method and apparatus to provide efficient multimedia content storage
CN2005800202407A CN1973529B (en) 2004-05-05 2005-05-02 Method and apparatus to provide efficient multimedia content storage

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/839,672 US7805024B2 (en) 2004-05-05 2004-05-05 Method and apparatus to provide efficient multimedia content storage
US10/839,672 2004-05-05

Publications (2)

Publication Number Publication Date
WO2005107234A1 true WO2005107234A1 (en) 2005-11-10
WO2005107234B1 WO2005107234B1 (en) 2006-02-16

Family

ID=35242037

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2005/001184 WO2005107234A1 (en) 2004-05-05 2005-05-02 Method and apparatus to provide efficient multimedia content storage

Country Status (4)

Country Link
US (1) US7805024B2 (en)
EP (1) EP1745639A1 (en)
CN (1) CN1973529B (en)
WO (1) WO2005107234A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006134509A2 (en) * 2005-06-15 2006-12-21 Koninklijke Philips Electronics N.V. Method and apparatus for storing image data files
KR100775217B1 (en) 2006-06-01 2007-11-12 (주) 엘지텔레콤 Still picture encoding method based on similarity in mobile camera

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7593057B2 (en) * 2004-07-28 2009-09-22 Microsoft Corp. Multi-view integrated camera system with housing
US7756193B2 (en) * 2006-09-21 2010-07-13 Broadcom Corporation Time divided pilot channel detection processing in WCDMA terminal having shared memory
WO2012035371A1 (en) * 2010-09-14 2012-03-22 Nokia Corporation A multi frame image processing apparatus
US8655085B2 (en) 2010-10-28 2014-02-18 Microsoft Corporation Burst mode image compression and decompression
US8811756B2 (en) * 2011-07-11 2014-08-19 International Business Machines Corporation Image compression
US10083618B2 (en) * 2012-08-21 2018-09-25 Jacob UKELSON System and method for crowd sourced multi-media lecture capture, sharing and playback
US11317123B2 (en) 2013-04-25 2022-04-26 Vmware, Inc. Systems and methods for using pre-calculated block hashes for image block matching
US20140369413A1 (en) * 2013-06-18 2014-12-18 Vmware, Inc. Systems and methods for compressing video data using image block matching
CN105684035B (en) * 2013-09-16 2019-08-20 英特尔公司 It is grouped and compresses similar photo
WO2018023557A1 (en) * 2016-08-04 2018-02-08 Zte Corporation Method and device for storing and loading, including index, restore and display, data related to multiple pictures
US10803593B2 (en) 2016-09-19 2020-10-13 Siemens Healthcare Gmbh Method and system for image compression
CN108241645B (en) * 2016-12-23 2020-03-17 腾讯科技(深圳)有限公司 Image processing method and device
US10809869B2 (en) 2017-09-09 2020-10-20 Apple Inc. Layered image compression
US20210373789A1 (en) * 2020-05-29 2021-12-02 Western Digital Technologies, Inc. Storage System, Host, and Method for Optimizing Storage of a Sequence of Images

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6148031A (en) * 1996-11-27 2000-11-14 Canon Kabushiki Kaisha Image processing apparatus and method
EP1209619A2 (en) * 2000-11-28 2002-05-29 Monolith Co., Ltd. Image interpolating method and apparatus
US20020075310A1 (en) * 2000-12-20 2002-06-20 Prabhu Prasad V. Graphical user interface adapted to allow scene content annotation of groups of pictures in a picture database to promote efficient database browsing
US6625319B1 (en) * 1999-03-30 2003-09-23 Koninklijke Philips Electronics N.V. Image compression using content-based image similarity

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5687095A (en) * 1994-11-01 1997-11-11 Lucent Technologies Inc. Video transmission rate matching for multimedia communication systems
JPH09121358A (en) * 1995-10-25 1997-05-06 Matsushita Electric Ind Co Ltd Picture coding/decoding device and its method
EP0920215A4 (en) * 1997-06-16 2005-09-21 Sony Corp Image processing device and method, and transmission medium, transmission method and image format
US6185314B1 (en) * 1997-06-19 2001-02-06 Ncr Corporation System and method for matching image information to object model information
US6177959B1 (en) * 1997-12-31 2001-01-23 Telecruz Technology, Inc. Circuit and method for generating a clock signal synchronized with time reference signals associated with television signals
US6285995B1 (en) * 1998-06-22 2001-09-04 U.S. Philips Corporation Image retrieval system using a query image
ATE510258T1 (en) * 1999-01-29 2011-06-15 Lg Electronics Inc METHOD FOR SEARCHING OR BROWSING MULTIMEDIA DATA
US6813395B1 (en) * 1999-07-14 2004-11-02 Fuji Photo Film Co., Ltd. Image searching method and image processing method
JP2001290820A (en) * 2000-01-31 2001-10-19 Mitsubishi Electric Corp Video gathering device, video retrieval device, and video gathering and retrieval system
US6914626B2 (en) * 2000-02-21 2005-07-05 Hewlett Packard Development Company, L.P. Location-informed camera
FR2807852B1 (en) * 2000-04-17 2004-10-22 Canon Kk METHODS AND DEVICES FOR INDEXING AND SEARCHING FOR DIGITAL IMAGES TAKING INTO ACCOUNT THE SPATIAL DISTRIBUTION OF IMAGE CONTENT
EP1170953A3 (en) * 2000-07-03 2002-07-10 Pioneer Corporation Portable telephone, remote monitoring system, portable information terminal, and method for using the same
JP2002027145A (en) * 2000-07-05 2002-01-25 Toshiba Corp Radio communication terminal
US7016532B2 (en) * 2000-11-06 2006-03-21 Evryx Technologies Image capture and identification system and process
US20050113113A1 (en) * 2001-11-15 2005-05-26 Reed Mark J. Enhanced wireless phone
US7872669B2 (en) * 2004-01-22 2011-01-18 Massachusetts Institute Of Technology Photo-based mobile deixis system and related techniques
US7289806B2 (en) * 2004-03-30 2007-10-30 Intel Corporation Method and apparatus for context enabled search
US7376265B2 (en) * 2004-06-17 2008-05-20 Seiko Epson Corporation Segmentation-based hybrid compression scheme for scanned documents

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6148031A (en) * 1996-11-27 2000-11-14 Canon Kabushiki Kaisha Image processing apparatus and method
US6625319B1 (en) * 1999-03-30 2003-09-23 Koninklijke Philips Electronics N.V. Image compression using content-based image similarity
EP1209619A2 (en) * 2000-11-28 2002-05-29 Monolith Co., Ltd. Image interpolating method and apparatus
US20020075310A1 (en) * 2000-12-20 2002-06-20 Prabhu Prasad V. Graphical user interface adapted to allow scene content annotation of groups of pictures in a picture database to promote efficient database browsing

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2006134509A2 (en) * 2005-06-15 2006-12-21 Koninklijke Philips Electronics N.V. Method and apparatus for storing image data files
WO2006134509A3 (en) * 2005-06-15 2007-03-01 Koninkl Philips Electronics Nv Method and apparatus for storing image data files
KR100775217B1 (en) 2006-06-01 2007-11-12 (주) 엘지텔레콤 Still picture encoding method based on similarity in mobile camera

Also Published As

Publication number Publication date
WO2005107234B1 (en) 2006-02-16
CN1973529B (en) 2010-10-13
EP1745639A1 (en) 2007-01-24
US7805024B2 (en) 2010-09-28
US20050262543A1 (en) 2005-11-24
CN1973529A (en) 2007-05-30

Similar Documents

Publication Publication Date Title
WO2005107234A1 (en) Method and apparatus to provide efficient multimedia content storage
US10855984B2 (en) Image processing apparatus and method
US10587899B2 (en) Image processing device and method
US10237558B2 (en) Encoder, decoder, encoding method, and decoding method
RU2498523C2 (en) Fast macroblock delta quantisation parameter decision
US8019169B2 (en) Image coding apparatus, image decoding apparatus, image processing apparatus and methods thereof
KR100703283B1 (en) Image encoding apparatus and method for estimating motion using rotation matching
US20070286281A1 (en) Picture Information Encoding Apparatus and Picture Information Encoding Method
US20110164684A1 (en) Image processing apparatus and method
JP4641892B2 (en) Moving picture encoding apparatus, method, and program
US20150172700A1 (en) Moving picture coding apparatus and moving picture decoding apparatus
US20020009143A1 (en) Bandwidth scaling of a compressed video stream
US20050207496A1 (en) Moving picture coding apparatus
WO2006058113A1 (en) Rate control techniques for video encoding using parametric equations
US20110170605A1 (en) Image processing apparatus and image processing method
US20050147375A1 (en) Moving picture coding method and moving picture decoding method
US20120288006A1 (en) Apparatus and method for image processing
US20120288004A1 (en) Image processing apparatus and image processing method
JP2010063092A (en) Image coding apparatus, image coding method, image coding integrated circuit and camera
US20050111551A1 (en) Data processing apparatus and method and encoding device of same
US20130058416A1 (en) Image processing apparatus and method
EP3531700A1 (en) Image coding method, transmission method and image coding device
US20190394478A1 (en) Encoder, decoder, encoding method, and decoding method
JP2005167721A (en) Method and device for data processing, and encoder
US11375214B2 (en) Encoder, decoder, encoding method, and decoding method

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
B Later publication of amended claims

Effective date: 20051108

NENP Non-entry into the national phase

Ref country code: DE

WWW Wipo information: withdrawn in national office

Country of ref document: DE

WWE Wipo information: entry into national phase

Ref document number: 2005740451

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 200580020240.7

Country of ref document: CN

WWP Wipo information: published in national office

Ref document number: 2005740451

Country of ref document: EP