WO2001031569A1 - Automatic bit-rate controlled encoding and decoding of digital images - Google Patents

Automatic bit-rate controlled encoding and decoding of digital images Download PDF

Info

Publication number
WO2001031569A1
WO2001031569A1 PCT/US2000/029378 US0029378W WO0131569A1 WO 2001031569 A1 WO2001031569 A1 WO 2001031569A1 US 0029378 W US0029378 W US 0029378W WO 0131569 A1 WO0131569 A1 WO 0131569A1
Authority
WO
WIPO (PCT)
Prior art keywords
blocks
digital image
quantized
frequency blocks
bit stream
Prior art date
Application number
PCT/US2000/029378
Other languages
French (fr)
Inventor
Johnny K. Wang
Original Assignee
Divio, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Divio, Inc. filed Critical Divio, Inc.
Priority to AU13432/01A priority Critical patent/AU1343201A/en
Publication of WO2001031569A1 publication Critical patent/WO2001031569A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T9/00Image coding
    • G06T9/007Transform coding, e.g. discrete cosine transform

Definitions

  • the present invention relates to the field of image processing. More specifically, the present invention relates to a method and apparatus for automatically adjusting the rate at which a bit stream is processed.
  • image compression standards include video compression standards, such as the digital video standards specified by the Motion Picture Experts Group (the MPEG standards), by the Join Photographic Experts Group (the Motion JPEG standard), and by the Digital NCR Conference (the "Blue Book” or DN standard), all of which standards are included herein by reference.
  • MPEG Motion Picture Experts Group
  • Motion JPEG Join Photographic Experts Group
  • DN Digital NCR Conference
  • the MPEG, JPEG, and DN standards are distinct compression formats based on discrete cosine transform (DCT) technology.
  • the MPEG in particular, MPEG- 2
  • MPEG- 2 is currently popular as a distribution format for satellite, cable, terrestrial broadcasting and digital video disc (DVD).
  • the MPEG format uses a compression algorithm that utilizes both intraframe and interframe compression.
  • the JPEG standard is most popularly used as a format for still images, but it may also be applied to video (Motion JPEG).
  • the JPEG format uses only intraframe compression.
  • the DN standard is currently gaining popularity as acquisition format for consumer digital camcorders, as well as professional digital cameras and post-production editing systems.
  • the DV format uses only intraframe compression.
  • Fig. 1 is a flow diagram showing a conventional digital image encoding process (100).
  • the encoder constructs ⁇ x ⁇ (typically 8x8) blocks from a digital image input into the encoder.
  • a forward transform typically a cosine related transform
  • the encoder divides the matrix of transformed blocks I(k l5 k 2 ) by a predetermined, fixed quantization table or matrix Q(k ⁇ ,k 2 ), and subsequently quantizes the result to produce a quantized transformed matrix S(k l5 k 2 ) which comprises a matrix of symbols.
  • the encoder applies symbol encoding to transform the matrix of symbols S(k 1 ⁇ k 2 ) to an output bit stream.
  • Fig. 3 is a flow diagram showing a conventional digital image decoding process (300).
  • the decoder receives an input bit stream and applies symbol decoding to regenerate the symbol matrix representing the quantized transformed matrix S(k ⁇ ,k 2 ).
  • the decoder multiplies the symbol matrix S(k ls k 2 ) by the same predetermined, fixed quantization matrix Q(k l9 k 2 ) to produce a transformed matirx F(k l5 k 2 ).
  • This transformed matrix I'(k l5 k 2 ) is an approximation of the original transformed matrix I(k ⁇ ,k 2 ), some fidelity having been permanently lost due to the quantization step (106) of the encoding process (100).
  • an inverse transform is applied to generate an image block matrix i'(n l5 n 2 ).
  • the image block matrix i'(n ⁇ ,n 2 ) is an approximation of the original image block matrix i(n ⁇ ,n 2 ).
  • the blocks are merged to form an output digital image which is an approximation of the original image.
  • the present invention provides new and improved apparatus and methods for encoding and decoding of a series of digital images.
  • the throughput bit rate of conventional systems varies depending upon the complexity of the images.
  • Applying apparatus and methods of the present invention, encoding and decoding of a series of digital images may be adjusted in an automated manner to achieve a constant throughput bit rate.
  • Fig. 1 is a flow diagram showing a conventional digital image encoding process (100).
  • Fig. 2 is a flow diagram showing a digital image encoding process (200) in accordance with a preferred embodiment of the present invention.
  • Fig. 3 is a flow diagram showing a conventional digital image decoding process (300).
  • Fig. 4 is a flow diagram showing a digital image decoding process (400) in accordance with a preferred embodiment of the present invention.
  • Fig. 2 is a flow diagram showing a digital image encoding process (200) in accordance with a preferred embodiment of the present invention.
  • the process (200) is similar to the conventional encoding process (100) in that the first two steps (102 and 104) are the same. However, thereafter the processes differ substantially.
  • the third step (202) involves receiving a current quantization matrix or table Q(k 1 ⁇ k 2 ).
  • the current quantization matrix Q(k ls k 2 ) is not predetermined and fixed, as it is in the conventional process (100).
  • the current quantization matrix Q(k ⁇ ,k 2 ) is used to divide the transformed matrix I(k l5 k 2 ), and the result is quantized to generate the quantized transformed matrix of symbols S(k ⁇ ,k 2 ).
  • a fourth step (204) symbol encoding is applied to generate an output bit stream.
  • the current quantization factor q f is also output with the bitstream.
  • the current bit rate for the bit stream is fed back to the encoder.
  • the encoder utilizes the fed back bit rate to determine the next quantization factor q f by adjustment to the current quantization factor.
  • this next quantization factor q f is used to determine the next quantization table or matrix Q(k ⁇ ,k 2 ).
  • This next quantization matrix Q(k ls k 2 ) becomes the current quantization matrix when it is fed into the third step (202) and used to divide the next transformed matrix I(k ⁇ ,k 2 ).
  • next quantization factor q f is determined using the following equation.
  • Rtarget is the target compression ratio set for the automated process
  • R revious is the previous compression ratio
  • q Pre vi o u s is the previous quantization factor
  • Compression ratios are reciprocals of bit rates.
  • next quantization matrix Q(k ⁇ ,k 2 ) is determined using the following equations.
  • Fig. 4 is a flow diagram showing a digital image decoding process (400) in accordance with a preferred embodiment of the present invention.
  • the process (400) is similar to the conventional decoding process (300) in that the basic steps are the same. However, the processes differ is important aspects.
  • symbol decoding is applied to an input bit stream to generate a matrix of symbols S(k 1? k 2 ).
  • the symbol matrix is multiplied by a current quantization table Q(k l5 k 2 ) rather than a predetermined, fixed quantization table.
  • the current quantization table Q(k l5 k 2 ) is derived as follows.
  • a quantization factor q f is extracted from the input bit stream.
  • the quantization factor q f having been output along with the rest of the bit stream in the fourth step (204) of the encoding process in Fig. 2.
  • this quantization factor q f is used to determine the current quantization table Q(k ⁇ ,k 2 ) for application in the second step (406) discussed above.

Abstract

New and improved apparatus (200) and method for encoding and decoding (400) of a series of digital images. The troughput bit rate of conventional systems varies depending upon the complexity of the images. Applying apparatus (200) and methods described herein, encoding and decoding (400) of a series of digital images may be adjusted in an automated manner to achieve a constant throughput bit rate.

Description

AUTOMATIC BIT-RATE CONTROLLED ENCODING AND DECODING OF DIGITAL IMAGES
CROSS-REFERENCES TO RELATED APPLICATIONS
This application claims priority from U.S. Provisional Patent Application No. 60/077,076, filed March 6, 1998, the full disclosure of which is incorporated herein by reference. This application also claims priority from U.S. Patent Application No. 09/263,589, filed March 5, 1999, the full disclosure of which is incorporated herein by reference.
BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention relates to the field of image processing. More specifically, the present invention relates to a method and apparatus for automatically adjusting the rate at which a bit stream is processed.
2. Description of the Related Art
This technique is suitable for widely-used image compression standards that integrate various algorithms into a compression system. Such image compression standards include video compression standards, such as the digital video standards specified by the Motion Picture Experts Group (the MPEG standards), by the Join Photographic Experts Group (the Motion JPEG standard), and by the Digital NCR Conference (the "Blue Book" or DN standard), all of which standards are included herein by reference.
The MPEG, JPEG, and DN standards are distinct compression formats based on discrete cosine transform (DCT) technology. The MPEG (in particular, MPEG- 2) standard is currently popular as a distribution format for satellite, cable, terrestrial broadcasting and digital video disc (DVD). The MPEG format uses a compression algorithm that utilizes both intraframe and interframe compression. The JPEG standard is most popularly used as a format for still images, but it may also be applied to video (Motion JPEG). The JPEG format uses only intraframe compression. The DN standard is currently gaining popularity as acquisition format for consumer digital camcorders, as well as professional digital cameras and post-production editing systems. Like the JPEG format, the DV format uses only intraframe compression.
The intraframe compression or encoding process in accordance with the above standards typically proceeds according to the process shown in Fig. 1. Fig. 1 is a flow diagram showing a conventional digital image encoding process (100). In a first step (102), the encoder constructs ΝxΝ (typically 8x8) blocks from a digital image input into the encoder. The result is a matrix of image blocks i(nl3n2), where ni = 1,2,3,...,Νι ; n2=l,2,3,...,N2; Ni = the number of rows; and N2 = the number of columns. In a second step (104), the encoder applies a forward transform (typically a cosine related transform) to the matrix of image blocks to generate a matrix of transformed blocks I(klsk2). In a third step (106), the encoder divides the matrix of transformed blocks I(kl5k2) by a predetermined, fixed quantization table or matrix Q(kι,k2), and subsequently quantizes the result to produce a quantized transformed matrix S(kl5k2) which comprises a matrix of symbols. Finally, in a fourth step (108), the encoder applies symbol encoding to transform the matrix of symbols S(k1}k2) to an output bit stream.
Fig. 3 is a flow diagram showing a conventional digital image decoding process (300). In a first step (302), the decoder receives an input bit stream and applies symbol decoding to regenerate the symbol matrix representing the quantized transformed matrix S(kι,k2). In a second step (304), the decoder multiplies the symbol matrix S(klsk2) by the same predetermined, fixed quantization matrix Q(kl9k2) to produce a transformed matirx F(kl5k2). This transformed matrix I'(kl5k2) is an approximation of the original transformed matrix I(kι,k2), some fidelity having been permanently lost due to the quantization step (106) of the encoding process (100). In a third step (306), an inverse transform is applied to generate an image block matrix i'(nl5n2). The image block matrix i'(nι,n2) is an approximation of the original image block matrix i(nι,n2). Finally, in a fourth step (308), the blocks are merged to form an output digital image which is an approximation of the original image. SUMMARY OF THE INVENTION
The present invention provides new and improved apparatus and methods for encoding and decoding of a series of digital images. The throughput bit rate of conventional systems varies depending upon the complexity of the images. Applying apparatus and methods of the present invention, encoding and decoding of a series of digital images may be adjusted in an automated manner to achieve a constant throughput bit rate.
For further understanding of the nature and advantages of the present invention, together with other embodiments, reference should be made to the ensuing detailed description taken in conjunction with the accompanying drawings.
BRIEF DESCRIPTION OF THE DRAWINGS
Fig. 1 is a flow diagram showing a conventional digital image encoding process (100).
Fig. 2 is a flow diagram showing a digital image encoding process (200) in accordance with a preferred embodiment of the present invention. Fig. 3 is a flow diagram showing a conventional digital image decoding process (300).
Fig. 4 is a flow diagram showing a digital image decoding process (400) in accordance with a preferred embodiment of the present invention.
DESCRIPTION OF THE SPECIFIC EMBODIMENTS
Fig. 2 is a flow diagram showing a digital image encoding process (200) in accordance with a preferred embodiment of the present invention. The process (200) is similar to the conventional encoding process (100) in that the first two steps (102 and 104) are the same. However, thereafter the processes differ substantially.
In accordance with a preferred embodiment of the present invention, the third step (202) involves receiving a current quantization matrix or table Q(k1}k2). The current quantization matrix Q(klsk2) is not predetermined and fixed, as it is in the conventional process (100). The current quantization matrix Q(kι,k2) is used to divide the transformed matrix I(kl5k2), and the result is quantized to generate the quantized transformed matrix of symbols S(kι,k2).
In a fourth step (204), symbol encoding is applied to generate an output bit stream. However, in addition to outputting the bit stream, the current quantization factor qf is also output with the bitstream. Furthermore, the current bit rate for the bit stream is fed back to the encoder.
In a fifth step (206), the encoder utilizes the fed back bit rate to determine the next quantization factor qf by adjustment to the current quantization factor. In a sixth step (208), this next quantization factor qf is used to determine the next quantization table or matrix Q(kι,k2). This next quantization matrix Q(klsk2) becomes the current quantization matrix when it is fed into the third step (202) and used to divide the next transformed matrix I(kι,k2).
In a preferred embodiment of the present invention, the next quantization factor qf is determined using the following equation.
qf -t arget
Figure imgf000005_0001
' qpreviousj " D
where Rtarget is the target compression ratio set for the automated process, R revious is the previous compression ratio, qPrevious is the previous quantization factor, and b = 1/a = 1/0.234 = 4.27. Compression ratios are reciprocals of bit rates.
In a preferred embodiment of the present invention, the next quantization matrix Q(kι,k2) is determined using the following equations.
Q(kl5k2) = [QfaedOd Jc2) * qd/64 + 0.5
where
Figure imgf000005_0002
may be, for example, a standard quantization table from the JPEG specification.
Fig. 4 is a flow diagram showing a digital image decoding process (400) in accordance with a preferred embodiment of the present invention. The process (400) is similar to the conventional decoding process (300) in that the basic steps are the same. However, the processes differ is important aspects. Like the conventional process (200), in a first step (302) symbol decoding is applied to an input bit stream to generate a matrix of symbols S(k1?k2). However, here, in a second step (406) the symbol matrix is multiplied by a current quantization table Q(kl5k2) rather than a predetermined, fixed quantization table. The current quantization table Q(kl5k2) is derived as follows.
First, in parallel to the first step (302), in a first additional step (402), a quantization factor qf is extracted from the input bit stream. The quantization factor qf having been output along with the rest of the bit stream in the fourth step (204) of the encoding process in Fig. 2. In a second additional step (404), this quantization factor qf is used to determine the current quantization table Q(kι,k2) for application in the second step (406) discussed above.
As will be understood by those with ordinary skill in the art, the present invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof.

Claims

WHAT IS CLAIMED IS: 1. A method for lossy compression of a series of digital images, the method comprising: receiving frequency blocks for a digital image, the frequency blocks including frequency data from a digital image; dividing the frequency blocks by a quantization table in a quantized manner to generate thresholded and quantized frequency blocks; symbol encoding the thresholded and quantized frequency blocks to generate a bit stream; calculating a bit rate for the bit stream; calculating a quantization factor using the bit rate; recalculating the quantization table using the quantization factor; and repeating the method for a next digital image.
2. The method of claim 1, further comprising, prior to receiving the frequency blocks: receiving the digital image; constructing intensity blocks from the digital image, the intensity blocks including pixel intensity data from the digital image; and forward fransforrning the intensity blocks to generate the frequency blocks.
3. The method of claim 2, wherein the pixel intensity data comprises data from a group of data including luminance data and chrominance data.
4. The method of claim 2, wherein the forward transforming comprises DCT fransforming.
5. The method of claim 1, further comprising:
inserting data representing the quantization factor into the bit stream; and outputting the bit stream.
6. A method for decompression of a series of digital images, the method comprising: receiving a bit stream; extracting data representing a quantization factor from the bit stream; calculating a quantization table using the quantization factor; symbol decoding the bit stream to generate thresholded and quantized frequency blocks; and multiplying the thresholded and quantized frequency blocks by the quantization table to generate quantized frequency blocks, the quantized frequency blocks including quantized frequency data from a digital image.
7. The method of claim 6, further comprising: inverse transforming the quantized frequency blocks to generate quantized intensity blocks; and merging the quantized intensity blocks to generate a reconstructed digital image.
8. An encoding apparatus operating with the method of claim 1.
9. A decoding apparatus operating with the method of claim 6.
PCT/US2000/029378 1999-10-25 2000-10-25 Automatic bit-rate controlled encoding and decoding of digital images WO2001031569A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU13432/01A AU1343201A (en) 1999-10-25 2000-10-25 Automatic bit-rate controlled encoding and decoding of digital images

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/426,407 US6442299B1 (en) 1998-03-06 1999-10-25 Automatic bit-rate controlled encoding and decoding of digital images
US09/426,407 1999-10-25

Publications (1)

Publication Number Publication Date
WO2001031569A1 true WO2001031569A1 (en) 2001-05-03

Family

ID=23690686

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/029378 WO2001031569A1 (en) 1999-10-25 2000-10-25 Automatic bit-rate controlled encoding and decoding of digital images

Country Status (3)

Country Link
US (1) US6442299B1 (en)
AU (1) AU1343201A (en)
WO (1) WO2001031569A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5263100A (en) * 1991-07-19 1993-11-16 Samsung Electronics Co., Ltd. Image coding method and apparatus
US5301242A (en) * 1991-05-24 1994-04-05 International Business Machines Corporation Apparatus and method for motion video encoding employing an adaptive quantizer
US5905534A (en) * 1993-07-12 1999-05-18 Sony Corporation Picture decoding and encoding method and apparatus for controlling processing speeds

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4302775A (en) 1978-12-15 1981-11-24 Compression Labs, Inc. Digital video compression system and methods utilizing scene adaptive coding with rate buffer feedback
US5341318A (en) 1990-03-14 1994-08-23 C-Cube Microsystems, Inc. System for compression and decompression of video data using discrete cosine transform and coding techniques
EP0735772A3 (en) 1995-03-27 1998-04-01 Hewlett-Packard Company Method for selecting JPEG quantization tables for low bandwidth applications

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5301242A (en) * 1991-05-24 1994-04-05 International Business Machines Corporation Apparatus and method for motion video encoding employing an adaptive quantizer
US5263100A (en) * 1991-07-19 1993-11-16 Samsung Electronics Co., Ltd. Image coding method and apparatus
US5905534A (en) * 1993-07-12 1999-05-18 Sony Corporation Picture decoding and encoding method and apparatus for controlling processing speeds

Also Published As

Publication number Publication date
AU1343201A (en) 2001-05-08
US20020048320A1 (en) 2002-04-25
US6442299B1 (en) 2002-08-27

Similar Documents

Publication Publication Date Title
US5881180A (en) Method and apparatus for the reduction of blocking effects in images
US6058141A (en) Varied frame rate video
US7620261B2 (en) Edge adaptive filtering system for reducing artifacts and method
KR100253931B1 (en) Approximate mpeg decoder with compressed reference frames
US20030007567A1 (en) Method and apparatus for real-time editing of plural content streams
US8411741B2 (en) Picture processing apparatus, picture processing method, picture processing program and recording medium
US20050129130A1 (en) Color space coding framework
US6185253B1 (en) Perceptual compression and robust bit-rate control system
US6865229B1 (en) Method and apparatus for reducing the “blocky picture” effect in MPEG decoded images
JP2002135783A (en) Processing method of variable bit rate for streaming service
US6633673B1 (en) Fast fade operation on MPEG video or other compressed data
JPH08289294A (en) Animation image compression system by adaptive quantization
CN113612898B (en) Robust covert communication device for resisting JPEG image downsampling
US5353060A (en) Process and device for the transformation of image data
US6442299B1 (en) Automatic bit-rate controlled encoding and decoding of digital images
US7656949B1 (en) Methods and apparatus for performing efficient inverse transform operations
Strachan et al. Video compression
US20050129111A1 (en) Transform-domain video editing
JP3469866B2 (en) Method for changing the bit rate of a data stream of an encoded video picture
EP1110408B1 (en) Compression and decompression system for digital video signals
WO2000070878A1 (en) A system and method for performing inverse quantization of a video stream
JPH10336650A (en) Frequency demultiplier for digital signal and frequency demultiplying method for digital signal
JPH06113140A (en) Picture processor
Jeong et al. Development of an experimental full-digital HDTV system: algorithm and implementation
KR100335435B1 (en) Compression coder and / or decoder of video signal and method thereof

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 69(1) EPC

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP