WO2001031569A1

WO2001031569A1 - Automatic bit-rate controlled encoding and decoding of digital images

Info

Publication number: WO2001031569A1
Application number: PCT/US2000/029378
Authority: WO
Inventors: Johnny K. Wang
Original assignee: Divio, Inc.
Priority date: 1999-10-25
Filing date: 2000-10-25
Publication date: 2001-05-03
Also published as: AU1343201A; US20020048320A1; US6442299B1

Abstract

New and improved apparatus (200) and method for encoding and decoding (400) of a series of digital images. The troughput bit rate of conventional systems varies depending upon the complexity of the images. Applying apparatus (200) and methods described herein, encoding and decoding (400) of a series of digital images may be adjusted in an automated manner to achieve a constant throughput bit rate.

Description

AUTOMATIC BIT-RATE CONTROLLED ENCODING AND DECODING OF DIGITAL IMAGES

CROSS-REFERENCES TO RELATED APPLICATIONS

This application claims priority from U.S. Provisional Patent Application No. 60/077,076, filed March 6, 1998, the full disclosure of which is incorporated herein by reference. This application also claims priority from U.S. Patent Application No. 09/263,589, filed March 5, 1999, the full disclosure of which is incorporated herein by reference.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to the field of image processing. More specifically, the present invention relates to a method and apparatus for automatically adjusting the rate at which a bit stream is processed.

2. Description of the Related Art

This technique is suitable for widely-used image compression standards that integrate various algorithms into a compression system. Such image compression standards include video compression standards, such as the digital video standards specified by the Motion Picture Experts Group (the MPEG standards), by the Join Photographic Experts Group (the Motion JPEG standard), and by the Digital NCR Conference (the "Blue Book" or DN standard), all of which standards are included herein by reference.

The MPEG, JPEG, and DN standards are distinct compression formats based on discrete cosine transform (DCT) technology. The MPEG (in particular, MPEG- 2) standard is currently popular as a distribution format for satellite, cable, terrestrial broadcasting and digital video disc (DVD). The MPEG format uses a compression algorithm that utilizes both intraframe and interframe compression. The JPEG standard is most popularly used as a format for still images, but it may also be applied to video (Motion JPEG). The JPEG format uses only intraframe compression. The DN standard is currently gaining popularity as acquisition format for consumer digital camcorders, as well as professional digital cameras and post-production editing systems. Like the JPEG format, the DV format uses only intraframe compression.

The intraframe compression or encoding process in accordance with the above standards typically proceeds according to the process shown in Fig. 1. Fig. 1 is a flow diagram showing a conventional digital image encoding process (100). In a first step (102), the encoder constructs ΝxΝ (typically 8x8) blocks from a digital image input into the encoder. The result is a matrix of image blocks i(n_l3n₂), where ni = 1,2,3,...,Νι ; n₂=l,2,3,...,N₂; Ni = the number of rows; and N₂ = the number of columns. In a second step (104), the encoder applies a forward transform (typically a cosine related transform) to the matrix of image blocks to generate a matrix of transformed blocks I(k_lsk₂). In a third step (106), the encoder divides the matrix of transformed blocks I(k_l5k₂) by a predetermined, fixed quantization table or matrix Q(kι,k₂), and subsequently quantizes the result to produce a quantized transformed matrix S(k_l5k₂) which comprises a matrix of symbols. Finally, in a fourth step (108), the encoder applies symbol encoding to transform the matrix of symbols S(k_1}k₂) to an output bit stream.

Fig. 3 is a flow diagram showing a conventional digital image decoding process (300). In a first step (302), the decoder receives an input bit stream and applies symbol decoding to regenerate the symbol matrix representing the quantized transformed matrix S(kι,k₂). In a second step (304), the decoder multiplies the symbol matrix S(k_lsk₂) by the same predetermined, fixed quantization matrix Q(k_l9k₂) to produce a transformed matirx F(k_l5k₂). This transformed matrix I'(k_l5k₂) is an approximation of the original transformed matrix I(kι,k₂), some fidelity having been permanently lost due to the quantization step (106) of the encoding process (100). In a third step (306), an inverse transform is applied to generate an image block matrix i'(n_l5n₂). The image block matrix i'(nι,n₂) is an approximation of the original image block matrix i(nι,n₂). Finally, in a fourth step (308), the blocks are merged to form an output digital image which is an approximation of the original image. SUMMARY OF THE INVENTION

The present invention provides new and improved apparatus and methods for encoding and decoding of a series of digital images. The throughput bit rate of conventional systems varies depending upon the complexity of the images. Applying apparatus and methods of the present invention, encoding and decoding of a series of digital images may be adjusted in an automated manner to achieve a constant throughput bit rate.

For further understanding of the nature and advantages of the present invention, together with other embodiments, reference should be made to the ensuing detailed description taken in conjunction with the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

Fig. 1 is a flow diagram showing a conventional digital image encoding process (100).

Fig. 2 is a flow diagram showing a digital image encoding process (200) in accordance with a preferred embodiment of the present invention. Fig. 3 is a flow diagram showing a conventional digital image decoding process (300).

Fig. 4 is a flow diagram showing a digital image decoding process (400) in accordance with a preferred embodiment of the present invention.

DESCRIPTION OF THE SPECIFIC EMBODIMENTS

Fig. 2 is a flow diagram showing a digital image encoding process (200) in accordance with a preferred embodiment of the present invention. The process (200) is similar to the conventional encoding process (100) in that the first two steps (102 and 104) are the same. However, thereafter the processes differ substantially.

In accordance with a preferred embodiment of the present invention, the third step (202) involves receiving a current quantization matrix or table Q(k_1}k₂). The current quantization matrix Q(k_lsk₂) is not predetermined and fixed, as it is in the conventional process (100). The current quantization matrix Q(kι,k₂) is used to divide the transformed matrix I(k_l5k₂), and the result is quantized to generate the quantized transformed matrix of symbols S(kι,k₂).

In a fourth step (204), symbol encoding is applied to generate an output bit stream. However, in addition to outputting the bit stream, the current quantization factor q_f is also output with the bitstream. Furthermore, the current bit rate for the bit stream is fed back to the encoder.

In a fifth step (206), the encoder utilizes the fed back bit rate to determine the next quantization factor q_f by adjustment to the current quantization factor. In a sixth step (208), this next quantization factor q_f is used to determine the next quantization table or matrix Q(kι,k₂). This next quantization matrix Q(k_lsk₂) becomes the current quantization matrix when it is fed into the third step (202) and used to divide the next transformed matrix I(kι,k₂).

In a preferred embodiment of the present invention, the next quantization factor q_f is determined using the following equation.

qf -t arget

' qpreviousj " D

where Rtarget is the target compression ratio set for the automated process, R revious is the previous compression ratio, q_Previ_ou_s is the previous quantization factor, and b = 1/a = 1/0.234 = 4.27. Compression ratios are reciprocals of bit rates.

In a preferred embodiment of the present invention, the next quantization matrix Q(kι,k₂) is determined using the following equations.

Q(k_l5k₂) = [QfaedOd Jc₂) * qd/64 + 0.5

where

may be, for example, a standard quantization table from the JPEG specification.

Fig. 4 is a flow diagram showing a digital image decoding process (400) in accordance with a preferred embodiment of the present invention. The process (400) is similar to the conventional decoding process (300) in that the basic steps are the same. However, the processes differ is important aspects. Like the conventional process (200), in a first step (302) symbol decoding is applied to an input bit stream to generate a matrix of symbols S(k_1?k₂). However, here, in a second step (406) the symbol matrix is multiplied by a current quantization table Q(k_l5k₂) rather than a predetermined, fixed quantization table. The current quantization table Q(k_l5k₂) is derived as follows.

First, in parallel to the first step (302), in a first additional step (402), a quantization factor q_f is extracted from the input bit stream. The quantization factor q_f having been output along with the rest of the bit stream in the fourth step (204) of the encoding process in Fig. 2. In a second additional step (404), this quantization factor q_f is used to determine the current quantization table Q(kι,k₂) for application in the second step (406) discussed above.

As will be understood by those with ordinary skill in the art, the present invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof.

Claims

WHAT IS CLAIMED IS: 1. A method for lossy compression of a series of digital images, the method comprising: receiving frequency blocks for a digital image, the frequency blocks including frequency data from a digital image; dividing the frequency blocks by a quantization table in a quantized manner to generate thresholded and quantized frequency blocks; symbol encoding the thresholded and quantized frequency blocks to generate a bit stream; calculating a bit rate for the bit stream; calculating a quantization factor using the bit rate; recalculating the quantization table using the quantization factor; and repeating the method for a next digital image.

2. The method of claim 1, further comprising, prior to receiving the frequency blocks: receiving the digital image; constructing intensity blocks from the digital image, the intensity blocks including pixel intensity data from the digital image; and forward fransforrning the intensity blocks to generate the frequency blocks.

3. The method of claim 2, wherein the pixel intensity data comprises data from a group of data including luminance data and chrominance data.

4. The method of claim 2, wherein the forward transforming comprises DCT fransforming.

5. The method of claim 1, further comprising:

inserting data representing the quantization factor into the bit stream; and outputting the bit stream.

6. A method for decompression of a series of digital images, the method comprising: receiving a bit stream; extracting data representing a quantization factor from the bit stream; calculating a quantization table using the quantization factor; symbol decoding the bit stream to generate thresholded and quantized frequency blocks; and multiplying the thresholded and quantized frequency blocks by the quantization table to generate quantized frequency blocks, the quantized frequency blocks including quantized frequency data from a digital image.

7. The method of claim 6, further comprising: inverse transforming the quantized frequency blocks to generate quantized intensity blocks; and merging the quantized intensity blocks to generate a reconstructed digital image.

8. An encoding apparatus operating with the method of claim 1.

9. A decoding apparatus operating with the method of claim 6.