WO2001078403A2 - Block based video processing - Google Patents

Block based video processing Download PDF

Info

Publication number
WO2001078403A2
WO2001078403A2 PCT/GB2001/001328 GB0101328W WO0178403A2 WO 2001078403 A2 WO2001078403 A2 WO 2001078403A2 GB 0101328 W GB0101328 W GB 0101328W WO 0178403 A2 WO0178403 A2 WO 0178403A2
Authority
WO
WIPO (PCT)
Prior art keywords
blocks
picture
motion
combining
block
Prior art date
Application number
PCT/GB2001/001328
Other languages
French (fr)
Other versions
WO2001078403A3 (en
Inventor
Roderick Mackenzie Thomson
David Melvyn Banks
Original Assignee
Snell & Wilcox Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from GB0008654A external-priority patent/GB0008654D0/en
Priority claimed from GB0008653A external-priority patent/GB2361135A/en
Application filed by Snell & Wilcox Limited filed Critical Snell & Wilcox Limited
Priority to AU42585/01A priority Critical patent/AU4258501A/en
Priority to EP01915494A priority patent/EP1273176A2/en
Priority to JP2001575727A priority patent/JP2003530790A/en
Priority to CA002404655A priority patent/CA2404655A1/en
Publication of WO2001078403A2 publication Critical patent/WO2001078403A2/en
Publication of WO2001078403A3 publication Critical patent/WO2001078403A3/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation

Definitions

  • This invention relates to video processing and particularly to motion compensation of video processes.
  • FIG. 3 is a diagram of apparatus according to another embodiment of the invention.
  • phase correlation is a particularly useful technique, other and perhaps simpler forms of correlation could alternatively be employed, such as block matching.
  • a gradient approach could also be employed.

Abstract

In video processing, a motion vector is identifed for each of a plurality of overlapping picture blocks. Pictures are then shifted in accordance with the motion vectors to provide multiple shifted pictures, and these are combined with weightings derived from the proximity of the associated pixel to the respective blocks.

Description

BLOCK BASED VIDEO PROCESSING
This invention relates to video processing and particularly to motion compensation of video processes.
It is a well-known technique in video processing, to identify a motion vector for each pixel and to shift pixels in accordance with those vectors. Such motion compensation is of benefit in myriad video processes, of which standards conversion is a good example. A motion compensated process will be expected to perform considerably better than the equivalent linear process, although at a substantial extra cost in terms of hardware complexity or software processing requirement. It is an object of one aspect of the present invention to provide a method of taking motion into account, which is less complex and involves less processing than full motion compensation, but which nonetheless offers significant improvements over the equivalent linear process.
Accordingly, the present invention consists in one aspect in a method of video processing comprising the steps of identifying a motion vector for each of a plurality of overlapping picture blocks, picture shifting in accordance with said motion vectors to provide multiple shifted pictures and combining said multiple shifted pictures.
Preferably, the multiple shifted pictures are combined with respective weightings derived from the proximity of the associated pixel to the respective blocks.
Advantageously, each pixel lies in four overlapping blocks. The major difference between a typical motion compensated system and a system according to one form of this invention (which might be termed a "motion- assisted" system) is that, while a motion compensated system has a vector bandwidth similar to the pixel rate; the motion assisted system may have many fewer vectors per field. Each vector can be associated with a relatively large block.
If the images are constructed using only vectors based on large blocks, the resulting images may look very "blocky" or like independent tiles rather one image. The technique used in one form of this invention to avoid this effect is to construct each point as a mix of four images, which are constructed using the four closest block vectors. The relative distance from the four block centres controls the proportions in which the four images are mixed.
The advantage of this technique is that discontinuities in the vector field result in image blurring rather than image discontinuities at block boundaries. If two adjacent blocks have the same vector then there is no difficulty but when the vector changes between two blocks the resulting pictures are quite different. A conventional block based system will produce an edge or discontinuity at the block boundary which is particularly visible because it is always in the same position on the screen (that is to say: the inherent blocks become very visible). The approach taken in the present invention will cause image "blurring" over the distance from one block centre to the next, which is much less objectionable. The present invention consists in another aspect in a method of video processing comprising the steps of identifying a picture region, combining pixels in a first direction over that region; and performing a one dimensional correlation process upon said combined pixels to identify a motion vector in a second, orthogonal direction. Preferably, the method further comprises the steps of combining pixels in said second direction over that region; and performing a one dimensional correlation process upon said combined pixels to identify a motion vector in said first direction.
In one form of the invention, each individual frame of the sequence is split into a number of blocks. Each m by n block is then summed in one dimension to produce either an m by 1 or a 1 by n block. These two blocks are then analysed for motion in one dimension using phase correlation.
The invention will now be described by way of example with reference to the accompanying drawings, in which:-
Figure 1 is a block diagram of apparatus according to an embodiment of the present invention; Figure 2 is a diagram illustrating the block structure and mix weightings.
Figure 3 is a diagram of apparatus according to another embodiment of the invention; and
Figure 4 is a diagram illustrating a further embodiment of the invention.
The following notation is employed in the figures:
X = motion estimation block centre 0 = current pixel p vec(-,-) / vec(+,-) / vec(-,+) / vec(+,+) are the vectors from the four blocks whose block centres are closest to the current pixel p
p(-,-) = image interpolated at p using vec(-,-) p(-,+)= image interpolated at p using vec(-,+) p(+,-)= image interpolated at p using vec(+,-) p(+,+)= image interpolated at p using vec(+,+)
The output pixel p(out) = vpos(H2) + (1-vpos)(HI)
Where; HI = hpos(p(+,-) + (1-hpos)p(-,-) and
H2 = hpos(p(+,+) + (1 -hpos)p(-,+)
Referring initially to Figure 1 , the input video signal is taken to a block based motion estimator (100). This derives one vector for each block, N vectors per field, utilising phase correlation or simpler motion measurement techniques, which are held in a vector store (102). Figure 2 shows by way of example an image which has 20 measurement blocks arranged on a 5 x 4 grid with the block centres marked "X". It should be noted that the measurement blocks may be overlapping. The vectors vec(-,-), vec(+,-), vec(-,+) and vec(+,+) are then passed from the store (102) to picture shifters (104). The four shifted pictures p(-,-), P(-.+). P(+.-). and P(+.+) are then mixed via blocks 106, 108 and 110 in a two-stage mixing process, first using hpos, and then mixing the two remaining signals using vpos. This produces output picture p(out).
The picture shifts can be regarded as read/write operations with an offset determined by the vector. This offset may be employed on either the read or the write side. Forward or backward vectors can be employed, or combinations thereof.
Each of the picture shifters shown in Figure 1 may comprise a vertical shifter followed by a horizontal shifter. It is not uncommon for horizontal motion to occur more frequently in the pictures to be processed than vertical motion. In this case a saving in hardware complexity may be achieved by reducing the number of vertical shift circuits.
Figure 3 shows an example where only two vertical shifters are used. Because there are fewer vertical shifters, the vertical vector field is subsampled horizontally so as to make the number of required shift values correspond to the number of shifters. For example the four vectors of Figure 2 could be processed as shown below to obtain two vertical shift values and four horizontal shift values.
Let vec(-,-) have horizontal component H(-,-), and vertical component
V(-,-), and vec(+,-) have horizontal component H(+,-), and vertical component V(+,-), etc. Then:
Vertical Shift 1 = 1/2[V(-,-) + V(+,-)]
Vertical Shift 2 = 1/2[V(-,+) + V(+,+)]
Horizontal Shift 1 = H(-,-)
Horizontal Shift 2 = H(+,-) Horizontal Shift 3 = H(-,+)
Horizontal Shift 4 = H(+,+)
The use of these six shift values is shown in Figure 3. The input picture (30) is fed in parallel to two vertical shifters (31) (32). The four vectors from the blocks containing the current pixel are processed as described above in the vector processor (33) so as to derive respective vertical shift values for the two vertical shifters. The shifted output picture from the vertical shifter (31) is fed in parallel to two horizontal shifters (34) (35). These shifters are fed with horizontal shift values from the vector processor (33) in order to create the pictures p(-,-) and p(+,-) for the mixer shown in Figure 1. The output of the vertical shifter (32) is processed in a similar way in the horizontal shifters (36) and (37) to create the other two pictures for the mixer. The output picture mixer has been processed in accordance with motion vectors from four overlapping blocks, but the vertical component of the vectors have been used with reduced resolution to achieve a saving in hardware complexity. Where horizontal motion predominates the subjective quality of the pictures is not adversely affected.
The mixing can of course be conducted in other ways and the relative weighting can take into account other considerations such as the confidence or estimated error in each vector. Referring finally to Figure 4, an input video signal is first organised (400) into b blocks; each block is n pixels by m lines. In one example, there are 63 blocks of 64 x 64 points. These are summed vertically to produce 63 blocks of 64 points. The blocks are 100% overlapping.
In separate horizontal and vertical paths, these blocks are windowed (402, 404) and summed (406, 408) in one direction, "m" and "n" point phase correlations are then conducted (410, 412) for b blocks per picture and the resulting correlation surfaces are then filtered (414, 416). Peaks are then detected (418, 420).
The horizontal and vertical vectors may be used separately or alternatively combined vectorally before use.
The post correlation filtering is optional and is used to increase the reliability of resulting vectors. In one embodiment adjacent blocks are filtered in the H ,V and temporal direction.
The windowing and summing functions could be replaced by other means for combining pixels in one direction. It is preferable to take steps to remove block edge effects and it may sometimes be preferable to weight the sum or other combination to give priority to pixels close to the block centre.
Whilst phase correlation is a particularly useful technique, other and perhaps simpler forms of correlation could alternatively be employed, such as block matching. A gradient approach could also be employed.
In certain applications it will be sufficient to sum in the vertical direction only and to detect only horizontal motion components. It is usually the horizontal motion components that cause the most objectionable motion artefacts in a linear process. In other applications, the horizontal and vertical processing will be time-multiplexed in common hardware.
Processing according to the present invention lends itself particularly well to software implementation or implementation in generic or video-specific digital signal processors.
These techniques can be applied to standards conversion but would be equally applicable in other areas where motion detection is useful. These include prediction based compression systems, interpolators and noise reducers.
It should be understood that this invention has been described by way of examples only and that numerous modifications are possible without departing from the scope of the invention. For example, certain embodiments may make use not merely of the horizontal and vertical components of the vectors, but of further components in other dimensions. For example, one such further dimension would be information regarding depth or distance into the picture, as considered in various special effects systems, and standards such as MPEG-4.

Claims

1. A method of video processing comprising the steps of identifying a motion vector for each of a plurality of overlapping picture blocks, picture shifting in accordance with said motion vectors to provide multiple shifted pictures and combining said multiple shifted pictures.
2. A method according to Claim 1 , in which said multiple shifted pictures are combined with respective weightings derived from the proximity of the associated pixel to the respective blocks.
3. A method according to Claim 1, in which each pixel lies in four overlapping blocks.
4. A method according to any of the preceding claims, further comprising picture shifting in stages, each stage making a shift in a direction orthogonal to the other stages.
5. A method according to Claim 4, wherein a different number of shifts is performed in at least one orthogonal direction.
6. A method of video processing comprising the steps of identifying a picture region, combining pixels in a first direction over that region; and performing a one dimensional correlation process upon said combined pixels to identify a motion vector in a second, orthogonal direction.
7. A method according to Claim 6, further comprises the steps of combining pixels in said second direction over that region; and performing a one dimensional correlation process upon said combined pixels to identify a motion vector in said first direction.
8. A method according to Claim 6 or Claim 7, wherein the step of combining comprises the step of summing.
9. A method according to Claim 8, wherein the sum is a weighted sum.
10. A method according to Claim 8 or Claim 9, wherein the step of combining further comprises the step of windowing.
11. A method according to any one of the preceding claims, in which each picture is split into a number of m by n blocks with each m by n block being summed in one dimension to produce either an m by 1 or a 1 by n block.
12. A method according to Claim 11, in which each pixel lies in four overlapping blocks.
13. A method according to any one of the preceding claims, wherein the correlation process comprises phase correlation.
PCT/GB2001/001328 2000-04-07 2001-03-26 Block based video processing WO2001078403A2 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
AU42585/01A AU4258501A (en) 2000-04-07 2001-03-26 Block based video processing
EP01915494A EP1273176A2 (en) 2000-04-07 2001-03-26 Block based video processing
JP2001575727A JP2003530790A (en) 2000-04-07 2001-03-26 Block-based video processing method
CA002404655A CA2404655A1 (en) 2000-04-07 2001-03-26 Block based video processing

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
GB0008654.6 2000-04-07
GB0008654A GB0008654D0 (en) 2000-04-07 2000-04-07 Improvements in video processing
GB0008653.8 2000-04-07
GB0008653A GB2361135A (en) 2000-04-07 2000-04-07 Block based video processing

Publications (2)

Publication Number Publication Date
WO2001078403A2 true WO2001078403A2 (en) 2001-10-18
WO2001078403A3 WO2001078403A3 (en) 2002-03-28

Family

ID=26244063

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB2001/001328 WO2001078403A2 (en) 2000-04-07 2001-03-26 Block based video processing

Country Status (5)

Country Link
EP (1) EP1273176A2 (en)
JP (1) JP2003530790A (en)
AU (1) AU4258501A (en)
CA (1) CA2404655A1 (en)
WO (1) WO2001078403A2 (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5602593A (en) * 1994-02-22 1997-02-11 Nec Corporation Overlapped motion compensation using a window function which varies in response to an input picture

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5602593A (en) * 1994-02-22 1997-02-11 Nec Corporation Overlapped motion compensation using a window function which varies in response to an input picture

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
"MUNICH MEETING OF MPEG-4 WORKING GROUP. REPORT ISO/IEC JTC1/SC29/WG11 MPEG4/N1172" INTERNATIONAL ORGANIZATION FOR STANDARDIZATION - ORGANISATION INTERNATIONALE DE NORMALISATION,XX,XX, 1996, pages 3-49, XP002047798 *
ORCHARD M T ET AL: "OVERLAPPED BLOCK MOTION COMPENSATION: AN ESTIMATION-THEORETIC APPROACH" IEEE TRANSACTIONS ON IMAGE PROCESSING,IEEE INC. NEW YORK,US, vol. 3, no. 5, 1 September 1994 (1994-09-01), pages 693-699, XP000476843 ISSN: 1057-7149 *
SATOSHI NOGAKI ET AL: "AN OVERLAPPED BLOCK MOTION COMPENSATION FOR HIGH QUALITY MOTION PICTURE CODING" SAN DIEGO, MAY 10 - 13, 1992,NEW YORK, IEEE,US, vol. CONF. 25, 10 May 1992 (1992-05-10), pages 184-187, XP000337274 ISBN: 0-7803-0593-0 *
SULLIVAN G J ET AL: "METHODS OF REDUCED-COMPLEXITY OVERLAPPED BLOCK MOTION COMPENSATION" AUSTIN, NOV. 13 - 16, 1994,LOS ALAMITOS, IEEE COMP. SOC. PRESS,US, vol. CONF. 1, 13 November 1994 (1994-11-13), pages 957-961, XP000522757 ISBN: 0-8186-6952-7 *

Also Published As

Publication number Publication date
WO2001078403A3 (en) 2002-03-28
EP1273176A2 (en) 2003-01-08
CA2404655A1 (en) 2001-10-18
JP2003530790A (en) 2003-10-14
AU4258501A (en) 2001-10-23

Similar Documents

Publication Publication Date Title
EP0393906B1 (en) Video signal interpolation
EP0765573B1 (en) Motion-compensated interpolation
US4672442A (en) Frame rate conversion system in television signal
US5600377A (en) Apparatus and method for motion compensating video signals to produce interpolated video signals
JP2611591B2 (en) Motion compensator
EP0294958A2 (en) Motion compensated interpolation of digital television images
US5012337A (en) Motion dependent video signal processing
US5444493A (en) Method and apparatus for providing intra-field interpolation of video signals with adaptive weighting based on gradients of temporally adjacent fields
JPS63313981A (en) Digital television image motion vector processor
EP0395263B1 (en) Motion dependent video signal processing
EP1104970B1 (en) Method and device for converting number of frames of image signals
US5485224A (en) Motion compensated video signal processing by interpolation of correlation surfaces and apparatus for doing the same
GB2313515A (en) Motion compensated video signal processing
KR100968642B1 (en) Method and interpolation device for calculating a motion vector, display device comprising the interpolation device, and computer program
US20030165195A1 (en) Block based video processing
WO2001078403A2 (en) Block based video processing
JPH05207271A (en) Picture magnification equipment
JP3180741B2 (en) Motion detection circuit
GB2277004A (en) Motion compensated video signal processing; motion/no motion flag
GB2361135A (en) Block based video processing
US7202909B2 (en) Video signal processing with two stage motion compensation
KR100628190B1 (en) Converting Method of Image Data's Color Format
GB2315181A (en) Motion compensated video signal processing
JPH0235515B2 (en)
JPH10164600A (en) Motion compensation predicting method

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AU CA JP US

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: A3

Designated state(s): AU CA JP US

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR

WWE Wipo information: entry into national phase

Ref document number: 2404655

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 2001915494

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 42585/01

Country of ref document: AU

ENP Entry into the national phase in:

Ref country code: JP

Ref document number: 2001 575727

Kind code of ref document: A

Format of ref document f/p: F

WWP Wipo information: published in national office

Ref document number: 2001915494

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 10257156

Country of ref document: US

WWW Wipo information: withdrawn in national office

Ref document number: 2001915494

Country of ref document: EP