WO2000028477A1 - Image processing apparatus - Google Patents

Image processing apparatus Download PDF

Info

Publication number
WO2000028477A1
WO2000028477A1 PCT/GB1999/003716 GB9903716W WO0028477A1 WO 2000028477 A1 WO2000028477 A1 WO 2000028477A1 GB 9903716 W GB9903716 W GB 9903716W WO 0028477 A1 WO0028477 A1 WO 0028477A1
Authority
WO
WIPO (PCT)
Prior art keywords
sub
regions
rendering
tiles
image
Prior art date
Application number
PCT/GB1999/003716
Other languages
French (fr)
Inventor
Cliff Gibson
Original Assignee
Imagination Technologies Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=10842011&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=WO2000028477(A1) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Imagination Technologies Limited filed Critical Imagination Technologies Limited
Priority to AT99954179T priority Critical patent/ATE258327T1/en
Priority to EP19990954179 priority patent/EP1125250B1/en
Priority to JP2000581592A priority patent/JP4480895B2/en
Priority to DE69914355T priority patent/DE69914355T2/en
Priority to US09/831,386 priority patent/US6750867B1/en
Publication of WO2000028477A1 publication Critical patent/WO2000028477A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T15/003D [Three Dimensional] image rendering
    • G06T15/005General purpose rendering architectures

Definitions

  • This invention relates to apparatus for the processing of images.
  • the invention is particularly, though not exclusively, suitable for use in systems for the real-time rendering, texturing or shading of three-dimensional (3D) images.
  • Real-time here means sufficiently quickly for the image to be displayed without appreciable perceptible delay to the viewer.
  • the best known existing system for generating realtime 3D images is the Z-buffer (or depth buffer) image precision algorithm.
  • the Z-buffer algorithm requires a frame buffer in which color values are stored for each pixel (elementary picture element) in an image. In addition to this it requires a Z-buffer with an entry for each pixel. In this Z-buffer, a Z value or depth value is stored for each pixel.
  • polygons are rendered into the frame buffer in arbitrary order. As a subsequent polygon is entered into the frame buffer, if a point on the polygon is nearer to the viewer than the point already in the frame buffer for that pixel, then the new point's color and Z value replace the previously-stored values.
  • the texturing or shading is applied to the polygon before it is rendered into the frame buffer.
  • the system is fed with a list of vertices which define the polygons, and texture and shading operations are performed on each polygon before a depth test is executed.
  • the performance of such systems is limited by various factors, including the input data bandwidth, the speed of texturing and shading, and the local memory interface bandwidth. It has been proposed to improve the system performance by the use of dual rendering devices, which are 'scan line interleaved'.
  • the system can determine whether any surface is visible. If it is visible, that surface is then textured or shaded as desired. However if it is not visible, texturing and shading of the surface is not necessary.
  • One advantage of this system over the Z-buffer is that non-visible surfaces do not have to be textured or shaded. Texturing or shading of images requires a great deal of processing power. The reduction in processing achieved by the system of the above-mentioned United States Patent is therefore very useful, and can be quite dramatic with certain types of image, particularly those having many overlapping polygons or surfaces.
  • each surface is defined as being a forward surface, if it is at the front of the object and thus faces towards the observer, or a reverse surface if it forms part of the back of the object and thus faces away from the observer.
  • a comparison is made of the distances from the observation point to (a) the forward surface intersection with the ray which is furthest from the observation point and (b) the reverse surface intersection with the ray which is closest to the observation point.
  • the technique makes it particularly easy to apply shadows to appropriate parts of the image.
  • the system is also able to deal with transparency which can take various forms.
  • the technique has advantages over the Z-buffer system, but nevertheless processing requirements can still be a constraint. It is proposed in the Patent to improve performance by subdividing the image plane or screen into a plurality of sub-regions or 'tiles'.
  • the tiles are conveniently rectangular (including square) .
  • those objects having surfaces which could fall within the tile are first determined, and only those objects within the tile are processed, thus decreasing the number of surfaces to be processed.
  • the determination of which objects could contribute to each tile may be achieved by surrounding the object with a bounding volume, namely a cuboid which fully contains the object, and comparing the tile area with the bounding volume. To do this, all the bounding volumes are projected onto the image plane and are tested against the corners of the tiles.
  • this preferred embodiment of the invention takes the form of image processing apparatus for rendering (i.e. coloring, texturing or shading) an image includes a tiling device which divides the image into sub-regions or tiles. Two rendering devices are provided, and the tiles are allocated so that some are processed by one rendering device and some by the other. Polygons representing surfaces of objects to be displayed are tested against the tiles. If the surface falls into one sub-region only, the data is sent to one rendering device only. On the other hand, if the surface falls into two sub-regions being handled by the different rendering devices, then the data is sent to both rendering devices. The result is that a substantial proportion of the data need only be supplied to and processed by one rendering device, thereby speeding the operation of the apparatus. The outputs of the two rendering devices are subsequently combined by tile interleaving and image display circuitry.
  • Figure 1 illustrates an portion of an image plane containing several objects and having tiles of variable sizes, taken from U.S. Patent 5,729,672,
  • Figure 2 illustrates a portion of the screen containing tiles m accordance with an embodiment of the present invention
  • Figure 3 is a diagram showing a bounding box around an object
  • Figure 4 is a flow chart of the procedure used to determine the co-ordinates of a bounding box around an object
  • Figure 5 illustrates a portion of the screen containing tiles in a modification of the embodiment of Figure 1; and Figure 6 is a block schematic diagram of hardware used in preferred image processing apparatus embodying the invention .
  • FIG. 2 there is shown a portion 20 of a screen which contains twelve sub-regions or tiles 22 as shown.
  • Each tile is typically 32 pixels or 64 pixels square, that is, using a conventional raster scan, 32 or 64 pixels long by 32 or 64 lines high.
  • two objects are diagrammatically shown on this Figure, namely a house 24 and a bicycle 26. Each of these objects extends, for the sake of illustration, over two of the tiles 22. In the case of the house 24 these are one above the other in the image, and in the case of the bicycle they are side by side. Either way the operation is the same.
  • the tiles are divided into two groups of tiles. As shown, the tiles are split into two groups in checkerboard fashion. As shown in Figure 2, alternate tiles are shaded light or dark on the figure, with each light tile 22A being surrounded by four dark tiles and each dark tile 22B being surrounded by four light tiles. The light and dark tiles thus form diagonals across the screen.
  • the rendering of the objects is split between two processors which may be referred to as processor A and processor B.
  • processor A and processor B Each group of tiles is associated with a respective one of the processors A and B. That is, all the light tiles 22A are associated with processor A and all the dark tiles 22B, as shown in Figure 2, are associated with processor B. All the processing of surfaces which are seen in a light tile is undertaken by processor A and all the processing of surfaces which are seen in a dark tile is undertaken by processor B.
  • Complex objects can be seen to be made up of a group of several smaller polygons.
  • the house 24 is made up of a triangle for the roof 28, a triangle and a square for the chimney 30, and rectangles for the main body 32 of the house, the windows 34, and the door 36.
  • Those polygons which make up the roof 28, the chimney 30, and the upstairs one of the windows 34 lie entirely within a light tile 22A, and so need only be sent to processor or device A.
  • the door 36 and the downstairs window 34 lie entirely within a dark tile 22B, and so need only be sent to processor or device B.
  • the mam body 32 of the house overlaps two tiles. Accordingly it must be sent to both processors or devices as it affects the display m both a light tile and a dark tile.
  • the polygons which make up the front wheel, the front forks and the handlebars are sent to device A only; the rear wheel, the rear forks and the saddle are sent to device B only; while the frame of the bicycle is sent to both devices. Larger objects may extend over three or more tiles, but the processing applied and its effects are the same as with objects that overlap just two tiles.
  • each of the polygons which make up the complex object can be achieved by drawing a rectangle to enclose the entire polygon, and then testing the rectangle against the co-ordinates of the tiles of the screen to determine the extent of the polygon.
  • This is illustrated m Figure 3, which shows an arbitrary polygon 4C.
  • This polygon is shown as a re-entrant polygon and may conveniently be broken into non-re-entrant polygons for processing, if desired, but the principle of establishing the bounding box is the same.
  • Polygon data is supplied to the rendering device by giving the co-ordinates of each vertex of the polygon. The co-ordinates are given in a Cartesian system which has three orthogonal axes, X, Y and Z.
  • the final display screen is assumed to be in the plane of the X-Y axes, while the Z axis represents the depth of the object, as is conventional.
  • the procedure used to determine the minimum values of x and y which define the corners 44 of the bounding box 42 is illustrated m Figure 4.
  • a first step 52 the input values x._, y._ for the first vertex processed are initially assumed to be the desired values x m _, x_, ⁇ , y mil , y-, a • Tne next vertex is then processed in step 54.
  • the new input values x ⁇ n , y ⁇ r are compared with the stored values for x ⁇ ._, x max , y m r , yêt x . If for either x or y the new input value is less than the stored minimum value then it replaces the minimum value, and if the new input value exceeds the maximum value it replaces "the maximum value.
  • step 56 a check is made to see whether the last vertex defining the polygon has been processed. If not, the procedure returns to step 54 to process the next vertex. If it has, then the co-ordinates of the bounding box 42 have now been determined, and the procedure moves to step 58 where a determination is made to see whether the bounding box overlaps the screen tile which is being processed. This determination also is made by simple comparison of the XY co-ordinates of the corners 44 of the bounding box 42 with the XY co-ordinates of the corners of the tile.
  • the tile size in terms of the number of pixels is preferably chosen to be a power of two, e.g. 32 or 64, which has the consequence that the screen tile test 58 reduces to a number of simple binary comparisons.
  • FIG. 5 illustrates a form of division in which a large rectangle 2 contains a small rectangle 1 of shorter height and width.
  • the device B is arranged to process the tiles forming the smaller rectangle 1, and the processor A is arranged to process what is left, that is rectangle 2 with rectangle 1 excluded.
  • Rectangle 1 is thus defined for processor A as an inclusion rectangle and rectangle 2 as an exclusion rectangle. That is, all polygons entirely enclosed in rectangle 1 are sent to device A. Polygons outside rectangle 1 but inside rectangle 2 are sent to device B. Polygons which overlap the two areas are sent to both devices.
  • the division of the screen is such that a substantial number of surfaces have to be sent only to one processor.
  • the tiles are preferentially substantially square with, say, one side not more than twice or three times the length m pixels of the other, so that there is a reasonable HKelihood that a good proportion of the surfaces will fall only in one tile. In this way the processing is reduced by some surfaces having to be sent to one processor only.
  • a split in which alternate scan-lines were processed by different processors would not provide any advantage because the number of surfaces which fall only on one scanlme is zero, or close to it.
  • Another possible split would be in horizontal bands across the screen, but the bands would have to be sufficiently wide, e.g. the screen as a whole might be split into three or four bands.
  • the tile aspect ratio is less than six to one. In any event, there will normally be at least three screen areas, with at least one of the processors being arranged to process at least two discrete separate image areas.
  • Figure 6 shows in block schematic form image processing apparatus 60 embodying the invention and comprising a central processing unit (CPU) 62 connected to a main memory 64.
  • a tiling device 66 defines the tiles and communicates with a local memory 68 as well as with the CPU 62.
  • the tiling device effectively has two outputs to which are attached a first texturing or rendering device 70A and a second texturing or rendering device 70B.
  • the outputs of the two rendering devices 68A and 68B are both applied to tile interleaving and image display circuitry 72.
  • Objects are generated by the user (programmer) . They are defined by their vertices, and by texture codes which indicate the type of texturing required for each surface. This will include the color and other surface effects.
  • a bounding box is generated for each surface, following the method described above.
  • the bounding box is compared against the macro tiling pattern being employed, so as to determine which surfaces fall into which tiles, and hence which of the multiple rendering devices (two in this case) require the data.
  • Surface vertices and texture codes are then stored in the appropriate local memory portion associated with each memory device.
  • step 7 for some tiles is achieved by one processor and the same step for the other (dark) tiles is achieved by the other processor. This reduces the time required for processing by a factor less than but approaching a half.
  • the outputs of the two processors are combined for application to the display buffer and subsequent display on the display screen.
  • the system operates by supplying data defining a group of surfaces representing an object, e.g. the house or the bicycle in Figure 2.
  • the display is subdivided into a large number of tiles, and a determination is made as to which surfaces fall into which tiles.
  • the data is then applied to the two rendering devices in dependence upon which tile the various surfaces fall into.
  • the data of some surfaces will be sent to one rendering device only and the data of other surfaces will be sent to both rendering devices. More particularly, when the surface falls into one tile only, e.g. the roof or the door in Figure 2, the data need only be sent to one rendering or texturing device.
  • the surface data must be sent to both rendering devices .
  • the invention is not limited to the use of two devices; more than two may be used if desired, in which case the screen is split up into an appropriate larger number of regions each comprising a respective group of tiles .
  • the embodiments illustrated have the advantage that, assuming normal images are being processed, the processing time is reduced, due to the fact that the processors can operate simultaneously in parallel, each processor needing to process only some of the surfaces and not all the surfaces in the image.

Abstract

Image processing apparatus (60) for rendering (i.e. coloring, texturing or shading) an image includes a tiling device (66) which divides the image into sub-regions or tiles. Two rendering devices (70A, 70B) are provided, and the tiles are allocated so that some are processed by one rendering device and some by the other. Polygons representing surfaces of objects to be displayed are tested against the tiles. If the surface falls into one sub-region only, the data is sent to one rendering device only. On the other hand, if the surface falls into two sub-regions being handled by the different rendering devices, then the data is sent to both rendering devices. The result is that a substantial proportion of the data need only be supplied to and processed by one rendering device, thereby speeding the operation of the apparatus. The outputs of the two rendering devices (70A, 70B) are subsequently combined by tile interleaving and image display circuitry (72).

Description

IMAGE PROCESSING APPARATUS
Background of the Invention
This invention relates to apparatus for the processing of images. The invention is particularly, though not exclusively, suitable for use in systems for the real-time rendering, texturing or shading of three-dimensional (3D) images. Real-time here means sufficiently quickly for the image to be displayed without appreciable perceptible delay to the viewer.
The best known existing system for generating realtime 3D images is the Z-buffer (or depth buffer) image precision algorithm. The Z-buffer algorithm requires a frame buffer in which color values are stored for each pixel (elementary picture element) in an image. In addition to this it requires a Z-buffer with an entry for each pixel. In this Z-buffer, a Z value or depth value is stored for each pixel. To generate a 3D representation, polygons are rendered into the frame buffer in arbitrary order. As a subsequent polygon is entered into the frame buffer, if a point on the polygon is nearer to the viewer than the point already in the frame buffer for that pixel, then the new point's color and Z value replace the previously-stored values. If texture or shading of the polygon is desired, the texturing or shading is applied to the polygon before it is rendered into the frame buffer. The system is fed with a list of vertices which define the polygons, and texture and shading operations are performed on each polygon before a depth test is executed. The performance of such systems is limited by various factors, including the input data bandwidth, the speed of texturing and shading, and the local memory interface bandwidth. It has been proposed to improve the system performance by the use of dual rendering devices, which are 'scan line interleaved'. That is, there are two processors which process alternate scanlines of the image raster, thus sharing the processing between them Another system for generating real-time 3D images is described m United States Patent US-A-5, 729, 672 assigned to VideoLogic Limited. This system uses a 'ray-casting' technique for the rendering of three-dimensional images rather than conventional polygon-based rendering techniques. In this system, objects are each represented by a set of surfaces which are stored as sets of data. An image plane is deemed to lie between a viewer and the scene to be viewed, and this image plane is composed of a plurality of pixels. A ray is assumed to pass from the viewpoint through a pixel of the screen into the scene to be viewed and will intersect various surfaces which represent objects n the scene. By analysis of these intersections and their distances from the viewer, the system can determine whether any surface is visible. If it is visible, that surface is then textured or shaded as desired. However if it is not visible, texturing and shading of the surface is not necessary. One advantage of this system over the Z-buffer is that non-visible surfaces do not have to be textured or shaded. Texturing or shading of images requires a great deal of processing power. The reduction in processing achieved by the system of the above-mentioned United States Patent is therefore very useful, and can be quite dramatic with certain types of image, particularly those having many overlapping polygons or surfaces.
In the system of the United States Patent, the surfaces defining an object are assumed to extend across the whole of the image plane, that is to say they are 'infinite' in extent. Also, each surface is defined as being a forward surface, if it is at the front of the object and thus faces towards the observer, or a reverse surface if it forms part of the back of the object and thus faces away from the observer. To determine whether any given object is visible at any given pixel, in the ray-casting technique, a comparison is made of the distances from the observation point to (a) the forward surface intersection with the ray which is furthest from the observation point and (b) the reverse surface intersection with the ray which is closest to the observation point. If (a) is greater than (b) , then, as illustrated in the Patent, that indicates that the ray does not intersect with that object, and thus that that particular object is not visible at the pixel in the image plane through which the ray passes. With this technique, the edges of the surfaces do not have to be defined or calculated as such; it is sufficient to know the vertices of the object as a whole and to calculate the planes occupied by the surfaces. It will be appreciated that the technique requires that every object is checked for every pixel in the image plane to determine whether or not that object is visible at that location.
As described in that patent, the technique makes it particularly easy to apply shadows to appropriate parts of the image. The system is also able to deal with transparency which can take various forms.
The technique has advantages over the Z-buffer system, but nevertheless processing requirements can still be a constraint. It is proposed in the Patent to improve performance by subdividing the image plane or screen into a plurality of sub-regions or 'tiles'. The tiles are conveniently rectangular (including square) . Then for each tile, those objects having surfaces which could fall within the tile are first determined, and only those objects within the tile are processed, thus decreasing the number of surfaces to be processed. The determination of which objects could contribute to each tile may be achieved by surrounding the object with a bounding volume, namely a cuboid which fully contains the object, and comparing the tile area with the bounding volume. To do this, all the bounding volumes are projected onto the image plane and are tested against the corners of the tiles. Those objects which have bounding surfaces which are completely outside the tile are discarded for that tile. Thus the number of surfaces which need to be processed per pixel within a tile becomes less, and hence the total time to render an image is reduced, since the total processing time for all the tiles will be reduced. The United States Patent describes the use of tiles of variable size. This reflects the fact that objects are not normally evenly distributed over the entire screen. As shown n Figure 1 of the drawings of the present application, three tiles 10 are 10 pixels square to accommodate three particular objects, and four tiles 12 are 5 pixels square to accommodate four smaller objects. The image portions for the several tiles are processed n pipeline fashion n a processing system as described in the patent.
We have appreciated that even despite all tnese features, processing power can still be a constraint, but can be improved by use of the present invention.
Summary of the Invention
The invention in its various aspects is defined in the appended claims to which reference should now be made. Preferred features of the invention are set forth in the appendant claims.
A preferred embodiment of the invention is described in more detail below with reference to the drawings. Briefly, this preferred embodiment of the invention takes the form of image processing apparatus for rendering (i.e. coloring, texturing or shading) an image includes a tiling device which divides the image into sub-regions or tiles. Two rendering devices are provided, and the tiles are allocated so that some are processed by one rendering device and some by the other. Polygons representing surfaces of objects to be displayed are tested against the tiles. If the surface falls into one sub-region only, the data is sent to one rendering device only. On the other hand, if the surface falls into two sub-regions being handled by the different rendering devices, then the data is sent to both rendering devices. The result is that a substantial proportion of the data need only be supplied to and processed by one rendering device, thereby speeding the operation of the apparatus. The outputs of the two rendering devices are subsequently combined by tile interleaving and image display circuitry.
Brief Description of the Drawings
The invention will now be described in more detail by way of example with reference to the accompanying drawings, in which:
Figure 1 illustrates an portion of an image plane containing several objects and having tiles of variable sizes, taken from U.S. Patent 5,729,672,
Figure 2 illustrates a portion of the screen containing tiles m accordance with an embodiment of the present invention;
Figure 3 is a diagram showing a bounding box around an object;
Figure 4 is a flow chart of the procedure used to determine the co-ordinates of a bounding box around an object;
Figure 5 illustrates a portion of the screen containing tiles in a modification of the embodiment of Figure 1; and Figure 6 is a block schematic diagram of hardware used in preferred image processing apparatus embodying the invention .
Detailed Description of the Preferred Embodiments A method of and apparatus for image processing, more particularly for the real-time texturing or shading of three-dimensional (3D) images, will now be described with reference to Figure 2 et seq. of the drawings.
The introduction above describes a "bounding volume" technique in which objects which have bounding volumes which are completely outside a tile are discarded for that tile. In the following description a specific implementation of this principle is used, namely it is assumed that objects are projected onto the screen or image plane, and their bounding surfaces as seen on that plane are compared with the tiles.
Referring first to Figure 2, there is shown a portion 20 of a screen which contains twelve sub-regions or tiles 22 as shown. Each tile is typically 32 pixels or 64 pixels square, that is, using a conventional raster scan, 32 or 64 pixels long by 32 or 64 lines high. It will be seen that two objects are diagrammatically shown on this Figure, namely a house 24 and a bicycle 26. Each of these objects extends, for the sake of illustration, over two of the tiles 22. In the case of the house 24 these are one above the other in the image, and in the case of the bicycle they are side by side. Either way the operation is the same.
The tiles are divided into two groups of tiles. As shown, the tiles are split into two groups in checkerboard fashion. As shown in Figure 2, alternate tiles are shaded light or dark on the figure, with each light tile 22A being surrounded by four dark tiles and each dark tile 22B being surrounded by four light tiles. The light and dark tiles thus form diagonals across the screen.
In accordance with this invention, the rendering of the objects, that is to say the texturing and shading of the objects, which has hitherto been handled by a single processor, is split between two processors which may be referred to as processor A and processor B. Each group of tiles is associated with a respective one of the processors A and B. That is, all the light tiles 22A are associated with processor A and all the dark tiles 22B, as shown in Figure 2, are associated with processor B. All the processing of surfaces which are seen in a light tile is undertaken by processor A and all the processing of surfaces which are seen in a dark tile is undertaken by processor B.
Complex objects can be seen to be made up of a group of several smaller polygons. For example, the house 24 is made up of a triangle for the roof 28, a triangle and a square for the chimney 30, and rectangles for the main body 32 of the house, the windows 34, and the door 36.
Those polygons which make up the roof 28, the chimney 30, and the upstairs one of the windows 34 lie entirely within a light tile 22A, and so need only be sent to processor or device A. The door 36 and the downstairs window 34 lie entirely within a dark tile 22B, and so need only be sent to processor or device B. However, the mam body 32 of the house overlaps two tiles. Accordingly it must be sent to both processors or devices as it affects the display m both a light tile and a dark tile. Similarly for the bicycle 26, the polygons which make up the front wheel, the front forks and the handlebars are sent to device A only; the rear wheel, the rear forks and the saddle are sent to device B only; while the frame of the bicycle is sent to both devices. Larger objects may extend over three or more tiles, but the processing applied and its effects are the same as with objects that overlap just two tiles.
The extent of each of the polygons which make up the complex object can be achieved by drawing a rectangle to enclose the entire polygon, and then testing the rectangle against the co-ordinates of the tiles of the screen to determine the extent of the polygon. This is illustrated m Figure 3, which shows an arbitrary polygon 4C. This polygon is shown as a re-entrant polygon and may conveniently be broken into non-re-entrant polygons for processing, if desired, but the principle of establishing the bounding box is the same. Polygon data is supplied to the rendering device by giving the co-ordinates of each vertex of the polygon. The co-ordinates are given in a Cartesian system which has three orthogonal axes, X, Y and Z. The final display screen is assumed to be in the plane of the X-Y axes, while the Z axis represents the depth of the object, as is conventional. The procedure used to determine the minimum values of x and y which define the corners 44 of the bounding box 42 is illustrated m Figure 4.
In the procedure 50 of Figure 4, m a first step 52 the input values x._, y._ for the first vertex processed are initially assumed to be the desired values xm _, x_,α , ymil, y-,aTne next vertex is then processed in step 54. The new input values xιn, yιr are compared with the stored values for xτ._, xmax, ym r, y„ x . If for either x or y the new input value is less than the stored minimum value then it replaces the minimum value, and if the new input value exceeds the maximum value it replaces "the maximum value. The procedure then moves to step 56 where a check is made to see whether the last vertex defining the polygon has been processed. If not, the procedure returns to step 54 to process the next vertex. If it has, then the co-ordinates of the bounding box 42 have now been determined, and the procedure moves to step 58 where a determination is made to see whether the bounding box overlaps the screen tile which is being processed. This determination also is made by simple comparison of the XY co-ordinates of the corners 44 of the bounding box 42 with the XY co-ordinates of the corners of the tile. The tile size in terms of the number of pixels is preferably chosen to be a power of two, e.g. 32 or 64, which has the consequence that the screen tile test 58 reduces to a number of simple binary comparisons.
The bounding box procedure described will occasionally cause the system to indicate that a surface falls within a given tile when in fact it does not, but this is not a significant problem. As described, the tiles have been divided between the two processors A and B in a checkerboard pattern. Other methods may however be used to divide up the screen. The optimal method will depend on the image content, and may mean that more tiles are processed by one processor than the other. Figure 5 illustrates a form of division in which a large rectangle 2 contains a small rectangle 1 of shorter height and width. The device B is arranged to process the tiles forming the smaller rectangle 1, and the processor A is arranged to process what is left, that is rectangle 2 with rectangle 1 excluded. Rectangle 1 is thus defined for processor A as an inclusion rectangle and rectangle 2 as an exclusion rectangle. That is, all polygons entirely enclosed in rectangle 1 are sent to device A. Polygons outside rectangle 1 but inside rectangle 2 are sent to device B. Polygons which overlap the two areas are sent to both devices.
The division of the screen is such that a substantial number of surfaces have to be sent only to one processor. Thus the tiles are preferentially substantially square with, say, one side not more than twice or three times the length m pixels of the other, so that there is a reasonable HKelihood that a good proportion of the surfaces will fall only in one tile. In this way the processing is reduced by some surfaces having to be sent to one processor only. Thus, a split in which alternate scan-lines were processed by different processors would not provide any advantage because the number of surfaces which fall only on one scanlme is zero, or close to it. Another possible split would be in horizontal bands across the screen, but the bands would have to be sufficiently wide, e.g. the screen as a whole might be split into three or four bands. Then the tile aspect ratio is less than six to one. In any event, there will normally be at least three screen areas, with at least one of the processors being arranged to process at least two discrete separate image areas.
The hardware required for implementation of the embodiment described takes the form of that shown m Figure 6. Figure 6 shows in block schematic form image processing apparatus 60 embodying the invention and comprising a central processing unit (CPU) 62 connected to a main memory 64. A tiling device 66 defines the tiles and communicates with a local memory 68 as well as with the CPU 62. The tiling device effectively has two outputs to which are attached a first texturing or rendering device 70A and a second texturing or rendering device 70B. The outputs of the two rendering devices 68A and 68B are both applied to tile interleaving and image display circuitry 72. For further description of suitable hardware, reference may be made to our above-mentioned United States Patent, the significant point being that there are two rendering devices 70A, 70B connected in parallel between the tiling device 66 and the circuitry 72. In practice the two outputs of the tiling device 66 may be constituted by a single data bus together with appropriate addressing to identify a required one of the two rendering devices 70A,70B. The steps involved in the operation of the apparatus of Figure 6 may be described in principle as follows:
1. Objects are generated by the user (programmer) . They are defined by their vertices, and by texture codes which indicate the type of texturing required for each surface. This will include the color and other surface effects.
2. A bounding box is generated for each surface, following the method described above.
3. The bounding box is compared against the macro tiling pattern being employed, so as to determine which surfaces fall into which tiles, and hence which of the multiple rendering devices (two in this case) require the data. Surface vertices and texture codes are then stored in the appropriate local memory portion associated with each memory device.
4. At this stage a tile display list for each rendering device is generated, so that when the rendering device operates, it only needs to traverse data for each of its tiles, and not the whole scene display list.
5. For each pixel of the tile the surfaces are sorted by depth, with the nearest surfaces listed first.
6. The ray-casting method of the above-mentioned U.S. Patent is employed to find the front-most opaque surface which is seen at that particular pixel.
7. The thus-located front-most visible surfaces are then rendered, to provide the desired surface color, texture, and shading.
8. The resultant is stored in a display buffer. In accordance with this invention, some of the above steps are executed by two processors operating in parallel. The operations are distributed between the two processors in the manner described above. That is to say, step 7 for some tiles (the light tiles as referred to above) is achieved by one processor and the same step for the other (dark) tiles is achieved by the other processor. This reduces the time required for processing by a factor less than but approaching a half. The outputs of the two processors are combined for application to the display buffer and subsequent display on the display screen.
It will therefore be seen that the system operates by supplying data defining a group of surfaces representing an object, e.g. the house or the bicycle in Figure 2. The display is subdivided into a large number of tiles, and a determination is made as to which surfaces fall into which tiles. The data is then applied to the two rendering devices in dependence upon which tile the various surfaces fall into. The data of some surfaces will be sent to one rendering device only and the data of other surfaces will be sent to both rendering devices. More particularly, when the surface falls into one tile only, e.g. the roof or the door in Figure 2, the data need only be sent to one rendering or texturing device. When the surface falls into two tiles being handled by the different rendering devices, the surface data must be sent to both rendering devices .
It may be that the surface falls into two tiles which are both handled by the same rendering device or processor, in which case, again, the data need be sent only to one rendering device. This is unlikely with the tile arrangement of Figure 2, but could more easily happen with other arrangements.
The embodiments of the invention illustrated assume that there are two processors instead of the usual one to provide the rendering (texturing or shading for instance).
However, the invention is not limited to the use of two devices; more than two may be used if desired, in which case the screen is split up into an appropriate larger number of regions each comprising a respective group of tiles .
The embodiments illustrated have the advantage that, assuming normal images are being processed, the processing time is reduced, due to the fact that the processors can operate simultaneously in parallel, each processor needing to process only some of the surfaces and not all the surfaces in the image.
It will be appreciated that many other modifications may be made to the system described and illustrated purely by way of example.

Claims

1. Image processing apparatus, comprising: supply means (62,64) for supplying data defining a group of surfaces representing each object in an image; sub-division means (66,68) coupled to the supply means for sub-dividing the display into a plurality of sub-regions, for determining which surfaces may fall into which sub-regions, and for applying surface data to a plurality of output means in dependence upon which sub- regions the surfaces fall; a corresponding plurality of rendering devices (70A, 70B), each coupled to a respective output of the subdivision means; and combining means (72) coupled to the plurality of rendering devices to receive and combine the respective outputs of the plurality of rendering devices for display; wherein the data of some surfaces is sent to one rendering device only and the data of other surfaces is sent to more than one rendering device, in dependence upon whether the surface falls into one sub-region only or into a plurality of sub-regions being handled by different rendering devices.
2. Apparatus according to claim 1, in which the supply means comprises a CPU (62) and a memory device (64) .
3. Apparatus according to claim 1, in which the subdivision means (66) has associated memory means (68) .
4. Apparatus according to claim 1, in which the subdivision means (66) determines which surfaces may fall into which sub-region by determining a rectangular bounding volume for each surface, and determining which sub-regions the bounding volumes fall into.
5. Apparatus according to claim 1, in which the subdivision means (66) uses a ray-casting method to determine which surfaces may fall into which sub-regions in an image plane .
6. Apparatus according to claim 1, in which the surfaces are sorted by depth before rendering by the rendering means (70A, 70B) .
7. Apparatus according to claim 1, in which the surfaces are polygons.
8. Apparatus according to claim 1, in which there are two and only two rendering devices (70A, 70B) .
9. Apparatus according to claim 8, in which the sub- regions are associated with the rendering devices (70A, 70B) in a checkerboard pattern.
10. Apparatus according to claim 1, in which the sub- regions consist of respective bands across the image.
11. Apparatus according to claim 1, in which the sub- regions are rectangular with an aspect ratio of less than one in six.
12. Apparatus according to claim 1, in which the sub- regions are rectangular with an aspect ratio of less than one in three .
13. Apparatus according to claim 1, in which at least some of the sub-regions are substantially square.
14. Apparatus according to claim 1, in which at least one sub-region is defined by inclusion in a first rectangle and exclusion from a second rectangle contained within the first rectangle.
15. Apparatus according to claim 1, in which at least some of the sub-regions are from 32 to 64 pixels in width and height.
PCT/GB1999/003716 1998-11-06 1999-11-08 Image processing apparatus WO2000028477A1 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
AT99954179T ATE258327T1 (en) 1998-11-06 1999-11-08 IMAGE PROCESSING DEVICE
EP19990954179 EP1125250B1 (en) 1998-11-06 1999-11-08 Image processing apparatus
JP2000581592A JP4480895B2 (en) 1998-11-06 1999-11-08 Image processing device
DE69914355T DE69914355T2 (en) 1998-11-06 1999-11-08 IMAGE PROCESSING UNIT
US09/831,386 US6750867B1 (en) 1998-11-06 1999-11-08 Image processing apparatus

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GB9824406.4 1998-11-06
GB9824406A GB2343598B (en) 1998-11-06 1998-11-06 Image processing apparatus

Publications (1)

Publication Number Publication Date
WO2000028477A1 true WO2000028477A1 (en) 2000-05-18

Family

ID=10842011

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB1999/003716 WO2000028477A1 (en) 1998-11-06 1999-11-08 Image processing apparatus

Country Status (7)

Country Link
US (1) US6750867B1 (en)
EP (1) EP1125250B1 (en)
JP (1) JP4480895B2 (en)
AT (1) ATE258327T1 (en)
DE (1) DE69914355T2 (en)
GB (1) GB2343598B (en)
WO (1) WO2000028477A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002052506A1 (en) * 2000-12-22 2002-07-04 Volume Interactions Pte. Ltd. A method of rendering a graphics image

Families Citing this family (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7061495B1 (en) * 2002-11-18 2006-06-13 Ati Technologies, Inc. Method and apparatus for rasterizer interpolation
US7796133B1 (en) 2002-11-18 2010-09-14 Ati Technologies Ulc Unified shader
US7633506B1 (en) * 2002-11-27 2009-12-15 Ati Technologies Ulc Parallel pipeline graphics system
US8933945B2 (en) * 2002-11-27 2015-01-13 Ati Technologies Ulc Dividing work among multiple graphics pipelines using a super-tiling technique
JP2005100176A (en) * 2003-09-25 2005-04-14 Sony Corp Image processor and its method
US8133115B2 (en) 2003-10-22 2012-03-13 Sony Computer Entertainment America Llc System and method for recording and displaying a graphical path in a video game
US7403661B2 (en) * 2004-02-12 2008-07-22 Xerox Corporation Systems and methods for generating high compression image data files having multiple foreground planes
US20060071933A1 (en) 2004-10-06 2006-04-06 Sony Computer Entertainment Inc. Application binary interface for multi-pass shaders
DE602004011234T2 (en) * 2004-11-19 2009-01-02 Telefonaktiebolaget Lm Ericsson (Publ) Method and device for generating three-dimensional images
US7636126B2 (en) 2005-06-22 2009-12-22 Sony Computer Entertainment Inc. Delay matching in audio/video systems
US7965859B2 (en) 2006-05-04 2011-06-21 Sony Computer Entertainment Inc. Lighting control of a user environment via a display device
US7880746B2 (en) 2006-05-04 2011-02-01 Sony Computer Entertainment Inc. Bandwidth management through lighting control of a user environment via a display device
GB2449398B (en) * 2006-09-29 2009-02-11 Imagination Tech Ltd Improvements in memory management for systems for generating 3-dimensional computer images
US8797320B2 (en) * 2007-08-02 2014-08-05 Disney Enterprises, Inc. Surface shading of computer-generated object using multiple surfaces
GB2452731B (en) * 2007-09-12 2010-01-13 Imagination Tech Ltd Methods and systems for generating 3-dimensional computer images
GB0723536D0 (en) 2007-11-30 2008-01-09 Imagination Tech Ltd Multi-core geometry processing in a tile based rendering system
JP5120174B2 (en) * 2008-09-18 2013-01-16 富士通株式会社 Drawing device
US8152613B2 (en) * 2009-01-14 2012-04-10 Sony Computer Entertainment America Llc Tiled objects in digital environments
GB0922126D0 (en) 2009-12-17 2010-02-03 Advanced Risc Mach Ltd Graphics processing systems
JP5393574B2 (en) 2010-04-08 2014-01-22 キヤノン株式会社 Image processing apparatus, image processing method, and program
US10786736B2 (en) 2010-05-11 2020-09-29 Sony Interactive Entertainment LLC Placement of user information in a game space
JP5595151B2 (en) 2010-07-13 2014-09-24 キヤノン株式会社 Image processing apparatus, compression method in image processing apparatus, and program
JP5643574B2 (en) * 2010-08-26 2014-12-17 キヤノン株式会社 Image processing apparatus and image processing method
JP5664052B2 (en) 2010-09-15 2015-02-04 富士通セミコンダクター株式会社 Graphic processing apparatus and graphic processing program
US9342817B2 (en) 2011-07-07 2016-05-17 Sony Interactive Entertainment LLC Auto-creating groups for sharing photos
GB2494903B (en) 2011-09-22 2017-12-27 Advanced Risc Mach Ltd Graphics processing systems
EP2648107B1 (en) * 2012-04-05 2016-09-28 Siemens Healthcare GmbH Volume rendering on shared memory systems with multiple processors by optimizing cache reuse

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB8828342D0 (en) * 1988-12-05 1989-01-05 Rediffusion Simulation Ltd Image generator
US5008838A (en) * 1989-11-17 1991-04-16 Digital Corporation Method for simultaneous initialization of a double buffer and a frame buffer
US5729672A (en) * 1993-07-30 1998-03-17 Videologic Limited Ray tracing method and apparatus for projecting rays through an object represented by a set of infinite surfaces
US6313841B1 (en) * 1998-04-13 2001-11-06 Terarecon, Inc. Parallel volume rendering system with a resampling module for parallel and perspective projections

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
COPPEN D ET AL: "A DISTRIBUTED FRAME BUFFER FOR RAPID DYNAMIC CHANGES TO 3D SCENES", COMPUTERS AND GRAPHICS,GB,PERGAMON PRESS LTD. OXFORD, vol. 19, no. 2, March 1995 (1995-03-01), pages 247-250, XP000546629, ISSN: 0097-8493 *
KAWAI T ET AL: "EVALUATION OF PARELLE RAY TRACING ALGORITHM USING SCREEN SPACE SUBDIVISION FOR IMAGE GENERATION SYSTEM MAGG", SYSTEMS & COMPUTERS IN JAPAN,US,SCRIPTA TECHNICA JOURNALS. NEW YORK, vol. 25, no. 11, October 1994 (1994-10-01), pages 78-86, XP000499051, ISSN: 0882-1666 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002052506A1 (en) * 2000-12-22 2002-07-04 Volume Interactions Pte. Ltd. A method of rendering a graphics image

Also Published As

Publication number Publication date
DE69914355T2 (en) 2004-11-11
DE69914355D1 (en) 2004-02-26
ATE258327T1 (en) 2004-02-15
GB2343598B (en) 2003-03-19
GB9824406D0 (en) 1998-12-30
EP1125250A1 (en) 2001-08-22
GB2343598A (en) 2000-05-10
US6750867B1 (en) 2004-06-15
JP4480895B2 (en) 2010-06-16
EP1125250B1 (en) 2004-01-21
JP2002529865A (en) 2002-09-10

Similar Documents

Publication Publication Date Title
EP1125250B1 (en) Image processing apparatus
EP0610004B1 (en) Image generating apparatus and method of generating an image
JP4499292B2 (en) Shading of 3D computer generated images
EP1127337B1 (en) Shading 3-dimensional computer generated images
US4825391A (en) Depth buffer priority processing for real time computer image generating systems
US5963210A (en) Graphics processor, system and method for generating screen pixels in raster order utilizing a single interpolator
JP5185418B2 (en) Method and apparatus for rendering translucent 3D graphics
US6509897B1 (en) Method and system for providing implicit edge antialiasing
US20160292910A1 (en) Untransformed display lists in a tile based rendering system
EP2548176B1 (en) Processing of 3d computer graphics data on multiple shading engines
US6906715B1 (en) Shading and texturing 3-dimensional computer generated images
EP1295256B1 (en) Method and system for image rendering with tiles
US20040100471A1 (en) Dividing work among multiple graphics pipelines using a super-tiling technique
US5719598A (en) Graphics processor for parallel processing a plurality of fields of view for multiple video displays
WO2009068893A1 (en) Multi-core rasterisation in a tile based rendering system
EP1127338B1 (en) Depth sorting for use in 3-dimensional computer shading and texturing systems
EP0725365A1 (en) Shading three-dimensional images
GB2465812A (en) Distributed processing for rendering 3D images

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): JP US

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 1999954179

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 09831386

Country of ref document: US

ENP Entry into the national phase

Ref country code: JP

Ref document number: 2000 581592

Kind code of ref document: A

Format of ref document f/p: F

WWP Wipo information: published in national office

Ref document number: 1999954179

Country of ref document: EP

WWG Wipo information: grant in national office

Ref document number: 1999954179

Country of ref document: EP