US20100302348A1 - VTV System - Google Patents
VTV System Download PDFInfo
- Publication number
- US20100302348A1 US20100302348A1 US12/732,671 US73267110A US2010302348A1 US 20100302348 A1 US20100302348 A1 US 20100302348A1 US 73267110 A US73267110 A US 73267110A US 2010302348 A1 US2010302348 A1 US 2010302348A1
- Authority
- US
- United States
- Prior art keywords
- image
- video
- audio
- information
- data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/011—Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/10—Geometric effects
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/002—Special television systems not provided for by H04N7/007 - H04N7/18
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/16—Analogue secrecy systems; Analogue subscription systems
- H04N7/173—Analogue secrecy systems; Analogue subscription systems with two-way working, e.g. subscriber sending a programme selection signal
- H04N7/17309—Transmission or handling of upstream communications
- H04N7/17318—Direct or substantially direct transmission and handling of requests
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N9/00—Details of colour television systems
- H04N9/79—Processing of colour television signals in connection with recording
- H04N9/80—Transformation of the television signal for recording, e.g. modulation, frequency changing; Inverse transformation for playback
- H04N9/82—Transformation of the television signal for recording, e.g. modulation, frequency changing; Inverse transformation for playback the individual colour picture signal components being recorded simultaneously only
- H04N9/8205—Transformation of the television signal for recording, e.g. modulation, frequency changing; Inverse transformation for playback the individual colour picture signal components being recorded simultaneously only involving the multiplexing of an additional signal and the colour video signal
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2215/00—Indexing scheme for image rendering
- G06T2215/08—Gnomonic or central projection
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/41—Structure of client; Structure of client peripherals
- H04N21/426—Internal components of the client ; Characteristics thereof
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N9/00—Details of colour television systems
- H04N9/64—Circuits for processing colour signals
- H04N9/641—Multi-purpose receivers, e.g. for auxiliary information
Definitions
- the following patent relates to an overall hardware configuration that produces an enhanced spatial television-like viewing experience. Unlike normal television, with this system the viewer is able to control both the viewing direction and relative position of the viewer with respect to the movie action. In addition to a specific hardware configuration, this patent also relates to a new video format which makes possible this virtual reality like experience. Additionally, several proprietary video compression standards are also defined which facilitate this goal.
- the VTV system is designed to be an intermediary technology between conventional two-dimensional cinematography and true virtual reality.
- the overall VTV system consists of a central graphics processing device (the VTV processor), a range of video input devices (DVD, VCR, satellite, terrestrial television, remote video cameras), infrared remote control, digital network connection and several output device connections.
- the VTV unit In its most basic configuration as shown in FIG. 2 , the VTV unit would output imagery to a conventional television device. In such a configuration a remote control device (possibly infrared) would be used to control the desired viewing direction and position of the viewer within the VTV environment.
- the advantage of this “basic system configuration” is that it is implementable utilizing current audiovisual technology.
- the VTV graphics standard is a forwards compatible graphics standard which can be thought of as a “layer” above that of standard video.
- VTV video represents a subset of the new VTV graphics standard.
- VTV can be introduced without requiring any major changes in the television and/or audiovisual manufacturer's specifications.
- VTV compatible television decoding units will inherently be compatible with conventional television transmissions.
- the VTV system uses a wireless HMD as the display device.
- the wireless HMD can be used as a tracking device in addition to simply displaying images.
- This tracking information in the most basic form could consist of simply controlling the direction of view.
- both direction of view and position of the viewer within the virtual environment can be determined.
- remote cameras on the HMD will provide to the VTV system, real world images which it will interpret into spatial objects, the spatial objects can then be replaced with virtual objects thus providing an “environment aware” augmented reality system.
- the wireless HMD is connected to the VTV processor by virtue of a wireless data link “Cybernet link”.
- this link is capable of transmitting video information from the VTV processor, to the HMD and transmitting tracking information from the HMD to the VTV processor.
- the cybernet link would transmit video information both to and from the HMD in addition to transferring tracking information from the HMD to the VTV processor. Additionally certain components of the VTV processor may be incorporated in the remote HMD thus reducing the data transfer requirement through the cybernet link.
- This wireless data link can be implemented in a number of different ways utilizing either analog or digital video transmission (in either an un-compressed or a digitally compressed format) with a secondary digitally encoded data stream for tracking information.
- a purely digital un-directional or bidirectional data link which carries both of these channels could be incorporated.
- the actual medium for data transfer would probably be microwave or optical. However either transfer medium may be utilized as appropriate.
- the preferred embodiment of this system is one which utilizes on-board panoramic cameras fitted to the HMD in conjunction with image analysis hardware on board the HMD or possibly on the VTV base station to provide real-time tracking information.
- retroflective markers may also the utilized in the “real world environment”.
- switchable light sources placed near to the optical axis of the on-board cameras would be utilized in conjunction with these cameras to form a “differential image analysis” system.
- Such a system features considerably higher recognition accuracy than one utilizing direct video images alone.
- VTV system will transfer graphic information utilizing a “universal graphics standard”.
- a “universal graphics standard” will incorporate an object based graphics description language which achieves a high degree of compression by virtue of a “common graphics knowledge base” between subsystems.
- This patent describes in basic terms three levels of progressive sophistication in the evolution of this graphics language.
- VTV system In its most basic format the VTV system can be thought of as a 360 Degree panoramic display screen which surrounds the viewer.
- This “virtual display screen” consists of a number of “video Pages”. Encoded in the video image is a “Page key code” which instructs the VTV processor to place the graphic information into specific locations within this “virtual display screen”.
- Page key code which instructs the VTV processor to place the graphic information into specific locations within this “virtual display screen”.
- FIG. 1 is a schematic diagram of an overall VTV system.
- FIG. 2 is a schematic diagram of a VTV system according to its basic configuration.
- FIG. 3 is a schematic diagram of a VTV system according to an advanced configuration.
- FIG. 4 is an illustration of a cylindrical virtual display field.
- FIG. 5 is an illustration of a truncated spherical virtual display field.
- FIG. 6 is an illustration of a virtual representation of a 4 track sound system.
- FIG. 7 is an illustration of a virtual representation of an 8 track sound system.
- FIG. 8 is a depiction of a VTV memory map for a system utilizing both augmented reality memory and virtual reality memory.
- FIG. 9 is a VTV graphics engine diagram showing the data write side of the VTV processor.
- FIG. 10 is a VTV graphics engine diagram showing the data read side of the VTV processor.
- FIG. 11 is an example of an analogue video compatible VTV encoded video line shown containing digital data.
- FIG. 12 is an example of an analogue video compatible VTV encoded video line shown containing audio data.
- FIG. 13 is a diagram of an optical tracking system for detecting changes in position and orientation.
- FIG. 14 is a diagram of an optical tracking system for detecting azimuth changes in orientation.
- FIG. 15 is a diagram of an optical tracking system for detecting elevation changes in orientation.
- FIG. 16 is a diagram of an optical tracking system for detecting roll changes in orientation.
- FIG. 17 is a diagram of an optical tracking system for detecting forwards/backwards changes in position.
- FIG. 18 is a diagram of an optical tracking system for detecting left/right changes in position.
- FIG. 19 is a diagram of an optical tracking system for detecting up/down changes in position.
- FIG. 20 is a block diagram of hardware for an optical tracking system according to a simplified version.
- FIG. 21 is a table showing one possible configuration of VTV digital header data.
- the VTV graphics standard consists of a virtual 360 degree panoramic display screen upon which video images can be rendered from an external video source such as VCR, DVD, satellite, camera or terrestrial television receiver such that each video frame contains not only the video information but also information that defines its location within the virtual display screen.
- an external video source such as VCR, DVD, satellite, camera or terrestrial television receiver
- each video frame contains not only the video information but also information that defines its location within the virtual display screen.
- Such a system is remarkably versatile as it provides not only variable resolution images but also frame rate independent imagery. That is to say, the actual update rate within a particular virtual image (entire virtual display screen) may vary within the display screen itself. This is inherently accomplished by virtue of each frame containing its virtual location information. This allows active regions of the virtual image to be updated quickly at the nominal perception cost of not updating sections on the image which have little or no change.
- FIG. 4 Such a system is shown in FIG. 4 .
- the basic VTV system can be enhanced to the format shown in FIG. 5 .
- the cylindrical virtual display screen is interpreted by the VTV processor as a truncated sphere. This effect can be easily generated through the use of a geometry translator or “Warp Engine” within the digital processing hardware component of the VTV processor.
- the VTV standard also supports either 4 track (quadraphonic) or 8 track (octaphonic) spatial audio.
- a virtual representation of the 4 track system is shown in FIG. 6 .
- sound through the left and right speakers of the sound system (or headphones, in the case of an HMD based system) is scaled according to the azimuth the of the view port (direction of view within the VR environment).
- the 8 track audio system sound through the left and right speakers of the sound system (or headphones, in the case of an HMD based system) is scaled according to both the azimuth and elevation of the view port, as shown in the virtual representation of the system, FIG. 7 .
- the VTV standard encodes the multi-track audio channels as part of the video information in a digital/analogue hybrid format as shown in FIG. 12 .
- the audio data is stored in a compressed analogue coded format such that each video scan line contains 512 audio samples.
- each audio scan line contains a three bit digital code that is used to “pre-scale” the audio information. That is to say that the actual audio sample value is X*S where X is the pre-scale number and S is the sample value.
- the dynamic range of the audio system can be extended from about 43 dB to over 60 dB.
- this extending of the dynamic range is done at relatively “low cost” to the audio quality because we are relatively insensitive to audio distortion when the overall signal level is high.
- the start bit is an important component in the system. Its function is to set the maximum level for the scan line (i.e. the 100% or white level) This level in conjunction with the black level (this can be sampled just after the colour burst) forms the 0% and 100% range for each line.
- the system becomes much less sensitive to variations in black level due to AC-coupling of video sub modules and/or recording and play back of the video media in addition to improving the accuracy of the decoding of the digital component of the scan line.
- an audio control bit (AS) is included in each field (at line 21). This control bit sets the audio buffer sequence to 0 when it is set. This provides a way to synchronize the 4 or 8 track audio information so that the correct track is always being updated from the current data regardless of the sequence of the video Page updates.
- this spatial audio system/standard could also be used in audio only mode by the combination of a suitable compact tracking device and a set of cordless headphones to realize a spatial-audio system for advanced hi-fi equipment.
- the first two standards relate to the definitions of spatial graphics objects where as the third graphics standard relates to a complete VR environment definition language which utilizes the first standards as a subset and incorporates additional environment definitions and control algorithms.
- the VTV graphic standard (in its basic form) can be thought of as a control layer above that of the conventional video standard (NTSC, PAL etc.). As such, it is not limited purely to conventional analog video transmission standards. Using basically identical techniques, the VTV standard can 30 operate with the HDTV standard as well as many of the computer graphic and industry audiovisual standards.
- the VTV graphics processor is the heart of the VTV system.
- this module is responsible for the real-time generation of the graphics which is output to the display device (either conventional TV/HDTV or HMD).
- a video media provision device such as VCR, DVD, satellite, camera or terrestrial television receiver.
- More sophisticated versions of this module may real-time render graphics from a “universal graphics language” passed to it via the Internet or other network connection.
- the VTV processor can also perform image analysis. Early versions of this system will use this image analysis function for the purpose of determining tracking coordinates of the HMD.
- More sophisticated versions of this module will in addition to providing this tracking information, also interpret the real world images from the HMD as physical three-dimensional objects. These three-dimensional objects will be defined in the universal graphics language which can then be recorded or communicated to similar remote display devices via the Internet or other network or alternatively be replaced by other virtual objects of similar physical size thus creating a true augmented reality experience.
- the VTV hardware itself consists of a group of sub modules as follows:
- the exact configuration of these modules is dependent upon other external hardware. For example, if digital video sources are used then the video digitizing module becomes relatively trivial and may consist of no more than a group of latch's or FIFO buffer. However, if composite or Y/C video inputs are utilized then additional hardware is required to convert these signals into digital format. Additionally, if a digital HDTV signal is used as the video input source then an HDTV, decoder is required as the front end of the system (as HDTV signals cannot be processed in compressed format).
- VTV graphics engine In the case of a field based video system such as analogue TV, the basic operation of the VTV graphics engine is as follows:
- the memory write side of the VTV processor shows two separate video input stages (ADC's).
- ADC video input stages
- ADC-2 would generally be used for virtual reality video feeds from pre-rendered video material
- both video input stages have full access to both augmented reality and virtual reality memory (Le. they use a memory pool).
- This hardware configuration allows for more versatility in the design and allows several unusual display modes (which will be covered in more detail in later sections).
- the video output stages (DAC-0 and DAC-1) have total access to both virtual and augmented reality memory.
- the memory pool style of design means that the system can function with either one or two input and/or output stages (although with reduced capabilities) and as such the presence of either one or two input or output stages in a particular implementation should not limit the generality of the specification.
- the digital processing hardware would take the form of one or more field programmable logic arrays or custom ASIC.
- the advantage of using field programmable logic arrays is that the hardware can be updated at anytime.
- the main disadvantage of this technology is that it is not quite as fast as an ASIC.
- high speed conventional digital processors may' also be utilized to perform this image analysis and/or graphics generation task.
- VTV base station hardware would act only as a link between the HMD and the Internet or other network with all graphics image generation, image analysis and spatial object recognition occurring within the HMD itself.
- the low order bits of the viewport address generator are run through a look up table address translator for the X and Y image axies which impose barrel distortion on the generated images. This provides the correct image distortion for the current field of view for the viewport.
- This hardware is not shown explicitly in FIG. 10 because it will probably be implemented within an FPGA or ASIC logic and thus comprises a part of the viewport address generator functional block. Likewise roll of the final image will likely be implemented in a similar fashion.
- viewport-0 is affected by the translation engine (Warp Engine)
- Viewport-1 is read out undistorted. This is necessary when using the superimpose and overlay augmented reality modes because VR-video material being played from storage has already been “flattened” (Le. pincushion distorted) prior to being stored whereas the live video from the panoramic cameras on the HMD require distortion correction prior to being displayed by the system in Augmented Reality mode. After this preliminary distortion, images recorded by the panoramic cameras in the HMD should be geometrically accurate and suitable for storage as new VR material in their own right (Le. they can become VR material).
- One of the primary roles of the Warp Engine is then to provide geometry correction and trimming of the panoramic camera's on the HMD. This includes the complex task of providing a seamless transition between camera views.
- a VTV image frame consists of either a cylinder or a truncated sphere. This space subtends only a finite vertical angle to the viewer (+/ ⁇ 45 degrees in the prototype). This is an intentional limitation designed to make the most of the available data bandwidth of the video storage and transmission media and thus maintain compatibility with existing video systems. However, as a result of this compromise, there can exist a situation in which the view port exceeds the scope of the image data. There are several different ways in which this exception can be handled. Firstly, the simplest way to handle this exception is to simply make out of bounds video data black. This will give the appearance of being in a room with a black ceiling and floor.
- VRM Virtual reality memory
- FIG. 8 The basic memory map for the system utilizing both augmented reality memory and virtual reality memory (in addition to translation memory) is shown in FIG. 8 .
- the translation memory area must have sufficient range to cover a full 360 degree*180 degrees and ideally have the same angular resolution as that of the augmented reality memory bank (which covers 360 degree*90 degree).
- the backgrounds can be either static or dynamic and can be updated in basically the same way as foreground (augmented reality memory) by utilizing a Paged format.
- the VTV system has two basic modes of operation. Within these two modes there also exist several sub modes.
- the two basic modes are as follows:
- augmented reality mode 1 selective components of “real world imagery” are overlaid upon a virtual reality background.
- this process involves first removing all of the background components from the “real world” imagery. This can be easily done by using differential imaging techniques. i.e. by comparing current “real world” imagery against a stored copy taken previously and detecting differences between the two. After the two images have been correctly aligned, the regions that differ are new or foreground objects and those that remain the same are static background objects.
- This is the simplest of the augmented reality modes and is generally not sufficiently interesting as most of the background will be removed in the process.
- the augmented reality memory will generally be updated in sequential Page order (Le. updated in whole system frames) rather than random Page updates. This is because constant variations in the position and orientation of the panoramic camera system during filming will probably cause mis-matches in the image Pages if they are handled separately.
- Augmented reality mode 2 differs from mode 1 in that, in addition to automatically extracting foreground and moving objects and placing these in an artificial background environment, the system also utilizes the Warp Engine to “push” additional “real world” objects into the background. In addition to simply adding these “real world” objects into the virtual environment the Warp Engine is also capable of scaling and translating these objects so that they match into the virtual environment more effectively. These objects can be handled as opaque overlays or transparencies.
- Augmented reality mode 3 differs from the mode 2 in that, in this case, the Warp Engine is used to “pull” the background objects into the foreground to replace “real world” objects.
- these objects can be translated and scaled and can be handled as r either opaque overlays or transparencies. This gives the user to the ability to “match” the physical size and position of a “real world” object with a virtual object. By doing so, the user is able to interact and navigate within the augmented reality environment as they would in the “real world” environment.
- This mode is probably the most likely mode to be utilized for entertainment and gaming purposes as it would allow a Hollywood production to be brought into the users own living room.
- Virtual reality mode is a functionally simpler mode than the previous augmented reality modes.
- “pre-filmed” or computer-generated graphics are loaded into augmented reality memory on a random Page by Page basis. This is possible because the virtual camera planes of reference are fixed.
- virtual reality memory is loaded with a fixed or dynamic background at a lower resolution. The use of both foreground and background image planes makes possible more sophisticated graphics techniques such as motion parallax.
- the versatility of virtual reality memory can be improved by utilizing an enhanced form of “blue-screening”.
- background memory a sample of the “chroma-key” color is provided at the beginning of each scan line in the background field (area outside of the active image area).
- This provides a versatile system in which any color is allowable in the image.
- problems and inaccuracies associated with the “cutting and pasting” of this object by the Warp Engine are greatly reduced.
- the use of “transparent” chroma-keyed regions within foreground virtual reality images allows easy generation of complex sharp edged and/or dynamic foreground regions with no additional information overhead.
- the Camera System The Camera System:
- the images are first processed by a VTV encoder module.
- This device provides video distortion correction and also inserts video Page information, orientation tracking data and spatial audio into the video stream. This can be done without altering the video standard, thereby maintaining compatibility with existing recording and playback devices.
- this module could be incorporated within the VTV processor, having this module as a separate entity is advantageous for use in remote camera applications where the video information must ultimately be either stored or transmitted through some form of wireless network
- tracking information must comprise part of the resultant video stream in order that an “absolute” azimuth and elevation coordinate system be maintained.
- this data is not required as the camera orientation is a theoretical construct known to the computer system at render time.
- the basic tracking system of the VTV HMD utilizes on-board panoramic video cameras to capture the required 360 degree visual information of the surrounding real world environment. This information is then analyzed by the VTV processor (whether it exists within the HMD or as a base station unit) utilizing computationally intensive yet relatively algorithmically simple techniques such as auto correlation. Examples of a possible algorithm are shown in FIGS. 13-19 .
- the simple tracking system outlined in FIGS. 13-19 detects only changes in position and orientation.
- Such absolute reference points would probably be located at the extremities of the environmental region (i.e. confines of the user space) however they could be placed anywhere within the real environment, provided the VTV hardware is aware of the real world coordinates of these markers.
- the combination of these absolute reference points and differential movement makes possible the generation of absolute real world coordinate information at full video frame rates.
- active optical beacons could be employed as an alternative to the placement of retroflective targets at known spatial coordinates.
- FIG. 20 shows a simplistic representation of the tracking hardware in which the auto correlators simply detect the presence or absence of a particular movement.
- a practical system would probably incorporate a number of auto correlators for each class of movement (for example there may be 16 or more separate auto correlators to detect horizontal movement). Such as system would then be able to detect different levels or amounts of movement in all of the directions.
- This tracking system is possible utilizing a similar image analysis technique to track a pattern on the ceiling to achieve spatial positioning information and simple “tilt sensors” to detect angular orientation of the HMD/Pan-Cam system.
- the advantage of this system is that it is considerably simpler and less expensive than the full six axis optical tracker previously described.
- the fact that the ceiling is at a constant distance and known orientation from the HMD greatly implifies the optical system, the quality of the required imaging device and the complexity of the subsequent image analysis.
- this spatial positioning information is inherently in the form of relative movement only.
- absolute reference points allows such a system to re-calibrate its absolute references and thus achieve an overall absolute coordinate system.
- This absolute reference point calibration can be achieved relatively easily utilizing several different techniques.
- the first, and perhaps simplest technique is to use color sensitive retroflective spots as previously described.
- active optical beacons such as LED beacons
- a further alternative absolute reference calibration system which could be used is based on a bi-directional infrared beacon. Such as system would communicate a unique code between the HMD and the beacon, such that calibration would occur only once each time the HMD passed under any of these “known spatial reference points”. This is required to avoid “dead tracking regions” within the vicinity of the calibration beacons due to multiple origin resets.
- the basic auto correlation technique used to locate movement within the image can be simplified into reasonably straightforward image processing steps.
- rotation detection can be simplified into a group of lateral shifts (up, down, left, right) symmetrical around the center of the image (optical axis of the camera).
- sample points for lateral movement do not necessarily have to be very large. They do however have to contain unique picture information. For example a blank featureless wall will yield no useful tracking information
- an image with high contrast regions such as edges of objects or bright highlight points is relatively easily tracked. Taking this thinking one step further, it is possible to first reduce the entire image into highlight points/edges. The image can then be processed as a series of horizontal and vertical strips such that auto correlation regions are bounded between highlight points/edges.
- small highlight regions can very easily be tracked by comparing previous image frames against current images and determining “closest possible fit” between the images (i.e. minimum movement of highlight points).
- Such techniques are relatively easy and well within the capabilities of most moderate speed micro-processors, provided some of-the image pre-processing overhead is handled by hardware.
Abstract
The following patent relates to an overall hardware configuration that produces an enhanced spatial television-like viewing experience. Unlike normal television, with this system the viewer is able to control both the viewing direction and relative position of the viewer with respect to the movie action. In addition to a specific hardware configuration, this patent also relates to a new video format which makes possible this virtual reality like experience.
Description
- This application is a continuation of application Ser. No. 09/891,733, filed Jun. 25, 2001.
- The following patent relates to an overall hardware configuration that produces an enhanced spatial television-like viewing experience. Unlike normal television, with this system the viewer is able to control both the viewing direction and relative position of the viewer with respect to the movie action. In addition to a specific hardware configuration, this patent also relates to a new video format which makes possible this virtual reality like experience. Additionally, several proprietary video compression standards are also defined which facilitate this goal. The VTV system is designed to be an intermediary technology between conventional two-dimensional cinematography and true virtual reality. There are several stages in the evolution of the VTV system ranging from, in its most basic form, a panoramic display system to, in its most sophisticated form featuring full object based virtual reality utilizing animated texture maps and featuring live actors and/or computer-generated characters in a full “environment aware” augmented reality system.
- As can be seen in
FIG. 1 the overall VTV system consists of a central graphics processing device (the VTV processor), a range of video input devices (DVD, VCR, satellite, terrestrial television, remote video cameras), infrared remote control, digital network connection and several output device connections. In its most basic configuration as shown inFIG. 2 , the VTV unit would output imagery to a conventional television device. In such a configuration a remote control device (possibly infrared) would be used to control the desired viewing direction and position of the viewer within the VTV environment. The advantage of this “basic system configuration” is that it is implementable utilizing current audiovisual technology. The VTV graphics standard is a forwards compatible graphics standard which can be thought of as a “layer” above that of standard video. That is to say conventional video represents a subset of the new VTV graphics standard. As a result of this standard's compatibility, VTV can be introduced without requiring any major changes in the television and/or audiovisual manufacturer's specifications. Additionally, VTV compatible television decoding units will inherently be compatible with conventional television transmissions. - In a more sophisticated configuration, as shown in
FIG. 3 , the VTV system uses a wireless HMD as the display device. In such a configuration, the wireless HMD can be used as a tracking device in addition to simply displaying images. This tracking information in the most basic form could consist of simply controlling the direction of view. In a more sophisticated system, both direction of view and position of the viewer within the virtual environment can be determined. Ultimately, in the most sophisticated implementation, remote cameras on the HMD will provide to the VTV system, real world images which it will interpret into spatial objects, the spatial objects can then be replaced with virtual objects thus providing an “environment aware” augmented reality system. - The wireless HMD is connected to the VTV processor by virtue of a wireless data link “Cybernet link”. In its most basic form this link is capable of transmitting video information from the VTV processor, to the HMD and transmitting tracking information from the HMD to the VTV processor. In its most sophisticated form the cybernet link would transmit video information both to and from the HMD in addition to transferring tracking information from the HMD to the VTV processor. Additionally certain components of the VTV processor may be incorporated in the remote HMD thus reducing the data transfer requirement through the cybernet link. This wireless data link can be implemented in a number of different ways utilizing either analog or digital video transmission (in either an un-compressed or a digitally compressed format) with a secondary digitally encoded data stream for tracking information. Alternately, a purely digital un-directional or bidirectional data link which carries both of these channels could be incorporated. The actual medium for data transfer would probably be microwave or optical. However either transfer medium may be utilized as appropriate. The preferred embodiment of this system is one which utilizes on-board panoramic cameras fitted to the HMD in conjunction with image analysis hardware on board the HMD or possibly on the VTV base station to provide real-time tracking information. To further improve system accuracy, retroflective markers may also the utilized in the “real world environment”. In such a configuration, switchable light sources placed near to the optical axis of the on-board cameras would be utilized in conjunction with these cameras to form a “differential image analysis” system. Such a system features considerably higher recognition accuracy than one utilizing direct video images alone.
- Ultimately, the VTV system will transfer graphic information utilizing a “universal graphics standard”. Such a standard will incorporate an object based graphics description language which achieves a high degree of compression by virtue of a “common graphics knowledge base” between subsystems. This patent describes in basic terms three levels of progressive sophistication in the evolution of this graphics language.
- These three compression standards will for the purpose of this patent be described as:
- a) c-com
b) s-com
c) v-com - In its most basic format the VTV system can be thought of as a 360 Degree panoramic display screen which surrounds the viewer.
- This “virtual display screen” consists of a number of “video Pages”. Encoded in the video image is a “Page key code” which instructs the VTV processor to place the graphic information into specific locations within this “virtual display screen”. As a result of this ability to place images dynamically it is possible to achieve the effective equivalent to both high-resolution and high frame rates without significant sacrifice to either. For example, only sections of the image which are rapidly changing require rapid image updates whereas the majority of the image is generally static. Unlike conventional cinematography in which key elements (which are generally moving) are located in the primary scene, the majority of a panoramic image is generally static.
-
FIG. 1 is a schematic diagram of an overall VTV system. -
FIG. 2 is a schematic diagram of a VTV system according to its basic configuration. -
FIG. 3 is a schematic diagram of a VTV system according to an advanced configuration. -
FIG. 4 is an illustration of a cylindrical virtual display field. -
FIG. 5 is an illustration of a truncated spherical virtual display field. -
FIG. 6 is an illustration of a virtual representation of a 4 track sound system. -
FIG. 7 is an illustration of a virtual representation of an 8 track sound system. -
FIG. 8 is a depiction of a VTV memory map for a system utilizing both augmented reality memory and virtual reality memory. -
FIG. 9 is a VTV graphics engine diagram showing the data write side of the VTV processor. -
FIG. 10 is a VTV graphics engine diagram showing the data read side of the VTV processor. -
FIG. 11 is an example of an analogue video compatible VTV encoded video line shown containing digital data. -
FIG. 12 is an example of an analogue video compatible VTV encoded video line shown containing audio data. -
FIG. 13 is a diagram of an optical tracking system for detecting changes in position and orientation. -
FIG. 14 is a diagram of an optical tracking system for detecting azimuth changes in orientation. -
FIG. 15 is a diagram of an optical tracking system for detecting elevation changes in orientation. -
FIG. 16 is a diagram of an optical tracking system for detecting roll changes in orientation. -
FIG. 17 is a diagram of an optical tracking system for detecting forwards/backwards changes in position. -
FIG. 18 is a diagram of an optical tracking system for detecting left/right changes in position. -
FIG. 19 is a diagram of an optical tracking system for detecting up/down changes in position. -
FIG. 20 is a block diagram of hardware for an optical tracking system according to a simplified version. -
FIG. 21 is a table showing one possible configuration of VTV digital header data. - In its most basic form the VTV graphics standard consists of a virtual 360 degree panoramic display screen upon which video images can be rendered from an external video source such as VCR, DVD, satellite, camera or terrestrial television receiver such that each video frame contains not only the video information but also information that defines its location within the virtual display screen. Such a system is remarkably versatile as it provides not only variable resolution images but also frame rate independent imagery. That is to say, the actual update rate within a particular virtual image (entire virtual display screen) may vary within the display screen itself. This is inherently accomplished by virtue of each frame containing its virtual location information. This allows active regions of the virtual image to be updated quickly at the nominal perception cost of not updating sections on the image which have little or no change. Such a system is shown in
FIG. 4 . - To further improve the realism of the imagery, the basic VTV system can be enhanced to the format shown in
FIG. 5 . In this configuration the cylindrical virtual display screen is interpreted by the VTV processor as a truncated sphere. This effect can be easily generated through the use of a geometry translator or “Warp Engine” within the digital processing hardware component of the VTV processor. - Due to constant variation of absolute planes of reference, mobile camera applications (either HMO based or Pan-Cam based) require additional tracking information for azimuth and elevation of the camera system to be included with the visual information in order that the images can be correctly decoded by the VTV graphics engine. In such a system, absolute camera azimuth and elevation becomes part of the image frame information. There are several possible techniques for the interpretation of this absolute reference data. Firstly, the coordinate data could be used to define the origins of the image planes within the memory during the memory writing process. Unfortunately this approach will tend to result in remnant image fragments being left in memory from previous frames with different alignment values. A more practical solution is simply to write the video information into memory with an assumed reference point of 0 azimuth, 0 elevation. This video information is then correctly displayed by correcting the display viewport for the camera angular offsets. One possible data format for such a system is shown in
FIG. 11 andFIG. 21 . - In addition to 360 Degree panoramic video, the VTV standard also supports either 4 track (quadraphonic) or 8 track (octaphonic) spatial audio. A virtual representation of the 4 track system is shown in
FIG. 6 . In the case of the simple 4 track audio system sound through the left and right speakers of the sound system (or headphones, in the case of an HMD based system) is scaled according to the azimuth the of the view port (direction of view within the VR environment). In the case of the 8 track audio system sound through the left and right speakers of the sound system (or headphones, in the case of an HMD based system) is scaled according to both the azimuth and elevation of the view port, as shown in the virtual representation of the system,FIG. 7 . - In its most basic form, the VTV standard encodes the multi-track audio channels as part of the video information in a digital/analogue hybrid format as shown in
FIG. 12 . As a result, video compatibility with existing equipment can be achieved. As can be seen in this illustration, the audio data is stored in a compressed analogue coded format such that each video scan line contains 512 audio samples. In addition to this analogue coded audio information, each audio scan line contains a three bit digital code that is used to “pre-scale” the audio information. That is to say that the actual audio sample value is X*S where X is the pre-scale number and S is the sample value. Using this dual-coding scheme the dynamic range of the audio system can be extended from about 43 dB to over 60 dB. Secondly, this extending of the dynamic range is done at relatively “low cost” to the audio quality because we are relatively insensitive to audio distortion when the overall signal level is high. The start bit is an important component in the system. Its function is to set the maximum level for the scan line (i.e. the 100% or white level) This level in conjunction with the black level (this can be sampled just after the colour burst) forms the 0% and 100% range for each line. By dynamically adjusting the 0% and 100% marks for each line on a line by line basis, the system becomes much less sensitive to variations in black level due to AC-coupling of video sub modules and/or recording and play back of the video media in addition to improving the accuracy of the decoding of the digital component of the scan line. - In addition to this pre-scaling of the digital information, an audio control bit (AS) is included in each field (at line 21). This control bit sets the audio buffer sequence to 0 when it is set. This provides a way to synchronize the 4 or 8 track audio information so that the correct track is always being updated from the current data regardless of the sequence of the video Page updates.
- In more sophisticated multimedia data formats such as computer AV files and digital television transmissions, these additional audio tracks could be stored in other ways which may be more efficient or otherwise advantageous.
- should be noted that, in addition to it's use as an audiovisual device, this spatial audio system/standard could also be used in audio only mode by the combination of a suitable compact tracking device and a set of cordless headphones to realize a spatial-audio system for advanced hi-fi equipment.
- In addition to this simplistic graphics standard, There a are number of enhancements which can be used alone or in conjunction with the basic VTV graphics standard. These three graphics standards will be described in detail in subsequent patents, however for the purpose of this patent, they are known as:
-
- a) c-corn
- b) s-corn
- c) v-corn
- The first two standards relate to the definitions of spatial graphics objects where as the third graphics standard relates to a complete VR environment definition language which utilizes the first standards as a subset and incorporates additional environment definitions and control algorithms.
- The VTV graphic standard (in its basic form) can be thought of as a control layer above that of the conventional video standard (NTSC, PAL etc.). As such, it is not limited purely to conventional analog video transmission standards. Using basically identical techniques, the VTV standard can 30 operate with the HDTV standard as well as many of the computer graphic and industry audiovisual standards.
- The VTV graphics processor is the heart of the VTV system. In its most basic form this module is responsible for the real-time generation of the graphics which is output to the display device (either conventional TV/HDTV or HMD). In addition to digitizing raw graphics information input from a video media provision device such as VCR, DVD, satellite, camera or terrestrial television receiver. More sophisticated versions of this module may real-time render graphics from a “universal graphics language” passed to it via the Internet or other network connection. In addition to this digitizing and graphics rendering task, the VTV processor can also perform image analysis. Early versions of this system will use this image analysis function for the purpose of determining tracking coordinates of the HMD. More sophisticated versions of this module will in addition to providing this tracking information, also interpret the real world images from the HMD as physical three-dimensional objects. These three-dimensional objects will be defined in the universal graphics language which can then be recorded or communicated to similar remote display devices via the Internet or other network or alternatively be replaced by other virtual objects of similar physical size thus creating a true augmented reality experience.
- The VTV hardware itself consists of a group of sub modules as follows:
-
- a) video digitizing module
- b) Augmented Reality Memory (ARM)
- c) Virtual Reality Memory (VRM)
- d) Translation Memory (TM)
- e) digital processing hardware
- f) video generation module
- The exact configuration of these modules is dependent upon other external hardware. For example, if digital video sources are used then the video digitizing module becomes relatively trivial and may consist of no more than a group of latch's or FIFO buffer. However, if composite or Y/C video inputs are utilized then additional hardware is required to convert these signals into digital format. Additionally, if a digital HDTV signal is used as the video input source then an HDTV, decoder is required as the front end of the system (as HDTV signals cannot be processed in compressed format).
- In the case of a field based video system such as analogue TV, the basic operation of the VTV graphics engine is as follows:
-
- a) Video information is digitized and placed in the augmented reality memory on a field by field basis assuming an absolute Page reference of 0 degree azimuth, 0 degree elevation with the origin of each Page being determined by the state of the Page number bits (P3-PO).
- b) Auxiliary video information for background and/or floor/ceiling maps is loaded into the virtual reality memory on a field by field basis dependent upon the state of the “field type” bits (F3-FO) and Page number bits (P3PO).
- c) The digital processing hardware interprets this information held in augmented reality and virtual reality memory and utilizing a combination of a geometry processing engine (Warp Engine), digital subtractive image processing and a new versatile form of “blue-screening”, translates and selectively combines this data into an image substantially similar to that which would be seen by the viewer if they were standing in the same location as that of the panoramic camera when the video material was filmed. The main differences between this image and that available utilizing conventional video techniques being that it is not only 360 degree panoramic but also has the ability to have elements of both virtual reality and “real world” imagery melded together to form a complex immersive augmented reality experience.
- d) The exact way in which the virtual reality and “real world imagery” is combined depends upon the mode that the VTV processor is operating in and is discussed in more detail in later sections of this specification. The particular VTV processor mode is determined by additional control information present in the source media and thus the processing and display modes can change dynamically while displaying a source of VTV media.
- e) The video generation module then generates a single or pair of video images for display on a conventional television or HMD display device. Although the VTV image field will be updated at less than full frame rates (unless multi-spin DVD devices are used as the image media) graphics rendering will still occur at full video frame rates, as will the updates of the spatial audio. This is possible because each “Image Sphere” contains all of the required information for both video and audio for any viewer orientation (azimuth and elevation).
- As can be seen in
FIG. 9 . The memory write side of the VTV processor shows two separate video input stages (ADC's). It should be noted that although ADC-0 would generally be used for live panoramic video feeds and ADC-2 would generally be used for virtual reality video feeds from pre-rendered video material, both video input stages have full access to both augmented reality and virtual reality memory (Le. they use a memory pool). This hardware configuration allows for more versatility in the design and allows several unusual display modes (which will be covered in more detail in later sections). Similarly, the video output stages (DAC-0 and DAC-1) have total access to both virtual and augmented reality memory. - Although having two input and two output stages improves the versatility of the design, the memory pool style of design means that the system can function with either one or two input and/or output stages (although with reduced capabilities) and as such the presence of either one or two input or output stages in a particular implementation should not limit the generality of the specification.
- For ease of design, high-speed static RAM was utilized as the video memory in the prototype device. However, other memory technologies may be utilized without limiting the generality of the design specification.
- In the preferred embodiment, the digital processing hardware would take the form of one or more field programmable logic arrays or custom ASIC. The advantage of using field programmable logic arrays is that the hardware can be updated at anytime. The main disadvantage of this technology is that it is not quite as fast as an ASIC. Alternatively, high speed conventional digital processors may' also be utilized to perform this image analysis and/or graphics generation task.
- As previously described, certain sections of this hardware may be incorporated in the HMD, possibly even to the, point at which the entire VTV hardware exists within the portable HMD device. In such a case the VTV base station hardware would act only as a link between the HMD and the Internet or other network with all graphics image generation, image analysis and spatial object recognition occurring within the HMD itself.
- Note: The low order bits of the viewport address generator are run through a look up table address translator for the X and Y image axies which impose barrel distortion on the generated images. This provides the correct image distortion for the current field of view for the viewport. This hardware is not shown explicitly in
FIG. 10 because it will probably be implemented within an FPGA or ASIC logic and thus comprises a part of the viewport address generator functional block. Likewise roll of the final image will likely be implemented in a similar fashion. - It should be noted that only viewport-0 is affected by the translation engine (Warp Engine), Viewport-1 is read out undistorted. This is necessary when using the superimpose and overlay augmented reality modes because VR-video material being played from storage has already been “flattened” (Le. pincushion distorted) prior to being stored whereas the live video from the panoramic cameras on the HMD require distortion correction prior to being displayed by the system in Augmented Reality mode. After this preliminary distortion, images recorded by the panoramic cameras in the HMD should be geometrically accurate and suitable for storage as new VR material in their own right (Le. they can become VR material). One of the primary roles of the Warp Engine is then to provide geometry correction and trimming of the panoramic camera's on the HMD. This includes the complex task of providing a seamless transition between camera views.
- As can be seen in
FIGS. 4 , 5 a VTV image frame consists of either a cylinder or a truncated sphere. This space subtends only a finite vertical angle to the viewer (+/−45 degrees in the prototype). This is an intentional limitation designed to make the most of the available data bandwidth of the video storage and transmission media and thus maintain compatibility with existing video systems. However, as a result of this compromise, there can exist a situation in which the view port exceeds the scope of the image data. There are several different ways in which this exception can be handled. Firstly, the simplest way to handle this exception is to simply make out of bounds video data black. This will give the appearance of being in a room with a black ceiling and floor. However, an alternative and preferable configuration is to use a secondary video memory store to store a full 360 degree*180 degree background image map at reduced resolution. This memory area is known as Virtual reality memory (VRM). The basic memory map for the system utilizing both augmented reality memory and virtual reality memory (in addition to translation memory) is shown inFIG. 8 . As can be seen in this illustration, the translation memory area must have sufficient range to cover a full 360 degree*180 degrees and ideally have the same angular resolution as that of the augmented reality memory bank (which covers 360 degree*90 degree). With such a configuration, it is possible to provide both floor and ceiling exception handling and variable transparency imagery such as looking through windows in the foreground and showing the background behind them. The backgrounds can be either static or dynamic and can be updated in basically the same way as foreground (augmented reality memory) by utilizing a Paged format. - The VTV system has two basic modes of operation. Within these two modes there also exist several sub modes. The two basic modes are as follows:
-
- a) Augmented reality mode
- b) Virtual reality mode
- In
augmented reality mode 1, selective components of “real world imagery” are overlaid upon a virtual reality background. In general, this process involves first removing all of the background components from the “real world” imagery. This can be easily done by using differential imaging techniques. i.e. by comparing current “real world” imagery against a stored copy taken previously and detecting differences between the two. After the two images have been correctly aligned, the regions that differ are new or foreground objects and those that remain the same are static background objects. This is the simplest of the augmented reality modes and is generally not sufficiently interesting as most of the background will be removed in the process. It should be noted that, when operated in mobile Pan-Cam (telepresense) or augmented reality mode the augmented reality memory will generally be updated in sequential Page order (Le. updated in whole system frames) rather than random Page updates. This is because constant variations in the position and orientation of the panoramic camera system during filming will probably cause mis-matches in the image Pages if they are handled separately. -
Augmented reality mode 2 differs frommode 1 in that, in addition to automatically extracting foreground and moving objects and placing these in an artificial background environment, the system also utilizes the Warp Engine to “push” additional “real world” objects into the background. In addition to simply adding these “real world” objects into the virtual environment the Warp Engine is also capable of scaling and translating these objects so that they match into the virtual environment more effectively. These objects can be handled as opaque overlays or transparencies. -
Augmented reality mode 3 differs from themode 2 in that, in this case, the Warp Engine is used to “pull” the background objects into the foreground to replace “real world” objects. As in mode 2: these objects can be translated and scaled and can be handled as r either opaque overlays or transparencies. This gives the user to the ability to “match” the physical size and position of a “real world” object with a virtual object. By doing so, the user is able to interact and navigate within the augmented reality environment as they would in the “real world” environment. This mode is probably the most likely mode to be utilized for entertainment and gaming purposes as it would allow a Hollywood production to be brought into the users own living room. - 3.16) Clearly the key to making
augmented reality modes - Virtual reality mode is a functionally simpler mode than the previous augmented reality modes. In this mode “pre-filmed” or computer-generated graphics are loaded into augmented reality memory on a random Page by Page basis. This is possible because the virtual camera planes of reference are fixed. As in the previous examples, virtual reality memory is loaded with a fixed or dynamic background at a lower resolution. The use of both foreground and background image planes makes possible more sophisticated graphics techniques such as motion parallax.
- The versatility of virtual reality memory (background memory) can be improved by utilizing an enhanced form of “blue-screening”. In such a system, a sample of the “chroma-key” color is provided at the beginning of each scan line in the background field (area outside of the active image area). This provides a versatile system in which any color is allowable in the image. Thus, by surrounding individual objects with the “transparent” chroma-key color, problems and inaccuracies associated with the “cutting and pasting” of this object by the Warp Engine are greatly reduced. Additionally, the use of “transparent” chroma-keyed regions within foreground virtual reality images allows easy generation of complex sharp edged and/or dynamic foreground regions with no additional information overhead.
- As can be seen in the definition of the graphic standard, additional Page placement and tracking information is required for the correct placement and subsequent display of the imagery captured by mobile Pan-Cam or HMD based video systems. Additionally, if Spatial audio is to be recorded in real-time then this information must also be encoded as part of the video stream. In the case of computer-generated imagery this additional video information can easily be inserted at render-stage. However, in the case of live video capture, this additional tracking and audio information must be inserted into the video stream prior to recording. This can effectively be achieved through a graphics processing module herein after referred to as the VTV encoder module.
- In the case of imagery collected by mobile panoramic camera systems, the images are first processed by a VTV encoder module. This device provides video distortion correction and also inserts video Page information, orientation tracking data and spatial audio into the video stream. This can be done without altering the video standard, thereby maintaining compatibility with existing recording and playback devices. Although this module could be incorporated within the VTV processor, having this module as a separate entity is advantageous for use in remote camera applications where the video information must ultimately be either stored or transmitted through some form of wireless network
- For any mobile panoramic camera system such as a “Pan-Cam” or HMD based camera system, tracking information must comprise part of the resultant video stream in order that an “absolute” azimuth and elevation coordinate system be maintained. In the case of computer-generated imagery this data is not required as the camera orientation is a theoretical construct known to the computer system at render time.
- The basic tracking system of the VTV HMD utilizes on-board panoramic video cameras to capture the required 360 degree visual information of the surrounding real world environment. This information is then analyzed by the VTV processor (whether it exists within the HMD or as a base station unit) utilizing computationally intensive yet relatively algorithmically simple techniques such as auto correlation. Examples of a possible algorithm are shown in
FIGS. 13-19 . - The simple tracking system outlined in
FIGS. 13-19 detects only changes in position and orientation. With the addition of several retroflective targets, which can be easily distinguished from the background images using differential imaging techniques, it is possible to gain absolute reference points. Such absolute reference points would probably be located at the extremities of the environmental region (i.e. confines of the user space) however they could be placed anywhere within the real environment, provided the VTV hardware is aware of the real world coordinates of these markers. The combination of these absolute reference points and differential movement (from the image analysis data) makes possible the generation of absolute real world coordinate information at full video frame rates. As an alternative to the placement of retroflective targets at known spatial coordinates, active optical beacons could be employed. These devices would operate in a similar fashion to the retroflective targets in that they would be configured' to strobe light in synchronism with the video capture rate thus allowing differential video analysis to be performed on the resultant images. However, unlike passive retroflective targets, active optical beacons could, in addition to strobing in time with the video capture, transmit additional information describing their real world coordinates to the HMD. As a result, the system would not have to explicitly know the locations of these beacons as this data could be extracted “on the fly”. Such a system is very versatile and somewhat more rugged than the simpler retroflective configuration. - Note:
FIG. 20 shows a simplistic representation of the tracking hardware in which the auto correlators simply detect the presence or absence of a particular movement. A practical system would probably incorporate a number of auto correlators for each class of movement (for example there may be 16 or more separate auto correlators to detect horizontal movement). Such as system would then be able to detect different levels or amounts of movement in all of the directions. - An alternative implementation of this tracking system is possible utilizing a similar image analysis technique to track a pattern on the ceiling to achieve spatial positioning information and simple “tilt sensors” to detect angular orientation of the HMD/Pan-Cam system. The advantage of this system is that it is considerably simpler and less expensive than the full six axis optical tracker previously described. The fact that the ceiling is at a constant distance and known orientation from the HMD greatly implifies the optical system, the quality of the required imaging device and the complexity of the subsequent image analysis. As in the previous six-axis optical tracking system, this spatial positioning information is inherently in the form of relative movement only. However, the addition of “absolute reference points” allows such a system to re-calibrate its absolute references and thus achieve an overall absolute coordinate system. This absolute reference point calibration can be achieved relatively easily utilizing several different techniques. The first, and perhaps simplest technique is to use color sensitive retroflective spots as previously described. Alternately, active optical beacons (such as LED beacons) could also be utilized. A further alternative absolute reference calibration system which could be used is based on a bi-directional infrared beacon. Such as system would communicate a unique code between the HMD and the beacon, such that calibration would occur only once each time the HMD passed under any of these “known spatial reference points”. This is required to avoid “dead tracking regions” within the vicinity of the calibration beacons due to multiple origin resets.
- The basic auto correlation technique used to locate movement within the image can be simplified into reasonably straightforward image processing steps. Firstly, rotation detection can be simplified into a group of lateral shifts (up, down, left, right) symmetrical around the center of the image (optical axis of the camera). Additionally, these “sample points” for lateral movement do not necessarily have to be very large. They do however have to contain unique picture information. For example a blank featureless wall will yield no useful tracking information However an image with high contrast regions such as edges of objects or bright highlight points is relatively easily tracked. Taking this thinking one step further, it is possible to first reduce the entire image into highlight points/edges. The image can then be processed as a series of horizontal and vertical strips such that auto correlation regions are bounded between highlight points/edges. Additionally, small highlight regions can very easily be tracked by comparing previous image frames against current images and determining “closest possible fit” between the images (i.e. minimum movement of highlight points). Such techniques are relatively easy and well within the capabilities of most moderate speed micro-processors, provided some of-the image pre-processing overhead is handled by hardware.
Claims (32)
1) An interactive image capture and display system comprising
a) an image input means including an array of electronic image capture devices distributed in a horizontal plane such that their fields of view partially overlap and collectively cover a full 360 degrees; and
b) an image storage and playback means compatible with existing television standards;
c) a signal processing means including
1) a means of producing graphical imagery depicting a panoramic image such that said panoramic image is composed of a plurality of smaller image sections;
2) a means for cropping, distorting and aligning individual images produced by the said image capture devices to produce an overall 360 degree panoramic image with negligible distortion and overlap between the individual image sections and wherein each pixel in the resulting 360 degree panoramic image has the same effective width, where each pixel subtends an equal horizontal angle to the center of said panoramic image;
3) a means for generating an image representing a subset of the said 360 degree panoramic image, whereby the azimuth and elevation of the center of said subset is adjustable by user control;
4) a means for selectively combining and geometrically altering real time imagery from said capture devices and prerecorded imagery to create a composite augmented reality experience;
5) a means for determining the correct location of said image sections within said 360 degree panoramic image utilizing additional information present in the source media;
6) a means for inserting tracking information to describe at least the current orientation of said array of electronic image capture devices into an outgoing video stream;
7) a means for encoding multi-track audio such that it maintains compatibility with standard video storage, playback and transmission systems; and
8) a means for producing orientation-sensitive audio in real-time, utilizing multi-track audio information and controlled by coordinates of a viewport within said panoramic image;
d) an image output means capable of outputting an image in a format compatible with existing television standards;
e) an audio output means capable of outputting at least 2 channels of audio;
f) a display means including at least one display device;
g) a user control means including an input device allowing the user to control said signal processing means; and
h) A tracking means capable of measuring at least azimuth and elevation of said array of electronic image capture devices.
2) The system according to claim 1 further comprising signal processing means for applying distortion correction to the images, wherein each pixel in the resulting 360 degree panoramic image has the same effective height, where each pixel subtends an equal vertical angle to the center of said panoramic image.
3) A system according to claim 1 in which said display means is a conventional television type display device and the user input means is an infrared or radio based manually operated remote control device.
4) A system according to claim 1 in which said display means is a helmet mounted display device and the user input means is an automatic tracking device that calculates at least azimuth and elevation of the user's head.
5) A system according to claim 1 which utilizes a modified television protocol comprising a plurality of video fields or frames such that each field or frame includes at least one of graphical data, sound data, and control information, wherein the signal from said image playback means is compatible with at least one widely accepted television standard.
6) A system according to claim 5 wherein said modified television protocol further comprises, within one or more scan lines of a standard video image, additional coded data defining control parameters and image manipulation data for a signal processing means.
7) A system according to claim 5 wherein said graphical data comprises sections of said 360 degree panoramic image.
8) A system according to claim 5 further comprising, within one or more scan lines of a standard video image, additional coded data providing information defining the placement position of image sections within said 360 degree panoramic image.
9) A system according to claim 5 further comprising within one or more scan lines of a standard video image, additional coded data providing information for the generation of four or more real-time audio tracks.
10) A system according to 5 further comprising within one or more scan lines of a standard video image, additional coded data providing
a) audio information for generation of four or more real-time audio tracks; and
b) data descriptive of a number of employed audio tracks, an employed audio data format, an employed audio sampling rate,
and track synchronization, whereby said signal processing means can decode the audio information into position and orientation sensitive sound.
11) A system according to claim 5 further comprising, within one or more scan lines of a standard video image, additional coded data which provides information as to absolute orientation and X-Y-Z position of said capture device array.
12) A system according to claim 1 further comprising
a) means for mathematically combining information about azimuth
and elevation of a viewer; and
b) means for encoding multi-track audio for use with standard video storage and transmission systems such that the combined information can be subsequently decoded by specific hardware to produce a left and right audio channel with spatially correct three-dimensional audio for the left and right ears of a viewer.
13) A system according to claim 1 further comprising means for varying angular field of view of said viewport within said panoramic image responsive to runtime user control.
14) A system according to claim 1 further comprising means for varying the position of a viewpoint within a three-dimensional virtual space responsive to runtime user control.
15) A system according to claim 1 further comprising:
a) a tracking device for continuously calculating a viewer's physical position; and
b) means for varying the position of a viewpoint within a three-dimensional virtual space responsive to said position.
16) A system according to claim 1 further comprising means for providing orientation-sensitive audio in real-time, controlled by the direction of the viewer's head.
17) A system according to claim 1 further comprising means for providing orientation-sensitive audio in real-time, controlled by coordinates of a viewport within said panoramic image.
18) A system according to claim 1 further comprising means for providing position-sensitive audio in real-time, controlled by the virtual position of a viewpoint within a three-dimensional virtual space.
19) A system according to claim 1 wherein said signal processing means comprises
a) one or more video digitizing modules;
b) one or more memory areas selected from the group consisting of ARM, VRM, and TM;
c) digital processing means for
1) altering address mapping of data held in at least one of ARM and VRM so as to effectively move graphical information from one location to another therein; and
2) mathematically combining and altering data from both a source location and a destination location, thereby achieving the functions of compositing and transformation; and
d) one or more video, generation modules.
20) A system according to claim 19 wherein said ARM is mapped to occupy a smaller vertical field of view than said VRM and said TM, thereby reducing the amount of data required for the generation of a high-quality image.
21) A system according to claim 19 further comprising means for mapping ARM, VRM, and TM at different resolutions, whereby pixels in each memory region can represent different degrees of angular deviation.
22) A system according to claim 1 further comprising
a) means for displaying imagery;
b) means for placing said real-time video imagery into ARM and source information from said video playback means into VRM; and
c) means for combining imagery from ARM and VRM according to a pattern of data held in TM into a composite image before display.
23) A system according to claim 1 further comprising:
a) means for displaying imagery;
b) means for placing source information from said video playback means into ARM and VRM; and
c) means for combining imagery from ARM and VRM according to a translation map included in the source media.
24) The system according to claim 1 further comprising
a) means for displaying imagery;
b) means for placing source information from said video playback means into ARM and VRM; and
c) means for combining imagery from ARM and VRM in accordance with a geometric interpretation of said real-time video imagery.
25) A system according to claim 1 further comprising signal processing means for inserting identification information to describe the location of individual image sections that comprise said 360 degree panoramic image into said outgoing video stream.
26) A system according to claim 1 wherein said tracking information also describes the current spatial position of said array of electronic image capture devices into said outgoing video stream.
27) A system according to claim 1 whereby said signal processing means utilizes data received from said array of electronic image capture devices and, by performing a series of image analysis processes, calculates changes in the orientation of said array of electronic image capture devices.
28) A system according to claim 1 whereby said signal processing means utilizes data received from said array of electronic image capture devices and, by performing a series of image analysis processes, calculates changes in the position of said array of electronic image capture devices.
29) A system according to claim 1 wherein said tracking means comprises
a) a plurality of reflective targets placed at predetermined coordinates;
b) a plurality of on-axis light sources strobed in synchronization with the capture rate of said array of electronic image capture devices; and
c) means for computing absolute angular and spatial data based on said predetermined coordinates and relative angular and spatial data determined by said array of electronic image capture devices.
30) A system according to claim 29 further comprising a plurality of color filters positioned over said reflective targets, whereby the ability of said system to correctly identify and maintain tracking of said reflective targets is improved.
31) A system according to claim 29 wherein said light sources are color-controllable, whereby the ability of the system to correctly identify and maintain tracking of said reflective targets is improved.
32) A system according to claim 1 wherein said tracking means incorporates active beacons which utilize at least one of pulse timing and color of light to transmit spatial coordinates of each beacon to said array of electronic image capture devices, whereby relative angular and spatial data can be determined by said array of electronic image capture devices and converted into absolute angular and spatial data.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/732,671 US20100302348A1 (en) | 2001-06-25 | 2010-03-26 | VTV System |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/891,733 US20010056574A1 (en) | 2000-06-26 | 2001-06-25 | VTV system |
US11/230,173 US7688346B2 (en) | 2001-06-25 | 2005-09-19 | VTV system |
US12/732,671 US20100302348A1 (en) | 2001-06-25 | 2010-03-26 | VTV System |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US11/230,173 Continuation US7688346B2 (en) | 2001-06-25 | 2005-09-19 | VTV system |
Publications (1)
Publication Number | Publication Date |
---|---|
US20100302348A1 true US20100302348A1 (en) | 2010-12-02 |
Family
ID=25398728
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/891,733 Abandoned US20010056574A1 (en) | 2000-06-26 | 2001-06-25 | VTV system |
US11/230,173 Active 2024-09-19 US7688346B2 (en) | 2001-06-25 | 2005-09-19 | VTV system |
US12/732,671 Abandoned US20100302348A1 (en) | 2001-06-25 | 2010-03-26 | VTV System |
Family Applications Before (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/891,733 Abandoned US20010056574A1 (en) | 2000-06-26 | 2001-06-25 | VTV system |
US11/230,173 Active 2024-09-19 US7688346B2 (en) | 2001-06-25 | 2005-09-19 | VTV system |
Country Status (4)
Country | Link |
---|---|
US (3) | US20010056574A1 (en) |
JP (1) | JP2005500721A (en) |
DE (1) | DE10197255T5 (en) |
WO (1) | WO2003001803A1 (en) |
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080117287A1 (en) * | 2006-11-16 | 2008-05-22 | Park Michael C | Distributed video sensor panoramic imaging system |
US20090033772A1 (en) * | 2007-08-01 | 2009-02-05 | Shaka Ramsay | Methods, systems, and computer program products for implementing a personalized, image capture and display system |
US20100045869A1 (en) * | 2008-08-19 | 2010-02-25 | Sony Computer Entertainment Europe Ltd. | Entertainment Device, System, and Method |
US20120013609A1 (en) * | 2009-12-11 | 2012-01-19 | Nokia Corporation | Method and apparatus for presenting a first person world view of content |
US8265426B2 (en) * | 2008-08-04 | 2012-09-11 | Kabushiki Kaisha Toshiba | Image processor and image processing method for increasing video resolution |
US20130333633A1 (en) * | 2012-06-14 | 2013-12-19 | Tai Cheung Poon | Systems and methods for testing dogs' hearing, vision, and responsiveness |
WO2014176115A1 (en) * | 2013-04-22 | 2014-10-30 | Ar Tables, Llc | Apparatus for hands-free augmented reality viewing |
US8982472B2 (en) * | 2013-05-21 | 2015-03-17 | Matvey Lvovskiy | Method of widening of angular field of view of collimating optical systems |
US20150128104A1 (en) * | 2013-06-10 | 2015-05-07 | Pixel Press Technology, LLC | Systems and Methods for Creating a Playable Video Game From A Three-Dimensional Model |
US20170105053A1 (en) * | 2012-04-24 | 2017-04-13 | Skreens Entertainment Technologies, Inc. | Video display system |
US9743119B2 (en) | 2012-04-24 | 2017-08-22 | Skreens Entertainment Technologies, Inc. | Video display system |
US10363486B2 (en) | 2013-06-10 | 2019-07-30 | Pixel Press Technology, LLC | Smart video game board system and methods |
US10547704B2 (en) * | 2017-04-06 | 2020-01-28 | Sony Interactive Entertainment Inc. | Predictive bitrate selection for 360 video streaming |
US11284137B2 (en) | 2012-04-24 | 2022-03-22 | Skreens Entertainment Technologies, Inc. | Video processing systems and methods for display, selection and navigation of a combination of heterogeneous sources |
Families Citing this family (115)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2374505B (en) * | 2001-01-29 | 2004-10-20 | Hewlett Packard Co | Audio announcements with range indications |
JP3880561B2 (en) * | 2002-09-05 | 2007-02-14 | 株式会社ソニー・コンピュータエンタテインメント | Display system |
US7734070B1 (en) | 2002-12-31 | 2010-06-08 | Rajeev Sharma | Method and system for immersing face images into a video sequence |
US9063633B2 (en) * | 2006-03-30 | 2015-06-23 | Arjuna Indraeswaran Rajasingham | Virtual navigation system for virtual and real spaces |
DE20305278U1 (en) * | 2003-04-02 | 2003-06-12 | Daimler Chrysler Ag | Device for taking into account the viewer's position when displaying 3D image content on 2D display devices |
US7256779B2 (en) * | 2003-05-08 | 2007-08-14 | Nintendo Co., Ltd. | Video game play using panoramically-composited depth-mapped cube mapping |
US7118228B2 (en) | 2003-11-04 | 2006-10-10 | Hewlett-Packard Development Company, L.P. | Image display system |
DE10361512A1 (en) * | 2003-12-23 | 2005-07-28 | Siemens Ag | Device and method for positionally precise superposition of the real field of view |
JP4522129B2 (en) * | 2004-03-31 | 2010-08-11 | キヤノン株式会社 | Image processing method and image processing apparatus |
US20070182812A1 (en) * | 2004-05-19 | 2007-08-09 | Ritchey Kurtis J | Panoramic image-based virtual reality/telepresence audio-visual system and method |
US20060069591A1 (en) * | 2004-09-29 | 2006-03-30 | Razzano Michael R | Dental image charting system and method |
US9101279B2 (en) | 2006-02-15 | 2015-08-11 | Virtual Video Reality By Ritchey, Llc | Mobile user borne brain activity data and surrounding environment data correlation system |
CN101496387B (en) | 2006-03-06 | 2012-09-05 | 思科技术公司 | System and method for access authentication in a mobile wireless network |
US9958934B1 (en) | 2006-05-01 | 2018-05-01 | Jeffrey D. Mullen | Home and portable augmented reality and virtual reality video game consoles |
US20080007617A1 (en) * | 2006-05-11 | 2008-01-10 | Ritchey Kurtis J | Volumetric panoramic sensor systems |
JP5074777B2 (en) * | 2006-05-22 | 2012-11-14 | キヤノン株式会社 | Display device with imaging function, image processing device, image processing method, and image display system |
US20080117288A1 (en) * | 2006-11-16 | 2008-05-22 | Imove, Inc. | Distributed Video Sensor Panoramic Imaging System |
US8570373B2 (en) * | 2007-06-08 | 2013-10-29 | Cisco Technology, Inc. | Tracking an object utilizing location information associated with a wireless device |
US8717412B2 (en) * | 2007-07-18 | 2014-05-06 | Samsung Electronics Co., Ltd. | Panoramic image production |
US9703369B1 (en) | 2007-10-11 | 2017-07-11 | Jeffrey David Mullen | Augmented reality video game systems |
US8797377B2 (en) | 2008-02-14 | 2014-08-05 | Cisco Technology, Inc. | Method and system for videoconference configuration |
US8355041B2 (en) * | 2008-02-14 | 2013-01-15 | Cisco Technology, Inc. | Telepresence system for 360 degree video conferencing |
US10229389B2 (en) * | 2008-02-25 | 2019-03-12 | International Business Machines Corporation | System and method for managing community assets |
US8319819B2 (en) * | 2008-03-26 | 2012-11-27 | Cisco Technology, Inc. | Virtual round-table videoconference |
US8390667B2 (en) | 2008-04-15 | 2013-03-05 | Cisco Technology, Inc. | Pop-up PIP for people not in picture |
US8694658B2 (en) * | 2008-09-19 | 2014-04-08 | Cisco Technology, Inc. | System and method for enabling communication sessions in a network environment |
US8529346B1 (en) * | 2008-12-30 | 2013-09-10 | Lucasfilm Entertainment Company Ltd. | Allocating and managing software assets |
US8659637B2 (en) | 2009-03-09 | 2014-02-25 | Cisco Technology, Inc. | System and method for providing three dimensional video conferencing in a network environment |
US8477175B2 (en) * | 2009-03-09 | 2013-07-02 | Cisco Technology, Inc. | System and method for providing three dimensional imaging in a network environment |
US20130176192A1 (en) * | 2011-09-30 | 2013-07-11 | Kenneth Varga | Extra-sensory perception sharing force capability and unknown terrain identification system |
WO2010124074A1 (en) * | 2009-04-22 | 2010-10-28 | Terrence Dashon Howard | System for merging virtual reality and reality to provide an enhanced sensory experience |
US8659639B2 (en) | 2009-05-29 | 2014-02-25 | Cisco Technology, Inc. | System and method for extending communications between participants in a conferencing environment |
US8762982B1 (en) * | 2009-06-22 | 2014-06-24 | Yazaki North America, Inc. | Method for programming an instrument cluster |
US9082297B2 (en) | 2009-08-11 | 2015-07-14 | Cisco Technology, Inc. | System and method for verifying parameters in an audiovisual environment |
US9225916B2 (en) | 2010-03-18 | 2015-12-29 | Cisco Technology, Inc. | System and method for enhancing video images in a conferencing environment |
USD626103S1 (en) | 2010-03-21 | 2010-10-26 | Cisco Technology, Inc. | Video unit with integrated features |
USD626102S1 (en) | 2010-03-21 | 2010-10-26 | Cisco Tech Inc | Video unit with integrated features |
US9313452B2 (en) | 2010-05-17 | 2016-04-12 | Cisco Technology, Inc. | System and method for providing retracting optics in a video conferencing environment |
US8771064B2 (en) | 2010-05-26 | 2014-07-08 | Aristocrat Technologies Australia Pty Limited | Gaming system and a method of gaming |
US8896655B2 (en) | 2010-08-31 | 2014-11-25 | Cisco Technology, Inc. | System and method for providing depth adaptive video conferencing |
US8599934B2 (en) | 2010-09-08 | 2013-12-03 | Cisco Technology, Inc. | System and method for skip coding during video conferencing in a network environment |
US20120075466A1 (en) * | 2010-09-29 | 2012-03-29 | Raytheon Company | Remote viewing |
WO2012048252A1 (en) | 2010-10-07 | 2012-04-12 | Aria Glassworks, Inc. | System and method for transitioning between interface modes in virtual and augmented reality applications |
US8599865B2 (en) | 2010-10-26 | 2013-12-03 | Cisco Technology, Inc. | System and method for provisioning flows in a mobile network environment |
US8699457B2 (en) | 2010-11-03 | 2014-04-15 | Cisco Technology, Inc. | System and method for managing flows in a mobile network environment |
US9143725B2 (en) | 2010-11-15 | 2015-09-22 | Cisco Technology, Inc. | System and method for providing enhanced graphics in a video environment |
US8902244B2 (en) | 2010-11-15 | 2014-12-02 | Cisco Technology, Inc. | System and method for providing enhanced graphics in a video environment |
US9338394B2 (en) | 2010-11-15 | 2016-05-10 | Cisco Technology, Inc. | System and method for providing enhanced audio in a video environment |
US8730297B2 (en) | 2010-11-15 | 2014-05-20 | Cisco Technology, Inc. | System and method for providing camera functions in a video environment |
US8576276B2 (en) | 2010-11-18 | 2013-11-05 | Microsoft Corporation | Head-mounted display device which provides surround video |
US8542264B2 (en) | 2010-11-18 | 2013-09-24 | Cisco Technology, Inc. | System and method for managing optics in a video environment |
US8723914B2 (en) | 2010-11-19 | 2014-05-13 | Cisco Technology, Inc. | System and method for providing enhanced video processing in a network environment |
US9017163B2 (en) | 2010-11-24 | 2015-04-28 | Aria Glassworks, Inc. | System and method for acquiring virtual and augmented reality scenes by a user |
US9111138B2 (en) | 2010-11-30 | 2015-08-18 | Cisco Technology, Inc. | System and method for gesture interface control |
USD682864S1 (en) | 2010-12-16 | 2013-05-21 | Cisco Technology, Inc. | Display screen with graphical user interface |
USD682293S1 (en) | 2010-12-16 | 2013-05-14 | Cisco Technology, Inc. | Display screen with graphical user interface |
USD678320S1 (en) | 2010-12-16 | 2013-03-19 | Cisco Technology, Inc. | Display screen with graphical user interface |
USD682294S1 (en) | 2010-12-16 | 2013-05-14 | Cisco Technology, Inc. | Display screen with graphical user interface |
USD682854S1 (en) | 2010-12-16 | 2013-05-21 | Cisco Technology, Inc. | Display screen for graphical user interface |
USD678894S1 (en) | 2010-12-16 | 2013-03-26 | Cisco Technology, Inc. | Display screen with graphical user interface |
USD678307S1 (en) | 2010-12-16 | 2013-03-19 | Cisco Technology, Inc. | Display screen with graphical user interface |
USD678308S1 (en) | 2010-12-16 | 2013-03-19 | Cisco Technology, Inc. | Display screen with graphical user interface |
US8845107B1 (en) | 2010-12-23 | 2014-09-30 | Rawles Llc | Characterization of a scene with structured light |
US8905551B1 (en) | 2010-12-23 | 2014-12-09 | Rawles Llc | Unpowered augmented reality projection accessory display device |
US8845110B1 (en) * | 2010-12-23 | 2014-09-30 | Rawles Llc | Powered augmented reality projection accessory display device |
US9721386B1 (en) | 2010-12-27 | 2017-08-01 | Amazon Technologies, Inc. | Integrated augmented reality environment |
US9607315B1 (en) | 2010-12-30 | 2017-03-28 | Amazon Technologies, Inc. | Complementing operation of display devices in an augmented reality environment |
US9508194B1 (en) | 2010-12-30 | 2016-11-29 | Amazon Technologies, Inc. | Utilizing content output devices in an augmented reality environment |
US8953022B2 (en) | 2011-01-10 | 2015-02-10 | Aria Glassworks, Inc. | System and method for sharing virtual and augmented reality scenes between users and viewers |
US8692862B2 (en) | 2011-02-28 | 2014-04-08 | Cisco Technology, Inc. | System and method for selection of video data in a video conference environment |
US8670019B2 (en) | 2011-04-28 | 2014-03-11 | Cisco Technology, Inc. | System and method for providing enhanced eye gaze in a video conferencing environment |
US8786631B1 (en) | 2011-04-30 | 2014-07-22 | Cisco Technology, Inc. | System and method for transferring transparency information in a video environment |
US20120281128A1 (en) * | 2011-05-05 | 2012-11-08 | Sony Corporation | Tailoring audio video output for viewer position and needs |
US8934026B2 (en) | 2011-05-12 | 2015-01-13 | Cisco Technology, Inc. | System and method for video coding in a dynamic environment |
JP5591281B2 (en) * | 2011-06-03 | 2014-09-17 | 任天堂株式会社 | Information processing system, information processing apparatus, information processing program, and moving image reproduction control method |
JP5918618B2 (en) | 2011-06-03 | 2016-05-18 | 任天堂株式会社 | Information processing program, information processing apparatus, information processing system, and information processing method |
US8947493B2 (en) | 2011-11-16 | 2015-02-03 | Cisco Technology, Inc. | System and method for alerting a participant in a video conference |
US8682087B2 (en) | 2011-12-19 | 2014-03-25 | Cisco Technology, Inc. | System and method for depth-guided image filtering in a video conference environment |
US20130250040A1 (en) * | 2012-03-23 | 2013-09-26 | Broadcom Corporation | Capturing and Displaying Stereoscopic Panoramic Images |
US9179126B2 (en) * | 2012-06-01 | 2015-11-03 | Ostendo Technologies, Inc. | Spatio-temporal light field cameras |
US9626799B2 (en) * | 2012-10-02 | 2017-04-18 | Aria Glassworks, Inc. | System and method for dynamically displaying multiple virtual and augmented reality scenes on a single display |
US9681154B2 (en) | 2012-12-06 | 2017-06-13 | Patent Capital Group | System and method for depth-guided filtering in a video conference environment |
US10769852B2 (en) | 2013-03-14 | 2020-09-08 | Aria Glassworks, Inc. | Method for simulating natural perception in virtual and augmented reality scenes |
US9843621B2 (en) | 2013-05-17 | 2017-12-12 | Cisco Technology, Inc. | Calendaring activities based on communication processing |
FR3006841B1 (en) * | 2013-06-07 | 2015-07-03 | Kolor | FUSION OF SEVERAL VIDEO STREAMS |
KR20150068299A (en) * | 2013-12-09 | 2015-06-19 | 씨제이씨지브이 주식회사 | Method and system of generating images for multi-surface display |
KR20150068298A (en) * | 2013-12-09 | 2015-06-19 | 씨제이씨지브이 주식회사 | Method and system of generating images for multi-surface display |
US10977864B2 (en) | 2014-02-21 | 2021-04-13 | Dropbox, Inc. | Techniques for capturing and displaying partial motion in virtual or augmented reality scenes |
US9392212B1 (en) | 2014-04-17 | 2016-07-12 | Visionary Vr, Inc. | System and method for presenting virtual reality content to a user |
US20170127035A1 (en) * | 2014-04-22 | 2017-05-04 | Sony Corporation | Information reproducing apparatus and information reproducing method, and information recording apparatus and information recording method |
US9665170B1 (en) | 2015-06-10 | 2017-05-30 | Visionary Vr, Inc. | System and method for presenting virtual reality content to a user based on body posture |
DE102015116868A1 (en) | 2015-10-05 | 2017-04-06 | Christoph Greiffenbach | Presentation system for advertising purposes and for displaying a product |
AU2017214748B9 (en) * | 2016-02-05 | 2021-05-27 | Magic Leap, Inc. | Systems and methods for augmented reality |
FR3057430B1 (en) * | 2016-10-10 | 2022-02-18 | Immersion | DEVICE FOR IMMERSION IN A REPRESENTATION OF AN ENVIRONMENT RESULTING FROM A SET OF IMAGES |
US20170329394A1 (en) * | 2016-05-13 | 2017-11-16 | Benjamin Lloyd Goldstein | Virtual and augmented reality systems |
WO2017205642A1 (en) * | 2016-05-25 | 2017-11-30 | Livit Media Inc. | Methods and systems for live sharing 360-degree video streams on a mobile device |
JP2018101452A (en) * | 2016-12-20 | 2018-06-28 | カシオ計算機株式会社 | Output control device, content storage device, output control method, content storage method, program and data structure |
US10593012B2 (en) * | 2017-03-22 | 2020-03-17 | Mediatek Inc. | Method and apparatus for generating and encoding projection-based frame with 360-degree content represented in projection faces packed in segmented sphere projection layout |
US10796723B2 (en) * | 2017-05-26 | 2020-10-06 | Immersive Licensing, Inc. | Spatialized rendering of real-time video data to 3D space |
US11361640B2 (en) | 2017-06-30 | 2022-06-14 | Johnson Controls Tyco IP Holdings LLP | Security camera system with multi-directional mount and method of operation |
US20190007672A1 (en) * | 2017-06-30 | 2019-01-03 | Bobby Gene Burrough | Method and Apparatus for Generating Dynamic Real-Time 3D Environment Projections |
US11288937B2 (en) | 2017-06-30 | 2022-03-29 | Johnson Controls Tyco IP Holdings LLP | Security camera system with multi-directional mount and method of operation |
US20190104282A1 (en) * | 2017-09-29 | 2019-04-04 | Sensormatic Electronics, LLC | Security Camera System with Multi-Directional Mount and Method of Operation |
CN111201837B (en) | 2017-10-16 | 2022-10-11 | 昕诺飞控股有限公司 | Method and controller for controlling a plurality of lighting devices |
US10712810B2 (en) * | 2017-12-08 | 2020-07-14 | Telefonaktiebolaget Lm Ericsson (Publ) | System and method for interactive 360 video playback based on user location |
US10217488B1 (en) * | 2017-12-15 | 2019-02-26 | Snap Inc. | Spherical video editing |
CN109996060B (en) * | 2017-12-30 | 2021-09-03 | 深圳多哚新技术有限责任公司 | Virtual reality cinema system and information processing method |
KR102157160B1 (en) * | 2018-12-27 | 2020-09-17 | 주식회사 다윈테크 | 360°virtual image experience system |
US11683464B2 (en) * | 2018-12-28 | 2023-06-20 | Canon Kabushiki Kaisha | Electronic device, control method, and non-transitorycomputer readable medium |
JP7349793B2 (en) * | 2019-02-15 | 2023-09-25 | キヤノン株式会社 | Image processing device, image processing method, and program |
US11372474B2 (en) * | 2019-07-03 | 2022-06-28 | Saec/Kinetic Vision, Inc. | Systems and methods for virtual artificial intelligence development and testing |
US11503227B2 (en) | 2019-09-18 | 2022-11-15 | Very 360 Vr Llc | Systems and methods of transitioning between video clips in interactive videos |
US11816757B1 (en) * | 2019-12-11 | 2023-11-14 | Meta Platforms Technologies, Llc | Device-side capture of data representative of an artificial reality environment |
CN112233048B (en) * | 2020-12-11 | 2021-03-02 | 成都成电光信科技股份有限公司 | Spherical video image correction method |
CN113824746B (en) * | 2021-11-25 | 2022-02-18 | 山东信息职业技术学院 | Virtual reality information transmission method and virtual reality system |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5130794A (en) * | 1990-03-29 | 1992-07-14 | Ritchey Kurtis J | Panoramic display system |
US5850352A (en) * | 1995-03-31 | 1998-12-15 | The Regents Of The University Of California | Immersive video, including video hypermosaicing to generate from multiple video views of a scene a three-dimensional video mosaic from which diverse virtual video scene images are synthesized, including panoramic, scene interactive and stereoscopic images |
US6654019B2 (en) * | 1998-05-13 | 2003-11-25 | Imove, Inc. | Panoramic movie which utilizes a series of captured panoramic images to display movement as observed by a viewer looking in a selected direction |
US6765569B2 (en) * | 2001-03-07 | 2004-07-20 | University Of Southern California | Augmented-reality tool employing scene-feature autocalibration during camera motion |
Family Cites Families (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3580978A (en) * | 1968-06-06 | 1971-05-25 | Singer General Precision | Visual display method and apparatus |
US3872238A (en) * | 1974-03-11 | 1975-03-18 | Us Navy | 360 Degree panoramic television system |
JPS5124211A (en) * | 1974-08-23 | 1976-02-27 | Victor Company Of Japan | Onseishingono shuhasuhenkansochi |
JPS60141087A (en) * | 1983-12-28 | 1985-07-26 | Tsutomu Ohashi | Reproducer of environment |
US5130815A (en) * | 1990-07-20 | 1992-07-14 | Mti Associates | Method and apparatus for encoding a video signal having multi-language capabilities |
US5148310A (en) * | 1990-08-30 | 1992-09-15 | Batchko Robert G | Rotating flat screen fully addressable volume display system |
ES2043549B1 (en) * | 1992-04-30 | 1996-10-01 | Jp Producciones Sl | INTEGRAL RECORDING SYSTEM, PROJECTION-VISUALIZATION-AUDITION OF IMAGES AND / OR PERFECTED VIRTUAL REALITY. |
US5495576A (en) * | 1993-01-11 | 1996-02-27 | Ritchey; Kurtis J. | Panoramic image based virtual reality/telepresence audio-visual system and method |
JPH06301390A (en) * | 1993-04-12 | 1994-10-28 | Sanyo Electric Co Ltd | Stereoscopic sound image controller |
US5619255A (en) * | 1994-08-19 | 1997-04-08 | Cornell Research Foundation, Inc. | Wide-screen video system |
US5991085A (en) * | 1995-04-21 | 1999-11-23 | I-O Display Systems Llc | Head-mounted personal visual display apparatus with image generator and holder |
US5703604A (en) * | 1995-05-22 | 1997-12-30 | Dodeca Llc | Immersive dodecaherdral video viewing system |
US5999220A (en) * | 1997-04-07 | 1999-12-07 | Washino; Kinya | Multi-format audio/video production system with frame-rate conversion |
JP3232408B2 (en) * | 1997-12-01 | 2001-11-26 | 日本エルエスアイカード株式会社 | Image generation device, image presentation device, and image generation method |
JPH11308608A (en) * | 1998-02-19 | 1999-11-05 | Nippon Lsi Card Co Ltd | Dynamic image generating method, dynamic image generator, and dynamic image display method |
US6064423A (en) * | 1998-02-12 | 2000-05-16 | Geng; Zheng Jason | Method and apparatus for high resolution three dimensional display |
JP2002516121A (en) * | 1998-03-03 | 2002-06-04 | アリーナ, インコーポレイテッド | System and method for tracking and evaluating exercise techniques in a multidimensional space |
JP3449937B2 (en) * | 1999-01-14 | 2003-09-22 | 日本電信電話株式会社 | Panorama image creation method, surrounding situation transmission method using panorama image, and recording medium recording these methods |
JP4453119B2 (en) * | 1999-06-08 | 2010-04-21 | ソニー株式会社 | Camera calibration apparatus and method, image processing apparatus and method, program providing medium, and camera |
GB9914914D0 (en) * | 1999-06-26 | 1999-08-25 | British Aerospace | Measurement apparatus for measuring the position and orientation of a first part to be worked, inspected or moved |
JP2001108421A (en) * | 1999-10-13 | 2001-04-20 | Sanyo Electric Co Ltd | Method and apparatus for three-dimensional modeling, and medium recording three-dimensional modeling program |
-
2001
- 2001-06-25 US US09/891,733 patent/US20010056574A1/en not_active Abandoned
- 2001-12-21 DE DE10197255T patent/DE10197255T5/en not_active Withdrawn
- 2001-12-21 JP JP2003508064A patent/JP2005500721A/en active Pending
- 2001-12-21 WO PCT/US2001/049287 patent/WO2003001803A1/en active Application Filing
-
2005
- 2005-09-19 US US11/230,173 patent/US7688346B2/en active Active
-
2010
- 2010-03-26 US US12/732,671 patent/US20100302348A1/en not_active Abandoned
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5130794A (en) * | 1990-03-29 | 1992-07-14 | Ritchey Kurtis J | Panoramic display system |
US5850352A (en) * | 1995-03-31 | 1998-12-15 | The Regents Of The University Of California | Immersive video, including video hypermosaicing to generate from multiple video views of a scene a three-dimensional video mosaic from which diverse virtual video scene images are synthesized, including panoramic, scene interactive and stereoscopic images |
US6654019B2 (en) * | 1998-05-13 | 2003-11-25 | Imove, Inc. | Panoramic movie which utilizes a series of captured panoramic images to display movement as observed by a viewer looking in a selected direction |
US6765569B2 (en) * | 2001-03-07 | 2004-07-20 | University Of Southern California | Augmented-reality tool employing scene-feature autocalibration during camera motion |
Cited By (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8094182B2 (en) * | 2006-11-16 | 2012-01-10 | Imove, Inc. | Distributed video sensor panoramic imaging system |
US20080117287A1 (en) * | 2006-11-16 | 2008-05-22 | Park Michael C | Distributed video sensor panoramic imaging system |
US20090033772A1 (en) * | 2007-08-01 | 2009-02-05 | Shaka Ramsay | Methods, systems, and computer program products for implementing a personalized, image capture and display system |
US8098283B2 (en) * | 2007-08-01 | 2012-01-17 | Shaka Ramsay | Methods, systems, and computer program products for implementing a personalized, image capture and display system |
US8265426B2 (en) * | 2008-08-04 | 2012-09-11 | Kabushiki Kaisha Toshiba | Image processor and image processing method for increasing video resolution |
US20100045869A1 (en) * | 2008-08-19 | 2010-02-25 | Sony Computer Entertainment Europe Ltd. | Entertainment Device, System, and Method |
US8542250B2 (en) * | 2008-08-19 | 2013-09-24 | Sony Computer Entertainment Europe Limited | Entertainment device, system, and method |
US8812990B2 (en) * | 2009-12-11 | 2014-08-19 | Nokia Corporation | Method and apparatus for presenting a first person world view of content |
US20120013609A1 (en) * | 2009-12-11 | 2012-01-19 | Nokia Corporation | Method and apparatus for presenting a first person world view of content |
US9743119B2 (en) | 2012-04-24 | 2017-08-22 | Skreens Entertainment Technologies, Inc. | Video display system |
US20170105053A1 (en) * | 2012-04-24 | 2017-04-13 | Skreens Entertainment Technologies, Inc. | Video display system |
US11284137B2 (en) | 2012-04-24 | 2022-03-22 | Skreens Entertainment Technologies, Inc. | Video processing systems and methods for display, selection and navigation of a combination of heterogeneous sources |
US10499118B2 (en) * | 2012-04-24 | 2019-12-03 | Skreens Entertainment Technologies, Inc. | Virtual and augmented reality system and headset display |
US20130333633A1 (en) * | 2012-06-14 | 2013-12-19 | Tai Cheung Poon | Systems and methods for testing dogs' hearing, vision, and responsiveness |
US9786246B2 (en) | 2013-04-22 | 2017-10-10 | Ar Tables, Llc | Apparatus for hands-free augmented reality viewing |
WO2014176115A1 (en) * | 2013-04-22 | 2014-10-30 | Ar Tables, Llc | Apparatus for hands-free augmented reality viewing |
US8982472B2 (en) * | 2013-05-21 | 2015-03-17 | Matvey Lvovskiy | Method of widening of angular field of view of collimating optical systems |
US9579573B2 (en) * | 2013-06-10 | 2017-02-28 | Pixel Press Technology, LLC | Systems and methods for creating a playable video game from a three-dimensional model |
US10071316B2 (en) | 2013-06-10 | 2018-09-11 | Pixel Press Technology, LLC | Systems and methods for creating a playable video game from a three-dimensional model |
US10363486B2 (en) | 2013-06-10 | 2019-07-30 | Pixel Press Technology, LLC | Smart video game board system and methods |
US20150128104A1 (en) * | 2013-06-10 | 2015-05-07 | Pixel Press Technology, LLC | Systems and Methods for Creating a Playable Video Game From A Three-Dimensional Model |
US10547704B2 (en) * | 2017-04-06 | 2020-01-28 | Sony Interactive Entertainment Inc. | Predictive bitrate selection for 360 video streaming |
US11128730B2 (en) | 2017-04-06 | 2021-09-21 | Sony Interactive Entertainment Inc. | Predictive bitrate selection for 360 video streaming |
Also Published As
Publication number | Publication date |
---|---|
WO2003001803A1 (en) | 2003-01-03 |
US7688346B2 (en) | 2010-03-30 |
JP2005500721A (en) | 2005-01-06 |
DE10197255T5 (en) | 2004-10-14 |
US20010056574A1 (en) | 2001-12-27 |
US20060082643A1 (en) | 2006-04-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7688346B2 (en) | VTV system | |
US7719563B2 (en) | VTV system | |
JP2005500721A5 (en) | ||
US10958890B2 (en) | Method and apparatus for rendering timed text and graphics in virtual reality video | |
CN106165415B (en) | Stereoscopic viewing | |
EP0793392B1 (en) | Method and apparatus for the transmission and the reception of three-dimensional television signals of stereoscopic images | |
US7551770B2 (en) | Image conversion and encoding techniques for displaying stereoscopic 3D images | |
US5077608A (en) | Video effects system able to intersect a 3-D image with a 2-D image | |
KR20170017700A (en) | Electronic Apparatus generating 360 Degrees 3D Stereoscopic Panorama Images and Method thereof | |
US20060165310A1 (en) | Method and apparatus for a virtual scene previewing system | |
US11310459B2 (en) | Image capturing device, image capturing system, image processing method, and recording medium | |
AU1463597A (en) | Method and apparatus for converting a two-dimensional motion picture into a three-dimensional motion picture | |
CN113099204A (en) | Remote live-action augmented reality method based on VR head-mounted display equipment | |
EP3301933A1 (en) | Methods, devices and stream to provide indication of mapping of omnidirectional images | |
US11250540B2 (en) | Image processing apparatus, image capturing system, image processing method, and recording medium | |
JP2018033107A (en) | Video distribution device and distribution method | |
Zheng et al. | Research on panoramic stereo live streaming based on the virtual reality | |
CN114040097A (en) | Large-scene interactive action capturing system based on multi-channel image acquisition and fusion | |
JP3520318B2 (en) | Video composition arithmetic processing apparatus, method and system thereof | |
KR20230035780A (en) | Content video production system based on extended reality | |
JP2004048803A (en) | Video composition processing system | |
JP2022021886A (en) | Vr video generation device and program | |
Yamazawa et al. | Telepresence by real‐time view‐dependent image generation from omnidirectional images | |
KR20200010909A (en) | Panorama chroma-key synthesis system and method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |