WO2001047251A2 - System, apparatus and method for controlling communications in a multimedia communication network - Google Patents

System, apparatus and method for controlling communications in a multimedia communication network Download PDF

Info

Publication number
WO2001047251A2
WO2001047251A2 PCT/US2000/042352 US0042352W WO0147251A2 WO 2001047251 A2 WO2001047251 A2 WO 2001047251A2 US 0042352 W US0042352 W US 0042352W WO 0147251 A2 WO0147251 A2 WO 0147251A2
Authority
WO
WIPO (PCT)
Prior art keywords
accordance
processor
video
conttol
camera
Prior art date
Application number
PCT/US2000/042352
Other languages
French (fr)
Other versions
WO2001047251A3 (en
Inventor
Bruce Norman Zieper
Joseph Jay Fitzpatrick
Ilya Gorelik
Original Assignee
Videoscape.Net, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Videoscape.Net, Inc. filed Critical Videoscape.Net, Inc.
Priority to AU49033/01A priority Critical patent/AU4903301A/en
Publication of WO2001047251A2 publication Critical patent/WO2001047251A2/en
Publication of WO2001047251A3 publication Critical patent/WO2001047251A3/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/147Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/148Interfacing a video terminal to a particular transmission medium, e.g. ISDN

Definitions

  • the invention relates in general to multimedia systems and more specifically to systems and methods for controlling a processor in a multimedia communication system.
  • Multimedia commumcation systems are used for transmitting and receiving a wide variety of audio, video, data and control signals.
  • Video conferencing systems and other multimedia systems frequendy utilize existing communication networks including circuit switched networks such as ISDN (Integrated Service Digital Networks) and POTS (Plain Old Telephone Service) and packet switched networks such as the Internet and intranets as well as private commumcation networks and LANs (Local Area Networks) for establishing a commumcation link.
  • circuit switched networks such as ISDN (Integrated Service Digital Networks) and POTS (Plain Old Telephone Service)
  • packet switched networks such as the Internet and intranets as well as private commumcation networks and LANs (Local Area Networks) for establishing a commumcation link.
  • the ITU International Telecommunication Umon
  • H.32X recommended standards for communicating over POTS
  • ISDN H.310, H.320, H.321
  • ATM H.310
  • switched digital and leased lines H.320
  • the Internet H.323
  • LANs H.323, H.322
  • Isochronous Ethernet H.322
  • the related ITU H.281 standard provides a common protocol for controlling a remote camera through a network connection that has been established using one of the H.32X standards
  • Multimedia systems often utilize a central location as a processing and distribution center for video calls and other multimedia commumcation.
  • the video processing system at the central location typically includes a computer or other processor that interfaces to the commumcation network through a codec (COder- DECoder).
  • COder- DECoder codec
  • Video and audio signals are transmitted from a remote user location through the central location to one or more final destinations.
  • a user at the remote location can send instructions to the video processing system to control the distribution and content of multimedia signals.
  • Conventional systems are limited, however, in that the user at the remote location does not receive visual or audible feedback regarding the requested instruction or command unless the device receiving the commands is a properly equipped camera. Further, conventional systems that provide for control communications to be received by the video processing system typically are not compatible with similar systems provided by other manufacturers.
  • the camera control format in accepted use today is the ITU-T H.281 standard which provides protocol for camera preset (location preset), pan, tilt, and zoom controls.
  • Most conventional video conferencing equipment does not include the necessary hardware and software for utilizing a widely accepted control protocol or method for controlling devices other than cameras.
  • Remote multimedia equipment at a remote user location transmits processor commands through a commumcation network to a video processing system using a camera control format.
  • Visual feedback is provided to the remote user through a graphic user interface by transmitting a video signal from the video processing system to the remote user.
  • the remote user accesses the processor at the central location through the commumcation network and provides commands using the ITU-T H.281 standard format for camera control.
  • a command entered through a control input device is translated to the camera control format and transmitted through the commumcation network to the video processing system.
  • the video processing system converts the processor control command from a camera control format to a computer input format that can be interpreted by the processor.
  • the processor control commands correspond to a movement of cursor or image on a graphical user interface (GUI)
  • GUI graphical user interface
  • the video signal produced by the video processing system corresponding to the GUI is transmitted to the remote equipment through the commumcation network and displayed on a video output device such as video monitor at the remote user location.
  • Audio information contained in the GUI is transmitted through the commumcation network and produced through an audio output device such as a speaker at the remote user location.
  • the GUI which contains visual and audio information, provides the remote user with selection options that can be selected by observing the cursor and using the control input device.
  • Fig. 1 is a block diagram of a multimedia commumcation system in accordance with the exemplary embodiment of the invention.
  • Fig. 2 is a flowchart of a method of initializing the video processing system in accordance with the exemplary embodiment of the invention.
  • Fig. 3 is a flow chart of a method of providing a graphical user interface to a remote user in accordance with the exemplary embodiment of the invention.
  • FIG. 1 is block diagram of a multimedia commumcation system 100 in accordance with the exemplary embodiment of the invention.
  • the multimedia commumcation system utilizes video conferencing compression, communication and control techmques to provide various types of multimedia services to a remote user. These services may include multimedia on demand, recording, web-casting, video-mail, video streaming, video messaging, video conferencing, user tracking, storage and forwarding services.
  • the remote user may select the type of service, enter user data, and provide other instructional commands to a video processing system 104 providing the services
  • the remote user enters processor control commands through a control input device 110 such as an infrared remote control.
  • the commands are transmitted through a commumcation network 106 using a standard camera control format to the video processing system 104.
  • the video processing system 104 provides a graphical user interface (GUI) to the remote user by transmitting audio and video signals through the commumcation network 106.
  • GUI graphical user interface
  • the GUI includes various multimedia information such as video images, sounds and text.
  • Remote multimedia system equipment (remote equipment) 102 at the remote user location communicates with the video processing system 104 through the commumcation network 106.
  • the remote equipment 102 includes at least a processor 108, a control input device 110 and one video output device 112. Other devices may be connected to the processor 108.
  • the remote equipment may include video input devices 114 such as cameras, image scanners, video cassette recorder (VCR), digital video disc (DVD) players or any other device (114) capable of providing a video image signal or image data .
  • video output devices 112 is a video momtor (112) such as a television or a computer momtor.
  • video output devices 112 may also be connected to the processor 108 such as printers, projectors or VCRs.
  • the output devices 116 include audio output devices such as speakers.
  • Other types of input devices may be coupled to the processor 108 in addition to the video input devices 114.
  • the input devices 118 may include audio input devices (118) such as microphones, compact disc (CD) players, tape players, computers or any other type of device that provides an audio signal.
  • the control mput device 110 may be a joy stick, keyboard, mouse or any other type of apparatus that provides a control signal to the processor.
  • the control input device 110 is an infrared remote control implemented as part of video conferencing system (102).
  • control input device 110 translates mechamcal movements performed by the user into signals that are accepted by the processor 108.
  • the signals may be transmitted to the processor 108 using any of several known techniques.
  • the signals may be transmitted to the processor 108 through a cable, wire, radio frequency link, optical link, infrared link or any other wired or wireless commumcation link.
  • the processor 108 is any processor, microprocessor, processor arrangement or computer capable of performing the functions described herein.
  • Software running on the processor facilitates the overall operation of the remote equipment 102 in addition to performing the specific functions described below.
  • the processor 108 may be implemented as a separate computer such as personal computer (PC) or may be implemented as part of video conferencing system equipment.
  • An audio/video process 120 running on the processor 108 communicates with the input and output devices (110-118) using commumcation interfaces and performs various signal processing as known in the art.
  • the audio/video process 120 is illustrated as part of the processor 108, the functions of the audio /video process 120 may be performed on a separate video processor, computer or ASIC (Application Specific Integrated Circuit) as is known.
  • ASIC Application Specific Integrated Circuit
  • a commumcation interface 122 provides an interface to the commumcation network 106 and facilitates the transmission and reception of audio, video, data and control signals.
  • the commumcation interface 122 may be implemented in a variety of ways depending on the type of multimedia system 100 and commumcation network. For example, if the commumcation network 106 is an ISDN network, the commumcation interface 122 formats the outgoing video, audio, data or conttol signals usmg the appropriate signaling and protocol for ISDN commumcation. Incoming signals are deciphered in accordance with the ISDN protocol.
  • the communication network 106 is a computer network such as the Internet
  • the commumcation interface 122 formats outgoing signals and deciphers incoming signals using the TCP/IP protocols.
  • communications through the commumcation network comply with the ITU-T video conferencing standards.
  • a conttol process 124 receives and interprets the signals produced by the conttol input device 110. If the signals are intended for a user device located at the user's location such as a camera (114), the conttol process forwards the signals to the appropriate interface or device in accordance with known techniques. Otherwise, the conttol signals are translated into the digital format for transmission through the commumcation interface 122.
  • the input conttol device 110 is a remote conttol device which includes function keys for controlling the pan, tilt, and zoom parameters of a remote or user camera (114). The input conttol device 110 may also be used to select a video source or preset camera location command sets in addition to providing an interface for performing various call commands.
  • the input control device 110 may be used to start or end a video call, or to dial a phone number.
  • the input control device 110 may mclude other types of function keys such as a key pad for entering numbers.
  • the processor conttol commands entered using the camera conttol functions therefore, are interpreted as camera conttol signals.
  • the input conttol process 124 formats the signals in accordance with a camera conttol format to produce standard camera control signals.
  • the camera conttol format complies with the ITU-T H.281 standard governing camera conttol signaling.
  • Other types of camera conttol formats may be used to transmit the conttol signals generated by the control input device 110.
  • the camera conttol format may be a standard camera signaling scheme accepted by an industry or a standard adopted or recommended by a standardization body.
  • the commumcation network 106 may be any one of several suitable commumcation networks including circuit switched networks such as POTS and ISDN or packets switched networks such, LANs, WANs, intranets, or internets. In addition to the various wire-line networks, the commumcation network 106 may be a wireless network. Those skilled in the art will recognize the various protocols, systems and interfaces that may be used for the commumcation network 106.
  • Video, audio, data and conttol signals are transmitted from the remote equipment 102 through the commumcation network 106 in accordance with the format and protocols required by the particular commumcation network 106 and multimedia system 100.
  • the H.32X standard applies to the various types of standard protocols used for video commumcation as set out by the ITU-T standards.
  • the video processing system 104 includes a multimedia codec (COder- DECoder) 126 for translating the compressed digital signals received through the commumcation network 106 to the appropriate signal format and converts outgoing signals for transmission through the network 106 to the appropriate format dictated by the particular multimedia system 100 and communication network 106.
  • the multimedia codec 126 is a Polycom ViewStation that includes a conttol translator process 128 as described below.
  • the multimedia codec 126 may be any configuration of hardware and software for implementing the functions described herein.
  • a processor 130 coupled to the multimedia codec 126 performs various signal routing, processmg, managing and conttol functions for facilitating multimedia services to the remote user. These functions may include transmitting or receiving video, audio, data or conttol signals to and from various sources. For example, video signals may be ret ⁇ eved or forwarded to an external video device 132 such as a VCR, DVD player, computer, memory device, camera, image scanner, printer, projector, video monitor or any other type of device (132) capable of transmitting or receiving video signals. Audio signals may be transmitted and received from an external audio device 134 such as a tape player, CD player, microphone, or speaker. Data and control signals may be coupled between the processor and the video and audio devices 132, 134.
  • video signals may be ret ⁇ eved or forwarded to an external video device 132 such as a VCR, DVD player, computer, memory device, camera, image scanner, printer, projector, video monitor or any other type of device (132) capable of transmitting or receiving video signals.
  • Audio signals may
  • controls such as PLAY, FAST FORWARD, REVERSE, RECORD and STOP may be transmitted to a VCR (132) or digital recording device such as multimedia computer coupled to the processor 130.
  • Data devices 136 such as computers, processors, databases, and memory devices may also be connected to the processor 130.
  • Various data may be transmitted and received from the data device 136 such as stored video images, graphics or text messages.
  • Instructions and software running on the processor 130 allow the processor 130 to provide various services to the remote user such as multimedia on demand, recording, web-casting, video-mail, video streaming, video messaging, video conferencing, user tracking, storage and forwarding services.
  • an audience 154 is coupled to the video processing system 104 through a multimedia server 150 and another commumcation network 152 to provide multimedia services such as web-casting or video conferencing.
  • the two commumcation networks 106, 152 may be the same network, may share equipment and resources or may be connected to each other.
  • Those skilled m the art will recognize the various types of video services that can be provided using the multimedia system 100 and the various configurations of networks (106, 152) , multimedia servers (104, 150) and multimedia equipment.
  • the invention is not intended to be limited to any particular video service or embodiment.
  • the processor 130 is a personal computer (PC) having dual 500 MHz Pentium III processors and running a Windows NT operating system.
  • the processor 130 may by any computer, processor, microprocessor, or processor arrangement having sufficient speed and memory for performing the functions desc ⁇ bed herein. Further, other operating systems may be used such as Windows 2000 or LINUX.
  • the conttol translator process 128 is implemented in software code running on the multimedia codec 126.
  • This code is typically provided as part of the multimedia codec 126 to provide the appropnate signals (mechamsm conttol signals) for controlling mechanical mechamsms of a camera.
  • Other types of software code may be used, however, to perform the functions of the conttol translator process 128.
  • the conttol translator process 128 translates the processor control commands received at the multimedia codec 126 in the standard format to a camera mechamsm format. Therefore, the signals received at the multimedia codec 12(> are converted into the appropriate signaling format (camera mechanism format) to conttol a camera.
  • the camera mechanism formats typically employ various voltage levels applied to control inputs on the camera.
  • the camera mechanism signals may comply with any type of signaling scheme used to conttol the camera.
  • the camera mechanism control signals may correspond to a digital zoom for a digital camera rather than a mechanical zoom performed by mechanically adjusting the position of a lens in the camera.
  • the processor conttol commands are preferably transmitted in accordance with the ITU-T H.281 Standard.
  • the control translator process 128 converts the H.281 conttol signals into a camera mechanism format in accordance with the ViscaTM format that can be interpreted by a Sony camera.
  • the various camera manufacturers use different camera mechanism formats for controlling the camera. Other types of standard or unique mechanism control formats may be used.
  • the camera mechanism signals are coupled to the processor 130 by connecting the output of a conttol connector at the multimedia codec 126 to a serial input port on the processor 130.
  • a printed circuit board designed to couple the multimedia codec 126 to a camera is adapted to provide an interface to the serial port on the processor 130 and a video interface to a video connector on the processor 130.
  • the camera mechanism signals are transmitted from the multimedia codec 126 to the processor 130 and acknowledgment messages are transmitted from the processor 130 to the multimedia codec 126 through the serial data cable 138.
  • video signals are transmitted from the processor 130 to the multimedia codec 126 through a multimedia cable 140 and communication and data signals are coupled between the multimedia codec 126 and the processor using Signaling Network Management Protocol (SNMP) and Transmission Control Protocol/ Internet Protocol (TCP/IP) through a data interface 142.
  • SNMP Signaling Network Management Protocol
  • TCP/IP Transmission Control Protocol/ Internet Protocol
  • a graphic user interface (GUI) process 144 running on the processor 130 provides a GUI to the remote user by producing an audio and video signal at the multimedia output of the processor 130 that is received by the multimedia codec 126, translated into the approp ⁇ ate digital format and transmitted to the remote equipment 102.
  • the GUI may provide other types of multimedia information to the remote user and is preferably displayed on a video momtor (112).
  • the GUI process 144 is implemented in computer software code that runs on the processor 130.
  • the GUI process 144 mteracts with a signal adapter process 146 and a GUI interface process 148 running on the processor 130.
  • the signal adapter process 146 and the GUI interface process 148 are implemented as Dynamic Link Library (DLL) software modules.
  • DLL modules are executable code modules that are loaded on demand and linked at run time This allows library code to be field- updated ttanêtdy to application software and unloaded on completion.
  • Other types of software code may be used to perform the functions of the signal adapter process 146 and the GUI interface process 148.
  • Commumcation between the processes (128, 144, 146, 148) are executed using known techniques
  • the various boxes representing the processes (144, 146, 148) running on the processor 130 are drawn with dashed lines to illustrate that the processes (144, 146, 148) may be implemented in other ways or on other processors.
  • the described process functions may be distributed between the processes (144, 146, 148) or other code in other arrangements Further, the described functions may be implemented m code of a single process Also, the signal adapter process 146 may be implemented in code residing in the multimedia codec 126
  • the GUI interface process 148 couples the GUI process 146 to the multimedia codec 126 by communicating with the multimedia codec 126 using SNMP and TCP/IP over the data commumcation interface 142.
  • the multimedia codec 126 negotiates the va ⁇ ous parameters for communication with the remote equipment 102 and starts a video call according to the appropriate communication standards and in accordance with known techmques.
  • the GUI interface process 148 receives, through the data commumcation mterface 142, a call notification indicating that a video call has been initiated and notifies the GUI process 144.
  • the GUI process 144 monitors the appropnate se ⁇ al port (e.g. COMM1) for activity through the signal adapter process 146.
  • processor conttol commands entered by the remote user are ttansmitted through the commumcation network 106 m a camera control format, received by the multimedia codec 126 and translated into a camera mechamsm format.
  • the camera mechamsm signals are received by the processor 130 through the serial port.
  • the signal adapter process 146 converts the camera mechanism signals into computer input signals that can be interpreted by the GUI process 144.
  • the resultant computer input signals in the computer input format correspond to cursor movement commands and relate to the position of a cursor produced on the GUI
  • the signal adapter process 146 sends an appropnate acknowledgement message to the multimedia codec 126 using the camera mechamsm format when the camera mechamsm signal is received.
  • the cursor may be any type of video image, symbol or visual indicator that can be perceived by the remote user.
  • suitable cursors include visible arrows, lines, and shapes such as rectangles, circles and ovals.
  • the GUI generated by the GUI process 144 and including the cursor are produced at the multimedia output connector on the processor 130.
  • the resulting audio and video signals are transmitted through the multimedia cable 140 to the multimedia codec 126.
  • the multimedia codec 126 compresses and formats the GUI multimedia signals in accordance with the transmission protocol used in the commumcation link between the remote equipment 102 and the multimedia codec 126.
  • the formatted digital signals are ttansmitted through the commumcation network 106 to the remote equipment 102.
  • the formatted digital signals are converted to an appropriate analog or digital output and displayed to the remote user.
  • the formatted digital signals are converted into either NTSC or VGA compatible signals and supplied to the video momtor (112).
  • the multimedia signals ttansmitted from the processor 130 to the multimedia codec 126 may include audio, video or data signals and may be in an analog or digital format.
  • the connection may be a multimedia connection often referred to as a "fire-wire" link. Therefore, processor conttol commands entered by the remote user and ttansmitted in a camera control format are translated mto cursor movement commands as interpreted by the processor 130 at the video processing system 104 and displayed to the remote user by transmitting a GUI through the commumcation network 106.
  • an action by the remote user corresponding to a camera pan left command is ttansmitted using the standard camera conttol format such as H.281 to the multimedia codec 126 through the communication network 106.
  • the conttol translator process 128 translates the standard camera conttol signal in the camera control format to a camera mechamsm format and forwards the resulting camera mechanism conttol signal to the processor 130.
  • the signal adapter process 146 adapts the camera mechamsm control signal to a computer input signal corresponding to a cursor command to pan left.
  • the GUI process 144 running on the processor 130 displays the cursor movement and location on the GUI.
  • the video signal corresponding to the GUI is transmitted to the multimedia codec 126, converted into a digital signal and ttansmitted through the commumcation network 106.
  • the remote equipment 102 interprets the digital signals and produces a video image on the video momtor displaying the GUI with the cursor moving left at the appropriate location in the GUI.
  • the remote user therefore, receives visual feedback of the control mput device 110 output as a cursor movement on the GUI.
  • Other multimedia information mcluding sound may be included in the GUI and presented to the remote user.
  • FIG. 2 is a flow chart of a method of initializing the video processing system 104 when a connection is established between the remote equipment 102 and the video processing system 104.
  • the multimedia codec 126 receives a call request message indicating that the remote user is attempting to establish a video call.
  • the protocol used to transmit the message depends on the particular communication network 106 and multimedia system 100.
  • the call request message is ttansmitted m accordance with the ITU-T H.32X standards.
  • the message is received and deciphered using known techmques.
  • the multimedia codec 126 communicates with the remote equipment 102 to establish video call parameters.
  • the negotiation methods used in establishing the video call parameters are in accordance with known methods and protocols.
  • the multimedia codec 126 through the data communication link
  • the GUI interface process 148 running on the processor 130 interprets the notification ttansmitted from the multimedia codec 126 through the data communication link 142 using the TCP/IP protocols.
  • the GUI process 144 receives the video call notification through the GUI interface process 148.
  • the GUI process 144 in response to the video call notification, begins a new user session.
  • the GUI process 144 monitors signals received through the signal adapter process 146.
  • the signal adapter 146 process launches, it opens the processor 130 serial port that is connected to the multimedia codec 126 camera conttol connector. Camera mechanism signals received from the multimedia codec
  • GUI process 126 are translated into a computer format and forwarded to the GUI process 144.
  • FIG. 3 a is flow chart of a method of providing the GUI to the remote user in accordance with the exemplary embodiment of the invention.
  • a processor conttol command in a camera conttol format is received at the video processing system 104.
  • the processor control command is produced by an conttol input device 110 coupled within the remote equipment 102 as described above.
  • the processor control commands are transmitted though the communication network 106 in accordance with the ITU- T H.281 Standard.
  • the processor conttol command is converted to a camera mechamsm signal conforming to a camera mechanism format.
  • the control translator process 128 running on a processor in the multimedia codec 126 converts the processor conttol commands to the camera mechanism signal complying with the ViscaTM signaling format.
  • the camera mechanism signal is received at the processor 130.
  • the signal adapter process 146 running on the processor 130 converts the camera mechanism signal to a computer input signal corresponding to a cursor movement.
  • the signal adapter process 146 sends the appropriate acknowledgment message to the multimedia codec 126 at step 308.
  • the camera mechanism conttol signaling involves a two way connection where the camera acknowledges received camera conttol signals. Accordingly, the signal adapter process 146 emulating a camera generates and transmits the acknowledgment message corresponding to the camera conttol command usmg the camera mechanism format.
  • the GUI process 144 generates the GUI.
  • the GUI mcludes options that may be selected by the remote user. Examples of selection options that may be provided and displayed to the remote user include choosing a multimedia service, signal routing options, secu ⁇ ty options, or choosing a file for review.
  • the GUI provides an interface for the user to enter user information such as a name, password or PIN.
  • the GUI may provide HELP information that may be presented as text, visual demonstration, audible mstructions, or other types or combinations of multimedia information.
  • a short audio/visual demonstration may be presented through the GUI to provide the HELP information
  • the remote user may select from a plurality of selection options or perform a variety of functions usmg a GUI similar to a GUI typically provided by or through a PC.
  • a multimedia signal corresponding to the GUI is produced at the processor 130.
  • known techniques are used to create the multimedia signal at the standard video connector of the processor 130.
  • the multimedia signal mcludes at least the various images representing the options to the user and the cursor and may include other suitable information to the remote user.
  • the video images from an external video device 132 is combmed in the GUI.
  • the video image produced by a VCR (132) may be included m a portion of the GUI.
  • multiple video images produced by multiple video devices 134 may be displayed within the GUI.
  • the multimedia signal is coupled to the multimedia codec 126.
  • a standard multimedia cable connecting the processor 130 to the multimedia codec 126 is used to convey the multimedia signal to the multimedia codec 126 in a standard signaling format such as those complying with NTSC (National Television Standards Committee), MPEG (Motion Picture Experts Group), AUI, ASI (Advanced Services Implementation), Quicktime, IEEE 1394, and USB (Universal Serial Buss).
  • NTSC National Television Standards Committee
  • MPEG Motion Picture Experts Group
  • AUI Advanced Services Implementation
  • ASI Advanced Services Implementation
  • Quicktime IEEE 1394
  • USB Universal Serial Buss
  • Other wired or wireless signaling formats may be used to transmit the multimedia information from the processor 130 to the multimedia code 126.
  • the multimedia signal is compressed and formatted in accordance with the transmission protocol of the commumcation network 106 and required by the multimedia system 100. As explamed above, the transmission protocol in the exemplary embodiment complies with the ITU-T H.32X standards.
  • the compressed digital signal representing the GUI is transmitted through the commumcation network
  • a video processmg system 104 receives processor conttol commands m a standard camera conttol format through a commumcation network 106 from a remote user.
  • the commands are converted to computer mput signals readable by a processor 130 in the video processmg system 104.
  • the computer mput signals are interpreted by the processor 130 and executed.
  • Visual feedback is provided to the remote user by transmitting a video signal reflecting the remote users commands.
  • the remote user therefore, can use a standard camera controlling device (110) or other control mput device 110 to manipulate a cursor on a GUI produced by the processor 130 at the video processmg system 104 through a commumcation network 106 without special software or hardware at the user's remote location.
  • Smce standard camera control formats are used to convey the conttol commands, the video processmg system 104 can be controlled by any multimedia equipment complying with the standards.

Abstract

Camera control signals generated at a remote user location are received through a communication network at a video processing system. The camera control signals are adapted to provide input controls to a processor at the video processing system. Computer functions are controlled using the camera control signals. The camera control signals may be used to control a cursor on a computer video output display. Feedback is provided to the remote user through the communication network by transmitting the output of the processor through the network as video data. Any video equipment at the remote user premises using a camera control protocol conforming to an accepted camera control signaling standard can be used to provide computer control information through the camera control device.

Description

SYSTEM, APPARATUS AND METHOD FOR CONTROLLING COMMUNICATIONS IN A MULTIMEDIA COMMUNICATION
NETWORK
BACKGROUND OF THE INVENTION
The invention relates in general to multimedia systems and more specifically to systems and methods for controlling a processor in a multimedia communication system. Multimedia commumcation systems are used for transmitting and receiving a wide variety of audio, video, data and control signals. Video conferencing systems and other multimedia systems frequendy utilize existing communication networks including circuit switched networks such as ISDN (Integrated Service Digital Networks) and POTS (Plain Old Telephone Service) and packet switched networks such as the Internet and intranets as well as private commumcation networks and LANs (Local Area Networks) for establishing a commumcation link. The ITU (International Telecommunication Umon) has developed a set of recommended standards for multimedia commumcation that provide camera control protocols in order to foster compatibility between various multimedia system manufacturers and service providers. For example, a family of ITU standards often referred to as the H.32X standards include recommended standards for communicating over POTS (H.324), ISDN (H.310, H.320, H.321), ATM (H.310), switched digital and leased lines (H.320), the Internet (H.323), LANs (H.323, H.322), and Isochronous Ethernet (H.322) The related ITU H.281 standard provides a common protocol for controlling a remote camera through a network connection that has been established using one of the H.32X standards
Multimedia systems often utilize a central location as a processing and distribution center for video calls and other multimedia commumcation. The video processing system at the central location typically includes a computer or other processor that interfaces to the commumcation network through a codec (COder- DECoder). Video and audio signals are transmitted from a remote user location through the central location to one or more final destinations. A user at the remote location can send instructions to the video processing system to control the distribution and content of multimedia signals. Conventional systems are limited, however, in that the user at the remote location does not receive visual or audible feedback regarding the requested instruction or command unless the device receiving the commands is a properly equipped camera. Further, conventional systems that provide for control communications to be received by the video processing system typically are not compatible with similar systems provided by other manufacturers. The camera control format in accepted use today is the ITU-T H.281 standard which provides protocol for camera preset (location preset), pan, tilt, and zoom controls. Most conventional video conferencing equipment does not include the necessary hardware and software for utilizing a widely accepted control protocol or method for controlling devices other than cameras.
Therefore, there is a need for a multimedia commumcation system, apparatus and method for controlling communications through a video processing system at a central location.
SUMMARY OF THE INVENTION Remote multimedia equipment at a remote user location transmits processor commands through a commumcation network to a video processing system using a camera control format. Visual feedback is provided to the remote user through a graphic user interface by transmitting a video signal from the video processing system to the remote user. In an exemplary embodiment, the remote user accesses the processor at the central location through the commumcation network and provides commands using the ITU-T H.281 standard format for camera control. A command entered through a control input device is translated to the camera control format and transmitted through the commumcation network to the video processing system. The video processing system converts the processor control command from a camera control format to a computer input format that can be interpreted by the processor. In the exemplary embodiment, the processor control commands correspond to a movement of cursor or image on a graphical user interface (GUI) The video signal produced by the video processing system corresponding to the GUI is transmitted to the remote equipment through the commumcation network and displayed on a video output device such as video monitor at the remote user location. Audio information contained in the GUI is transmitted through the commumcation network and produced through an audio output device such as a speaker at the remote user location. The GUI, which contains visual and audio information, provides the remote user with selection options that can be selected by observing the cursor and using the control input device.
BRIEF DESCRIPTION OF THE DRAWINGS
Fig. 1 is a block diagram of a multimedia commumcation system in accordance with the exemplary embodiment of the invention.
Fig. 2 is a flowchart of a method of initializing the video processing system in accordance with the exemplary embodiment of the invention. Fig. 3 is a flow chart of a method of providing a graphical user interface to a remote user in accordance with the exemplary embodiment of the invention.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
FIG. 1 is block diagram of a multimedia commumcation system 100 in accordance with the exemplary embodiment of the invention. Although the invention may be implemented in other types of multimedia commumcation systems, the multimedia commumcation system utilizes video conferencing compression, communication and control techmques to provide various types of multimedia services to a remote user. These services may include multimedia on demand, recording, web-casting, video-mail, video streaming, video messaging, video conferencing, user tracking, storage and forwarding services.
As will be discussed below in more detail, the remote user may select the type of service, enter user data, and provide other instructional commands to a video processing system 104 providing the services The remote user enters processor control commands through a control input device 110 such as an infrared remote control. The commands are transmitted through a commumcation network 106 using a standard camera control format to the video processing system 104. The video processing system 104 provides a graphical user interface (GUI) to the remote user by transmitting audio and video signals through the commumcation network 106. In addition to providing feedback to the remote user in response to the processor control command, the GUI includes various multimedia information such as video images, sounds and text.
Remote multimedia system equipment (remote equipment) 102 at the remote user location communicates with the video processing system 104 through the commumcation network 106. The remote equipment 102 includes at least a processor 108, a control input device 110 and one video output device 112. Other devices may be connected to the processor 108. For example, the remote equipment may include video input devices 114 such as cameras, image scanners, video cassette recorder (VCR), digital video disc (DVD) players or any other device (114) capable of providing a video image signal or image data . Preferably, at least one of the video output devices 112 is a video momtor (112) such as a television or a computer momtor. Other types of video output devices 112 may also be connected to the processor 108 such as printers, projectors or VCRs. In the exemplary embodiment, the output devices 116 include audio output devices such as speakers. Other types of input devices may be coupled to the processor 108 in addition to the video input devices 114. For example, the input devices 118 may include audio input devices (118) such as microphones, compact disc (CD) players, tape players, computers or any other type of device that provides an audio signal. The control mput device 110 may be a joy stick, keyboard, mouse or any other type of apparatus that provides a control signal to the processor. In the exemplary embodiment, the control input device 110 is an infrared remote control implemented as part of video conferencing system (102). Preferably, the control input device 110 translates mechamcal movements performed by the user into signals that are accepted by the processor 108. The signals may be transmitted to the processor 108 using any of several known techniques. The signals may be transmitted to the processor 108 through a cable, wire, radio frequency link, optical link, infrared link or any other wired or wireless commumcation link.
The processor 108 is any processor, microprocessor, processor arrangement or computer capable of performing the functions described herein. Software running on the processor facilitates the overall operation of the remote equipment 102 in addition to performing the specific functions described below. The processor 108 may be implemented as a separate computer such as personal computer (PC) or may be implemented as part of video conferencing system equipment. An audio/video process 120 running on the processor 108 communicates with the input and output devices (110-118) using commumcation interfaces and performs various signal processing as known in the art. Although, the audio/video process 120 is illustrated as part of the processor 108, the functions of the audio /video process 120 may be performed on a separate video processor, computer or ASIC (Application Specific Integrated Circuit) as is known.
A commumcation interface 122 provides an interface to the commumcation network 106 and facilitates the transmission and reception of audio, video, data and control signals. The commumcation interface 122 may be implemented in a variety of ways depending on the type of multimedia system 100 and commumcation network. For example, if the commumcation network 106 is an ISDN network, the commumcation interface 122 formats the outgoing video, audio, data or conttol signals usmg the appropriate signaling and protocol for ISDN commumcation. Incoming signals are deciphered in accordance with the ISDN protocol. If, on the other hand, the communication network 106 is a computer network such as the Internet, the commumcation interface 122 formats outgoing signals and deciphers incoming signals using the TCP/IP protocols. In the exemplary embodiment, communications through the commumcation network comply with the ITU-T video conferencing standards.
A conttol process 124 receives and interprets the signals produced by the conttol input device 110. If the signals are intended for a user device located at the user's location such as a camera (114), the conttol process forwards the signals to the appropriate interface or device in accordance with known techniques. Otherwise, the conttol signals are translated into the digital format for transmission through the commumcation interface 122. In the exemplary embodiment, the input conttol device 110 is a remote conttol device which includes function keys for controlling the pan, tilt, and zoom parameters of a remote or user camera (114). The input conttol device 110 may also be used to select a video source or preset camera location command sets in addition to providing an interface for performing various call commands. For example, the input control device 110 may be used to start or end a video call, or to dial a phone number. In addition to function keys such as up- down, left-right arrows, the input control device 110 may mclude other types of function keys such as a key pad for entering numbers. The processor conttol commands entered using the camera conttol functions, therefore, are interpreted as camera conttol signals. The input conttol process 124 formats the signals in accordance with a camera conttol format to produce standard camera control signals. In the exemplary embodiment, the camera conttol format complies with the ITU-T H.281 standard governing camera conttol signaling. Other types of camera conttol formats, however, may be used to transmit the conttol signals generated by the control input device 110. The camera conttol format may be a standard camera signaling scheme accepted by an industry or a standard adopted or recommended by a standardization body. The commumcation network 106 may be any one of several suitable commumcation networks including circuit switched networks such as POTS and ISDN or packets switched networks such, LANs, WANs, intranets, or internets. In addition to the various wire-line networks, the commumcation network 106 may be a wireless network. Those skilled in the art will recognize the various protocols, systems and interfaces that may be used for the commumcation network 106.
Video, audio, data and conttol signals are transmitted from the remote equipment 102 through the commumcation network 106 in accordance with the format and protocols required by the particular commumcation network 106 and multimedia system 100. As is known, the H.32X standard applies to the various types of standard protocols used for video commumcation as set out by the ITU-T standards.
The video processing system 104 includes a multimedia codec (COder- DECoder) 126 for translating the compressed digital signals received through the commumcation network 106 to the appropriate signal format and converts outgoing signals for transmission through the network 106 to the appropriate format dictated by the particular multimedia system 100 and communication network 106. In the exemplary embodiment, the multimedia codec 126 is a Polycom ViewStation that includes a conttol translator process 128 as described below. The multimedia codec 126 however, may be any configuration of hardware and software for implementing the functions described herein.
A processor 130 coupled to the multimedia codec 126 performs various signal routing, processmg, managing and conttol functions for facilitating multimedia services to the remote user. These functions may include transmitting or receiving video, audio, data or conttol signals to and from various sources. For example, video signals may be retπeved or forwarded to an external video device 132 such as a VCR, DVD player, computer, memory device, camera, image scanner, printer, projector, video monitor or any other type of device (132) capable of transmitting or receiving video signals. Audio signals may be transmitted and received from an external audio device 134 such as a tape player, CD player, microphone, or speaker. Data and control signals may be coupled between the processor and the video and audio devices 132, 134. For example, controls such as PLAY, FAST FORWARD, REVERSE, RECORD and STOP may be transmitted to a VCR (132) or digital recording device such as multimedia computer coupled to the processor 130. Data devices 136 such as computers, processors, databases, and memory devices may also be connected to the processor 130. Various data may be transmitted and received from the data device 136 such as stored video images, graphics or text messages.
Instructions and software running on the processor 130 allow the processor 130 to provide various services to the remote user such as multimedia on demand, recording, web-casting, video-mail, video streaming, video messaging, video conferencing, user tracking, storage and forwarding services. In the exemplary embodiment, an audience 154 is coupled to the video processing system 104 through a multimedia server 150 and another commumcation network 152 to provide multimedia services such as web-casting or video conferencing. The two commumcation networks 106, 152 may be the same network, may share equipment and resources or may be connected to each other. Those skilled m the art will recognize the various types of video services that can be provided using the multimedia system 100 and the various configurations of networks (106, 152) , multimedia servers (104, 150) and multimedia equipment. The invention is not intended to be limited to any particular video service or embodiment.
In the exemplary embodiment, the processor 130 is a personal computer (PC) having dual 500 MHz Pentium III processors and running a Windows NT operating system. The processor 130 may by any computer, processor, microprocessor, or processor arrangement having sufficient speed and memory for performing the functions descπbed herein. Further, other operating systems may be used such as Windows 2000 or LINUX.
In the exemplary embodiment, the conttol translator process 128 is implemented in software code running on the multimedia codec 126. This code is typically provided as part of the multimedia codec 126 to provide the appropnate signals (mechamsm conttol signals) for controlling mechanical mechamsms of a camera. Other types of software code may be used, however, to perform the functions of the conttol translator process 128. The conttol translator process 128 translates the processor control commands received at the multimedia codec 126 in the standard format to a camera mechamsm format. Therefore, the signals received at the multimedia codec 12(> are converted into the appropriate signaling format (camera mechanism format) to conttol a camera. The camera mechanism formats typically employ various voltage levels applied to control inputs on the camera. Although many cameras include electric motors for controlling the various parameters, the camera mechanism signals may comply with any type of signaling scheme used to conttol the camera. For example the camera mechanism control signals may correspond to a digital zoom for a digital camera rather than a mechanical zoom performed by mechanically adjusting the position of a lens in the camera. As discussed above, the processor conttol commands are preferably transmitted in accordance with the ITU-T H.281 Standard. In the exemplary embodiment, the control translator process 128 converts the H.281 conttol signals into a camera mechanism format in accordance with the Visca™ format that can be interpreted by a Sony camera. As is known, the various camera manufacturers use different camera mechanism formats for controlling the camera. Other types of standard or unique mechanism control formats may be used. Although standard connectors may be used in various multimedia codecs (126) and cameras, the pin configurations on the connectors and voltage levels may differ between different manufacturers. The camera mechanism signals are coupled to the processor 130 by connecting the output of a conttol connector at the multimedia codec 126 to a serial input port on the processor 130. In the exemplary embodiment, a printed circuit board designed to couple the multimedia codec 126 to a camera is adapted to provide an interface to the serial port on the processor 130 and a video interface to a video connector on the processor 130. A serial data cable 138 connected between the multimedia codec 126 and the processor that provides a bi-directional communication interface between the two devices (126, 130). The camera mechanism signals are transmitted from the multimedia codec 126 to the processor 130 and acknowledgment messages are transmitted from the processor 130 to the multimedia codec 126 through the serial data cable 138. As explained below in further detail, video signals are transmitted from the processor 130 to the multimedia codec 126 through a multimedia cable 140 and communication and data signals are coupled between the multimedia codec 126 and the processor using Signaling Network Management Protocol (SNMP) and Transmission Control Protocol/ Internet Protocol (TCP/IP) through a data interface 142.
A graphic user interface (GUI) process 144 running on the processor 130 provides a GUI to the remote user by producing an audio and video signal at the multimedia output of the processor 130 that is received by the multimedia codec 126, translated into the appropπate digital format and transmitted to the remote equipment 102. In addition to the vaπous selection options, the GUI may provide other types of multimedia information to the remote user and is preferably displayed on a video momtor (112). In the exemplary embodiment, the GUI process 144 is implemented in computer software code that runs on the processor 130. The GUI process 144 mteracts with a signal adapter process 146 and a GUI interface process 148 running on the processor 130. In the exemplary embodiment, the signal adapter process 146 and the GUI interface process 148 are implemented as Dynamic Link Library (DLL) software modules. DLL modules are executable code modules that are loaded on demand and linked at run time This allows library code to be field- updated ttansparendy to application software and unloaded on completion. Other types of software code may be used to perform the functions of the signal adapter process 146 and the GUI interface process 148. Commumcation between the processes (128, 144, 146, 148) are executed using known techniques The various boxes representing the processes (144, 146, 148) running on the processor 130 are drawn with dashed lines to illustrate that the processes (144, 146, 148) may be implemented in other ways or on other processors. For example, the described process functions may be distributed between the processes (144, 146, 148) or other code in other arrangements Further, the described functions may be implemented m code of a single process Also, the signal adapter process 146 may be implemented in code residing in the multimedia codec 126
The GUI interface process 148 couples the GUI process 146 to the multimedia codec 126 by communicating with the multimedia codec 126 using SNMP and TCP/IP over the data commumcation interface 142. When the remote user initiates commumcation with the multimedia codec 126, the multimedia codec 126 negotiates the vaπous parameters for communication with the remote equipment 102 and starts a video call according to the appropriate communication standards and in accordance with known techmques. The GUI interface process 148 receives, through the data commumcation mterface 142, a call notification indicating that a video call has been initiated and notifies the GUI process 144.
The GUI process 144 monitors the appropnate seπal port (e.g. COMM1) for activity through the signal adapter process 146. As explained above, processor conttol commands entered by the remote user are ttansmitted through the commumcation network 106 m a camera control format, received by the multimedia codec 126 and translated into a camera mechamsm format. The camera mechamsm signals are received by the processor 130 through the serial port. The signal adapter process 146 converts the camera mechanism signals into computer input signals that can be interpreted by the GUI process 144. In the exemplary embodiment, the resultant computer input signals in the computer input format correspond to cursor movement commands and relate to the position of a cursor produced on the GUI The signal adapter process 146 sends an appropnate acknowledgement message to the multimedia codec 126 using the camera mechamsm format when the camera mechamsm signal is received.
The cursor may be any type of video image, symbol or visual indicator that can be perceived by the remote user. Examples of suitable cursors include visible arrows, lines, and shapes such as rectangles, circles and ovals.
The GUI generated by the GUI process 144 and including the cursor are produced at the multimedia output connector on the processor 130. The resulting audio and video signals are transmitted through the multimedia cable 140 to the multimedia codec 126. The multimedia codec 126 compresses and formats the GUI multimedia signals in accordance with the transmission protocol used in the commumcation link between the remote equipment 102 and the multimedia codec 126. The formatted digital signals are ttansmitted through the commumcation network 106 to the remote equipment 102. The formatted digital signals are converted to an appropriate analog or digital output and displayed to the remote user. In the exemplary embodiment, the formatted digital signals are converted into either NTSC or VGA compatible signals and supplied to the video momtor (112). The multimedia signals ttansmitted from the processor 130 to the multimedia codec 126 may include audio, video or data signals and may be in an analog or digital format. For example, the connection may be a multimedia connection often referred to as a "fire-wire" link. Therefore, processor conttol commands entered by the remote user and ttansmitted in a camera control format are translated mto cursor movement commands as interpreted by the processor 130 at the video processing system 104 and displayed to the remote user by transmitting a GUI through the commumcation network 106. For example, an action by the remote user corresponding to a camera pan left command is ttansmitted using the standard camera conttol format such as H.281 to the multimedia codec 126 through the communication network 106. The conttol translator process 128 translates the standard camera conttol signal in the camera control format to a camera mechamsm format and forwards the resulting camera mechanism conttol signal to the processor 130. The signal adapter process 146 adapts the camera mechamsm control signal to a computer input signal corresponding to a cursor command to pan left. The GUI process 144 running on the processor 130 displays the cursor movement and location on the GUI. The video signal corresponding to the GUI is transmitted to the multimedia codec 126, converted into a digital signal and ttansmitted through the commumcation network 106. The remote equipment 102 interprets the digital signals and produces a video image on the video momtor displaying the GUI with the cursor moving left at the appropriate location in the GUI. The remote user, therefore, receives visual feedback of the control mput device 110 output as a cursor movement on the GUI. Other multimedia information mcluding sound may be included in the GUI and presented to the remote user.
FIG. 2 is a flow chart of a method of initializing the video processing system 104 when a connection is established between the remote equipment 102 and the video processing system 104. At step 202 , the multimedia codec 126 receives a call request message indicating that the remote user is attempting to establish a video call. The protocol used to transmit the message depends on the particular communication network 106 and multimedia system 100. In the exemplary embodiment, the call request message is ttansmitted m accordance with the ITU-T H.32X standards. The message is received and deciphered using known techmques.
At step 204, the multimedia codec 126 communicates with the remote equipment 102 to establish video call parameters. The negotiation methods used in establishing the video call parameters are in accordance with known methods and protocols. At step 206, the multimedia codec 126, through the data communication link
142, notifies the processor 130 that a video call has been established. The GUI interface process 148 running on the processor 130 interprets the notification ttansmitted from the multimedia codec 126 through the data communication link 142 using the TCP/IP protocols.
At step 208, the GUI process 144 receives the video call notification through the GUI interface process 148. The GUI process 144, in response to the video call notification, begins a new user session.
At step 210, the GUI process 144 monitors signals received through the signal adapter process 146. When the signal adapter 146 process is launched, it opens the processor 130 serial port that is connected to the multimedia codec 126 camera conttol connector. Camera mechanism signals received from the multimedia codec
126 are translated into a computer format and forwarded to the GUI process 144.
FIG. 3 a is flow chart of a method of providing the GUI to the remote user in accordance with the exemplary embodiment of the invention.
At step 302, a processor conttol command in a camera conttol format is received at the video processing system 104. The processor control command is produced by an conttol input device 110 coupled within the remote equipment 102 as described above. In the exemplary embodiment, the processor control commands are transmitted though the communication network 106 in accordance with the ITU- T H.281 Standard.
At step 304, the processor conttol command is converted to a camera mechamsm signal conforming to a camera mechanism format. In the exemplary embodiment, the control translator process 128 running on a processor in the multimedia codec 126 converts the processor conttol commands to the camera mechanism signal complying with the Visca™ signaling format.
At step 306, the camera mechanism signal is received at the processor 130. Preferably, the signal adapter process 146 running on the processor 130 converts the camera mechanism signal to a computer input signal corresponding to a cursor movement.
The signal adapter process 146 sends the appropriate acknowledgment message to the multimedia codec 126 at step 308. In the exemplary embodiment, the camera mechanism conttol signaling involves a two way connection where the camera acknowledges received camera conttol signals. Accordingly, the signal adapter process 146 emulating a camera generates and transmits the acknowledgment message corresponding to the camera conttol command usmg the camera mechanism format.
At step 310, the GUI process 144 generates the GUI. The GUI mcludes options that may be selected by the remote user. Examples of selection options that may be provided and displayed to the remote user include choosing a multimedia service, signal routing options, secuπty options, or choosing a file for review. Further, the GUI provides an interface for the user to enter user information such as a name, password or PIN. In addition, the GUI may provide HELP information that may be presented as text, visual demonstration, audible mstructions, or other types or combinations of multimedia information. For example, m response to a user selection of a HELP topic, a short audio/visual demonstration may be presented through the GUI to provide the HELP information Accordingly, the remote user may select from a plurality of selection options or perform a variety of functions usmg a GUI similar to a GUI typically provided by or through a PC.
At step 312, a multimedia signal corresponding to the GUI is produced at the processor 130. In the exemplary embodiment, known techniques are used to create the multimedia signal at the standard video connector of the processor 130. The multimedia signal mcludes at least the various images representing the options to the user and the cursor and may include other suitable information to the remote user.
In an alternate embodiment, the video images from an external video device 132 is combmed in the GUI. For example, the video image produced by a VCR (132) may be included m a portion of the GUI. Further, multiple video images produced by multiple video devices 134 may be displayed within the GUI. At step 314, the multimedia signal is coupled to the multimedia codec 126.
Preferably, a standard multimedia cable connecting the processor 130 to the multimedia codec 126 is used to convey the multimedia signal to the multimedia codec 126 in a standard signaling format such as those complying with NTSC (National Television Standards Committee), MPEG (Motion Picture Experts Group), AUI, ASI (Advanced Services Implementation), Quicktime, IEEE 1394, and USB (Universal Serial Buss). Other wired or wireless signaling formats may be used to transmit the multimedia information from the processor 130 to the multimedia code 126. At step 316, the multimedia signal is compressed and formatted in accordance with the transmission protocol of the commumcation network 106 and required by the multimedia system 100. As explamed above, the transmission protocol in the exemplary embodiment complies with the ITU-T H.32X standards. At step 318, the compressed digital signal representing the GUI is transmitted through the commumcation network 106 to the remote equipment 102, where the signal is decompressed, deciphered and displayed.
Therefore, a video processmg system 104 receives processor conttol commands m a standard camera conttol format through a commumcation network 106 from a remote user. The commands are converted to computer mput signals readable by a processor 130 in the video processmg system 104. The computer mput signals are interpreted by the processor 130 and executed. Visual feedback is provided to the remote user by transmitting a video signal reflecting the remote users commands. The remote user, therefore, can use a standard camera controlling device (110) or other control mput device 110 to manipulate a cursor on a GUI produced by the processor 130 at the video processmg system 104 through a commumcation network 106 without special software or hardware at the user's remote location. Smce standard camera control formats are used to convey the conttol commands, the video processmg system 104 can be controlled by any multimedia equipment complying with the standards.
Clearly, other embodiments and modifications of this invention will occur readily to those of ordinary skill in the art in view of these teachings. Therefore, this invention is to be limited only by following claims, which include all such embodiments and modifications when viewed in conjunction with the above specification and accompanying drawings. WE CLAIM:

Claims

1. A method compπsmg: receiving, at a processor, a processor conttol command ttansmitted from remote multimedia equipment through a commumcation network in accordance with camera conttol format; and executing the processor conttol command at the processor.
2. A method in accordance with claim 1 , further compnsing: converting the processor conttol command from the camera control format to a computer mput format.
3. A method m accordance with claim 2, wherein the camera control format is adopted as a standard by a standardization body.
4. A method m accordance with claim 3, wherein the camera conttol format is in accordance with an International Telecommumcation Umon Standardization Sector (ITU-T) standard.
5. A method in accordance with claim 4, wherein the International
Telecommumcation Umon Standardization Sector standard is a H.281 standard.
6. A method m accordance with claim 3, wherein the camera conttol format is adopted as a standard by an industry.
7. A method in accordance with claim 1, wherein the implementing the computer instruction at the computer compnses: generating a cursor image on a video image, the cursor having a position within the video image in accordance with the computer instruction
8. A method m accordance with claim 7, further compπsmg: transmitting a video signal corresponding to the video image to the remote multimedia device.
9. A method in accordance with claim 8, wherein the video image corresponds to a graphic user interface conveying selection options.
10. A method in accordance with claim 7, wherein the implementing the computer mstruction at the computer further compnses selecting a function m accordance with the computer instruction.
11. A video processmg system compnsing: a multimedia codec adapted to communicate with remote equipment through a commumcation network and to receive a processor conttol command ttansmitted m a camera control format; and a processor coupled to the multimedia codec, the processor adapted to convert the processor conttol command to a computer mput format
12. A video processing system m accordance with claim 11, wherein the processor is further adapted to generate a graphic user interface in accordance with the processor conttol command.
13. A video processmg system in accordance with claim 12, wherein the multimedia codec is further adapted to ttansmit a video signal correspondmg to the graphic user interface to the remote equipment.
14. A video processmg system m accordance with clam 13, wherein the processor conttol command corresponds to a cursor movement.
15. A video processmg system in accordance with claim 14, wherem the multimedia codec is further adapted to convert the processor control command to a camera mechamsm signal, the processor adapted to convert the camera mechamsm signal to a signal m the computer input format.
16 A video processmg system in accordance with claim 15, wherem the camera control format is adopted as a standard by a standardization body
17. A video processmg system in accordance with claim 16, wherein the camera conttol format is in accordance with an International Telecommumcation Umon Standardization Sector (ITU-T) standard.
18. A video processmg system in accordance with claim 17, wherem the
International Telecommumcation Umon Standardization Sector standard is a H.281 standard.
19. A video processmg system in accordance with claim 15, wherem the camera control format is adopted as a standard by an industry
20. A multimedia system comprising: a control mput device coupled to remote equipment, the remote eqmpment adapted to produce a processor conttol command signal in accordance with an output of the control mput device, wherem the processor conttol command signal complies with a camera conttol format; and a video processmg system coupled to the remote eqmpment through a commumcation network, the video processmg system adapted to ttansmit a graphic user mterface to the remote equipment, wherem the graphic user mterface mcludes a cursor havmg a position based on the processor command signal.
21. A multimedia system m accordance with claim 20, wherem the camera control format is adopted as a standard by a standardization body
22 A multimedia system m accordance with claim 21 , wherem the camera control format is in accordance with an International Telecommunication Umon Standardization Sector (ITU-T) standard.
23 A multimedia system in accordance with claim 22, wherem the International Telecommumcation Umon Standardization Sector standard is a H.281 standard.
24. A multimedia system in accordance with claim 23, wherem the camera control format is adopted as a standard by an industry.
PCT/US2000/042352 1999-12-21 2000-11-29 System, apparatus and method for controlling communications in a multimedia communication network WO2001047251A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU49033/01A AU4903301A (en) 1999-12-21 2000-11-29 System, apparatus and method for controlling communications in a multimedia communication network

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US46897199A 1999-12-21 1999-12-21
US09/468,971 1999-12-21

Publications (2)

Publication Number Publication Date
WO2001047251A2 true WO2001047251A2 (en) 2001-06-28
WO2001047251A3 WO2001047251A3 (en) 2002-03-28

Family

ID=23861932

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/042352 WO2001047251A2 (en) 1999-12-21 2000-11-29 System, apparatus and method for controlling communications in a multimedia communication network

Country Status (2)

Country Link
AU (1) AU4903301A (en)
WO (1) WO2001047251A2 (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0738076A2 (en) * 1995-04-14 1996-10-16 Canon Kabushiki Kaisha Camera control system and method of controlling same
EP0781049A2 (en) * 1995-12-19 1997-06-25 Canon Kabushiki Kaisha Communication apparatus, image processing apparatus, communication method, and image processing method
US5802281A (en) * 1994-09-07 1998-09-01 Rsi Systems, Inc. Peripheral audio/video communication system that interfaces with a host computer and determines format of coded audio/video signals
US5850340A (en) * 1996-04-05 1998-12-15 York; Matthew Integrated remote controlled computer and television system
US5884096A (en) * 1995-08-25 1999-03-16 Apex Pc Solutions, Inc. Interconnection system for viewing and controlling remotely connected computers with on-screen video overlay for controlling of the interconnection switch
US5920694A (en) * 1993-03-19 1999-07-06 Ncr Corporation Annotation of computer video displays
US5956028A (en) * 1995-09-14 1999-09-21 Fujitsu Ltd. Virtual space communication system, three-dimensional image display method, and apparatus therefor

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5920694A (en) * 1993-03-19 1999-07-06 Ncr Corporation Annotation of computer video displays
US5802281A (en) * 1994-09-07 1998-09-01 Rsi Systems, Inc. Peripheral audio/video communication system that interfaces with a host computer and determines format of coded audio/video signals
EP0738076A2 (en) * 1995-04-14 1996-10-16 Canon Kabushiki Kaisha Camera control system and method of controlling same
US5884096A (en) * 1995-08-25 1999-03-16 Apex Pc Solutions, Inc. Interconnection system for viewing and controlling remotely connected computers with on-screen video overlay for controlling of the interconnection switch
US5956028A (en) * 1995-09-14 1999-09-21 Fujitsu Ltd. Virtual space communication system, three-dimensional image display method, and apparatus therefor
EP0781049A2 (en) * 1995-12-19 1997-06-25 Canon Kabushiki Kaisha Communication apparatus, image processing apparatus, communication method, and image processing method
US5850340A (en) * 1996-04-05 1998-12-15 York; Matthew Integrated remote controlled computer and television system

Also Published As

Publication number Publication date
AU4903301A (en) 2001-07-03
WO2001047251A3 (en) 2002-03-28

Similar Documents

Publication Publication Date Title
JP4372558B2 (en) Telecommunications system
US7231603B2 (en) Communication apparatus, communication system, video image display control method, storage medium and program
US6646677B2 (en) Image sensing control method and apparatus, image transmission control method, apparatus, and system, and storage means storing program that implements the method
RU2398362C2 (en) Connection of independent multimedia sources into conference communication
KR101633100B1 (en) Information processing system, information processing apparatus, information processing method, and recording medium
JP2001500331A (en) Video camera with general-purpose serial bus interface
US6009470A (en) Encoded multi-media terminal
US20060188096A1 (en) Systems and methods for remotely controlling computer applications
KR20080014329A (en) Apparatus for image display and control method thereof
EP2506571A1 (en) Analog and digital video monitoring fusion system and fusion method thereof
US5956076A (en) Multimedia enabled network
US7339604B1 (en) Videoconferencing arrangement having multi-purpose digital still camera
US7573497B2 (en) Computer-based multimedia communication system having an external unit
US20060143314A1 (en) Remote USB videophone communication
US8174556B1 (en) Videoconferencing arrangement having multi-purpose digital still camera
WO2001047251A2 (en) System, apparatus and method for controlling communications in a multimedia communication network
US6772197B1 (en) Apparatus for enabling communication between a computer terminal and a servicing station for remote, real-time, controlled repair
CA2577087A1 (en) Systems and methods for remotely controlling computer applications
KR100729628B1 (en) Unification home gateway apparatus
JP2000152203A (en) Video compliant computer/telephone device
CN116136751B (en) Mirror image method for cross-operating system of primary screen and secondary screen
JP2009159351A (en) Image recording/reproducing/distributing apparatus
JP4818309B2 (en) Video phone terminal
US20080320535A1 (en) Ip-Enabled Terminal for Combined Video-Based Entertainment and Communication Services
WO1999056457A2 (en) Portable data transmission system for global and local computer networks

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase in:

Ref country code: JP