WO2013012552A1 - A side channel for employing descriptive audio commentary about a video conference - Google Patents
A side channel for employing descriptive audio commentary about a video conference Download PDFInfo
- Publication number
- WO2013012552A1 WO2013012552A1 PCT/US2012/045204 US2012045204W WO2013012552A1 WO 2013012552 A1 WO2013012552 A1 WO 2013012552A1 US 2012045204 W US2012045204 W US 2012045204W WO 2013012552 A1 WO2013012552 A1 WO 2013012552A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- side channel
- video conference
- audio
- descriptive
- real
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/141—Systems for two-way working between two video terminals, e.g. videophone
- H04N7/147—Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B21/00—Teaching, or communicating with, the blind, deaf or mute
- G09B21/001—Teaching or communicating with blind persons
- G09B21/006—Teaching or communicating with blind persons using audible presentation of the information
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/56—Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
- H04M3/567—Multimedia conference systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/15—Conference systems
Definitions
- the present disclosure relates generally to video conferencing and more particularly to communicating descriptive audio commentary about a real-time video conference to visually-impaired attendees.
- FIG. 1 is an exemplary pictorial illustration of local participants in a video conference setting.
- FIG. 2 is an exemplary block diagram of electronic system employed in a mobile computing device.
- FIG. 3 is an exemplary flowchart.
- Skilled artisans will appreciate that elements in the figures are illustrated for simplicity and clarity and have not necessarily been drawn to scale. For example, the dimensions of some of the elements in the figures may be exaggerated relative to other elements to help to improve understanding of embodiments of the present invention.
- An electronic device such as a mobile computing device with an audio/video electronic component is communicatively coupled to at least one video camera during a video conference, and also includes a touchscreen display having video conference participant representations displayed on the touchscreen display.
- a side channel networked with the video conference feed, includes descriptive audio commentary.
- the side channel also includes an adjustable data handler for adjusting the amount of the descriptive audio data correlated to audio commentary regarding a real-time video conference; and a means for prioritizing associated conversation links tied to the real-time videoconference.
- FIG. 1 shows an exemplary pictorial illustration of local participants in a video conference setting 100.
- Local video conference participants may be in attendance to view and hear a presentation.
- the term "local” is with respect to a reference point associated with the video conference setup.
- remote video conference participants can be in attendance via a communication network to view and hear a presentation.
- local participants include a first male lp 105 displaying a presentation or document 140; a female lp 110, a second male lp 115 at far end of a table; and a third male lp 120 seated across from first male lp 105.
- FIG. 2 shows an exemplary block diagram of electronic system employed in a mobile computing device 200 for receiving the streamed video from camera 130 in FIG. 1.
- Mobile computing device 200 includes a display 210 having an integrated haptics feedback system 220; a camera 230; a microphone 240; and a controller 250 electronically and
- Controller 250 can be comprised of separate, but linked controllers such as a speech-to-text controller and an ongoing commentary controller.
- a transceiver 260 is also electronically and communicatively coupled to controller 250 for receiving and transmitting data.
- Data can include image processing data, metadata, audio data, user input data, and communication data (e.g., Braille, texting, email), for example.
- Memory 270 can store the data either permanently or temporarily and is electronically and communicatively coupled to controller 250.
- the electronic system may also include a speaker communicatively coupled to the controller (not shown).
- Functions in controller 250 can include a speech-to-text function that converts video conference participants' speech into text and creates an identification tag for each video conference participant. Controller 250 may analyze video data received and determine descriptive information about the video such as descriptions of clothes, body language, the location, movements of the participants and other related descriptive data. Other functions may include an ongoing commentary controller that provides feedback on non-verbal cues about the video conference participants. Moreover, controller 250 can prioritize the nonverbal cues and description feedback to avoid unnecessary or unwanted feedback, such as a participant doing excessive scratching of a body part or a participant blowing her nose several times throughout the video conference.
- FIG. 3 shows an exemplary flowchart of a method 300 for providing descriptive audio data about a real-time video conference to an output device.
- a mobile computing device with an audio/video electronic component uses Operation 310 to receive a real-time video-conferencing feed.
- the video-conferencing feed can be protected by authorization codes and/or passwords during an initialization period.
- the realtime video-conferencing feed includes streamed audio and video content of an ongoing videoconference.
- the real-time videoconference can show seating arrangement of participants captured by an image capturing device.
- the real-time videoconference can show actual participants and therefore the garments the actual participants are wearing, including jewelry, headdress, accessories, and uncovered tattoos.
- the real-time videoconference can show authority figures and preferred interested persons designated by title or seating arrangement or subject matter, for example.
- Operation 320 enables the mobile computing device to receive a side channel data feed in addition to the real-time video-conferencing feed.
- the side channel data feed may or may not be synchronized with the video conference feed and may include an audio feed.
- the side channel feed can be distinct from the real-time video-conferencing to enable greater description of the events and actions of the participants in the real-time video conference.
- the side channel feed includes descriptive audio commentary comprising information about the real-time videoconference during and after its broadcast to the mobile computing device.
- the descriptive data may be generated at the site of the video conferencing, or may be generated at a server connected to the video conferencing equipment via a network, or the descriptive data may be generated at the site receiving the video conferencing data.
- Operation 330 provides descriptive audio commentary that can, for example, include descriptive information about the emotions of one or more videoconference participants (e.g., participant displays a pleasing smile versus participant displays a frown or a confused look, excessive swallowing, involuntary facial ticks or is sweating profusely); informs about movement by the participants (e.g., shifting in seat, or leaning forward or backwards or sideways towards another participant, finger tapping, entering or leaving the video- conference room, entering or leaving the viewable area of the camera.); informs about clothes or garments or accessories worn by participants; informs about body language of participants (slouching versus ram-rod or stiff posture); informs about the seating arrangements or the aesthetic ambience of the room or geographical location where meeting is being conducted (e.g., low- light versus bright light, indoor versus outdoor, paint on walls versus scenery through a window).
- descriptive information about the emotions of one or more videoconference participants e.g., participant displays a pleasing smile versus participant displays a frown or a confused
- Operation 340 may provide audio commentary on the real-time video-conferencing while being subject to rules that enable filtering of the descriptive audio data or information corresponding to the descriptive audio commentary.
- the rules can be heuristic (i.e., learned over time and experience), or can be predetermined.
- the rules may limit the descriptive audio data, if the user or operator of the mobile computing device with audio/video electronic component is talking.
- the rules may enable more descriptive audio data to be delivered to the mobile computing device, if the operator of the mobile computing device is not talking.
- the rules may prioritize the descriptive data based on an input that one or more persons are decision makers or authority figures or influential persons or persons of interest. Likewise, the rules may prioritize the descriptive data based on a specific participant speaking.
- Operation 350 enables the side channel feed to be controlled by an operator of the mobile computing device via a selectivity apparatus, such as a slider mechanism, dial, or other graphical user input, for example.
- the selectivity apparatus enables the operator to maximize the descriptive audio data received by the mobile computing device and minimize the real-time video conference audio.
- the selectivity apparatus enables the operator to maximize the real-time video conference audio and severely limit or stop the descriptive audio data sent to the mobile computing device. That is, the selectivity apparatus can reduce either the descriptive data or the real-time video-conferencing audio to a quantitative amount approaching nearly zero.
- Operation 360 enables the side channel feed to be transmitted to an output device, such as a headset comprised of one or more speakers or a stand-alone speaker system comprised of one or more speakers, or an integrated speaker set that is integrated or communicatively coupled to a controller of the mobile computing device.
- Any speaker system can be electronically tethered or wireless, for example, via Bluetooth.
- Additional information may be coupled or transmitted along with the side channel feed. The additional information can include non-verbal cues can be commented upon or fed back to a limited sighted person via Braille, large format text, haptic output, or audio.
- Coupled as used herein is defined as connected, although not necessarily directly and not necessarily mechanically.
- a device or structure that is “configured” in a certain way is configured in at least that way, but may also be configured in ways that are not listed.
- processors such as microprocessors, digital signal processors, customized processors and field programmable gate arrays (FPGAs) and unique stored program instructions (including both software and firmware) that control the one or more processors to implement, in conjunction with certain non-processor circuits, some, most, or all of the functions of the method and/or apparatus described herein.
- processors or “processing devices” such as microprocessors, digital signal processors, customized processors and field programmable gate arrays (FPGAs) and unique stored program instructions (including both software and firmware) that control the one or more processors to implement, in conjunction with certain non-processor circuits, some, most, or all of the functions of the method and/or apparatus described herein.
- an embodiment can be implemented as a computer-readable storage medium having computer readable code stored thereon for programming a computer (e.g., comprising a processor) to perform a method as described and claimed herein.
- Examples of such computer-readable storage mediums include, but are not limited to, a hard disk, a CD-ROM, an optical storage device, a magnetic storage device, a ROM (Read Only Memory), a PROM (Programmable Read Only Memory), an EPROM (Erasable Programmable Read Only Memory), an EEPROM (Electrically Erasable Programmable Read Only Memory) and a Flash memory.
- a non-transitory machine readable storage device having stored thereon a computer program that include a plurality of code sections comprising code for implementing the method described herein can be used.
Abstract
A side channel, networked with a real-time video conference feed, includes descriptive audio commentary. The side channel also includes an adjustable data handler for adjusting amount of the descriptive audio data correlated to audio commentary regarding the real-time video conference; and a means for prioritizing associated conversation links tied to the real-time video conference.
Description
A SIDE CHANNEL FOR EMPLOYING DESCRIPTIVE AUDIO COMMENTARY ABOUT A VIDEO CONFERENCE
FIELD OF THE DISCLOSURE
[001] The present disclosure relates generally to video conferencing and more particularly to communicating descriptive audio commentary about a real-time video conference to visually-impaired attendees.
BACKGROUND
[002] Persons having limited sight are disadvantaged in a video conference because much information may not be communicated to them, for example, knowing whether a participant looks tired, or is nodding their acceptance of the presented information, or knowing whether a participant is an authority figure, or whether a participant has worn a special garment or accessory as a signal to other video conference participants.
Accordingly, there is a need for an apparatus that informs a visually-impaired video conference attendee of pertinent information related to the video conference and its purpose.
[003]
BRIEF DESCRIPTION OF THE FIGURES
[004] The accompanying figures, where like reference numerals refer to identical or functionally similar elements throughout the separate views, together with the detailed description below, are incorporated in and form part of the specification, and serve to further illustrate embodiments of concepts that include the claimed invention, and explain various principles and advantages of those embodiments.
[005] FIG. 1 is an exemplary pictorial illustration of local participants in a video conference setting.
[006] FIG. 2 is an exemplary block diagram of electronic system employed in a mobile computing device.
[007] FIG. 3 is an exemplary flowchart.
[008] Skilled artisans will appreciate that elements in the figures are illustrated for simplicity and clarity and have not necessarily been drawn to scale. For example, the dimensions of some of the elements in the figures may be exaggerated relative to other elements to help to improve understanding of embodiments of the present invention.
[009] The apparatus and method components have been represented where appropriate by conventional symbols in the drawings, showing only those specific details that are pertinent to understanding the embodiments of the present invention so as not to obscure the disclosure with details that will be readily apparent to those of ordinary skill in the art having the benefit of the description herein.
DETAILED DESCRIPTION
[0010] An electronic device, such as a mobile computing device with an audio/video electronic component is communicatively coupled to at least one video camera during a video conference, and also includes a touchscreen display having video conference participant representations displayed on the touchscreen display. A side channel, networked with the video conference feed, includes descriptive audio commentary. The side channel also includes an adjustable data handler for adjusting the amount of the descriptive audio data correlated to audio commentary regarding a real-time video conference; and a means for prioritizing associated conversation links tied to the real-time videoconference.
[0011] FIG. 1 shows an exemplary pictorial illustration of local participants in a video conference setting 100. Local video conference participants may be in attendance to view and hear a presentation. The term "local" is with respect to a reference point associated with the video conference setup. Likewise, remote video conference participants can be in attendance via a communication network to view and hear a presentation. In video conference setting 100, local participants (herein after termed: "lp") include a first male lp 105 displaying a presentation or document 140; a female lp 110, a second male lp 115 at far end of a table; and a third male lp 120 seated across from first male lp 105. A camera 130 is communicatively coupled to the video conference's communication network and captures video images and audio of the local participants and the room in which the video conference is being held. The captured video can be streamed to remote video conference participants. It is envisioned that there may be two groups of video conference participants at each end of a video stream and each group is local to its reference point.
[0012] FIG. 2 shows an exemplary block diagram of electronic system employed in a mobile computing device 200 for receiving the streamed video from camera 130 in FIG. 1. Mobile computing device 200 includes a display 210 having an integrated haptics feedback system 220; a camera 230; a microphone 240; and a controller 250 electronically and
communicatively coupled to display 210, haptics feedback system 220, camera 230, and microphone 240. Controller 250 can be comprised of separate, but linked controllers such as a speech-to-text controller and an ongoing commentary controller. A transceiver 260 is also electronically and communicatively coupled to controller 250 for receiving and transmitting data. Data can include image processing data, metadata, audio data, user input data, and communication data (e.g., Braille, texting, email), for example. Memory 270 can store the data either permanently or temporarily and is electronically and communicatively coupled to controller 250. The electronic system may also include a speaker communicatively coupled to the controller (not shown).
[0013] Functions in controller 250 can include a speech-to-text function that converts video conference participants' speech into text and creates an identification tag for each video conference participant. Controller 250 may analyze video data received and determine descriptive information about the video such as descriptions of clothes, body language, the location, movements of the participants and other related descriptive data. Other functions may include an ongoing commentary controller that provides feedback on non-verbal cues about the video conference participants. Moreover, controller 250 can prioritize the nonverbal cues and description feedback to avoid unnecessary or unwanted feedback, such as a participant doing excessive scratching of a body part or a participant blowing her nose several times throughout the video conference.
[0014] FIG. 3 shows an exemplary flowchart of a method 300 for providing descriptive audio data about a real-time video conference to an output device.
[0015] A mobile computing device with an audio/video electronic component uses Operation 310 to receive a real-time video-conferencing feed. The video-conferencing feed can be protected by authorization codes and/or passwords during an initialization period. The realtime video-conferencing feed includes streamed audio and video content of an ongoing videoconference. The real-time videoconference can show seating arrangement of participants captured by an image capturing device. The real-time videoconference can show actual participants and therefore the garments the actual participants are wearing, including jewelry, headdress, accessories, and uncovered tattoos. The real-time videoconference can
show authority figures and preferred interested persons designated by title or seating arrangement or subject matter, for example.
[0016] Operation 320 enables the mobile computing device to receive a side channel data feed in addition to the real-time video-conferencing feed. The side channel data feed may or may not be synchronized with the video conference feed and may include an audio feed. In most cases, the side channel feed can be distinct from the real-time video-conferencing to enable greater description of the events and actions of the participants in the real-time video conference. The side channel feed includes descriptive audio commentary comprising information about the real-time videoconference during and after its broadcast to the mobile computing device.
[0017] The descriptive data may be generated at the site of the video conferencing, or may be generated at a server connected to the video conferencing equipment via a network, or the descriptive data may be generated at the site receiving the video conferencing data.
[0018] Operation 330 provides descriptive audio commentary that can, for example, include descriptive information about the emotions of one or more videoconference participants (e.g., participant displays a pleasing smile versus participant displays a frown or a confused look, excessive swallowing, involuntary facial ticks or is sweating profusely); informs about movement by the participants ( e.g., shifting in seat, or leaning forward or backwards or sideways towards another participant, finger tapping, entering or leaving the video- conference room, entering or leaving the viewable area of the camera.); informs about clothes or garments or accessories worn by participants; informs about body language of participants (slouching versus ram-rod or stiff posture); informs about the seating arrangements or the aesthetic ambiance of the room or geographical location where meeting is being conducted (e.g., low- light versus bright light, indoor versus outdoor, paint on walls versus scenery through a window).
[0019] Operation 340 may provide audio commentary on the real-time video-conferencing while being subject to rules that enable filtering of the descriptive audio data or information corresponding to the descriptive audio commentary. The rules can be heuristic (i.e., learned over time and experience), or can be predetermined. The rules may limit the descriptive audio data, if the user or operator of the mobile computing device with audio/video electronic component is talking. The rules may enable more descriptive audio data to be delivered to the mobile computing device, if the operator of the mobile computing device is not talking. The rules may prioritize the descriptive data based on an input that one or more persons are
decision makers or authority figures or influential persons or persons of interest. Likewise, the rules may prioritize the descriptive data based on a specific participant speaking. That is specific conversation links associated with the real-time video conferencing and attributable to one or more speakers can be given higher or lower priority. In addition, the operator may establish his own rules that impact the amount of descriptive data about the real-time videoconferencing that is provided to the mobile computing device.
[0020] Operation 350 enables the side channel feed to be controlled by an operator of the mobile computing device via a selectivity apparatus, such as a slider mechanism, dial, or other graphical user input, for example. The selectivity apparatus enables the operator to maximize the descriptive audio data received by the mobile computing device and minimize the real-time video conference audio. Alternatively, the selectivity apparatus enables the operator to maximize the real-time video conference audio and severely limit or stop the descriptive audio data sent to the mobile computing device. That is, the selectivity apparatus can reduce either the descriptive data or the real-time video-conferencing audio to a quantitative amount approaching nearly zero.
[0021] Operation 360 enables the side channel feed to be transmitted to an output device, such as a headset comprised of one or more speakers or a stand-alone speaker system comprised of one or more speakers, or an integrated speaker set that is integrated or communicatively coupled to a controller of the mobile computing device. Any speaker system can be electronically tethered or wireless, for example, via Bluetooth. Additional information may be coupled or transmitted along with the side channel feed. The additional information can include non-verbal cues can be commented upon or fed back to a limited sighted person via Braille, large format text, haptic output, or audio.
[0022] In the foregoing specification, specific embodiments have been described. However, one of ordinary skill in the art appreciates that various modifications and changes can be made without departing from the scope of the invention as set forth in the claims below. Accordingly, the specification and figures are to be regarded in an illustrative rather than a restrictive sense, and all such modifications are intended to be included within the scope of present teachings.
[0023] The benefits, advantages, solutions to problems, and any element(s) that may cause any benefit, advantage, or solution to occur or become more pronounced are not to be construed as a critical, required, or essential features or elements of any or all the claims. The
invention is defined solely by the appended claims including any amendments made during the pendency of this application and all equivalents of those claims as issued.
[0024] Moreover in this document, relational terms such as first and second, top and bottom, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. The terms "comprises," "comprising," "has", "having," "includes", "including," "contains", "containing" or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises, has, includes, contains a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. An element proceeded by "comprises ...a", "has ...a", "includes ...a",
"contains ...a" does not, without more constraints, preclude the existence of additional identical elements in the process, method, article, or apparatus that comprises, has, includes, contains the element. The terms "a" and "an" are defined as one or more unless explicitly stated otherwise herein. The terms "substantially", "essentially", "approximately", "about" or any other version thereof, are defined as being close to as understood by one of ordinary skill in the art, and in one non- limiting embodiment the term is defined to be within 10%, in another embodiment within 5%, in another embodiment within 1% and in another
embodiment within 0.5%. The term "coupled" as used herein is defined as connected, although not necessarily directly and not necessarily mechanically. A device or structure that is "configured" in a certain way is configured in at least that way, but may also be configured in ways that are not listed.
[0025] It will be appreciated that some embodiments may be comprised of one or more generic or specialized processors (or "processing devices") such as microprocessors, digital signal processors, customized processors and field programmable gate arrays (FPGAs) and unique stored program instructions (including both software and firmware) that control the one or more processors to implement, in conjunction with certain non-processor circuits, some, most, or all of the functions of the method and/or apparatus described herein.
Alternatively, some or all functions could be implemented by a state machine that has no stored program instructions, or in one or more application specific integrated circuits
(ASICs), in which each function or some combinations of certain of the functions are implemented as custom logic. Of course, a combination of the two approaches could be used.
Moreover, an embodiment can be implemented as a computer-readable storage medium having computer readable code stored thereon for programming a computer (e.g., comprising a processor) to perform a method as described and claimed herein. Examples of such computer-readable storage mediums include, but are not limited to, a hard disk, a CD-ROM, an optical storage device, a magnetic storage device, a ROM (Read Only Memory), a PROM (Programmable Read Only Memory), an EPROM (Erasable Programmable Read Only Memory), an EEPROM (Electrically Erasable Programmable Read Only Memory) and a Flash memory. Additionally, a non-transitory machine readable storage device, having stored thereon a computer program that include a plurality of code sections comprising code for implementing the method described herein can be used.
[0026] Further, it is expected that one of ordinary skill, notwithstanding possibly significant effort and many design choices motivated by, for example, available time, current technology, and economic considerations, when guided by the concepts and principles disclosed herein will be readily capable of generating such software instructions and programs and ICs with minimal experimentation.
The Abstract of the Disclosure is provided to allow the reader to quickly ascertain the nature of the technical disclosure. It is submitted with the understanding that it will not be used to interpret or limit the scope or meaning of the claims. In addition, in the foregoing Detailed Description, it can be seen that various features are grouped together in various embodiments for the purpose of streamlining the disclosure. This method of disclosure is not to be interpreted as reflecting an intention that the claimed embodiments require more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive subject matter lies in less than all features of a single disclosed embodiment. Thus the following claims are hereby incorporated into the Detailed Description, with each claim standing on its own as a separately claimed subject matter.
Claims
1. A side channel, networked with a real-time video conference feed, for employing descriptive audio commentary, comprising:
an adjustable data handler for adjusting amount of the descriptive audio data correlated to audio commentary regarding the real-time video conference; and
a means for prioritizing associated conversation links tied to the real-time video conference.
2. The side channel according to claim 1, wherein the side channel provides additional information to enhance visual aspect of the real-time video conference.
3. The side channel according to claim 2, wherein the additional information is selected from the group consisting of audio, large format text, Braille output, or haptic output.
4. The side channel according to claim 3, wherein the audio is transmitted over a stand-alone speaker or a headset.
5. The side channel according to claim 2, wherein the additional information is generated from a server or at a mobile computing device.
6. The side channel according to claim 1, further comprising a slider mechanism coupled to the adjustable data handler.
7. The side channel according to claim 6 wherein the slider mechanism enables a range of descriptive data that includes full audio from the real-time video conference audio with nearly zero descriptive data through an entire descriptive audio commentary on video conference participants with nearly zero real-time video conference audio.
8. The side channel according to claim 1, wherein the amount of descriptive audio data is decreased as a user of the side channel is communicating with other real-time video conference participants.
9. The side channel according to claim 8, wherein the adjustable data handler automatically reduces descriptive data associated with the real-time video conference and the communicative participation by a user of the side channel.
10. The side channel according to claim 9, wherein the amount of reduction of descriptive data is dependent on the amount of communicative participation generated by the user of the side channel.
11. The side channel according to claim 1 , further comprising a selectivity filter wherein the descriptive audio commentary is selectable according to the group consisting of at least one of: side channel user interest, video conference participant deemed a decision maker, and side channel user selection.
12. The side channel according to claim 1, wherein the descriptive audio commentary includes descriptive information about video conference participants selected from the group consisting of at least one of: emotions, body movement, body posture, garments, and seating arrangement.
13. A method for employing descriptive audio commentary of a real-time video conference feed via a side channel, comprising the steps of:
adjusting amount of descriptive audio data correlated to the real-time video conference; and
prioritizing associated conversation links tied to the real-time video conference.
14. The method according to claim 13, further comprising the step of:
automatically reducing descriptive audio commentary associated with the real-time video conference and communicative participation by a user of the side channel.
15. The method according to claim 14, wherein the amount of reduction of descriptive data is dependent on the amount of communicative participation generated by the user of the side channel.
16. The method according to claim 13, further comprising the step of:
selecting the descriptive audio commentary according to the group consisting of at least one of: side channel user interest, video conference participant deemed a decision maker, and side channel user selection.
17. The method according to claim 13, further comprising the step of:
employing descriptive information about video conference participants selected from the group consisting of at least one of: emotions, body movement, body posture, garments, and seating arrangement.
18. The method according to claim 13, further comprising:
receiving the real-time video conference on a mobile computing device comprising an audio/video electronic component;
receiving a side channel audio feed comprising descriptive audio commentary about the real-time video conference;
controlling the side channel audio feed with a set of rules; controlling the side channel audio feed with a user selectivity apparatus coupled to an adjustable data handler for adjusting amount of the descriptive audio data correlated to the descriptive audio commentary about the real-time video conference; and
transmitting the side channel audio feed to the output device.
19. The method according to claim 18, wherein the side channel audio feed is transmitted over a stand-alone speaker or a headset.
20. The method according to claim 13, further comprising:
transmitting additional information to enhance visual aspect of the real-time video conference; wherein the additional information is selected from the group consisting of audio, large format text, Braille output, or haptic output.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/183,846 US9077848B2 (en) | 2011-07-15 | 2011-07-15 | Side channel for employing descriptive audio commentary about a video conference |
US13/183,846 | 2011-07-15 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2013012552A1 true WO2013012552A1 (en) | 2013-01-24 |
Family
ID=46466969
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2012/045204 WO2013012552A1 (en) | 2011-07-15 | 2012-07-02 | A side channel for employing descriptive audio commentary about a video conference |
Country Status (2)
Country | Link |
---|---|
US (1) | US9077848B2 (en) |
WO (1) | WO2013012552A1 (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3053000A4 (en) | 2013-09-30 | 2017-05-03 | Schneider Electric USA, Inc. | Systems and methods of data acquisition |
EP3066483A4 (en) | 2013-11-08 | 2017-04-19 | Schneider Electric USA, Inc. | Sensor-based facility energy modeling |
NO339354B1 (en) * | 2014-02-04 | 2016-12-05 | Parcels In Sport As | System and procedure for improved visual impairment experience |
US9735274B2 (en) * | 2015-11-20 | 2017-08-15 | Taiwan Semiconductor Manufacturing Co., Ltd. | Semiconductor device including a stacked wire structure |
US11573999B2 (en) * | 2020-07-31 | 2023-02-07 | Adobe Inc. | Accessible multimedia content |
CN113873195B (en) * | 2021-08-18 | 2023-04-18 | 荣耀终端有限公司 | Video conference control method, device and storage medium |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050152524A1 (en) * | 2004-01-13 | 2005-07-14 | International Business Machines Corporation | System and method for server based conference call volume management |
EP1916833A1 (en) * | 2006-10-27 | 2008-04-30 | Nortel Networks Limited | Source selection for conference bridges |
US20100253689A1 (en) * | 2009-04-07 | 2010-10-07 | Avaya Inc. | Providing descriptions of non-verbal communications to video telephony participants who are not video-enabled |
Family Cites Families (90)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6738697B2 (en) | 1995-06-07 | 2004-05-18 | Automotive Technologies International Inc. | Telematics system for vehicle diagnostics |
US6850252B1 (en) | 1999-10-05 | 2005-02-01 | Steven M. Hoffberg | Intelligent electronic appliance system and method |
US7185054B1 (en) | 1993-10-01 | 2007-02-27 | Collaboration Properties, Inc. | Participant display and selection in video conference calls |
EP0724362B1 (en) * | 1995-01-30 | 2000-03-22 | International Business Machines Corporation | Priority controlled transmission of multimedia streams via a telecommunication line |
US6205716B1 (en) | 1995-12-04 | 2001-03-27 | Diane P. Peltz | Modular video conference enclosure |
US6304648B1 (en) | 1998-12-21 | 2001-10-16 | Lucent Technologies Inc. | Multimedia conference call participant identification system and method |
US6795106B1 (en) | 1999-05-18 | 2004-09-21 | Intel Corporation | Method and apparatus for controlling a video camera in a video conferencing system |
US6688891B1 (en) | 1999-08-27 | 2004-02-10 | Inter-Tares, Llc | Method and apparatus for an electronic collaborative education process model |
US7006893B2 (en) | 1999-09-22 | 2006-02-28 | Telepharmacy Solutions, Inc. | Systems for dispensing medical products |
ATE497721T1 (en) | 1999-10-27 | 2011-02-15 | Dimicine Res It Llc | MEDICAL DATA CONTROL SYSTEM |
US6606744B1 (en) | 1999-11-22 | 2003-08-12 | Accenture, Llp | Providing collaborative installation management in a network-based supply chain environment |
US7716077B1 (en) | 1999-11-22 | 2010-05-11 | Accenture Global Services Gmbh | Scheduling and planning maintenance and service in a network-based supply chain environment |
WO2001039028A2 (en) | 1999-11-22 | 2001-05-31 | Accenture Llp | Method for affording a market space interface between a plurality of manufacturers and service providers and installation management via a market space interface |
US8271336B2 (en) | 1999-11-22 | 2012-09-18 | Accenture Global Services Gmbh | Increased visibility during order management in a network-based supply chain environment |
US6671818B1 (en) | 1999-11-22 | 2003-12-30 | Accenture Llp | Problem isolation through translating and filtering events into a standard object format in a network based supply chain |
WO2001039029A2 (en) | 1999-11-22 | 2001-05-31 | Accenture Llp | Collaborative capacity planning and reverse inventory management during demand and supply planning in a network-based supply chain environment and method thereof |
US7124101B1 (en) | 1999-11-22 | 2006-10-17 | Accenture Llp | Asset tracking in a network-based supply chain environment |
US7130807B1 (en) | 1999-11-22 | 2006-10-31 | Accenture Llp | Technology sharing during demand and supply planning in a network-based supply chain environment |
US7167844B1 (en) | 1999-12-22 | 2007-01-23 | Accenture Llp | Electronic menu document creator in a virtual financial environment |
US7069234B1 (en) | 1999-12-22 | 2006-06-27 | Accenture Llp | Initiating an agreement in an e-commerce environment |
US7610233B1 (en) | 1999-12-22 | 2009-10-27 | Accenture, Llp | System, method and article of manufacture for initiation of bidding in a virtual trade financial environment |
US6629081B1 (en) | 1999-12-22 | 2003-09-30 | Accenture Llp | Account settlement and financing in an e-commerce environment |
US6778533B1 (en) | 2000-01-24 | 2004-08-17 | Ati Technologies, Inc. | Method and system for accessing packetized elementary stream data |
US6999424B1 (en) | 2000-01-24 | 2006-02-14 | Ati Technologies, Inc. | Method for displaying data |
US6988238B1 (en) | 2000-01-24 | 2006-01-17 | Ati Technologies, Inc. | Method and system for handling errors and a system for receiving packet stream data |
US6885680B1 (en) | 2000-01-24 | 2005-04-26 | Ati International Srl | Method for synchronizing to a data stream |
US6804266B1 (en) | 2000-01-24 | 2004-10-12 | Ati Technologies, Inc. | Method and apparatus for handling private data from transport stream packets |
US6763390B1 (en) | 2000-01-24 | 2004-07-13 | Ati Technologies, Inc. | Method and system for receiving and framing packetized data |
US6785336B1 (en) | 2000-01-24 | 2004-08-31 | Ati Technologies, Inc. | Method and system for retrieving adaptation field data associated with a transport packet |
US7087015B1 (en) | 2000-01-31 | 2006-08-08 | Panmedix, Inc. | Neurological pathology diagnostic apparatus and methods |
US7370983B2 (en) | 2000-03-02 | 2008-05-13 | Donnelly Corporation | Interior mirror assembly with display |
US7113546B1 (en) | 2000-05-02 | 2006-09-26 | Ati Technologies, Inc. | System for handling compressed video data and method thereof |
US20020165894A1 (en) | 2000-07-28 | 2002-11-07 | Mehdi Kashani | Information processing apparatus and method |
US20020078459A1 (en) | 2000-08-30 | 2002-06-20 | Mckay Brent | Interactive electronic directory service, public information and general content delivery system and method |
WO2002049311A2 (en) | 2000-11-14 | 2002-06-20 | Tritrust.Com, Inc. | Pseudonym credentialing system |
US7194411B2 (en) | 2001-02-26 | 2007-03-20 | Benjamin Slotznick | Method of displaying web pages to enable user access to text information that the user has difficulty reading |
US7253732B2 (en) | 2001-09-10 | 2007-08-07 | Osann Jr Robert | Home intrusion confrontation avoidance system |
EP1306735A1 (en) | 2001-10-25 | 2003-05-02 | ABB Installationen AG | Control of a meeting room |
US7404001B2 (en) | 2002-03-27 | 2008-07-22 | Ericsson Ab | Videophone and method for a video call |
US8581688B2 (en) | 2002-06-11 | 2013-11-12 | Intelligent Technologies International, Inc. | Coastal monitoring techniques |
EP1381185A1 (en) | 2002-07-12 | 2004-01-14 | BRITISH TELECOMMUNICATIONS public limited company | Mediated communications |
US6931113B2 (en) | 2002-11-08 | 2005-08-16 | Verizon Services Corp. | Facilitation of a conference call |
US7756923B2 (en) | 2002-12-11 | 2010-07-13 | Siemens Enterprise Communications, Inc. | System and method for intelligent multimedia conference collaboration summarization |
US7266189B1 (en) | 2003-01-27 | 2007-09-04 | Cisco Technology, Inc. | Who said that? teleconference speaker identification apparatus and method |
US7607097B2 (en) | 2003-09-25 | 2009-10-20 | International Business Machines Corporation | Translating emotion to braille, emoticons and other special symbols |
US20050131744A1 (en) | 2003-12-10 | 2005-06-16 | International Business Machines Corporation | Apparatus, system and method of automatically identifying participants at a videoconference who exhibit a particular expression |
IL160429A0 (en) | 2004-02-16 | 2005-11-20 | Home Comfort Technologies Ltd | Environmental control system |
US20050235032A1 (en) | 2004-04-15 | 2005-10-20 | Mason Wallace R Iii | System and method for haptic based conferencing |
US9820658B2 (en) | 2006-06-30 | 2017-11-21 | Bao Q. Tran | Systems and methods for providing interoperability among healthcare devices |
US8210848B1 (en) * | 2005-03-07 | 2012-07-03 | Avaya Inc. | Method and apparatus for determining user feedback by facial expression |
US20060248210A1 (en) | 2005-05-02 | 2006-11-02 | Lifesize Communications, Inc. | Controlling video display mode in a video conferencing system |
AU2006335151A1 (en) | 2005-12-30 | 2007-07-19 | Steven Kays | Genius adaptive design |
US20080017136A1 (en) | 2006-01-10 | 2008-01-24 | Chevron U.S.A. Inc. | Method of controlling combustion in an hcci engine |
US7558622B2 (en) | 2006-05-24 | 2009-07-07 | Bao Tran | Mesh network stroke monitoring appliance |
US7539532B2 (en) | 2006-05-12 | 2009-05-26 | Bao Tran | Cuffless blood pressure monitoring appliance |
US7539533B2 (en) | 2006-05-16 | 2009-05-26 | Bao Tran | Mesh network monitoring appliance |
US20070271338A1 (en) * | 2006-05-18 | 2007-11-22 | Thomas Anschutz | Methods, systems, and products for synchronizing media experiences |
US20070294263A1 (en) | 2006-06-16 | 2007-12-20 | Ericsson, Inc. | Associating independent multimedia sources into a conference call |
EP2261828A3 (en) | 2006-07-11 | 2012-12-26 | PCAS Patient Care Automation Services Inc | Method, system and apparatus for dispensing drugs |
US8182267B2 (en) | 2006-07-18 | 2012-05-22 | Barry Katz | Response scoring system for verbal behavior within a behavioral stream with a remote central processing system and associated handheld communicating devices |
CN2923916Y (en) | 2006-07-19 | 2007-07-18 | 陈能森 | Folding double-top iron-handicraft mat-awning |
US8687037B2 (en) | 2006-09-12 | 2014-04-01 | Savant Systems, Llc | Telephony services for programmable multimedia controller |
WO2010093503A2 (en) | 2007-01-05 | 2010-08-19 | Myskin, Inc. | Skin analysis methods |
US8065240B2 (en) | 2007-10-31 | 2011-11-22 | The Invention Science Fund I | Computational user-health testing responsive to a user interaction with advertiser-configured content |
US20090112621A1 (en) | 2007-10-30 | 2009-04-30 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Computational user-health testing responsive to a user interaction with advertiser-configured content |
US20090119154A1 (en) | 2007-11-07 | 2009-05-07 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Determining a demographic characteristic based on computational user-health testing of a user interaction with advertiser-specified content |
US20090132275A1 (en) | 2007-11-19 | 2009-05-21 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Determining a demographic characteristic of a user based on computational user-health testing |
US20080243005A1 (en) | 2007-03-30 | 2008-10-02 | Searete Llc, A Limited Liability Corporation Of The State Of Delaware | Computational user-health testing |
CN101309390B (en) | 2007-05-17 | 2012-05-23 | 华为技术有限公司 | Visual communication system, apparatus and subtitle displaying method |
KR101526963B1 (en) | 2007-09-19 | 2015-06-11 | 엘지전자 주식회사 | Mobile terminal, method of displaying data in the mobile terminal, and method of editting data in the mobile terminal |
US20090140887A1 (en) | 2007-11-29 | 2009-06-04 | Breed David S | Mapping Techniques Using Probe Vehicles |
US20090193345A1 (en) * | 2008-01-28 | 2009-07-30 | Apeer Inc. | Collaborative interface |
US8421840B2 (en) | 2008-06-09 | 2013-04-16 | Vidyo, Inc. | System and method for improved view layout management in scalable video and audio communication systems |
US8594290B2 (en) | 2008-06-20 | 2013-11-26 | International Business Machines Corporation | Descriptive audio channel for use with multimedia conferencing |
KR101495172B1 (en) | 2008-07-29 | 2015-02-24 | 엘지전자 주식회사 | Mobile terminal and method for controlling image thereof |
US8487975B2 (en) | 2009-01-27 | 2013-07-16 | Lifesize Communications, Inc. | Conferencing system utilizing a mobile communication device as an interface |
US20110238753A1 (en) | 2009-03-04 | 2011-09-29 | Lueth Jacquelynn R | System and Method for Providing a Real-Time Digital Impact Virtual Audience |
US8386255B2 (en) * | 2009-03-17 | 2013-02-26 | Avaya Inc. | Providing descriptions of visually presented information to video teleconference participants who are not video-enabled |
US20100257462A1 (en) | 2009-04-01 | 2010-10-07 | Avaya Inc | Interpretation of gestures to provide visual queues |
US20100299134A1 (en) * | 2009-05-22 | 2010-11-25 | Microsoft Corporation | Contextual commentary of textual images |
US8174932B2 (en) | 2009-06-11 | 2012-05-08 | Hewlett-Packard Development Company, L.P. | Multimodal object localization |
US8416715B2 (en) | 2009-06-15 | 2013-04-09 | Microsoft Corporation | Interest determination for auditory enhancement |
US9154730B2 (en) | 2009-10-16 | 2015-10-06 | Hewlett-Packard Development Company, L.P. | System and method for determining the active talkers in a video conference |
US8427521B2 (en) * | 2009-10-21 | 2013-04-23 | At&T Intellectual Property I, L.P. | Method and apparatus for providing a collaborative workspace |
KR101681321B1 (en) | 2009-11-17 | 2016-11-30 | 엘지전자 주식회사 | Method for user authentication, video communication apparatus and display apparatus thereof |
US8670018B2 (en) | 2010-05-27 | 2014-03-11 | Microsoft Corporation | Detecting reactions and providing feedback to an interaction |
US8630854B2 (en) | 2010-08-31 | 2014-01-14 | Fujitsu Limited | System and method for generating videoconference transcriptions |
US9237305B2 (en) | 2010-10-18 | 2016-01-12 | Apple Inc. | Overlay for a video conferencing application |
US8812510B2 (en) * | 2011-05-19 | 2014-08-19 | Oracle International Corporation | Temporally-correlated activity streams for conferences |
US8976218B2 (en) * | 2011-06-27 | 2015-03-10 | Google Technology Holdings LLC | Apparatus for providing feedback on nonverbal cues of video conference participants |
-
2011
- 2011-07-15 US US13/183,846 patent/US9077848B2/en active Active
-
2012
- 2012-07-02 WO PCT/US2012/045204 patent/WO2013012552A1/en active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050152524A1 (en) * | 2004-01-13 | 2005-07-14 | International Business Machines Corporation | System and method for server based conference call volume management |
EP1916833A1 (en) * | 2006-10-27 | 2008-04-30 | Nortel Networks Limited | Source selection for conference bridges |
US20100253689A1 (en) * | 2009-04-07 | 2010-10-07 | Avaya Inc. | Providing descriptions of non-verbal communications to video telephony participants who are not video-enabled |
Also Published As
Publication number | Publication date |
---|---|
US20130016175A1 (en) | 2013-01-17 |
US9077848B2 (en) | 2015-07-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9077848B2 (en) | Side channel for employing descriptive audio commentary about a video conference | |
US10057542B2 (en) | System for immersive telepresence | |
US10089769B2 (en) | Augmented display of information in a device view of a display screen | |
CN106791893B (en) | Video live broadcasting method and device | |
KR101988279B1 (en) | Operating Method of User Function based on a Face Recognition and Electronic Device supporting the same | |
EP3358835B1 (en) | Improved method and system for video conferences with hmds | |
JP7056055B2 (en) | Information processing equipment, information processing systems and programs | |
US8976218B2 (en) | Apparatus for providing feedback on nonverbal cues of video conference participants | |
US8477174B2 (en) | Automatic video switching for multimedia conferencing | |
US9398258B1 (en) | Method and system for video conferencing units | |
CN105608715B (en) | online group photo method and system | |
US20160155474A1 (en) | Information processing apparatus and recording medium | |
US20140063176A1 (en) | Adjusting video layout | |
JP2010529738A (en) | Home video communication system | |
CN108762501B (en) | AR display method, intelligent terminal, AR device and AR system | |
CN103945121A (en) | Information processing method and electronic equipment | |
KR20190121758A (en) | Information processing apparatus, information processing method, and program | |
KR20140063673A (en) | Augmenting a video conference | |
CN112312042A (en) | Display control method, display control device, electronic equipment and storage medium | |
US9407871B2 (en) | Apparatus and method for controlling eye-to-eye contact function | |
JP2011152593A (en) | Robot operation device | |
TW202018649A (en) | Asymmetric video conferencing system and method thereof | |
EP4044589A1 (en) | Context dependent focus in a video feed | |
CN113676693B (en) | Picture presentation method, video conference system, and readable storage medium | |
JP2010004480A (en) | Imaging apparatus, control method thereof and program |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 12733396 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 12733396 Country of ref document: EP Kind code of ref document: A1 |