WO2012089832A1 - Communication system and method - Google Patents

Communication system and method Download PDF

Info

Publication number
WO2012089832A1
WO2012089832A1 PCT/EP2011/074304 EP2011074304W WO2012089832A1 WO 2012089832 A1 WO2012089832 A1 WO 2012089832A1 EP 2011074304 W EP2011074304 W EP 2011074304W WO 2012089832 A1 WO2012089832 A1 WO 2012089832A1
Authority
WO
WIPO (PCT)
Prior art keywords
terminals
terminal
client
user
call
Prior art date
Application number
PCT/EP2011/074304
Other languages
French (fr)
Inventor
Manrique Brenes
Derek Macdonald
Original Assignee
Skype
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Skype filed Critical Skype
Priority to CN201180063497.6A priority Critical patent/CN103416023B/en
Priority to EP11802455.3A priority patent/EP2649753B1/en
Publication of WO2012089832A1 publication Critical patent/WO2012089832A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/1066Session management
    • H04L65/1069Session establishment or de-establishment
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/1066Session management
    • H04L65/1083In-session procedures
    • H04L65/1094Inter-user-equipment sessions transfer or sharing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/40Support for services or applications
    • H04L65/401Support for services or applications wherein the services involve a main real-time session and one or more additional parallel real-time or time sensitive sessions, e.g. white board sharing or spawning of a subconference
    • H04L65/4015Support for services or applications wherein the services involve a main real-time session and one or more additional parallel real-time or time sensitive sessions, e.g. white board sharing or spawning of a subconference where at least one of the additional parallel sessions is real time or time sensitive, e.g. white board sharing, collaboration or spawning of a subconference
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/52Network services specially adapted for the location of the user terminal
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/02Services making use of location information
    • H04W4/023Services making use of location information using mutual or relative location information between multiple location based services [LBS] targets or of distance thresholds

Definitions

  • the present invention relates to a communication system and a corresponding method for handling voice and/or video calls when multiple audio or video transducers or terminals are potentially available for use in the call.
  • IP internet protocol
  • each end user first installs a client application onto a memory of his or her user terminal such that the client application is arranged for execution on a processor of that terminal.
  • the client application indicates a username of at least one other user (the callee) to the client application.
  • the client application can then control its respective terminal to access a database mapping usernames to IP addresses, and thus uses the indicated username to look up the IP address of the callee.
  • the database may be implemented using either a server or a peer-to-peer (P2P) distributed database, or a combination of the two,.
  • P2P peer-to-peer
  • the caller's client Once the caller's client has retrieved the callee's IP address, it can then use the IP address to request establishment of a live voice and/or video stream between the caller and callee terminals via the Internet or other such packet-based network, thus establishing a call.
  • An authentication procedure is typically also required, which may involve the user providing credentials via the client to be centrally authenticated by a server, and/or may involve the exchange of authentication certificates between the two or more users' client applications according to a P2P type authentication scheme.
  • multiple audio or video transducers In such cases it may be necessary to consider how to coordinate the operation of the multiple transducers and/or multiple terminals when making or receiving a call, or rather how to best exploit these multiple resources to improve the user's experience of the communication system.
  • the present invention provides at least three different aspects, each relating to a communication system, terminal and client application.
  • the communication system is a packet-based communication system such as the Internet and the terminal and client are arranged to conduct the calls via the packet based network using a suitable packet-based protocol such as internet protocol (IP).
  • IP internet protocol
  • terminal and/or client application configured to receive an input from multiple different audio and/or video input transducers of the same terminal, to analyse said inputs in relation to one another, and based on said analysis to select at least one audio and/or video input transducer and/or output transducer of that terminal for use in conducting a voice or video call.
  • a communication system configured to receive an input from multiple different audio or video input transducers of different terminals of the same user, to analyse said inputs in relation to one another, and based on said analysis to select a suitable one of multiple instances of the client application running on the different terminals for use in conducting a voice or video call.
  • the different instances may be logged in with the same user identity.
  • a first user terminal installed with an instance of a client application configured to determine an availability of one or more other secondary user terminals installed with other instances of the client application, and to present the user with an option to select one of said other secondary terminals for use in conducting a voice or video call in conjunction with the first terminal.
  • the first, second and third aspects of the invention may be used either independently or in combination.
  • a method comprising: providing a packet-based communication system for conducting voice or video calls over a packet-based network; and providing an instance of a client application enabling a first user terminal to access the packet- based communication system, the client application being configured so as when executed on the first terminal to receive an input from one or more audio and/or video input transducers of the first terminal, and to operate in conjunction with one or more other instance of the client application executed on one or more respective second terminals so as to participate in an analysis of said one or more inputs in relation to an input from one or more audio and/or video input transducers of the one or more second terminals; thereby enabling selection of one of the first and second terminals, based on said analysis, for use by a near- end user in conducting a voice or video call with a far-end user of a third user terminal via the respective client instance and packet-based communication system.
  • the analysis may relate to a relative proximity of a user to the first and second terminals.
  • the analysis may comprise a comparison of the energy or power level of the audio input from audio input transducers of the first and second terminals.
  • the analysis may comprise a Fourier analysis applied the input audio or video inputs of the first and second terminals.
  • the analysis may comprise a voice recognition algorithm applied to the audio input audio input transducers of the first and second terminals.
  • the analysis may comprise a facial recognition algorithm applied to the video input from video input transducers of the first and second terminals.
  • the analysis may comprise a motion recognition algorithm applied to the video input from video input transducers of the first terminal. Said selection may be made upon answering or initiating a call.
  • Said selection may be made during an ongoing call.
  • the client application may be configured to recognise voice commands for controlling the call, and said selection may be made based on the analysis of audio inputs received due to one or more voice commands.
  • the instance of the client application on the first terminal may be configured to determine a local selection of a most relevant input from one a plurality of said input transducers of the first terminal, and said analysis may comprise comparing the local selection from the first terminal with a local selection from the one or more other instances on the respective one or more second terminals, said selection of one of the first and second terminals being based on the comparison of the selected local inputs.
  • the client application may be configured to perform an initial calibration process to determine relative input response properties of the different input transducers.
  • the instance of the client on the first user terminal may be configured to automatically discover a respective identity of each of the one or more second user terminals.
  • the instance of the client on the first user terminal may be configured to automatically discover a respective address of each of the one or more second user terminals for use in said analysis and/or call.
  • the instance of the client on the first user terminal may be configured to automatically discover a respective media capability of each of the one or more second user terminals for use in said call.
  • the instance of the client on the first user terminal may be configured to automatically discover a respective online status of each of the one or more second user terminals for the purpose of said analysis and/or call.
  • the method may comprise making involvement of the one or more second terminals in conjunction with the first user terminal conditional on an authorisation procedure.
  • a terminal or system comprising apparatus configured in accordance with any of the above features.
  • a computer program product comprising code embodied on a non-transient computer-readable medium and configured so as when executed on a processing apparatus to operate in accordance with any of the above features.
  • Figure 1 is a schematic representation of a communication network
  • Figure 2 is a schematic block diagram of a user terminal
  • Figure 3 is a schematic illustration of a headset.
  • FIG. 1 is a schematic diagram of a communication system implemented over a packet-based network such as the Internet 101 .
  • the communication system comprises respective end-user communication apparatus 103 for each of a plurality of users.
  • the communication apparatus 103 of each user is connected to or communicable with the Internet 101 via a suitable transceiver such as a wired or wireless modem.
  • Each communication apparatus 103 comprises at least one user terminal 102.
  • Each terminal 102 is installed with an instance of the client application for accessing the communication system and thereby establishing a live packet-based voice or video call with the client of another user running on another such terminal 102.
  • that user's respective communication apparatus 103 comprises an arrangement or collection of multiple terminals 102.
  • the communication apparatus 103 of one user comprises: a mobile handset type terminal 102a such as a mobile phone, a laptop computer 102b, a desktop computer 102c, and a television set or television with set-top box 102d.
  • a mobile handset type terminal 102a such as a mobile phone
  • laptop computer 102b such as a laptop computer
  • desktop computer 102c such as a mobile phone
  • a television set or television with set-top box 102d a television set or television with set-top box 102d.
  • Other types of terminal 102 that may be installed with a communication client include photo frames, tablets, car audio systems, printers, home control systems, cameras, or other such household appliances or end-user devices, etc.
  • Each of the multiple terminals 102a-102d of the same user is installed with a respective instance of the communication client application which the same user may be logged into concurrently, i.e. so the same user may be logged into multiple instances of the same client application on two or more different terminals 102a- 102d simultaneously. This will be discussed in more detail below.
  • Each of the different end-user terminals 102a-102d of the same user may be provided with individual connections to the internet 101 and packet-based communication system, and/or some or all of those different terminals 102a-102d may connect via a common router 105 and thus form a local network such as a household network. Either way, it envisaged that in certain preferred
  • some or all of the different terminals 102a-102d of the same user will be located at different points around the house, e.g. with the television 102d in the living room, the desktop 102c in the study, the laptop 102b open in the kitchen, and the handheld 102a at any other location the user may happen to find themselves (e.g. garden or WC).
  • the television 102d in the living room the desktop 102c in the study
  • the laptop 102b open in the kitchen the handheld 102a at any other location the user may happen to find themselves (e.g. garden or WC).
  • a data store 104 in the form of either a server, a distributed peer-to-peer database, or a combination of the two.
  • a peer-to-peer database is distributed amongst a plurality of end-user terminals of a plurality of different users, typically including one or more users who are not actually participants of the call. However, this is not the only option and a central server can be used as an alternative or in addition. Either way, the data store 104 is connected so as to be accessible via the internet 101 to each of the client applications or instances of client applications running on each of the terminals 102 of each user's communication apparatus 103 .
  • the data store 104 is arranged to provide a mapping of usernames to IP addresses (or other such network addresses) so as to allow the client applications of different users to establish communication channels with one another over the Internet 101 (or other packet-based network) for the purpose of establishing voice or video calls, or indeed other types of communication such as instant messaging (IM) or voicemail.
  • IM instant messaging
  • the data store 104 may be arranged to map the same username (user ID) to all of those multiple instances but also to map a separate sub- identifier (sub-ID) to each particular individual instance.
  • the communication system is capable of distinguishing between the different instances whilst still maintaining a consistent identity for the user within the communication system.
  • FIG. 2 shows a schematic block diagram of an exemplary end-user terminal 102 according to embodiments of the present invention, which may correspond to any of those mentioned above.
  • the user terminal 102 comprises a memory 202 such as an internal or external hard drive or flash memory, and a processing apparatus 204 in the form of a single or multi core processor.
  • the memory 202 is installed with an instance of the communication client 206, is coupled to the processing apparatus 204, and is arranged such that the communication client 206 can be executed on the processing apparatus 204.
  • the terminal 102 also comprises a transceiver 220 for communicating data on the up and downlink to and from the client 206 via the Internet 101 or other such packet-based network, e.g.
  • the terminal 102 further comprises a plurality of AV transducers e.g. an internal microphone 104, an internal speaker 210, an internal camera 212 and a screen 214.
  • the terminal 102 may then also comprise further AV transducers plugged into the main body of the terminal 102, e.g. an external or peripheral webcam 216 and a headset 218.
  • the headset 218 preferably comprises an earpiece or headphones 302 and microphone 304 integrated into the same unit.
  • the term AV transducer may be used herein to refer to any means of audio or video input or output.
  • Terminal is meant as a discrete unit of user equipment whereas a transducer is a component or peripheral of a given terminal. In some situations such as that of a handset and docking station the categorisation may not be immediately apparent, but for the purpose of this application a terminal is considered distinct if it executes its own instance of the communication client.
  • Each of the transducers 208-218 is operatively coupled to the processing apparatus 204 such that the client is able to receive input from any or all of the input transducers 208, 212, 216, 218 and supply outputs to any or all of the output transducers 210, 214, 218.
  • the terminal of Figure 2 is therefore notable in one respect in that it comprises multiple audio input transducers, multiple audio output transducers and/or multiple video input transducers, and that each of these is potentially available to the client application 206 for conducting voice or video calls over the packet-based network. Multiple video output transducers are also a possibility.
  • the client application 206 is configured so as when executed to receive an input from multiple different input transducers of the same terminal 102, to analyse the input signals from the different input transducers in relation to one another, and based on the analysis to select a suitable input transducer and/or output transducer of that terminal for use in conducting a call.
  • the instances of the client application 206 on the different terminals 102a-102d of the same user are configured so as when executed to operate in conjunction with one another, to thereby receive an input from input transducers of the different terminals 102a- 102d, analyse the input signals from the different terminals 102a-102d in relation to one another, and select a suitable one of the multiple instances of the client application 206 running on different terminals 102a-102d for use in conducting a call.
  • the second aspect is concerned not just with the selection of a particular input or output transducer 208-218, but also with the routing of the voice and/or video stream of a call to and from a selected terminal 102a-102d.
  • the terminals 102a-102d together form one end of a call (the "near end") communicating with the client running on a further, third user terminal 102f (the "far end”) via the Internet 101 or other such packet-based network.
  • the analysis applied to the inputs may include:
  • a voice recognition algorithm applied to the received audio signal from two or more audio input transducers from the same terminal 102 and/or from different terminals 02;
  • the client application 206 is arranged to perform one or more such analysis processes and, based on the said analysis, to select one or more of the following for use in conducting the packet-based voice or video call: an input audio transducer, an input video transducer, an output audio transducer, an output video transducer, and/or an instance of the client application running on one of multiple terminals of the same user. This process may be performed upon making a call, answering a call, and/or dynamically during an ongoing call.
  • the invention may advantageously be used in conjunction with a client 206 or terminal 102 capable of recognising voice activated commands, e.g. so that the user can control the client 206 vocally with commands such as "call "answer call", and "hang up".
  • a client 206 or terminal 102 capable of recognising voice activated commands, e.g. so that the user can control the client 206 vocally with commands such as "call "answer call", and "hang up”.
  • the optimal microphone for use in making or answering a call may vary.
  • the client 206 may therefore analyse the inputs from two or microphones 208, 304 of the same terminal in the same room to determine which input signal has the largest energy in the human vocal frequency range, e.g. when the user speaks to answer the call using a voice command, and then select that microphone to use in the call.
  • the audio inputs from the microphones may also determine a suitable audio output transducer, e.g. by selecting between a loudspeaker 210 and the headphones 302 of a headset 218 depending on which microphone is generating the most vocal energy.
  • detection of a rustling or scrabbling sound at the headset microphone 304 may be taken as indicative of the user fumbling for their headset to answer an incoming call, and this can be used to select the headset 218 for audio input and/or output.
  • the audio inputs from the microphones may be used to switch between different audio or video output transducers during the call, e.g. if the user moves around the room or puts on or removes the headset
  • the invention will have an application in situations where the user has different terminals 102a, 102b, etc. located in different places around the house.
  • an analysis of the energy levels from different microphones or cameras of different terminals may be used to determine the presence of the user in a particular room or the proximity to a particular terminal, and hence determine the best terminal for answering or making a call, or to switch between the terminals during an ongoing call as the user roams about the house.
  • Other techniques that can be used to detect the presence or proximity of a particular user include motion estimation to detect the presence of a suitably sized moving object (which may be taken as a human moving between rooms), a Fourier analysis to determine an overall colour property of an image or moving object (e.g. based on the assumption that the moving user wears the same colour clothes as they move between rooms), or a voice or facial recognition algorithm (to help distinguish between multiple people and/or background noise), or indeed any combination of these.
  • motion estimation to detect the presence of a suitably sized moving object (which may be taken as a human moving between rooms)
  • a Fourier analysis to determine an overall colour property of an image or moving object (e.g. based on the assumption that the moving user wears the same colour clothes as they move between rooms)
  • a voice or facial recognition algorithm to help distinguish between multiple people and/or background noise
  • the client instance 206 on at least one of the terminals it will be necessary for the client instance 206 on at least one of the terminals to send information about the input received from its respective transducer(s) to one or more other terminals or other network elements for comparison.
  • both (or all) of the user's client instances 206 that are involved in the comparison are preferably logged into with the same username and have the concept of being in the same call.
  • communication between the instances and/or controller may be enabled by reference to the system of user IDs and sub-IDs mapped to IP addresses or other such network addresses by the data store 104.
  • the list of sub-IDs for each user allows the different client instances to be identified, and the mapping allows a client instance, server or other network element to determine the address of each terminal on which one or more other different instances is running.
  • the same mechanism may also be used to signal or negotiate the selection of the required instance for conducting the call.
  • communication set up may be enabled by maintaining a list of only the terminal identities rather than the corresponding client identities, the list being maintained on an accessible network element for the purpose of address look-up.
  • a list of all the different terminals 102a-102d may be maintained on an element of the local home network, 105, 102a-102d, in which case only the local network addresses and terminal identities need be maintained in the list, and a system of IDs and separate sub-IDs would then not necessarily be required.
  • the local list could be stored at each terminal 102a-102d or on a local server of the home network (not shown), and each client instance would be arranged to determine the necessary identities and addresses of the other instances' terminals by accessing the list over the local network.
  • the selection may be performed in a number of ways. For example, in one
  • all of the client instances 206 in question may be arranged to transmit their respective transducer input information (e.g. input energy levels, motion vectors or an FFT result) to a central server or other central control element, which could be implemented on a server along with the data store 104 or on an element of the local home network.
  • the central controller would then be arranged to run an algorithm to analyse the different received inputs in relation to one another and thereby select the instance of a particular terminal, e.g. 102b, to make or answer the call.
  • the controller then instructs the instances 206 on their behaviour (are they involved in the call or not and to what extent) using the chosen signalling mechanism.
  • the different client instances 206 may share the information directly between each other and either mutually negotiate the selected terminal according to some predetermined protocol or act under the control of one instance that has been designated as the master instance (e.g. by the user or by default).
  • the analysis of the transducer inputs may be used to switch between different instances of the client 206 running on different terminals 02a, 102b of the same user during an ongoing call, e.g. as the user walks between rooms (rather than just selecting an instance to initiate an outgoing call or answer an incoming call).
  • This may involve each client instance 206 periodically sampling or otherwise monitoring its respective input transducer or transducers 208, 212, 218 and sharing the monitored information throughout the call in a similar manner to that described above.
  • each client may only share new transducer input information in response to some detected event such as the input energy level or motion vector exceeding some threshold.
  • the controller or master instance may thus apply the selection algorithm to the received input information at multiple times throughout the call, in either a scheduled or event-based manner, in order to make an ongoing, dynamic selection as to which instance 206 on which terminal 102 should be used for the call.
  • the switchover may be completed in a similar manner to known call forwarding techniques as described for example in US application no. 12/290232, publication no. US 2009- 0136016, but with the call being transferred between different terminals of the same user based on different sub-IDs, rather than the call being transferred between different users based on different user IDs.
  • the client 206 on the initial terminal e.g.
  • 102b may continue to receive the audio and/or video stream of the call but then route the audio and/or video stream onwards to the newly selected terminal, e.g. 102c (such that the endpoint of the call appears to be the same, 102b, from the perspective of the client on the other end of the call).
  • the newly selected terminal e.g. 102c (such that the endpoint of the call appears to be the same, 102b, from the perspective of the client on the other end of the call).
  • This latter option would be particularly applicable if the initial and new terminals 102b and 102c have a local wireless connection such as a wi-fi or Bluetooth connection available between them.
  • the type of transducer being selected is not necessarily the same as the type of transducer used to make the selection.
  • a microphone may indicate the presence of a user in a particular room or location and be used to select a suitably located camera, or a camera may indicate the presence of a user in a particular room or location and be used to select a suitably located microphone, etc.
  • the voice and video streams are routed to and from the same user terminals 102a, 120b, etc.
  • the selection algorithm running on the client instance 206 and/or central controller would then instruct the two streams to be routed to different terminals 102a, 102b.
  • the first and second aspects of the invention may be used either independently or in combination.
  • the instance of the client application 206 running on each of multiple terminals 102 to determine its own best local input transducer (e.g. highest audio input energy level or best Fourier analysis match), then for the different instances to compare their best local input results to find the best global input result.
  • best local input transducer e.g. highest audio input energy level or best Fourier analysis match
  • an initial calibration phase may be useful, e.g. to determine the relative levels that the different microphones generate when the user is at different distance from those microphones. That is to say, the client 206 is preferably configured to determine the relative gains of the different microphones. For example upon initial installation the user may be asked to sit or stand at two or more predetermined distances from the terminal and speak at a fixed level, which the client 206 can then use for calibration.
  • the client or system may be configured to use a default transducer and/or terminal for making or answering calls.
  • a primary terminal 102 running an instance of the communication client 206 is able to select the resources of one or more secondary terminals 102 installed with or running another instance of the client 206 to be used during a conversation.
  • the user may use the terminal 102 that provides the best interfaces for what they are doing. For example if the user is participating in a video call, he may use his TV 102d for receiving and transmitting video, while using a mobile device 102a for controlling the call and transmitting and receiving the audio stream.
  • a stereo system or speaker phone while he sends/receives video from a handheld or portable device like a photo-frame, he may even chose to send/receive a data file going directly from/to his NAS (network attached storage device).
  • NAS network attached storage device
  • the term “consume” is sometimes used to generally mean playing a live audio or video stream or storing a file transfer, i.e. using the stream for its ultimate purpose at its end-point destination.
  • the term “generate” may refer to capturing a live audio or video stream or retrieving a file transfer from memory for transmission over a network , i.e. at the origin of the stream.
  • the third aspect of the invention can be used to allow the second terminal to consume or generate at least one stream of the call whilst the first user terminal concurrently generates or consumes at least another stream of the call.
  • one of the streams may be a file transfer performed in conjunction with the voice or video call as part of the same session between the same near and far end users.
  • only live voice and/or video streams may be involved, e.g. with one terminal capturing the outgoing video stream and another terminal playing the incoming video stream, and/or with different terminals handling audio and video streams.
  • the client instances could be "network aware” and could be provided with an API enabled to facilitate not only the discovery of the different devices but also the easy transfer/usage of different media streams in a conversation from one end point to the next or the combination of two end points, This allows a user to configure how multi device resources should be allocated for handling communication events.
  • This third aspect of the present invention advantageously allows a user to be presented with a list of available user terminals 102 and to select at least one secondary terminal 102 with the most appropriate capabilities to handle part of the call or a particular type of communication, for example a live video stream or file transfer.
  • a terminal 102 such as a mobile phone 102a installed with an instance of the client application 206 is arranged to discover other resources that are available on other such user terminals 102. The user may select to use the resources of one or more of the discovered terminals 102.
  • the terminal 102 that is used by a user to perform the selection will be referred to as a primary terminal.
  • Each selected terminal will be referred to as the secondary terminal.
  • the primary terminal In the case of an outgoing call the primary terminal is preferably the initiator of a call, and in the case of an incoming call the primary terminal is preferably the terminal used to answer the call.
  • the primary terminal is also preferably the terminal which controls the call, e.g. to choose when to terminate the call or activate other features.
  • the primary terminal may remain the master terminal for the purpose of controlling the call, or primary status could be transferred during the call to another terminal such as one of the secondary terminals.
  • (secondary) user terminal such as 102c may be of the same user as that on the primary terminal (i.e. logged in with the same user ID), or may be another terminal 102e borrowed from a different user (logged in with a different user ID). Either way, the primary and secondary terminals 102a-102e together form one end of the call (the "near end") communicating with the client running on a further, third user terminal 102f (the "far end”) via the Internet 101 or other such packet-based network.
  • one client instance 206 In order for one client instance 206 to use the resources of another client instance 206 it is desirable to require the primary client instance to be authorised to use the resources of the other client instance.
  • One client instance 206 can authorise another client instance, as follows.
  • One method is for instances of the client 206 that are logged on with the same username may be automatically authorised to share resources, e.g. the clients on terminals 102b and 102a.
  • the terminals 102 are logged on using different usernames, it may be necessary for the client on one terminal such as 102b to authorise the client of another terminal 102e to use resources.
  • This may be configured locally at the terminal 102, for example the user may input the usernames of authorised clients 206, or select contacts from the contact list that are authorised to use the terminal resources.
  • this may be configured remotely, for example a primary client may send an authorisation request to a secondary client to allow the primary client to use the resources of the secondary client.
  • a client 206 may be configured to authorise any contact to use the resources of the terminal 102 - i.e. any contact that has been approved by the user to be included in a contact list.
  • the primary terminal 102b that initiates a VoIP conversation will have the capability to direct any media stream to any of the secondary terminal that is authorized to use.
  • the media stream of the conversation will be directed to the designated secondary terminal 102e that the primary user selects.
  • the primary user may choose to move all the media streams to a terminal and/or designate an alternative terminal as the primary terminal.
  • Each client 206 that can act as a primary client is preferably configured with a mechanism for resource discovery for discovering the presence of other potential secondary terminals 102a, 102e, etc. and/or for discovering the media capability of the potential secondary terminals (what input and output transducers they have available).
  • Resources available to the primary client are presented in a list.
  • the list of available resources may indicate the terminal type (e.g. TV, printer) such that the user can select the most appropriate device to handle the communication event. For example the user may select a TV for a video call, a stereo system for a voice call, or a Network Attached Storage (NAS) device for a file transfer.
  • NAS Network Attached Storage
  • a user terminal 102 installed with a suitable client application 206 or suitable instance of the client application may be referred to in the following as an "enabled terminal”.
  • a server stores the location of each terminal having an instance of the client 206.
  • the client is arranged to provide its location and terminal type/capabilities to the server.
  • the location could be defined as IP address, NAT or postal address input by the user.
  • the server is arranged to return a list of proximate terminals to that of the primary client in response to the primary client transmitting a find suitable terminals to the server.
  • the server may provide a client 206 with a list of available terminals 102 enabled with the client 206 in response to receiving a configuration message from a client 206.
  • a client may provide a list of one or more usernames that are authorised to use the resources of the device on which the client is installed.
  • the user of the device may configure access to be limited to certain types of resource, e.g. audio output only, and may additionally control the times during which the resource may be accessed.
  • the server is arranged to provide each authorised client in the list with a message indicating which resources the client is authorised to use and when.
  • the server could instead be replaced with a distributed database for maintaining the list, or a combination of the two may be used.
  • the system of usernames and sub-identifiers may be used to distinguish between the different instances in a similar manner to that discussed above.
  • that is not essential and instead other means of listing the available terminals could be used, e.g. by listing only the terminal identity rather than the corresponding client identity, or in the case where the primary and secondary terminals are of different users (e.g. 102b and 102e) then the sub-identifier would not necessarily be needed.
  • the primary client is arranged to present a list of terminals 102a, 102c, 102d enabled with the client 206 to the user that are discovered on the local network, this may be in response to the user actuating a find 'Enabled Terminals' instruction or in response to the user connecting to the network.
  • Any IP enabled terminal that registers into a given network receives a unique IP address within that network.
  • As an enabled terminal joins it will broadcast a presence message to all enabled terminals in that network announcing a given username / ID and a list of authorized users that have rights to access its capabilities.
  • All the enabled terminals 102 that receive this message and have a common authorized user will reply back to authenticate themselves and establish a secure communication channel through which they will announce its IP address and available resources to the primary user.
  • the primary user will have rights to access the media interfaces of all the enabled terminals for which it has been authenticated.
  • Another method is to select resources from the contact list (i.e. a list of contacts approved by the user for communicating with that user, typically maintained via the user's own client 206).
  • resources of other terminals such as 102e may be indicted on the user's contact list.
  • each contact listed in the contact list will indicate the type of terminal 102 on which the client is executed. This may be determined automatically, for example the client 206 may detect the type of device by detection of the operating system.
  • the user may input the device type directly.
  • the device type may be presented in the contact list in the form of the contact name, 'John's TV ,the mood messages, or an icon.
  • the online status of the other or secondary client application may be relevant, e.g. on other user terminal 102a, 102c or 102e. In order for the resources of an other or secondary terminal 102 to be accessed, it will likely be necessary for the device to be online at the time the
  • the secondary terminal If the secondary terminal is not online the primary client should preferably be prevented from selecting the resources of the offline secondary device.
  • the availability of resources may be determined using the presence.
  • the client applications 206 are arranged to search for contacts' network addresses in a peer-to-peer network or server. Once the address is determined the client 206 sends a status request command to the specified address. If the contact is online, it responds with a reply reporting its status. If no reply is received, the contact is deemed to be offline. Presence requests are preferably sent periodically so that the status can be updated when necessary.
  • each client 206 is arranged to transmit "keep alive" messages periodically. If a keep alive message is not received from a client, the client is determined to be offline.
  • a further matter which may be relevant to the third embodiment is the primary and secondary client status.
  • the identity of the primary client is stored at the secondary device.
  • the client will behave as a secondary client for any calls or communication events associated with the primary client.
  • the secondary client will store the network location/username of the primary terminal.
  • the secondary client will handle media and call set up instructions from the primary client in accordance with predetermined rules configurable by the user. For example the user may configure the TV client to allow the mobile client to use the video input and output resources of the TV. If the TV receives a call set up request from a third party that identifies the mobile client as the primary client, the TV will only handle the video media during the call. In particular the TV client will not attempt to capture and transmit audio during the call. Furthermore the TV will not display a user interface to the user for controlling the call.
  • Call or communication set up can proceed as follows.
  • the following will be described with reference to the laptop or tablet style computer 102b as the primary terminal and the mobile handset type terminal 102a as the secondary terminal.
  • the client running on the primary client may be used to answer the call and direct the media stream to the desired selected secondary terminal 102a.
  • the audio and video streams of a video call are not necessarily generated or played out by the same user terminal 102, and nor are the received and outbound streams necessarily generated and played by the same terminal 102.
  • the primary terminal 102b can be used to generate and/or play out the audio stream whilst the secondary terminal 102a is used to generate and/or play out the video stream, or vice versa - i.e. the second terminal 102a handles whichever of the received and/or outbound audio and/or video streams is not handled by the primary terminal 102b.
  • Other streams such as file transfers forming part of the same session as the call may also be directed to a secondary terminal, e.g. 102c or 102e
  • the primary client on the primary terminal 102b may instruct the far end party (on the other end of the call) to route the relevant media stream directly to the secondary client on the secondary terminal 102a (e.g. by referring the far end terminal to the mapping of user IDs and Sub-IDs to addresses in the data store 104, or by sending the address of the secondary terminal 102a to the far end terminal directly).
  • all streams may be routed via the primary terminal 102b, with the primary client then routing any required streams onwards to and from the secondary terminal 102a (i.e. so from the perspective of the far end client and user terminal the primary terminal 102b is still the end point for all streams of the call, and the routing to or from the secondary terminal is handled solely by the client on the primary terminal 102b).
  • the secondary client will handle the call in response to instructions received from the client (e.g. end call). These instructions may be sent as IM messages.
  • the primary terminal may input call handling instructions such as 'increase volume', 'answer call', 'turn off webcam' or 'end call' using predetermined IM messages, recognised by the secondary client.
  • call set up may be handled by the server.
  • the server will provide the address of the TV to the far end node such that video data can be sent directly to the TV client. Again the TV client will be aware that the call is associated with the primary device and will accordingly only use the resources that are authorised for use.
  • the primary client or the server may determine the address of the secondary terminal from a list or database mapping addresses to terminal identities or user IDs and Sub-IDs, which may be implemented on a server, distributed database or local network element.
  • the mechanism for transmitting control signals and/or responses in the third aspect of the invention may be the same or a similar mechanism to that used to share information on transducer inputs according to the second aspect of the present invention, or in other embodiments different mechanisms may be used for the two different aspects.
  • a client 206 installed at any terminal 102 may present the user with a list of enabled terminals 102 which are installed with an instance of the client application 206.
  • the terminal on which the user selects a terminal becomes the primary terminal and the selected terminal becomes the secondary device.
  • a client 206 as either a primary client or a secondary client will depend on whether it has been selected for use as a secondary client. It will be appreciated that the above embodiments have been described only by way of example. Other variants or implementations may become apparent to a person skilled in the art given the disclosure herein. For example, the invention is not limited by any particular method of resource discovery or authorisation, and any of the above-described examples could be used, or indeed others. Further, any of the first, second and/or third aspects of the invention may be implemented either independently or in combination. Where it is referred to a server this is not necessarily intended to limit to a discrete server unit housed within a single housing or located at a single site.

Abstract

There is provided an instance of a client application enabling a first user terminal to access a packet-based communication system to conduct voice or video calls over a packet-based network. The client application is configured to receive an input from one or more audio and/or video input transducers of the first terminal, and to operate in conjunction with one or more other instances of the client application executed on one or more respective second terminals so as to participate in an analysis of the one or more inputs in relation to an input from one or more audio and/or video input transducers of the one or more second terminals; thereby enabling selection of one of the first and second terminals for use by a near-end user in conducting a call with a far-end user of a third user terminal via the respective client instance and packet-based communication system.

Description

Communication System and Method
Field of the Invention The present invention relates to a communication system and a corresponding method for handling voice and/or video calls when multiple audio or video transducers or terminals are potentially available for use in the call.
Background
Communication systems exist which allow a live voice and/or video call to be conducted between two or more end-user terminals over a packet-based network such as the Internet, using a packet-based protocol such as internet protocol (IP). This type of communication is sometimes referred to as "voice over IP" (VoIP) or "video over IP".
To use the communication system, each end user first installs a client application onto a memory of his or her user terminal such that the client application is arranged for execution on a processor of that terminal. To establish a call, one user (the caller) indicates a username of at least one other user (the callee) to the client application. When executed the client application can then control its respective terminal to access a database mapping usernames to IP addresses, and thus uses the indicated username to look up the IP address of the callee. The database may be implemented using either a server or a peer-to-peer (P2P) distributed database, or a combination of the two,. Once the caller's client has retrieved the callee's IP address, it can then use the IP address to request establishment of a live voice and/or video stream between the caller and callee terminals via the Internet or other such packet-based network, thus establishing a call. An authentication procedure is typically also required, which may involve the user providing credentials via the client to be centrally authenticated by a server, and/or may involve the exchange of authentication certificates between the two or more users' client applications according to a P2P type authentication scheme.
In the simple case where each of the end users has only one client application installed on one terminal with only one microphone, one speaker, one webcam and one screen, then the handling of the call is relatively straightforward in this respect.
However, with the increasing prevalence of electronic devices capable of executing communication software, both around the home and in portable devices on the move, then it is possible that the same end user may have multiple instances of the same client application installed on different terminals, and/or that a user may have an instance of the client application installed on a terminal with multiple means of audio and/or video input and/or output, i.e.
multiple audio or video transducers. In such cases it may be necessary to consider how to coordinate the operation of the multiple transducers and/or multiple terminals when making or receiving a call, or rather how to best exploit these multiple resources to improve the user's experience of the communication system.
The matter has been explored to some extent in some preceding patent applications by the applicant: GB 1005386.6, US 12/843527 (GB 1005462.5), and GB 0919592.6. Further, there are some existing arrangements that provide a remote interface for a call. For example Bluetooth headsets provide an input/output interface that is remote from the phone that handles the call. DECT phones (Digital Enhanced Cordless Telephones) provide handsets that are remote from the base station. Nonetheless, the inventors believe there is scope to further improve the coordination between the operation of multiple audio or video transducers or terminals for the purpose of making or receiving packet-based calls. Summary
The present invention provides at least three different aspects, each relating to a communication system, terminal and client application. The communication system is a packet-based communication system such as the Internet and the terminal and client are arranged to conduct the calls via the packet based network using a suitable packet-based protocol such as internet protocol (IP).
According to a first aspect of the present invention, there is provided a
communication system, terminal and/or client application configured to receive an input from multiple different audio and/or video input transducers of the same terminal, to analyse said inputs in relation to one another, and based on said analysis to select at least one audio and/or video input transducer and/or output transducer of that terminal for use in conducting a voice or video call. According to a second aspect of the present invention, there is provided a communication system configured to receive an input from multiple different audio or video input transducers of different terminals of the same user, to analyse said inputs in relation to one another, and based on said analysis to select a suitable one of multiple instances of the client application running on the different terminals for use in conducting a voice or video call. The different instances may be logged in with the same user identity.
According to a third aspect of the present invention there is provided a first user terminal installed with an instance of a client application configured to determine an availability of one or more other secondary user terminals installed with other instances of the client application, and to present the user with an option to select one of said other secondary terminals for use in conducting a voice or video call in conjunction with the first terminal.
The first, second and third aspects of the invention may be used either independently or in combination.
According to the second aspect of the present invention, there may be provided a method comprising: providing a packet-based communication system for conducting voice or video calls over a packet-based network; and providing an instance of a client application enabling a first user terminal to access the packet- based communication system, the client application being configured so as when executed on the first terminal to receive an input from one or more audio and/or video input transducers of the first terminal, and to operate in conjunction with one or more other instance of the client application executed on one or more respective second terminals so as to participate in an analysis of said one or more inputs in relation to an input from one or more audio and/or video input transducers of the one or more second terminals; thereby enabling selection of one of the first and second terminals, based on said analysis, for use by a near- end user in conducting a voice or video call with a far-end user of a third user terminal via the respective client instance and packet-based communication system.
In embodiments the analysis may relate to a relative proximity of a user to the first and second terminals.
The analysis may comprise a comparison of the energy or power level of the audio input from audio input transducers of the first and second terminals.
The analysis may comprise a Fourier analysis applied the input audio or video inputs of the first and second terminals. The analysis may comprise a voice recognition algorithm applied to the audio input audio input transducers of the first and second terminals.
The analysis may comprise a facial recognition algorithm applied to the video input from video input transducers of the first and second terminals.
The analysis may comprise a motion recognition algorithm applied to the video input from video input transducers of the first terminal. Said selection may be made upon answering or initiating a call.
Said selection may be made during an ongoing call.
The client application may be configured to recognise voice commands for controlling the call, and said selection may be made based on the analysis of audio inputs received due to one or more voice commands.
The instance of the client application on the first terminal may be configured to determine a local selection of a most relevant input from one a plurality of said input transducers of the first terminal, and said analysis may comprise comparing the local selection from the first terminal with a local selection from the one or more other instances on the respective one or more second terminals, said selection of one of the first and second terminals being based on the comparison of the selected local inputs.
The client application may be configured to perform an initial calibration process to determine relative input response properties of the different input transducers.
The instance of the client on the first user terminal may be configured to automatically discover a respective identity of each of the one or more second user terminals. The instance of the client on the first user terminal may be configured to automatically discover a respective address of each of the one or more second user terminals for use in said analysis and/or call.
The instance of the client on the first user terminal may be configured to automatically discover a respective media capability of each of the one or more second user terminals for use in said call. The instance of the client on the first user terminal may be configured to automatically discover a respective online status of each of the one or more second user terminals for the purpose of said analysis and/or call.
The method may comprise making involvement of the one or more second terminals in conjunction with the first user terminal conditional on an authorisation procedure.
According to another aspect of the invention there may be provided a terminal or system comprising apparatus configured in accordance with any of the above features.
According to another aspect, there may be provided a computer program product comprising code embodied on a non-transient computer-readable medium and configured so as when executed on a processing apparatus to operate in accordance with any of the above features. Brief Description of the Drawings
For a better understanding of the present invention and to show how it may be put into effect, reference will be made by way of example to the accompanying drawings in which:
Figure 1 is a schematic representation of a communication network,
Figure 2 is a schematic block diagram of a user terminal, and
Figure 3 is a schematic illustration of a headset.
Detailed Description of Preferred Embodiments
Figure 1 is a schematic diagram of a communication system implemented over a packet-based network such as the Internet 101 . The communication system comprises respective end-user communication apparatus 103 for each of a plurality of users. The communication apparatus 103 of each user is connected to or communicable with the Internet 101 via a suitable transceiver such as a wired or wireless modem. Each communication apparatus 103 comprises at least one user terminal 102. Each terminal 102 is installed with an instance of the client application for accessing the communication system and thereby establishing a live packet-based voice or video call with the client of another user running on another such terminal 102.
Furthermore, in the case of at least one user of the communication system, that user's respective communication apparatus 103 comprises an arrangement or collection of multiple terminals 102. For example, in the illustrative embodiment of Figure 1 the communication apparatus 103 of one user comprises: a mobile handset type terminal 102a such as a mobile phone, a laptop computer 102b, a desktop computer 102c, and a television set or television with set-top box 102d. Other types of terminal 102 that may be installed with a communication client include photo frames, tablets, car audio systems, printers, home control systems, cameras, or other such household appliances or end-user devices, etc. Each of the multiple terminals 102a-102d of the same user is installed with a respective instance of the communication client application which the same user may be logged into concurrently, i.e. so the same user may be logged into multiple instances of the same client application on two or more different terminals 102a- 102d simultaneously. This will be discussed in more detail below.
Each of the different end-user terminals 102a-102d of the same user may be provided with individual connections to the internet 101 and packet-based communication system, and/or some or all of those different terminals 102a-102d may connect via a common router 105 and thus form a local network such as a household network. Either way, it envisaged that in certain preferred
embodiments some or all of the different terminals 102a-102d of the same user will be located at different points around the house, e.g. with the television 102d in the living room, the desktop 102c in the study, the laptop 102b open in the kitchen, and the handheld 102a at any other location the user may happen to find themselves (e.g. garden or WC).
Also shown connected to the internet 101 is a data store 104 in the form of either a server, a distributed peer-to-peer database, or a combination of the two. A peer-to-peer database is distributed amongst a plurality of end-user terminals of a plurality of different users, typically including one or more users who are not actually participants of the call. However, this is not the only option and a central server can be used as an alternative or in addition. Either way, the data store 104 is connected so as to be accessible via the internet 101 to each of the client applications or instances of client applications running on each of the terminals 102 of each user's communication apparatus 103 . The data store 104 is arranged to provide a mapping of usernames to IP addresses (or other such network addresses) so as to allow the client applications of different users to establish communication channels with one another over the Internet 101 (or other packet-based network) for the purpose of establishing voice or video calls, or indeed other types of communication such as instant messaging (IM) or voicemail.
In the case where the same user can be simultaneously logged in to multiple instances of the same client application on different terminals 102a-102d, in embodiments the data store 104 may be arranged to map the same username (user ID) to all of those multiple instances but also to map a separate sub- identifier (sub-ID) to each particular individual instance. Thus the communication system is capable of distinguishing between the different instances whilst still maintaining a consistent identity for the user within the communication system.
Figure 2 shows a schematic block diagram of an exemplary end-user terminal 102 according to embodiments of the present invention, which may correspond to any of those mentioned above. The user terminal 102 comprises a memory 202 such as an internal or external hard drive or flash memory, and a processing apparatus 204 in the form of a single or multi core processor. The memory 202 is installed with an instance of the communication client 206, is coupled to the processing apparatus 204, and is arranged such that the communication client 206 can be executed on the processing apparatus 204. The terminal 102 also comprises a transceiver 220 for communicating data on the up and downlink to and from the client 206 via the Internet 101 or other such packet-based network, e.g. a wireless transceiver for wirelessly connecting to the Internet 1010 via the wireless router 105. The terminal 102 further comprises a plurality of AV transducers e.g. an internal microphone 104, an internal speaker 210, an internal camera 212 and a screen 214. The terminal 102 may then also comprise further AV transducers plugged into the main body of the terminal 102, e.g. an external or peripheral webcam 216 and a headset 218. As shown in Figure 3 the headset 218 preferably comprises an earpiece or headphones 302 and microphone 304 integrated into the same unit. The term AV transducer may be used herein to refer to any means of audio or video input or output. Terminal is meant as a discrete unit of user equipment whereas a transducer is a component or peripheral of a given terminal. In some situations such as that of a handset and docking station the categorisation may not be immediately apparent, but for the purpose of this application a terminal is considered distinct if it executes its own instance of the communication client.
Each of the transducers 208-218 is operatively coupled to the processing apparatus 204 such that the client is able to receive input from any or all of the input transducers 208, 212, 216, 218 and supply outputs to any or all of the output transducers 210, 214, 218. The terminal of Figure 2 is therefore notable in one respect in that it comprises multiple audio input transducers, multiple audio output transducers and/or multiple video input transducers, and that each of these is potentially available to the client application 206 for conducting voice or video calls over the packet-based network. Multiple video output transducers are also a possibility.
According to a first aspect of the present invention, the client application 206 is configured so as when executed to receive an input from multiple different input transducers of the same terminal 102, to analyse the input signals from the different input transducers in relation to one another, and based on the analysis to select a suitable input transducer and/or output transducer of that terminal for use in conducting a call. According to a second aspect of the present invention, the instances of the client application 206 on the different terminals 102a-102d of the same user are configured so as when executed to operate in conjunction with one another, to thereby receive an input from input transducers of the different terminals 102a- 102d, analyse the input signals from the different terminals 102a-102d in relation to one another, and select a suitable one of the multiple instances of the client application 206 running on different terminals 102a-102d for use in conducting a call. That is to say, the second aspect is concerned not just with the selection of a particular input or output transducer 208-218, but also with the routing of the voice and/or video stream of a call to and from a selected terminal 102a-102d. In this case the terminals 102a-102d together form one end of a call (the "near end") communicating with the client running on a further, third user terminal 102f (the "far end") via the Internet 101 or other such packet-based network.
In either case, the analysis applied to the inputs may include:
• a comparison of the energy or power level of the received audio signal from two or more audio input transducers from the same terminal 102 and/or from different terminals 102;
• comparison of a Fourier analysis of the input signal received from two or more different audio or video inputs from the same terminal and/or different terminals;
· a voice recognition algorithm applied to the received audio signal from two or more audio input transducers from the same terminal 102 and/or from different terminals 02;
• a facial recognition algorithm applied to the received video signal from two or more video input transducers from the same terminal 102 and/or from different terminals 02; and/or
• a motion recognition algorithm applied to the received video signal from two or more video input transducers from the same terminal 102 and/or from different terminals 102. The client application 206 is arranged to perform one or more such analysis processes and, based on the said analysis, to select one or more of the following for use in conducting the packet-based voice or video call: an input audio transducer, an input video transducer, an output audio transducer, an output video transducer, and/or an instance of the client application running on one of multiple terminals of the same user. This process may be performed upon making a call, answering a call, and/or dynamically during an ongoing call. For example, in one embodiment the invention may advantageously be used in conjunction with a client 206 or terminal 102 capable of recognising voice activated commands, e.g. so that the user can control the client 206 vocally with commands such as "call "answer call", and "hang up". Depending on the approximate location of a user within a room, or depending on whether or not the user is wearing his or her headset 218, then the optimal microphone for use in making or answering a call may vary. The client 206 may therefore analyse the inputs from two or microphones 208, 304 of the same terminal in the same room to determine which input signal has the largest energy in the human vocal frequency range, e.g. when the user speaks to answer the call using a voice command, and then select that microphone to use in the call.
The audio inputs from the microphones may also determine a suitable audio output transducer, e.g. by selecting between a loudspeaker 210 and the headphones 302 of a headset 218 depending on which microphone is generating the most vocal energy.
In another example, detection of a rustling or scrabbling sound at the headset microphone 304 may be taken as indicative of the user fumbling for their headset to answer an incoming call, and this can be used to select the headset 218 for audio input and/or output.
In yet another embodiment, the audio inputs from the microphones may be used to switch between different audio or video output transducers during the call, e.g. if the user moves around the room or puts on or removes the headset
In an example of the second aspect of the invention, it is envisaged that the invention will have an application in situations where the user has different terminals 102a, 102b, etc. located in different places around the house. In this case an analysis of the energy levels from different microphones or cameras of different terminals may be used to determine the presence of the user in a particular room or the proximity to a particular terminal, and hence determine the best terminal for answering or making a call, or to switch between the terminals during an ongoing call as the user roams about the house. Other techniques that can be used to detect the presence or proximity of a particular user include motion estimation to detect the presence of a suitably sized moving object (which may be taken as a human moving between rooms), a Fourier analysis to determine an overall colour property of an image or moving object (e.g. based on the assumption that the moving user wears the same colour clothes as they move between rooms), or a voice or facial recognition algorithm (to help distinguish between multiple people and/or background noise), or indeed any combination of these.
In the case of the second aspect of the invention, it will be necessary for the client instance 206 on at least one of the terminals to send information about the input received from its respective transducer(s) to one or more other terminals or other network elements for comparison. To achieve this, both (or all) of the user's client instances 206 that are involved in the comparison are preferably logged into with the same username and have the concept of being in the same call.
For example, communication between the instances and/or controller may be enabled by reference to the system of user IDs and sub-IDs mapped to IP addresses or other such network addresses by the data store 104. Thus the list of sub-IDs for each user allows the different client instances to be identified, and the mapping allows a client instance, server or other network element to determine the address of each terminal on which one or more other different instances is running. In this manner it is possible to establish communications between one client and another or between the client and a server or other network element for the purpose of sharing information on the input signals from the audio and/or video input transducers, e.g. to share input energy levels, motion vectors or FFT results. The same mechanism may also be used to signal or negotiate the selection of the required instance for conducting the call.
Alternatively, communication set up may be enabled by maintaining a list of only the terminal identities rather than the corresponding client identities, the list being maintained on an accessible network element for the purpose of address look-up. For example a list of all the different terminals 102a-102d may be maintained on an element of the local home network, 105, 102a-102d, in which case only the local network addresses and terminal identities need be maintained in the list, and a system of IDs and separate sub-IDs would then not necessarily be required. The local list could be stored at each terminal 102a-102d or on a local server of the home network (not shown), and each client instance would be arranged to determine the necessary identities and addresses of the other instances' terminals by accessing the list over the local network.
Once a suitable mechanism has been put in place for identifying the different client instances and the addresses of their respective terminals 102a-102d, the selection may be performed in a number of ways. For example, in one
implementation all of the client instances 206 in question may be arranged to transmit their respective transducer input information (e.g. input energy levels, motion vectors or an FFT result) to a central server or other central control element, which could be implemented on a server along with the data store 104 or on an element of the local home network. The central controller would then be arranged to run an algorithm to analyse the different received inputs in relation to one another and thereby select the instance of a particular terminal, e.g. 102b, to make or answer the call. The controller then instructs the instances 206 on their behaviour (are they involved in the call or not and to what extent) using the chosen signalling mechanism. In another implementation, the different client instances 206 may share the information directly between each other and either mutually negotiate the selected terminal according to some predetermined protocol or act under the control of one instance that has been designated as the master instance (e.g. by the user or by default).
As mentioned, in some embodiments the analysis of the transducer inputs may be used to switch between different instances of the client 206 running on different terminals 02a, 102b of the same user during an ongoing call, e.g. as the user walks between rooms (rather than just selecting an instance to initiate an outgoing call or answer an incoming call). This may involve each client instance 206 periodically sampling or otherwise monitoring its respective input transducer or transducers 208, 212, 218 and sharing the monitored information throughout the call in a similar manner to that described above. Alternatively each client may only share new transducer input information in response to some detected event such as the input energy level or motion vector exceeding some threshold. Either way, the controller or master instance may thus apply the selection algorithm to the received input information at multiple times throughout the call, in either a scheduled or event-based manner, in order to make an ongoing, dynamic selection as to which instance 206 on which terminal 102 should be used for the call. In one implementation of the dynamic switching case, once the desired instance has been identified as the selected endpoint for the call, then the switchover may be completed in a similar manner to known call forwarding techniques as described for example in US application no. 12/290232, publication no. US 2009- 0136016, but with the call being transferred between different terminals of the same user based on different sub-IDs, rather than the call being transferred between different users based on different user IDs. In another implementation, the client 206 on the initial terminal, e.g. 102b, may continue to receive the audio and/or video stream of the call but then route the audio and/or video stream onwards to the newly selected terminal, e.g. 102c (such that the endpoint of the call appears to be the same, 102b, from the perspective of the client on the other end of the call). This latter option would be particularly applicable if the initial and new terminals 102b and 102c have a local wireless connection such as a wi-fi or Bluetooth connection available between them.
In both the first and second aspects of the invention, note that the type of transducer being selected is not necessarily the same as the type of transducer used to make the selection. E.g. a microphone may indicate the presence of a user in a particular room or location and be used to select a suitably located camera, or a camera may indicate the presence of a user in a particular room or location and be used to select a suitably located microphone, etc.
Further, it is not necessarily the case that the voice and video streams are routed to and from the same user terminals 102a, 120b, etc. In embodiments it could be possible to determine that, say, a television 102d is best placed to display video to the user whilst a laptop 102b or mobile terminal 102a is best placed to handle the audio part of the call. The selection algorithm running on the client instance 206 and/or central controller would then instruct the two streams to be routed to different terminals 102a, 102b.
The first and second aspects of the invention may be used either independently or in combination. For example, it would be possible for the instance of the client application 206 running on each of multiple terminals 102 to determine its own best local input transducer (e.g. highest audio input energy level or best Fourier analysis match), then for the different instances to compare their best local input results to find the best global input result.
In either case, an initial calibration phase may be useful, e.g. to determine the relative levels that the different microphones generate when the user is at different distance from those microphones. That is to say, the client 206 is preferably configured to determine the relative gains of the different microphones. For example upon initial installation the user may be asked to sit or stand at two or more predetermined distances from the terminal and speak at a fixed level, which the client 206 can then use for calibration.
In absence of a clear determination in any of the above examples, the client or system may be configured to use a default transducer and/or terminal for making or answering calls.
According to a third aspect of the present invention, a primary terminal 102 running an instance of the communication client 206 is able to select the resources of one or more secondary terminals 102 installed with or running another instance of the client 206 to be used during a conversation. The user may use the terminal 102 that provides the best interfaces for what they are doing. For example if the user is participating in a video call, he may use his TV 102d for receiving and transmitting video, while using a mobile device 102a for controlling the call and transmitting and receiving the audio stream. Similarly if he wanted to maintain a conversation while working in his living room he may direct the call to a stereo system or speaker phone while he sends/receives video from a handheld or portable device like a photo-frame, he may even chose to send/receive a data file going directly from/to his NAS (network attached storage device).
The term "consume" is sometimes used to generally mean playing a live audio or video stream or storing a file transfer, i.e. using the stream for its ultimate purpose at its end-point destination. Similarly the term "generate" may refer to capturing a live audio or video stream or retrieving a file transfer from memory for transmission over a network , i.e. at the origin of the stream. The third aspect of the invention can be used to allow the second terminal to consume or generate at least one stream of the call whilst the first user terminal concurrently generates or consumes at least another stream of the call. As mentioned, in embodiments one of the streams may be a file transfer performed in conjunction with the voice or video call as part of the same session between the same near and far end users. In other embodiments only live voice and/or video streams may be involved, e.g. with one terminal capturing the outgoing video stream and another terminal playing the incoming video stream, and/or with different terminals handling audio and video streams.
In one example, the client instances could be "network aware" and could be provided with an API enabled to facilitate not only the discovery of the different devices but also the easy transfer/usage of different media streams in a conversation from one end point to the next or the combination of two end points, This allows a user to configure how multi device resources should be allocated for handling communication events.
This third aspect of the present invention advantageously allows a user to be presented with a list of available user terminals 102 and to select at least one secondary terminal 102 with the most appropriate capabilities to handle part of the call or a particular type of communication, for example a live video stream or file transfer. According to an embodiment of the invention, a terminal 102 such as a mobile phone 102a installed with an instance of the client application 206 is arranged to discover other resources that are available on other such user terminals 102. The user may select to use the resources of one or more of the discovered terminals 102.
The selection of "resources" could refer to selecting a particular other
(secondary) user terminal 102, or selecting a particular audio or video output transducer 208-218 of a particular other user terminal 102.
The terminal 102 that is used by a user to perform the selection will be referred to as a primary terminal. Each selected terminal will be referred to as the secondary terminal. In the case of an outgoing call the primary terminal is preferably the initiator of a call, and in the case of an incoming call the primary terminal is preferably the terminal used to answer the call. The primary terminal is also preferably the terminal which controls the call, e.g. to choose when to terminate the call or activate other features. The primary terminal may remain the master terminal for the purpose of controlling the call, or primary status could be transferred during the call to another terminal such as one of the secondary terminals.
A similar terminology may be used to describe the primary and secondary clients running on the primary and secondary terminals respectively. According to the third aspect of the invention, the client 206 on the other
(secondary) user terminal such as 102c may be of the same user as that on the primary terminal (i.e. logged in with the same user ID), or may be another terminal 102e borrowed from a different user (logged in with a different user ID). Either way, the primary and secondary terminals 102a-102e together form one end of the call (the "near end") communicating with the client running on a further, third user terminal 102f (the "far end") via the Internet 101 or other such packet-based network.
In order for one client instance 206 to use the resources of another client instance 206 it is desirable to require the primary client instance to be authorised to use the resources of the other client instance. There are different methods by which one client instance 206 can authorise another client instance, as follows.
One method is for instances of the client 206 that are logged on with the same username may be automatically authorised to share resources, e.g. the clients on terminals 102b and 102a.
In another authorisation method, if the terminals 102 are logged on using different usernames, it may be necessary for the client on one terminal such as 102b to authorise the client of another terminal 102e to use resources. This may be configured locally at the terminal 102, for example the user may input the usernames of authorised clients 206, or select contacts from the contact list that are authorised to use the terminal resources. Alternatively this may be configured remotely, for example a primary client may send an authorisation request to a secondary client to allow the primary client to use the resources of the secondary client.
Alternatively a client 206 may be configured to authorise any contact to use the resources of the terminal 102 - i.e. any contact that has been approved by the user to be included in a contact list.
As such the primary terminal 102b that initiates a VoIP conversation will have the capability to direct any media stream to any of the secondary terminal that is authorized to use. The media stream of the conversation will be directed to the designated secondary terminal 102e that the primary user selects. The primary user may choose to move all the media streams to a terminal and/or designate an alternative terminal as the primary terminal.
Each client 206 that can act as a primary client, e.g. on primary terminal 102b, is preferably configured with a mechanism for resource discovery for discovering the presence of other potential secondary terminals 102a, 102e, etc. and/or for discovering the media capability of the potential secondary terminals (what input and output transducers they have available). Resources available to the primary client are presented in a list. The list of available resources may indicate the terminal type (e.g. TV, printer) such that the user can select the most appropriate device to handle the communication event. For example the user may select a TV for a video call, a stereo system for a voice call, or a Network Attached Storage (NAS) device for a file transfer.
The available resources of other terminals installed with instances of the client 206 may be discovered using a number of alternative methods, for example as follows. A user terminal 102 installed with a suitable client application 206 or suitable instance of the client application may be referred to in the following as an "enabled terminal".
One such method is server assisted resource discovery. In one embodiment of the invention a server stores the location of each terminal having an instance of the client 206. When a user logs in, the client is arranged to provide its location and terminal type/capabilities to the server. The location could be defined as IP address, NAT or postal address input by the user. In this embodiment of the invention the server is arranged to return a list of proximate terminals to that of the primary client in response to the primary client transmitting a find suitable terminals to the server.
Alternatively the server may provide a client 206 with a list of available terminals 102 enabled with the client 206 in response to receiving a configuration message from a client 206. In this case a client may provide a list of one or more usernames that are authorised to use the resources of the device on which the client is installed. The user of the device may configure access to be limited to certain types of resource, e.g. audio output only, and may additionally control the times during which the resource may be accessed. The server is arranged to provide each authorised client in the list with a message indicating which resources the client is authorised to use and when.
In either of the above options the server could instead be replaced with a distributed database for maintaining the list, or a combination of the two may be used. In the case where the primary and secondary terminals are of the same user, i.e. running clients logged in with the same username, the system of usernames and sub-identifiers may be used to distinguish between the different instances in a similar manner to that discussed above. However, that is not essential and instead other means of listing the available terminals could be used, e.g. by listing only the terminal identity rather than the corresponding client identity, or in the case where the primary and secondary terminals are of different users (e.g. 102b and 102e) then the sub-identifier would not necessarily be needed.
Another possible method is common local network device discovery. In an alternative embodiment the primary client is arranged to present a list of terminals 102a, 102c, 102d enabled with the client 206 to the user that are discovered on the local network, this may be in response to the user actuating a find 'Enabled Terminals' instruction or in response to the user connecting to the network. Any IP enabled terminal that registers into a given network receives a unique IP address within that network. As an enabled terminal joins it will broadcast a presence message to all enabled terminals in that network announcing a given username / ID and a list of authorized users that have rights to access its capabilities. All the enabled terminals 102 that receive this message and have a common authorized user will reply back to authenticate themselves and establish a secure communication channel through which they will announce its IP address and available resources to the primary user. The primary user will have rights to access the media interfaces of all the enabled terminals for which it has been authenticated. Another method is to select resources from the contact list (i.e. a list of contacts approved by the user for communicating with that user, typically maintained via the user's own client 206). In this case resources of other terminals such as 102e may be indicted on the user's contact list. For example each contact listed in the contact list will indicate the type of terminal 102 on which the client is executed. This may be determined automatically, for example the client 206 may detect the type of device by detection of the operating system. Alternatively the user may input the device type directly. The device type may be presented in the contact list in the form of the contact name, 'John's TV ,the mood messages, or an icon. In both the second as well as the third aspects of the invention, the online status of the other or secondary client application may be relevant, e.g. on other user terminal 102a, 102c or 102e. In order for the resources of an other or secondary terminal 102 to be accessed, it will likely be necessary for the device to be online at the time the
communication event is received. If the secondary terminal is not online the primary client should preferably be prevented from selecting the resources of the offline secondary device.
The availability of resources may be determined using the presence. In this case the client applications 206 are arranged to search for contacts' network addresses in a peer-to-peer network or server. Once the address is determined the client 206 sends a status request command to the specified address. If the contact is online, it responds with a reply reporting its status. If no reply is received, the contact is deemed to be offline. Presence requests are preferably sent periodically so that the status can be updated when necessary.
In an alternative embodiment, in the case where the secondary terminals 102 are located on the same local network, each client 206 is arranged to transmit "keep alive" messages periodically. If a keep alive message is not received from a client, the client is determined to be offline.
A further matter which may be relevant to the third embodiment is the primary and secondary client status.
When a client is selected as a secondary client, the identity of the primary client is stored at the secondary device. The client will behave as a secondary client for any calls or communication events associated with the primary client. For example, in response to receiving a selection request at the secondary terminal, the secondary client will store the network location/username of the primary terminal. The secondary client will handle media and call set up instructions from the primary client in accordance with predetermined rules configurable by the user. For example the user may configure the TV client to allow the mobile client to use the video input and output resources of the TV. If the TV receives a call set up request from a third party that identifies the mobile client as the primary client, the TV will only handle the video media during the call. In particular the TV client will not attempt to capture and transmit audio during the call. Furthermore the TV will not display a user interface to the user for controlling the call.
The above embodiments provide example mechanisms by which one or more user terminals can be selected for use in conducting a call or part of a call according to the second or third aspects of the invention. Once a suitable mechanism is put in place, Call or communication set up can proceed as follows. By way of example, the following will be described with reference to the laptop or tablet style computer 102b as the primary terminal and the mobile handset type terminal 102a as the secondary terminal.
When a call is received at the primary terminal 102b, the client running on the primary client may be used to answer the call and direct the media stream to the desired selected secondary terminal 102a. Note that the audio and video streams of a video call are not necessarily generated or played out by the same user terminal 102, and nor are the received and outbound streams necessarily generated and played by the same terminal 102. Indeed, it is one advantageous use of the present invention that the primary terminal 102b can be used to generate and/or play out the audio stream whilst the secondary terminal 102a is used to generate and/or play out the video stream, or vice versa - i.e. the second terminal 102a handles whichever of the received and/or outbound audio and/or video streams is not handled by the primary terminal 102b. Other streams such as file transfers forming part of the same session as the call may also be directed to a secondary terminal, e.g. 102c or 102e
In a preferred embodiment of the invention, the primary client on the primary terminal 102b may instruct the far end party (on the other end of the call) to route the relevant media stream directly to the secondary client on the secondary terminal 102a (e.g. by referring the far end terminal to the mapping of user IDs and Sub-IDs to addresses in the data store 104, or by sending the address of the secondary terminal 102a to the far end terminal directly). Alternatively however all streams may be routed via the primary terminal 102b, with the primary client then routing any required streams onwards to and from the secondary terminal 102a (i.e. so from the perspective of the far end client and user terminal the primary terminal 102b is still the end point for all streams of the call, and the routing to or from the secondary terminal is handled solely by the client on the primary terminal 102b).
In order to retain control of the call at the primary device, the secondary client will handle the call in response to instructions received from the client (e.g. end call). These instructions may be sent as IM messages. The primary terminal may input call handling instructions such as 'increase volume', 'answer call', 'turn off webcam' or 'end call' using predetermined IM messages, recognised by the secondary client.
In an alternative embodiment of the invention, call set up may be handled by the server. For example if the user has previously configured the system to send video to the television, the server will provide the address of the TV to the far end node such that video data can be sent directly to the TV client. Again the TV client will be aware that the call is associated with the primary device and will accordingly only use the resources that are authorised for use. Either way, in order to direct control signals to instruct the secondary terminal, the primary client or the server may determine the address of the secondary terminal from a list or database mapping addresses to terminal identities or user IDs and Sub-IDs, which may be implemented on a server, distributed database or local network element. In embodiments the mechanism for transmitting control signals and/or responses in the third aspect of the invention may be the same or a similar mechanism to that used to share information on transducer inputs according to the second aspect of the present invention, or in other embodiments different mechanisms may be used for the two different aspects.
Note that in preferred embodiments a client 206 installed at any terminal 102 may present the user with a list of enabled terminals 102 which are installed with an instance of the client application 206. In this case the terminal on which the user selects a terminal becomes the primary terminal and the selected terminal becomes the secondary device.
The behaviour of a client 206 as either a primary client or a secondary client will depend on whether it has been selected for use as a secondary client. It will be appreciated that the above embodiments have been described only by way of example. Other variants or implementations may become apparent to a person skilled in the art given the disclosure herein. For example, the invention is not limited by any particular method of resource discovery or authorisation, and any of the above-described examples could be used, or indeed others. Further, any of the first, second and/or third aspects of the invention may be implemented either independently or in combination. Where it is referred to a server this is not necessarily intended to limit to a discrete server unit housed within a single housing or located at a single site. Further, where it is referred to an application, this is not necessarily intended to refer to a discrete, stand-alone, separately executable unit of software, but could alternatively refer to any portion of code such as a plug-in or add-on to an existing application. The invention is not limited by the described embodiments but only by the appendant claims.

Claims

Claims
1. A method comprising:
providing a packet-based communication system for conducting voice or video calls over a packet-based network; and
providing an instance of a client application enabling a first user terminal to access the packet-based communication system, the client application being configured so as when executed on the first terminal to receive an input from one or more audio and/or video input transducers of the first terminal, and to operate in conjunction with one or more other instances of the client application executed on one or more respective second terminals so as to participate in an analysis of said one or more inputs in relation to an input from one or more audio and/or video input transducers of the one or more second terminals;
thereby enabling selection of one of the first and second terminals, based on said analysis, for use by a near-end user in conducting a voice or video call with a far-end user of a third user terminal via the respective client instance and packet-based communication system.
2. The method of claim 1 , wherein the analysis relates to a relative proximity of a user to the first and second terminals.
3. The method of claim 1 or 2, wherein the analysis comprises a comparison of the energy or power level of the audio input from audio input transducers of the first and second terminals.
4. The method of any preceding claim, wherein the analysis comprises a Fourier analysis applied the input audio or video inputs of the first and second terminals.
5. The method of any preceding claim, wherein the analysis comprises a voice recognition algorithm applied to the audio input audio input transducers of the first and second terminals.
6. The method of any preceding claim, wherein the analysis comprises a facial recognition algorithm applied to the video input from video input transducers of the first and second terminals.
7. The method of any preceding claim, wherein the analysis comprises a motion recognition algorithm applied to the video input from video input transducers of the first terminal.
8. The method of any preceding claim, wherein said selection is made upon answering or initiating a call.
9. The method of any preceding claim, wherein said selection is made during an ongoing call.
10. The method of any preceding claim, wherein the client application is configured to recognise voice commands for controlling the call, and said selection is made based on the analysis of audio inputs received due to one or more voice commands.
11. The method of any preceding claim, wherein the instance of the client application on the first terminal is configured to determine a local selection of a most relevant input from one a plurality of said input transducers of the first terminal, and said analysis comprises comparing the local selection from the first terminal with a local selection from the one or more other instances on the respective one or more second terminals, said selection of one of the first and second terminals being based on the comparison of the selected local inputs.
12. The method of any preceding claim, wherein the client application is configured to perform an initial calibration process to determine relative input response properties of the different input transducers.
13. The method of any preceding claim, wherein the instance of the client on the first user terminal is configured to automatically discover a respective identity of each of the one or more second user terminals.
14. The method of any preceding claim, wherein the instance of the client on the first user terminal is configured to automatically discover a respective address of each of the one or more second user terminals for use in said analysis and/or call.
15. The method of any preceding claim, wherein the instance of the client on the first user terminal is configured to automatically discover a respective media capability of each of the one or more second user terminals for use in said call.
16. The method of any preceding claim, wherein the instance of the client on the first user terminal is configured to automatically discover a respective online status of each of the one or more second user terminals for the purpose of said analysis and/or call.
17. The method of any preceding claim, comprising making involvement of the one or more second terminals in conjunction with the first user terminal conditional on an authorisation procedure.
18. A client application comprising code embodied on a computer-readable medium and configured so as when executed on a first terminal to perform operations of:
accessing a packet-based communication system to conduct voice or video calls over a packet-based network; receiving an input from one or more audio and/or video input transducers of the first terminal; and
operating in conjunction with one or more other instances of the client application executed on one or more respective second terminals so as to participate in an analysis of said one or more inputs in relation to an input from one or more audio and/or video input transducers of the one or more second terminals; thereby enabling selection of one of the first and second terminals, based on said analysis, for use by a near-end user in conducting a voice or video call with a far-end user of a third user terminal via the respective client instance and packet-based communication system.
19. The client application of claim 18, wherein the code is further configured to perform operations in accordance with any of claims 2 to 17.
20. A first user terminal comprising:
a transceiver operable to access a packet-based communication system to conduct voice or video calls over a packet-based network;
a storage medium storing an instance of a client application enabling the first user terminal to access the packet-based communication system; and
processing apparatus arranged to execute the instance of the client application, the client application being configured so as when executed on the first terminal to receive an input from one or more audio and/or video input transducers of the first terminal, and to operate in conjunction with one or more other instance of the client application executed on one or more respective second terminals so as to participate in an analysis of said one or more inputs in relation to an input from one or more audio and/or video input transducers of the one or more second terminals; thereby enabling selection of one of the first and second terminals, based on said analysis, for use by a near-end user in conducting a voice or video call with a far-end user of a third user terminal via the respective client instance and packet-based communication system.
21 . The first user terminal of claim 20, wherein the client application is further configured to perform operations in accordance with any of claims 2 to 17.
PCT/EP2011/074304 2010-12-31 2011-12-30 Communication system and method WO2012089832A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201180063497.6A CN103416023B (en) 2010-12-31 2011-12-30 Communication system and method
EP11802455.3A EP2649753B1 (en) 2010-12-31 2011-12-30 Communication system and method

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201061428952P 2010-12-31 2010-12-31
US61/428,952 2010-12-31

Publications (1)

Publication Number Publication Date
WO2012089832A1 true WO2012089832A1 (en) 2012-07-05

Family

ID=45420678

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2011/074304 WO2012089832A1 (en) 2010-12-31 2011-12-30 Communication system and method

Country Status (4)

Country Link
US (1) US10291660B2 (en)
EP (1) EP2649753B1 (en)
CN (1) CN103416023B (en)
WO (1) WO2012089832A1 (en)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB201005454D0 (en) 2010-03-31 2010-05-19 Skype Ltd Television apparatus
US8963982B2 (en) * 2010-12-31 2015-02-24 Skype Communication system and method
US10404762B2 (en) 2010-12-31 2019-09-03 Skype Communication system and method
US9717090B2 (en) 2010-12-31 2017-07-25 Microsoft Technology Licensing, Llc Providing notifications of call-related services
US9019336B2 (en) 2011-12-30 2015-04-28 Skype Making calls using an additional terminal
GB201301452D0 (en) 2013-01-28 2013-03-13 Microsoft Corp Providing notifications of call-related services
JP6561433B2 (en) * 2014-05-15 2019-08-21 ソニー株式会社 Method, system, terminal device, and server for realizing a function by operating a plurality of hardware elements in a coordinated manner
PH12016000086A1 (en) * 2016-03-01 2017-10-18 Bluewave Global Innovations Pte Ltd Converged communication device
US9749583B1 (en) * 2016-03-31 2017-08-29 Amazon Technologies, Inc. Location based device grouping with voice control
CN109286643A (en) * 2017-07-20 2019-01-29 西门子公司 The method and apparatus for reading the configuration parameter of an application example
US10616419B1 (en) * 2018-12-12 2020-04-07 Mitel Networks Corporation Devices, systems and methods for communications that include social media clients
EP3912312B1 (en) * 2019-01-15 2022-07-20 Telefonaktiebolaget Lm Ericsson (Publ) Providing communication services using sets of i/o devices
CN111818291B (en) * 2020-07-06 2022-07-12 北京字节跳动网络技术有限公司 Method and device for establishing multimedia call and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080069087A1 (en) 2006-09-07 2008-03-20 Technology, Patents & Licensing, Inc. VoIP Interface Using a Wireless Home Entertainment Hub
US20090177601A1 (en) * 2008-01-08 2009-07-09 Microsoft Corporation Status-aware personal information management
US20090175509A1 (en) 2008-01-03 2009-07-09 Apple Inc. Personal computing device control using face detection and recognition
US20090280789A1 (en) 2005-06-01 2009-11-12 Sanyo Electric Co., Ltd. Telephone and method of controlling telephone

Family Cites Families (149)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5305244B2 (en) 1992-04-06 1997-09-23 Computer Products & Services I Hands-free user-supported portable computer
US6061434A (en) 1996-08-14 2000-05-09 Joseph C. Corbett Video caller identification systems and methods
US6449284B1 (en) 1997-03-21 2002-09-10 Avaya Technology Corp. Methods and means for managing multimedia call flow
US6243129B1 (en) 1998-01-09 2001-06-05 8×8, Inc. System and method for videoconferencing and simultaneously viewing a supplemental video source
US6532482B1 (en) 1998-09-25 2003-03-11 Xybernaut Corporation Mobile computer with audio interrupt system
US20020040377A1 (en) 1998-09-25 2002-04-04 Newman Edward G. Computer with audio interrupt system
US6425131B2 (en) 1998-12-30 2002-07-23 At&T Corp. Method and apparatus for internet co-browsing over cable television and controlled through computer telephony
US6321080B1 (en) * 1999-03-15 2001-11-20 Lucent Technologies, Inc. Conference telephone utilizing base and handset transducers
CA2271828A1 (en) 1999-05-11 2000-11-11 Infointeractive Inc. Internet based telephone line
US7039205B1 (en) 1999-05-19 2006-05-02 Siemens Communications, Inc. Techniques for audio transducer switching under programmatic and off hook interrupt control
US6636269B1 (en) 1999-08-18 2003-10-21 Webtv Networks, Inc. Video timing system and method
US6904025B1 (en) 1999-10-12 2005-06-07 Telefonaktiebolaget Lm Ericsson (Publ) Wide area network mobility for IP based networks
US7120692B2 (en) 1999-12-02 2006-10-10 Senvid, Inc. Access and control system for network-enabled devices
US20040194146A1 (en) 2000-02-15 2004-09-30 Bates Cary Lee Set top box and methods for using the same
US6778528B1 (en) 2000-05-17 2004-08-17 Cisco Technology, Inc. Dial-out with dynamic IP address assignment
FI20001293A (en) 2000-05-30 2001-12-01 Nokia Networks Oy Transmission of IP speech in a wireless telecommunications network
US6654722B1 (en) 2000-06-19 2003-11-25 International Business Machines Corporation Voice over IP protocol based speech system
JP4543513B2 (en) 2000-07-17 2010-09-15 ソニー株式会社 Bidirectional communication system, display device, base device, and bidirectional communication method
US7126939B2 (en) 2000-07-24 2006-10-24 Nortel Networks Limited Packet-based calls in a wireless network
JP4658374B2 (en) 2001-05-10 2011-03-23 株式会社リコー Wireless communication method and master terminal thereof
US20030023730A1 (en) 2001-07-27 2003-01-30 Michael Wengrovitz Multiple host arrangement for multimedia sessions using session initiation protocol (SIP) communication
US20030058805A1 (en) 2001-09-24 2003-03-27 Teleware Inc. Multi-media communication management system with enhanced video conference services
US7031443B2 (en) 2001-11-19 2006-04-18 Inter-Tel, Inc. System and method for remote access to a telephone
US6985961B1 (en) 2001-12-04 2006-01-10 Nortel Networks Limited System for routing incoming message to various devices based on media capabilities and type of media session
US7092385B2 (en) 2002-03-12 2006-08-15 Mci, Llc Policy control and billing support for call transfer in a session initiation protocol (SIP) network
US7240214B2 (en) 2002-10-25 2007-07-03 Yahoo!, Inc. Centrally controllable instant messaging system
DE10252989A1 (en) 2002-11-14 2004-06-03 Siemens Ag Support of fax and modem in SIP / SIP-T networks and in the interworking of these networks with ISUP + / BICC
US7920690B2 (en) 2002-12-20 2011-04-05 Nortel Networks Limited Interworking of multimedia and telephony equipment
US7751546B2 (en) 2003-01-22 2010-07-06 Avaya Canada Corp. Call transfer system, method and network devices
US7549924B2 (en) 2003-05-09 2009-06-23 Microsoft Corporation Instant messaging embedded games
JP2007535193A (en) 2003-07-16 2007-11-29 スカイプ・リミテッド Peer-to-peer telephone system and method
ATE487987T1 (en) 2003-07-16 2010-11-15 Joltid Ltd DISTRIBUTED DATABASE SYSTEM
US8140980B2 (en) 2003-08-05 2012-03-20 Verizon Business Global Llc Method and system for providing conferencing services
CN1283125C (en) 2003-08-05 2006-11-01 株式会社日立制作所 Telephone communication system
JP4339056B2 (en) 2003-09-11 2009-10-07 シャープ株式会社 TV receiver, mobile phone, and TV receiver-integrated mobile phone device
US7673001B1 (en) 2003-11-21 2010-03-02 Microsoft Corporation Enterprise management of public instant message communications
KR100590867B1 (en) 2003-12-05 2006-06-19 삼성전자주식회사 Video/voice communication system and call transfer/pick-up method using thereof
US7260186B2 (en) 2004-03-23 2007-08-21 Telecommunication Systems, Inc. Solutions for voice over internet protocol (VoIP) 911 location services
US7634072B2 (en) 2004-02-13 2009-12-15 Yahoo! Inc. Integrated instant messaging, routing and telephone services billing system
US7634533B2 (en) 2004-04-30 2009-12-15 Microsoft Corporation Systems and methods for real-time audio-visual communication and data collaboration in a network conference environment
JP4505257B2 (en) 2004-05-12 2010-07-21 京セラ株式会社 Mobile phone with broadcast reception function
US20050278778A1 (en) 2004-05-28 2005-12-15 D Agostino Anthony Method and apparatus for credential management on a portable device
US7840681B2 (en) * 2004-07-30 2010-11-23 International Business Machines Corporation Method and apparatus for integrating wearable devices within a SIP infrastructure
US8499027B2 (en) 2004-09-02 2013-07-30 Gryphon Networks Corp. System and method for exchanging information with a relationship management system
US8364125B2 (en) 2004-11-09 2013-01-29 Avaya, Inc. Content delivery to a telecommunications terminal that is associated with a call in progress
JP2008521267A (en) 2004-11-15 2008-06-19 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ Method and network device for supporting user content selection
US8064968B2 (en) 2004-11-22 2011-11-22 The Invention Science Fund I Llc Transfer then sleep
US7424288B2 (en) 2004-11-22 2008-09-09 Searete Llc Call transfer to proximate devices
US7356567B2 (en) * 2004-12-30 2008-04-08 Aol Llc, A Delaware Limited Liability Company Managing instant messaging sessions on multiple devices
US7693133B2 (en) 2004-12-30 2010-04-06 Alcatel-Lucent Usa Inc. System and method for conference calling with VOIP terminal
US20060153198A1 (en) * 2005-01-10 2006-07-13 Siemens Communications, Inc. Systems and methods for uninterrupted communication sessions
FR2884023B1 (en) 2005-03-31 2011-04-22 Erocca DEVICE FOR COMMUNICATION BY PERSONS WITH DISABILITIES OF SPEECH AND / OR HEARING
US7116349B1 (en) 2005-04-04 2006-10-03 Leadtek Research Inc. Method of videophone data transmission
FI119863B (en) 2005-08-22 2009-04-15 Teliasonera Ab Verifying the authenticity and rights of the remote customer
EP1758334A1 (en) 2005-08-26 2007-02-28 Matsushita Electric Industrial Co., Ltd. Establishment of media sessions with media adaptation
CN101147343B (en) * 2005-09-28 2012-06-20 桥扬科技有限公司 Method for multi-carrier packet communication with reduced overhead
US8019279B2 (en) 2005-10-25 2011-09-13 International Business Machines Corporation System and method for using mobile phones as handsets for IP softphones
US20070115348A1 (en) 2005-10-27 2007-05-24 Cisco Technology, Inc. Method and system for automatic scheduling of a conference
US20070120949A1 (en) 2005-11-22 2007-05-31 Inventec Multimedia & Telecom Corporation Video, sound, and voice over IP integration system
US7751848B2 (en) 2005-11-23 2010-07-06 Envio Networks Inc. Systems and methods for providing concurrent mobile applications to mobile communication devices
US8544058B2 (en) 2005-12-29 2013-09-24 Nextlabs, Inc. Techniques of transforming policies to enforce control in an information management system
US20070183396A1 (en) 2006-02-07 2007-08-09 Bennett James D Set top box supporting bridging between a packet switched network and the public switched telephone network
US8619953B2 (en) 2006-03-15 2013-12-31 Polycom, Inc. Home videoconferencing system
CN101039307A (en) 2006-03-17 2007-09-19 深圳市朗科科技有限公司 Wireless network system and operation method thereof
US8102812B2 (en) 2006-03-21 2012-01-24 Motorola Mobility, Inc. Methods and apparatus for data packet transmission on a network
GB2437592A (en) 2006-04-10 2007-10-31 Skype Ltd Indicating communication events on an alternative interface whilst running an application on another interface
US7729489B2 (en) 2006-04-12 2010-06-01 Cisco Technology, Inc. Transferring a communications exchange
US7769508B2 (en) 2006-04-14 2010-08-03 Snap-On Incorporated Vehicle diagnostic tool with packet and voice over packet communications and systems incorporating such a tool
US20070263824A1 (en) 2006-04-18 2007-11-15 Cisco Technology, Inc. Network resource optimization in a video conference
US7734470B2 (en) 2006-05-22 2010-06-08 Accenture Global Services Gmbh Interactive voice response system
CN101080083A (en) 2006-05-26 2007-11-28 华为技术有限公司 A call forward method and system
US20070280200A1 (en) 2006-05-31 2007-12-06 Patel Mehul B System and method for controlling a voip client using a wireless personal-area-network enabled device
US20070286202A1 (en) 2006-06-08 2007-12-13 Latitude Broadband Global, Inc. Methods and Systems for Call Admission Control and Providing Quality of Service in Broadband Wireless Access Packet-Based Networks
WO2008015369A1 (en) 2006-08-01 2008-02-07 Nds Limited Call management
US9049253B2 (en) 2006-08-09 2015-06-02 Cisco Technology, Inc. Resetting / restarting SIP endpoint devices
US20080075240A1 (en) 2006-09-06 2008-03-27 Microsoft Corporation Consultative call transfer using non-voice consultation modes
US7934156B2 (en) * 2006-09-06 2011-04-26 Apple Inc. Deletion gestures on a portable multifunction device
US7711370B2 (en) 2006-09-20 2010-05-04 Cisco Technology, Inc. Method for establishing voice communications using a mobile handset
FR2906099A1 (en) 2006-09-20 2008-03-21 France Telecom METHOD OF TRANSFERRING AN AUDIO STREAM BETWEEN SEVERAL TERMINALS
US7861175B2 (en) 2006-09-29 2010-12-28 Research In Motion Limited IM contact list entry as a game in progress designate
US7656836B2 (en) 2006-10-05 2010-02-02 Avaya Inc. Centralized controller for distributed handling of telecommunications features
US8032764B2 (en) 2006-11-14 2011-10-04 Texas Instruments Incorporated Electronic devices, information products, processes of manufacture and apparatus for enabling code decryption in a secure mode using decryption wrappers and key programming applications, and other structures
JP2008166980A (en) 2006-12-27 2008-07-17 Funai Electric Co Ltd Television system, and remote control unit
US7958276B2 (en) 2007-01-22 2011-06-07 Counterpath Corporation Automatic configuration of peripheral devices
TWI334721B (en) 2007-01-26 2010-12-11 Asustek Comp Inc Mobile phone capable of making internet calls, system and method using the same
US8705720B2 (en) 2007-02-08 2014-04-22 Avaya Inc. System, method and apparatus for clientless two factor authentication in VoIP networks
US20080235587A1 (en) 2007-03-23 2008-09-25 Nextwave Broadband Inc. System and method for content distribution
US7609170B2 (en) 2007-03-26 2009-10-27 Jon Andrew Bickel Interactive interface within a monitoring and control device
US8045489B2 (en) 2007-03-30 2011-10-25 Cisco Technology, Inc. Method and system for the automatic configuration of conference resources
US7747010B1 (en) * 2007-04-05 2010-06-29 Avaya Inc. Telephony software client application for handling the insertion and removal of logical audio devices
US8301757B2 (en) 2007-06-11 2012-10-30 Enghouse Interactive Inc. System and method for obtaining in-use statistics for voice applications in interactive voice response systems
US7953038B2 (en) 2007-07-20 2011-05-31 Broadcom Corporation Method and system for environment configuration by a device based on auto-discovery of local resources and generating preference information for those resources
US8553623B2 (en) 2007-07-20 2013-10-08 Broadcom Corporation Method and system for utilizing standardized interface in a wireless device to discover and use local and remote resources
US20090049190A1 (en) 2007-08-16 2009-02-19 Yahoo!, Inc. Multiple points of presence in real time communications
US8644842B2 (en) 2007-09-04 2014-02-04 Nokia Corporation Personal augmented reality advertising
US20100046731A1 (en) * 2007-10-02 2010-02-25 Douglas Gisby Method, apparatus and system for use of presence and location information in intelligent call routing
US20090094684A1 (en) 2007-10-05 2009-04-09 Microsoft Corporation Relay server authentication service
KR101413563B1 (en) 2007-11-05 2014-07-04 삼성전자주식회사 A method to provide seeing and hearing information for displaying channels watched by users
US20090136016A1 (en) 2007-11-08 2009-05-28 Meelik Gornoi Transferring a communication event
US8161299B2 (en) 2007-12-20 2012-04-17 Intel Corporation Location based policy system and method for changing computing environments
US20090185792A1 (en) 2008-01-18 2009-07-23 Rutan & Tucker, LLP Digital video camcorder with wireless transmission built-in
US8447303B2 (en) 2008-02-07 2013-05-21 Research In Motion Limited Method and system for automatic seamless mobility
EP2088735A1 (en) 2008-02-11 2009-08-12 Siemens Schweiz AG Client side media splitting function
US8687626B2 (en) * 2008-03-07 2014-04-01 CenturyLink Intellectual Property, LLC System and method for remote home monitoring utilizing a VoIP phone
US20090238170A1 (en) 2008-03-19 2009-09-24 Rajan Muralidhar Method and system for providing voice over ip (voip) to wireless mobile communication devices
CN101242663B (en) 2008-03-20 2012-04-04 华为技术有限公司 Switching method, system and device for call between mobile terminal and soft terminal with same number
US20090282130A1 (en) 2008-05-12 2009-11-12 Nokia Corporation Resource sharing via close-proximity wireless communication
US20100008523A1 (en) 2008-07-14 2010-01-14 Sony Ericsson Mobile Communications Ab Handheld Devices Including Selectively Enabled Audio Transducers
US8203977B2 (en) 2008-07-28 2012-06-19 Broadcom Corporation Method and system for half duplex audio in a bluetooth stereo headset
US8112037B2 (en) 2008-09-02 2012-02-07 Nissaf Ketari Bluetooth assistant
GB2463109B (en) 2008-09-05 2013-03-13 Skype Communication system and method
GB2463108B (en) 2008-09-05 2012-08-29 Skype Communication system and method
GB2463105A (en) 2008-09-05 2010-03-10 Skype Ltd Viewer activity dependent video telephone call ringing
GB2463103A (en) 2008-09-05 2010-03-10 Skype Ltd Video telephone call using a television receiver
GB2463110B (en) 2008-09-05 2013-01-16 Skype Communication system and method
GB2463124B (en) 2008-09-05 2012-06-20 Skype Ltd A peripheral device for communication over a communications sytem
GB2463104A (en) 2008-09-05 2010-03-10 Skype Ltd Thumbnail selection of telephone contact using zooming
GB2463107A (en) 2008-09-05 2010-03-10 Skype Ltd A remote control unit of a media device for placing/receiving calls, comprising activating one of the two wireless transceivers when needed.
KR101229034B1 (en) 2008-09-10 2013-02-01 성준형 Multimodal unification of articulation for device interfacing
US8339438B2 (en) 2008-12-24 2012-12-25 Rockstar Consortium Us Lp Web based access to video associated with calls
US8249056B2 (en) 2009-03-12 2012-08-21 At&T Intellectual Property I, L.P. Converged telephone number mapping for call completion among hybrid communication services
KR101510723B1 (en) 2009-04-20 2015-04-20 삼성전자주식회사 Mobile terminal having projector and method for displaying data thereof
WO2011022662A2 (en) * 2009-08-21 2011-02-24 Genband Us Llc Systems, methods, and computer readable media for selecting an optimal media-adaptation resource for latency-sensitive applications
GB2475237B (en) 2009-11-09 2016-01-06 Skype Apparatus and method for controlling communication signalling and media
GB2475236A (en) 2009-11-09 2011-05-18 Skype Ltd Authentication arrangement for a packet-based communication system covering public and private networks
GB2476077A (en) 2009-12-10 2011-06-15 Skype Ltd Estimating VoIP call Quality before a call is set up
US8446453B2 (en) 2010-01-06 2013-05-21 Cisco Technology, Inc. Efficient and on demand convergence of audio and non-audio portions of a communication session for phones
US9043474B2 (en) 2010-01-20 2015-05-26 Microsoft Technology Licensing, Llc Communication sessions among devices and interfaces with mixed capabilities
EP2355474B1 (en) * 2010-01-21 2018-09-05 BlackBerry Limited Transfer of telephony functions associated with a wireless handheld telephony device to another telephony device
US8797999B2 (en) * 2010-03-10 2014-08-05 Apple Inc. Dynamically adjustable communications services and communications links
US8547877B2 (en) 2010-03-30 2013-10-01 Telefonaktiebolaget L M Ericsson (Publ) RSTP tracking
GB201005386D0 (en) 2010-03-31 2010-05-12 Skype Ltd Communication using a user terminal
US20110242268A1 (en) 2010-03-31 2011-10-06 Jin Kim Television Appliance
GB2479180B (en) 2010-03-31 2016-06-01 Skype System of user devices
GB201005454D0 (en) 2010-03-31 2010-05-19 Skype Ltd Television apparatus
GB201005465D0 (en) 2010-03-31 2010-05-19 Skype Ltd Television set
GB201005458D0 (en) 2010-03-31 2010-05-19 Skype Ltd Media appliance
GB201006796D0 (en) 2010-04-23 2010-06-09 Skype Ltd Viewing apparatus
US9210528B2 (en) 2010-07-21 2015-12-08 Tksn Holdings, Llc System and method for control and management of resources for consumers of information
US10187509B2 (en) 2010-09-14 2019-01-22 At&T Intellectual Property I, L.P. Enhanced video sharing
US8730294B2 (en) 2010-10-05 2014-05-20 At&T Intellectual Property I, Lp Internet protocol television audio and video calling
US9143533B2 (en) 2010-10-12 2015-09-22 Skype Integrating communications
US8698843B2 (en) * 2010-11-02 2014-04-15 Google Inc. Range of focus in an augmented reality application
US8451315B2 (en) 2010-11-30 2013-05-28 Hewlett-Packard Development Company, L.P. System and method for distributed meeting capture
US9717090B2 (en) 2010-12-31 2017-07-25 Microsoft Technology Licensing, Llc Providing notifications of call-related services
US8963982B2 (en) 2010-12-31 2015-02-24 Skype Communication system and method
US10404762B2 (en) 2010-12-31 2019-09-03 Skype Communication system and method
US9019336B2 (en) 2011-12-30 2015-04-28 Skype Making calls using an additional terminal
US9258172B2 (en) 2012-10-24 2016-02-09 Microsoft Technology Licensing, Llc Calling an unready terminal

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090280789A1 (en) 2005-06-01 2009-11-12 Sanyo Electric Co., Ltd. Telephone and method of controlling telephone
US20080069087A1 (en) 2006-09-07 2008-03-20 Technology, Patents & Licensing, Inc. VoIP Interface Using a Wireless Home Entertainment Hub
US20090175509A1 (en) 2008-01-03 2009-07-09 Apple Inc. Personal computing device control using face detection and recognition
US20090177601A1 (en) * 2008-01-08 2009-07-09 Microsoft Corporation Status-aware personal information management

Also Published As

Publication number Publication date
EP2649753A1 (en) 2013-10-16
EP2649753B1 (en) 2018-05-16
CN103416023A (en) 2013-11-27
CN103416023B (en) 2019-04-09
US10291660B2 (en) 2019-05-14
US20120207147A1 (en) 2012-08-16

Similar Documents

Publication Publication Date Title
US20200021627A1 (en) Communication system and method
EP2649753B1 (en) Communication system and method
EP2643963B1 (en) Communication system and method for handling voice and/or video calls when multiple audio or video transducers are available.
US9717090B2 (en) Providing notifications of call-related services
US20130219278A1 (en) Transferring of Communication Event
US10798233B2 (en) Mobile phone station
KR101994504B1 (en) Making calls using an additional terminal
JP6001613B2 (en) Using local network information to determine presence status
EP2798779B1 (en) Transferring of communication event
JP4623582B2 (en) Communication service provision method
US20030210770A1 (en) Method and apparatus for peer-to-peer voice communication using voice recognition and proper noun identification
JP2016511569A (en) Provision of telephone service notifications
US8630273B2 (en) Dynamic appropriation of at least one multimedia device during call set-up
EP3276924B1 (en) Method of sending message in local area network, local area network gateway, and wearable device
CN109845230A (en) Real-time communication system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11802455

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2011802455

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE