WO2009030128A1 - A method and media server of obtaining the present active speaker in conference - Google Patents

A method and media server of obtaining the present active speaker in conference Download PDF

Info

Publication number
WO2009030128A1
WO2009030128A1 PCT/CN2008/071438 CN2008071438W WO2009030128A1 WO 2009030128 A1 WO2009030128 A1 WO 2009030128A1 CN 2008071438 W CN2008071438 W CN 2008071438W WO 2009030128 A1 WO2009030128 A1 WO 2009030128A1
Authority
WO
WIPO (PCT)
Prior art keywords
speaker
media
current
information
user
Prior art date
Application number
PCT/CN2008/071438
Other languages
French (fr)
Chinese (zh)
Inventor
Jiangping Feng
Yan Li
Xiangwei Liu
Liping Gao
Original Assignee
Huawei Technologies Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co., Ltd. filed Critical Huawei Technologies Co., Ltd.
Publication of WO2009030128A1 publication Critical patent/WO2009030128A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/56Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities

Definitions

  • the present invention relates to the field of multimedia communication technologies, and in particular, to a method for obtaining an event speaker in a conference, a media server, a media controller, and a media processor.
  • the media server device can be functionally divided into two functional entities: a media controller (MC) and a media processor (MP).
  • the physical device corresponding to the MC is the Media Gateway Controller (MGC), which is mainly used to complete the management of the conference control, and implement the functions of resource management and access control for the MP device.
  • the physical device corresponding to the MP is a media gateway (MG, Media Gateway), which is mainly used to complete the collection and distribution of conference audio, video and data media, and to implement audio mixing, video picture switching, codec and rate adaptation, and the like. Handle related functions.
  • the interface protocol between MC and MP is the H.248 protocol.
  • the H.248 protocol is the Media Gateway Control Protocol H, which is used to separate call control from media conversion.
  • the H.248 protocol describes the link between media control and the media processor.
  • the media gateway controller controls and manages the media gateway and the H.248 endpoint through the protocol, and the media gateway also converts different types of access modes into RTP/RTCP streams suitable for transmission over the IP network, thereby implementing a packet network. Interoperate with the PSTN network.
  • Termination and Context There are two basic components in the H.248 protocol: Termination and Context.
  • the various resources on the media processor are abstractly represented as endpoints.
  • the combination between endpoints is abstracted as a context.
  • the context can contain multiple endpoints, and thus the topology is used to describe the interrelationship between the endpoints.
  • the connection of the call is actually the operation of the endpoint and context. This operation is done by commands (Request) and Request (Reply) between the MC and the MP.
  • the parameters carried by the command also called descriptors, are classified into categories: Property, Signal, Event, and Statistic. The parameters with business relevance are logically aggregated into a package.
  • Meetings can be divided into audio conferencing, video conferencing, and data conferencing, depending on the media used for the meeting.
  • the endpoint state is referred to as an active speaking state, and the user is referred to as an active speaker.
  • the MP will perform the corresponding mixing processing on the audio from multiple endpoints according to the mixing method specified by the MC, and then Outputs the mixed audio stream to the specified endpoint, and the specified endpoint hears the mix audio stream of the currently active speaker.
  • the MP has the ability to detect the event spokesperson, the MC cannot obtain the information of the current event spokesperson from the MP, and therefore cannot display whether the user is the current active spokesperson in the user terminal.
  • the MC cannot obtain the current event speaker information from the MP.
  • the technical problem to be solved by the embodiments of the present invention is to provide a method for obtaining a current active speaker in a conference, a media server, a media controller, and a media processor, so that the media controller can obtain an event speaker in the current conference.
  • the embodiment of the present invention provides a method for obtaining a current active speaker in a conference, including: the media controller instructs the media processor to detect and report The information about the current speaker is received by the media controller; the media controller receives the information related to the current speaker detected and reported by the media processor; and the media controller acquires the event speaker information in the current conference according to the information related to the current speaker.
  • An embodiment of the present invention further provides a media server, including a media controller and a media processor connected to the media controller.
  • the media controller is configured to instruct the media processor to detect and report information related to the current speaker, and obtain event speaker information in the current conference according to the information about the current speaker reported by the media processor;
  • the media processor is configured to detect and report information related to the current speaker to the media controller according to the indication of detecting and reporting information related to the current speaker.
  • the embodiment of the invention further provides a media controller, including:
  • a sending unit configured to send an indication that is required to detect and report information related to the current speaker
  • a receiving unit configured to receive information related to the current speaker
  • a parsing unit configured to parse the information about the current speaker received by the receiving unit, and obtain event speaker information in the current conference.
  • the embodiment of the invention further provides a media processor, including:
  • a receiving unit configured to receive an indication that the user is required to detect and report information related to the current speaker
  • a detecting unit configured to detect, according to the indication received by the receiving unit, information related to the current speaker
  • a sending unit configured to send information about the user detected by the detecting unit to the current speaker to the media controller.
  • the media controller instructs the media processor to detect and report information related to the current speaker, and the media controller is configured according to The user is related to the current speaker, and obtains an event speaker in the current meeting.
  • FIG. 1 is a signaling flowchart of a first embodiment of a method for acquiring a current active speaker in a conference according to the present invention
  • FIG. 2 is a signaling flowchart of a second embodiment of a method for acquiring a current active speaker in a conference according to the present invention
  • FIG. 4 is a structural diagram of an embodiment of the media server of the present invention.
  • Embodiments of the present invention provide a method for obtaining a current active speaker in a conference, a media server, a media controller, and a media processor.
  • a media server a media server
  • a media controller a media controller
  • a media processor a media processor
  • FIG. 1 is a flow chart of a signal of a first embodiment of a method for obtaining a current active speaker in a conference of the present invention.
  • the conference in this embodiment is an audio conference, and the media processor reports the current active speaker list of each user endpoint in the conference.
  • Step 101 The media controller instructs the media processor to detect and report information related to the current speaker.
  • the information related to the current speaker may be a list of currently active speakers for each user endpoint, including other user endpoint identities that are heard by the user endpoint in the current participating conference, or wildcards used to represent the user's punctuation;
  • the information related to the current speaker may also be a list of listeners of the current event speaker, including the user endpoint identifier of the current active speaker's audience or a wildcard used to indicate the client's punctuation;
  • the information related to the current speaker may also be user information in a speaking state, including a user endpoint identifier in an uttered state or a wildcard character used to represent the user's punctuation.
  • the information related to the current speaker is a list of currently active speakers for each user endpoint.
  • the media controller can instruct the media processor to periodically report a list of currently active speakers for each user endpoint.
  • the media controller can also instruct the media processor to report the current activity of each user endpoint that has changed. List of speakers.
  • the media controller can also instruct the media processor to detect and periodically report the list of currently active speakers for each user endpoint that has changed. If the active speaker list of a user endpoint does not change, even if the reporting interval is reached, the media processor does not report the active speaker list of the user endpoint.
  • the essence of the change in the list of active speakers is that the content of the mix list changes, and the situation that causes the change is: the user changes from speaking to stopping, or the user changes from not speaking to starting to speak, or the volume of the user speaking is increased or decreased.
  • the media controller instructs the media processor to add the specified user endpoints A, B, C, and D to the conference context by adding an ADD. Request command, and simultaneously instructs the media processor to detect the user endpoints A, B. , C, D, the list of active speakers and the time interval for periodic reporting.
  • Step 102 The media processor detects information related to the current speaker
  • the media processor detects a current active speaker list of each user endpoint. After receiving the ADD.Request command of the media controller, the media processor returns an ADD.Reply message to the media controller, and starts detecting the active speaker list of the user endpoints A, B, C, and D. Step 103: Report the information related to the current speaker to the media controller. In this embodiment, the media controller instructs the media processor to periodically report the current active speaker list of each user endpoint that has changed.
  • the media processor reports the current active speaker list of the changed user endpoint according to the reporting time interval carried in the indication; if the indication does not carry the reporting time interval information
  • the reporting time interval may be set in the media processor, and the media processor reports the current active speaker list of the changed user endpoint according to the reporting interval set by the media processor.
  • the endpoints 8, B, C, and D representing the user participate in the conference
  • the media controller sets the number of mixes n of endpoints A, B, C, and D to be 2, 3, 3, and 1, respectively.
  • the current speaker is endpoints A, B, D
  • the volume is sorted as endpoints A, B, and D because the active speaking state endpoint itself does not participate in the mix.
  • endpoint A hears the speech of endpoint B, and the list of reported active speakers includes endpoint B; endpoint B hears the endpoints A and D, and reports
  • the event speaker list contains endpoints A and D; endpoint C hears the mix of endpoints A, B, and D.
  • the reported event speaker list contains endpoints A, B, and D; endpoint D can only hear The speech of the endpoint A with the highest volume, the endpoint A of the reported event speaker list.
  • Step 104 The media controller acquires event speaker information in the current conference according to the information about the current speaker.
  • the media controller after receiving the Notify.Request request reported by the media processor, the media controller returns a notification response (Notify.Reply) message to the media processor, and performs a speech according to the reported user endpoints eight, B, C, and D. For the list of people, get the spokesperson for the event in the current meeting as endpoints eight, B, and D.
  • the event speaker in the current meeting After the event speaker in the current meeting is obtained, the event speaker in the current meeting can be displayed or processed in the user terminal.
  • the media controller can notify the user server through the conference event defined in RFC4722
  • AS Application Server
  • AS the AS is responsible for controlling the user endpoints A, B, C, D to show which current active speakers are.
  • the event spokesperson itself can also be displayed.
  • the user terminal represented by endpoint A can display endpoints A and B as current active speakers in the conference.
  • the existing Multipoint Conference Control Protocol H.248 defines the following packages for audio conferencing: Volume Control Package):
  • the volume control package is used to define a feature to set the volume of media received from the participant.
  • the MC can instruct the MP to set the volume level of a certain participant by the volume level threshold.
  • the MP can be mixed using the volume level.
  • the volume detection package defines an event that indicates that the participant's volume exceeds the threshold.
  • the MC can specify the volume level threshold through the volume detection packet. When the audio volume level exceeds the threshold, the MP should send a Notify message to the MC.
  • volume Level Mixing Package An algorithm that defines the endpoint for audio mixing.
  • the parameters include the mix volume level and the maximum mix.
  • the MC instructs the MP to set the mix volume level for a particular conference. When the volume of a participant exceeds or equals the volume level, the media stream sent by the participant will be used for the mix. When the volume of a participant is below the threshold, the media stream sent by the participant will not be used for the mix.
  • the MC instructs the MP to set the maximum number of mixes n for a particular conference. This parameter is assigned to all participants who are willing to hear the first n loudest speakers. At this time, the MP will mix the voices of the speakers with the highest volume of the first n volumes whose volume is greater than or equal to the threshold, and output the mixed sounds to the respective relevant endpoints.
  • Each of the endpoints in the conference can set different mix volume level and mix value.
  • the mixing strategy is the same.
  • the mixing volume level of each endpoint is different, the number of mixing squares heard is not the same. At the same time, it is called a mix strategy.
  • the Mixing Volume Level Control Package defines a set of attributes that control the volume level of each participant to specify a participant output according to a certain mixing algorithm. Remix media.
  • the event packet of the current active speaker list is reported in the H.248 protocol, and the event packet indicates that the MP reports the current user endpoints. List of event speakers.
  • the event package may further include an optional reporting time interval.
  • the media processor reports the current active speaker list of each user endpoint according to the reporting interval in the event packet; when there is no reporting interval in the event packet, the media processor The current active speaker list of each user endpoint can be reported according to the reporting interval set in the media processor.
  • the event spit list may further include an indication that the media processor is required to report the change of the current active speaker list of each user endpoint.
  • the media processor may report the current active speaker list of each user endpoint that changes according to the indication information.
  • the list of active speakers is the list of users currently participating in the mix, and the list contents are the endpoint IDs indicating the currently active speakers.
  • FIG. 2 it is a flow chart of a signal of a second embodiment of a method for displaying a current active speaker in a conference of the present invention.
  • the conference in this embodiment is an audio conference, and the media processor reports the audience list of the currently active speaker in the conference.
  • the specific process includes:
  • Step 201 The media controller instructs the media processor to detect and report information related to the current speaker.
  • the information related to the current speaker is a listener list of the currently active speaker, and the listener list of the current active speaker includes other user endpoint identifiers that hear the user's voice or wildcards used to indicate the user's punctuation.
  • the media controller can instruct the media processor to periodically report the listener list of the currently active speaker. Since each user endpoint separately reports the audience list of the event speaker, there is a large amount of report information. To reduce the amount of reported data, the media controller can instruct the media processor to report the listener list of the current active speaker who has changed.
  • the media controller can also instruct the media processor to detect and periodically report a list of listeners of the currently active speaker who have changed. If the audience list of an event spokesperson has not changed, even if the escalation time is reached, the media processor does not report the audience list of the event spokesperson.
  • the media controller instructs the media processor to add the specified user endpoints A, B, C, and D to the conference context by adding an ADD. Request command, and simultaneously instructs the media processor to detect the user endpoints A, B. , C, D event speaker list and periodic reporting interval.
  • Step 202 The media processor detects information related to the current speaker
  • the media controller indicates a list of listeners of the current active speaker whose media processor cycle has changed.
  • Step 203 Report the information related to the current speaker to the media controller.
  • the media controller indicates the listener list of the current active speaker whose media processor cycle changes.
  • the media processor reports the listener list of the currently active speaker that has changed according to the reporting time interval carried in the indication; if the reporting time interval information is not carried in the indication, The reporting time interval can be set in the media processor, and the media processor reports the current active speaker list of the changed user endpoint according to the reporting interval set by itself.
  • the media controller sets the endpoint A,
  • the number of mixes of B, C, and D are 2, 3, 3, and 1, respectively.
  • A, B, D because the active speech state endpoint itself does not participate in the mix, so according to the number of mixes set by each endpoint, the speech of endpoint A is heard by endpoints B, C, D, and the listener of endpoint A is reported.
  • the list contains endpoints B, C, D; the endpoint B's speech is heard by endpoints A, C, the reported endpoint B's listener list contains endpoints A, C; endpoint C does not speak, will not be heard, no listeners; endpoint D
  • the speech is heard by endpoints B, C, and the listener list of endpoint D is reported to contain endpoints B and C.
  • Step 204 The media controller acquires event speaker information in the current conference according to the information about the current speaker.
  • the media controller After receiving the Notify.Request request reported by the media processor, the media controller returns a notification response (Notify.Reply) message to the media processor, and the media controller obtains the current active speaker endpoint VIII, B,
  • endpoint A appears in the endpoint B array, indicating that endpoint A is currently listening to endpoint B speaking; endpoint B appears in the array of endpoints A and D, indicating that endpoint B is currently hearing endpoint eight D speaks; endpoint C appears in all arrays, indicating that endpoint C is currently listening to endpoints A, B, and D; endpoint D appears in the endpoint A array, indicating that endpoint D is currently hearing endpoint A speaking, and the media controller gets current The user represented by endpoints A, B, and D is speaking, that is, the current active speaker is the endpoint,
  • the terminal After getting the event spokesperson in the current meeting, you can put the event spokesperson in the current meeting in the user.
  • the terminal performs display or performs other processing.
  • the media controller can notify the AS through the conference event defined in RFC4722, and the AS is responsible for controlling the user endpoints eight, B, C, and D to show which current active speakers are.
  • the event spokesperson itself can also be displayed.
  • the user terminal represented by endpoint A can display endpoints A, B, and D as the current active speaker in the conference.
  • the listener list event package of the current active speaker is reported in the H.248 protocol, and the event packet indicates that the MP reports the current active speaker. Listener list.
  • the event package may further include an optional reporting interval.
  • the media processor reports the listener list of the currently active speaker according to the reporting interval in the event packet; when there is no reporting interval in the event packet, the media processor may The listener list of the currently active speaker is reported according to the reporting interval set in the media processor.
  • the event package may also include indication information that requires the media processor to report a change in the list of pre-active speaker listeners.
  • the media processor can report the listener list of the current active speaker that has changed according to the indication information.
  • FIG. 3 is a signaling flow diagram of a third embodiment of a method for acquiring a current active speaker according to the present invention.
  • the conference in this embodiment is an audio conference, and the media processor reports a single user information in a conference state in the conference.
  • the specific process includes:
  • Step 301 The media controller instructs the media processor to detect and report information related to the current speaker.
  • the information related to the current speaker is the user information in the speaking state.
  • the indication may also carry a volume level threshold.
  • the media controller instructs the media processor to detect and report the user information in the speaking state.
  • the specific manner is that the media controller instructs the media processor to report the volume is greater than or equal to the level. The value of the user's information.
  • the media controller may also instruct the media processor to periodically report information of the user who is in the active speaking state, that is, the volume is greater than or equal to the threshold. Since each user endpoint separately reports the audience list of the event speaker, there is a large amount of report information. To reduce the amount of reported data, the media controller can instruct the media processor to report the changed user information in the active speaking state.
  • the media controller can also instruct the media processor to detect and periodically report the changed user information in the active speaking state. If the user information in the active speaking state has not changed, the media processor does not report the user information even if the reporting time interval is reached.
  • the user information includes a user endpoint identifier in a speaking state or a wildcard character used to represent the client endpoint.
  • the media controller instructs the media processor to report the current active speaker at a single location.
  • the media controller uses the ADD.Request command to instruct the media processor to add the endpoints A, B, C, and D to the conference context, and instruct the media processor to detect the user represented by the endpoints A, B, C, and D.
  • the state changes and the volume level is wide.
  • Step 302 The media processor detects a current speaking state of the user.
  • the media processor After receiving the ADD.Request command of the media controller, the media processor returns an ADD.Reply message to the media controller, and starts detecting the speaking states of the user endpoints A, B, C, and D, That is, the volume level value.
  • Step 303 Report the information related to the current speaker to the media controller.
  • the media controller instructs the media processor to periodically report the user information in the speaking state.
  • the media processor performs the user information in the speaking state according to the upper time interval carried in the indication; if the reporting time interval information is not carried in the indication, When the reporting time interval is set in the media processor, the media processor reports the user information in the speaking state according to the reporting interval set by the media processor.
  • endpoint A speaks first, the volume exceeds the threshold, the media processor reports endpoint A to the media controller, endpoint D then speaks, and the volume exceeds the threshold, and the media processor reports endpoint D to the media controller.
  • the media controller retains endpoint D.
  • the user represented by multiple endpoints becomes the active speaker and the volume is greater than or equal to the volume level threshold, the user representing the endpoint with the highest volume is reported to the MC.
  • endpoints A and D speak at the same time, the volume exceeds the threshold, and the endpoint D The volume is greater than the volume of endpoint A, and the media processor reports endpoint D to the media controller.
  • Step 304 The media controller acquires event speaker information in the current conference according to the user information in the speaking state.
  • the event speaker in the current meeting After the event speaker in the current meeting is obtained, the event speaker in the current meeting can be displayed or processed in the user terminal.
  • the media controller can also notify the AS through the conference event defined in RFC4722, and the AS is responsible for controlling the user endpoints A, B, C, and D to show which current active speakers are.
  • endpoint D can display endpoint D as the current active speaker in the conference.
  • the H.248 protocol adds a user information event packet that is reported in the speaking state, and the event packet indicates the user who is in the speaking state on the MP. information.
  • the event packet may further include an optional reporting time interval.
  • the media processor reports the user information in the speaking state according to the reporting interval in the event packet; if there is no reporting interval in the event packet, if the media processor is in the media processor
  • the reporting interval is set, the user information in the speaking state is reported according to the reporting interval set in the media processor.
  • the event package may further include user information in the speaking state that requires the media processor to change.
  • the media processor may report the changed user information in the speaking state according to the indication information.
  • the user information in the speaking state is the endpoint identifier indicating the currently active speaker.
  • the method for displaying the current active speaker in the conference provided by the embodiment of the present invention can not only enable the media controller to obtain the current event speaker in the audio conference, but also can be applied to other types of multimedia conferences.
  • the media controller can instruct the media processor to report the user information in the speaking state, and obtain the active speech in the current conference according to the user information in the speaking state. people.
  • FIG. 4 is a structural diagram of an embodiment of a media server according to the present invention.
  • the media server A media controller 40, and a media processor 41 coupled to the media controller 40 are included.
  • the media controller 40 is configured to instruct the media processor 41 to detect and report information related to the current speaker, and obtain an event speaker in the current conference according to the information about the current speaker reported by the media processor 41. information;
  • the media processor 41 is configured to detect and report information related to the current speaker to the media controller 40 according to the indication sent by the media controller 40 to detect and report information related to the current speaker.
  • the media controller 40 includes a transmitting unit 401, a receiving unit 402, and a parsing unit 403.
  • the sending unit 401 is configured to send an indication that the information related to the current speaker is required to be detected and reported to the media processor 41.
  • the receiving unit 402 is configured to receive information about the current speaker reported by the media processor 41.
  • the parsing unit 403 is configured to parse the information about the current speaker received by the receiving unit 402, and obtain event speaker information in the current conference.
  • the media controller 40 may further include a first transmission control unit, configured to control the sending unit 401 to send an indication that the information related to the current speaker is required to be detected and periodically reported.
  • the media controller 40 may further include a second transmission control unit for controlling the sending unit 401 to send an indication that the information related to the current speaker is required to be detected and reported.
  • the media controller 40 may further include a third transmission control unit for controlling the sending unit 401 to send an indication of information related to the current speaker that is required to detect and periodically report the change.
  • the information related to the current speaker may be a current active speaker list of each user endpoint, a listener list of the current active speaker, and user information in a speaking state.
  • the indication sent by the sending unit 401 may carry the reporting time interval information, which is used to instruct the media processor 41 to periodically report information related to the current speaker.
  • the media processor 41 includes a receiving unit 411, a detecting unit 412, and a transmitting unit 413.
  • the receiving unit 411 is configured to receive an indication sent by the media controller 40 to detect and report information related to the current speaker.
  • the detecting unit 412 is configured to detect according to the indication received by the receiving unit 411.
  • the information about the current speaker is sent to the media controller 40 by the sending unit 413 for transmitting the information about the current speaker detected by the detecting unit 412.
  • the media processor 41 may further include a monitoring unit configured to monitor whether information related to the current speaker changes, and if so, the control sending unit 413 sends the detecting unit 412 to detect The information related to the current speaker is given to the media controller 40.
  • the media processor 41 may further include a periodic control unit for controlling the sending unit 413 to periodically send the information about the current speaker detected by the detecting unit 412 to the media controller 40.
  • the media controller instructing the media processor to report the information about the user and the current speaker, and according to the user and the user
  • the information about the current speaker gets the event spokesperson in the current meeting, so the media controller can obtain the current event spokesperson in the meeting, and the obtained event spokesperson in the current meeting can display or perform other processing in the user terminal.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Telephonic Communication Services (AREA)

Abstract

A method and media server of obtaining the present speaker in conference.The method includes the following steps : the media controller indicates the media processor to detect and up-report the information about the present speaker; the media controller receives the information about the present speaker detected and up-reported by the media processor; the media controller acquires the active speaker in the present conference according to the user's information associated with the present speaker.

Description

会议中获取当前活动发言人的方法、 媒体服务器  Method for obtaining the current event spokesperson in the conference, media server
本申请要求于 2007 年 9 月 7 日提交中国专利局、 申请号为 200710146076.5、 发明名称为"会议中获取当前活动发言人的方法、 媒体服务 器"的中国专利申请的优先权, 其全部内容通过引用结合在本申请中。  This application claims the priority of the Chinese Patent Application filed on September 7, 2007, the Chinese Patent Office, Application No. 200710146076.5, entitled "Method for obtaining the current event spokesperson, media server" in the conference, the entire contents of which are incorporated by reference. Combined in this application.
技术领域 Technical field
本发明涉及多媒体通信技术领域,特别涉及一种会议中获取活动发言人的 方法、 媒体服务器、 媒体控制器、 媒体处理器。  The present invention relates to the field of multimedia communication technologies, and in particular, to a method for obtaining an event speaker in a conference, a media server, a media controller, and a media processor.
背景技术 Background technique
媒体服务器设备从功能上可以分为媒体控制器 (MC, Media Controller)和 媒体处理器 (MP, Media Processor)两个功能实体。 MC对应的物理设备是媒体网 关控制器 (MGC, Media Gateway Controller), 主要用于完成会议控制的管理、 以及实现对 MP设备的资源管理和接入控制等功能。 MP对应的物理设备是媒体 网关 (MG, Media Gateway),主要用于完成会议音频、视频和数据媒体的汇集和 分发, 以及实现音频混音、 视频画面切换、 编解码和速率适配等与媒体处理相 关的功能。  The media server device can be functionally divided into two functional entities: a media controller (MC) and a media processor (MP). The physical device corresponding to the MC is the Media Gateway Controller (MGC), which is mainly used to complete the management of the conference control, and implement the functions of resource management and access control for the MP device. The physical device corresponding to the MP is a media gateway (MG, Media Gateway), which is mainly used to complete the collection and distribution of conference audio, video and data media, and to implement audio mixing, video picture switching, codec and rate adaptation, and the like. Handle related functions.
MC和 MP之间的接口协议是 H.248协议。 H.248协议为媒体网关控制协议 H, 用于把呼叫控制从媒体转换中分离出来。 H.248协议说明了媒体控制和媒 体处理器的联系。媒体网关控制器通过该协议对媒体网关及 H.248端点进行控 制和管理,媒体网关也通过该协议将不同类型的接入方式转化为适合在 IP 网 络上传送的 RTP/RTCP 流, 实现分组网络和 PSTN 网络的业务互通。 H.248 协议中有两个基本组成部分: 端点 (Termination)和上下文 (Context)。 媒体处理 器上的各种资源被抽象表示为端点。端点之间的组合被抽象表示为上下文。上 下文可以包含多个端点, 因而以拓朴 (Topology)来描述端点间的相互关系。 基 于协议的这种抽象模型, 呼叫的接续实际上就是对端点和上下文的操作。这种 操作通过 MC和 MP之间的命令 (Command), 请求 (Request)和响应 (Reply)来完 成。 命令所携带的参数, 也称为描述符 (Descriptor), 被划分为属性 (Property), 信号 (Signal), 事件 (Event)、 统计 (Statistic)等类别。 具有业务相关性的参数逻辑 上聚合成为包 ( Package)。  The interface protocol between MC and MP is the H.248 protocol. The H.248 protocol is the Media Gateway Control Protocol H, which is used to separate call control from media conversion. The H.248 protocol describes the link between media control and the media processor. The media gateway controller controls and manages the media gateway and the H.248 endpoint through the protocol, and the media gateway also converts different types of access modes into RTP/RTCP streams suitable for transmission over the IP network, thereby implementing a packet network. Interoperate with the PSTN network. There are two basic components in the H.248 protocol: Termination and Context. The various resources on the media processor are abstractly represented as endpoints. The combination between endpoints is abstracted as a context. The context can contain multiple endpoints, and thus the topology is used to describe the interrelationship between the endpoints. Based on this abstract model of the protocol, the connection of the call is actually the operation of the endpoint and context. This operation is done by commands (Request) and Request (Reply) between the MC and the MP. The parameters carried by the command, also called descriptors, are classified into categories: Property, Signal, Event, and Statistic. The parameters with business relevance are logically aggregated into a package.
根据会议使用媒体的不同, 会议可分为音频会议、 视频会议和数据会议。 在音频会议业务中,当代表用户的端点状态由不发言转为发言并被混音器 混音时, 端点状态被称为活动发言状态, 该用户称为活动发言人。 当会议发言 人数大于 1时, 与会者应该能同时听到几个发言人的声音, 此时 MP就要根据 MC指定的混音方法, 将来自多个端点的音频进行相应的混音处理, 然后将混 音后的音频流输出到指定端点,指定端点听到的是当前活动发言人的混音音频 流。 Meetings can be divided into audio conferencing, video conferencing, and data conferencing, depending on the media used for the meeting. In the audio conference service, when the state of the endpoint representing the user is changed from no speaking to speaking and being mixed by the mixer, the endpoint state is referred to as an active speaking state, and the user is referred to as an active speaker. When the number of conference speakers is greater than 1, the participants should be able to hear the voices of several speakers at the same time. At this time, the MP will perform the corresponding mixing processing on the audio from multiple endpoints according to the mixing method specified by the MC, and then Outputs the mixed audio stream to the specified endpoint, and the specified endpoint hears the mix audio stream of the currently active speaker.
在包含音频媒体的会议中,可能需要在用户终端显示该用户是否为当前活 动发言人, 便于与会者明确当前发言人都是哪些用户。 在现有技术中, MP虽 然具备检测活动发言人的能力, 但是 MC无法从 MP获取当前活动发言人的信 息, 因此无法在用户终端显示该用户是否为当前活动发言人。  In a conference that includes audio media, it may be necessary to display at the user terminal whether the user is a current event speaker, so that the participant can know which users are currently the speaker. In the prior art, although the MP has the ability to detect the event spokesperson, the MC cannot obtain the information of the current event spokesperson from the MP, and therefore cannot display whether the user is the current active spokesperson in the user terminal.
因此,在进行本发明创造过程中,发明人发现现有技术中至少存在如下问 题: MC无法从 MP处获取当前的活动发言人信息。  Therefore, in carrying out the creation process of the present invention, the inventors found that at least the following problems exist in the prior art: The MC cannot obtain the current event speaker information from the MP.
发明内容 Summary of the invention
本发明实施例要解决的技术问题为提供一种会议中获取当前活动发言人 的方法、 媒体服务器、 媒体控制器、 媒体处理器, 使媒体控制器能够获取当前 会议中的活动发言人。  The technical problem to be solved by the embodiments of the present invention is to provide a method for obtaining a current active speaker in a conference, a media server, a media controller, and a media processor, so that the media controller can obtain an event speaker in the current conference.
为解决上述技术问题, 本发明实施例的目的是通过以下技术方案实现的: 本发明实施例提供一种会议中获取当前活动发言人的方法, 包括: 媒体控制器指示媒体处理器检测并上报与当前发言人有关的信息; 媒体控制器接收媒体处理器检测并上报的与当前发言人有关的信息; 媒体控制器根据所述与当前发言人有关的信息,获取当前会议中的活动发 言人信息。  In order to solve the above technical problem, the object of the present invention is achieved by the following technical solutions: The embodiment of the present invention provides a method for obtaining a current active speaker in a conference, including: the media controller instructs the media processor to detect and report The information about the current speaker is received by the media controller; the media controller receives the information related to the current speaker detected and reported by the media processor; and the media controller acquires the event speaker information in the current conference according to the information related to the current speaker.
本发明实施例还提供一种媒体服务器, 包括媒体控制器、 与媒体控制器相 连的媒体处理器,  An embodiment of the present invention further provides a media server, including a media controller and a media processor connected to the media controller.
所述媒体控制器,用于指示媒体处理器检测并上报与当前发言人有关的信 息, 并根据所述媒体处理器上报的与当前发言人有关的信息, 获取当前会议中 的活动发言人信息;  The media controller is configured to instruct the media processor to detect and report information related to the current speaker, and obtain event speaker information in the current conference according to the information about the current speaker reported by the media processor;
所述媒体处理器,用于根据所述检测并上报与当前发言人有关的信息的指 示, 检测并上报与当前发言人有关的信息给媒体控制器。 本发明实施例还提供一种媒体控制器, 包括: The media processor is configured to detect and report information related to the current speaker to the media controller according to the indication of detecting and reporting information related to the current speaker. The embodiment of the invention further provides a media controller, including:
发送单元, 用于发送要求检测并上报与当前发言人有关的信息的指示; 接收单元, 用于接收与当前发言人有关的信息;  a sending unit, configured to send an indication that is required to detect and report information related to the current speaker; and a receiving unit, configured to receive information related to the current speaker;
解析单元, 用于解析所述接收单元接收到的与当前发言人有关的信息, 获 取当前会议中的活动发言人信息。  And a parsing unit, configured to parse the information about the current speaker received by the receiving unit, and obtain event speaker information in the current conference.
本发明实施例还提供一种媒体处理器, 包括:  The embodiment of the invention further provides a media processor, including:
接收单元, 用于接收要求检测并上报用户与当前发言人有关的信息的指 示;  a receiving unit, configured to receive an indication that the user is required to detect and report information related to the current speaker;
检测单元, 用于根据所述接收单元所接收的指示,检测用户与当前发言人 有关的信息;  a detecting unit, configured to detect, according to the indication received by the receiving unit, information related to the current speaker;
发送单元,用于发送所述检测单元检测得到的用户与当前发言人有关的信 息给媒体控制器。  And a sending unit, configured to send information about the user detected by the detecting unit to the current speaker to the media controller.
通过本发明实施例提供的会议中获取当前活动发言人方法及媒体服务器、 媒体控制器、媒体处理器,媒体控制器指示媒体处理器检测并上报用户与当前 发言人有关的信息,媒体控制器根据所述用户与当前发言人有关的信息, 获取 当前会议中的活动发言人。  Obtaining the current active speaker method and the media server, the media controller, and the media processor in the conference provided by the embodiment of the present invention, the media controller instructs the media processor to detect and report information related to the current speaker, and the media controller is configured according to The user is related to the current speaker, and obtains an event speaker in the current meeting.
附图说明 DRAWINGS
图 1为本发明会议中获取当前活动发言人方法第一实施例的信令流程图; 图 2为本发明会议中获取当前活动发言人方法第二实施例的信令流程图; 图 3为本发明会议中获取当前活动发言人方法第三实施例的信令流程图; 图 4为本发明媒体服务器一个实施例的结构图。  FIG. 1 is a signaling flowchart of a first embodiment of a method for acquiring a current active speaker in a conference according to the present invention; FIG. 2 is a signaling flowchart of a second embodiment of a method for acquiring a current active speaker in a conference according to the present invention; A signaling flow chart for obtaining a third embodiment of the current active speaker method in the invention meeting; FIG. 4 is a structural diagram of an embodiment of the media server of the present invention.
具体实施方式 detailed description
本发明实施例提供一种会议中获取当前活动发言人的方法及媒体服务器、 媒体控制器、媒体处理器。 为使本发明的技术方案更加清楚明白, 以下参照附 图并列举实施例, 对本发明进一步详细说明。  Embodiments of the present invention provide a method for obtaining a current active speaker in a conference, a media server, a media controller, and a media processor. In order to make the technical solutions of the present invention clearer, the present invention will be further described in detail below with reference to the accompanying drawings.
请参照图 1 , 为本发明会议中获取当前活动发言人的方法第一实施例的信 令流程图。 本实施例中的会议为音频会议,媒体处理器上报会议中各用户端点 的当前活动发言人列表。  Please refer to FIG. 1 , which is a flow chart of a signal of a first embodiment of a method for obtaining a current active speaker in a conference of the present invention. The conference in this embodiment is an audio conference, and the media processor reports the current active speaker list of each user endpoint in the conference.
具体过程包括: 步骤 101 : 媒体控制器指示媒体处理器检测并上报与当前发言人有关的信 息; The specific process includes: Step 101: The media controller instructs the media processor to detect and report information related to the current speaker.
与当前发言人有关的信息可以为各用户端点的当前活动发言人列表,包括 用户端点听到的当前参与会议中混音的其他用户端点标识或者用于表示用户 端标点的通配符;  The information related to the current speaker may be a list of currently active speakers for each user endpoint, including other user endpoint identities that are heard by the user endpoint in the current participating conference, or wildcards used to represent the user's punctuation;
与当前发言人有关的信息还可以为当前活动发言人的听众列表,包括当前 活动发言人听众的用户端点标识或者用于表示用户端标点的通配符;  The information related to the current speaker may also be a list of listeners of the current event speaker, including the user endpoint identifier of the current active speaker's audience or a wildcard used to indicate the client's punctuation;
与当前发言人有关的信息也可以为处于发言状态的用户信息,包括处于发 言状态的用户端点标识或者用于表示用户端标点的通配符。  The information related to the current speaker may also be user information in a speaking state, including a user endpoint identifier in an uttered state or a wildcard character used to represent the user's punctuation.
本实施例中,与当前发言人有关的信息为各用户端点的当前活动发言人列 表。  In this embodiment, the information related to the current speaker is a list of currently active speakers for each user endpoint.
媒体控制器可以指示媒体处理器周期上报各用户端点的当前活动发言人 列表。  The media controller can instruct the media processor to periodically report a list of currently active speakers for each user endpoint.
由于每个用户端点如果分别上报自己能够听到的活动发言人列表,则存在 大量的上报信息, 为减少上报数据量,媒体控制器还可以指示媒体处理器上报 发生变化的各用户端点的当前活动发言人列表。  Since each user endpoint separately reports the list of active speakers that can be heard by the user, there is a large amount of report information. To reduce the amount of reported data, the media controller can also instruct the media processor to report the current activity of each user endpoint that has changed. List of speakers.
媒体控制器还可以指示媒体处理器检测并周期上报发生变化的各用户端 点的当前活动发言人列表。如果某用户端点的活动发言人列表没有变化, 即使 上报时间间隔到了, 媒体处理器也不上报该用户端点的活动发言人列表。  The media controller can also instruct the media processor to detect and periodically report the list of currently active speakers for each user endpoint that has changed. If the active speaker list of a user endpoint does not change, even if the reporting interval is reached, the media processor does not report the active speaker list of the user endpoint.
活动发言人列表发生变化的实质就是混音列表内容发生变化,引起相关变 化的情形有: 用户由发言变为停止发言, 或用户由不发言变为开始发言, 或者 用户发言音量升高或降低。  The essence of the change in the list of active speakers is that the content of the mix list changes, and the situation that causes the change is: the user changes from speaking to stopping, or the user changes from not speaking to starting to speak, or the volume of the user speaking is increased or decreased.
本实施例中, 媒体控制器通过增加请求 (ADD.Request)命令指示媒体处理 器向会议上下文中添加指定的用户端点 A、 B、 C、 D, 并同时指示媒体处理器 检测用户端点 A、 B、 C、 D的活动发言人列表和周期上报的时间间隔。  In this embodiment, the media controller instructs the media processor to add the specified user endpoints A, B, C, and D to the conference context by adding an ADD. Request command, and simultaneously instructs the media processor to detect the user endpoints A, B. , C, D, the list of active speakers and the time interval for periodic reporting.
步骤 102: 媒体处理器检测与当前发言人有关的信息;  Step 102: The media processor detects information related to the current speaker;
本实施例中,媒体处理器检测各用户端点的当前活动发言人列表。媒体处 理器收到媒体控制器的 ADD.Request命令后, 向媒体控制器返回增加响应 (ADD.Reply)消息, 并开始检测用户端点 A、 B、 C、 D的活动发言人列表。 步骤 103: 将所述用户与当前发言人有关的信息上报给媒体控制器; 本实施例中,媒体控制器指示媒体处理器周期上报发生变化的各用户端点 的当前活动发言人列表。 In this embodiment, the media processor detects a current active speaker list of each user endpoint. After receiving the ADD.Request command of the media controller, the media processor returns an ADD.Reply message to the media controller, and starts detecting the active speaker list of the user endpoints A, B, C, and D. Step 103: Report the information related to the current speaker to the media controller. In this embodiment, the media controller instructs the media processor to periodically report the current active speaker list of each user endpoint that has changed.
如果在所述指示中携带上报时间间隔信息,则媒体处理器按照指示中携带 的上报时间间隔, 上报发生变化的用户端点的当前活动发言人列表; 如果在所 述指示中没有携带上报时间间隔信息, 可以在媒体处理器中设置上报时间间 隔, 则媒体处理器按照自身设置的上报时间间隔, 上报发生变化的用户端点的 当前活动发言人列表。  If the indication carries the reporting time interval information, the media processor reports the current active speaker list of the changed user endpoint according to the reporting time interval carried in the indication; if the indication does not carry the reporting time interval information The reporting time interval may be set in the media processor, and the media processor reports the current active speaker list of the changed user endpoint according to the reporting interval set by the media processor.
例如, 代表用户的端点八、 B、 C、 D参与会议, 媒体控制器设置端点 A、 B、 C、 D的混音方数 n分别是 2、 3、 3和 1。 当上 4艮时间间隔达到并且用户的活 动发言人列表发生变化时, 当前发言人是端点 A、 B、 D, 音量大小排序为端点 A、 B、 D, 因为活动发言状态端点本身不参与混音, 这样根据每个端点设定的 混音方数,端点 A听到的是端点 B的发言,上报的活动发言人列表中包含端点 B; 端点 B听到的是端点 A、 D的发言, 上报的活动发言人列表中包含端点 A、 D; 端点 C听到的是端点 A、 B、 D三方的混音, 上报的活动发言人列表中包含端点 A、 B、 D; 端点 D只能听到音量最大的端点 A的发言, 上报的活动发言人列表 中包含端点 A。  For example, the endpoints 8, B, C, and D representing the user participate in the conference, and the media controller sets the number of mixes n of endpoints A, B, C, and D to be 2, 3, 3, and 1, respectively. When the upper 4 艮 interval is reached and the user's active speaker list changes, the current speaker is endpoints A, B, D, and the volume is sorted as endpoints A, B, and D because the active speaking state endpoint itself does not participate in the mix. According to the number of mixes set by each endpoint, endpoint A hears the speech of endpoint B, and the list of reported active speakers includes endpoint B; endpoint B hears the endpoints A and D, and reports The event speaker list contains endpoints A and D; endpoint C hears the mix of endpoints A, B, and D. The reported event speaker list contains endpoints A, B, and D; endpoint D can only hear The speech of the endpoint A with the highest volume, the endpoint A of the reported event speaker list.
步骤 104: 媒体控制器根据所述与当前发言人有关的信息, 获取当前会议 中的活动发言人信息。  Step 104: The media controller acquires event speaker information in the current conference according to the information about the current speaker.
本实施例中, 媒体控制器接收到媒体处理器上报的 Notify.Request请求后, 向媒体处理器返回通知响应 (Notify.Reply)消息, 并根据上报的用户端点八、 B、 C、 D活动发言人列表, 获取当前会议中活动的发言人为端点八、 B、 D。  In this embodiment, after receiving the Notify.Request request reported by the media processor, the media controller returns a notification response (Notify.Reply) message to the media processor, and performs a speech according to the reported user endpoints eight, B, C, and D. For the list of people, get the spokesperson for the event in the current meeting as endpoints eight, B, and D.
获取当前会议中的活动发言人后,可以将当前会议中的活动发言人在用户 终端进行显示或者进行其他处理。  After the event speaker in the current meeting is obtained, the event speaker in the current meeting can be displayed or processed in the user terminal.
例如, 媒体控制器可以通过 RFC4722中定义的会议事件通知用户服务器 For example, the media controller can notify the user server through the conference event defined in RFC4722
(AS, Application Server), 由 AS负责控制用户端点 A、 B、 C、 D显示当前活动发 言人有哪些。 同时, 还可以将活动发言人本身也显示出来。 例如, 端点 A代表 的用户终端上可以显示端点 A和 B为会议中的当前活动发言人。 (AS, Application Server), the AS is responsible for controlling the user endpoints A, B, C, D to show which current active speakers are. At the same time, the event spokesperson itself can also be displayed. For example, the user terminal represented by endpoint A can display endpoints A and B as current active speakers in the conference.
现有的多点会议控制协议 H.248关于音频会议定义了下列包: 音量控制包Volume Control Package): 音量控制包用于定义一个特性来设 置从与会者端接收到的媒体音量。 MC可以通过音量电平阔值来指示 MP对某个 与会者的音量电平进行设置。 MP可以使用音量电平进行混音。 The existing Multipoint Conference Control Protocol H.248 defines the following packages for audio conferencing: Volume Control Package): The volume control package is used to define a feature to set the volume of media received from the participant. The MC can instruct the MP to set the volume level of a certain participant by the volume level threshold. The MP can be mixed using the volume level.
音量检测包 (Volume Detection Package): 音量检测包定义了用于指示与会 者音量超过阔值的事件。 MC通过音量检测包可以指定音量电平阔值, 当音频 音量电平超过阔值时, MP应向 MC发送 Notify消息。  Volume Detection Package: The volume detection package defines an event that indicates that the participant's volume exceeds the threshold. The MC can specify the volume level threshold through the volume detection packet. When the audio volume level exceeds the threshold, the MP should send a Notify message to the MC.
混音包 (Volume Level Mixing Package): 定义端点进行音频混音的算法,参 数包含混音音量电平阔值和最大混音方数。 MC指示 MP为特定会议设置混音音 量电平阔值, 当某个参会者的音量超出或等于音量电平阔值时, 则该参会者发 来的媒体流将被用于混音。 当某个参会者的音量低于阔值时, 则该参会者发来 的媒体流将不会被用于混音。 MC指示 MP为特定会议设置最大混音方数 n, 该 参数分配给所有愿意听到前 n个音量最大发言者的每一个与会者。 此时, MP 将把音量超过或等于阔值的前 n个音量最大的发言者的声音进行混音, 并将混 合后的声音输出到各个相关的端点。  Volume Level Mixing Package: An algorithm that defines the endpoint for audio mixing. The parameters include the mix volume level and the maximum mix. The MC instructs the MP to set the mix volume level for a particular conference. When the volume of a participant exceeds or equals the volume level, the media stream sent by the participant will be used for the mix. When the volume of a participant is below the threshold, the media stream sent by the participant will not be used for the mix. The MC instructs the MP to set the maximum number of mixes n for a particular conference. This parameter is assigned to all participants who are willing to hear the first n loudest speakers. At this time, the MP will mix the voices of the speakers with the highest volume of the first n volumes whose volume is greater than or equal to the threshold, and output the mixed sounds to the respective relevant endpoints.
会议中每个端点都可以设置不同的混音音量电平阔值和混音方数值。当各 端点混音音量电平阔值相同, 听到的混音方数相同时, 称为混音策略相同, 当 各端点混音音量电平阔值不同,听到的混音方数也不同时,称为混音策略不同。  Each of the endpoints in the conference can set different mix volume level and mix value. When the mixing volume level of each endpoint is the same, and the number of mixing squares heard is the same, the mixing strategy is the same. When the mixing volume level of each endpoint is different, the number of mixing squares heard is not the same. At the same time, it is called a mix strategy.
混音电平控制包 (Mixing Volume Level Control Package): 混音电平控制包 定义一组属性用来控制每个与会者的音量电平,以便按照一定的混音算法为某 个与会者输出指定的混音媒体。  Mixing Volume Level Control Package: The Mix Level Control Package defines a set of attributes that control the volume level of each participant to specify a participant output according to a certain mixing algorithm. Remix media.
为了实现媒体处理器上报各用户端点当前活动发言人列表给媒体控制器, 因此在 H.248协议中增加上报各用户端点当前活动发言人列表事件包, 所述事 件包指示 MP上报各用户端点当前活动发言人列表。  In order to implement the media processor to report the current active speaker list of each user endpoint to the media controller, the event packet of the current active speaker list is reported in the H.248 protocol, and the event packet indicates that the MP reports the current user endpoints. List of event speakers.
为了实现媒体控制器指示媒体处理器周期上报各用户端点当前活动发言 人列表, 所述事件包还可以包括可选的上报时间间隔。 当所述事件包中带有上 报时间间隔时,媒体处理器按照事件包中的上报时间间隔上报各用户端点当前 活动发言人列表; 当所述事件包中没有带上报时间间隔时,媒体处理器可以按 照媒体处理器中设置的上报时间间隔上报各用户端点当前活动发言人列表。  In order to implement that the media controller instructs the media processor to periodically report the current active speaker list of each user endpoint, the event package may further include an optional reporting time interval. When the event packet has a reporting interval, the media processor reports the current active speaker list of each user endpoint according to the reporting interval in the event packet; when there is no reporting interval in the event packet, the media processor The current active speaker list of each user endpoint can be reported according to the reporting interval set in the media processor.
为了实现媒体控制器指示媒体处理器上报发生变化的各用户端点当前活 动发言人列表,所述事件包还可以包括要求媒体处理器上报发生变化的各用户 端点当前活动发言人列表的指示信息。媒体处理器可以按照所述指示信息上报 发生变化的各用户端点当前活动发言人列表。 In order to implement the media controller to instruct the media processor to report the change of each user endpoint currently live The event spit list may further include an indication that the media processor is required to report the change of the current active speaker list of each user endpoint. The media processor may report the current active speaker list of each user endpoint that changes according to the indication information.
活动发言人列表即当前参与混音的用户列表,列表内容为表示当前活动发 言人的端点标识。  The list of active speakers is the list of users currently participating in the mix, and the list contents are the endpoint IDs indicating the currently active speakers.
请参照图 2, 为本发明会议中显示当前活动发言人的方法第二实施例的信 令流程图。 本实施例中的会议为音频会议,媒体处理器上报会议中当前活动发 言人的听众列表。  Referring to FIG. 2, it is a flow chart of a signal of a second embodiment of a method for displaying a current active speaker in a conference of the present invention. The conference in this embodiment is an audio conference, and the media processor reports the audience list of the currently active speaker in the conference.
具体过程包括:  The specific process includes:
步骤 201 : 媒体控制器指示媒体处理器检测并上报与当前发言人有关的信 息;  Step 201: The media controller instructs the media processor to detect and report information related to the current speaker.
本实施例中, 与当前发言人有关的信息为当前活动发言人的听众列表, 所 述当前活动发言人的听众列表包括听到用户话音的其他用户端点标识或者用 于表示用户端标点的通配符。  In this embodiment, the information related to the current speaker is a listener list of the currently active speaker, and the listener list of the current active speaker includes other user endpoint identifiers that hear the user's voice or wildcards used to indicate the user's punctuation.
媒体控制器可以指示媒体处理器周期上报当前活动发言人的听众列表。 由于每个用户端点如果分别上报活动发言人的听众列表,则存在大量的上 报信息, 为减少上报数据量,媒体控制器可以指示媒体处理器上报发生变化的 当前活动发言人的听众列表。  The media controller can instruct the media processor to periodically report the listener list of the currently active speaker. Since each user endpoint separately reports the audience list of the event speaker, there is a large amount of report information. To reduce the amount of reported data, the media controller can instruct the media processor to report the listener list of the current active speaker who has changed.
媒体控制器还可以指示媒体处理器检测并周期上报发生变化的当前活动 发言人的听众列表。如果某活动发言人的听众列表没有变化, 即使上报时间间 隔到了, 媒体处理器也不上报该活动发言人的听众列表。  The media controller can also instruct the media processor to detect and periodically report a list of listeners of the currently active speaker who have changed. If the audience list of an event spokesperson has not changed, even if the escalation time is reached, the media processor does not report the audience list of the event spokesperson.
本实施例中, 媒体控制器通过增加请求 (ADD.Request)命令指示媒体处理 器向会议上下文中添加指定的用户端点 A、 B、 C、 D, 并同时指示媒体处理器 检测用户端点 A、 B、 C、 D的活动发言人听众列表和周期上报的时间间隔。  In this embodiment, the media controller instructs the media processor to add the specified user endpoints A, B, C, and D to the conference context by adding an ADD. Request command, and simultaneously instructs the media processor to detect the user endpoints A, B. , C, D event speaker list and periodic reporting interval.
步骤 202: 媒体处理器检测与当前发言人有关的信息;  Step 202: The media processor detects information related to the current speaker;
本实施例中,媒体控制器指示媒体处理器周期上 ^艮发生变化的当前活动发 言人的听众列表。  In this embodiment, the media controller indicates a list of listeners of the current active speaker whose media processor cycle has changed.
媒体处理器收到媒体控制器的 ADD.Request命令后, 向媒体控制器返回增 加响应 (ADD.Reply)消息, 并开始检测活动发言人的听众列表。 步骤 203: 将所述与当前发言人有关的信息上报给媒体控制器; 本实施例中,媒体控制器指示媒体处理器周期上 ^艮发生变化的当前活动发 言人的听众列表。 After receiving the ADD.Request command from the media controller, the media processor returns an ADD.Reply message to the media controller and begins to detect the listener list of the active speaker. Step 203: Report the information related to the current speaker to the media controller. In this embodiment, the media controller indicates the listener list of the current active speaker whose media processor cycle changes.
如果在所述指示中携带上报时间间隔信息,则媒体处理器按照指示中携带 的上报时间间隔, 上报发生变化的当前活动发言人的听众列表; 如果在所述指 示中没有携带上报时间间隔信息, 可以在媒体处理器中设置上报时间间隔, 则 媒体处理器按照自身设置的上报时间间隔,上报发生变化的用户端点的当前活 动发言人列表。  If the reporting time interval information is carried in the indication, the media processor reports the listener list of the currently active speaker that has changed according to the reporting time interval carried in the indication; if the reporting time interval information is not carried in the indication, The reporting time interval can be set in the media processor, and the media processor reports the current active speaker list of the changed user endpoint according to the reporting interval set by itself.
当用户端点发言状态发生变化时, 其听众列表的内容会发生变化, 可以通 过检测混音器, 得到当前活动发言人的听众列表。  When the state of the user's endpoint changes, the content of the listener list changes. You can check the mixer to get the list of listeners of the current active speaker.
例如, 代表用户的端点八、 B、 C、 D参与会议, 媒体控制器设置端点 A、 For example, on behalf of the user's endpoints eight, B, C, D participate in the conference, the media controller sets the endpoint A,
B、 C、 D的混音方数 n分别是 2、 3、 3和 1。 当上 时间间隔达到并且用户的活 动发言人列表发生变化时, 当前发言人是端点 A、 B、 D, 音量大小排序为端点The number of mixes of B, C, and D are 2, 3, 3, and 1, respectively. When the upper time interval is reached and the user's active speaker list changes, the current speaker is the endpoints A, B, D, and the volume is sorted as the endpoint.
A、 B、 D, 因为活动发言状态端点本身不参与混音, 这样根据每个端点设定的 混音方数, 端点 A的发言被端点 B、 C、 D听到, 上报的端点 A的听众列表包含 端点 B、 C、 D; 端点 B的发言被端点 A、 C听到, 上报的端点 B的听众列表包含 端点 A、 C; 端点 C不发言, 不会被听到, 没有听众; 端点 D的发言被端点 B、 C 听到, 上报的端点 D的听众列表包含端点 B和 C。 A, B, D, because the active speech state endpoint itself does not participate in the mix, so according to the number of mixes set by each endpoint, the speech of endpoint A is heard by endpoints B, C, D, and the listener of endpoint A is reported. The list contains endpoints B, C, D; the endpoint B's speech is heard by endpoints A, C, the reported endpoint B's listener list contains endpoints A, C; endpoint C does not speak, will not be heard, no listeners; endpoint D The speech is heard by endpoints B, C, and the listener list of endpoint D is reported to contain endpoints B and C.
步骤 204: 媒体控制器根据所述与当前发言人有关的信息, 获取当前会议 中的活动发言人信息。  Step 204: The media controller acquires event speaker information in the current conference according to the information about the current speaker.
媒体控制器接收到媒体处理器上报的 Notify.Request请求后,向媒体处理器 返回通知响应 (Notify.Reply)消息, 媒体控制器得到当前活动发言人端点八、 B、 After receiving the Notify.Request request reported by the media processor, the media controller returns a notification response (Notify.Reply) message to the media processor, and the media controller obtains the current active speaker endpoint VIII, B,
C、 D的听众列表后进行统计, 发现端点 A在端点 B数组中出现, 表明端点 A当 前听到端点 B发言; 端点 B在端点 A和 D的数组中出现, 表明端点 B当前听到端 点八、 D发言; 端点 C在所有数组中出现, 表明端点 C当前听到端点 A、 B、 D发 言; 端点 D在端点 A数组中出现, 表明端点 D当前听到端点 A发言, 媒体控制器 获取当前端点 A、 B、 D代表的用户正在发言, 即得到当前活动发言人为端点 、C, D listener list statistics, found that endpoint A appears in the endpoint B array, indicating that endpoint A is currently listening to endpoint B speaking; endpoint B appears in the array of endpoints A and D, indicating that endpoint B is currently hearing endpoint eight D speaks; endpoint C appears in all arrays, indicating that endpoint C is currently listening to endpoints A, B, and D; endpoint D appears in the endpoint A array, indicating that endpoint D is currently hearing endpoint A speaking, and the media controller gets current The user represented by endpoints A, B, and D is speaking, that is, the current active speaker is the endpoint,
B、 D。 B, D.
获取当前会议中的活动发言人后,可以将当前会议中的活动发言人在用户 终端进行显示或者进行其他处理。 After getting the event spokesperson in the current meeting, you can put the event spokesperson in the current meeting in the user. The terminal performs display or performs other processing.
例如, 媒体控制器可以通过 RFC4722中定义的会议事件通知 AS, 由 AS负 责控制用户端点八、 B、 C、 D显示当前活动发言人有哪些。 同时, 还可以将活 动发言人本身也显示出来。例如,端点 A代表的用户终端上可以显示端点 A、 B、 D为会议中的当前活动发言人。  For example, the media controller can notify the AS through the conference event defined in RFC4722, and the AS is responsible for controlling the user endpoints eight, B, C, and D to show which current active speakers are. At the same time, the event spokesperson itself can also be displayed. For example, the user terminal represented by endpoint A can display endpoints A, B, and D as the current active speaker in the conference.
为了实现媒体处理器上报当前活动发言人的听众列表给媒体控制器,因此 在 H.248协议中增加上报当前活动发言人的听众列表事件包, 所述事件包包指 示 MP上报当前活动发言人的听众列表。  In order to implement the media processor to report the listener list of the current active speaker to the media controller, the listener list event package of the current active speaker is reported in the H.248 protocol, and the event packet indicates that the MP reports the current active speaker. Listener list.
为了实现媒体控制器指示媒体处理器周期上报当前活动发言人的听众列 表, 所述事件包还可以包括可选的上报时间间隔。 当所述事件包中带有上报时 间间隔时,媒体处理器按照事件包中的上报时间间隔上报当前活动发言人的听 众列表; 当所述事件包中没有带上报时间间隔时,媒体处理器可以按照媒体处 理器中设置的上报时间间隔上报当前活动发言人的听众列表。  In order to implement that the media controller instructs the media processor to periodically report the audience list of the currently active speaker, the event package may further include an optional reporting interval. When the event packet has a reporting interval, the media processor reports the listener list of the currently active speaker according to the reporting interval in the event packet; when there is no reporting interval in the event packet, the media processor may The listener list of the currently active speaker is reported according to the reporting interval set in the media processor.
为了实现媒体控制器指示媒体处理器上报发生变化的当前活动发言人的 听众列表,所述事件包还可以包括要求媒体处理器上报发生变化的前活动发言 人听众列表的指示信息。媒体处理器可以按照所述指示信息上报发生变化的当 前活动发言人的听众列表。  In order to enable the media controller to instruct the media processor to report the listener list of the currently active speaker that has changed, the event package may also include indication information that requires the media processor to report a change in the list of pre-active speaker listeners. The media processor can report the listener list of the current active speaker that has changed according to the indication information.
请参照图 3 , 为本发明获取当前活动发言人方法第三个实施例的信令流程 图。本实施例中的会议为音频会议,媒体处理器上报会议中处于发言状态的单 一用户信息。  Please refer to FIG. 3 , which is a signaling flow diagram of a third embodiment of a method for acquiring a current active speaker according to the present invention. The conference in this embodiment is an audio conference, and the media processor reports a single user information in a conference state in the conference.
具体过程包括:  The specific process includes:
步骤 301 : 媒体控制器指示媒体处理器检测并上报与当前发言人有关的信 息;  Step 301: The media controller instructs the media processor to detect and report information related to the current speaker.
本实施例中, 与当前发言人有关的信息为处于发言状态的用户信息。 所述 指示中还可以携带音量电平阔值,所述媒体控制器指示媒体处理器检测并上报 处于发言状态的用户信息具体方式为:媒体控制器指示媒体处理器上报音量大 于或等于电平阔值的用户的信息。  In this embodiment, the information related to the current speaker is the user information in the speaking state. The indication may also carry a volume level threshold. The media controller instructs the media processor to detect and report the user information in the speaking state. The specific manner is that the media controller instructs the media processor to report the volume is greater than or equal to the level. The value of the user's information.
媒体控制器还可以指示媒体处理器周期上报处于活动发言状态即音量大 于或等于电平阔值的用户的信息。 由于每个用户端点如果分别上报活动发言人的听众列表,则存在大量的上 报信息, 为减少上报数据量,媒体控制器可以指示媒体处理器上报发生变化的 处于活动发言状态的用户信息。 The media controller may also instruct the media processor to periodically report information of the user who is in the active speaking state, that is, the volume is greater than or equal to the threshold. Since each user endpoint separately reports the audience list of the event speaker, there is a large amount of report information. To reduce the amount of reported data, the media controller can instruct the media processor to report the changed user information in the active speaking state.
媒体控制器还可以指示媒体处理器检测并周期上报发生变化的处于活动 发言状态的用户信息。如果处于活动发言状态的用户信息没有变化, 即使上报 时间间隔到了, 媒体处理器也不上报用户信息。  The media controller can also instruct the media processor to detect and periodically report the changed user information in the active speaking state. If the user information in the active speaking state has not changed, the media processor does not report the user information even if the reporting time interval is reached.
所述用户信息包括处于发言状态的用户端点标识或者用于表示用户端标 点的通配符。  The user information includes a user endpoint identifier in a speaking state or a wildcard character used to represent the client endpoint.
本实施例中, 媒体控制器指示媒体处理器上报单一处当前活动发言人。 本实施例中, 媒体控制器用 ADD.Request命令指示媒体处理器向会议上下 文中添加端点 A、 B、 C、 D, 同时指示媒体处理器检测所述端点 A、 B、 C、 D 代表的用户发言状态的变化和音量电平阔值。  In this embodiment, the media controller instructs the media processor to report the current active speaker at a single location. In this embodiment, the media controller uses the ADD.Request command to instruct the media processor to add the endpoints A, B, C, and D to the conference context, and instruct the media processor to detect the user represented by the endpoints A, B, C, and D. The state changes and the volume level is wide.
步骤 302: 媒体处理器检测用户当前的发言状态;  Step 302: The media processor detects a current speaking state of the user.
本实施例中, 媒体处理器收到媒体控制器的 ADD.Request命令后, 向媒体 控制器返回增加响应 (ADD.Reply)消息, 并开始检测用户端点 A、 B、 C、 D的 发言状态, 即音量电平值。  In this embodiment, after receiving the ADD.Request command of the media controller, the media processor returns an ADD.Reply message to the media controller, and starts detecting the speaking states of the user endpoints A, B, C, and D, That is, the volume level value.
步骤 303: 将所述与当前发言人有关的信息上报给媒体控制器;  Step 303: Report the information related to the current speaker to the media controller.
本实施例中,媒体控制器指示媒体处理器周期上报处于发言状态的用户信 息。  In this embodiment, the media controller instructs the media processor to periodically report the user information in the speaking state.
如果在所述指示中携带上报时间间隔信息,则媒体处理器按照指示中携带 的上 时间间隔, 上"¾处于发言状态的用户信息; 如果在所述指示中没有携带 上报时间间隔信息, 可以在媒体处理器中设置上报时间间隔, 则媒体处理器按 照自身设置的上报时间间隔, 上报处于发言状态的用户信息。  If the reporting time interval information is carried in the indication, the media processor performs the user information in the speaking state according to the upper time interval carried in the indication; if the reporting time interval information is not carried in the indication, When the reporting time interval is set in the media processor, the media processor reports the user information in the speaking state according to the reporting interval set by the media processor.
例如, 当前会议中, 端点 A先发言, 音量超过阔值, 媒体处理器上报端点 A给媒体控制器, 端点 D随后发言, 且音量超过阔值, 媒体处理器上报端点 D 给媒体控制器, 此时媒体控制器保留端点 D。  For example, in the current conference, endpoint A speaks first, the volume exceeds the threshold, the media processor reports endpoint A to the media controller, endpoint D then speaks, and the volume exceeds the threshold, and the media processor reports endpoint D to the media controller. The media controller retains endpoint D.
如果多个端点代表的用户变为活动发言人且音量大于或者等于音量电平 阔值时, 选择音量最大的端点代表的用户向 MC上报。  If the user represented by multiple endpoints becomes the active speaker and the volume is greater than or equal to the volume level threshold, the user representing the endpoint with the highest volume is reported to the MC.
例如, 当前会议中, 端点 A、 D同时发言, 音量均超过阔值, 且端点 D的 音量大于端点 A的音量, 媒体处理器上报端点 D给媒体控制器。 For example, in the current conference, endpoints A and D speak at the same time, the volume exceeds the threshold, and the endpoint D The volume is greater than the volume of endpoint A, and the media processor reports endpoint D to the media controller.
步骤 304: 媒体控制器根据所述处于发言状态的用户信息, 获取当前会议 中的活动发言人信息。  Step 304: The media controller acquires event speaker information in the current conference according to the user information in the speaking state.
获取当前会议中的活动发言人后,可以将当前会议中的活动发言人在用户 终端进行显示或者进行其他处理。  After the event speaker in the current meeting is obtained, the event speaker in the current meeting can be displayed or processed in the user terminal.
例如, 媒体控制器还可以通过 RFC4722中定义的会议事件通知 AS, 由 AS 负责控制用户端点 A、 B、 C、 D显示当前活动发言人有哪些。 例如, 端点 A代 表的用户终端上可以显示端点 D为会议中的当前活动发言人。  For example, the media controller can also notify the AS through the conference event defined in RFC4722, and the AS is responsible for controlling the user endpoints A, B, C, and D to show which current active speakers are. For example, endpoint D can display endpoint D as the current active speaker in the conference.
为了实现媒体处理器上报处于发言状态的用户信息给媒体控制器,因此在 H.248协议中增加上报处于发言状态的用户信息事件包, 所述事件包指示 MP 上才艮各处于发言状态的用户信息。  In order to implement the media processor to report the user information in the speaking state to the media controller, the H.248 protocol adds a user information event packet that is reported in the speaking state, and the event packet indicates the user who is in the speaking state on the MP. information.
为了实现媒体控制器指示媒体处理器周期上报处于发言状态的用户信息, 所述事件包还可以包括可选的上报时间间隔。当所述事件包中带有上报时间间 隔时, 媒体处理器按照事件包中的上报时间间隔上报处于发言状态的用户信 息; 当所述事件包中没有带上报时间间隔时,如果媒体处理器中设置了上报时 间间隔,则按照媒体处理器中设置的上报时间间隔上报处于发言状态的用户信 息。  In order to implement the media controller to indicate that the media processor periodically reports the user information in the speaking state, the event packet may further include an optional reporting time interval. When the event packet has a reporting interval, the media processor reports the user information in the speaking state according to the reporting interval in the event packet; if there is no reporting interval in the event packet, if the media processor is in the media processor When the reporting interval is set, the user information in the speaking state is reported according to the reporting interval set in the media processor.
为了实现媒体控制器指示媒体处理器上 ^艮发生变化的处于发言状态的用 户信息,所述事件包还可以包括要求媒体处理器上^艮发生变化的处于发言状态 的用户信息。媒体处理器可以按照所述指示信息上报发生变化的处于发言状态 的用户信息。  In order to implement the user information indicating that the media controller is in the speaking state on the media processor, the event package may further include user information in the speaking state that requires the media processor to change. The media processor may report the changed user information in the speaking state according to the indication information.
处于发言状态的用户信息为表示当前活动发言人的端点标识。  The user information in the speaking state is the endpoint identifier indicating the currently active speaker.
本发明实施例提供的会议中显示当前活动发言人的方法不仅能够使媒体 控制器在音频会议中获取会议当前活动发言人,还能够应用于其他类型的多媒 体会议中。  The method for displaying the current active speaker in the conference provided by the embodiment of the present invention can not only enable the media controller to obtain the current event speaker in the audio conference, but also can be applied to other types of multimedia conferences.
通过上述本发明实施例提供的会议中获取当前活动发言人方法,媒体控制 器能够指示媒体处理器上报处于发言状态的用户信息,并根据所述处于发言状 态的用户信息获取当前会议中的活动发言人。  The method for obtaining the current active speaker in the conference provided by the foregoing embodiment of the present invention, the media controller can instruct the media processor to report the user information in the speaking state, and obtain the active speech in the current conference according to the user information in the speaking state. people.
请参照图 4, 为本发明媒体服务器一个实施例的结构图。 所述媒体服务器 包括媒体控制器 40、 以及与媒体控制器 40相连的媒体处理器 41。 所述媒体控制 器 40用于指示媒体处理器 41检测并上报与当前发言人有关的信息,并根据所述 媒体处理器 41上报的与当前发言人有关的信息,获取当前会议中的活动发言人 信息; Please refer to FIG. 4, which is a structural diagram of an embodiment of a media server according to the present invention. The media server A media controller 40, and a media processor 41 coupled to the media controller 40 are included. The media controller 40 is configured to instruct the media processor 41 to detect and report information related to the current speaker, and obtain an event speaker in the current conference according to the information about the current speaker reported by the media processor 41. information;
所述媒体处理器 41用于根据所述媒体控制器 40发送的检测并上报与当前 发言人有关的信息的指示,检测并上报与当前发言人有关的信息给媒体控制器 40。  The media processor 41 is configured to detect and report information related to the current speaker to the media controller 40 according to the indication sent by the media controller 40 to detect and report information related to the current speaker.
所述媒体控制器 40包括发送单元 401、 接收单元 402、 解析单元 403。  The media controller 40 includes a transmitting unit 401, a receiving unit 402, and a parsing unit 403.
所述发送单元 401用于发送要求检测并上报与当前发言人有关的信息的指 示给媒体处理器 41 ; 所述接收单元 402用于接收媒体处理器 41上报的与当前发 言人有关的信息;所述解析单元 403用于解析所述接收单元 402接收到的与当前 发言人有关的信息, 获取当前会议中的活动发言人信息。  The sending unit 401 is configured to send an indication that the information related to the current speaker is required to be detected and reported to the media processor 41. The receiving unit 402 is configured to receive information about the current speaker reported by the media processor 41. The parsing unit 403 is configured to parse the information about the current speaker received by the receiving unit 402, and obtain event speaker information in the current conference.
所述媒体控制器 40还可以包括第一发送控制单元, 用于控制发送单元 401 发送要求检测并周期上报与当前发言人有关的信息的指示。  The media controller 40 may further include a first transmission control unit, configured to control the sending unit 401 to send an indication that the information related to the current speaker is required to be detected and periodically reported.
所述媒体控制器 40还可以包括第二发送控制单元, 用于控制发送单元 401 发送要求检测并上报发生变化的与当前发言人有关的信息的指示。  The media controller 40 may further include a second transmission control unit for controlling the sending unit 401 to send an indication that the information related to the current speaker is required to be detected and reported.
所述媒体控制器 40还可以包括第三发送控制单元, 用于控制发送单元 401 发送要求检测并周期上报发生变化的与当前发言人有关的信息的指示。  The media controller 40 may further include a third transmission control unit for controlling the sending unit 401 to send an indication of information related to the current speaker that is required to detect and periodically report the change.
所述与当前发言人有关的信息可以为各用户端点的当前活动发言人列表、 当前活动发言人的听众列表、 处于发言状态的用户信息。  The information related to the current speaker may be a current active speaker list of each user endpoint, a listener list of the current active speaker, and user information in a speaking state.
所述发送单元 401发送的指示中可以携带上报时间间隔信息, 用于指示媒 体处理器 41周期上报与当前发言人有关的信息。  The indication sent by the sending unit 401 may carry the reporting time interval information, which is used to instruct the media processor 41 to periodically report information related to the current speaker.
所述媒体处理器 41包括接收单元 411、 检测单元 412、 发送单元 413。  The media processor 41 includes a receiving unit 411, a detecting unit 412, and a transmitting unit 413.
所述接收单元 411用于接收所述媒体控制器 40发送的要求检测并上报用户 与当前发言人有关的信息的指示;所述检测单元 412用于根据所述接收单元 411 所接收的指示, 检测与当前发言人有关的信息; 所述发送单元 413用于发送所 述检测单元 412检测得到的与当前发言人有关的信息给媒体控制器 40。  The receiving unit 411 is configured to receive an indication sent by the media controller 40 to detect and report information related to the current speaker. The detecting unit 412 is configured to detect according to the indication received by the receiving unit 411. The information about the current speaker is sent to the media controller 40 by the sending unit 413 for transmitting the information about the current speaker detected by the detecting unit 412.
所述媒体处理器 41还可以包括监控单元,用于监控与当前发言人有关的信 息是否发生变化, 如果是, 则控制发送单元 413发送所述检测单元 412检测得到 的与当前发言人有关的信息给媒体控制器 40。 The media processor 41 may further include a monitoring unit configured to monitor whether information related to the current speaker changes, and if so, the control sending unit 413 sends the detecting unit 412 to detect The information related to the current speaker is given to the media controller 40.
所述媒体处理器 41还可以包括周期上 4艮控制单元, 用于控制发送单元 413 周期发送所述检测单元 412检测得到的与当前发言人有关的信息给媒体控制器 40。  The media processor 41 may further include a periodic control unit for controlling the sending unit 413 to periodically send the information about the current speaker detected by the detecting unit 412 to the media controller 40.
通过本发明实施例提供的会议中获取当前活动发言人方法及媒体服务器、 媒体控制器、媒体处理器,媒体控制器指示媒体处理器上报用户与当前发言人 有关的信息, 并根据所述用户与当前发言人有关的信息, 获取当前会议中的活 动发言人, 因此媒体控制器能够在会议中获取当前活动发言人, 所获取的当前 会议中的活动发言人可以在用户终端进行显示或者进行其他处理。  Obtaining the current active speaker method and the media server, the media controller, and the media processor in the conference provided by the embodiment of the present invention, the media controller instructing the media processor to report the information about the user and the current speaker, and according to the user and the user The information about the current speaker gets the event spokesperson in the current meeting, so the media controller can obtain the current event spokesperson in the meeting, and the obtained event spokesperson in the current meeting can display or perform other processing in the user terminal. .
需要说明的是,本领域普通技术人员可以理解实现上述实施例方法中的全 部或部分步骤是可以通过程序来指令相关的硬件来完成,所述的程序可以存储 于一计算机可读取存储介质中, 该程序在执行时, 包括上述方法的步骤; 所述 的存储介质, 如: ROM/RAM、 磁碟、 光盘等。  It should be noted that those skilled in the art can understand that all or part of the steps of implementing the above embodiments can be completed by a program to instruct related hardware, and the program can be stored in a computer readable storage medium. The program, when executed, includes the steps of the above method; the storage medium, such as: ROM/RAM, disk, optical disk, and the like.
以上对本发明所提供的一种会议中获取当前活动发言人方法及媒体服务 器、媒体控制器、媒体处理器进行了详细介绍, 本文中应用了具体个例对本发 明的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本发明 所揭示的技术方案; 同时, 对于本领域的一般技术人员, 依据本发明的思想, 在具体实施方式及应用范围上均会有改变之处, 综上所述, 本说明书内容不应 理解为对本发明的限制。  The method for obtaining the current event speaker and the media server, the media controller, and the media processor in the conference provided by the present invention is described in detail. The principles and implementation manners of the present invention are described in the following. The description of the above embodiments is only for helping to understand the technical solutions disclosed by the present invention; at the same time, for those skilled in the art, according to the idea of the present invention, there will be changes in the specific embodiments and application scopes. The description herein is not to be construed as limiting the invention.

Claims

权 利 要 求 Rights request
1. 一种会议中获取当前活动发言人的方法, 其特征在于, 所述方法包括: 媒体控制器指示媒体处理器检测并上报与当前发言人有关的信息; 媒体控制器接收媒体处理器检测并上报的与当前发言人有关的信息; 媒体控制器根据所述与当前发言人有关的信息,获取当前会议中的活动发 言人信息。  A method for obtaining a current active speaker in a conference, the method comprising: the media controller instructing the media processor to detect and report information related to the current speaker; and the media controller receiving the media processor to detect The reported information related to the current speaker; the media controller obtains the event speaker information in the current conference according to the information related to the current speaker.
2. 根据权利要求 1所述的会议中获取活动发言人的方法, 其特征在于, 所 述用户与当前发言人有关的信息为: 各用户端点的当前活动发言人列表。  2. The method for obtaining an event speaker in a conference according to claim 1, wherein the information about the user and the current speaker is: a list of currently active speakers of each user endpoint.
3. 根据权利要求 1所述的会议中获取活动发言人的方法, 其特征在于, 所 述用户与当前发言人有关的信息为: 当前活动发言人的听众列表。  3. The method for obtaining an event speaker in a conference according to claim 1, wherein the information about the user and the current speaker is: a list of listeners of the current active speaker.
4. 根据权利要求 1所述的会议中获取活动发言人的方法, 其特征在于, 所 述用户与当前发言人有关的信息为: 处于发言状态的用户信息。  The method for obtaining an event speaker in a conference according to claim 1, wherein the information about the user and the current speaker is: user information in a speaking state.
5. 根据权利要求 4所述的会议中获取活动发言人的方法, 其特征在于, 所述媒体控制器指示媒体处理器检测并上报处于发言状态的用户信息具体方 式为: 媒体控制器指示媒体处理器上报音量大于或等于电平阔值的用户信息。  The method for obtaining an event speaker in a conference according to claim 4, wherein the media controller instructs the media processor to detect and report the user information in the speaking state, the specific manner is: the media controller instructs the media processing The user reports the user information whose volume is greater than or equal to the level threshold.
6. 根据权利要求 1至 5中任一项所述的会议中获取当前活动发言人的方 法, 其特征在于, 所述媒体控制器指示媒体处理器检测并上报与当前发言人有 关的信息具体为:  The method for obtaining a current active speaker in a conference according to any one of claims 1 to 5, wherein the media controller instructs the media processor to detect and report information related to the current speaker, specifically :
媒体控制器指示媒体处理器检测并周期上报用户与当前发言人有关的信 息;  The media controller instructs the media processor to detect and periodically report information about the user related to the current speaker;
所述媒体控制器接收媒体处理器检测并上报的与当前发言人有关的信息 具体为:  The media controller receives information related to the current speaker detected and reported by the media processor, specifically:
媒体控制器接收媒体处理器检测并周期上报的与当前发言人有关的信息。 The media controller receives information related to the current speaker detected by the media processor and periodically reported.
7. 根据权利要求 6所述的会议中获取活动发言人的方法, 其特征在于, 所述媒体控制器指示媒体处理器检测并周期上报用户与当前发言人有关 的信息具体方式为: The method for obtaining an event speaker in a conference according to claim 6, wherein the media controller instructs the media processor to detect and periodically report information related to the current speaker to the current speaker:
在所述指示中携带上报时间间隔信息;  Carrying the reporting time interval information in the indication;
所述媒体处理器周期上报与当前发言人有关的信息的具体过程为:根据所 述指示中携带的上报时间间隔信息,媒体处理器周期上报与当前发言人有关的 信息。 The specific process of the media processor periodically reporting the information related to the current speaker is: according to the reporting interval information carried in the indication, the media processor periodically reports the current speaker Information.
8. 根据权利要求 6所述的会议中获取活动发言人的方法, 其特征在于, 所述媒体处理器周期上报与当前发言人有关的信息的具体过程为:根据媒体处 理器设置的上报时间间隔信息, 媒体处理器周期上报与当前发言人有关的信 息。  The method for obtaining an event spokesperson in a conference according to claim 6, wherein the media processor periodically reports the information related to the current spokesperson according to the reporting time interval set by the media processor. Information, the media processor cycle reports information related to the current speaker.
9. 根据权利要求 1至 5中任一项所述的会议中获取活动发言人的方法, 其特征在于,所述媒体控制器指示媒体处理器检测并上报用户与当前发言人有 关的信息具体为:  The method for obtaining an event spokesperson in a conference according to any one of claims 1 to 5, wherein the media controller instructs the media processor to detect and report information related to the current spokesperson of the user, specifically :
媒体控制器指示媒体处理器检测并上报发生变化的与当前发言人有关的 信息。  The media controller instructs the media processor to detect and report the changed information related to the current speaker.
10. 根据权利要求 1至 5中任一项所述的会议中获取当前活动发言人的方 法, 其特征在于, 所述媒体控制器指示媒体处理器检测并上报用户与当前发言 人有关的信息具体为:  The method for obtaining a current active speaker in a conference according to any one of claims 1 to 5, wherein the media controller instructs the media processor to detect and report information related to the current speaker and the current speaker. For:
媒体控制器指示媒体处理器检测并周期上报发生变化的与当前发言人有 关的信息。  The media controller instructs the media processor to detect and periodically report information relating to the current speaker that has changed.
11. 根据权利要求 1至 5中任一所述的会议中获取活动发言人的方法, 其 特征在于,所述与当前发言人有关的信息包括用户端点标识或者用于表示用户 端标点的通配符。  11. A method of obtaining an event spokesperson in a conference according to any one of claims 1 to 5, wherein said information relating to the current spokesperson comprises a user endpoint identifier or a wildcard for indicating a user end point.
12. 一种媒体服务器, 包括媒体控制器、 媒体处理器, 其特征在于, 所述媒体控制器, 用于指示媒体处理器检测并上报与当前发言人有关的信 息, 并根据所述媒体处理器上报的与当前发言人有关的信息, 获取当前会议中 的活动发言人信息;  A media server, comprising a media controller and a media processor, wherein the media controller is configured to instruct the media processor to detect and report information related to the current speaker, and according to the media processor The reported information about the current spokesperson, and information about the event spokesperson in the current meeting;
所述媒体处理器, 用于根据所述检测并上报与当前发言人有关的信息的指 示, 检测并上报与当前发言人有关的信息给媒体控制器。  The media processor is configured to detect and report information related to the current speaker to the media controller according to the indication of detecting and reporting information related to the current speaker.
13. 根据权利要求 12所述的媒体服务器, 其特征在于, 所述与当前发言 人有关的信息为: 各用户端点的当前活动发言人列表。  The media server according to claim 12, wherein the information related to the current speaker is: a current active speaker list of each user endpoint.
14. 根据权利要求 12所述的媒体服务器, 其特征在于, 所述与当前发言 人有关的信息为: 当前活动发言人的听众列表。  14. The media server according to claim 12, wherein the information related to the current speaker is: a list of listeners of the current active speaker.
15. 根据权利要求 12所述的媒体服务器, 其特征在于, 所述与当前发言 人有关的信息为: 处于发言状态的用户信息。 15. The media server according to claim 12, wherein the said and the current speaking The person-related information is: User information in the speaking state.
16. 一种媒体控制器, 其特征在于, 所述媒体控制器包括:  16. A media controller, wherein the media controller comprises:
发送单元, 用于发送要求检测并上报与当前发言人有关的信息的指示; 接收单元, 用于接收与当前发言人有关的信息;  a sending unit, configured to send an indication that is required to detect and report information related to the current speaker; and a receiving unit, configured to receive information related to the current speaker;
解析单元, 用于解析所述接收单元接收到的与当前发言人有关的信息, 获 取当前会议中的活动发言人信息。  And a parsing unit, configured to parse the information about the current speaker received by the receiving unit, and obtain event speaker information in the current conference.
17. 根据权利要求 16所述的媒体控制器, 其特征在于, 所述媒体控制器 还包括: 第一发送控制单元, 用于控制发送单元发送要求检测并周期上报与当 前发言人有关的信息的指示。  The media controller according to claim 16, wherein the media controller further comprises: a first sending control unit, configured to control the sending unit to send a request for detecting and periodically reporting information related to the current speaker. Instructions.
18. 根据权利要求 17所述的媒体控制器, 其特征在于, 所述要求检测并 周期上报与当前发言人有关的信息的指示携带上报时间间隔信息。  The media controller according to claim 17, wherein the indication of detecting and periodically reporting information related to the current speaker carries the reporting time interval information.
19. 根据权利要求 16所述的媒体控制器, 其特征在于, 所述媒体控制器 还包括: 第二发送控制单元, 用于控制发送单元发送要求检测并上报发生变化 的与当前发言人有关的信息的指示。  The media controller according to claim 16, wherein the media controller further comprises: a second sending control unit, configured to control the sending unit to send a request to detect and report the change related to the current speaker. An indication of the information.
20. 根据权利要求 16所述的媒体控制器, 其特征在于, 所述媒体控制器 还包括: 第三发送控制单元, 用于控制发送单元发送要求检测并周期上报发生 变化的与当前发言人有关的信息的指示。  The media controller according to claim 16, wherein the media controller further comprises: a third sending control unit, configured to control the sending unit to send a request for detection and periodically report a change related to the current speaker. Instructions for the information.
21. 一种媒体处理器, 其特征在于, 所述媒体处理器包括:  21. A media processor, wherein the media processor comprises:
接收单元, 用于接收要求检测并上报用户与当前发言人有关的信息的指 示;  a receiving unit, configured to receive an indication that the user is required to detect and report information related to the current speaker;
检测单元, 用于根据所述接收单元所接收的指示,检测与当前发言人有关 的信息;  a detecting unit, configured to detect information related to the current speaker according to the indication received by the receiving unit;
发送单元,用于发送所述检测单元检测得到的与当前发言人有关的信息给 媒体控制器。  And a sending unit, configured to send information about the current speaker detected by the detecting unit to the media controller.
22. 根据权利要求 21 所述的媒体处理器, 其特征在于, 所述媒体处理器 还包括监控单元, 用于监控与当前发言人有关的信息是否发生变化, 如果是, 则控制发送单元发送所述检测单元检测得到的与当前发言人有关的信息给媒 体控制器。  The media processor according to claim 21, wherein the media processor further comprises a monitoring unit, configured to monitor whether information related to the current speaker changes, and if yes, control the sending unit to send the location The detecting unit detects the information about the current speaker and sends it to the media controller.
23. 根据权利要求 21 所述的媒体处理器, 其特征在于, 所述媒体处理器 还包括周期上报控制单元,用于控制发送单元周期发送所述检测单元检测得 ^1 的与当前发言人有关的信息给媒体控制器。 23. The media processor of claim 21, wherein the media processor The method further includes a periodic reporting control unit, configured to control the sending unit to periodically send the information related to the current speaker detected by the detecting unit to the media controller.
PCT/CN2008/071438 2007-09-07 2008-06-26 A method and media server of obtaining the present active speaker in conference WO2009030128A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CNA2007101460765A CN101383876A (en) 2007-09-07 2007-09-07 Method, media server acquiring current active speaker in conference
CN200710146076.5 2007-09-07

Publications (1)

Publication Number Publication Date
WO2009030128A1 true WO2009030128A1 (en) 2009-03-12

Family

ID=40428460

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2008/071438 WO2009030128A1 (en) 2007-09-07 2008-06-26 A method and media server of obtaining the present active speaker in conference

Country Status (2)

Country Link
CN (1) CN101383876A (en)
WO (1) WO2009030128A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10348778B2 (en) * 2013-02-08 2019-07-09 Avaya Inc. Dynamic device pairing with media server audio substitution

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102111278A (en) * 2010-12-28 2011-06-29 华为技术有限公司 Conference media quality monitoring method, device and system
CN103581608B (en) * 2012-07-20 2019-02-01 Polycom 通讯技术(北京)有限公司 Spokesman's detection system, spokesman's detection method and audio/video conferencingasystem figureu
CN104639777A (en) * 2013-11-14 2015-05-20 中兴通讯股份有限公司 Conference control method, conference control device and conference system
CN107302640B (en) * 2017-06-08 2019-10-01 携程旅游信息技术(上海)有限公司 Videoconference control system and its control method
CN108920127A (en) * 2018-06-26 2018-11-30 广州视源电子科技股份有限公司 Target volume Threshold, device, equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5991277A (en) * 1995-10-20 1999-11-23 Vtel Corporation Primary transmission site switching in a multipoint videoconference environment based on human voice
US6304648B1 (en) * 1998-12-21 2001-10-16 Lucent Technologies Inc. Multimedia conference call participant identification system and method
WO2002089458A1 (en) * 2001-04-30 2002-11-07 Octave Communications, Inc. Audio conference platform with dynamic speech detection threshold
CN1805489A (en) * 2005-01-10 2006-07-19 华为技术有限公司 Method and system of implementing report of current speaker during conference

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5991277A (en) * 1995-10-20 1999-11-23 Vtel Corporation Primary transmission site switching in a multipoint videoconference environment based on human voice
US6304648B1 (en) * 1998-12-21 2001-10-16 Lucent Technologies Inc. Multimedia conference call participant identification system and method
WO2002089458A1 (en) * 2001-04-30 2002-11-07 Octave Communications, Inc. Audio conference platform with dynamic speech detection threshold
CN1805489A (en) * 2005-01-10 2006-07-19 华为技术有限公司 Method and system of implementing report of current speaker during conference

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10348778B2 (en) * 2013-02-08 2019-07-09 Avaya Inc. Dynamic device pairing with media server audio substitution

Also Published As

Publication number Publication date
CN101383876A (en) 2009-03-11

Similar Documents

Publication Publication Date Title
US7257641B1 (en) Multipoint processing unit
US9055155B2 (en) Collaborative conference experience improvement
US7283154B2 (en) Systems and methods for videoconference and/or data collaboration initiation
US7236580B1 (en) Method and system for conducting a conference call
US20070263824A1 (en) Network resource optimization in a video conference
US7359725B2 (en) Push-to-talk apparatus and method for communication between an application server and media resource function processor
US8149261B2 (en) Integration of audio conference bridge with video multipoint control unit
US7719975B2 (en) Method and system for communication session under conditions of bandwidth starvation
CN100571367C (en) Audio/video file playing method
JP3984929B2 (en) VoIP system, VoIP server, and multicast packet communication method
US8180933B2 (en) Dynamic call handling from multiple attached devices wherein devices advertize its capabililes before facilitating call through appropriate device
WO2009030128A1 (en) A method and media server of obtaining the present active speaker in conference
US10187432B2 (en) Replaying content of a virtual meeting
WO2012079510A1 (en) Mute indication method and device applied to video conferencing
CN113114688B (en) Multimedia conference management method and device, storage medium and electronic equipment
JP7463552B2 (en) SESSION CREATION METHOD, ELECTRONIC DEVICE, AND READABLE STORAGE MEDIUM
KR100556914B1 (en) Video communication service method for pear to pear type mobile phone
US8180358B2 (en) Joining a session in a communication network supporting multimedia group calls
JP4644813B2 (en) Multi-party call system, call terminal and call server in multi-party call system, multi-party call method
WO2008017250A1 (en) Method?system and device for communicating seat information under separate framework
KR100585718B1 (en) Multimedia streaming service method for mobile communication terminal
JP2010504675A (en) How to communicate between several terminals
WO2015024525A1 (en) Conference information processing method, device and system
KR20030003433A (en) The conference aparatus and method applied between another networks
CN115842809A (en) Data stream real-time transmission method and conference system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08757847

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 08757847

Country of ref document: EP

Kind code of ref document: A1