WO2015127793A1

WO2015127793A1 - Recording method, voice exchanging device, recording server, and recording system

Info

Publication number: WO2015127793A1
Application number: PCT/CN2014/089748
Authority: WO
Inventors: 诸宏亮
Original assignee: 华为技术有限公司
Priority date: 2014-02-25
Filing date: 2014-10-29
Publication date: 2015-09-03
Also published as: CN104869106A

Abstract

Disclosed are a recording method, a voice exchanging device, a recording server, and a recording system. The method comprises: before a recording starting response of a recording server is received, a voice exchanging device storing a first media stream generated after voice mixing is performed into a buffer; after the recording starting response of the recording server is received, the voice exchanging device sending a second media stream generated after voice mixing is performed to the recording server; and the voice exchanging device sending the first media stream stored the buffer to the recording server, and after the recording server receives the first media stream and the second media stream, the recording server merging the first media stream and the second media stream and performing recording. By means of the foregoing method, the present invention can avoid losing a record before a recording server prepares for recording, and accordingly, a complete record can be obtained.

Description

Recording method, voice switching device, recording server and recording system

Technical field

The present invention relates to the field of communication technologies, and in particular, to a recording method, a voice switching device, a recording server, and a recording system.

Background technique

In the unified communication and call center (UC&CC, Unified Communication & Call Center) application scenario, it is often necessary to perform on-demand recording of the user's point-to-point and conference calls. The user initiates recording on the IP phone during the call, and the IP phone sends a recording request to the IP-based voice switch (IP-PBX, IP Private Branch eXchange), IP-PBX, when the communication terminal of the call is in the conference site. The IP-PBX then joins the recording server to the conference site in a "speak only" manner, and sends a request to initiate recording to the recording server, and after receiving the startup recording response returned by the recording server, the communication terminal from the call is received. The media stream is mixed, and the media stream generated after the mixing is sent to the recording server, and the recording server acquires the media stream and performs recording, thereby realizing recording.

However, during the process in which the IP-PBX receives the recording request and receives the start recording response sent by the recording server, a number of signaling interactions are involved, during which the recording server cannot acquire the media stream generated by the communication terminal after mixing. The media stream, so recording is not possible.

Summary of the invention

The technical problem to be solved by the present invention is to provide a recording method, a voice exchange device, a recording server and a recording system, which can prevent the recording of the recording server before the recording is prepared, so that a complete recording can be obtained.

In a first aspect, the present invention provides a recording method, the method comprising: a voice switching device receiving a recording request from a first communication terminal, wherein the first communication terminal and other communication terminals communicate via the voice switching device; The voice exchange device receives the recording At the request, mixing the currently received media stream from the first communication terminal and the media stream from the other communication terminal, and storing the media stream generated by the mixing as a first media stream in the cache; The voice switching device adds a recording server to a site created by the voice switching device, including the first communication terminal and the other communication terminal; the voice switching device sends a start recording request to the recording server; After receiving the startup recording response of the recording server, the voice switching device mixes the currently received media stream from the first communication terminal and the media stream from the other communication terminal, and mixes the generated media. Sending, by the second media stream, the stream to the recording server, and sending, to the recording server, the first media stream saved in the cache before receiving the start recording response, so that the recording server The first media stream and the second media stream are combined and recorded.

In a first possible implementation manner of the first aspect, the method further includes: the voice switching device creates the cache when receiving the recording request.

In conjunction with the first aspect or the first possible implementation of the first aspect, in a second possible implementation manner of the first aspect, the method further includes: the voice switching device receiving the start recording response Stops saving the media stream generated by the mix in the cache.

With reference to the first aspect to the second possible implementation of the first aspect, in a third possible implementation manner of the first aspect, the media packet in the first media stream carries a serial number identifier, So that the recording server records the media packets in the first media stream in chronological order according to the serial number identifier.

In a second aspect, the present invention provides a recording method, the method comprising: a recording server joining a conference site including a first communication terminal and other communication terminals created by a voice switching device, the first communication terminal and the other communication terminal Communicating by the voice switching device; the recording server receives a startup recording request sent by the voice switching device and sends a startup recording response to the voice switching device; after transmitting the startup recording response to the voice switching device, the recording server receives a first media stream and a second media stream from the voice switching device, the first media stream being from the first communication terminal and the other before the voice switching device receives the start recording response a media stream generated by the media stream of the communication terminal after the mixing process, where the second media stream is received by the voice switching device a media stream generated by mixing a media stream from the first communication terminal and the other communication terminal after the motion recording response; the recording server performing the first media stream and the second media stream Combined recording.

In a third aspect, the present invention provides a voice switching device, where the voice switching device includes: a receiving module, a mixing module, a saving module, a joining module, a first sending module, a second sending module, and a third sending module; The receiving module is configured to receive a recording request from the first communication terminal and a startup recording response from the recording server, where the first communication terminal and the other communication terminal communicate through the voice switching device; the mixing module is configured to receive At the time of the recording request, the currently received media stream from the first communication terminal and the media stream from the other communication terminal are mixed; the saving module is configured to receive the After the recording request, before receiving the start recording response, the media stream generated by the mixing of the mixing module is saved in the cache as a first media stream; the joining module is configured to join the recording server to the voice switching device. In the conference site including the first communication terminal and the other communication terminal; the first sending module is configured to The sound server sends a start recording request; the second sending module is configured to send, after the receiving module receives the start recording response, the media stream generated by the current mixing of the mixing module as the second media stream to the a third sending module, configured to send the first media stream saved in the cache to the recording server after the receiving module receives the startup recording response, so that the recording server is The first media stream and the second media stream are combined and recorded.

In a first possible implementation manner of the third aspect, the voice switching device further includes a creating module, where the creating module is configured to create the cache when the recording request is received.

In conjunction with the third aspect, or the first possible implementation manner of the third aspect, in a second possible implementation manner of the third aspect, the saving module is further configured to stop mixing when receiving the startup recording response The media stream generated by the tone is saved in the cache.

With reference to the second possible implementation of the third aspect to the third aspect, in a third possible implementation manner of the third aspect, the media packet in the first media stream is identified by a serial number to facilitate the The recording server records the media packets in the first media stream in chronological order according to the serial number identifier.

In a fourth aspect, the present invention provides a recording server, where the recording server includes: a joining module, a first receiving module, a sending module, a second receiving module, a third receiving module, and a combined recording module; In the conference site of the first communication terminal and the other communication terminal created by the voice switching device, the first communication terminal and the other communication terminal communicate through the voice switching device; the first receiving module receives the voice exchange Sending a recording request sent by the device; the sending module is configured to send a startup recording response to the voice switching device after the first receiving module receives the startup recording request sent by the voice switching device; the second receiving module Receiving, after the sending module sends a startup recording response to the voice switching device, receiving a second media stream from the voice switching device, where the second media stream is the voice switching device receiving the startup Performing media streams from the first communication terminal and the other communication terminals after the recording response a media stream generated after the tone processing; the third receiving module is configured to receive, after the sending module sends a start recording response to the voice switching device, a first media stream that is buffered from the voice switching device, The first media stream is a media stream generated by the voice switching device mixing the media streams from the first communication terminal and the other communication terminal before receiving the start recording response; the combined recording The module is configured to perform combined recording on the second media stream received by the second receiving module and the first media stream received by the third receiving module.

In a fifth aspect, the present invention provides a recording system, the system comprising: a voice switching device and a recording server; the voice switching device is configured to receive a recording request from the first communication terminal, the first communication terminal and other communication The terminal performs communication through the voice switching device; when receiving the recording request, mixes the currently received media stream from the first communication terminal and the media stream from the other communication terminal, and mixes The sound generated media stream is saved in the cache as the first media stream; the recording server is added to the conference site including the first communication terminal and the other communication terminal created by the voice switching device; and the recording server is sent to the recording server. Recording request; after receiving the start recording response of the recording server, mixing the currently received media stream from the first communication terminal and the media stream from the other communication terminal, and mixing the generated media The stream is sent to the recording server as a second media stream, and is saved in the location before receiving the start recording response A first media cache audio stream to the server, so that the recording medium of said first server The volume stream and the second media stream are combined and recorded; the recording server is used to join a conference site including a first communication terminal and other communication terminals created by the voice switching device, the first communication terminal and the other communication terminal Communicating by the voice switching device; receiving a startup recording request sent by the voice switching device and transmitting a startup recording response to the voice switching device; receiving the initiated recording response from the voice switching device, receiving the voice from the voice a first media stream and a second media stream of the switching device, the first media stream being a medium from the first communication terminal and the other communication terminal before the voice switching device receives the start recording response Flowing a media stream generated after the mixing process, wherein the second media stream is that the voice switching device performs a media stream from the first communication terminal and the other communication terminal after receiving the start recording response a media stream generated after the mixing process; combining the first media stream and the second media stream.

The invention has the beneficial effects that, prior to the prior art, the voice exchange device saves the first media stream after the mixing process in the buffer before receiving the recording response of the recording server; After the server initiates the recording response, the voice switching device sends the second media stream after the mixing process to the recording server; the voice switching device sends the first media stream stored in the buffer to the recording server, and the recording server receives the first media stream and After the second media stream, the first media stream and the second media stream are combined and recorded. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.

DRAWINGS

1 is a schematic structural diagram of a networking of a scene for recording a point-to-point call in the prior art;

2 is a schematic structural diagram of a networking of a scene in which a recording method of the present invention records a point-to-point call;

3 is a flow chart of an embodiment of a recording method of the present invention;

4 is a flow chart of another embodiment of the recording method of the present invention;

Figure 5 is a flow chart of still another embodiment of the recording method of the present invention;

6 is a flow of interaction between network elements in a network in the application scenario of the recording method of the present invention in a peer-to-peer manner Cheng Tu

7 is a schematic structural diagram of an embodiment of a voice switching device according to the present invention;

8 is a schematic structural diagram of another embodiment of a voice switching device according to the present invention;

9 is a schematic structural diagram of an embodiment of a recording server of the present invention;

10 is a schematic structural view of an embodiment of a recording system of the present invention;

11 is a schematic diagram of a physical structure of still another embodiment of a voice switching device according to the present invention;

FIG. 12 is a schematic diagram showing the physical structure of another embodiment of the recording server of the present invention.

detailed description

The invention will now be described in detail in conjunction with the drawings and embodiments.

Referring to FIG. 1 , FIG. 1 is a schematic diagram of a typical network structure for recording a point-to-point call in a prior art. During a call, a user presses a button on the IP phone 11 to start recording, and the IP phone 11 sends a recording request to the IP- PBX 12, IP-PBX 12 creates a site to add the IP phone of the user and the IP phone of another user to the site. The IP-PBX 12 sends an Invite message to the SIP signaling server 13, inviting the recording server 14 to join the site, SIP signaling. The server 13 selects an appropriate recording server 14 in the recording server cluster, and returns the IP address of the recording server 14 to the IP-PBX 12 in the 200 OK message, the IP-PBX 12 adds the recording server 14 to the conference site, and the IP-PBX 12 sends the SIP. The INFO message is sent to the SIP signaling server 13, and the recording server 14 is notified to start the recording. After the IP-PBX 12 receives the 200 OK response from the recording server 14 by the SIP signaling server 13, the IP-PBX 12 sends the media stream from both parties in the conference. The mixing process is performed, and the media stream generated after the mixing is sent to the recording server 14, and the recording server 14 acquires the media stream and records the media stream to realize recording. In the process that the IP-PBX 12 receives the recording request until the IP-PBX receives the start recording response sent by the recording server 14, a number of signaling interactions are involved, during which the recording server 14 cannot obtain the media stream of both parties of the call. The media stream in this process will be lost, making the recording incomplete.

Referring to FIG. 2, FIG. 2 is a structural diagram of a networking of a scene in which a recording method of the present invention records a point-to-point call. In the method of the present invention, the IP-PBX 21 creates a site, and adds an IP phone that initiates the recording request and another IP phone to the site, and creates a cache for the recording. twenty two. The IP-PBX 21 sends an Invite message to the SIP signaling server 23, invites the recording server 24 to join the conference site, and the SIP signaling server 23 selects an appropriate recording server 24 in the recording server cluster, and sets the IP address of the recording server 24 in the 200 OK message. Replying to the IP-PBX 21, the IP-PBX 21 joins the recording server 24 to the conference site, and the IP-PBX 21 sends a SIP INFO message to the SIP signaling server 23, informing the recording server 24 to start recording. Wherein, before receiving the recording response of the recording server 24, the IP-PBX 21 mixes the media streams from both parties of the call, and sends the media stream generated after the mixing to the buffer 22; the recording server 24 After joining the conference site, the IP-PBX 21 directly transmits the media stream generated by the mixing process from both parties in the conference site to the recording server 24 through the IP address of the recording server 24, and is not sent to the buffer 22. The IP-PBX 21 transmits the media stream held in the cache 22 to the recording server 24. Therefore, after receiving the media stream, the recording server can not complete the recording of the recording server before preparing for recording, thereby achieving complete recording.

3 is a flowchart of an embodiment of a recording method of the present invention. The embodiment is a flowchart of a voice switching device, and includes:

Step S101: The voice switching device receives the recording request from the first communication terminal, and the first communication terminal and the other communication terminal communicate through the voice switching device.

A voice switching device is a network device used for voice electrical signal forwarding. Its main functions are to process user registration, call, outgoing relay, create a conference site, and interact with commands of the recording server.

The first communication terminal is a communication terminal that actively initiates a recording request, and the other communication terminals are communication terminals that participate in a recording process in addition to the first communication terminal. When communicating via a conference call created on a voice switching device, there are typically at least two other communication terminals; when peer-to-peer communication is through a voice switching device, the other communication terminals are one.

When the first communication terminal sends a recording request to the voice switching device, the voice switching device receives a recording request from the first communication terminal, wherein the first communication terminal and the other communication terminal communicate through the voice switching device.

Step S102: When receiving the recording request, the voice switching device mixes the currently received media stream from the first communication terminal and the media stream from the other communication terminal, and uses the media stream generated by the mixing as the first media stream. Saved in the cache.

The first media stream is a media stream generated after the voice switching device mixes the media stream from the first communication terminal and the media stream from the other communication terminal before receiving the recording response of the recording server.

The voice switching device is not ready for recording until it receives a recording response from the recording server. However, the first communication terminal and the other communication terminal have started the session, and the voice switching device saves the first media stream in the cache, which can prevent the media stream before the recording server is prepared for recording, wherein the cache is already created in advance. .

Step S103: The voice switching device adds the recording server to the conference site that is created by the voice switching device, including the first communication terminal and other communication terminals.

The recording server is a device that acquires a media stream and implements user recording. After the voice switching device receives the recording request from the first communication terminal, the recording server needs to be added to the conference site to enable recording. The venue is created by the voice switching device and includes the first communication terminal and other communication terminals. If it is a point-to-point session, after the voice switching device receives the recording request of the first communication terminal, the voice switching device creates a site, and joins the first communication terminal and other communication terminals to the conference site; if it is a conference call, the conference site is before the conference call starts. The conference site that has been created, that is, the conference call, does not need to create a conference site after the voice switching device receives the recording request of the first communication terminal.

Specifically, the voice switching device sends an Invite message to the signaling server, invites the recording server to join the conference site, and the signaling server selects an appropriate recording server in the recording server cluster, and returns the IP address of the recording server to the voice exchange in the 200 OK message. The device and the voice switching device join the recording server to the site.

The signaling server is used to process the signaling and recording instructions from the voice switching device and is responsible for interaction with the recording server.

The voice switching device is a voice switching device based on an IP network, and the signaling server is a session initiation protocol SIP signaling server. Of course, the signaling server may also be a signaling server of the H.323 protocol, and is not limited herein.

Of course, the function of signaling interaction by the signaling server can also be integrated on the recording server, so that the voice switching device directly performs signaling interaction with the recording server.

Step S104: The voice switching device sends a start recording request to the recording server and receives the recording. The start recording response sent by the tone server.

Specifically, after the recording server joins the conference site, the voice switching device sends a recording request to the recording server to prepare the recording server for recording (for example, reserve recording resources for the recording). For example, the specific implementation manner of step S104 is: the voice switching device sends a SIP INFO message to the signaling server, and notifies the recording server to start recording, and the signaling server sends a message to the recording server to start recording to the recording server, and the signaling server receives the recording. After the server initiates the recording response, the recording response of the recording server is sent to the voice switching device through the 200 OK message, and after receiving the recording response of the recording server, the voice switching device can determine that the recording server is ready for the recording.

Specifically, in the process of joining the recording server to the conference site, the recording request is initiated and the recording response is started by signaling the interaction when the recording server is added to the conference site. For example, the specific implementation manner of the step S104 is: the voice switching device carries the start recording instruction by inviting the recording server to join the SIP INVITE message of the conference site, and the signaling server sends a message to the recording server to start the recording to the recording server, and receives the recording server. After the start of the recording response, the recording response of the recording server is sent to the voice switching device through the 200 OK message of the SIP INVITE, and the voice switching device completes the process of joining the recording server to the site after receiving the 200 OK message, and determines the recording. The server is ready for recording.

Step S105: After receiving the startup recording response of the recording server, the voice switching device mixes the currently received media stream from the first communication terminal and the media stream from the other communication terminal, and uses the media stream generated by the mixing as the media stream. The second media stream is sent to the recording server.

The second media stream is a media stream generated after the voice switching device receives the recording response of the recording server and mixes the media stream from the first communication terminal with the media stream from the other communication terminal.

After the voice switching device receives the recording response from the recording server, the recording server has prepared the recording for the recording. The voice switching device mixes the currently received media stream from the first communication terminal with the media stream from the other communication terminal. At this time, the media stream generated by the mixing is sent to the recording server as the second media stream.

Step S106: The voice switching device sends the first media stream saved in the cache before receiving the recording response to the recording server, so that the recording server performs combined recording on the first media stream and the second media stream.

During the time when the recording server is not ready for recording, the first media stream is saved in the cache. In order to facilitate the recording server to obtain a complete media stream, the voice switching device sends the first media stream stored in the buffer to the recording server, so that The first media stream and the second media stream are combined and recorded by the recording server. Step S106 may specifically be implemented in multiple manners. For example, the voice switching device simultaneously sends the first media stream and the second media stream, and the recording server combines the first media stream and the second media stream and records the same as a recording file; The voice switching device first sends the second media stream and then sends the first media stream, and the recording server records the first media stream and the second media stream as one recording file respectively, and combines the two recording files into one recording file.

Referring to FIG. 4, the recording method of the present invention further includes:

Step S107: The voice switching device creates a cache when receiving the recording request.

Cache refers to the temporary file swap area, which has an extremely fast access rate, which is a buffer between the internal storage and the external interface.

Among them, the cache is a first-in, first-out FIFO buffer. The FIFO buffer means that when a read operation is performed on the cache, the data first written into the buffer is first read. In this way, the cache can be managed automatically.

The voice switching device creates a cache when it receives a recording request. For example, it may be created after receiving the recording request of the first communication terminal. Of course, it may be created before receiving the recording request of the first communication terminal, and no limitation is imposed here.

Step S108: The voice switching device stops storing the media stream generated by the mixing in the cache when receiving the startup recording response.

After receiving the recording response of the recording server, the recording server is ready for recording. Therefore, the voice switching device can stop storing the media stream generated after the mixing processing in the cache, and directly generate the media after the mixing processing. The stream is sent as a second media stream to the recording server. This way, you can avoid wasting the cached storage space.

The media packet in the first media stream carries a serial number identifier to facilitate the recording server root. The media packets in the first media stream are recorded in chronological order according to the serial number identifier.

The media stream is a real-time transport protocol RTP media stream.

Before receiving the recording response of the recording server, the voice exchange device saves the first media stream generated after the mixing process in the cache; after receiving the recording response of the recording server, the voice switching device will perform the mixing processing. The generated second media stream is sent to the recording server; the voice switching device sends the first media stream saved in the cache to the recording server, so that the recording server performs combined recording on the first media stream and the second media stream. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.

Referring to FIG. 5, FIG. 5 is a flowchart of still another embodiment of the recording method of the present invention. The embodiment is a flowchart of the recording server, and includes:

Step S301: The recording server joins the conference site including the first communication terminal and other communication terminals created by the voice switching device, and the first communication terminal and other communication terminals communicate through the voice switching device.

The recording server is a device that acquires a media stream and implements user recording. A voice switching device is a network device used for voice electrical signal forwarding. The first communication terminal is a communication terminal that actively initiates a recording request, and the other communication terminals are communication terminals that participate in a recording process in addition to the first communication terminal. When communicating via a conference call created on a voice switching device, there are typically at least two other communication terminals; when peer-to-peer communication is through a voice switching device, the other communication terminals are one. The first communication terminal and the other communication terminals communicate through the voice switching device.

Step S302: The recording server receives the startup recording request sent by the voice switching device and sends a startup recording response to the voice switching device.

After the recording server joins the conference site, it receives a startup recording request sent by the voice switching device. At this time, the recording server receives the startup recording request sent by the voice switching device, and sends a startup recording response to the voice switching device.

Step S303: After transmitting the startup recording response to the voice switching device, the recording server receives the first media stream and the second media stream from the voice switching device.

The second media stream is a voice switching device that receives the recording response of the recording server. Thereafter, the media stream generated from the first communication terminal and the media stream from the other communication terminal are subjected to a mixing process to generate a media stream. After transmitting the initiate recording response to the voice switching device, the recording server is ready for recording, at which point the recording server receives the second media stream from the voice switching device.

The first media stream is a media stream generated by the voice switching device before the voice switching device receives the recording response of the recording server, and the voice switching device mixes the media stream from the first communication terminal with the media stream from the other communication terminal. The first media stream is pre-stored in the cache, and after the recording server is ready for the recording, the first media stream saved in the cache sent by the voice switching device can be received.

Step S304: The recording server performs combined recording on the first media stream and the second media stream.

The recording server performs combined recording on the first media stream and the second media stream. For example, the voice switching device simultaneously sends the first media stream and the second media stream, and the recording server combines the first media stream and the second media stream into one recording file; for example, the voice switching device sends the second media stream first. The first media stream is sent again, and the recording server records the first media stream and the second media stream as one recording file respectively, and combines the two recording files into one recording file.

After receiving the first media stream and the second media stream, the recording server of the present invention performs combined recording on the first media stream and the second media stream. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.

The recording method of the present invention is described below by taking a point-to-point application scenario and a conference application scenario as an example.

Referring to FIG. 6, FIG. 6 is a flowchart of interaction between network elements in a networking in a scenario where the recording method of the present invention is peer-to-peer. Take IP-PBX, FIFO buffer, SIP signaling server as an example.

(1) The user of the first communication terminal uses the first communication terminal to call the user of the other communication terminal through the IP-PBX, and performs peer-to-peer communication with the user of the other communication terminal, and the user of the first communication terminal presses on the first communication terminal. The recording button starts recording and sends a recording request to the IP-PBX.

The first communication terminal and the other communication terminals may each be an IP phone, and the users of the first communication terminal and other communication terminals may be internal users.

If it is a conference call scenario, this step should be: the user of the first communication terminal initiates recording by pressing the record button on the first communication terminal during the conference, and sends a recording request to the IP-PBX.

(2) After receiving the recording request, the IP-PBX creates a conference site, adds the first communication terminal and other communication terminals to the conference site, and mixes the media streams from the first communication terminal and other communication terminals, after the mixing process. The generated media stream acts as the first media stream.

If it is a conference call scenario, this step should be: After the IP-PBX receives the recording request, since the site has been created before the conference call begins, there is no need to create a site at this time, and the first communication terminal and other communication terminals will be used. The media stream is subjected to mixing processing, and the media stream generated after the mixing processing is used as the first media stream.

(2) IP-PBX creates a FIFO buffer for this recording. Before the IP-PBX receives the recording response of the recording server, the IP-PBX sends the first media stream to the FIFO buffer for storage.

(3) The IP-PBX sends an Invite message to the SIP server, and invites the recording server to join the site.

(4) The SIP Server selects a suitable recording server in the recording server cluster, and replies the IP address of the recording server to the IP-PBX in the 200 OK message.

(5) IP-PBX joins the recording server to the conference site. Then, the IP-PBX sends a SIP INFO message to the SIP server to notify the recording server to start recording through the SIP server. The SIPServer notifies the recording server to start recording, and the SIP server receives the recording server. After the recording response, the recording response of the recording server is sent to the voice switching device in the 200 OK message.

(6) After the IP-PBX receives the 200 OK response from the recording server, the IP-PBX mixes the media streams from the first communication terminal and other communication terminals, and the media stream generated after the mixing process is used as the second media stream. Directly sent to the recording server, no longer sent to the FIFO buffer.

(7) The first media stream in the FIFO buffer is sent to the recording server. The first media stream has a serial number identifier, and the recording server receives the out-of-order first media stream, and can record the media packets in the first media stream in time sequence according to the serial number identifier, so that the lost recording server is ready for recording. Prepare the previous recording.

(8) The recording server performs combined recording on the first media stream and the second media stream.

Referring to FIG. 7, FIG. 7 is a schematic structural diagram of an embodiment of a voice switching device according to the present invention. The voice switching device includes: a receiving module 101, a mixing module 102, a saving module 103, a joining module 104, a first sending module 105, and a second The transmitting module 106 and the third transmitting module 107.

The receiving module 101 is configured to receive a recording request from the first communication terminal and a startup recording response from the recording server, and the first communication terminal and the other communication terminal communicate through the voice switching device.

In addition, after the language exchange device sends a recording request to the recording server, it can receive a startup recording response from the recording server. At this time, the recording server is ready for recording.

The mixing module 102 is configured to mix the currently received media stream from the first communication terminal and the media stream from other communication terminals upon receiving the recording request from the first communication terminal.

The saving module 103 is configured to save the media stream generated by the mixing of the mixing module 102 as a first media stream in the cache after the receiving module 101 receives the recording request and before receiving the starting recording response.

The first media stream is a voice switching device that receives a recording response from the recording server. Before, the media stream generated from the first communication terminal and the media stream from the other communication terminal are mixed and processed.

The adding module 104 is configured to join the recording server into the conference site of the first communication terminal and other communication terminals created by the voice switching device.

The first sending module 105 is configured to send a start recording request to the recording server.

After the recording server joins the conference site, the voice switching device sends a recording request to the recording server to prepare the recording server for the recording.

The second sending module 106 is configured to send the media stream generated by the current mixing of the mixing module 102 to the recording server as the second media stream after the receiving module 101 receives the startup recording response.

The third sending module 107 is configured to send the first media stream saved in the cache to the recording server after the receiving module 101 receives the startup recording response, so that the recording server performs combined recording on the first media stream and the second media stream.

During the time when the recording server is not ready for recording, the first media stream is saved in the cache. In order to facilitate the recording server to obtain a complete media stream, the voice switching device sends the first media stream stored in the buffer to the recording server, so that The first media stream and the second media stream are combined and recorded by the recording server.

It should be noted that, in practical applications, the modules or units of the present embodiment may be added, subtracted, or combined, and will not be further described herein.

Referring to Figure 8, the voice switching device also includes a creation module 108 for creating a cache upon receipt of a recording request.

The saving module 103 is configured to stop saving the media stream generated by the mixing in the cache when receiving the startup recording response of the recording server.

The media packet in the first media stream carries a serial number identifier, so that the recording server records the media packets in the first media stream in time sequence according to the serial number identifier.

It should be noted that the voice switching device of this embodiment may perform the steps in FIG. 3 and FIG. 4.

Before receiving the recording response of the recording server, the voice exchange device saves the first media stream generated after the mixing process in the cache; after receiving the recording response of the recording server, the voice switching device will perform the mixing processing. After the generated second media stream recording service Transmitting; the voice switching device sends the first media stream saved in the cache to the recording server, so that the recording server performs combined recording on the first media stream and the second media stream. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.

Referring to FIG. 9, FIG. 9 is a schematic structural diagram of an embodiment of a recording server according to the present invention. The recording server includes: a joining module 201, a first receiving module 202, a sending module 203, a second receiving module 204, a third receiving module 205, and a merge. Recording module 206.

It should be noted that the recording server of the present embodiment can perform the steps in FIG. 5.

The joining module 201 is configured to join a conference site that includes a first communication terminal and other communication terminals created by the voice switching device, and the first communication terminal and other communication terminals communicate through the voice switching device.

The recording server can be prepared for recording by joining the conference site including the first communication terminal and other communication terminals created by the voice switching device.

The first receiving module 202 is configured to receive a startup recording request sent by the voice switching device.

The sending module 203 is configured to send a start recording response to the voice switching device after the first receiving module 202 receives the startup recording request sent by the voice switching device.

The second receiving module 204 is configured to receive the second media stream from the voice switching device after the sending module 203 sends the start recording response to the voice switching device.

The third receiving module 205 is configured to receive the buffered first media stream from the voice switching device after the sending module 203 sends the start recording response to the voice switching device.

The merge recording module 206 is configured to perform combined recording on the second media stream received by the second receiving module 204 and the first media stream received by the third receiving module 205.

Referring to FIG. 10, FIG. 10 is a schematic structural diagram of an embodiment of a recording system according to the present invention. The system includes a voice switching device 31 and a recording server 32.

The voice switching device is configured to receive a recording request from the first communication terminal, and the first communication terminal and the other communication terminal communicate through the voice switching device; when receiving the recording request, the currently received media stream from the first communication terminal and The media streams from other communication terminals are mixed, and the media stream generated by the mixing is saved in the cache as the first media stream; the first communication terminal and other communication terminals are created by adding the recording server to the voice switching device. In the conference site; sending a start recording request to the recording server; after receiving the recording response of the recording server, mixing the currently received media stream from the first communication terminal and the media stream from the other communication terminal, mixing the sound The generated media stream is sent to the recording server as a second media stream, and the first media stream saved in the cache before receiving the start recording response is sent to the recording server, so that the recording server can access the first media stream and the second media stream. Make a combined recording.

The recording server is used to join the conference site including the first communication terminal and the other communication terminal created by the voice switching device, and the first communication terminal and the other communication terminal communicate through the voice switching device; receive the startup recording request sent by the voice switching device and send the voice to the voice The switching device sends a startup recording response; after transmitting the startup recording response to the voice switching device, receiving the first media stream and the second media stream from the voice switching device, the first media stream is the voice switching device before receiving the start recording response a media stream generated by the media stream from the first communication terminal and other communication terminals after the mixing process, and the second media stream is a media stream from the first communication terminal and other communication terminals after the voice switching device receives the start recording response a media stream generated after the mixing process is performed; the first media stream and the second media stream are combined and recorded.

Before receiving the recording response of the recording server, the voice exchange device saves the first media stream after the mixing process in the cache; after receiving the recording response of the recording server, the voice switching device will process the mixing The second media stream is sent to the recording server; the voice switching device sends the first media stream saved in the cache to the recording server, and after receiving the first media stream and the second media stream, the recording server processes the first media stream and the second media. The stream is combined for recording. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.

Referring to FIG. 11, FIG. 11 is a schematic diagram showing the physical structure of still another embodiment of the voice switching device of the present invention. The voice switching device 40 includes a processor 41, a memory 42 coupled to the processor 41, a receiver 43, and a transmitter 44.

The receiver 43 is for receiving a recording request from the first communication terminal, and the first communication terminal and the other communication terminal communicate via the receiver 43 and the transmitter 44 of the voice switching device 40.

The processor 41 connects to the current receiver 43 when the receiver 43 receives the recording request. The received media stream from the first communication terminal and the media stream from the other communication terminal are mixed, and the transmitter 44 is controlled to save the media stream generated by the mixing as a first media stream in the buffer of the memory 42. .

The processor 41 adds the recording server to the conference site that is created by the voice switching device and includes the first communication terminal and the other communication terminal.

The transmitter 44 sends a start recording request to the recording server.

The processor 41 mixes the currently received media stream from the first communication terminal and the media stream from the other communication terminal after the receiver 43 receives the startup recording response of the recording server, and controls transmission. The processor 44 transmits the media stream generated by the mixing as the second media stream to the recording server.

The processor 41 acquires the first media stream stored in the buffer of the memory 42, and the control transmitter 44 sends the first media stream saved in the cache before the receiver 43 receives the start recording response to the recording server. In order to facilitate the combined recording of the first media stream and the second media stream by the recording server.

Referring to FIG. 12, FIG. 12 is a block diagram showing another embodiment of a recording server of the present invention. The recording server 50 includes a processor 51, a memory 52 coupled to the processor 51, a receiver 53, and a transmitter 54.

The processor 51 is configured to join a conference site that is created by the voice switching device, including the first communications terminal and the other communications terminal, where the first communications terminal and the other communications terminal communicate through the voice switching device;

The receiver 53 receives a startup recording request sent by the voice switching device, and the transmitter 54 sends a startup recording response to the voice switching device.

After transmitting a startup recording response to the voice switching device, the receiver 53 receives a first media stream and a second media stream from the voice switching device, the first media stream being the voice switching device receiving the location a media stream generated by performing a mixing process on a media stream from the first communication terminal and the other communication terminal before the recording response is started, the second media stream being the voice switching device receiving the startup a media stream generated after the sound recording process is performed on the media streams from the first communication terminal and the other communication terminals after the recording response;

The processor 51 performs combined recording on the first media stream and the second media stream.

The above is only the embodiment of the present invention, and is not intended to limit the scope of the invention, and the equivalent structure or equivalent process transformations made by the description of the invention and the drawings are directly or indirectly applied to other related technologies. The fields are all included in the scope of patent protection of the present invention.

Claims

A recording method, characterized in that the method comprises:

The voice switching device receives a recording request from the first communication terminal, and the first communication terminal and the other communication terminal communicate through the voice switching device;

The voice switching device, when receiving the recording request, mixes a currently received media stream from the first communication terminal and a media stream from the other communication terminal, and mixes the generated media stream Stored in the cache as the first media stream;

The voice switching device adds a recording server to a conference site that is created by the voice switching device, including the first communication terminal and the other communication terminal;

The voice switching device sends a start recording request to the recording server;

After receiving the startup recording response of the recording server, the voice switching device mixes the currently received media stream from the first communication terminal and the media stream from the other communication terminal, and generates a mix The media stream is sent to the recording server as a second media stream, and the first media stream saved in the cache before receiving the start recording response is sent to the recording server, so that the recording server is The first media stream and the second media stream are combined and recorded.
The method of claim 1, further comprising the voice switching device creating the cache upon receiving the recording request.
The method according to claim 1 or 2, wherein the method further comprises: the voice switching device stops storing the media stream generated by the mixing in the cache upon receiving the start recording response.
The method according to any one of claims 1 to 3, wherein the media package in the first media stream has a serial number identifier, so that the recording server records the chronological order according to the serial number identifier. The media package in the first media stream.
A recording method, characterized in that the method comprises:

The recording server is added to the conference site including the first communication terminal and the other communication terminal created by the voice switching device, and the first communication terminal and the other communication terminal communicate through the voice switching device;

Receiving, by the recording server, a startup recording request sent by the voice switching device The voice switching device sends a start recording response;

After transmitting a startup recording response to the voice switching device, the recording server receives a first media stream and a second media stream from the voice switching device, the first media stream being the voice switching device receiving the a media stream generated by mixing a media stream from the first communication terminal and the other communication terminal before the recording response is started, the second media stream being the voice switching device receiving the startup recording a media stream generated after the mixing of the media streams from the first communication terminal and the other communication terminals after the response;

The recording server performs combined recording on the first media stream and the second media stream.
A voice switching device, comprising: a receiving module, a mixing module, a saving module, a joining module, a first sending module, a second sending module, and a third sending module;

The receiving module is configured to receive a recording request from the first communication terminal and a startup recording response from the recording server, where the first communication terminal and the other communication terminal communicate through the voice switching device;

The mixing module is configured to mix a currently received media stream from the first communication terminal and a media stream from the other communication terminal when receiving the recording request;

The saving module is configured to save the media stream generated by the mixing of the mixing module as a first media stream in a cache after the receiving module receives the recording request, and before receiving the starting recording response;

The joining module is configured to join a recording server to a conference site that is created by the voice switching device, including the first communications terminal and the other communications terminal;

The first sending module is configured to send a start recording request to the recording server;

The second sending module is configured to send the media stream generated by the current mixing of the mixing module to the recording server as a second media stream after the receiving module receives the startup recording response;

The third sending module is configured to send the first media stream saved in the cache to the recording server after the receiving module receives the startup recording response, so as to facilitate The recording server performs combined recording on the first media stream and the second media stream.
The voice switching device according to claim 6, wherein the voice switching device further comprises a creating module, the creating module configured to create the cache when the recording request is received.
The voice switching device according to claim 6 or 7, wherein the saving module is further configured to stop saving the media stream generated by the mixing in the cache when receiving the startup recording response.
The voice switching device according to any one of claims 6 to 8, wherein the media packet in the first media stream has a serial number identifier, so that the recording server records the time sequence according to the serial number identifier. a media package in the first media stream.
A recording server, comprising: a joining module, a first receiving module, a sending module, a second receiving module, a third receiving module, and a combined recording module;

The joining module is configured to join a conference site that is created by the voice switching device, including the first communications terminal and the other communications terminal, where the first communications terminal and the other communications terminal communicate through the voice switching device;

The first receiving module is configured to receive a startup recording request sent by the voice switching device;

The sending module is configured to send a start recording response to the voice switching device after the first receiving module receives the startup recording request sent by the voice switching device;

The second receiving module is configured to receive a second media stream from the voice switching device after the sending module sends a start recording response to the voice switching device, where the second media stream is the voice switching device a media stream generated by performing a mixing process on a media stream from the first communication terminal and the other communication terminal after receiving the start recording response;

The third receiving module is configured to receive, after the sending module sends a startup recording response to the voice switching device, a first media stream that is buffered from the voice switching device, where the first media stream is the voice The switching device mixes the media streams from the first communication terminal and the other communication terminals before receiving the start recording response Generated media stream;

The merge recording module is configured to perform combined recording on the second media stream received by the second receiving module and the first media stream received by the third receiving module.
A recording system, characterized in that the system comprises: a voice switching device and a recording server;

The voice switching device is configured to receive a recording request from a first communication terminal, where the first communication terminal and other communication terminals communicate through the voice switching device; when the recording request is received, the current received Mixing the media stream of the first communication terminal with the media stream from the other communication terminal, and storing the media stream generated by the mixing as a first media stream in a cache; adding the recording server to the voice switching device Created in the conference site including the first communication terminal and the other communication terminal; sending a startup recording request to the recording server; after receiving the recording response of the recording server, the currently received from the first a media stream of a communication terminal and a media stream from the other communication terminal are mixed, and the media stream generated by the mixing is sent to the recording server as a second media stream, and is saved before receiving the start recording response Transmitting, in the cache, the first media stream to the recording server, so that the recording server is to the first medium Stream and the second stream merge recording media;

The recording server is configured to join a conference site that includes a first communication terminal and other communication terminals created by the voice switching device, where the first communication terminal and the other communication terminal communicate through the voice switching device; receive the voice Receiving a recording request sent by the switching device and sending a startup recording response to the voice switching device; after transmitting the startup recording response to the voice switching device, receiving the first media stream and the second media stream from the voice switching device, The first media stream is a media stream generated by the voice switching device after performing a mixing process on a media stream from the first communication terminal and the other communication terminal before receiving the start recording response, The second media stream is a media stream generated by the voice switching device after performing the mixing process on the media streams from the first communication terminal and the other communication terminal after receiving the start recording response; A media stream and the second media stream are combined for recording.