WO2015127793A1 - Recording method, voice exchanging device, recording server, and recording system - Google Patents

Recording method, voice exchanging device, recording server, and recording system Download PDF

Info

Publication number
WO2015127793A1
WO2015127793A1 PCT/CN2014/089748 CN2014089748W WO2015127793A1 WO 2015127793 A1 WO2015127793 A1 WO 2015127793A1 CN 2014089748 W CN2014089748 W CN 2014089748W WO 2015127793 A1 WO2015127793 A1 WO 2015127793A1
Authority
WO
WIPO (PCT)
Prior art keywords
recording
media stream
switching device
communication terminal
voice switching
Prior art date
Application number
PCT/CN2014/089748
Other languages
French (fr)
Chinese (zh)
Inventor
诸宏亮
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2015127793A1 publication Critical patent/WO2015127793A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/42221Conversation recording systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2203/00Aspects of automatic or semi-automatic exchanges
    • H04M2203/30Aspects of automatic or semi-automatic exchanges related to audio recordings in general
    • H04M2203/301Management of recordings
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/42314Systems providing special services or facilities to subscribers in private branch exchanges
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/50Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
    • H04M3/51Centralised call answering arrangements requiring operator intervention, e.g. call or contact centers for telemarketing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M7/00Arrangements for interconnection between switching centres
    • H04M7/006Networks other than PSTN/ISDN providing telephone service, e.g. Voice over Internet Protocol (VoIP), including next generation networks with a packet-switched transport layer

Definitions

  • the present invention relates to the field of communication technologies, and in particular, to a recording method, a voice switching device, a recording server, and a recording system.
  • IP-PBX IP Private Branch eXchange
  • IP-PBX IP Private Branch eXchange
  • the IP-PBX then joins the recording server to the conference site in a "speak only" manner, and sends a request to initiate recording to the recording server, and after receiving the startup recording response returned by the recording server, the communication terminal from the call is received.
  • the media stream is mixed, and the media stream generated after the mixing is sent to the recording server, and the recording server acquires the media stream and performs recording, thereby realizing recording.
  • the IP-PBX receives the recording request and receives the start recording response sent by the recording server
  • a number of signaling interactions are involved, during which the recording server cannot acquire the media stream generated by the communication terminal after mixing. The media stream, so recording is not possible.
  • the technical problem to be solved by the present invention is to provide a recording method, a voice exchange device, a recording server and a recording system, which can prevent the recording of the recording server before the recording is prepared, so that a complete recording can be obtained.
  • the present invention provides a recording method, the method comprising: a voice switching device receiving a recording request from a first communication terminal, wherein the first communication terminal and other communication terminals communicate via the voice switching device;
  • the voice exchange device receives the recording At the request, mixing the currently received media stream from the first communication terminal and the media stream from the other communication terminal, and storing the media stream generated by the mixing as a first media stream in the cache;
  • the voice switching device adds a recording server to a site created by the voice switching device, including the first communication terminal and the other communication terminal; the voice switching device sends a start recording request to the recording server; After receiving the startup recording response of the recording server, the voice switching device mixes the currently received media stream from the first communication terminal and the media stream from the other communication terminal, and mixes the generated media.
  • the first media stream and the second media stream are combined
  • the method further includes: the voice switching device creates the cache when receiving the recording request.
  • the method further includes: the voice switching device receiving the start recording response Stops saving the media stream generated by the mix in the cache.
  • the media packet in the first media stream carries a serial number identifier, So that the recording server records the media packets in the first media stream in chronological order according to the serial number identifier.
  • the present invention provides a recording method, the method comprising: a recording server joining a conference site including a first communication terminal and other communication terminals created by a voice switching device, the first communication terminal and the other communication terminal Communicating by the voice switching device; the recording server receives a startup recording request sent by the voice switching device and sends a startup recording response to the voice switching device; after transmitting the startup recording response to the voice switching device, the recording server receives a first media stream and a second media stream from the voice switching device, the first media stream being from the first communication terminal and the other before the voice switching device receives the start recording response a media stream generated by the media stream of the communication terminal after the mixing process, where the second media stream is received by the voice switching device a media stream generated by mixing a media stream from the first communication terminal and the other communication terminal after the motion recording response; the recording server performing the first media stream and the second media stream Combined recording.
  • the present invention provides a voice switching device, where the voice switching device includes: a receiving module, a mixing module, a saving module, a joining module, a first sending module, a second sending module, and a third sending module;
  • the receiving module is configured to receive a recording request from the first communication terminal and a startup recording response from the recording server, where the first communication terminal and the other communication terminal communicate through the voice switching device;
  • the mixing module is configured to receive At the time of the recording request, the currently received media stream from the first communication terminal and the media stream from the other communication terminal are mixed;
  • the saving module is configured to receive the After the recording request, before receiving the start recording response, the media stream generated by the mixing of the mixing module is saved in the cache as a first media stream;
  • the joining module is configured to join the recording server to the voice switching device.
  • the first sending module is configured to The sound server sends a start recording request;
  • the second sending module is configured to send, after the receiving module receives the start recording response, the media stream generated by the current mixing of the mixing module as the second media stream to the a third sending module, configured to send the first media stream saved in the cache to the recording server after the receiving module receives the startup recording response, so that the recording server is The first media stream and the second media stream are combined and recorded.
  • the voice switching device further includes a creating module, where the creating module is configured to create the cache when the recording request is received.
  • the saving module is further configured to stop mixing when receiving the startup recording response
  • the media stream generated by the tone is saved in the cache.
  • the media packet in the first media stream is identified by a serial number to facilitate the
  • the recording server records the media packets in the first media stream in chronological order according to the serial number identifier.
  • the present invention provides a recording server, where the recording server includes: a joining module, a first receiving module, a sending module, a second receiving module, a third receiving module, and a combined recording module;
  • the first communication terminal and the other communication terminal communicate through the voice switching device;
  • the first receiving module receives the voice exchange Sending a recording request sent by the device;
  • the sending module is configured to send a startup recording response to the voice switching device after the first receiving module receives the startup recording request sent by the voice switching device;
  • the second receiving module Receiving, after the sending module sends a startup recording response to the voice switching device, receiving a second media stream from the voice switching device, where the second media stream is the voice switching device receiving the startup Performing media streams from the first communication terminal and the other communication terminals after the recording response a media stream generated after the tone processing;
  • the third receiving module is configured to receive, after the sending module sends a start recording response to the voice switching device, a first
  • the present invention provides a recording system, the system comprising: a voice switching device and a recording server; the voice switching device is configured to receive a recording request from the first communication terminal, the first communication terminal and other communication The terminal performs communication through the voice switching device; when receiving the recording request, mixes the currently received media stream from the first communication terminal and the media stream from the other communication terminal, and mixes The sound generated media stream is saved in the cache as the first media stream; the recording server is added to the conference site including the first communication terminal and the other communication terminal created by the voice switching device; and the recording server is sent to the recording server.
  • the recording server is used to join a conference site including a first communication terminal and other communication terminals created by the voice switching device, the first communication terminal and the other communication terminal Communicating by the voice switching device; receiving a startup recording request sent by the voice switching device and transmitting a startup recording response to the voice switching device; receiving the initiated recording response from the voice switching device, receiving the voice from the voice a first media stream and a second media stream of the switching device, the first media stream being a medium from the first communication terminal and the other communication terminal before the voice switching device receives the start recording response Flowing a media stream generated after the mixing process, wherein the second media stream is that
  • the invention has the beneficial effects that, prior to the prior art, the voice exchange device saves the first media stream after the mixing process in the buffer before receiving the recording response of the recording server; After the server initiates the recording response, the voice switching device sends the second media stream after the mixing process to the recording server; the voice switching device sends the first media stream stored in the buffer to the recording server, and the recording server receives the first media stream and After the second media stream, the first media stream and the second media stream are combined and recorded. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.
  • FIG. 1 is a schematic structural diagram of a networking of a scene for recording a point-to-point call in the prior art
  • FIG. 2 is a schematic structural diagram of a networking of a scene in which a recording method of the present invention records a point-to-point call;
  • FIG. 3 is a flow chart of an embodiment of a recording method of the present invention.
  • FIG. 5 is a flow chart of still another embodiment of the recording method of the present invention.
  • 6 is a flow of interaction between network elements in a network in the application scenario of the recording method of the present invention in a peer-to-peer manner Cheng Tu
  • FIG. 7 is a schematic structural diagram of an embodiment of a voice switching device according to the present invention.
  • FIG. 8 is a schematic structural diagram of another embodiment of a voice switching device according to the present invention.
  • FIG. 9 is a schematic structural diagram of an embodiment of a recording server of the present invention.
  • FIG. 10 is a schematic structural view of an embodiment of a recording system of the present invention.
  • FIG. 11 is a schematic diagram of a physical structure of still another embodiment of a voice switching device according to the present invention.
  • FIG. 12 is a schematic diagram showing the physical structure of another embodiment of the recording server of the present invention.
  • FIG. 1 is a schematic diagram of a typical network structure for recording a point-to-point call in a prior art.
  • a user presses a button on the IP phone 11 to start recording, and the IP phone 11 sends a recording request to the IP- PBX 12, IP-PBX 12 creates a site to add the IP phone of the user and the IP phone of another user to the site.
  • the IP-PBX 12 sends an Invite message to the SIP signaling server 13, inviting the recording server 14 to join the site, SIP signaling.
  • the server 13 selects an appropriate recording server 14 in the recording server cluster, and returns the IP address of the recording server 14 to the IP-PBX 12 in the 200 OK message, the IP-PBX 12 adds the recording server 14 to the conference site, and the IP-PBX 12 sends the SIP.
  • the INFO message is sent to the SIP signaling server 13, and the recording server 14 is notified to start the recording.
  • the IP-PBX 12 receives the 200 OK response from the recording server 14 by the SIP signaling server 13, the IP-PBX 12 sends the media stream from both parties in the conference.
  • the mixing process is performed, and the media stream generated after the mixing is sent to the recording server 14, and the recording server 14 acquires the media stream and records the media stream to realize recording.
  • the IP-PBX 12 receives the recording request until the IP-PBX receives the start recording response sent by the recording server 14, a number of signaling interactions are involved, during which the recording server 14 cannot obtain the media stream of both parties of the call. The media stream in this process will be lost, making the recording incomplete.
  • FIG. 2 is a structural diagram of a networking of a scene in which a recording method of the present invention records a point-to-point call.
  • the IP-PBX 21 creates a site, and adds an IP phone that initiates the recording request and another IP phone to the site, and creates a cache for the recording. twenty two.
  • the IP-PBX 21 sends an Invite message to the SIP signaling server 23, invites the recording server 24 to join the conference site, and the SIP signaling server 23 selects an appropriate recording server 24 in the recording server cluster, and sets the IP address of the recording server 24 in the 200 OK message.
  • the IP-PBX 21 joins the recording server 24 to the conference site, and the IP-PBX 21 sends a SIP INFO message to the SIP signaling server 23, informing the recording server 24 to start recording.
  • the IP-PBX 21 mixes the media streams from both parties of the call, and sends the media stream generated after the mixing to the buffer 22; the recording server 24 After joining the conference site, the IP-PBX 21 directly transmits the media stream generated by the mixing process from both parties in the conference site to the recording server 24 through the IP address of the recording server 24, and is not sent to the buffer 22.
  • the IP-PBX 21 transmits the media stream held in the cache 22 to the recording server 24. Therefore, after receiving the media stream, the recording server can not complete the recording of the recording server before preparing for recording, thereby achieving complete recording.
  • FIG. 3 is a flowchart of an embodiment of a recording method of the present invention.
  • the embodiment is a flowchart of a voice switching device, and includes:
  • Step S101 The voice switching device receives the recording request from the first communication terminal, and the first communication terminal and the other communication terminal communicate through the voice switching device.
  • a voice switching device is a network device used for voice electrical signal forwarding. Its main functions are to process user registration, call, outgoing relay, create a conference site, and interact with commands of the recording server.
  • the first communication terminal is a communication terminal that actively initiates a recording request
  • the other communication terminals are communication terminals that participate in a recording process in addition to the first communication terminal.
  • the other communication terminals are typically at least two other communication terminals; when peer-to-peer communication is through a voice switching device, the other communication terminals are one.
  • the voice switching device When the first communication terminal sends a recording request to the voice switching device, the voice switching device receives a recording request from the first communication terminal, wherein the first communication terminal and the other communication terminal communicate through the voice switching device.
  • Step S102 When receiving the recording request, the voice switching device mixes the currently received media stream from the first communication terminal and the media stream from the other communication terminal, and uses the media stream generated by the mixing as the first media stream. Saved in the cache.
  • the first media stream is a media stream generated after the voice switching device mixes the media stream from the first communication terminal and the media stream from the other communication terminal before receiving the recording response of the recording server.
  • the voice switching device is not ready for recording until it receives a recording response from the recording server.
  • the first communication terminal and the other communication terminal have started the session, and the voice switching device saves the first media stream in the cache, which can prevent the media stream before the recording server is prepared for recording, wherein the cache is already created in advance. .
  • Step S103 The voice switching device adds the recording server to the conference site that is created by the voice switching device, including the first communication terminal and other communication terminals.
  • the recording server is a device that acquires a media stream and implements user recording. After the voice switching device receives the recording request from the first communication terminal, the recording server needs to be added to the conference site to enable recording.
  • the venue is created by the voice switching device and includes the first communication terminal and other communication terminals. If it is a point-to-point session, after the voice switching device receives the recording request of the first communication terminal, the voice switching device creates a site, and joins the first communication terminal and other communication terminals to the conference site; if it is a conference call, the conference site is before the conference call starts.
  • the conference site that has been created, that is, the conference call does not need to create a conference site after the voice switching device receives the recording request of the first communication terminal.
  • the voice switching device sends an Invite message to the signaling server, invites the recording server to join the conference site, and the signaling server selects an appropriate recording server in the recording server cluster, and returns the IP address of the recording server to the voice exchange in the 200 OK message.
  • the device and the voice switching device join the recording server to the site.
  • the signaling server is used to process the signaling and recording instructions from the voice switching device and is responsible for interaction with the recording server.
  • the voice switching device is a voice switching device based on an IP network
  • the signaling server is a session initiation protocol SIP signaling server.
  • the signaling server may also be a signaling server of the H.323 protocol, and is not limited herein.
  • the function of signaling interaction by the signaling server can also be integrated on the recording server, so that the voice switching device directly performs signaling interaction with the recording server.
  • Step S104 The voice switching device sends a start recording request to the recording server and receives the recording.
  • the voice switching device sends a recording request to the recording server to prepare the recording server for recording (for example, reserve recording resources for the recording).
  • the specific implementation manner of step S104 is: the voice switching device sends a SIP INFO message to the signaling server, and notifies the recording server to start recording, and the signaling server sends a message to the recording server to start recording to the recording server, and the signaling server receives the recording.
  • the recording response of the recording server is sent to the voice switching device through the 200 OK message, and after receiving the recording response of the recording server, the voice switching device can determine that the recording server is ready for the recording.
  • the recording request is initiated and the recording response is started by signaling the interaction when the recording server is added to the conference site.
  • the specific implementation manner of the step S104 is: the voice switching device carries the start recording instruction by inviting the recording server to join the SIP INVITE message of the conference site, and the signaling server sends a message to the recording server to start the recording to the recording server, and receives the recording server.
  • the recording response of the recording server is sent to the voice switching device through the 200 OK message of the SIP INVITE, and the voice switching device completes the process of joining the recording server to the site after receiving the 200 OK message, and determines the recording.
  • the server is ready for recording.
  • Step S105 After receiving the startup recording response of the recording server, the voice switching device mixes the currently received media stream from the first communication terminal and the media stream from the other communication terminal, and uses the media stream generated by the mixing as the media stream. The second media stream is sent to the recording server.
  • the second media stream is a media stream generated after the voice switching device receives the recording response of the recording server and mixes the media stream from the first communication terminal with the media stream from the other communication terminal.
  • the recording server After the voice switching device receives the recording response from the recording server, the recording server has prepared the recording for the recording.
  • the voice switching device mixes the currently received media stream from the first communication terminal with the media stream from the other communication terminal. At this time, the media stream generated by the mixing is sent to the recording server as the second media stream.
  • Step S106 The voice switching device sends the first media stream saved in the cache before receiving the recording response to the recording server, so that the recording server performs combined recording on the first media stream and the second media stream.
  • Step S106 may specifically be implemented in multiple manners.
  • the voice switching device simultaneously sends the first media stream and the second media stream, and the recording server combines the first media stream and the second media stream and records the same as a recording file;
  • the voice switching device first sends the second media stream and then sends the first media stream, and the recording server records the first media stream and the second media stream as one recording file respectively, and combines the two recording files into one recording file.
  • the recording method of the present invention further includes:
  • Step S107 The voice switching device creates a cache when receiving the recording request.
  • Cache refers to the temporary file swap area, which has an extremely fast access rate, which is a buffer between the internal storage and the external interface.
  • the cache is a first-in, first-out FIFO buffer.
  • the FIFO buffer means that when a read operation is performed on the cache, the data first written into the buffer is first read. In this way, the cache can be managed automatically.
  • the voice switching device creates a cache when it receives a recording request. For example, it may be created after receiving the recording request of the first communication terminal. Of course, it may be created before receiving the recording request of the first communication terminal, and no limitation is imposed here.
  • Step S108 The voice switching device stops storing the media stream generated by the mixing in the cache when receiving the startup recording response.
  • the recording server After receiving the recording response of the recording server, the recording server is ready for recording. Therefore, the voice switching device can stop storing the media stream generated after the mixing processing in the cache, and directly generate the media after the mixing processing. The stream is sent as a second media stream to the recording server. This way, you can avoid wasting the cached storage space.
  • the media packet in the first media stream carries a serial number identifier to facilitate the recording server root.
  • the media packets in the first media stream are recorded in chronological order according to the serial number identifier.
  • the media stream is a real-time transport protocol RTP media stream.
  • the voice exchange device Before receiving the recording response of the recording server, the voice exchange device saves the first media stream generated after the mixing process in the cache; after receiving the recording response of the recording server, the voice switching device will perform the mixing processing.
  • the generated second media stream is sent to the recording server; the voice switching device sends the first media stream saved in the cache to the recording server, so that the recording server performs combined recording on the first media stream and the second media stream. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.
  • FIG. 5 is a flowchart of still another embodiment of the recording method of the present invention.
  • the embodiment is a flowchart of the recording server, and includes:
  • Step S301 The recording server joins the conference site including the first communication terminal and other communication terminals created by the voice switching device, and the first communication terminal and other communication terminals communicate through the voice switching device.
  • the recording server is a device that acquires a media stream and implements user recording.
  • a voice switching device is a network device used for voice electrical signal forwarding.
  • the first communication terminal is a communication terminal that actively initiates a recording request, and the other communication terminals are communication terminals that participate in a recording process in addition to the first communication terminal.
  • the first communication terminal and the other communication terminals communicate through the voice switching device.
  • Step S302 The recording server receives the startup recording request sent by the voice switching device and sends a startup recording response to the voice switching device.
  • the recording server After the recording server joins the conference site, it receives a startup recording request sent by the voice switching device. At this time, the recording server receives the startup recording request sent by the voice switching device, and sends a startup recording response to the voice switching device.
  • Step S303 After transmitting the startup recording response to the voice switching device, the recording server receives the first media stream and the second media stream from the voice switching device.
  • the second media stream is a voice switching device that receives the recording response of the recording server. Thereafter, the media stream generated from the first communication terminal and the media stream from the other communication terminal are subjected to a mixing process to generate a media stream. After transmitting the initiate recording response to the voice switching device, the recording server is ready for recording, at which point the recording server receives the second media stream from the voice switching device.
  • the first media stream is a media stream generated by the voice switching device before the voice switching device receives the recording response of the recording server, and the voice switching device mixes the media stream from the first communication terminal with the media stream from the other communication terminal.
  • the first media stream is pre-stored in the cache, and after the recording server is ready for the recording, the first media stream saved in the cache sent by the voice switching device can be received.
  • Step S304 The recording server performs combined recording on the first media stream and the second media stream.
  • the recording server performs combined recording on the first media stream and the second media stream.
  • the voice switching device simultaneously sends the first media stream and the second media stream, and the recording server combines the first media stream and the second media stream into one recording file; for example, the voice switching device sends the second media stream first.
  • the first media stream is sent again, and the recording server records the first media stream and the second media stream as one recording file respectively, and combines the two recording files into one recording file.
  • the recording server of the present invention After receiving the first media stream and the second media stream, the recording server of the present invention performs combined recording on the first media stream and the second media stream. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.
  • the recording method of the present invention is described below by taking a point-to-point application scenario and a conference application scenario as an example.
  • FIG. 6 is a flowchart of interaction between network elements in a networking in a scenario where the recording method of the present invention is peer-to-peer. Take IP-PBX, FIFO buffer, SIP signaling server as an example.
  • the user of the first communication terminal uses the first communication terminal to call the user of the other communication terminal through the IP-PBX, and performs peer-to-peer communication with the user of the other communication terminal, and the user of the first communication terminal presses on the first communication terminal.
  • the recording button starts recording and sends a recording request to the IP-PBX.
  • the first communication terminal and the other communication terminals may each be an IP phone, and the users of the first communication terminal and other communication terminals may be internal users.
  • this step should be: the user of the first communication terminal initiates recording by pressing the record button on the first communication terminal during the conference, and sends a recording request to the IP-PBX.
  • the IP-PBX After receiving the recording request, the IP-PBX creates a conference site, adds the first communication terminal and other communication terminals to the conference site, and mixes the media streams from the first communication terminal and other communication terminals, after the mixing process.
  • the generated media stream acts as the first media stream.
  • this step should be: After the IP-PBX receives the recording request, since the site has been created before the conference call begins, there is no need to create a site at this time, and the first communication terminal and other communication terminals will be used.
  • the media stream is subjected to mixing processing, and the media stream generated after the mixing processing is used as the first media stream.
  • IP-PBX creates a FIFO buffer for this recording. Before the IP-PBX receives the recording response of the recording server, the IP-PBX sends the first media stream to the FIFO buffer for storage.
  • the IP-PBX sends an Invite message to the SIP server, and invites the recording server to join the site.
  • the SIP Server selects a suitable recording server in the recording server cluster, and replies the IP address of the recording server to the IP-PBX in the 200 OK message.
  • IP-PBX joins the recording server to the conference site. Then, the IP-PBX sends a SIP INFO message to the SIP server to notify the recording server to start recording through the SIP server. The SIPServer notifies the recording server to start recording, and the SIP server receives the recording server. After the recording response, the recording response of the recording server is sent to the voice switching device in the 200 OK message.
  • the IP-PBX After the IP-PBX receives the 200 OK response from the recording server, the IP-PBX mixes the media streams from the first communication terminal and other communication terminals, and the media stream generated after the mixing process is used as the second media stream. Directly sent to the recording server, no longer sent to the FIFO buffer.
  • the first media stream in the FIFO buffer is sent to the recording server.
  • the first media stream has a serial number identifier, and the recording server receives the out-of-order first media stream, and can record the media packets in the first media stream in time sequence according to the serial number identifier, so that the lost recording server is ready for recording. Prepare the previous recording.
  • the recording server performs combined recording on the first media stream and the second media stream.
  • FIG. 7 is a schematic structural diagram of an embodiment of a voice switching device according to the present invention.
  • the voice switching device includes: a receiving module 101, a mixing module 102, a saving module 103, a joining module 104, a first sending module 105, and a second The transmitting module 106 and the third transmitting module 107.
  • the receiving module 101 is configured to receive a recording request from the first communication terminal and a startup recording response from the recording server, and the first communication terminal and the other communication terminal communicate through the voice switching device.
  • a voice switching device is a network device used for voice electrical signal forwarding. Its main functions are to process user registration, call, outgoing relay, create a conference site, and interact with commands of the recording server.
  • the first communication terminal is a communication terminal that actively initiates a recording request
  • the other communication terminals are communication terminals that participate in a recording process in addition to the first communication terminal.
  • the other communication terminals are typically at least two other communication terminals; when peer-to-peer communication is through a voice switching device, the other communication terminals are one.
  • the voice switching device When the first communication terminal sends a recording request to the voice switching device, the voice switching device receives a recording request from the first communication terminal, wherein the first communication terminal and the other communication terminal communicate through the voice switching device.
  • the language exchange device After the language exchange device sends a recording request to the recording server, it can receive a startup recording response from the recording server. At this time, the recording server is ready for recording.
  • the mixing module 102 is configured to mix the currently received media stream from the first communication terminal and the media stream from other communication terminals upon receiving the recording request from the first communication terminal.
  • the saving module 103 is configured to save the media stream generated by the mixing of the mixing module 102 as a first media stream in the cache after the receiving module 101 receives the recording request and before receiving the starting recording response.
  • the first media stream is a voice switching device that receives a recording response from the recording server. Before, the media stream generated from the first communication terminal and the media stream from the other communication terminal are mixed and processed.
  • the voice switching device is not ready for recording until it receives a recording response from the recording server.
  • the first communication terminal and the other communication terminal have started the session, and the voice switching device saves the first media stream in the cache, which can prevent the media stream before the recording server is prepared for recording, wherein the cache is already created in advance. .
  • the adding module 104 is configured to join the recording server into the conference site of the first communication terminal and other communication terminals created by the voice switching device.
  • the recording server is a device that acquires a media stream and implements user recording. After the voice switching device receives the recording request from the first communication terminal, the recording server needs to be added to the conference site to enable recording.
  • the venue is created by the voice switching device and includes the first communication terminal and other communication terminals. If it is a point-to-point session, after the voice switching device receives the recording request of the first communication terminal, the voice switching device creates a site, and joins the first communication terminal and other communication terminals to the conference site; if it is a conference call, the conference site is before the conference call starts.
  • the conference site that has been created, that is, the conference call does not need to create a conference site after the voice switching device receives the recording request of the first communication terminal.
  • the first sending module 105 is configured to send a start recording request to the recording server.
  • the voice switching device After the recording server joins the conference site, the voice switching device sends a recording request to the recording server to prepare the recording server for the recording.
  • the second sending module 106 is configured to send the media stream generated by the current mixing of the mixing module 102 to the recording server as the second media stream after the receiving module 101 receives the startup recording response.
  • the second media stream is a media stream generated after the voice switching device receives the recording response of the recording server and mixes the media stream from the first communication terminal with the media stream from the other communication terminal.
  • the recording server After the voice switching device receives the recording response from the recording server, the recording server has prepared the recording for the recording.
  • the voice switching device mixes the currently received media stream from the first communication terminal with the media stream from the other communication terminal. At this time, the media stream generated by the mixing is sent to the recording server as the second media stream.
  • the third sending module 107 is configured to send the first media stream saved in the cache to the recording server after the receiving module 101 receives the startup recording response, so that the recording server performs combined recording on the first media stream and the second media stream.
  • the voice switching device sends the first media stream stored in the buffer to the recording server, so that The first media stream and the second media stream are combined and recorded by the recording server.
  • the voice switching device also includes a creation module 108 for creating a cache upon receipt of a recording request.
  • Cache refers to the temporary file swap area, which has an extremely fast access rate, which is a buffer between the internal storage and the external interface.
  • the voice switching device creates a cache when it receives a recording request. For example, it may be created after receiving the recording request of the first communication terminal. Of course, it may be created before receiving the recording request of the first communication terminal, and no limitation is imposed here.
  • the saving module 103 is configured to stop saving the media stream generated by the mixing in the cache when receiving the startup recording response of the recording server.
  • the recording server After receiving the recording response of the recording server, the recording server is ready for recording. Therefore, the voice switching device can stop storing the media stream generated after the mixing processing in the cache, and directly generate the media after the mixing processing. The stream is sent as a second media stream to the recording server. This way, you can avoid wasting the cached storage space.
  • the media packet in the first media stream carries a serial number identifier, so that the recording server records the media packets in the first media stream in time sequence according to the serial number identifier.
  • the voice switching device of this embodiment may perform the steps in FIG. 3 and FIG. 4.
  • the voice exchange device Before receiving the recording response of the recording server, the voice exchange device saves the first media stream generated after the mixing process in the cache; after receiving the recording response of the recording server, the voice switching device will perform the mixing processing. After the generated second media stream recording service Transmitting; the voice switching device sends the first media stream saved in the cache to the recording server, so that the recording server performs combined recording on the first media stream and the second media stream. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.
  • FIG. 9 is a schematic structural diagram of an embodiment of a recording server according to the present invention.
  • the recording server includes: a joining module 201, a first receiving module 202, a sending module 203, a second receiving module 204, a third receiving module 205, and a merge.
  • Recording module 206 includes: a joining module 201, a first receiving module 202, a sending module 203, a second receiving module 204, a third receiving module 205, and a merge.
  • Recording module 206 includes: a joining module 201, a first receiving module 202, a sending module 203, a second receiving module 204, a third receiving module 205, and a merge.
  • the recording server of the present embodiment can perform the steps in FIG. 5.
  • the joining module 201 is configured to join a conference site that includes a first communication terminal and other communication terminals created by the voice switching device, and the first communication terminal and other communication terminals communicate through the voice switching device.
  • the recording server is a device that acquires a media stream and implements user recording.
  • a voice switching device is a network device used for voice electrical signal forwarding.
  • the first communication terminal is a communication terminal that actively initiates a recording request, and the other communication terminals are communication terminals that participate in a recording process in addition to the first communication terminal.
  • the first communication terminal and the other communication terminals communicate through the voice switching device.
  • the recording server can be prepared for recording by joining the conference site including the first communication terminal and other communication terminals created by the voice switching device.
  • the first receiving module 202 is configured to receive a startup recording request sent by the voice switching device.
  • the sending module 203 is configured to send a start recording response to the voice switching device after the first receiving module 202 receives the startup recording request sent by the voice switching device.
  • the recording server After the recording server joins the conference site, it receives a startup recording request sent by the voice switching device. At this time, the recording server receives the startup recording request sent by the voice switching device, and sends a startup recording response to the voice switching device.
  • the second receiving module 204 is configured to receive the second media stream from the voice switching device after the sending module 203 sends the start recording response to the voice switching device.
  • the second media stream is a voice switching device that receives the recording response of the recording server. Thereafter, the media stream generated from the first communication terminal and the media stream from the other communication terminal are subjected to a mixing process to generate a media stream. After transmitting the initiate recording response to the voice switching device, the recording server is ready for recording, at which point the recording server receives the second media stream from the voice switching device.
  • the third receiving module 205 is configured to receive the buffered first media stream from the voice switching device after the sending module 203 sends the start recording response to the voice switching device.
  • the first media stream is a media stream generated by the voice switching device before the voice switching device receives the recording response of the recording server, and the voice switching device mixes the media stream from the first communication terminal with the media stream from the other communication terminal.
  • the first media stream is pre-stored in the cache, and after the recording server is ready for the recording, the first media stream saved in the cache sent by the voice switching device can be received.
  • the merge recording module 206 is configured to perform combined recording on the second media stream received by the second receiving module 204 and the first media stream received by the third receiving module 205.
  • the recording server performs combined recording on the first media stream and the second media stream.
  • the voice switching device simultaneously sends the first media stream and the second media stream, and the recording server combines the first media stream and the second media stream into one recording file; for example, the voice switching device sends the second media stream first.
  • the first media stream is sent again, and the recording server records the first media stream and the second media stream as one recording file respectively, and combines the two recording files into one recording file.
  • the recording server of the present invention After receiving the first media stream and the second media stream, the recording server of the present invention performs combined recording on the first media stream and the second media stream. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.
  • FIG. 10 is a schematic structural diagram of an embodiment of a recording system according to the present invention.
  • the system includes a voice switching device 31 and a recording server 32.
  • the voice switching device is configured to receive a recording request from the first communication terminal, and the first communication terminal and the other communication terminal communicate through the voice switching device; when receiving the recording request, the currently received media stream from the first communication terminal and The media streams from other communication terminals are mixed, and the media stream generated by the mixing is saved in the cache as the first media stream; the first communication terminal and other communication terminals are created by adding the recording server to the voice switching device.
  • the recording server is used to join the conference site including the first communication terminal and the other communication terminal created by the voice switching device, and the first communication terminal and the other communication terminal communicate through the voice switching device; receive the startup recording request sent by the voice switching device and send the voice to the voice
  • the switching device sends a startup recording response; after transmitting the startup recording response to the voice switching device, receiving the first media stream and the second media stream from the voice switching device, the first media stream is the voice switching device before receiving the start recording response a media stream generated by the media stream from the first communication terminal and other communication terminals after the mixing process, and the second media stream is a media stream from the first communication terminal and other communication terminals after the voice switching device receives the start recording response a media stream generated after the mixing process is performed; the first media stream and the second media stream are combined and recorded.
  • the voice exchange device Before receiving the recording response of the recording server, the voice exchange device saves the first media stream after the mixing process in the cache; after receiving the recording response of the recording server, the voice switching device will process the mixing The second media stream is sent to the recording server; the voice switching device sends the first media stream saved in the cache to the recording server, and after receiving the first media stream and the second media stream, the recording server processes the first media stream and the second media.
  • the stream is combined for recording. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.
  • FIG. 11 is a schematic diagram showing the physical structure of still another embodiment of the voice switching device of the present invention.
  • the voice switching device 40 includes a processor 41, a memory 42 coupled to the processor 41, a receiver 43, and a transmitter 44.
  • the receiver 43 is for receiving a recording request from the first communication terminal, and the first communication terminal and the other communication terminal communicate via the receiver 43 and the transmitter 44 of the voice switching device 40.
  • the processor 41 connects to the current receiver 43 when the receiver 43 receives the recording request.
  • the received media stream from the first communication terminal and the media stream from the other communication terminal are mixed, and the transmitter 44 is controlled to save the media stream generated by the mixing as a first media stream in the buffer of the memory 42. .
  • the processor 41 adds the recording server to the conference site that is created by the voice switching device and includes the first communication terminal and the other communication terminal.
  • the transmitter 44 sends a start recording request to the recording server.
  • the processor 41 mixes the currently received media stream from the first communication terminal and the media stream from the other communication terminal after the receiver 43 receives the startup recording response of the recording server, and controls transmission.
  • the processor 44 transmits the media stream generated by the mixing as the second media stream to the recording server.
  • the processor 41 acquires the first media stream stored in the buffer of the memory 42, and the control transmitter 44 sends the first media stream saved in the cache before the receiver 43 receives the start recording response to the recording server. In order to facilitate the combined recording of the first media stream and the second media stream by the recording server.
  • the voice exchange device Before receiving the recording response of the recording server, the voice exchange device saves the first media stream generated after the mixing process in the cache; after receiving the recording response of the recording server, the voice switching device will perform the mixing processing.
  • the generated second media stream is sent to the recording server; the voice switching device sends the first media stream saved in the cache to the recording server, so that the recording server performs combined recording on the first media stream and the second media stream. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.
  • FIG. 12 is a block diagram showing another embodiment of a recording server of the present invention.
  • the recording server 50 includes a processor 51, a memory 52 coupled to the processor 51, a receiver 53, and a transmitter 54.
  • the processor 51 is configured to join a conference site that is created by the voice switching device, including the first communications terminal and the other communications terminal, where the first communications terminal and the other communications terminal communicate through the voice switching device;
  • the receiver 53 receives a startup recording request sent by the voice switching device, and the transmitter 54 sends a startup recording response to the voice switching device.
  • the receiver 53 After transmitting a startup recording response to the voice switching device, the receiver 53 receives a first media stream and a second media stream from the voice switching device, the first media stream being the voice switching device receiving the location a media stream generated by performing a mixing process on a media stream from the first communication terminal and the other communication terminal before the recording response is started, the second media stream being the voice switching device receiving the startup a media stream generated after the sound recording process is performed on the media streams from the first communication terminal and the other communication terminals after the recording response;
  • the processor 51 performs combined recording on the first media stream and the second media stream.
  • the recording server of the present invention After receiving the first media stream and the second media stream, the recording server of the present invention performs combined recording on the first media stream and the second media stream. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.

Abstract

Disclosed are a recording method, a voice exchanging device, a recording server, and a recording system. The method comprises: before a recording starting response of a recording server is received, a voice exchanging device storing a first media stream generated after voice mixing is performed into a buffer; after the recording starting response of the recording server is received, the voice exchanging device sending a second media stream generated after voice mixing is performed to the recording server; and the voice exchanging device sending the first media stream stored the buffer to the recording server, and after the recording server receives the first media stream and the second media stream, the recording server merging the first media stream and the second media stream and performing recording. By means of the foregoing method, the present invention can avoid losing a record before a recording server prepares for recording, and accordingly, a complete record can be obtained.

Description

录音方法、语音交换设备、录音服务器及录音系统Recording method, voice switching device, recording server and recording system 技术领域Technical field
本发明涉及通信技术领域,特别是涉及一种录音方法、语音交换设备、录音服务器及录音系统。The present invention relates to the field of communication technologies, and in particular, to a recording method, a voice switching device, a recording server, and a recording system.
背景技术Background technique
在统一通信和呼叫中心(UC&CC,Unified Communication&Call Center)应用场景,经常需要对用户的点对点通话、会议通话进行按需录音。用户在通话过程中在IP话机上按键启动录音,IP话机发送录音请求给基于IP网络的语音交换机(IP-PBX,IP Private Branch eXchange),IP-PBX,当通话的通信终端都在会场中后,IP-PBX再把录音服务器以“只听不说”的方式加入会场,并向该录音服务器发送启动录音请求,并在收到该录音服务器返回的启动录音响应后对来自通话的通信终端的媒体流进行混音,并把混音后生成的媒体流发给录音服务器,录音服务器获取媒体流并进行录制,从而实现录音。In the unified communication and call center (UC&CC, Unified Communication & Call Center) application scenario, it is often necessary to perform on-demand recording of the user's point-to-point and conference calls. The user initiates recording on the IP phone during the call, and the IP phone sends a recording request to the IP-based voice switch (IP-PBX, IP Private Branch eXchange), IP-PBX, when the communication terminal of the call is in the conference site. The IP-PBX then joins the recording server to the conference site in a "speak only" manner, and sends a request to initiate recording to the recording server, and after receiving the startup recording response returned by the recording server, the communication terminal from the call is received. The media stream is mixed, and the media stream generated after the mixing is sent to the recording server, and the recording server acquires the media stream and performs recording, thereby realizing recording.
但是,在IP-PBX接收到录音请求至收到录音服务器发送的启动录音响应的过程中,涉及诸多信令交互,这段时间内录音服务器无法获取对通信终端的媒体流进行混音后生成的媒体流,因此无法实现录音。However, during the process in which the IP-PBX receives the recording request and receives the start recording response sent by the recording server, a number of signaling interactions are involved, during which the recording server cannot acquire the media stream generated by the communication terminal after mixing. The media stream, so recording is not possible.
发明内容Summary of the invention
本发明主要解决的技术问题是提供一种录音方法、语音交换设备、录音服务器及录音系统,能够防止丢失录音服务器为录音做好准备之前的录音,从而可以获得完整的录音。The technical problem to be solved by the present invention is to provide a recording method, a voice exchange device, a recording server and a recording system, which can prevent the recording of the recording server before the recording is prepared, so that a complete recording can be obtained.
第一方面,本发明提供一种录音方法,所述方法包括:语音交换设备接收来自第一通信终端的录音请求,所述第一通信终端和其它通信终端通过所述语音交换设备进行通信;所述语音交换设备在收到所述录音 请求时,对将当前接收的来自所述第一通信终端的媒体流和来自所述其它通信终端的媒体流进行混音,并将混音生成的媒体流作为第一媒体流保存在缓存中;所述语音交换设备将录音服务器加入所述语音交换设备创建的包括所述第一通信终端和所述其它通信终端的会场中;所述语音交换设备向所述录音服务器发送启动录音请求;所述语音交换设备在收到所述录音服务器的启动录音响应之后,对当前接收的来自所述第一通信终端的媒体流和来自所述其它通信终端的媒体流进行混音,将混音生成的媒体流作为第二媒体流发送给所述录音服务器,并将收到所述启动录音响应之前保存在所述缓存中的第一媒体流发送给所述录音服务器,以便于所述录音服务器对所述第一媒体流和所述第二媒体流进行合并录制。In a first aspect, the present invention provides a recording method, the method comprising: a voice switching device receiving a recording request from a first communication terminal, wherein the first communication terminal and other communication terminals communicate via the voice switching device; The voice exchange device receives the recording At the request, mixing the currently received media stream from the first communication terminal and the media stream from the other communication terminal, and storing the media stream generated by the mixing as a first media stream in the cache; The voice switching device adds a recording server to a site created by the voice switching device, including the first communication terminal and the other communication terminal; the voice switching device sends a start recording request to the recording server; After receiving the startup recording response of the recording server, the voice switching device mixes the currently received media stream from the first communication terminal and the media stream from the other communication terminal, and mixes the generated media. Sending, by the second media stream, the stream to the recording server, and sending, to the recording server, the first media stream saved in the cache before receiving the start recording response, so that the recording server The first media stream and the second media stream are combined and recorded.
在第一方面的第一种可能的实现方式中,所述方法还包括:所述语音交换设备在接收到所述录音请求时创建所述缓存。In a first possible implementation manner of the first aspect, the method further includes: the voice switching device creates the cache when receiving the recording request.
结合第一方面或第一方面的第一种可能的实现方式,在第一方面的第二种可能的实现方式中,所述方法还包括:所述语音交换设备在收到所述启动录音响应时停止将混音生成的媒体流保存在缓存中。In conjunction with the first aspect or the first possible implementation of the first aspect, in a second possible implementation manner of the first aspect, the method further includes: the voice switching device receiving the start recording response Stops saving the media stream generated by the mix in the cache.
结合第一方面至第一方面的第二种中任一种可能的实现方式,在第一方面的第三种可能的实现方式中,所述第一媒体流中的媒体包带有序号标识,以便于所述录音服务器根据所述序号标识按时间顺序录制所述第一媒体流中的媒体包。With reference to the first aspect to the second possible implementation of the first aspect, in a third possible implementation manner of the first aspect, the media packet in the first media stream carries a serial number identifier, So that the recording server records the media packets in the first media stream in chronological order according to the serial number identifier.
第二方面,本发明提供一种录音方法,所述方法包括:录音服务器加入语音交换设备创建的包括第一通信终端和其它通信终端的会场中,所述第一通信终端和所述其它通信终端通过所述语音交换设备进行通信;所述录音服务器接收所述语音交换设备发送的启动录音请求并向所述语音交换设备发送启动录音响应;在向语音交换设备发送启动录音响应之后,录音服务器接收来自所述语音交换设备的第一媒体流和第二媒体流,所述第一媒体流是所述语音交换设备在收到所述启动录音响应之前对来自所述第一通信终端和所述其它通信终端的媒体流进行混音处理后生成的媒体流,所述第二媒体流是所述语音交换设备在收到所述启 动录音响应之后对来自所述第一通信终端和所述其它通信终端的媒体流进行混音处理后生成的媒体流;所述录音服务器对所述第一媒体流和所述第二媒体流进行合并录制。In a second aspect, the present invention provides a recording method, the method comprising: a recording server joining a conference site including a first communication terminal and other communication terminals created by a voice switching device, the first communication terminal and the other communication terminal Communicating by the voice switching device; the recording server receives a startup recording request sent by the voice switching device and sends a startup recording response to the voice switching device; after transmitting the startup recording response to the voice switching device, the recording server receives a first media stream and a second media stream from the voice switching device, the first media stream being from the first communication terminal and the other before the voice switching device receives the start recording response a media stream generated by the media stream of the communication terminal after the mixing process, where the second media stream is received by the voice switching device a media stream generated by mixing a media stream from the first communication terminal and the other communication terminal after the motion recording response; the recording server performing the first media stream and the second media stream Combined recording.
第三方面,本发明提供一种语音交换设备,所述语音交换设备包括:接收模块、混音模块、保存模块、加入模块、第一发送模块、第二发送模块以及第三发送模块;所述接收模块用于接收来自第一通信终端的录音请求和来自录音服务器的启动录音响应,所述第一通信终端和其它通信终端通过所述语音交换设备进行通信;所述混音模块用于在收到所述录音请求时,将当前接收的来自所述第一通信终端的媒体流和来自所述其它通信终端的媒体流进行混音;所述保存模块用于在所述接收模块接收到所述录音请求之后、接收到所述启动录音响应之前将所述混音模块混音生成的媒体流作为第一媒体流保存在缓存中;所述加入模块用于将录音服务器加入所述语音交换设备创建的包括所述第一通信终端和所述其它通信终端的会场中;所述第一发送模块用于向所述录音服务器发送启动录音请求;所述第二发送模块用于在所述接收模块接收到所述启动录音响应之后将所述混音模块当前混音生成的媒体流作为第二媒体流发送给所述录音服务器;所述第三发送模块用于在所述接收模块接收到所述启动录音响应之后将保存在所述缓存中的第一媒体流向所述录音服务器发送,以便于所述录音服务器对所述第一媒体流和所述第二媒体流进行合并录制。In a third aspect, the present invention provides a voice switching device, where the voice switching device includes: a receiving module, a mixing module, a saving module, a joining module, a first sending module, a second sending module, and a third sending module; The receiving module is configured to receive a recording request from the first communication terminal and a startup recording response from the recording server, where the first communication terminal and the other communication terminal communicate through the voice switching device; the mixing module is configured to receive At the time of the recording request, the currently received media stream from the first communication terminal and the media stream from the other communication terminal are mixed; the saving module is configured to receive the After the recording request, before receiving the start recording response, the media stream generated by the mixing of the mixing module is saved in the cache as a first media stream; the joining module is configured to join the recording server to the voice switching device. In the conference site including the first communication terminal and the other communication terminal; the first sending module is configured to The sound server sends a start recording request; the second sending module is configured to send, after the receiving module receives the start recording response, the media stream generated by the current mixing of the mixing module as the second media stream to the a third sending module, configured to send the first media stream saved in the cache to the recording server after the receiving module receives the startup recording response, so that the recording server is The first media stream and the second media stream are combined and recorded.
在第三方面的第一种可能的实现方式中,所述语音交换设备还包括创建模块,所述创建模块用于在接收到所述录音请求时创建所述缓存。In a first possible implementation manner of the third aspect, the voice switching device further includes a creating module, where the creating module is configured to create the cache when the recording request is received.
结合第三方面或第三方面的第一种可能的实现方式,在第三方面的第二种可能的实现方式中,所述保存模块还用于在收到所述启动录音响应时停止将混音生成的媒体流保存在缓存中。In conjunction with the third aspect, or the first possible implementation manner of the third aspect, in a second possible implementation manner of the third aspect, the saving module is further configured to stop mixing when receiving the startup recording response The media stream generated by the tone is saved in the cache.
结合第三方面至第三方面的第二种可能的实现方式,在第三方面的第三种可能的实现方式中,所述第一媒体流中的媒体包带有序号标识,以便于所述录音服务器根据所述序号标识按时间顺序录制所述第一媒体流中的媒体包。 With reference to the second possible implementation of the third aspect to the third aspect, in a third possible implementation manner of the third aspect, the media packet in the first media stream is identified by a serial number to facilitate the The recording server records the media packets in the first media stream in chronological order according to the serial number identifier.
第四方面,本发明提供一种录音服务器,所述录音服务器包括:加入模块、第一接收模块、发送模块、第二接收模块、第三接收模块以及合并录制模块;所述加入模块用于加入语音交换设备创建的包括第一通信终端和其它通信终端的会场中,所述第一通信终端和所述其它通信终端通过所述语音交换设备进行通信;所述第一接收模块接收所述语音交换设备发送的启动录音请求;所述发送模块用于在所述第一接收模块接收所述语音交换设备发送的启动录音请求后,向所述语音交换设备发送启动录音响应;所述第二接收模块用于在所述发送模块向所述语音交换设备发送启动录音响应之后,接收来自所述语音交换设备的第二媒体流,所述第二媒体流是所述语音交换设备在收到所述启动录音响应之后对来自所述第一通信终端和所述其它通信终端的媒体流进行混音处理后生成的媒体流;所述第三接收模块用于在所述发送模块向所述语音交换设备发送启动录音响应之后,接收来自所述语音交换设备的缓存的第一媒体流,所述第一媒体流是所述语音交换设备在收到所述启动录音响应之前对来自所述第一通信终端和所述其它通信终端的媒体流进行混音处理后生成的媒体流;所述合并录制模块用于对所述第二接收模块接收的第二媒体流和所述第三接收模块接收的第一媒体流进行合并录制。In a fourth aspect, the present invention provides a recording server, where the recording server includes: a joining module, a first receiving module, a sending module, a second receiving module, a third receiving module, and a combined recording module; In the conference site of the first communication terminal and the other communication terminal created by the voice switching device, the first communication terminal and the other communication terminal communicate through the voice switching device; the first receiving module receives the voice exchange Sending a recording request sent by the device; the sending module is configured to send a startup recording response to the voice switching device after the first receiving module receives the startup recording request sent by the voice switching device; the second receiving module Receiving, after the sending module sends a startup recording response to the voice switching device, receiving a second media stream from the voice switching device, where the second media stream is the voice switching device receiving the startup Performing media streams from the first communication terminal and the other communication terminals after the recording response a media stream generated after the tone processing; the third receiving module is configured to receive, after the sending module sends a start recording response to the voice switching device, a first media stream that is buffered from the voice switching device, The first media stream is a media stream generated by the voice switching device mixing the media streams from the first communication terminal and the other communication terminal before receiving the start recording response; the combined recording The module is configured to perform combined recording on the second media stream received by the second receiving module and the first media stream received by the third receiving module.
第五方面,本发明提供一种录音系统,所述系统包括:语音交换设备和录音服务器;所述语音交换设备用于接收来自第一通信终端的录音请求,所述第一通信终端和其它通信终端通过所述语音交换设备进行通信;在收到所述录音请求时,对当前接收的来自所述第一通信终端的媒体流和来自所述其它通信终端的媒体流进行混音,并将混音生成的媒体流作为第一媒体流保存在缓存中;将录音服务器加入所述语音交换设备创建的包括所述第一通信终端和所述其它通信终端的会场中;向所述录音服务器发送启动录音请求;在收到所述录音服务器的启动录音响应之后,对当前接收的来自所述第一通信终端的媒体流和来自所述其它通信终端的媒体流进行混音,将混音生成的媒体流作为第二媒体流发送给所述录音服务器,并将收到所述启动录音响应之前保存在所述缓存中的第一媒体流发送给所述录音服务器,以便于所述录音服务器对所述第一媒 体流和所述第二媒体流进行合并录制;所述录音服务器用于加入语音交换设备创建的包括第一通信终端和其它通信终端的会场中,所述第一通信终端和所述其它通信终端通过所述语音交换设备进行通信;接收所述语音交换设备发送的启动录音请求并向所述语音交换设备发送启动录音响应;在向所述语音交换设备发送启动录音响应之后,接收来自所述语音交换设备的第一媒体流和第二媒体流,所述第一媒体流是所述语音交换设备在收到所述启动录音响应之前对来自所述第一通信终端和所述其它通信终端的媒体流进行混音处理后生成的媒体流,所述第二媒体流是所述语音交换设备在收到所述启动录音响应之后对来自所述第一通信终端和所述其它通信终端的媒体流进行混音处理后生成的媒体流;对所述第一媒体流和所述第二媒体流进行合并录制。In a fifth aspect, the present invention provides a recording system, the system comprising: a voice switching device and a recording server; the voice switching device is configured to receive a recording request from the first communication terminal, the first communication terminal and other communication The terminal performs communication through the voice switching device; when receiving the recording request, mixes the currently received media stream from the first communication terminal and the media stream from the other communication terminal, and mixes The sound generated media stream is saved in the cache as the first media stream; the recording server is added to the conference site including the first communication terminal and the other communication terminal created by the voice switching device; and the recording server is sent to the recording server. Recording request; after receiving the start recording response of the recording server, mixing the currently received media stream from the first communication terminal and the media stream from the other communication terminal, and mixing the generated media The stream is sent to the recording server as a second media stream, and is saved in the location before receiving the start recording response A first media cache audio stream to the server, so that the recording medium of said first server The volume stream and the second media stream are combined and recorded; the recording server is used to join a conference site including a first communication terminal and other communication terminals created by the voice switching device, the first communication terminal and the other communication terminal Communicating by the voice switching device; receiving a startup recording request sent by the voice switching device and transmitting a startup recording response to the voice switching device; receiving the initiated recording response from the voice switching device, receiving the voice from the voice a first media stream and a second media stream of the switching device, the first media stream being a medium from the first communication terminal and the other communication terminal before the voice switching device receives the start recording response Flowing a media stream generated after the mixing process, wherein the second media stream is that the voice switching device performs a media stream from the first communication terminal and the other communication terminal after receiving the start recording response a media stream generated after the mixing process; combining the first media stream and the second media stream.
本发明的有益效果是:区别于现有技术的情况,本发明在收到录音服务器的启动录音响应之前,语音交换设备将混音处理后的第一媒体流保存在缓存中;在收到录音服务器的启动录音响应之后,语音交换设备将混音处理后的第二媒体流向录音服务器发送;语音交换设备将保存在缓存中的第一媒体流向录音服务器发送,录音服务器接收到第一媒体流和第二媒体流后,对第一媒体流和第二媒体流进行合并录制。通过这种方式,能够防止丢失录音服务器为录音做好准备之前的录音,从而可以获得完整的录音。The invention has the beneficial effects that, prior to the prior art, the voice exchange device saves the first media stream after the mixing process in the buffer before receiving the recording response of the recording server; After the server initiates the recording response, the voice switching device sends the second media stream after the mixing process to the recording server; the voice switching device sends the first media stream stored in the buffer to the recording server, and the recording server receives the first media stream and After the second media stream, the first media stream and the second media stream are combined and recorded. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.
附图说明DRAWINGS
图1是现有技术中典型的对点对点通话进行录音的场景的组网一结构示意图;1 is a schematic structural diagram of a networking of a scene for recording a point-to-point call in the prior art;
图2是本发明录音方法对点对点会通话进行录音的场景的组网一结构示意图;2 is a schematic structural diagram of a networking of a scene in which a recording method of the present invention records a point-to-point call;
图3是本发明录音方法一实施方式的流程图;3 is a flow chart of an embodiment of a recording method of the present invention;
图4是本发明录音方法另一实施方式的流程图;4 is a flow chart of another embodiment of the recording method of the present invention;
图5是本发明录音方法又一实施方式的流程图;Figure 5 is a flow chart of still another embodiment of the recording method of the present invention;
图6是本发明录音方法点对点的应用场景在组网中各网元交互的流 程图;6 is a flow of interaction between network elements in a network in the application scenario of the recording method of the present invention in a peer-to-peer manner Cheng Tu
图7是本发明语音交换设备一实施方式的结构示意图;7 is a schematic structural diagram of an embodiment of a voice switching device according to the present invention;
图8是本发明语音交换设备另一实施方式的结构示意图;8 is a schematic structural diagram of another embodiment of a voice switching device according to the present invention;
图9是本发明录音服务器一实施方式的结构示意图;9 is a schematic structural diagram of an embodiment of a recording server of the present invention;
图10是本发明录音系统一实施方式的结构示意图;10 is a schematic structural view of an embodiment of a recording system of the present invention;
图11是本发明语音交换设备又一实施方式的实体结构示意图;11 is a schematic diagram of a physical structure of still another embodiment of a voice switching device according to the present invention;
图12是本发明录音服务器另一实施方式的实体结构示意图。FIG. 12 is a schematic diagram showing the physical structure of another embodiment of the recording server of the present invention.
具体实施方式detailed description
下面结合附图和实施方式对本发明进行详细说明。The invention will now be described in detail in conjunction with the drawings and embodiments.
参阅图1,图1是现有技术中典型的对点对点通话进行录音的场景的组网一结构示意图,用户在通话过程中在IP话机11上按键启动录音,IP话机11发送录音请求给IP-PBX 12,IP-PBX 12创建会场把该用户的IP话机和另一用户的IP话机加入会场中,IP-PBX 12发送Invite消息给SIP信令服务器13,邀请录音服务器14加入会场,SIP信令服务器13在录音服务器集群中选择合适的录音服务器14,并把录音服务器14的IP地址在200OK消息中回复给IP-PBX 12,IP-PBX 12将录音服务器14加入会场,IP-PBX 12发送SIP INFO消息给SIP信令服务器13,通知录音服务器14启动录音,IP-PBX 12通过SIP信令服务器13收到录音服务器14启动录音200OK响应后,IP-PBX 12把来自会场中通话双方的媒体流进行混音处理,将混音后生成的媒体流发送给录音服务器14,录音服务器14获取媒体流,对该媒体流进行录制,从而实现录音。在IP-PBX 12收到录音请求至IP-PBX收到录音服务器14发送的启动录音响应的过程中,涉及诸多信令交互,这段时间内录音服务器14无法获取通话双方的的媒体流,因此会丢失这个过程中的媒体流,从而使得录音不完整。Referring to FIG. 1 , FIG. 1 is a schematic diagram of a typical network structure for recording a point-to-point call in a prior art. During a call, a user presses a button on the IP phone 11 to start recording, and the IP phone 11 sends a recording request to the IP- PBX 12, IP-PBX 12 creates a site to add the IP phone of the user and the IP phone of another user to the site. The IP-PBX 12 sends an Invite message to the SIP signaling server 13, inviting the recording server 14 to join the site, SIP signaling. The server 13 selects an appropriate recording server 14 in the recording server cluster, and returns the IP address of the recording server 14 to the IP-PBX 12 in the 200 OK message, the IP-PBX 12 adds the recording server 14 to the conference site, and the IP-PBX 12 sends the SIP. The INFO message is sent to the SIP signaling server 13, and the recording server 14 is notified to start the recording. After the IP-PBX 12 receives the 200 OK response from the recording server 14 by the SIP signaling server 13, the IP-PBX 12 sends the media stream from both parties in the conference. The mixing process is performed, and the media stream generated after the mixing is sent to the recording server 14, and the recording server 14 acquires the media stream and records the media stream to realize recording. In the process that the IP-PBX 12 receives the recording request until the IP-PBX receives the start recording response sent by the recording server 14, a number of signaling interactions are involved, during which the recording server 14 cannot obtain the media stream of both parties of the call. The media stream in this process will be lost, making the recording incomplete.
参阅图2,图2是本发明录音方法对点对点通话进行录音的场景的组网一结构示意图。在本发明方法中,IP-PBX 21创建会场,将发起录音请求的IP话机和另一IP话机加入会场中,同时为本次录音创建缓存 22。IP-PBX 21发送Invite消息给SIP信令服务器23,邀请录音服务器24加入会场,SIP信令服务器23在录音服务器集群中选择合适的录音服务器24,并把录音服务器24的IP地址在200OK消息中回复给IP-PBX21,IP-PBX 21将录音服务器24加入会场,IP-PBX 21发送SIP INFO消息给SIP信令服务器23,通知录音服务器24启动录音。其中,IP-PBX21在收到录音服务器24的启动录音响应前,IP-PBX 21把来自通话双方的媒体流进行混音处理,将混音后生成的媒体流发送到缓存22中;录音服务器24加入会场后,IP-PBX 21把来自会场中通话双方经过混音处理后生成的媒体流通过录音服务器24的IP地址直接发送给录音服务器24,不再发送到缓存22。IP-PBX 21将缓存22中保存的媒体流发送给录音服务器24。因此,录音服务器收到媒体流后,能够不丢失录音服务器在做好录音准备之前的录音,从而实现完整的录音。Referring to FIG. 2, FIG. 2 is a structural diagram of a networking of a scene in which a recording method of the present invention records a point-to-point call. In the method of the present invention, the IP-PBX 21 creates a site, and adds an IP phone that initiates the recording request and another IP phone to the site, and creates a cache for the recording. twenty two. The IP-PBX 21 sends an Invite message to the SIP signaling server 23, invites the recording server 24 to join the conference site, and the SIP signaling server 23 selects an appropriate recording server 24 in the recording server cluster, and sets the IP address of the recording server 24 in the 200 OK message. Replying to the IP-PBX 21, the IP-PBX 21 joins the recording server 24 to the conference site, and the IP-PBX 21 sends a SIP INFO message to the SIP signaling server 23, informing the recording server 24 to start recording. Wherein, before receiving the recording response of the recording server 24, the IP-PBX 21 mixes the media streams from both parties of the call, and sends the media stream generated after the mixing to the buffer 22; the recording server 24 After joining the conference site, the IP-PBX 21 directly transmits the media stream generated by the mixing process from both parties in the conference site to the recording server 24 through the IP address of the recording server 24, and is not sent to the buffer 22. The IP-PBX 21 transmits the media stream held in the cache 22 to the recording server 24. Therefore, after receiving the media stream, the recording server can not complete the recording of the recording server before preparing for recording, thereby achieving complete recording.
图3是本发明录音方法一实施方式的流程图,本实施方式是语音交换设备端的流程图,包括:3 is a flowchart of an embodiment of a recording method of the present invention. The embodiment is a flowchart of a voice switching device, and includes:
步骤S101:语音交换设备接收来自第一通信终端的录音请求,第一通信终端和其它通信终端通过语音交换设备进行通信。Step S101: The voice switching device receives the recording request from the first communication terminal, and the first communication terminal and the other communication terminal communicate through the voice switching device.
语音交换设备是一种用于语音电信号转发的网络设备,主要作用是处理用户的注册、呼叫、出局中继、创建会场以及与录音服务器的指令交互等。A voice switching device is a network device used for voice electrical signal forwarding. Its main functions are to process user registration, call, outgoing relay, create a conference site, and interact with commands of the recording server.
第一通信终端是主动发起录音请求的通信终端,其它通信终端是除了第一通信终端外参与录音过程的通信终端。当通过语音交换设备上创建的电话会议进行通信时,其它通信终端一般至少有两个;当通过语音交换设备进行点对点通信时,其它通信终端是一个。The first communication terminal is a communication terminal that actively initiates a recording request, and the other communication terminals are communication terminals that participate in a recording process in addition to the first communication terminal. When communicating via a conference call created on a voice switching device, there are typically at least two other communication terminals; when peer-to-peer communication is through a voice switching device, the other communication terminals are one.
第一通信终端向语音交换设备发送录音请求时,语音交换设备接收来自第一通信终端的录音请求,其中,第一通信终端和其它通信终端通过语音交换设备进行通信。When the first communication terminal sends a recording request to the voice switching device, the voice switching device receives a recording request from the first communication terminal, wherein the first communication terminal and the other communication terminal communicate through the voice switching device.
步骤S102:语音交换设备在收到录音请求时,对当前接收的来自第一通信终端的媒体流和来自其它通信终端的媒体流进行混音,并将混音生成的媒体流作为第一媒体流保存在缓存中。 Step S102: When receiving the recording request, the voice switching device mixes the currently received media stream from the first communication terminal and the media stream from the other communication terminal, and uses the media stream generated by the mixing as the first media stream. Saved in the cache.
第一媒体流是语音交换设备在收到录音服务器的启动录音响应之前,将来自第一通信终端的媒体流和来自其它通信终端的媒体流进行混音处理后生成的媒体流。The first media stream is a media stream generated after the voice switching device mixes the media stream from the first communication terminal and the media stream from the other communication terminal before receiving the recording response of the recording server.
语音交换设备在收到录音服务器的启动录音响应之前,录音服务器还没有为录音做好准备。但是第一通信终端和其它通信终端已经开始会话,语音交换设备将第一媒体流保存在缓存中,可以防止丢失录音服务器为录音做好准备之前的媒体流,其中,缓存是预先已经创建好的。The voice switching device is not ready for recording until it receives a recording response from the recording server. However, the first communication terminal and the other communication terminal have started the session, and the voice switching device saves the first media stream in the cache, which can prevent the media stream before the recording server is prepared for recording, wherein the cache is already created in advance. .
步骤S103:语音交换设备将录音服务器加入语音交换设备创建的包括第一通信终端和其它通信终端的会场中。Step S103: The voice switching device adds the recording server to the conference site that is created by the voice switching device, including the first communication terminal and other communication terminals.
录音服务器是获取媒体流,实现用户录音的设备。语音交换设备收到来自第一通信终端的录音请求后,需要将录音服务器加入会场中才能够实现录音。会场是语音交换设备创建的,包括第一通信终端和其它通信终端。如果是点对点会话,当语音交换设备接收第一通信终端的录音请求后,语音交换设备创建会场,将第一通信终端和其它通信终端加入会场中;如果是电话会议,会场是在电话会议开始之前已经创建好的,即为电话会议的会场,当语音交换设备接收第一通信终端的录音请求后,不需要再创建会场。The recording server is a device that acquires a media stream and implements user recording. After the voice switching device receives the recording request from the first communication terminal, the recording server needs to be added to the conference site to enable recording. The venue is created by the voice switching device and includes the first communication terminal and other communication terminals. If it is a point-to-point session, after the voice switching device receives the recording request of the first communication terminal, the voice switching device creates a site, and joins the first communication terminal and other communication terminals to the conference site; if it is a conference call, the conference site is before the conference call starts. The conference site that has been created, that is, the conference call, does not need to create a conference site after the voice switching device receives the recording request of the first communication terminal.
具体可以是:语音交换设备发送Invite消息给信令服务器,邀请录音服务器加入会场,信令服务器在录音服务器集群中选择合适的录音服务器,并把录音服务器的IP地址在200OK消息中回复给语音交换设备,语音交换设备将录音服务器加入会场。Specifically, the voice switching device sends an Invite message to the signaling server, invites the recording server to join the conference site, and the signaling server selects an appropriate recording server in the recording server cluster, and returns the IP address of the recording server to the voice exchange in the 200 OK message. The device and the voice switching device join the recording server to the site.
信令服务器用于处理来自语音交换设备的信令和录音指令的设备,同时负责与录音服务器之间的交互。The signaling server is used to process the signaling and recording instructions from the voice switching device and is responsible for interaction with the recording server.
其中,语音交换设备是基于IP网络的语音交换设备,信令服务器是会话发起协议SIP信令服务器。当然,信令服务器还可以是H.323协议的信令服务器,在此不做限制。The voice switching device is a voice switching device based on an IP network, and the signaling server is a session initiation protocol SIP signaling server. Of course, the signaling server may also be a signaling server of the H.323 protocol, and is not limited herein.
当然,信令服务器进行信令交互的功能也可以集成在录音服务器上,使得语音交换设备直接与录音服务器进行信令的交互。Of course, the function of signaling interaction by the signaling server can also be integrated on the recording server, so that the voice switching device directly performs signaling interaction with the recording server.
步骤S104:语音交换设备向录音服务器发送启动录音请求并接收录 音服务器发送的启动录音响应。Step S104: The voice switching device sends a start recording request to the recording server and receives the recording. The start recording response sent by the tone server.
具体可以是,当录音服务器加入会场后,语音交换设备再向录音服务器发送启动录音请求,以便于录音服务器为录音做好准备(如为该录音预留录音资源)。如,步骤S104的具体实现方式为:语音交换设备发送SIP INFO消息给信令服务器,通知录音服务器启动录音,信令服务器将通知录音服务器启动录音的消息发送给录音服务器,信令服务器收到录音服务器的启动录音响应后,通过200OK消息将该录音服务器的启动录音响应发送给语音交换设备,语音交换设备在收到录音服务器的启动录音响应后,即可确定录音服务器已经为录音做好准备。Specifically, after the recording server joins the conference site, the voice switching device sends a recording request to the recording server to prepare the recording server for recording (for example, reserve recording resources for the recording). For example, the specific implementation manner of step S104 is: the voice switching device sends a SIP INFO message to the signaling server, and notifies the recording server to start recording, and the signaling server sends a message to the recording server to start recording to the recording server, and the signaling server receives the recording. After the server initiates the recording response, the recording response of the recording server is sent to the voice switching device through the 200 OK message, and after receiving the recording response of the recording server, the voice switching device can determine that the recording server is ready for the recording.
具体还可以是,在将录音服务器加入会场的过程中,通过将录音服务器加入会场时交互的信令发送启动录音请求和启动录音响应。如,步骤S104的具体实现方式为:语音交换设备通过邀请录音服务器加入会场的SIP INVITE消息携带启动录音指令,信令服务器将通知录音服务器启动录音的消息发送给录音服务器,并在收到录音服务器的启动录音响应后,通过SIP INVITE的200OK消息将该录音服务器的启动录音响应发送给语音交换设备,语音交换设备在收到该200OK消息后即完成将该录音服务器加入会场的过程,并确定录音服务器已经为录音做好准备。Specifically, in the process of joining the recording server to the conference site, the recording request is initiated and the recording response is started by signaling the interaction when the recording server is added to the conference site. For example, the specific implementation manner of the step S104 is: the voice switching device carries the start recording instruction by inviting the recording server to join the SIP INVITE message of the conference site, and the signaling server sends a message to the recording server to start the recording to the recording server, and receives the recording server. After the start of the recording response, the recording response of the recording server is sent to the voice switching device through the 200 OK message of the SIP INVITE, and the voice switching device completes the process of joining the recording server to the site after receiving the 200 OK message, and determines the recording. The server is ready for recording.
步骤S105:语音交换设备在收到录音服务器的启动录音响应之后,对当前接收的来自第一通信终端的媒体流和来自其它通信终端的媒体流进行混音,并将混音生成的媒体流作为第二媒体流发送给录音服务器。Step S105: After receiving the startup recording response of the recording server, the voice switching device mixes the currently received media stream from the first communication terminal and the media stream from the other communication terminal, and uses the media stream generated by the mixing as the media stream. The second media stream is sent to the recording server.
第二媒体流是语音交换设备在收到录音服务器的启动录音响应之后,将来自第一通信终端的媒体流和来自其它通信终端的媒体流进行混音处理后生成的媒体流。The second media stream is a media stream generated after the voice switching device receives the recording response of the recording server and mixes the media stream from the first communication terminal with the media stream from the other communication terminal.
语音交换设备在收到录音服务器的启动录音响应之后,录音服务器已经为录音做好了录音准备。语音交换设备将当前接收的来自第一通信终端的媒体流和来自其它通信终端的媒体流进行混音,此时,将混音生成的媒体流作为第二媒体流发送给录音服务器。 After the voice switching device receives the recording response from the recording server, the recording server has prepared the recording for the recording. The voice switching device mixes the currently received media stream from the first communication terminal with the media stream from the other communication terminal. At this time, the media stream generated by the mixing is sent to the recording server as the second media stream.
步骤S106:语音交换设备将收到启动录音响应之前保存在缓存中的第一媒体流发送给录音服务器,以便于录音服务器对第一媒体流和第二媒体流进行合并录制。Step S106: The voice switching device sends the first media stream saved in the cache before receiving the recording response to the recording server, so that the recording server performs combined recording on the first media stream and the second media stream.
在录音服务器没有为录音做好准备的时间内,第一媒体流保存在缓存中,为了便于录音服务器获得完整的媒体流,语音交换设备将保存在缓存中的第一媒体流向录音服务器发送,以便于录音服务器对第一媒体流和第二媒体流进行合并录制。步骤S106具体可以有多种实现方式,如,语音交换设备同时发送第一媒体流和第二媒体流,录音服务器将第一媒体流和第二媒体流进行合并并录制为一个录音文件;又如,语音交换设备先发送第二媒体流再发送第一媒体流,录音服务器将第一媒体流和第二媒体流分别录制为一个录音文件,并将这两个录音文件合并为一个录音文件。During the time when the recording server is not ready for recording, the first media stream is saved in the cache. In order to facilitate the recording server to obtain a complete media stream, the voice switching device sends the first media stream stored in the buffer to the recording server, so that The first media stream and the second media stream are combined and recorded by the recording server. Step S106 may specifically be implemented in multiple manners. For example, the voice switching device simultaneously sends the first media stream and the second media stream, and the recording server combines the first media stream and the second media stream and records the same as a recording file; The voice switching device first sends the second media stream and then sends the first media stream, and the recording server records the first media stream and the second media stream as one recording file respectively, and combines the two recording files into one recording file.
参阅图4,本发明录音方法还包括:Referring to FIG. 4, the recording method of the present invention further includes:
步骤S107:语音交换设备在接收到录音请求时创建缓存。Step S107: The voice switching device creates a cache when receiving the recording request.
缓存是指临时文件交换区,具有极快的存取速率,它是内部存储和外界接口之间的缓冲器。Cache refers to the temporary file swap area, which has an extremely fast access rate, which is a buffer between the internal storage and the external interface.
其中,缓存是先进先出FIFO缓存。FIFO缓存是指在对缓存进行读操作时,先被写入缓存里的数据先被读出来。通过这种方式,能够自动管理缓存。Among them, the cache is a first-in, first-out FIFO buffer. The FIFO buffer means that when a read operation is performed on the cache, the data first written into the buffer is first read. In this way, the cache can be managed automatically.
语音交换设备在接收到录音请求时创建缓存。例如:可以是在接收到第一通信终端的录音请求后进行创建。当然,也可以是在接收到第一通信终端的录音请求之前进行创建,在此不做限制。The voice switching device creates a cache when it receives a recording request. For example, it may be created after receiving the recording request of the first communication terminal. Of course, it may be created before receiving the recording request of the first communication terminal, and no limitation is imposed here.
步骤S108:语音交换设备在收到启动录音响应时停止将混音生成的媒体流保存在缓存中。Step S108: The voice switching device stops storing the media stream generated by the mixing in the cache when receiving the startup recording response.
在收到录音服务器的启动录音响应之后,录音服务器已经为录音做好准备,因此,语音交换设备可以停止将混音处理后生成的媒体流保存在缓存中,直接将混音处理后生成的媒体流作为第二媒体流向录音服务器发送。这样,可以避免浪费缓存的存储空间。After receiving the recording response of the recording server, the recording server is ready for recording. Therefore, the voice switching device can stop storing the media stream generated after the mixing processing in the cache, and directly generate the media after the mixing processing. The stream is sent as a second media stream to the recording server. This way, you can avoid wasting the cached storage space.
其中,第一媒体流中的媒体包带有序号标识,以便于录音服务器根 据序号标识按时间顺序录制第一媒体流中的媒体包。The media packet in the first media stream carries a serial number identifier to facilitate the recording server root. The media packets in the first media stream are recorded in chronological order according to the serial number identifier.
其中,媒体流是实时传输协议RTP媒体流。The media stream is a real-time transport protocol RTP media stream.
本发明在收到录音服务器的启动录音响应之前,语音交换设备将混音处理后生成的第一媒体流保存在缓存中;在收到录音服务器的启动录音响应之后,语音交换设备将混音处理后生成的第二媒体流向录音服务器发送;语音交换设备将保存在缓存中的第一媒体流向录音服务器发送,以便于录音服务器对第一媒体流和第二媒体流进行合并录制。通过这种方式,能够防止丢失录音服务器为录音做好准备之前的录音,从而可以获得完整的录音。Before receiving the recording response of the recording server, the voice exchange device saves the first media stream generated after the mixing process in the cache; after receiving the recording response of the recording server, the voice switching device will perform the mixing processing. The generated second media stream is sent to the recording server; the voice switching device sends the first media stream saved in the cache to the recording server, so that the recording server performs combined recording on the first media stream and the second media stream. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.
参阅图5,图5是本发明录音方法又一实施方式的流程图,本实施方式是录音服务器端的流程图,包括:Referring to FIG. 5, FIG. 5 is a flowchart of still another embodiment of the recording method of the present invention. The embodiment is a flowchart of the recording server, and includes:
步骤S301:录音服务器加入语音交换设备创建的包括第一通信终端和其它通信终端的会场中,第一通信终端和其它通信终端通过语音交换设备进行通信。Step S301: The recording server joins the conference site including the first communication terminal and other communication terminals created by the voice switching device, and the first communication terminal and other communication terminals communicate through the voice switching device.
录音服务器是获取媒体流,实现用户录音的设备。语音交换设备是一种用于语音电信号转发的网络设备。第一通信终端是主动发起录音请求的通信终端,其它通信终端是除了第一通信终端外参与录音过程的通信终端。当通过语音交换设备上创建的电话会议进行通信时,其它通信终端一般至少有两个;当通过语音交换设备进行点对点通信时,其它通信终端是一个。其中,第一通信终端和其它通信终端通过语音交换设备进行通信。The recording server is a device that acquires a media stream and implements user recording. A voice switching device is a network device used for voice electrical signal forwarding. The first communication terminal is a communication terminal that actively initiates a recording request, and the other communication terminals are communication terminals that participate in a recording process in addition to the first communication terminal. When communicating via a conference call created on a voice switching device, there are typically at least two other communication terminals; when peer-to-peer communication is through a voice switching device, the other communication terminals are one. The first communication terminal and the other communication terminals communicate through the voice switching device.
步骤S302:录音服务器接收语音交换设备发送的启动录音请求并向语音交换设备发送启动录音响应。Step S302: The recording server receives the startup recording request sent by the voice switching device and sends a startup recording response to the voice switching device.
在录音服务器加入会场后,接收语音交换设备发送的启动录音请求,此时录音服务器接收语音交换设备发送的启动录音请求,并向语音交换设备发送启动录音响应。After the recording server joins the conference site, it receives a startup recording request sent by the voice switching device. At this time, the recording server receives the startup recording request sent by the voice switching device, and sends a startup recording response to the voice switching device.
步骤S303:在向语音交换设备发送启动录音响应之后,录音服务器接收来自语音交换设备的第一媒体流和第二媒体流。Step S303: After transmitting the startup recording response to the voice switching device, the recording server receives the first media stream and the second media stream from the voice switching device.
第二媒体流是语音交换设备在收到录音服务器的启动录音响应之 后,将来自第一通信终端的媒体流和来自其它通信终端的媒体流进行混音处理后生成的媒体流。在向语音交换设备发送启动录音响应之后,录音服务器已经为录音做好准备,此时,录音服务器接收来自语音交换设备的第二媒体流。The second media stream is a voice switching device that receives the recording response of the recording server. Thereafter, the media stream generated from the first communication terminal and the media stream from the other communication terminal are subjected to a mixing process to generate a media stream. After transmitting the initiate recording response to the voice switching device, the recording server is ready for recording, at which point the recording server receives the second media stream from the voice switching device.
第一媒体流是语音交换设备在收到录音服务器的启动录音响应之前,语音交换设备将来自第一通信终端的媒体流和来自其它通信终端的媒体流进行混音处理后生成的媒体流。第一媒体流预先保存在缓存中,待录音服务器为录音做好准备后,即可接收语音交换设备发送的保存在缓存中的第一媒体流。The first media stream is a media stream generated by the voice switching device before the voice switching device receives the recording response of the recording server, and the voice switching device mixes the media stream from the first communication terminal with the media stream from the other communication terminal. The first media stream is pre-stored in the cache, and after the recording server is ready for the recording, the first media stream saved in the cache sent by the voice switching device can be received.
步骤S304:录音服务器对第一媒体流和第二媒体流进行合并录制。Step S304: The recording server performs combined recording on the first media stream and the second media stream.
录音服务器对第一媒体流和第二媒体流进行合并录制。如,语音交换设备同时发送第一媒体流和第二媒体流,录音服务器将第一媒体流和第二媒体流进行合并并录制为一个录音文件;又如,语音交换设备先发送第二媒体流再发送第一媒体流,录音服务器将第一媒体流和第二媒体流分别录制为一个录音文件,并将这两个录音文件合并为一个录音文件。The recording server performs combined recording on the first media stream and the second media stream. For example, the voice switching device simultaneously sends the first media stream and the second media stream, and the recording server combines the first media stream and the second media stream into one recording file; for example, the voice switching device sends the second media stream first. The first media stream is sent again, and the recording server records the first media stream and the second media stream as one recording file respectively, and combines the two recording files into one recording file.
本发明录音服务器接收到第一媒体流和第二媒体流后,对第一媒体流和第二媒体流进行合并录制。通过这种方式,能够防止丢失录音服务器为录音做好准备之前的录音,从而可以获得完整的录音。After receiving the first media stream and the second media stream, the recording server of the present invention performs combined recording on the first media stream and the second media stream. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.
下面以点对点的应用场景和电话会议的应用场景为例具有说明本发明录音方法。The recording method of the present invention is described below by taking a point-to-point application scenario and a conference application scenario as an example.
参阅图6,图6是本发明录音方法点对点的应用场景在组网中各网元交互的流程图。以IP-PBX、FIFO缓存、SIP信令服务器为例说明。Referring to FIG. 6, FIG. 6 is a flowchart of interaction between network elements in a networking in a scenario where the recording method of the present invention is peer-to-peer. Take IP-PBX, FIFO buffer, SIP signaling server as an example.
(1)第一通信终端的用户利用第一通信终端通过IP-PBX呼叫其它通信终端的用户,并与该其它通信终端的用户进行点对点通信,第一通信终端的用户在第一通信终端上按录音键启动录音,向IP-PBX发送录音请求。(1) The user of the first communication terminal uses the first communication terminal to call the user of the other communication terminal through the IP-PBX, and performs peer-to-peer communication with the user of the other communication terminal, and the user of the first communication terminal presses on the first communication terminal. The recording button starts recording and sends a recording request to the IP-PBX.
第一通信终端和其它通信终端均可以为IP话机,第一通信终端和其它通信终端的用户均可以为内部用户。 The first communication terminal and the other communication terminals may each be an IP phone, and the users of the first communication terminal and other communication terminals may be internal users.
如果是电话会议场景,这一步骤应该是:第一通信终端的用户在会议过程中在第一通信终端上按录音键启动录音,向IP-PBX发送录音请求。If it is a conference call scenario, this step should be: the user of the first communication terminal initiates recording by pressing the record button on the first communication terminal during the conference, and sends a recording request to the IP-PBX.
(2)IP-PBX接收录音请求后,创建会场,把第一通信终端和其它通信终端加入会场中,并将来自第一通信终端和其它通信终端的媒体流进行混音处理,混音处理后生成的媒体流作为第一媒体流。(2) After receiving the recording request, the IP-PBX creates a conference site, adds the first communication terminal and other communication terminals to the conference site, and mixes the media streams from the first communication terminal and other communication terminals, after the mixing process. The generated media stream acts as the first media stream.
如果是电话会议场景,这一步骤应该是:IP-PBX接收录音请求后,由于会场在电话会议开始前已经创建好,此时不需要再创建会场,将来自第一通信终端和其它通信终端的媒体流进行混音处理,混音处理后生成的媒体流作为第一媒体流。If it is a conference call scenario, this step should be: After the IP-PBX receives the recording request, since the site has been created before the conference call begins, there is no need to create a site at this time, and the first communication terminal and other communication terminals will be used. The media stream is subjected to mixing processing, and the media stream generated after the mixing processing is used as the first media stream.
(2)IP-PBX为本次录音创建FIFO缓存。在IP-PBX收到录音服务器的启动录音响应前,IP-PBX把第一媒体流发送到FIFO缓存中进行保存。(2) IP-PBX creates a FIFO buffer for this recording. Before the IP-PBX receives the recording response of the recording server, the IP-PBX sends the first media stream to the FIFO buffer for storage.
(3)IP-PBX发送Invite消息给SIP Server,邀请录音服务器加入会场。(3) The IP-PBX sends an Invite message to the SIP server, and invites the recording server to join the site.
(4)SIP Server在录音服务器集群中选择合适的录音服务器,并把该录音服务器IP地址在200OK消息中回复给IP-PBX。(4) The SIP Server selects a suitable recording server in the recording server cluster, and replies the IP address of the recording server to the IP-PBX in the 200 OK message.
(5)IP-PBX将录音服务器加入会场,然后,IP-PBX发送SIP INFO消息给SIP Server,以通过SIP Server通知录音服务器启动录音,SIPServer通知录音服务器启动录音,SIP Server收到录音服务器的启动录音响应后,在200OK消息中将该录音服务器的启动录音响应发送给语音交换设备。(5) IP-PBX joins the recording server to the conference site. Then, the IP-PBX sends a SIP INFO message to the SIP server to notify the recording server to start recording through the SIP server. The SIPServer notifies the recording server to start recording, and the SIP server receives the recording server. After the recording response, the recording response of the recording server is sent to the voice switching device in the 200 OK message.
(6)IP-PBX收到录音服务器启动录音200OK响应后,IP-PBX把来自第一通信终端和其它通信终端的媒体流进行混音处理,混音处理后生成的媒体流作为第二媒体流直接发给录音服务器,不再发送到FIFO缓存。(6) After the IP-PBX receives the 200 OK response from the recording server, the IP-PBX mixes the media streams from the first communication terminal and other communication terminals, and the media stream generated after the mixing process is used as the second media stream. Directly sent to the recording server, no longer sent to the FIFO buffer.
(7)FIFO缓存中的第一媒体流发送给录音服务器。第一媒体流有序号标识,录音服务器收到乱序的第一媒体流,能够根据序号标识按时间顺序录制第一媒体流中的媒体包,实现丢失录音服务器为录音做好准 备之前的录音。(7) The first media stream in the FIFO buffer is sent to the recording server. The first media stream has a serial number identifier, and the recording server receives the out-of-order first media stream, and can record the media packets in the first media stream in time sequence according to the serial number identifier, so that the lost recording server is ready for recording. Prepare the previous recording.
(8)录音服务器对第一媒体流和第二媒体流进行合并录制。(8) The recording server performs combined recording on the first media stream and the second media stream.
参阅图7,图7是本发明语音交换设备一实施方式的结构示意图,该语音交换设备包括:接收模块101、混音模块102、保存模块103、加入模块104、第一发送模块105、第二发送模块106以及第三发送模块107。Referring to FIG. 7, FIG. 7 is a schematic structural diagram of an embodiment of a voice switching device according to the present invention. The voice switching device includes: a receiving module 101, a mixing module 102, a saving module 103, a joining module 104, a first sending module 105, and a second The transmitting module 106 and the third transmitting module 107.
接收模块101用于接收来自第一通信终端的录音请求和来自录音服务器的启动录音响应,第一通信终端和其它通信终端通过语音交换设备进行通信。The receiving module 101 is configured to receive a recording request from the first communication terminal and a startup recording response from the recording server, and the first communication terminal and the other communication terminal communicate through the voice switching device.
语音交换设备是一种用于语音电信号转发的网络设备,主要作用是处理用户的注册、呼叫、出局中继、创建会场以及与录音服务器的指令交互等。A voice switching device is a network device used for voice electrical signal forwarding. Its main functions are to process user registration, call, outgoing relay, create a conference site, and interact with commands of the recording server.
第一通信终端是主动发起录音请求的通信终端,其它通信终端是除了第一通信终端外参与录音过程的通信终端。当通过语音交换设备上创建的电话会议进行通信时,其它通信终端一般至少有两个;当通过语音交换设备进行点对点通信时,其它通信终端是一个。The first communication terminal is a communication terminal that actively initiates a recording request, and the other communication terminals are communication terminals that participate in a recording process in addition to the first communication terminal. When communicating via a conference call created on a voice switching device, there are typically at least two other communication terminals; when peer-to-peer communication is through a voice switching device, the other communication terminals are one.
第一通信终端向语音交换设备发送录音请求时,语音交换设备接收来自第一通信终端的录音请求,其中,第一通信终端和其它通信终端通过语音交换设备进行通信。When the first communication terminal sends a recording request to the voice switching device, the voice switching device receives a recording request from the first communication terminal, wherein the first communication terminal and the other communication terminal communicate through the voice switching device.
另外,在语言交换设备向录音服务器发送录音请求后,可以接收到来自录音服务器的启动录音响应,此时,录音服务器已经为录音做好准备。In addition, after the language exchange device sends a recording request to the recording server, it can receive a startup recording response from the recording server. At this time, the recording server is ready for recording.
混音模块102用于在收到来自第一通信终端的录音请求时,将当前接收的来自第一通信终端的媒体流和来自其它通信终端的媒体流进行混音。The mixing module 102 is configured to mix the currently received media stream from the first communication terminal and the media stream from other communication terminals upon receiving the recording request from the first communication terminal.
保存模块103用于在接收模块101接收到所述录音请求之后、接收到所述启动录音响应之前将混音模块102混音生成的媒体流作为第一媒体流保存在缓存中。The saving module 103 is configured to save the media stream generated by the mixing of the mixing module 102 as a first media stream in the cache after the receiving module 101 receives the recording request and before receiving the starting recording response.
第一媒体流是语音交换设备在收到录音服务器的启动录音响应之 前,将来自第一通信终端的媒体流和来自其它通信终端的媒体流进行混音处理后生成的媒体流。The first media stream is a voice switching device that receives a recording response from the recording server. Before, the media stream generated from the first communication terminal and the media stream from the other communication terminal are mixed and processed.
语音交换设备在收到录音服务器的启动录音响应之前,录音服务器还没有为录音做好准备。但是第一通信终端和其它通信终端已经开始会话,语音交换设备将第一媒体流保存在缓存中,可以防止丢失录音服务器为录音做好准备之前的媒体流,其中,缓存是预先已经创建好的。The voice switching device is not ready for recording until it receives a recording response from the recording server. However, the first communication terminal and the other communication terminal have started the session, and the voice switching device saves the first media stream in the cache, which can prevent the media stream before the recording server is prepared for recording, wherein the cache is already created in advance. .
加入模块104用于将录音服务器加入语音交换设备创建的包括第一通信终端和其它通信终端的会场中。The adding module 104 is configured to join the recording server into the conference site of the first communication terminal and other communication terminals created by the voice switching device.
录音服务器是获取媒体流,实现用户录音的设备。语音交换设备收到来自第一通信终端的录音请求后,需要将录音服务器加入会场中才能够实现录音。会场是语音交换设备创建的,包括第一通信终端和其它通信终端。如果是点对点会话,当语音交换设备接收第一通信终端的录音请求后,语音交换设备创建会场,将第一通信终端和其它通信终端加入会场中;如果是电话会议,会场是在电话会议开始之前已经创建好的,即为电话会议的会场,当语音交换设备接收第一通信终端的录音请求后,不需要再创建会场。The recording server is a device that acquires a media stream and implements user recording. After the voice switching device receives the recording request from the first communication terminal, the recording server needs to be added to the conference site to enable recording. The venue is created by the voice switching device and includes the first communication terminal and other communication terminals. If it is a point-to-point session, after the voice switching device receives the recording request of the first communication terminal, the voice switching device creates a site, and joins the first communication terminal and other communication terminals to the conference site; if it is a conference call, the conference site is before the conference call starts. The conference site that has been created, that is, the conference call, does not need to create a conference site after the voice switching device receives the recording request of the first communication terminal.
第一发送模块105用于向录音服务器发送启动录音请求。The first sending module 105 is configured to send a start recording request to the recording server.
当录音服务器加入会场后,语音交换设备再向录音服务器发送启动录音请求,以便于录音服务器为录音做好准备。After the recording server joins the conference site, the voice switching device sends a recording request to the recording server to prepare the recording server for the recording.
第二发送模块106用于在接收模块101接收到所述启动录音响应之后将混音模块102当前混音生成的媒体流作为第二媒体流发送给录音服务器。The second sending module 106 is configured to send the media stream generated by the current mixing of the mixing module 102 to the recording server as the second media stream after the receiving module 101 receives the startup recording response.
第二媒体流是语音交换设备在收到录音服务器的启动录音响应之后,将来自第一通信终端的媒体流和来自其它通信终端的媒体流进行混音处理后生成的媒体流。The second media stream is a media stream generated after the voice switching device receives the recording response of the recording server and mixes the media stream from the first communication terminal with the media stream from the other communication terminal.
语音交换设备在收到录音服务器的启动录音响应之后,录音服务器已经为录音做好了录音准备。语音交换设备将当前接收的来自第一通信终端的媒体流和来自其它通信终端的媒体流进行混音,此时,将混音生成的媒体流作为第二媒体流发送给录音服务器。 After the voice switching device receives the recording response from the recording server, the recording server has prepared the recording for the recording. The voice switching device mixes the currently received media stream from the first communication terminal with the media stream from the other communication terminal. At this time, the media stream generated by the mixing is sent to the recording server as the second media stream.
第三发送模块107用于在接收模块101接收到启动录音响应之后将保存在缓存中的第一媒体流向录音服务器发送,以便于录音服务器对第一媒体流和第二媒体流进行合并录制。The third sending module 107 is configured to send the first media stream saved in the cache to the recording server after the receiving module 101 receives the startup recording response, so that the recording server performs combined recording on the first media stream and the second media stream.
在录音服务器没有为录音做好准备的时间内,第一媒体流保存在缓存中,为了便于录音服务器获得完整的媒体流,语音交换设备将保存在缓存中的第一媒体流向录音服务器发送,以便于录音服务器对第一媒体流和第二媒体流进行合并录制。During the time when the recording server is not ready for recording, the first media stream is saved in the cache. In order to facilitate the recording server to obtain a complete media stream, the voice switching device sends the first media stream stored in the buffer to the recording server, so that The first media stream and the second media stream are combined and recorded by the recording server.
需要说明的是,在实际应用中,可以对本实施方式的模块或单元进行增减或合并,在此不再一一赘叙。It should be noted that, in practical applications, the modules or units of the present embodiment may be added, subtracted, or combined, and will not be further described herein.
参阅图8,语音交换设备还包括创建模块108,创建模块108用于在接收到录音请求时创建缓存。Referring to Figure 8, the voice switching device also includes a creation module 108 for creating a cache upon receipt of a recording request.
缓存是指临时文件交换区,具有极快的存取速率,它是内部存储和外界接口之间的缓冲器。Cache refers to the temporary file swap area, which has an extremely fast access rate, which is a buffer between the internal storage and the external interface.
语音交换设备在接收到录音请求时创建缓存。例如:可以是在接收到第一通信终端的录音请求后进行创建。当然,也可以是在接收到第一通信终端的录音请求之前进行创建,在此不做限制。The voice switching device creates a cache when it receives a recording request. For example, it may be created after receiving the recording request of the first communication terminal. Of course, it may be created before receiving the recording request of the first communication terminal, and no limitation is imposed here.
保存模块103用于在收到录音服务器的启动录音响应时停止将混音生成的媒体流保存在缓存中。The saving module 103 is configured to stop saving the media stream generated by the mixing in the cache when receiving the startup recording response of the recording server.
在收到录音服务器的启动录音响应之后,录音服务器已经为录音做好准备,因此,语音交换设备可以停止将混音处理后生成的媒体流保存在缓存中,直接将混音处理后生成的媒体流作为第二媒体流向录音服务器发送。这样,可以避免浪费缓存的存储空间。After receiving the recording response of the recording server, the recording server is ready for recording. Therefore, the voice switching device can stop storing the media stream generated after the mixing processing in the cache, and directly generate the media after the mixing processing. The stream is sent as a second media stream to the recording server. This way, you can avoid wasting the cached storage space.
其中,第一媒体流中的媒体包带有序号标识,以便于录音服务器根据序号标识按时间顺序录制第一媒体流中的媒体包。The media packet in the first media stream carries a serial number identifier, so that the recording server records the media packets in the first media stream in time sequence according to the serial number identifier.
需要说明的是,本实施方式的语音交换设备可以执行图3和图4中的步骤。It should be noted that the voice switching device of this embodiment may perform the steps in FIG. 3 and FIG. 4.
本发明在收到录音服务器的启动录音响应之前,语音交换设备将混音处理后生成的第一媒体流保存在缓存中;在收到录音服务器的启动录音响应之后,语音交换设备将混音处理后生成的第二媒体流向录音服务 器发送;语音交换设备将保存在缓存中的第一媒体流向录音服务器发送,以便于录音服务器对第一媒体流和第二媒体流进行合并录制。通过这种方式,能够防止丢失录音服务器为录音做好准备之前的录音,从而可以获得完整的录音。Before receiving the recording response of the recording server, the voice exchange device saves the first media stream generated after the mixing process in the cache; after receiving the recording response of the recording server, the voice switching device will perform the mixing processing. After the generated second media stream recording service Transmitting; the voice switching device sends the first media stream saved in the cache to the recording server, so that the recording server performs combined recording on the first media stream and the second media stream. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.
参阅图9,图9是本发明录音服务器一实施方式的结构示意图,该录音服务器包括:加入模块201、第一接收模块202、发送模块203、第二接收模块204、第三接收模块205以及合并录制模块206。Referring to FIG. 9, FIG. 9 is a schematic structural diagram of an embodiment of a recording server according to the present invention. The recording server includes: a joining module 201, a first receiving module 202, a sending module 203, a second receiving module 204, a third receiving module 205, and a merge. Recording module 206.
需要说明的是,本实施方式的录音服务器可以执行图5中的步骤。It should be noted that the recording server of the present embodiment can perform the steps in FIG. 5.
加入模块201用于加入语音交换设备创建的包括第一通信终端和其它通信终端的会场中,第一通信终端和其它通信终端通过语音交换设备进行通信。The joining module 201 is configured to join a conference site that includes a first communication terminal and other communication terminals created by the voice switching device, and the first communication terminal and other communication terminals communicate through the voice switching device.
录音服务器是获取媒体流,实现用户录音的设备。语音交换设备是一种用于语音电信号转发的网络设备。第一通信终端是主动发起录音请求的通信终端,其它通信终端是除了第一通信终端外参与录音过程的通信终端。当通过语音交换设备上创建的电话会议进行通信时,其它通信终端一般至少有两个;当通过语音交换设备进行点对点通信时,其它通信终端是一个。其中,第一通信终端和其它通信终端通过语音交换设备进行通信。The recording server is a device that acquires a media stream and implements user recording. A voice switching device is a network device used for voice electrical signal forwarding. The first communication terminal is a communication terminal that actively initiates a recording request, and the other communication terminals are communication terminals that participate in a recording process in addition to the first communication terminal. When communicating via a conference call created on a voice switching device, there are typically at least two other communication terminals; when peer-to-peer communication is through a voice switching device, the other communication terminals are one. The first communication terminal and the other communication terminals communicate through the voice switching device.
录音服务器加入语音交换设备创建的包括第一通信终端和其它通信终端的会场中,才可以为录音做准备。The recording server can be prepared for recording by joining the conference site including the first communication terminal and other communication terminals created by the voice switching device.
第一接收模块202用于接收语音交换设备发送的启动录音请求。The first receiving module 202 is configured to receive a startup recording request sent by the voice switching device.
发送模块203用于在第一接收模块202接收语音交换设备发送的启动录音请求后,向语音交换设备发送启动录音响应。The sending module 203 is configured to send a start recording response to the voice switching device after the first receiving module 202 receives the startup recording request sent by the voice switching device.
在录音服务器加入会场后,接收语音交换设备发送的启动录音请求,此时录音服务器接收语音交换设备发送的启动录音请求,并向语音交换设备发送启动录音响应。After the recording server joins the conference site, it receives a startup recording request sent by the voice switching device. At this time, the recording server receives the startup recording request sent by the voice switching device, and sends a startup recording response to the voice switching device.
第二接收模块204用于在发送模块203向语音交换设备发送启动录音响应之后,接收来自语音交换设备的第二媒体流。The second receiving module 204 is configured to receive the second media stream from the voice switching device after the sending module 203 sends the start recording response to the voice switching device.
第二媒体流是语音交换设备在收到录音服务器的启动录音响应之 后,将来自第一通信终端的媒体流和来自其它通信终端的媒体流进行混音处理后生成的媒体流。在向语音交换设备发送启动录音响应之后,录音服务器已经为录音做好准备,此时,录音服务器接收来自语音交换设备的第二媒体流。The second media stream is a voice switching device that receives the recording response of the recording server. Thereafter, the media stream generated from the first communication terminal and the media stream from the other communication terminal are subjected to a mixing process to generate a media stream. After transmitting the initiate recording response to the voice switching device, the recording server is ready for recording, at which point the recording server receives the second media stream from the voice switching device.
第三接收模块205用于在发送模块203向所述语音交换设备发送启动录音响应之后,接收来自语音交换设备的缓存的第一媒体流。The third receiving module 205 is configured to receive the buffered first media stream from the voice switching device after the sending module 203 sends the start recording response to the voice switching device.
第一媒体流是语音交换设备在收到录音服务器的启动录音响应之前,语音交换设备将来自第一通信终端的媒体流和来自其它通信终端的媒体流进行混音处理后生成的媒体流。第一媒体流预先保存在缓存中,待录音服务器为录音做好准备后,即可接收语音交换设备发送的保存在缓存中的第一媒体流。The first media stream is a media stream generated by the voice switching device before the voice switching device receives the recording response of the recording server, and the voice switching device mixes the media stream from the first communication terminal with the media stream from the other communication terminal. The first media stream is pre-stored in the cache, and after the recording server is ready for the recording, the first media stream saved in the cache sent by the voice switching device can be received.
合并录制模块206用于对第二接收模块204接收的第二媒体流和第三接收模块205接收的第一媒体流进行合并录制。The merge recording module 206 is configured to perform combined recording on the second media stream received by the second receiving module 204 and the first media stream received by the third receiving module 205.
录音服务器对第一媒体流和第二媒体流进行合并录制。如,语音交换设备同时发送第一媒体流和第二媒体流,录音服务器将第一媒体流和第二媒体流进行合并并录制为一个录音文件;又如,语音交换设备先发送第二媒体流再发送第一媒体流,录音服务器将第一媒体流和第二媒体流分别录制为一个录音文件,并将这两个录音文件合并为一个录音文件。The recording server performs combined recording on the first media stream and the second media stream. For example, the voice switching device simultaneously sends the first media stream and the second media stream, and the recording server combines the first media stream and the second media stream into one recording file; for example, the voice switching device sends the second media stream first. The first media stream is sent again, and the recording server records the first media stream and the second media stream as one recording file respectively, and combines the two recording files into one recording file.
本发明录音服务器接收到第一媒体流和第二媒体流后,对第一媒体流和第二媒体流进行合并录制。通过这种方式,能够防止丢失录音服务器为录音做好准备之前的录音,从而可以获得完整的录音。After receiving the first media stream and the second media stream, the recording server of the present invention performs combined recording on the first media stream and the second media stream. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.
参阅图10,图10是本发明录音系统一实施方式的结构示意图,该系统包括:语音交换设备31和录音服务器32。Referring to FIG. 10, FIG. 10 is a schematic structural diagram of an embodiment of a recording system according to the present invention. The system includes a voice switching device 31 and a recording server 32.
语音交换设备用于接收来自第一通信终端的录音请求,第一通信终端和其它通信终端通过语音交换设备进行通信;在收到录音请求时,对当前接收的来自第一通信终端的媒体流和来自其它通信终端的媒体流进行混音,并将混音生成的媒体流作为第一媒体流保存在缓存中;将录音服务器加入语音交换设备创建的包括第一通信终端和其它通信终端 的会场中;向录音服务器发送启动录音请求;在收到录音服务器的启动录音响应之后,对当前接收的来自第一通信终端的媒体流和来自其它通信终端的媒体流进行混音,将混音生成的媒体流作为第二媒体流发送给录音服务器,并将收到启动录音响应之前保存在缓存中的第一媒体流发送给录音服务器,以便于录音服务器对第一媒体流和第二媒体流进行合并录制。The voice switching device is configured to receive a recording request from the first communication terminal, and the first communication terminal and the other communication terminal communicate through the voice switching device; when receiving the recording request, the currently received media stream from the first communication terminal and The media streams from other communication terminals are mixed, and the media stream generated by the mixing is saved in the cache as the first media stream; the first communication terminal and other communication terminals are created by adding the recording server to the voice switching device. In the conference site; sending a start recording request to the recording server; after receiving the recording response of the recording server, mixing the currently received media stream from the first communication terminal and the media stream from the other communication terminal, mixing the sound The generated media stream is sent to the recording server as a second media stream, and the first media stream saved in the cache before receiving the start recording response is sent to the recording server, so that the recording server can access the first media stream and the second media stream. Make a combined recording.
录音服务器用于加入语音交换设备创建的包括第一通信终端和其它通信终端的会场中,第一通信终端和其它通信终端通过语音交换设备进行通信;接收语音交换设备发送的启动录音请求并向语音交换设备发送启动录音响应;在向语音交换设备发送启动录音响应之后,接收来自语音交换设备的第一媒体流和第二媒体流,第一媒体流是语音交换设备在收到启动录音响应之前对来自第一通信终端和其它通信终端的媒体流进行混音处理后生成的媒体流,第二媒体流是语音交换设备在收到启动录音响应之后对来自第一通信终端和其它通信终端的媒体流进行混音处理后生成的媒体流;对第一媒体流和第二媒体流进行合并录制。The recording server is used to join the conference site including the first communication terminal and the other communication terminal created by the voice switching device, and the first communication terminal and the other communication terminal communicate through the voice switching device; receive the startup recording request sent by the voice switching device and send the voice to the voice The switching device sends a startup recording response; after transmitting the startup recording response to the voice switching device, receiving the first media stream and the second media stream from the voice switching device, the first media stream is the voice switching device before receiving the start recording response a media stream generated by the media stream from the first communication terminal and other communication terminals after the mixing process, and the second media stream is a media stream from the first communication terminal and other communication terminals after the voice switching device receives the start recording response a media stream generated after the mixing process is performed; the first media stream and the second media stream are combined and recorded.
本发明在收到录音服务器的启动录音响应之前,语音交换设备将混音处理后的第一媒体流保存在缓存中;在收到录音服务器的启动录音响应之后,语音交换设备将混音处理后的第二媒体流向录音服务器发送;语音交换设备将保存在缓存中的第一媒体流向录音服务器发送,录音服务器接收到第一媒体流和第二媒体流后,对第一媒体流和第二媒体流进行合并录制。通过这种方式,能够防止丢失录音服务器为录音做好准备之前的录音,从而可以获得完整的录音。Before receiving the recording response of the recording server, the voice exchange device saves the first media stream after the mixing process in the cache; after receiving the recording response of the recording server, the voice switching device will process the mixing The second media stream is sent to the recording server; the voice switching device sends the first media stream saved in the cache to the recording server, and after receiving the first media stream and the second media stream, the recording server processes the first media stream and the second media. The stream is combined for recording. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.
参阅图11,图11是本发明语音交换设备又一实施方式的实体结构示意图,该语音交换设备40包括处理器41、与处理器41耦合的存储器42、接收器43以及发送器44。Referring to FIG. 11, FIG. 11 is a schematic diagram showing the physical structure of still another embodiment of the voice switching device of the present invention. The voice switching device 40 includes a processor 41, a memory 42 coupled to the processor 41, a receiver 43, and a transmitter 44.
接收器43用于接收来自第一通信终端的录音请求,所述第一通信终端和其它通信终端通过所述语音交换设备40的接收器43和发送器44进行通信。The receiver 43 is for receiving a recording request from the first communication terminal, and the first communication terminal and the other communication terminal communicate via the receiver 43 and the transmitter 44 of the voice switching device 40.
处理器41在接收器43收到所述录音请求时,对当前接收器43接 收的来自所述第一通信终端的媒体流和来自所述其它通信终端的媒体流进行混音,并控制发送器44将混音生成的媒体流作为第一媒体流保存在存储器42的缓存中。The processor 41 connects to the current receiver 43 when the receiver 43 receives the recording request. The received media stream from the first communication terminal and the media stream from the other communication terminal are mixed, and the transmitter 44 is controlled to save the media stream generated by the mixing as a first media stream in the buffer of the memory 42. .
处理器41将录音服务器加入所述语音交换设备创建的包括所述第一通信终端和所述其它通信终端的会场中。The processor 41 adds the recording server to the conference site that is created by the voice switching device and includes the first communication terminal and the other communication terminal.
发送器44向所述录音服务器发送启动录音请求。The transmitter 44 sends a start recording request to the recording server.
处理器41在接收器43收到所述录音服务器的启动录音响应之后,对当前接收的来自所述第一通信终端的媒体流和来自所述其它通信终端的媒体流进行混音,并控制发送器44将混音生成的媒体流作为第二媒体流发送给所述录音服务器。The processor 41 mixes the currently received media stream from the first communication terminal and the media stream from the other communication terminal after the receiver 43 receives the startup recording response of the recording server, and controls transmission. The processor 44 transmits the media stream generated by the mixing as the second media stream to the recording server.
处理器41获取保存在存储器42的缓存中的第一媒体流,控制发送器44将接收器43收到所述启动录音响应之前保存在所述缓存中的第一媒体流发送给所述录音服务器,以便于所述录音服务器对所述第一媒体流和所述第二媒体流进行合并录制。The processor 41 acquires the first media stream stored in the buffer of the memory 42, and the control transmitter 44 sends the first media stream saved in the cache before the receiver 43 receives the start recording response to the recording server. In order to facilitate the combined recording of the first media stream and the second media stream by the recording server.
本发明在收到录音服务器的启动录音响应之前,语音交换设备将混音处理后生成的第一媒体流保存在缓存中;在收到录音服务器的启动录音响应之后,语音交换设备将混音处理后生成的第二媒体流向录音服务器发送;语音交换设备将保存在缓存中的第一媒体流向录音服务器发送,以便于录音服务器对第一媒体流和第二媒体流进行合并录制。通过这种方式,能够防止丢失录音服务器为录音做好准备之前的录音,从而可以获得完整的录音。Before receiving the recording response of the recording server, the voice exchange device saves the first media stream generated after the mixing process in the cache; after receiving the recording response of the recording server, the voice switching device will perform the mixing processing. The generated second media stream is sent to the recording server; the voice switching device sends the first media stream saved in the cache to the recording server, so that the recording server performs combined recording on the first media stream and the second media stream. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.
参阅图12,图12是本发明录音服务器另一实施方式的结构示意图,该录音服务器50包括:处理器51、与处理器51耦合的存储器52、接收器53以及发送器54。Referring to FIG. 12, FIG. 12 is a block diagram showing another embodiment of a recording server of the present invention. The recording server 50 includes a processor 51, a memory 52 coupled to the processor 51, a receiver 53, and a transmitter 54.
处理器51用于加入语音交换设备创建的包括第一通信终端和其它通信终端的会场中,所述第一通信终端和所述其它通信终端通过所述语音交换设备进行通信;The processor 51 is configured to join a conference site that is created by the voice switching device, including the first communications terminal and the other communications terminal, where the first communications terminal and the other communications terminal communicate through the voice switching device;
接收器53接收所述语音交换设备发送的启动录音请求,发送器54向所述语音交换设备发送启动录音响应; The receiver 53 receives a startup recording request sent by the voice switching device, and the transmitter 54 sends a startup recording response to the voice switching device.
在向所述语音交换设备发送启动录音响应之后,接收器53接收来自所述语音交换设备的第一媒体流和第二媒体流,所述第一媒体流是所述语音交换设备在收到所述启动录音响应之前对来自所述第一通信终端和所述其它通信终端的媒体流进行混音处理后生成的媒体流,所述第二媒体流是所述语音交换设备在收到所述启动录音响应之后对来自所述第一通信终端和所述其它通信终端的媒体流进行混音处理后生成的媒体流;After transmitting a startup recording response to the voice switching device, the receiver 53 receives a first media stream and a second media stream from the voice switching device, the first media stream being the voice switching device receiving the location a media stream generated by performing a mixing process on a media stream from the first communication terminal and the other communication terminal before the recording response is started, the second media stream being the voice switching device receiving the startup a media stream generated after the sound recording process is performed on the media streams from the first communication terminal and the other communication terminals after the recording response;
处理器51对所述第一媒体流和所述第二媒体流进行合并录制。The processor 51 performs combined recording on the first media stream and the second media stream.
本发明录音服务器接收到第一媒体流和第二媒体流后,对第一媒体流和第二媒体流进行合并录制。通过这种方式,能够防止丢失录音服务器为录音做好准备之前的录音,从而可以获得完整的录音。After receiving the first media stream and the second media stream, the recording server of the present invention performs combined recording on the first media stream and the second media stream. In this way, it is possible to prevent the recording of the recording server from being lost before the recording is prepared, so that a complete recording can be obtained.
以上所述仅为本发明的实施方式,并非因此限制本发明的专利范围,凡是利用本发明说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本发明的专利保护范围内。 The above is only the embodiment of the present invention, and is not intended to limit the scope of the invention, and the equivalent structure or equivalent process transformations made by the description of the invention and the drawings are directly or indirectly applied to other related technologies. The fields are all included in the scope of patent protection of the present invention.

Claims (11)

  1. 一种录音方法,其特征在于,所述方法包括:A recording method, characterized in that the method comprises:
    语音交换设备接收来自第一通信终端的录音请求,所述第一通信终端和其它通信终端通过所述语音交换设备进行通信;The voice switching device receives a recording request from the first communication terminal, and the first communication terminal and the other communication terminal communicate through the voice switching device;
    所述语音交换设备在收到所述录音请求时,对当前接收的来自所述第一通信终端的媒体流和来自所述其它通信终端的媒体流进行混音,并将混音生成的媒体流作为第一媒体流保存在缓存中;The voice switching device, when receiving the recording request, mixes a currently received media stream from the first communication terminal and a media stream from the other communication terminal, and mixes the generated media stream Stored in the cache as the first media stream;
    所述语音交换设备将录音服务器加入所述语音交换设备创建的包括所述第一通信终端和所述其它通信终端的会场中;The voice switching device adds a recording server to a conference site that is created by the voice switching device, including the first communication terminal and the other communication terminal;
    所述语音交换设备向所述录音服务器发送启动录音请求;The voice switching device sends a start recording request to the recording server;
    所述语音交换设备在收到所述录音服务器的启动录音响应之后,对当前接收的来自所述第一通信终端的媒体流和来自所述其它通信终端的媒体流进行混音,将混音生成的媒体流作为第二媒体流发送给所述录音服务器,并将收到所述启动录音响应之前保存在所述缓存中的第一媒体流发送给所述录音服务器,以便于所述录音服务器对所述第一媒体流和所述第二媒体流进行合并录制。After receiving the startup recording response of the recording server, the voice switching device mixes the currently received media stream from the first communication terminal and the media stream from the other communication terminal, and generates a mix The media stream is sent to the recording server as a second media stream, and the first media stream saved in the cache before receiving the start recording response is sent to the recording server, so that the recording server is The first media stream and the second media stream are combined and recorded.
  2. 根据权利要求1所述的方法,其特征在于,所述方法还包括:所述语音交换设备在接收到所述录音请求时创建所述缓存。The method of claim 1, further comprising the voice switching device creating the cache upon receiving the recording request.
  3. 根据权利要求1或2所述的方法,其特征在于,所述方法还包括:所述语音交换设备在收到所述启动录音响应时停止将混音生成的媒体流保存在缓存中。The method according to claim 1 or 2, wherein the method further comprises: the voice switching device stops storing the media stream generated by the mixing in the cache upon receiving the start recording response.
  4. 根据权利要求1至3任一项所述的方法,其特征在于,所述第一媒体流中的媒体包带有序号标识,以便于所述录音服务器根据所述序号标识按时间顺序录制所述第一媒体流中的媒体包。The method according to any one of claims 1 to 3, wherein the media package in the first media stream has a serial number identifier, so that the recording server records the chronological order according to the serial number identifier. The media package in the first media stream.
  5. 一种录音方法,其特征在于,所述方法包括:A recording method, characterized in that the method comprises:
    录音服务器加入语音交换设备创建的包括第一通信终端和其它通信终端的会场中,所述第一通信终端和所述其它通信终端通过所述语音交换设备进行通信;The recording server is added to the conference site including the first communication terminal and the other communication terminal created by the voice switching device, and the first communication terminal and the other communication terminal communicate through the voice switching device;
    所述录音服务器接收所述语音交换设备发送的启动录音请求并向 所述语音交换设备发送启动录音响应;Receiving, by the recording server, a startup recording request sent by the voice switching device The voice switching device sends a start recording response;
    在向所述语音交换设备发送启动录音响应之后,录音服务器接收来自所述语音交换设备的第一媒体流和第二媒体流,所述第一媒体流是所述语音交换设备在收到所述启动录音响应之前对来自所述第一通信终端和所述其它通信终端的媒体流进行混音处理后生成的媒体流,所述第二媒体流是所述语音交换设备在收到所述启动录音响应之后对来自所述第一通信终端和所述其它通信终端的媒体流进行混音处理后生成的媒体流;After transmitting a startup recording response to the voice switching device, the recording server receives a first media stream and a second media stream from the voice switching device, the first media stream being the voice switching device receiving the a media stream generated by mixing a media stream from the first communication terminal and the other communication terminal before the recording response is started, the second media stream being the voice switching device receiving the startup recording a media stream generated after the mixing of the media streams from the first communication terminal and the other communication terminals after the response;
    所述录音服务器对所述第一媒体流和所述第二媒体流进行合并录制。The recording server performs combined recording on the first media stream and the second media stream.
  6. 一种语音交换设备,其特征在于,所述语音交换设备包括:接收模块、混音模块、保存模块、加入模块、第一发送模块、第二发送模块以及第三发送模块;A voice switching device, comprising: a receiving module, a mixing module, a saving module, a joining module, a first sending module, a second sending module, and a third sending module;
    所述接收模块用于接收来自第一通信终端的录音请求和来自录音服务器的启动录音响应,所述第一通信终端和其它通信终端通过所述语音交换设备进行通信;The receiving module is configured to receive a recording request from the first communication terminal and a startup recording response from the recording server, where the first communication terminal and the other communication terminal communicate through the voice switching device;
    所述混音模块用于在收到所述录音请求时,将当前接收的来自所述第一通信终端的媒体流和来自所述其它通信终端的媒体流进行混音;The mixing module is configured to mix a currently received media stream from the first communication terminal and a media stream from the other communication terminal when receiving the recording request;
    所述保存模块用于在所述接收模块接收到所述录音请求之后、接收到所述启动录音响应之前将所述混音模块混音生成的媒体流作为第一媒体流保存在缓存中;The saving module is configured to save the media stream generated by the mixing of the mixing module as a first media stream in a cache after the receiving module receives the recording request, and before receiving the starting recording response;
    所述加入模块用于将录音服务器加入所述语音交换设备创建的包括所述第一通信终端和所述其它通信终端的会场中;The joining module is configured to join a recording server to a conference site that is created by the voice switching device, including the first communications terminal and the other communications terminal;
    所述第一发送模块用于向所述录音服务器发送启动录音请求;The first sending module is configured to send a start recording request to the recording server;
    所述第二发送模块用于在所述接收模块接收到所述启动录音响应之后将所述混音模块当前混音生成的媒体流作为第二媒体流发送给所述录音服务器;The second sending module is configured to send the media stream generated by the current mixing of the mixing module to the recording server as a second media stream after the receiving module receives the startup recording response;
    所述第三发送模块用于在所述接收模块接收到所述启动录音响应之后将保存在所述缓存中的第一媒体流向所述录音服务器发送,以便于 所述录音服务器对所述第一媒体流和所述第二媒体流进行合并录制。The third sending module is configured to send the first media stream saved in the cache to the recording server after the receiving module receives the startup recording response, so as to facilitate The recording server performs combined recording on the first media stream and the second media stream.
  7. 根据权利要求6所述的语音交换设备,其特征在于,所述语音交换设备还包括创建模块,所述创建模块用于在接收到所述录音请求时创建所述缓存。The voice switching device according to claim 6, wherein the voice switching device further comprises a creating module, the creating module configured to create the cache when the recording request is received.
  8. 根据权利要求6或7所述的语音交换设备,其特征在于,所述保存模块,还用于在收到所述启动录音响应时停止将混音生成的媒体流保存在缓存中。The voice switching device according to claim 6 or 7, wherein the saving module is further configured to stop saving the media stream generated by the mixing in the cache when receiving the startup recording response.
  9. 根据权利要求6至8任一项所述的语音交换设备,其特征在于,所述第一媒体流中的媒体包带有序号标识,以便于所述录音服务器根据所述序号标识按时间顺序录制所述第一媒体流中的媒体包。The voice switching device according to any one of claims 6 to 8, wherein the media packet in the first media stream has a serial number identifier, so that the recording server records the time sequence according to the serial number identifier. a media package in the first media stream.
  10. 一种录音服务器,其特征在于,所述录音服务器包括:加入模块、第一接收模块、发送模块、第二接收模块、第三接收模块以及合并录制模块;A recording server, comprising: a joining module, a first receiving module, a sending module, a second receiving module, a third receiving module, and a combined recording module;
    所述加入模块用于加入语音交换设备创建的包括第一通信终端和其它通信终端的会场中,所述第一通信终端和所述其它通信终端通过所述语音交换设备进行通信;The joining module is configured to join a conference site that is created by the voice switching device, including the first communications terminal and the other communications terminal, where the first communications terminal and the other communications terminal communicate through the voice switching device;
    所述第一接收模块用于接收所述语音交换设备发送的启动录音请求;The first receiving module is configured to receive a startup recording request sent by the voice switching device;
    所述发送模块用于在所述第一接收模块接收所述语音交换设备发送的启动录音请求后,向所述语音交换设备发送启动录音响应;The sending module is configured to send a start recording response to the voice switching device after the first receiving module receives the startup recording request sent by the voice switching device;
    所述第二接收模块用于在所述发送模块向所述语音交换设备发送启动录音响应之后,接收来自所述语音交换设备的第二媒体流,所述第二媒体流是所述语音交换设备在收到所述启动录音响应之后对来自所述第一通信终端和所述其它通信终端的媒体流进行混音处理后生成的媒体流;The second receiving module is configured to receive a second media stream from the voice switching device after the sending module sends a start recording response to the voice switching device, where the second media stream is the voice switching device a media stream generated by performing a mixing process on a media stream from the first communication terminal and the other communication terminal after receiving the start recording response;
    所述第三接收模块用于在所述发送模块向所述语音交换设备发送启动录音响应之后,接收来自所述语音交换设备的缓存的第一媒体流,所述第一媒体流是所述语音交换设备在收到所述启动录音响应之前对来自所述第一通信终端和所述其它通信终端的媒体流进行混音处理后 生成的媒体流;The third receiving module is configured to receive, after the sending module sends a startup recording response to the voice switching device, a first media stream that is buffered from the voice switching device, where the first media stream is the voice The switching device mixes the media streams from the first communication terminal and the other communication terminals before receiving the start recording response Generated media stream;
    所述合并录制模块用于对所述第二接收模块接收的第二媒体流和所述第三接收模块接收的第一媒体流进行合并录制。The merge recording module is configured to perform combined recording on the second media stream received by the second receiving module and the first media stream received by the third receiving module.
  11. 一种录音系统,其特征在于,所述系统包括:语音交换设备和录音服务器;A recording system, characterized in that the system comprises: a voice switching device and a recording server;
    所述语音交换设备用于接收来自第一通信终端的录音请求,所述第一通信终端和其它通信终端通过所述语音交换设备进行通信;在收到所述录音请求时,对当前接收的来自所述第一通信终端的媒体流和来自所述其它通信终端的媒体流进行混音,并将混音生成的媒体流作为第一媒体流保存在缓存中;将录音服务器加入所述语音交换设备创建的包括所述第一通信终端和所述其它通信终端的会场中;向所述录音服务器发送启动录音请求;在收到所述录音服务器的启动录音响应之后,对当前接收的来自所述第一通信终端的媒体流和来自所述其它通信终端的媒体流进行混音,将混音生成的媒体流作为第二媒体流发送给所述录音服务器,并将收到所述启动录音响应之前保存在所述缓存中的第一媒体流发送给所述录音服务器,以便于所述录音服务器对所述第一媒体流和所述第二媒体流进行合并录制;The voice switching device is configured to receive a recording request from a first communication terminal, where the first communication terminal and other communication terminals communicate through the voice switching device; when the recording request is received, the current received Mixing the media stream of the first communication terminal with the media stream from the other communication terminal, and storing the media stream generated by the mixing as a first media stream in a cache; adding the recording server to the voice switching device Created in the conference site including the first communication terminal and the other communication terminal; sending a startup recording request to the recording server; after receiving the recording response of the recording server, the currently received from the first a media stream of a communication terminal and a media stream from the other communication terminal are mixed, and the media stream generated by the mixing is sent to the recording server as a second media stream, and is saved before receiving the start recording response Transmitting, in the cache, the first media stream to the recording server, so that the recording server is to the first medium Stream and the second stream merge recording media;
    所述录音服务器用于加入语音交换设备创建的包括第一通信终端和其它通信终端的会场中,所述第一通信终端和所述其它通信终端通过所述语音交换设备进行通信;接收所述语音交换设备发送的启动录音请求并向所述语音交换设备发送启动录音响应;在向所述语音交换设备发送启动录音响应之后,接收来自所述语音交换设备的第一媒体流和第二媒体流,所述第一媒体流是所述语音交换设备在收到所述启动录音响应之前对来自所述第一通信终端和所述其它通信终端的媒体流进行混音处理后生成的媒体流,所述第二媒体流是所述语音交换设备在收到所述启动录音响应之后对来自所述第一通信终端和所述其它通信终端的媒体流进行混音处理后生成的媒体流;对所述第一媒体流和所述第二媒体流进行合并录制。 The recording server is configured to join a conference site that includes a first communication terminal and other communication terminals created by the voice switching device, where the first communication terminal and the other communication terminal communicate through the voice switching device; receive the voice Receiving a recording request sent by the switching device and sending a startup recording response to the voice switching device; after transmitting the startup recording response to the voice switching device, receiving the first media stream and the second media stream from the voice switching device, The first media stream is a media stream generated by the voice switching device after performing a mixing process on a media stream from the first communication terminal and the other communication terminal before receiving the start recording response, The second media stream is a media stream generated by the voice switching device after performing the mixing process on the media streams from the first communication terminal and the other communication terminal after receiving the start recording response; A media stream and the second media stream are combined for recording.
PCT/CN2014/089748 2014-02-25 2014-10-29 Recording method, voice exchanging device, recording server, and recording system WO2015127793A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201410065530.4A CN104869106A (en) 2014-02-25 2014-02-25 Sound recording method, voice switching equipment, sound recording server and sound recording system
CN201410065530.4 2014-02-25

Publications (1)

Publication Number Publication Date
WO2015127793A1 true WO2015127793A1 (en) 2015-09-03

Family

ID=53914631

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/089748 WO2015127793A1 (en) 2014-02-25 2014-10-29 Recording method, voice exchanging device, recording server, and recording system

Country Status (2)

Country Link
CN (1) CN104869106A (en)
WO (1) WO2015127793A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107395904A (en) * 2017-07-28 2017-11-24 河北远东通信系统工程有限公司 A kind of distributed call recording method

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6146827B2 (en) * 2015-09-10 2017-06-14 Necプラットフォームズ株式会社 Telephone exchange apparatus and method, and program
CN105162801B (en) * 2015-09-24 2018-03-02 河北远东通信系统工程有限公司 A kind of method for realizing that IP terminals independently record with conferencing
CN105681532B (en) * 2016-03-07 2019-03-12 携程旅游信息技术(上海)有限公司 The recording system and method for call center based on Softpbx
CN107888790B (en) * 2016-09-29 2020-09-08 南京中兴新软件有限责任公司 Recording method and device for teleconference
CN110352591B (en) * 2017-12-05 2021-10-26 Nec平台株式会社 Communication device, communication data recording system, communication method, and program
CN108833727A (en) * 2018-07-04 2018-11-16 广东商路信息科技有限公司 Sound record management method and system based on telephone exchange

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102769633A (en) * 2012-08-03 2012-11-07 深圳中兴网信科技有限公司 Call recording system and call recording method
US8463870B1 (en) * 2000-03-08 2013-06-11 Music Choice Personalized audio system and method
CN103166913A (en) * 2011-12-09 2013-06-19 华为技术有限公司 Recording method of encrypted media streams and swapping control equipment

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8463870B1 (en) * 2000-03-08 2013-06-11 Music Choice Personalized audio system and method
CN103166913A (en) * 2011-12-09 2013-06-19 华为技术有限公司 Recording method of encrypted media streams and swapping control equipment
CN102769633A (en) * 2012-08-03 2012-11-07 深圳中兴网信科技有限公司 Call recording system and call recording method

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107395904A (en) * 2017-07-28 2017-11-24 河北远东通信系统工程有限公司 A kind of distributed call recording method
CN107395904B (en) * 2017-07-28 2019-07-02 河北远东通信系统工程有限公司 A kind of distributed call recording method

Also Published As

Publication number Publication date
CN104869106A (en) 2015-08-26

Similar Documents

Publication Publication Date Title
WO2015127793A1 (en) Recording method, voice exchanging device, recording server, and recording system
TWI551112B (en) Non-transitory tangible machine-readable medium and client device for transitioning between a circuit switched audio call and a video call
WO2017129129A1 (en) Instant call method, device, and system
US20090279455A1 (en) Method, a device and a system for converging ip message
WO2012000347A1 (en) Method, apparatus and sysyem for cross-platform conference convergence
CN109802913B (en) Fusion conference implementation method and device, electronic equipment and readable storage medium
WO2013097457A1 (en) Method, device, and system for realizing voip call in cloud computing environment
WO2015131750A1 (en) Method, device and system for establishing multi-party call based on web rtc
WO2015062454A1 (en) Audio and video call method, device and system
WO2011017889A1 (en) Multimedia conference implementation method and system
WO2012113193A1 (en) Method and system for implementing multiparty call service
KR101589195B1 (en) METHOD AND APPARATUS FOR SEAMLESSlY IMPLEMENTING TRNASFERRING DUAL-PARTY CALL INTO CONFERENCE
WO2016045321A1 (en) Method, device, system and computer storage medium for multi-party call
WO2010069176A1 (en) A method for calling a conference when hard terminals have been bound to pc clients, a login server thereof, a conference server thereof and a pc client thereof
JP2012015797A (en) Telephone exchange device and control method used for telephone terminal and telephone system
US20120089680A1 (en) Communication apparatus, communication system and session control method
CN110943910A (en) WebSocket-based interphone implementation method
CN106331396B (en) Multimedia processing method and system for telephone conference
JP2010081279A (en) Receiving apparatus, transmitting and receiving system, and receiving method
WO2011140744A1 (en) Multimedia conference system in next generation network and realization method thereof
WO2016183774A1 (en) Call recording method, device and system
WO2017000781A1 (en) Video communication method and apparatus
WO2017032108A1 (en) Video communication method, network device, terminal and system
WO2023016172A1 (en) Call processing method, apparatus and system
US20210281474A1 (en) Replacement of collaboration endpoints

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14883572

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 14883572

Country of ref document: EP

Kind code of ref document: A1