WO2015062284A1 - 自然表达处理方法、处理及回应方法、设备及系统 - Google Patents
自然表达处理方法、处理及回应方法、设备及系统 Download PDFInfo
- Publication number
- WO2015062284A1 WO2015062284A1 PCT/CN2014/079945 CN2014079945W WO2015062284A1 WO 2015062284 A1 WO2015062284 A1 WO 2015062284A1 CN 2014079945 W CN2014079945 W CN 2014079945W WO 2015062284 A1 WO2015062284 A1 WO 2015062284A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- expression
- standard
- natural
- response
- information
- Prior art date
Links
- 230000014509 gene expression Effects 0.000 title claims abstract description 352
- 238000012545 processing Methods 0.000 title claims abstract description 71
- 238000003672 processing method Methods 0.000 title claims abstract description 31
- 230000004044 response Effects 0.000 title claims description 179
- 238000000034 method Methods 0.000 title claims description 60
- 238000006243 chemical reaction Methods 0.000 claims abstract description 95
- 238000012549 training Methods 0.000 claims description 58
- 241000282414 Homo sapiens Species 0.000 claims description 18
- 238000013473 artificial intelligence Methods 0.000 claims description 12
- 238000012795 verification Methods 0.000 claims description 7
- 238000005520 cutting process Methods 0.000 claims description 6
- 230000009466 transformation Effects 0.000 claims description 4
- 238000013519 translation Methods 0.000 abstract description 81
- 230000014616 translation Effects 0.000 description 77
- 239000003795 chemical substances by application Substances 0.000 description 36
- 238000005516 engineering process Methods 0.000 description 33
- 230000008520 organization Effects 0.000 description 21
- 238000012546 transfer Methods 0.000 description 21
- 230000008569 process Effects 0.000 description 13
- 238000003058 natural language processing Methods 0.000 description 12
- 230000001788 irregular Effects 0.000 description 9
- 238000004458 analytical method Methods 0.000 description 6
- 230000008859 change Effects 0.000 description 5
- 239000000203 mixture Substances 0.000 description 5
- 238000004590 computer program Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 4
- 230000010365 information processing Effects 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 241000282412 Homo Species 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 230000002452 interceptive effect Effects 0.000 description 3
- 238000004904 shortening Methods 0.000 description 3
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 238000012937 correction Methods 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 230000003252 repetitive effect Effects 0.000 description 2
- 238000007619 statistical method Methods 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 206010035148 Plague Diseases 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 210000000988 bone and bone Anatomy 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000009194 climbing Effects 0.000 description 1
- 230000019771 cognition Effects 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000008451 emotion Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 229910052742 iron Inorganic materials 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000036651 mood Effects 0.000 description 1
- 230000001459 mortal effect Effects 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 235000013550 pizza Nutrition 0.000 description 1
- 208000003580 polydactyly Diseases 0.000 description 1
- 210000004258 portal system Anatomy 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/903—Querying
- G06F16/9032—Query formulation
- G06F16/90332—Natural language query formulation or dialogue systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/42—Data-driven translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/42—Data-driven translation
- G06F40/47—Machine-assisted translation, e.g. using translation memory
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/58—Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/004—Artificial life, i.e. computing arrangements simulating life
- G06N3/008—Artificial life, i.e. computing arrangements simulating life based on physical entities controlled by simulated intelligence so as to replicate intelligent life forms, e.g. based on robots replicating pets or humans in their appearance or behaviour
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computing arrangements based on specific mathematical models
- G06N7/01—Probabilistic graphical models, e.g. probabilistic networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/22—Interactive procedures; Man-machine interfaces
- G10L17/24—Interactive procedures; Man-machine interfaces the user being prompted to utter a password or a predefined phrase
Definitions
- the present invention relates to an information processing method, and in particular to a processing method for natural expression of humans, a processing and response method for the natural expression, and an information processing device and an information processing system to which the processing and response method are applied.
- Machine translation (MT, Machine Translation, commonly known as “machine turning”) belongs to computational linguistics
- Some current translation tools such as Alta Vista Babelfish, sometimes get understandable translation results, but in order to get more meaningful translation results, it is often necessary to properly edit the input statements to facilitate computer program analysis.
- the purpose of using machine translation by the public may be simply to know the gist of the original sentence or paragraph, not the exact translation.
- machine translation has not yet reached the level that can replace professional (manual) translation, and it has not yet become a formal translation.
- Natural Language Processing is a branch of the field of artificial intelligence and linguistics. In this field, we explore how to deal with and apply natural language. Natural language recognition refers to letting computers "understand” the substantive meaning behind human language.
- Natural language generation systems transform computer data into natural language.
- Natural language understanding systems transform natural language into a more manageable form of computer programs.
- NLP is a very attractive way of human-computer interaction.
- Early language processing systems such as SHRDLU, worked quite well when they were in a limited "building block world" with limited vocabulary sessions. This makes researchers very optimistic about this system.
- SHRDLU high-language processing systems
- researchers very optimistic about this system.
- the system expanded into an ambiguous and uncertain environment filled with the real world, they quickly lost confidence. Understanding the natural language requires extensive knowledge of the external world and the ability to manipulate it, natural language cognition, and is also considered an AI-Complete problem.
- Statistical-based natural language processing uses probabilistic and statistical methods to solve the problems of natural language processing based on grammar rules. Especially for long sentences that are highly ambiguous, there are thousands of possibilities when applying actual grammar analysis. The methods used to deal with these highly ambiguous sentences are often applied to corpora and Markov models.
- the technology of statistical natural language processing is mainly derived from the sub-fields related to learning behavior in artificial intelligence technology - Machine Learning and Data Mining.
- Natural language processing technology has a wide range of applications in practice. For example, interactive voice response and Internet call center systems.
- Interactive Voice Response is a general term for voice-based value-added services based on telephone.
- Many institutions such as banks, credit card centers, telecom operators, etc.
- IVRS Interactive Voice Response System
- Customers can dial a specific phone number, enter the system, and type the appropriate options or profiles according to the system's instructions.
- Preset transactions (such as transfer, change password, change contact phone number, etc.).
- the Internet Call Center System (ICCS) is a new type of call center system that has emerged in recent years. It uses popular instant messaging (IM) technology to allow organizations and their customers to conduct major Internet problems. Text-based real-time communication is applied to the organization's customer service and remote sales. A human agent using ICCS can communicate with more than two customers at the same time.
- IM instant messaging
- the text-based ICC system is a variant of the voice-based IVR system, which is a necessary tool for communication between the organization and its customers (whether customer service or remote sales), both of which require the height of the artificial seat. participate. Therefore, like the IVR system, the ICC system is also difficult to meet the needs of the organization to effectively increase the customer self-service ratio and significantly replace the manual workload.
- the traditional speech recognition technology is based on the lack of accuracy and stability of the speech recognition results, using the keyword search technology, using the "exhaustive method” to do the semantic analysis of speech.
- many speech recognition technology companies have spent a lot of manpower and money on "Transcription” and “Keyword Spotting”, and they have been training voice robots for a long time, but the actual effect It is often far from the ideal effect.
- a natural expression processing method comprising: The natural expression from the user is identified, and some form of language information that can be processed by the computer is obtained; and the recognized language information is converted into a standard expression of the encoded form.
- the standard expression includes a demand code reflecting the user's needs.
- the demand code is represented by a digital code.
- the standard expression further includes a demand parameter that further reflects the specific needs of the user.
- the language information is optionally composed of language information units obtained by cutting and converting the natural expression of the voice form using a modeling tool.
- the language information is optionally composed of one of a phoneme, a word, and a phrase.
- the conversion from the language information to the standard expression is alternatively implemented based on the language information and the standard expressed MT (Machine Translation Machine Translation) training data table.
- information related to the natural expression is obtained at the same time as the natural expression is recognized, and this information is converted into a part of the standard expression.
- a training method for an artificial intelligence robot including: establishing an MT training data table, the MT training data table including: computer-processable language information obtained by converting natural expression, a standard expression of the encoding and a correspondence between the language information and the standard expression; causing the artificial intelligence robot to perform various permutation combinations and standard expressions of elements of the language information already existing in the MT training data table
- the various permutation combinations of the elements are subjected to a loop iterative comparison to find a correspondence between the permutation combination of the language information elements and the permutation combination of the standard expression elements.
- the data of the MT training data table may be imported from an external database, or may be generated and added by manual assisted understanding.
- a natural expression processing method comprising: inputting a natural expression; identifying the natural expression to obtain some form of language information that can be processed by a computer; Determining whether the language information can be converted into a coded standard expression by machine conversion; if it is judged that the required standard expression cannot be obtained by machine conversion, manual conversion processing is performed; and a standard expression of machine conversion or manual conversion is output.
- the determination is a judgment as to whether the robot understands maturity, wherein the judgment of whether the robot understands maturity is based on the evaluation of the robot understanding accuracy rate in a certain time interval. ongoing.
- a natural expression processing and response method including: inputting natural expression; identifying the natural expression, obtaining some form of language information and related expression type information that can be processed by a computer; Whether the recognized natural expression and expression type information can be converted into a coded standard expression by machine conversion; if it is judged that the required standard expression cannot be obtained by machine conversion, manual conversion processing is performed; calling or generating conversion with the machine or The standard response of the manually converted standard entry matches; the generated standard response is output in a manner corresponding to the expression type information.
- the standard response is fixed data pre-stored in a database, or the standard data and variable parameters are stored based on standard responses stored in a database in advance. Respond.
- a natural expression processing and response apparatus comprising: a conversation gateway, a central controller, a MAU workstation, a robot, an expression database, a response database, and a response generator, wherein the conversation gateway receives from The user's natural expression is sent to the central controller for subsequent processing, and the response to the natural expression is sent to the user; the central controller receives the natural expression from the conversation gateway and works with the robot and the MAU workstation to convert the natural expression A standard expression for coding, and a standard response indicating that the response generator generates a standard response corresponding to the standard expression; the robot recognizes the natural expression according to the instruction of the central controller, and obtains some form of language information that can be processed by the computer, and The language information is converted into a standard expression by using an expression database; the MAU workstation presents the recognized natural expression or the natural expression from the user to an external MAU artificial agent, and the MAU artificial agent inputs or selects a standard expression through the MAU workstation.
- the MAU workstation then sends the standard expression to the central controller;
- the expression database is used to store the table view data, including: the language information data related to the natural table, and the standard expression data related to the standard table And data related to the association between the language information and the standard expression;
- the response database stores the response related data, including for invoking The standard response data and/or data used to generate the response;
- the response generator receives instructions from the central controller to generate a response to the user's natural representation by invoking and/or running the data in the response database.
- the central controller updates the database and/or the response database.
- the device further includes a trainer for training the robot to convert the natural expression into a standard expression.
- the conversation gateway further includes an identity authenticator, configured to identify and verify the identity of the user before receiving the natural expression information, where the user identity verification method includes at least the cipher language and Voiceprint recognition.
- a natural expression processing and response system comprising: an intelligent answering device, and a calling device, the user communicates with the smart answering device through the calling device, and the MAU artificial agent operates the smart answering device, wherein
- the intelligent answering device comprises: a conversation gateway, a central controller, a MAU workstation, a robot, an expression database, a response database and a response generator, and the conversation gateway receives the natural expression from the user from the calling device and sends it to the central controller;
- the controller instructs the robot to recognize some form of language information and related expression information that the computer can process from the natural expression, and then instructs the robot to convert the language information and related expression information into a standard expression; if the understanding of the robot is not mature enough, However, the conversion of the standard expression cannot be completed, and the central controller instructs the MAU workstation to prompt the MAU artificial agent to perform manual conversion of the standard expression, and the MAU artificial agent converts the language information and related expression information recognized by the robot into a standard table.
- the intelligent answering device comprises: a
- natural expression can be converted into a coded standard expression, since the conversion of the standard expression converts the semantics of the natural expression into encoding and parameters without precise verbatim translation, It can reduce the accuracy requirements of machine translation, reduce the complexity of the database used to implement expression conversion (machine translation), improve the speed of data query and update, and improve the performance of intelligent processing.
- the relatively simple coding expression can also reduce the workload of manual intervention and improve the efficiency of manual intervention.
- the natural expression processing and response method, device and system can express a fast pointing response by using a standard expression, so that the client does not need to spend a long time traversing complex conventional functions. Menu to find the self-service you need.
- a standardized natural expression-standard expression-standard response database can be established, and the system automatically realizes and responds gradually. And the database can also have the advantages of small granularity, knowledge category, high data fidelity, etc., thereby reducing the difficulty of robot training and shortening the robot intelligence.
- FIG. 1 schematically shows a flow of a natural expression processing method according to an embodiment of the present invention
- FIG. 2 is a flow chart schematically showing a natural expression processing and response method according to an embodiment of the present invention
- FIG. 3 is a schematic illustration of an intelligent response system in accordance with an embodiment of the present invention.
- Figure 4 further illustrates a portion of the structure of the intelligent answering device of the system of Figure 3;
- Fig. 5 schematically shows an example of an operation interface presented to a human agent by a MAU workstation
- Figure 6 shows an example of identifying voice information
- Fig. 7 shows an example of converting a collected sound wave into an X element using a Gaussian mixture model
- Fig. 8 shows an example of conversion from a collected sound wave (A language information) to Y language information
- machine translation As mentioned earlier, in such applications, the requirement for machine translation is not literally exact, but rather the need to translate the customer's natural expression into information that the system can understand, thus providing the customer with a response corresponding to their expression. That is to say, the machine translation here focuses on the understanding of the substantive meaning behind the human language, so that the actual intention or demand of the customer is "understood" from the natural expression in a form that is easier to handle in a computer program.
- the natural expression from the user is recognized or converted, and some form of language information that can be processed by the computer is obtained, and then the recognized language information is converted into a standard of some form. expression.
- a language information we call it "physical layer language information", hereinafter also referred to as "A language information”.
- a language information we call it "physical layer language information", hereinafter also referred to as "A language information”.
- a language information Through a certain modeling tool, basic automatic recognition or conversion is performed to obtain a first logical layer language (hereinafter referred to as "X language”) information expressed in a combination of several basic elements (hereinafter referred to as "X elements").
- Y language information A standard expression of some form generated by re-converting X language information obtained or converted by A language information
- Human natural expression methods are various.
- the natural expression from customers that is, "A language information” can be divided into the following four categories: text information, voice information, image information, and animation information.
- the text information expression can be: The customer expresses himself by inputting text through the keyboard. For example, the customer types "How much money is in my savings account?" on the Internet channel call center user interface of a bank; the image information expression can be: Customers express themselves through images, for example, customer access Through the computer desktop screen capture tool, the error information of a certain software will be used to express the problem encountered by the image; the voice information expression can be: The customer expresses himself by speaking, for example, the customer and a bank's service hotline (telephone) The channel call center) customer service commissioner talks, during the phone said: "What do you mean by that? I don't understand too much"; animation (or "video”) information can be: Customers shake their heads in front of the camera to express I don't agree.
- the customer's natural expression is automatically recognized or converted to obtain information expressed in a certain language.
- the A language information is voice information
- the sound wave waveform information can be collected by the modeling tool and automatically recognized or converted into a certain (corresponding to voice information) X language by the system (smart robot);
- the A language information is a graphic Information, for example, the graphical pixel information can be collected by the modeling tool and automatically recognized or converted to (in the image information) X language by the system (smart robot);
- the A language information is animation information, for example, by modeling The tool collects graphic pixel information and image change speed information and automatically recognizes or converts to (in correspondence with the animation information information) X language by the system (smart robot); if the A language information is text information, no conversion is required.
- the X language information automatically converted from the A language information or the text information that is not converted is "translated” into a regularized standard expression (Y language information) that the computer or other processing device can "understand".
- the Y language information can be automatically processed by the computer business system.
- the regularized standard expression can be implemented with regularized coding
- Y language information For example, the following coding methods are used, which include industry code, industry service code, organization code, agency business code, and express information code.
- Dialect code (3 digits 1-999)
- the industry code indicates the industry in which the service provider is directed by the irregular natural expression (A language information) from the customer.
- a language information For example, it can be represented by 2 English letters, which can cover 676 industries, optionally, increase A sub-industry code of 3 English letters can add 17576 sub-industries covering each industry.
- the code can basically cover all common industries;
- the industry business code represents the service demand pointed to by the customer's A language information, and can also be represented by multiple digits, for example, encoding with 10 digits, can cover More industry business areas;
- the organization code indicates the entity that provides the service from the customer's A language information, for example, can identify the country and city where the organization is located;
- the agency business code indicates the internal personalized business division of the service provider. It is convenient for the organization to carry out personalized internal management;
- the expression information code indicates the identification information of the customer's A language information itself, which may include the type of information, the type of language, etc., represented by numbers and letters.
- Example 1 FSBNK27100000000860109558800000000000000000002zh-CN003 where,
- the agency code is,
- the agency business code is,
- the A language information corresponding to the Y language information may be, for example, "My credit card amount is too small”, “I want to increase my credit card amount”, “I want to reduce my credit card limit”, “I need to adjust the credit card limit” and other voice messages.
- the industry code, agency code, and agency business code described above may be preset as system defaults. That is to say, the business code and the expression information code can be obtained only from the A language information provided by the client, in which case the Y language information can be expressed as "271000000002zh-CN003"; or, if it is 3 bits for a specific application If the number indicates that the industry business code is sufficient, it can be further expressed as "27102zh-CN003"; further, if it is only for voice service, it can be expressed as "271zh-CN003"; if only the customer's demand expression is considered, care is not concerned. Express your own type letter Interest, even with "271".
- Example 2 TVTKT11200000000014047730305000000000001240003fr-CH000
- TKT Ticketing Ticketing (Sub-Industry)
- ⁇ 1120000000 Level 1 industry business area 1 (Airline ticket) Secondary industry business category 1 (Air ticket change) Level 3 industry business category 1 (Deferred) 0000000 (No more subdivision)
- the A language information corresponding to the Y language information is obtained by image recognition.
- the above-mentioned industry code and organization code can be preset as the system default value.
- the Y language information can be expressed as "11200000001240003fr-CH000"; if only the customer's demand expression is considered, and the type information of the expression itself is not concerned, only "112000000012400" can be used; if it is for a specific application 3
- the digits represent the industry business code, and the three digits represent the agency business code, which can only be represented by "112124".
- the natural expression from the customer often reflects the specific needs of the customer.
- the customer's A language information is automatically converted into X language information or language information without conversion (when A language information is text)
- the X language information or the text language information is then converted into a standard expression (Y language information) of the encoded form.
- the Y language information may include industry code, industry business code, agency code, agency business code, and expression information generation. code.
- the A language information may also include specific parameters (which may be referred to as "demand parameters") in the context of customer requirements, such as: “Transfer 5000 to Zhang San” (Example 1), "I want to see a The movie, called “Chinese Partner” (Example 2) and so on.
- a particular set of demand codes (eg, including one or more of the aforementioned industry code, industry business code, agency code, agency business code, and express information code) corresponds to a particular set of parameters.
- the demand code of "watching movie” is 123
- the corresponding parameter set may include parameters: movie name.
- the Y language information corresponding to this A language information is "123 ⁇ Chinese Partner".
- 123 is the requirement code
- the five Chinese characters in ⁇ > are the demand parameters.
- the foregoing process of converting a customer's A language information into a language-formed information that can be processed by a computer can be realized by a voice signal processing technology, a voice recognition technology, an image recognition technology, and a video processing technology. These technologies may be existing. Technology.
- the coding standard expression idea according to an embodiment of the present invention can also be applied to the recognition process of natural expression.
- Fig. 6 exemplarily shows the processing of voice information.
- the processing from A language to D language is realized.
- the correspondence between the "X language” information and the "A language” information in Fig. 6, and the correspondence between the "X language” information and the "B language” information only serve as a demonstration.
- the A language, sound wave is the physical layer data collected by a sonic collection device such as a microphone.
- the X language is the first logical layer data obtained after performing speech signal processing on the A language data, and is referred to as "X language" in the present invention.
- the X language is a language formed by various combinations of X elements.
- the X element is a system that uses a modeling tool such as the Gaussian Mixture Model (GMM) to automatically cut the sound waves into different columnar elements of different heights.
- GMM Gaussian Mixture Model
- Figure 7 shows an example of converting a collected sound wave (represented by a histogram) into an X element (represented by a vector quantization histogram) using a Gaussian mixture model.
- the number of X elements can be controlled within a certain range (for example, below 200).
- a combination of 2-bit ASCII characters is defined as an ID of an X element, as shown in FIG.
- the cut acoustic unit and the X element are - corresponding, Since the A language information can be regarded as a combination of sound wave units, and the X language information is a combination of X elements, the conversion (or "recognition") relationship from the A language to the X language in FIG. 6 is a "many-to-many" relationship.
- An example of an X element represented by an ASCII character is shown in FIG.
- B language is a language formed by various arrangement and combination of B elements, and is the second logical layer data in Fig. 6. All or part of the X elements are arranged in combination to form the B element, so it can be understood that the X language is converted into the B element, and the B element constitutes the B language. Therefore, the conversion relationship from X language to B language is also a "many-to-many" relationship.
- the B element can be a phoneme, and some of the B elements are combined to form a syllable.
- the "phoneme” and “syllable” here have the same meaning in terms of linguistics.
- An example of the B element is shown in Figure 6, and these examples are Chinese (Chinese) phonemes.
- the "C language” is a language formed by various combinations of C elements, and is the third logical layer data in Fig. 6. All or part of the B element is arranged in combination to form a C element, so it can be understood that the B language is converted into a C element, and the C element constitutes the C language. Therefore, the conversion relationship from B language to C language is also a "many-to-many" relationship. If the linguistic system of phonemes and syllables is used, the C element corresponds to the "word" in natural language. An example of the C element is shown in Figure 6, and these examples are Chinese words.
- the "D language” is a language formed by various combinations of D elements, which is the fourth logical layer data in Fig. 6. All or part of the C elements are arranged in combination to form a D element, so it can also be understood that the C language is converted into a D element, and the D element constitutes a D language. Therefore, the conversion relationship from C language to D language is also a "many-to-many" relationship. If the linguistic system of phonemes, syllables, and words is used, the D element corresponds to the "word” or "phrase” in natural language. An example of the D element is shown in Figure 6, and these examples are Chinese words.
- the "Y language” is the fifth logical layer data (as shown in FIG. 8), and refers to the language information that reflects the "meaning” or “meaning” obtained by understanding the original natural language information A.
- the "standard expression” of righteousness is a form of "Y language”. According to an embodiment of the present invention, for example: the banking industry can use the service code "21" to represent the meaning of "credit card loss”;
- Fig. 9 schematically shows a layer-by-layer conversion process from the collected sound waves (A language information) to the Y language information.
- Sound wave A language information
- X element X language information
- phoneme B language information
- word C language information
- D language information D To “word” (D language information D)
- meaning or “meaning” (Y language information)
- Y language information it is five conversions (translation) of information in six languages.
- the path information of the arrangement and combination of the elements of the five languages is selected to find or correspond to the information data of the sixth language, that is, the target language information. Y.
- the robot also has the ability to convert these five information languages.
- these five-step conversions can be divided into three phases. In these three phases, in order to train the voice robot, manual assisted recognition is required.
- the first stage From linguistic information (sound waves) to C language information (words).
- the two-step conversion from A language information (sound wave) to B language information (phoneme) is generally automatically performed by the robot more accurately due to the information extraction and conversion algorithm (such as the Gaussian mixture model described above) by means of the language information X.
- a conversion from B language information (phoneme) to C language information (word) may result in a higher error rate.
- the original language information input by the customer is "Pizza auction is finished”. Due to the customer's pronunciation or accent, "Table tennis” may be identified as "flat”.
- transfer transcription I
- a language information A language information
- C language information A language (sound wave) language for the robot (word) Conversion/translation relationship.
- the second stage from C language information (words) to D language information (words, phrases).
- words C language information
- words, phrases D language information
- the conversion from word to word is also ambiguous.
- the recognition from sound wave to word is accurate, the result of the order of "table tennis auction finished” is obtained, but it will still be converted into at least “table bat.” + Selling + finished “and “table tennis + auction + finished”
- the two results the meaning is obviously different.
- manual assisted identification can be used to correct it.
- Keyword Spotting The manual auxiliary recognition at this stage is called Keyword Spotting, and can also be referred to as "cut word”, which is the combination of “words” (C language information) that the word-cutting person will transfer to form “words (key Word)” (D language information), that is, the conversion/translation relationship of the C language (word) language (word) is defined for the robot. Whether the word is accurate or not depends on the degree of mastery of the business knowledge. For different fields, people who need to be familiar with the business content and terminology of the field will be able to perform word-cutting operations, and the cost will be improved compared with the transfer.
- the third stage From D language information to Y language information, that is, meaning understanding. Only words that are arranged in a certain order often do not accurately understand the true meaning of the customer. For example, if the customer says “My credit card is gone”, the robot can't recognize its meaning, and the technician puts “my”, “credit card”, and “disappeared” as new keywords into the database's grammar table; The customer said: “The sly brush is lost”, the robot can not recognize its meaning, the technician will use “ ⁇ ", “brush card” (meaning “credit card”), "lost” as the new keyword. Put it in the syntax table of the database. In this way, the meaning or needs of the customer are understood through manual assistance and summarized into the database.
- Keyword Pile-up This kind of artificial auxiliary identification is called Keyword Pile-up, which is simply referred to as "heap word”, which is the combination of accumulation of "words” and is included in the database according to its meaning.
- Heap word This kind of artificial auxiliary identification is called Keyword Pile-up, which is simply referred to as "heap word”, which is the combination of accumulation of "words” and is included in the database according to its meaning.
- the workload of this work is also enormous, and the professional knowledge of the trainers is also needed to aid understanding.
- the natural expression of the client is automatically converted to obtain the X language information, or the C language information is directly obtained without conversion (when the A language information is text) When the information is); then convert the X language information or C language information into Y language information.
- the random natural expression may be X language One of message information, B language information, C language information, and D language information. That is to say, the process of natural expression processing may be one of: A ⁇ X ⁇ Y, A ⁇ B ⁇ Y, A ⁇ C ⁇ Y, A ⁇ D ⁇ Y.
- the non-regular natural expression information such as text, voice, graphics, and video is first converted into X language information by using a modeling tool; then, the X language is used as the left language, and the Y language is used as the right language.
- the conversion of X language information to Y language information is realized by using machine translation (MT) technology.
- the A language is automatically converted/translated into X language (based on the current "speech signal processing” by using “Speech Signal Processing” technology. "Technology, AX conversion accuracy is generally as high as 95% or more, and the improved “voice signal processing” technology is better in noise reduction, which can increase the conversion accuracy of A ⁇ X to over 99%);
- the machine translation technology is then used to implement automatic machine translation of XY without the need for multi-layer conversion by XBCDY.
- a machine translation algorithm similar to statistical analysis based on an instance sample can be utilized to convert the transformed random natural expression (X language information) into a regularized standard expression (Y language information).
- This machine translation algorithm requires that the amount of data between the X and Y languages is large enough and accurate enough.
- the solution of the present invention provides a new artificial agent working mode of MAU (Mortal Aided Understanding), and realizes the corresponding data accumulation between the A language and the Y language through manual understanding and code input.
- MAU Total Aided Understanding
- the credit card is missing the "natural message" of "or” lost the card.
- This simple code input method turns the traditional "speaking agent” into a “non-speaking agent”, which makes the work of the agent more comfortable, greatly improves the work efficiency, and makes full use of the understanding of the highest value of human beings.
- the ability to accurately and quickly collect the corresponding data of A/X language and Y language provide the MT engine with loop iteration, self-learning A/XY conversion/translation rules, and form A/XY translation model.
- Machine translation is an artificial intelligence technology used to automatically translate two languages.
- language as used herein is not a narrow national language (for example: Chinese, English 7), but a generalized representation of information. As mentioned earlier, languages can be divided into four categories: text, voice, image, animation (or "video").
- a language is information formed by a combination of various elements in an element set.
- English text is a language formed by a combination of various one-dimensional (serial) arrangements of 128 ASCII characters (elements) in the ASCII character set (element set);
- the Chinese language is the national standard code. Thousands of Chinese characters in the field are combined with punctuation marks (the basic elements constituting Chinese information).
- RGB plane images are composed of three sub-pixels of red, green and blue, through various two-dimensional (long Wide) another language formed by arranging combinations.
- the data structure of the two tables is similar: the stored data is a pair of pairs, the left value is “left” “” (or “source language”), the right value is “right language” (or “target language”).
- training data table is a textbook that humans teach themselves to MT robots.
- test data table is a question that humans give to the MT robot to evaluate the robot's self-learning effect.
- the MT robot is an iterative loop that is arranged and combined in units of elements constituting the language.
- the 15 ASCII character elements of English “ May I have your” are found by training the two pairs of data in #3 and #4 in the data table (3 English letters "May” + 1 space + 1 English)
- the arrangement of the Chinese national "I ask you” is the arrangement of the three Chinese characters.
- the English "age” arrangement of the three ASCII character elements corresponds to the arrangement of the Chinese characters "age” in the two national standard codes.
- the robot can accurately translate the English "May I have your age?” in the test data sheet into Chinese "Is your age?”, then the robot has learned the Chinese-English translation of this sentence. If not, then prove that the robot has not learned. Then the robot needs to correct its own learning. Learning methods (for example, looking for another path to try to learn again), re-digesting the training data table once, this is another iteration; ... so repeating this "iteration correction", which makes the robot Translation accuracy continues to climb. When climbing to a certain level (for example, the translation accuracy rate is 70%), the translation accuracy of the robot may remain at this level, and it is difficult to go up again, that is, it encounters the bottleneck of "machine self-learning". Then you need to add MT training data table data to the robot.
- the data of the MT training data table can be imported from an external database or generated and added by "human assisted understanding".
- a new natural expression instance such as the above-mentioned natural expression "My credit card can be overdrawn too much", and its corresponding standard expression “271 "Add to the existing MT training data table to increase and update the MT training data table data.
- the MT training data table data can be efficiently added and updated, so that the data in the system MT training data table is richer and more accurate, and the translation (conversion) accuracy of the robot can be efficiently obtained. Upgrade.
- the MT robot needs to exhaustively list all the permutations and combinations of the #3 ASCII character elements of #3's lvalue "May I have your time”, and also need the right value for #3. "What time is it now?" All the permutations and combinations of the 10 national standard codes are exhausted. That is, the MT robot needs to exhaustively list all the permutations and combinations of the left and right sets of elements of each pair of data in the training data table. Through this exhaustion of elemental level, MT robots will find many repetitive permutations (such as "your", “May I have your” . “age” . "time” .
- the machine translation between the X language languages in the present invention is the same as the machine translation principle in Chinese and English, except that we changed the English to the X language, the Chinese language to the Y language, and the elements of the two languages.
- the set is different.
- machine translation techniques can be used to automatically translate one language into another.
- the technical principle is to perform basic element level analysis on the paired information (left language and right language) of the two languages collected, and iteratively compares various arrangement and combination of basic elements of a large number of language information pairs to find The translation/translation rules between the two languages form a translation model for the two languages.
- the technology of the present invention automatically converts the application of machine translation technology from the translation of languages between different countries to the automatic conversion of all non-regular multimedia natural expression information (text, voice, image, video, ie A language information) into
- the regularization standard information (Y language information) is described so that the business systems of various industries can process them, thereby realizing a practical and practical NLP (Natural Language Processing).
- the natural expression processing according to the embodiment of the present invention can be restricted to specific services of a specific industry organization.
- the size of the training data table required by the processing system can be greatly reduced, thereby improving While the robot understands the mature threshold, it reduces the cost of building and maintaining the training data table, and it can also effectively shorten the maturity cycle of the ⁇ / ⁇ ⁇ translation model.
- the natural expression processing system according to the embodiment of the present invention realizes the conversion from the natural expression to the standard expression of the encoding.
- the basis of the conversion is an MT training data table storing A/X language information and Y language information paired data, and a translation model of A/X ⁇ Y obtained based on the MT training data table.
- Fig. 1 schematically shows the flow of a natural expression processing method in accordance with one embodiment of the present invention.
- step S11 the system receives natural expression information (A language information) which, as previously described, may be text information, voice information, image information, video information, or the like.
- a language information which, as previously described, may be text information, voice information, image information, video information, or the like.
- step S21 it is judged whether the understanding ability of the robot is mature.
- the judgment of whether the robot understands maturity is based on the result that the robot converts the A language information into the X language information and then converts the X language information into the Y language information in a certain time interval (set according to specific application requirements).
- the same number of times Y1 and Y2 are divided by the total number of comparisons, and the percentage obtained is the robot understanding accuracy.
- the robot understands the accuracy according to the application needs, which we call "the robot understands the mature threshold".
- the system thinks that the robot is not mature enough to use the robot conversion result Yl, and continues to use the manual conversion result ⁇ 2 to ensure the system understands the linguistic information accurately. And stable.
- the system adds the A language information through the machine's automatically converted X language information (left language), and the manual conversion result Y2 (right language) to the MT training data table for the MT robot to self-train.
- the robot automatically converts the natural expression A directly into the standard expression Y in step S22; if the robot understands that it is not yet mature, the robot attempts to convert the natural expression A into the standard expression Y1 in step S23. At the same time, the natural expression ⁇ is converted to the standard expression Y2 by the MAU agent at step S24.
- step S32 if it is judged in step S21 that the understanding ability of the robot is mature, the result Y automatically converted by the robot is output; otherwise, the result Y2 of the manual conversion of the MAU agent is output.
- step S31 the following process is performed on the natural expression A, the result of the robot attempting to convert Y1, and the result Y2 of the MAU agent manual conversion: automatically converting A into X language information (Left language) Together with Y2 (right language), it is placed in the training data table as a pair of new pairing data; Y1 and ⁇ 2 are compared and used as statistics for "judge whether the robot understands maturity".
- the original data is retained, and when the future AX conversion technology is further developed (the conversion accuracy is higher), the left language data of the MT training data table is updated.
- Fig. 2 schematically shows the flow of a natural expression processing and response method in accordance with one embodiment of the present invention.
- step S31 the natural expression A is received at step S12. It is then judged at step S31 whether or not the natural expression A can be converted into the standard expression Y by machine conversion. This step is equivalent to step S21 in Fig. 1. Similar to the processing of Fig. 1, when it is judged at step S31 that the desired standard expression cannot be obtained by machine conversion, the manual conversion processing is performed at step S32.
- step S33 a response prompting the customer to re-enter is made in step S33, and then the process returns to step S12 to receive the client.
- the natural expression information A is input again.
- "Respond to the customer to re-enter the response” can be, for example, the voice prompt "Sorry, please tell us about your needs again", "Please speak slowly”; text prompt "Sorry, please write specific”; Or image prompts, etc.
- a standard expression of machine conversion or manual conversion is output at step S34.
- a standard response matching the standard expression is queried in step S35.
- the standard response can be fixed data pre-stored in the database, or the basic data of the standard response stored in the database in advance, and then run through the system to synthesize the basic data and the case variable parameters to generate a standard response.
- a standard response ID is set as the primary key of the response data
- a correspondence table of the standard expression (Y language information) requirement code and the standard response ID is set in the database, thereby expressing the standard (Y language information).
- the demand code is associated with the response data. Tables 1 to 3 below schematically show examples of the expression data table, the expression response relationship table, and the response data table, respectively.
- the standard expression and the standard response ID may be in a many-to-one relationship, as shown in Table 4.
- the demand code of the standard expression (Y language information) is itself encoded
- the requirement code of the standard expression (Y language information) can also be directly used as the primary key of the response data.
- standard expressions can include information related to natural forms, such as expression type, language type, dialect type, and so on.
- the natural expression from the customer is the voice "received”, and the standard response to the query is converted to the voice "good, know, thank you!; also for example, the natural expression from the customer is the image "transfer failure page” Screen capture", through the converted standard expression query, the standard response is the video "Transfer Error Correction tutorial”.
- step S36 If there is no standard response in the database that matches the standard expression, then the corresponding response can be manually matched in step S36.
- Manual matching can associate a standard expression with the standard response ID by entering or selecting a standard response ID, or directly associate the standard expression with the response data, and can also create new response data. The reason for not finding a standard response may be that the standard expression was newly added by hand, or it may be because there is no standard response matching the same type. Then, a machine matching or a manually matched response is output in step S37. The content of the response can be called or generated according to different types of information.
- the text message "Transfer 5000 to my mom” needs to be operated by the program to "transfer 5000 yuan to Ms. X", but the system may not pre-master the account information of "Ms. X".
- the account information needs to be manually added to achieve the conversion of the standard expression.
- the corresponding standard response may not be queried, and the response process needs to be manually performed.
- new response data (such as an operating program) is generated, and a new standard response ID is manually or automatically assigned to the response data, and the standard response ID is associated with the standard table of the above conversion.
- the natural expression processing and response method according to an embodiment of the present invention can be quickly expressed by using standard expression Pointing to the response, so customers don't have to spend a lot of time traversing the complex regular menus to find the self-service they need.
- manual operations are mainly limited to "decision" work in the background, including determining standard expression (Y language information) requirement code, selecting response (or response ID), or generating response operations, etc., but not required Communicate directly with the customer at the front desk by means of a call or text input (except for the input standard expression (Y language information) requirement parameter).
- This can save a lot of human resources and greatly improve work efficiency.
- the standardized response provided by the system to the customer is not affected by many factors such as emotion, voice, accent, business proficiency and other factors of the agent, as compared with the traditional free-form response provided by the agent directly to the client. Stability.
- the natural expression data in the database can also have the advantages of small granularity, narrow business scope, high data fidelity, etc., thereby reducing the difficulty of robot training and shortening the mature period of robot intelligence.
- Fig. 3 schematically shows an intelligent response system in accordance with an embodiment of the present invention.
- the intelligent response system includes an intelligent answering device 1 (equivalent to a server end), and a calling device 2 (equivalent to a client).
- the client 8 communicates with the smart answering device 1 through the calling device 2, and the MAU artificial seat 9 (System service personnel) Manually operate the smart answering device 1.
- the intelligent answering device 1 includes a conversation gateway 11, a central controller 12, a MAU workstation 13, and a robot 14.
- the smart answering device 1 further includes a trainer 15.
- Customer 8 refers to the object of the organization's remote sales and remote services.
- Remote sales usually refer to the initiative to contact customers in the form of "outgoing” through their own proprietary telephone or Internet channels, trying to sell their products and services.
- Remote service usually means that the organization's customers actively contact the organization in the form of "incoming call” through the organization's exclusive telephone or Internet channel, asking or using the organization's products and services.
- the calling device 2 may be a dedicated telephone channel or Internet channel established by the organization for remote sales (outbound service) to the customer 8 and remote service (incoming service) to the customer.
- Telephone channel call systems such as Automatic Call Distribution (ACD) (eg, Avaya's ACD), are automated business systems that are passed through the back office (eg, traditional IVR systems based on telephone button technology, or based on intelligent voice technology).
- ACD Automatic Call Distribution
- New VP Voice Portal
- a portal system and a human agent a dialogue channel that interacts with the client 8 in a voice form.
- Internet channel calling systems such as the Internet Call Center (ICC) based on Instant Messaging (IM) technology
- ICC Internet Call Center
- IM Instant Messaging
- customer self-service systems eg, Natural Language Processing (NLP)
- NLP Natural Language Processing
- the intelligent answering device 1 enables the organization to control the automatic business system and the artificial agent in the background, and the dialogue with the client 8 in the form of multimedia such as text, voice, image, video, etc., thereby realizing standardization and automatic interaction between the organization and the client. dialogue.
- the conversation gateway 11 plays the role of "pre-portal" in the intelligent answering device 1, and the main functions include: receiving the irregular natural expression (by text, voice, image, video) and regularization from the client 8 via the calling device 2
- the expression (such as in the form of a telephone keyboard button) is sent to the central controller 12 for subsequent processing; receiving instructions from the central controller 12 to respond to the expression of the client 8 (in the form of text, voice, images, video, programs, etc.) ).
- the conversation gateway 11 includes an expression receiver 111, a body ID 112, a response database 113, and a response generator 114.
- the expression receiver 111 receives the expression from the client 8 through the calling device 2.
- the expression may be the various irregular natural expressions and regularized unnatural expressions described above.
- an identity authenticator 112 is provided prior to expressing the receiver 111.
- the identity authenticator 112 can identify and verify the identity of the client 8 during the initial phase of the conversation.
- You can use the traditional "password input” technology such as: phone key to enter password, keyboard input website login password, etc.); also use the new "pass-phrase” + voice-print (Voice-print) identification "Technology; you can also mix and match the above two technologies.
- password authentication technology is inconvenient, it has been widely accepted and used by the market.
- the response database 113 stores response data for responding to the customer. Similar to the examples in the above table, the data can include the following types:
- Text Pre-written text, for example, the text answer in the online banking FAQ (Frequently Asked Questions).
- Image Prefabricated image, for example, Beijing subway network map. Also included are non-video animations, such as: Banks introduce customers to GIF files, FLASH files, etc. for international money transfer operations in online banking systems.
- Video Prefabricated videos, for example, electric iron suppliers show customers how to use their new products.
- Template Text, voice, image, program template that can be filled with variables.
- the response generator 114 receives the central controller 12 command to generate a response to the client 8 expression by invoking and/or running the data in the response database 113.
- the response ID may be queried according to the standard in the instruction
- the response data may be queried from the response database 113, or the text, image, or voice, video, or program may be played; or the template may be invoked in the database 113 according to the instruction.
- fill in the variable parameters transmitted in the instruction, or play the TTS speech synthesis generated in real time for example, "You have successfully repaid the credit card 5000 yuan.”", "5000 yuan" is the variable in the instruction or displays a paragraph of text, or Display a real-time generated picture or animation, or execute a program.
- central controller 12 may maintain and update data in response database 113, including response data, standard response IDs, and the like.
- the central controller 12 receives customer demand expression information from the expression receiver 111 (including: irregular natural expression and regularized unnatural expression), and cooperates with the robot 14 and the MAU workstation 13 via the MAU workstation 13 to thereby
- the irregular natural expression information is converted into a standard expression according to the foregoing method, and the standard response ID corresponding thereto is determined according to the standard expression,
- the standard response ID is then sent to the response generator 114.
- central controller 12 may update the data in the MT training data table.
- the robot 14 is an application robot that implements the above-described artificial intelligence technology.
- the robot 14 can perform conversion of natural expression (A-language information) such as text information, voice information, image information, and video information to obtain a standard expression (Y language information).
- A-language information such as text information, voice information, image information, and video information
- Y language information a standard expression
- the MT training data table can be set in the robot 14 or an external database, and the demand code of the standard expression data (right language) stored therein can be associated with the standard response ID.
- This database can be updated by the central controller 12.
- the database for text translation, speech recognition, image recognition, video processing, and the like may be an external database or may be provided in the robot 14.
- the MAU workstation 13 is an interface between the smart answering device 1 and the MAU human agent 9.
- the MAU workstation 13 presents the identified natural representation or customer original expression to the MAU artificial agent 9.
- the MAU artificial seat 9 inputs or selects a standard expression through the MAU workstation 13, and then the MAU workstation 13 sends the standard expression to the central controller 12.
- the MAU artificial agent 9 inputs or selects a response (or standard response ID) through the MAU workstation 13.
- a trainer 15 may also be included in the smart answering device 1.
- the trainer 15 is used to train the robot 14 to convert natural expressions into standard expressions.
- the trainer 15 uses the judgment result of the MAU artificial seat 9 to train the robot 11, and continuously improves the robot comprehension correctness rate of the robot 11 in various domains (for example, the aforementioned business scope and sub-service category, etc.).
- the trainer 15 compares the standard expression conversion result of the MAU artificial seat 9 with the standard expression conversion result of the robot 11, If the results are the same, the category "the number of robot judgments” and the “number of robot judgments” are increased accordingly; otherwise, the manual conversion result is added to the MT training data table as new robot training data.
- the trainer 15 can also indicate the robot. 14 Perform the aforementioned "self-learning”.
- the trainer 15 can also be used to train the robot 14 with artificial intelligence techniques such as text translation, speech recognition, image recognition, video processing, and the like.
- the trainer 15 can also perform dimensions on the MT training data table, the database for text translation, speech recognition, image recognition, and video processing. Protection and renewal.
- the trainer 15 can also be integrated with the central controller 12.
- the response generator 114 and the response database 113 may be independent of the conversation gateway 11, or may be integrated in the central controller 12.
- the intelligent response device 1 can implement the aforementioned natural expression processing and response method.
- the conversation gateway 11 receives the irregular natural expression information from the client 8 from the calling device 2 through the expression receiver 111 and transmits it to the central controller 12; the central controller 12 instructs the robot 11 to recognize the irregular natural expression information.
- Some form of language information and related expression information that can be processed by the computer, and then instructing the robot 11 to convert the language information and related expression information into a standard expression; if the understanding of the robot 11 is not mature enough or the corpus is not matched, If the conversion of the standard expression cannot be completed, the central controller 12 instructs the MAU workstation 13 to prompt the MAU artificial agent 9 to perform manual conversion of the standard expression; the MAU artificial agent 9 converts the language information and related expression information recognized by the robot 11 into a standard expression, and passes The MAU workstation 13 inputs and sends to the central controller 12.
- the MAU artificial agent 9 can directly convert the unrecognized irregular natural expression information into a standard expression; the central controller 12 queries the expression-response database, and retrieves the Standard expression matching standard response ID, if there is no matching result, then the MAU workstation 13 prompts the MAU artificial agent 9 to select the standard response and input the corresponding standard response ID. Alternatively, the MAU artificial agent 9 can directly associate the standard expression with the response data.
- the central controller 12 instructs the response generator 114 to invoke and/or run the data in the response database 113 to generate a response to the representation of the client 8; then, the conversation gateway 11 feeds the response back to the calling device 2 Client 8;
- the central controller 12 maintains and updates the MT training data table or response database, respectively, based on standard expressions or standard responses determined or added by the MAU human agent 9, and maintains and updates a response database accordingly.
- Fig. 5 schematically shows an example of an operation interface presented by the MAU workstation to the MAU human agent 9.
- the operation interface of the MAU workstation 13 includes: a customer expression display area 131, a conversation status display area 132, a navigation area 133, a category selection area 134, and a shortcut area 135.
- the customer expression display area 131 displays the natural expression of the customer, for example, in the form of text converted from text, images, and voice.
- the conversation status display area 132 displays the real-time status information of the conversation between the client 8 and the MAU artificial agent 9 or the robot 14, such as: the number of conversations, the total duration of the conversation, the customer information, and the like. This display area may not be set.
- the navigation area 133 shows the category of the MAU artificial seat 9 that has been selected so far.
- the left end of the area shows the text version of the current category path (as shown in the figure: bank credit card), and the right side displays the category pair code (as shown in the figure: "12", "represents the bank” category, "2" stands for The next level of the "bank” category “credit card”.
- "1" stands for "bank” category, and without “BNK", the two logos are the same) .
- Category selection area 134 for MAU artificial seats 9 select the next level category.
- MAU artificial agent 9 has entered the next level of "banking" category "credit card,,, and the "credit card” category has 7 sub-categories: “Activate new card”, “Apply new Card and application progress inquiry ", "payment” ....
- customer 8 is "My credit card can be overdrawn.”
- MAU artificial seat 9 Select “7” in the current category “Bank Credit Card”
- the navigation area will update and display "Bank Credit Card Adjustment Credit Limit 127” to enter the next level.
- MAU artificial agent 9 can also directly enter "127" on the keyboard after seeing the expression of customer 8, and reach the target category "bank credit card adjustment credit limit”.
- MAU artificial agent 9 can quickly help the customer directly start the "adjust credit card quota" processing As a result, the user experience becomes easy and convenient, and the self-service process utilization rate of the traditional IVR system will be greatly improved.
- the shortcut area 135 provides common shortcut keys for the MAU artificial seat 9, for example, "-" returns to the upper category, "0" transfers the artificial seat, "+, returns to the top level category (in this example, the root category "bank””
- the shortcut area 135 can also provide other shortcut keys for the MAU artificial seat 9.
- the shortcut area 135 can improve the processing speed of the MAU artificial seat 9.
- the shortcut area 135 is also an optional setting area.
- An intelligent answering device in accordance with an embodiment of the present invention can be implemented by one or more computers, mobile terminals or other data processing devices.
- the natural expression processing and response method, device and system according to an embodiment of the present invention can utilize the standard expression to quickly point to the response, so that the customer does not have to spend a long time traversing the complex conventional function menu to find the self-service required by himself.
- Standardized nature can be established through robotic learning, training and manual assisted understanding
- the expression of a standard expresses a standard response database, gradually realizing the system's automatic understanding and response.
- the natural expression information data in the database can also have the advantages of small granularity, narrow business scope, high fidelity, etc., thereby reducing the difficulty of robot training and shortening the maturity cycle of robot intelligence.
- manual operation is mainly limited to the "decision" work in the background, including determining the requirement code of the standard expression (Y language information), selecting the response (or response ID), or generating the response operation, etc., but does not need to pass in the foreground.
- Calls or text input except for the input parameters of the standard expression (Y language information)) to communicate directly with the customer. This can save a lot of human resources and improve work efficiency.
- the system's standardized response to customers is more than the traditional free agent's traditional free-form response to the customer, not affected by many factors such as mood, voice, accent, business proficiency, etc. The stability of the experience.
- the automatic learning, training and maturity evaluation of the robot can be realized in units of specific business categories (nodes), thereby realizing the intelligence of the overall system point by point.
- nodes business categories
- the "machine understanding of point-by-point maturity" mechanism is more easily recognized and accepted by the organization, because the risk is relatively low, the cost of the old system transformation is not high, and it will not have a negative impact on daily operations.
Abstract
Description
Claims
Priority Applications (11)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2016546460A JP6182272B2 (ja) | 2013-10-28 | 2014-06-16 | 自然表現の処理方法、処理及び応答方法、装置、及びシステム |
EP14856958.5A EP3062239A4 (en) | 2013-10-28 | 2014-06-16 | Natural expression processing method, processing and response method, device, and system |
CN201480059550.9A CN105723362B (zh) | 2013-10-28 | 2014-06-16 | 自然表达处理方法、处理及回应方法、设备及系统 |
KR1020167014285A KR20160077190A (ko) | 2013-10-28 | 2014-06-16 | 자연 표현 처리 방법, 처리 및 응답 방법, 디바이스 및 시스템 |
CA2929018A CA2929018C (en) | 2013-10-28 | 2014-06-16 | Natural expression processing method, processing and response method, device and system |
US15/032,603 US9760565B2 (en) | 2013-10-28 | 2014-06-16 | Natural expression processing method, processing and response method, device, and system |
RU2016120612A RU2672176C2 (ru) | 2013-10-28 | 2014-06-16 | Способ обработки естественного выражения, способ, устройство и система обработки и ответа |
IL245322A IL245322B (en) | 2013-10-28 | 2016-04-27 | A method for processing natural expression, a method, facility and system for processing and response |
US15/141,576 US9753914B2 (en) | 2013-10-28 | 2016-04-28 | Natural expression processing method, processing and response method, device, and system |
ZA2016/03638A ZA201603638B (en) | 2013-10-28 | 2016-05-27 | Natural expression processing method, processing and response method, device, and system |
HK16108004.9A HK1223164A1 (zh) | 2013-10-28 | 2016-08-09 | 自然表達處理方法、處理及回應方法、設備及系統 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310516340.5A CN103593340B (zh) | 2013-10-28 | 2013-10-28 | 自然表达信息处理方法、处理及回应方法、设备及系统 |
CN201310516340.5 | 2013-10-28 |
Related Child Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US15/032,603 A-371-Of-International US9760565B2 (en) | 2013-10-28 | 2014-06-16 | Natural expression processing method, processing and response method, device, and system |
US15/141,576 Continuation US9753914B2 (en) | 2013-10-28 | 2016-04-28 | Natural expression processing method, processing and response method, device, and system |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2015062284A1 true WO2015062284A1 (zh) | 2015-05-07 |
Family
ID=50083490
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2014/079945 WO2015062284A1 (zh) | 2013-10-28 | 2014-06-16 | 自然表达处理方法、处理及回应方法、设备及系统 |
Country Status (11)
Country | Link |
---|---|
US (2) | US9760565B2 (zh) |
EP (1) | EP3062239A4 (zh) |
JP (1) | JP6182272B2 (zh) |
KR (1) | KR20160077190A (zh) |
CN (2) | CN103593340B (zh) |
CA (2) | CA2929018C (zh) |
HK (1) | HK1223164A1 (zh) |
IL (1) | IL245322B (zh) |
RU (1) | RU2672176C2 (zh) |
WO (1) | WO2015062284A1 (zh) |
ZA (1) | ZA201603638B (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105721470A (zh) * | 2016-02-19 | 2016-06-29 | 广州盛华信息有限公司 | 一种实现呼叫中心可视化接入的方法 |
WO2019153533A1 (zh) * | 2018-02-12 | 2019-08-15 | 平安科技(深圳)有限公司 | 动态路由控制方法、装置、计算机设备及存储介质 |
JP2019536135A (ja) * | 2017-01-25 | 2019-12-12 | グーグル エルエルシー | 言語モデルを用いたメッセージ内受信画像に対する自動提案応答 |
Families Citing this family (63)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103593340B (zh) | 2013-10-28 | 2017-08-29 | 余自立 | 自然表达信息处理方法、处理及回应方法、设备及系统 |
US9535904B2 (en) * | 2014-03-26 | 2017-01-03 | Microsoft Technology Licensing, Llc | Temporal translation grammar for language translation |
AU2015305397A1 (en) * | 2014-08-21 | 2017-03-16 | Jobu Productions | Lexical dialect analysis system |
CN104486516B (zh) * | 2014-11-13 | 2018-01-09 | 国网浙江省电力公司电力科学研究院 | 基于95598大话务的ivr智能系统的机器人语音服务方法 |
CN105094315B (zh) * | 2015-06-25 | 2018-03-06 | 百度在线网络技术(北京)有限公司 | 基于人工智能的人机智能聊天的方法和装置 |
CN106372091A (zh) * | 2015-07-23 | 2017-02-01 | 中兴通讯股份有限公司 | 基于信息识别提供解决方案的方法及装置 |
US9682481B2 (en) * | 2015-10-26 | 2017-06-20 | X Development Llc | Communication of information regarding a robot using an optical identifier |
WO2017082447A1 (ko) * | 2015-11-11 | 2017-05-18 | 주식회사 엠글리쉬 | 외국어 독음 및 표시장치와 그 방법, 및 이를 이용한 외국어 리듬 동작 감지 센서 기반의 운동학습장치와 운동학습방법, 이를 기록한 전자매체 및 학습교재 |
CN107015983A (zh) * | 2016-01-27 | 2017-08-04 | 阿里巴巴集团控股有限公司 | 一种用于智能问答中提供知识信息的方法与设备 |
US10769517B2 (en) * | 2016-03-05 | 2020-09-08 | Fujitsu Limited | Neural network analysis |
CN105808501A (zh) * | 2016-03-09 | 2016-07-27 | 北京众星智联科技有限责任公司 | 一种人工智能学习的实现 |
US10394964B2 (en) * | 2016-04-04 | 2019-08-27 | Oslabs Pte. Ltd. | Gesture based system for translation and transliteration of input text and a method thereof |
CN105975622B (zh) * | 2016-05-28 | 2020-12-29 | 福州云之智网络科技有限公司 | 多角色智能聊天的方法及系统 |
US10929861B2 (en) * | 2016-06-23 | 2021-02-23 | Tata Consultancy Services Limited | Method and system for measuring a customer experience in an organization |
CN106297782A (zh) * | 2016-07-28 | 2017-01-04 | 北京智能管家科技有限公司 | 一种人机交互方法及系统 |
CN106409283B (zh) * | 2016-08-31 | 2020-01-10 | 上海交通大学 | 基于音频的人机混合交互系统及方法 |
US10984034B1 (en) * | 2016-10-05 | 2021-04-20 | Cyrano.ai, Inc. | Dialogue management system with hierarchical classification and progression |
CN108073804B (zh) * | 2016-11-14 | 2022-11-29 | 百度在线网络技术(北京)有限公司 | 一种风险识别方法和装置 |
US10909980B2 (en) * | 2017-02-27 | 2021-02-02 | SKAEL, Inc. | Machine-learning digital assistants |
KR102367778B1 (ko) * | 2017-03-15 | 2022-02-25 | 삼성전자주식회사 | 언어 정보를 처리하기 위한 방법 및 그 전자 장치 |
CN107331388A (zh) * | 2017-06-15 | 2017-11-07 | 重庆柚瓣科技有限公司 | 一种基于养老机器人的方言收集系统 |
TWI712975B (zh) * | 2017-06-29 | 2020-12-11 | 永豐商業銀行股份有限公司 | 一種基於智慧語音人機互動設備之金融服務系統 |
JP2019049604A (ja) * | 2017-09-08 | 2019-03-28 | 国立研究開発法人情報通信研究機構 | 命令文推定システムおよび命令文推定方法 |
CN109587358A (zh) * | 2017-09-29 | 2019-04-05 | 吴杰 | 人工智能客服转人工客服呼叫方法 |
CN107909995B (zh) * | 2017-11-16 | 2021-08-17 | 北京小米移动软件有限公司 | 语音交互方法和装置 |
US10762113B2 (en) * | 2018-01-31 | 2020-09-01 | Cisco Technology, Inc. | Conversational knowledge graph powered virtual assistant for application performance management |
JP7078429B2 (ja) * | 2018-03-20 | 2022-05-31 | 株式会社Screenホールディングス | テキストマイニング方法、テキストマイニングプログラム、および、テキストマイニング装置 |
US11568863B1 (en) * | 2018-03-23 | 2023-01-31 | Amazon Technologies, Inc. | Skill shortlister for natural language processing |
US20190295082A1 (en) * | 2018-03-23 | 2019-09-26 | Mastercard International Incorporated | Message Based Payment Card System, Apparatuses, and Method Thereof |
US10929601B1 (en) * | 2018-03-23 | 2021-02-23 | Amazon Technologies, Inc. | Question answering for a multi-modal system |
US10956462B1 (en) * | 2018-06-21 | 2021-03-23 | Amazon Technologies, Inc. | System answering of user inputs |
EP3613037B1 (en) * | 2018-06-27 | 2020-10-21 | Google LLC | Rendering responses to a spoken utterance of a user utilizing a local text-response map |
CN108806688A (zh) * | 2018-07-16 | 2018-11-13 | 深圳Tcl数字技术有限公司 | 智能电视的语音控制方法、智能电视、系统及存储介质 |
US10726058B2 (en) * | 2018-07-31 | 2020-07-28 | Market Advantage, Inc. | System, computer program product and method for generating embeddings of textual and quantitative data |
CN109397287B (zh) * | 2018-10-12 | 2021-07-27 | 珠海格力智能装备有限公司 | 机器人编码方法及装置、机器人、存储介质、处理器 |
CN109215654A (zh) * | 2018-10-22 | 2019-01-15 | 北京智合大方科技有限公司 | 实时语音识别和自然语言处理的移动端智能客服辅助系统 |
CN109582971B (zh) * | 2018-12-20 | 2022-11-25 | 广东小天才科技有限公司 | 一种基于句法分析的批改方法及批改系统 |
IT201900000526A1 (it) * | 2019-01-11 | 2020-07-11 | Userbot S R L | Sistema di intelligenza artificiale per processi aziendali |
CN110059168A (zh) * | 2019-01-23 | 2019-07-26 | 艾肯特公司 | 对基于自然智能的人机交互系统进行训练的方法 |
CN110019688A (zh) * | 2019-01-23 | 2019-07-16 | 艾肯特公司 | 对机器人进行训练的方法 |
CN110046232A (zh) * | 2019-01-23 | 2019-07-23 | 艾肯特公司 | 自然智能的自然表达处理方法、回应方法、设备及系统 |
WO2020151652A1 (zh) * | 2019-01-23 | 2020-07-30 | 艾肯特公司 | 基于自然智能的自然表达处理方法、回应方法、设备及系统,对机器人进行训练的方法,人机交互系统,对基于自然智能的人机交互系统进行训练的方法,端到端控制方法和控制系统 |
CN110008317A (zh) * | 2019-01-23 | 2019-07-12 | 艾肯特公司 | 自然智能的自然表达处理方法、回应方法、设备及系统 |
CN110059166A (zh) * | 2019-01-23 | 2019-07-26 | 艾肯特公司 | 自然智能的自然表达处理方法、回应方法、设备及系统 |
CN110059167A (zh) * | 2019-01-23 | 2019-07-26 | 艾肯特公司 | 自然智能的自然表达处理方法、回应方法、设备及系统 |
CN114938360B (zh) * | 2019-04-12 | 2023-04-18 | 腾讯科技(深圳)有限公司 | 一种基于即时通讯应用的数据处理方法和装置 |
CN110035187A (zh) * | 2019-04-16 | 2019-07-19 | 浙江百应科技有限公司 | 一种在电话中实现ai和人工坐席无缝切换的方法 |
CN112312181A (zh) * | 2019-07-26 | 2021-02-02 | 深圳Tcl新技术有限公司 | 一种智能电视语音识别方法、系统及可读存储介质 |
CN110428811B (zh) * | 2019-09-17 | 2021-09-07 | 北京声智科技有限公司 | 一种数据处理方法、装置及电子设备 |
CN110826345B (zh) * | 2019-11-14 | 2023-09-05 | 北京香侬慧语科技有限责任公司 | 一种机器翻译方法和装置 |
CN111078776A (zh) * | 2019-12-10 | 2020-04-28 | 北京明略软件系统有限公司 | 数据表的标准化方法、装置、设备及存储介质 |
TWI751504B (zh) * | 2020-02-27 | 2022-01-01 | 中華電信股份有限公司 | 人機協作對話系統與方法 |
CN111540353B (zh) * | 2020-04-16 | 2022-11-15 | 重庆农村商业银行股份有限公司 | 一种语义理解方法、装置、设备及存储介质 |
KR102340946B1 (ko) * | 2020-07-08 | 2021-12-20 | 황호진 | 렌털 상품을 제공하는 판매자의 쇼핑몰 시스템 및 그 방법 |
KR20220006426A (ko) * | 2020-07-08 | 2022-01-17 | 황호진 | 해외 직접 구매와 렌털 상품을 제공하는 판매자의 쇼핑몰 시스템 및 그 방법 |
CN112488133B (zh) * | 2020-12-18 | 2022-06-14 | 贵州大学 | 一种视频/图片-文本跨模态检索方法 |
CN113099306B (zh) * | 2021-03-23 | 2022-11-22 | 北京达佳互联信息技术有限公司 | 消息处理方法、装置、设备、服务器、介质和产品 |
US11875132B2 (en) * | 2021-05-13 | 2024-01-16 | Intrado Corporation | Validation of revised IVR prompt translation |
US11861322B2 (en) | 2021-05-13 | 2024-01-02 | West Technology Group, Llc | Automated management of revised IVR prompt translations |
US20230008868A1 (en) * | 2021-07-08 | 2023-01-12 | Nippon Telegraph And Telephone Corporation | User authentication device, user authentication method, and user authentication computer program |
CN113672206B (zh) * | 2021-09-02 | 2024-04-02 | 北京航空航天大学 | 一种x语言混合建模平台及建模方法 |
CN115329785B (zh) * | 2022-10-15 | 2023-01-20 | 小语智能信息科技(云南)有限公司 | 融入音素特征的英-泰-老多语言神经机器翻译方法及装置 |
CN117506926A (zh) * | 2023-12-19 | 2024-02-06 | 广州富唯智能科技有限公司 | 一种基于大工艺模型的机器人设备自动控制方法 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020178005A1 (en) * | 2001-04-18 | 2002-11-28 | Rutgers, The State University Of New Jersey | System and method for adaptive language understanding by computers |
CN1489086A (zh) * | 2002-10-10 | 2004-04-14 | 莎 刘 | 一种语义约定全文翻译系统和方法 |
US20070299824A1 (en) * | 2006-06-27 | 2007-12-27 | International Business Machines Corporation | Hybrid approach for query recommendation in conversation systems |
CN103593340A (zh) * | 2013-10-28 | 2014-02-19 | 茵鲁维夫有限公司 | 自然表达信息处理方法、处理及回应方法、设备及系统 |
Family Cites Families (37)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS58201175A (ja) * | 1982-05-20 | 1983-11-22 | Kokusai Denshin Denwa Co Ltd <Kdd> | 機械翻訳方式 |
US4849898A (en) * | 1988-05-18 | 1989-07-18 | Management Information Technologies, Inc. | Method and apparatus to identify the relation of meaning between words in text expressions |
SE466029B (sv) * | 1989-03-06 | 1991-12-02 | Ibm Svenska Ab | Anordning och foerfarande foer analys av naturligt spraak i ett datorbaserat informationsbehandlingssystem |
JPH03129469A (ja) * | 1989-10-14 | 1991-06-03 | Canon Inc | 自然言語処理装置 |
US5338976A (en) * | 1991-06-20 | 1994-08-16 | Ricoh Company, Ltd. | Interactive language conversion system |
US5278980A (en) * | 1991-08-16 | 1994-01-11 | Xerox Corporation | Iterative technique for phrase query formation and an information retrieval system employing same |
US5369575A (en) * | 1992-05-15 | 1994-11-29 | International Business Machines Corporation | Constrained natural language interface for a computer system |
CA2141221A1 (en) * | 1992-09-04 | 1994-03-17 | Jaime G. Carbonell | Integrated authoring and translation system |
JPH0744638A (ja) * | 1993-07-29 | 1995-02-14 | Nec Corp | 表データ検索装置 |
US5748841A (en) * | 1994-02-25 | 1998-05-05 | Morin; Philippe | Supervised contextual language acquisition system |
US5659765A (en) * | 1994-03-15 | 1997-08-19 | Toppan Printing Co., Ltd. | Machine translation system |
JP3350293B2 (ja) * | 1994-08-09 | 2002-11-25 | 株式会社東芝 | 対話処理装置及び対話処理方法 |
US5794177A (en) * | 1995-07-19 | 1998-08-11 | Inso Corporation | Method and apparatus for morphological analysis and generation of natural language text |
US5909678A (en) * | 1996-09-13 | 1999-06-01 | International Business Machines Corporation | Computer systems, method and program for constructing statements by dragging and dropping iconic representations of subcomponent statements onto a phrase template |
US6026410A (en) * | 1997-02-10 | 2000-02-15 | Actioneer, Inc. | Information organization and collaboration tool for processing notes and action requests in computer systems |
CA2391985A1 (en) * | 1999-11-25 | 2001-05-31 | Yeong Kuang Oon | A unitary language for problem solving resources for knowledge based services |
JP2002108859A (ja) * | 2000-09-29 | 2002-04-12 | Nippon Hoso Kyokai <Nhk> | 対訳表現自動抽出装置および記録媒体 |
JP2005241971A (ja) * | 2004-02-26 | 2005-09-08 | Seiko Epson Corp | プロジェクタシステム、マイク装置、プロジェクタ制御装置およびプロジェクタ |
JP2006024114A (ja) * | 2004-07-09 | 2006-01-26 | Advanced Telecommunication Research Institute International | 機械翻訳装置および機械翻訳コンピュータプログラム |
US7844465B2 (en) | 2004-11-30 | 2010-11-30 | Scansoft, Inc. | Random confirmation in speech based systems |
US8065286B2 (en) | 2006-01-23 | 2011-11-22 | Chacha Search, Inc. | Scalable search system using human searchers |
US8117196B2 (en) | 2006-01-23 | 2012-02-14 | Chacha Search, Inc. | Search tool providing optional use of human search guides |
US8266130B2 (en) | 2006-01-23 | 2012-09-11 | Chacha Search, Inc. | Search tool providing optional use of human search guides |
US7962466B2 (en) | 2006-01-23 | 2011-06-14 | Chacha Search, Inc | Automated tool for human assisted mining and capturing of precise results |
JP4131978B2 (ja) * | 2006-02-24 | 2008-08-13 | 本田技研工業株式会社 | 音声認識機器制御装置 |
US8280921B2 (en) | 2006-07-18 | 2012-10-02 | Chacha Search, Inc. | Anonymous search system using human searchers |
US7873532B2 (en) | 2006-07-19 | 2011-01-18 | Chacha Search, Inc. | Method, system, and computer readable medium useful in managing a computer-based system for servicing user initiated tasks |
JP2009545076A (ja) | 2006-07-24 | 2009-12-17 | チャチャ サーチ,インク. | 情報検索システムにおけるポッドキャスティング及びビデオトレーニングの為の方法、システム及びコンピュータ読込可能ストレージ |
US8239461B2 (en) | 2007-06-28 | 2012-08-07 | Chacha Search, Inc. | Method and system for accessing search services via messaging services |
WO2009067696A1 (en) | 2007-11-21 | 2009-05-28 | Chacha Search, Inc. | Method and system for improving utilization of human searchers |
US8336035B2 (en) * | 2008-12-16 | 2012-12-18 | Microsoft Corporation | Customizable dynamic language expression interpreter |
US8000454B1 (en) | 2010-09-10 | 2011-08-16 | Zvi Or-Bach | Systems and methods for visual presentation and selection of IVR menu |
AU2012207503A1 (en) | 2011-01-17 | 2013-09-05 | Chacha Search, Inc. | Method and system of selecting responders |
US9842299B2 (en) | 2011-01-25 | 2017-12-12 | Telepathy Labs, Inc. | Distributed, predictive, dichotomous decision engine for an electronic personal assistant |
JP5590431B1 (ja) * | 2012-12-27 | 2014-09-17 | パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ | 情報通信方法 |
US8818795B1 (en) * | 2013-03-14 | 2014-08-26 | Yahoo! Inc. | Method and system for using natural language techniques to process inputs |
US20150074524A1 (en) | 2013-09-10 | 2015-03-12 | Lenovo (Singapore) Pte. Ltd. | Management of virtual assistant action items |
-
2013
- 2013-10-28 CN CN201310516340.5A patent/CN103593340B/zh not_active Expired - Fee Related
-
2014
- 2014-06-16 CA CA2929018A patent/CA2929018C/en not_active Expired - Fee Related
- 2014-06-16 EP EP14856958.5A patent/EP3062239A4/en not_active Withdrawn
- 2014-06-16 CA CA3011397A patent/CA3011397A1/en not_active Abandoned
- 2014-06-16 KR KR1020167014285A patent/KR20160077190A/ko active IP Right Grant
- 2014-06-16 US US15/032,603 patent/US9760565B2/en not_active Expired - Fee Related
- 2014-06-16 WO PCT/CN2014/079945 patent/WO2015062284A1/zh active Application Filing
- 2014-06-16 JP JP2016546460A patent/JP6182272B2/ja not_active Expired - Fee Related
- 2014-06-16 RU RU2016120612A patent/RU2672176C2/ru active
- 2014-06-16 CN CN201480059550.9A patent/CN105723362B/zh not_active Expired - Fee Related
-
2016
- 2016-04-27 IL IL245322A patent/IL245322B/en active IP Right Grant
- 2016-04-28 US US15/141,576 patent/US9753914B2/en not_active Expired - Fee Related
- 2016-05-27 ZA ZA2016/03638A patent/ZA201603638B/en unknown
- 2016-08-09 HK HK16108004.9A patent/HK1223164A1/zh not_active IP Right Cessation
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020178005A1 (en) * | 2001-04-18 | 2002-11-28 | Rutgers, The State University Of New Jersey | System and method for adaptive language understanding by computers |
CN1489086A (zh) * | 2002-10-10 | 2004-04-14 | 莎 刘 | 一种语义约定全文翻译系统和方法 |
US20070299824A1 (en) * | 2006-06-27 | 2007-12-27 | International Business Machines Corporation | Hybrid approach for query recommendation in conversation systems |
CN103593340A (zh) * | 2013-10-28 | 2014-02-19 | 茵鲁维夫有限公司 | 自然表达信息处理方法、处理及回应方法、设备及系统 |
Non-Patent Citations (1)
Title |
---|
See also references of EP3062239A4 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105721470A (zh) * | 2016-02-19 | 2016-06-29 | 广州盛华信息有限公司 | 一种实现呼叫中心可视化接入的方法 |
JP2019536135A (ja) * | 2017-01-25 | 2019-12-12 | グーグル エルエルシー | 言語モデルを用いたメッセージ内受信画像に対する自動提案応答 |
WO2019153533A1 (zh) * | 2018-02-12 | 2019-08-15 | 平安科技(深圳)有限公司 | 动态路由控制方法、装置、计算机设备及存储介质 |
Also Published As
Publication number | Publication date |
---|---|
ZA201603638B (en) | 2017-09-27 |
US9753914B2 (en) | 2017-09-05 |
US9760565B2 (en) | 2017-09-12 |
EP3062239A1 (en) | 2016-08-31 |
RU2016120612A (ru) | 2017-12-05 |
RU2672176C2 (ru) | 2018-11-12 |
CN105723362B (zh) | 2018-10-02 |
HK1223164A1 (zh) | 2017-07-21 |
CN103593340A (zh) | 2014-02-19 |
CA2929018A1 (en) | 2015-05-07 |
CA2929018C (en) | 2018-08-28 |
KR20160077190A (ko) | 2016-07-01 |
IL245322B (en) | 2018-12-31 |
US20160275075A1 (en) | 2016-09-22 |
IL245322A0 (en) | 2016-06-30 |
CN105723362A (zh) | 2016-06-29 |
JP2017503282A (ja) | 2017-01-26 |
CN103593340B (zh) | 2017-08-29 |
CA3011397A1 (en) | 2015-05-07 |
JP6182272B2 (ja) | 2017-08-16 |
EP3062239A4 (en) | 2017-11-22 |
US20160253434A1 (en) | 2016-09-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2015062284A1 (zh) | 自然表达处理方法、处理及回应方法、设备及系统 | |
WO2022095380A1 (zh) | 基于ai的虚拟交互模型生成方法、装置、计算机设备及存储介质 | |
US20100217591A1 (en) | Vowel recognition system and method in speech to text applictions | |
CN109325091B (zh) | 兴趣点属性信息的更新方法、装置、设备及介质 | |
CN110019688A (zh) | 对机器人进行训练的方法 | |
CN107315742A (zh) | 具有人机对话功能的拟人化口语翻译方法及系统 | |
CN112214591B (zh) | 一种对话预测的方法及装置 | |
KR20200092499A (ko) | 대화형 인공지능 기술을 이용한 상담 지원 방법 및 그 장치 | |
CN109256133A (zh) | 一种语音交互方法、装置、设备及存储介质 | |
CN110610705A (zh) | 一种基于人工智能的语音交互提示器 | |
CN110059166A (zh) | 自然智能的自然表达处理方法、回应方法、设备及系统 | |
CN110046232A (zh) | 自然智能的自然表达处理方法、回应方法、设备及系统 | |
CN115643341A (zh) | 人工智能客服应答系统 | |
CN114064865A (zh) | 在远程交互中检测词汇技能级别并校正未对准 | |
CN110059168A (zh) | 对基于自然智能的人机交互系统进行训练的方法 | |
US20190012313A1 (en) | Speech converter | |
CN110059167A (zh) | 自然智能的自然表达处理方法、回应方法、设备及系统 | |
CN110008317A (zh) | 自然智能的自然表达处理方法、回应方法、设备及系统 | |
JP2673831B2 (ja) | 会話マナー教育システム | |
RU2755781C1 (ru) | Интеллектуальное рабочее место оператора и способ его взаимодействия для осуществления интерактивной поддержки сессии обслуживания клиента | |
WO2021159734A1 (zh) | 一种数据处理方法、装置、设备及介质 | |
US20050281395A1 (en) | Methods and apparatus for an interactive audio learning system | |
WO2023276088A1 (ja) | 対話装置、対話制御方法及び対話プログラム | |
WO2020151652A1 (zh) | 基于自然智能的自然表达处理方法、回应方法、设备及系统,对机器人进行训练的方法,人机交互系统,对基于自然智能的人机交互系统进行训练的方法,端到端控制方法和控制系统 | |
CN116741143B (zh) | 基于数字分身的个性化ai名片的交互方法及相关组件 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 14856958 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2016546460 Country of ref document: JP Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 15032603 Country of ref document: US Ref document number: 245322 Country of ref document: IL |
|
ENP | Entry into the national phase |
Ref document number: 2929018 Country of ref document: CA |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 20167014285 Country of ref document: KR Kind code of ref document: A |
|
REEP | Request for entry into the european phase |
Ref document number: 2014856958 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2014856958 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: 2016120612 Country of ref document: RU Kind code of ref document: A |