CA2929018A1 - Natural expression processing method, processing and response method, device and system - Google Patents

Natural expression processing method, processing and response method, device and system Download PDF

Info

Publication number
CA2929018A1
CA2929018A1 CA2929018A CA2929018A CA2929018A1 CA 2929018 A1 CA2929018 A1 CA 2929018A1 CA 2929018 A CA2929018 A CA 2929018A CA 2929018 A CA2929018 A CA 2929018A CA 2929018 A1 CA2929018 A1 CA 2929018A1
Authority
CA
Canada
Prior art keywords
expression
response
standard
natural
language information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CA2929018A
Other languages
French (fr)
Other versions
CA2929018C (en
Inventor
Zili Yu
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Icontek Corp
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CA3011397A priority Critical patent/CA3011397A1/en
Publication of CA2929018A1 publication Critical patent/CA2929018A1/en
Application granted granted Critical
Publication of CA2929018C publication Critical patent/CA2929018C/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/9032Query formulation
    • G06F16/90332Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/42Data-driven translation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/42Data-driven translation
    • G06F40/47Machine-assisted translation, e.g. using translation memory
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/58Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/004Artificial life, i.e. computing arrangements simulating life
    • G06N3/008Artificial life, i.e. computing arrangements simulating life based on physical entities controlled by simulated intelligence so as to replicate intelligent life forms, e.g. based on robots replicating pets or humans in their appearance or behaviour
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/22Interactive procedures; Man-machine interfaces
    • G10L17/24Interactive procedures; Man-machine interfaces the user being prompted to utter a password or a predefined phrase

Abstract

The present invention discloses a natural expression processing method, comprising: identifying a natural expression from a user and obtaining a certain form of language information which can be processed by a computer; and converting the identified and obtained language information to a standard expression in an encoded form. According to the natural expression processing method of the embodiments of the present invention, a natural expression is converted to an encoded standard expression; conversion to a standard expression is converting the semantics of a natural expression to encoding and parameters; precise verbatim translation is not necessary, thus the requirement for degree of accuracy of machine translation can be reduced; at the same time, the complexity of the database used for expression conversion (machine translation) is reduced, increasing data query and update speed and thus improving smart processing performance. Furthermore, the relatively simple encoded expression reduces the workload for manually-assisted interventions, increasing the efficiency of the work of manually-assisted interventions.

Description

LiuShen Docket No. C14W5787 NATURAL EXPRESSION PROCESSING METHOD, PROCESSING AND RESPONSE
METHOD, DEVICE AND SYSTEM
TECHNICAL FIELD
=
[0001] The present invention relates to an information processing method, and in particular, relates to a processing method of a natural expression from a human being, a processing and response method for the natural expression, and an information processing device and information processing system using the processing and response method. .=
BACKGROUND
[0002] Machine translation (MT) falls within the scope of computational linguistics, which uses computer programs to translate text or speech expressions from one natural language to another natural language. in a sense, glossary replacements between different natural languages are achieved. Further, with a corpus-based technique, more complex automatic translation can be achieved, thereby better processing different grammatical structures, glossary recognitions, correspondence of idiomatic expressions, etc.
[0003] The current machine translation tools can generally allow for the customization on a specific field or profession (such as weather forecast), with an objective of narrowing the translation on the glossary to a proper noun in the specific field, so as to improve the translation result. This technique is particularly effective for some fields that use more formal or more standardized presentation manners. For example, govermnent documents or law related documents are usually more formal and more standardized than other documents using an ordinary literal expression, and accordingly the result of the machine translation for such documents is often better than that of informal documents such as dialogues in daily life.
[0004] However, the quality of the machine translation usually depends on the differences between a source language and a target language in terms of glossary, grammar structure, linguistics, and even culture. For example, since both English and Dutch both belong to indogermanische Fmilie, the result of the machine translation between these two languages is often much beilier than the result of the mutual machine translation between LiuShen Docket No. C14W5787 English and Chinese.
[0005] Therefore, in order to improve the result of the machine translation, manual intervention is still very important. For example, in some machine translation systems, by manually defining or choosing more suitable words, the accuracy and quality of the machine translation can be dramatically improved.
[0006] Some existing translation tools, such as Alta Vista Babelfish, sometimes can obtain understandable translation results. However, if a more meaningful result is desired, it is often necessary to make an appropriate edition when inputting a sentence in order to facilitate the analysis by computer programs.
[0007] In general, the purpose of using machine translation by people may only be learning the essence of sentences or paragraphs in an original text, rather than obtaining an accurate translation. Generally speaking, the machine translation has not reached a quality level such that it can be substituted for professional (manual) translation, and still cannot become an official translation.
[0008] Natural Language Processing (NLP) is a sub-discipline of the field of artificial intelligence and linguistics. In this field, how to process and apply a natural language is discussed; and natural language cognition refers to that a computer is made to "understand"
the real meaning behind human languages.
[0009] A natural language generation system converts computer data to a natural language. A natural language understanding system converts a natural language to a form that can be more easily processed by computer programs.
[0010] In theory, the NLP is a very attractive way of human-computer interaction.
Early language processing systems, such as SHRDLU, when using a limited vocabulary for making sessions within a limited "blocks world", can work quite well. This makes the researchers fairly optimistic on this system. However, when the systems are developed to be located in an environment filled with real-world ambiguity and uncertainty, they quickly lost confidence. Since the understanding of a natural language requires for the extensive knowledge about the outside world and the ability to use or manipulate the knowledge, the natural language cognition is also regarded as an AI-Complete problem.
[0011] The statistic-based NLP utilizes probabilistic and statistical methods to solve LiuShen Docket No. C14W5787 the problems existing in the NLP based on grammar rates. Especially for long sentences prone to be highly ambiguous, when practical grammar is applied for analysis, thousands of possibilities may be produced. The disambiguation methods adopted for processing these highly ambiguous sentences often utilize corpora and Markov models. The statistic-based NLP technology is mainly developed by evolution from the sub-fields, namely Machine Learning and Data Mining, associated with learning behavior in the artificial intelligence technology.
[00121 However, for the statistic-based NLP method, a corpus of paired language corpora containing a large amount of data needs to be established for the learning and use of a computer, and for the corpus of a large amount of data, retrieving of a corresponding result of machine translation (understanding) from the corpus and feeding back the result also require for the support of a large amount of computing resources. In addition, even if this method is adopted, great difficulties still exist in dealing with the diversity and uncertainty of the practical natural language.
[0013] The NLP technology has been widely used in practice. For example, it is used in an interactive voice response system, an intemet call center system, and so on.
[0014] Interactive Voice Response (IVR) is a general term of telephone-based voice value-added services. Many institutions (such as banks, credit card centers, telecom operators, etc.) provide customers with a wide range of self-services through an Interactive Voice Response System (IVRS), in which a customer may dial a specified phone number to log into the system, and enter appropriate options or personal information according to the instruction of the system, so as to listen to the pre-recorded information, or combine data according to a preset program (Call Flow) through the computer system, and read out specific information (such as account balance, amount due, and so on) in the manner of speech, and may also input a transaction instruction through the system, so as to conduct a preset transaction (such as transfer, change of password, change of contact phone number, etc).
[0015] Despite the IVR system has been widely used over the past decade, but technically, the IVR system was born with a critical defect that is still troubling all institutions: an irreducible menu tree with multi-layer options. Most of the users, when using the IVR system to select the self-services, are impatient to take time to traverse a menu tree _ _ LiuShen Docket No. C14W5787 with multi-layer options, but directly turn to a manual customer service center by pressing leading to an insurmountable gap between the expectation of the institutions on the ability of the IVR system to "effectively improve the rate of using self-services by the customers and substantially replace the manual operations" and the reality.
[0016] An Internet Call Center System (ICCS) is a new type of call center system booming in recent years, which adopts a popular Instant Messaging (IM) Internet technique, for enabling the mainly text-based real-time communication to be performed by the institutions and customers thereof over the Internet, and is applied to the customer services and remote sales of the institutions. The manual agent employing the ICCS can communicate simultaneously with two or more customers.
[0017] So to speak, the text-based ICC system is a variant of the speech-based IVR
system. Both are necessary tools (either for customer services or for remote sales) for the communication between the institutions and the customers thereof, and both require for the high level of participation of the manual agent. Therefore, like the IVR
system, it is also difficult for the ICC system to meet the requirement of "effectively improving the rate of using self-services by the customers and substantially replacing the manual operations" of the institutions.
[0018] On the other hand, the traditional speech-identification technology, based on the speech identification result being lack of accuracy and stability, employs keyword search technology, and uses an "exhaustive method" to perform semantic analysis on the speech.
Although many companies majored in speech-identification technology spend a great deal of human efforts and money on two items of work, i.e., "transcription" and "keyword spotting", and persistently train a speech robot for a long time, but the actual effects are often far different from the ideal effects.
SUMMARY
[0019] According to one aspect of the present invention, a natural expression processing method is provided, which includes: identifying a natural expression from a user, to obtain a certain form of language information which can be processed by a computer; and converting the obtained language information to a standard expression in an encoded form.

LiuShen Docket No. C14W5787 [0020] In the natural expression processing method according to the embodiments of the present invention, optionally, the standard expression includes requirement codes embodying a user's requirements.
[0021] In the natural expression processing method according to the embodiments of the present invention, optionally, the requirement codes are expressed by digital codes.
[0022] In the natural expression processing method according to the embodiments of the present invention, optionally, the standard expressions further include requirement parameters further embodying the user's specific requirements.
[0023] In the natural expression processing method according to the embodiments of the present invention, optionally, the language information is constituted by language information units obtained through spotting and conversion performed on the natural expression in the form of speech by using a modeling tool.
[0024] In the natural expression processing method according to the embodiments of the present invention, optionally, the language information is constituted by one of phoneme, character, and phrase.
[0025] In the natural expression processing method according to the embodiments of the present invention, optionally, the conversion from the language information to the standard expression is implemented on the basis of an MT (Machine Translation) training dataset between the language information and the standard expression.
[0026] In the natural expression processing method according to the embodiments of the present invention, optionally, information associated with the natural expression is obtained during the identification of the natural expression, and the information is converted to a part of the standard expression.
[0027] According to another aspect of the present invention, a method for training a artificial intelligence robot is provided, which includes: establishing an MT
training dataset, wherein the MT training dataset contains: computer-processable language information obtained by converting a natural expression, an encoded standard expression, and a corresponding relationship between the language information and the standard expression;
and performing, by the artificial intelligence robot, an iterative comparison between various permutations and combinations of elements of the language information existing in the MT

LiltShen Docket No. C14W5787 training dataset and various permutations and combinations of elements of the standard expression, to find out a corresponding relationship between the permutations and combinations of the elements of the language information and the permutations and combinations of the elements of the standard expression.
[0028] In the method for training an artificial intelligence robot according to the embodiments of the present invention, optionally, the data in the MT training dataset may be imported from an external database, and may also be generated or added through the manual aided understanding.
[0029] According to another aspect of the present invention, a natural expression processing method is provided, which includes: inputting a natural expression;
identifying the natural expression, to obtain a certain form of language information which can be processed by a computer; determining whether the language information can be converted to an encoded standard expression through machine conversion; if determining that the desired standard expression cannot be obtained through the machine conversion, performing manual conversion processing; and outputting the standard expressions from the machine conversion or manual conversion.
[0030] In the natural expression processing method according to the embodiments of the present invention, optionally, the determining refers to determining whether the understanding of a robot is mature, wherein, the determining whether the understanding of the robot is mature is performed on the basis of evaluation on the accuracy rate of the understanding of the robot over a certain time interval.
[0031] According to still another aspect of the present invention, a natural expression processing and response method is provided, which includes: inputting a natural expression;
identifying the natural expression, to obtain a certain form of language information which can be processed by a computer and relevant expression type information;
determining whether the identified natural expression and the expression type information can be converted to an encoded standard expression through the machine conversion; if determining that the desired standard expression cannot be obtained through the machine conversion, performing manual conversion processing; invoking or generating a standard response matching with the standard expression obtained through the machine conversion and manual conversion; and LiuShen Docket No. C14W5787 outputting the generated standard response in the manner of corresponding to the expression type information.
[0032] In the natural expression processing and response method according to the embodiments of the present invention, optionally, the standard response is fixed data pre-stored in a database, or the standard response is generated on the basis of basic data of standard responses pre-stored in a database and variable parameters.
[0033] According to still another aspect of the present invention, a natural expression processing and response device is provided, which includes: a dialogue gateway, a central controller, an MAU workstation, a robot, an expression database, a response database, and a response generator, wherein, the dialogue gateway receives a natural expression from a user, transmits it to the central controller for subsequent processing, and transmits a response for the natural expression to the user; the central controller receives the natural expression from the dialogue gateway, and cooperates with the robot and the MAU workstation, to convert the natural expression to an encoded standard expression and instruct the response generator to generate a standard response corresponding to the standard expression according to the standard expression; the robot identifies the natural expression according to the instruction of the central controller, to obtain a certain form of language information which can be processed by a computer, and converts the language information to the standard expression using the expression database; the MAU workstation presents the identified natural expression or the natural expression from the user to an external MAU manual agent, the MAU manual agent inputs or selects the standard expression through the MAU
workstation, and then the MAU workstation transmits the standard expression to the central controller; and the expression database is configured to store expression-related data, including: the language information data associated with the natural expression, the standard expression data associated with the standard expression, and the data associated with the relationship between the language information and the standard expression; the response database stores response-related data, including standard response data for invocation and/or data for generating a response; and the response generator receives the instruction of the central controller, and generates a response for the natural expression from the user by invoking and/or running the data in the response database.

LiuShen Docket No. C14W5787 [0034] In the natural expression processing and response device according to the embodiments of the present invention, optionally, the central controller updates the expression database and/or the response database.
[0035] In the natural expression processing and response device according to the embodiments of the present invention, optionally, the device further includes a trainer, configured to train the robot to convert the natural expression to the standard expression.
100361 In the natural expression processing and response device according to the embodiments of the present invention, optionally, the dialogue gateway further includes an identity authenticator, configured to identify and verify a user's identity before receiving the natural expression information, wherein authentication methods for the user's identity at least include pass-phrase & voice-print identification.
[0037] According to still another aspect of the present invention, a natural expression processing and response system is provided, which includes: an intelligent response device and a calling device; wherein, a user communicates with the intelligent response device through the calling device, and a MAU manual agent operates the intelligent response device, wherein the intelligent response device includes: a dialogue gateway, a central controller, a MAU workstation, a robot, an expression database, a response database, and a response generator, wherein the dialogue gateway receives, from the calling device, a natural expression from the user, and transmits it to the central controller; the central controller instructs the robot to identify a certain form of language information which can be processed by a computer and related expression information from the natural expression, and then instructs the robot to convert the language information and the related expression information to a standard expression; if the understanding of the robot is not mature enough to complete the conversion to the standard expression, the central controller instructs the MAU
workstation to prompt the MAU manual agent to perform a manual conversion to the standard expression, the MAU manual agent converts the language information and the related expression information identified by the robot to the standard expression, and inputs and transmits it to the central controller through the MAU workstation; the central controller instructs the response generator to invoke and/or run the data in the response database on the basis of the standard expression so as to generate a response for the natural expression from LiuShen Docket No Cl 4W5787 the user; and the dialogue gateway feeds back the response to the user through the calling device.
[0038] In the natural expression processing method according to the embodiments of the present invention, the natural expression may be converted to an encoded standard expression; because the conversion to the standard expression is converting the semantics of the natural expression to codes and parameters, and precise verbatim translation is not required, the requirement of accuracy for machine translation can be reduced, and meanwhile the complexity of the database for expression conversion (machine translation) is reduced, increasing data query and updating speed, and thus improving the performance of intelligent processing. In addition, the relatively simple encoded expression reduces the workload of manually-assisted interventions, increasing the efficiency of the work of manually-assisted interventions.
[00391 In the natural expression processing and response method, device, and system according to the embodiments of the present invention, the standard expression can be used to quickly point to the response, such that the customer no longer needs to spend a lot of time traversing the complicated routine menu of functions to find out the desired self-service.
Moreover, a standardized natural expression¨standard expression¨standard response database can be established through the automatic learning, training, and manual aided understanding of the robot, so as to implement the automatic understanding and response of the system step by step. In addition, the database may also have the advantages including a small particle size, a narrow scope of knowledge, and a high data fidelity, so as to reduce the training difficulty of the robot, and shorten the maturation period of the robot's intelligence.
BRIEF DESCRIPTION OF THE DRAWINGS
[0040] In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the accompanying drawings used in the embodiments will be described briefly hereinafter. Apparently, the drawings in the following descriptions merely illustrate the embodiments of the present invention, and are not intended to limit the present invention.
[0041] FIG 1 schematically shows a flow diagram of a natural expression processing LiuShen Docket No. C14W5787 method according to an embodiment of the present invention;
[0042] FIG. 2 schematically shows a flow diagram of a natural expression processing and response method according to an embodiment of the present invention;
[0043] FIG 3 schematically shows an intelligent response system according to the embodiments of the present invention;
[0044] FIG. 4 further shows a part of an intelligent response device in the system of FIG. 3;
[0045] FIG. 5 schematically shows an example of an operation interface presented by an MAU workstation to a manual agent;
[0046] FIG. 6 shows an example of identification on speech information;
[0047] FIG 7 shows an example of converting a collected acoustic wave to X
elements by using a Gaussian mixture model;
[0048] FIG 8 shows an example of conversion from a collected acoustic wave (A
language information) to Y language information;
[0049] FIG 9 generally shows layer-by-layer conversion from a collected acoustic wave (A language information) to Y language information; and [0050] FIG. 10 is a schematic view of the principle of multi-layer perception.
DETAILED DESCRIPTION
[0051] To make the objectives, technical solutions, and advantages of the embodiments of the present invention more clear, the technical solutions of the embodiments of the present invention are described clearly and fully below with reference to the accompanying drawings of the embodiments of the present invention. Apparently, the described embodiments are merely a part of embodiments of the present invention, instead of all the embodiments. All other embodiments derived by a person of ordinary skill in the art based on the described embodiments of the present invention without creative efforts shall fall within the protection scope of the present invention.
[0052] Unless otherwise defined, the technical terms or scientific terms used herein shall have the general meanings that can be understood by a person of ordinary skill in the LiuShen Docket No C141415787 field of the present invention. The term "first" or "second" and other similar terms used in the description and claims of the present invention do not indicate any order, quantity, or importance, but are merely used to distinguish between the different components. Similarly, the term "a" or "an" and other similar terms do not indicate any quantitative restriction, but indicates that at least one is present.
[0053] The natural expression processing method according to the embodiments of the present invention can be applied in a customer service system such as the aforementioned Interactive Speech Response (fVR) or Internet call center system (ICCS) or other remote customer contact systems (such as a telephone sales system, a network sales system, and a VTM intelligent remote terminal). As stated above, in such applications, the requirement on the machine translation is not to make an exact word-by-word meaning, but to convert the natural expression of the customer into the information that can be understood by the system, thereby providing a response corresponding to the expression to the customer.
In other words, the machine translation here focuses on the understanding on the real meaning of the human language, so as to express the actual intent or requirement of the customer "understood" from the natural expression in the form that can be more easily processed by computer programs.
[0054] In the natural expression processing method according to the embodiments of the present invention, the natural expression from the user is firstly identified or converted, to obtain a certain form of language information which can be processed by a computer, and then the obtained language information is converted to a standard expression in a certain form.
[0055] Irregular natural expression information presented in physical data from the user, such as an acoustic wave, can be referred to as "language information on the physical layer", and is also referred to as "A language information" below for short.
Through a certain modeling tool, basic automatic identification or conversion is performed, to obtain language (hereafter referred to as "X language") information on the first logic layer presented in the form of pennutation.s and combinations of several basic elements (hereafter referred to as "X
element"). The standard expression in a certain form generated by converting the X language information obtained by identifying or converting the A language information is hereinafter referred to as "Y language information".
II

LiuShen Docket No. C14W5787 [0056] There are a variety of human natural expression methods. For example, the natural expression from the customer, namely the "A language information", may be divided into the following four categories: text information, speech information, image information, and animation information.
[0057] Among these, a text information expression may be as follows: the customer expresses himself by inputting text through a keyboard, for example, the customer enters "How much money is there in my saving account?" on a user interface of an Internet channel call center of one bank; an image information expression may be as follows:
the customer expresses himself through an image, for example, the customer expresses the problem encountered by means of an image taken for error information during the use of a certain software through a computer desktop screen capture tool; a speech information expression may be as follows: the customer expresses himself through talking, for example, the customer talks with a customer service personnel of a service hotline (the telephone channel call center) of one bank, and asks during the talking over the phone: "What exactly do you mean? I'm not quite sure about that"; and an animation (also referred to as "video") information expression may be as follows: the customer shakes his head in front of a camera to express his disagreement.
[0058] As stated above, the natural expression (the A language information) of the customer is automatically identified and converted, to obtain information in a certain language form. If the A language information is the speech information, acoustic waveform information may for example be collected by means of a modeling tool and be automatically identified or converted to a certain type (corresponding to the speech information) of X
language through a system (an intelligent robot); if the A language information is the graphic information, graphic pixel information may for example be collected by means of a modeling tool and be automatically identified or converted to an X language (corresponding to the image information) through a system (an intelligent robot); if the A language information is the animation information, graphic pixel information and image change speed information may for example be collected by means of a modeling tool and be automatically identified or converted to the X language (corresponding to the animation information) through a system (an intelligent robot); and if the A language information is the text information, no conversion
12 LiuShen Docket No. C141V5787 needs to be performed.
[0059] Then, the aforementioned X language information obtained by the automatic conversion from the A language information or the text information for which no conversion is needed is "translated" into a regularized standard expression (Y language information) that can be "understood" by a computer or other processing devices. The Y language information can be automatically processed by a computer business system.
[0060] According to the embodiments of the present invention, regularized codes can be used to implement the regularized standard expression (the Y language information). For example, the following coding modes are adopted, including an industry code, an industry business code, an institution code, an institution business code, and an expression information code.
[0061] (1) Industry code [0062] Primary industry (2 letters, up to 26 x 26 = 676 primary industries) [0063] Subordinate industry (3 letters, up to 26 x 26 x 26 =
17,576 subordinate industries per primary industry) [0064] (2) Industry business code [0065] Level-1 industry business category (1-digit number 0-9) [0066] Level-2 industry business category (1-digit number 0-9) [0067] Level-3 industry business category (1-digit number 0-9) [0068] Level-4 industry business category (1-digit number 0-9) [0069] Level-5 industry business category (1-digit number 0-9) 100701 Level-6 industry business category (1-digit number 0-9) [0071] Level-7 industry business category (1-digit number 0-9) [0072] Level-8 industry business category (1-digit number 0-9) [0073] Level-9 industry business category (1-digit number 0-9) [0074] Level-10 industry business category (1-digit number 0-9) [0075] (3) Institution code (UID) (24-digit number = 3-digit country code + 3-digit city number + 18-digit institution number) [0076] (4) Institution business code [0077] Level-1 institution business category (0-9)
13 LiuShen Docket No. C14W5787 [0078] Level-2 institution business category (0-9) [0079] Level-3 institution business category (0-9) [0080] Level-4 institution business category (0-9) [0081] Level-5 institution business category (0-9) [0082] (5) Expression information code [0083] Information type code (2-digit number 1-99) [0084] Language code (using an RFC3066 standard:
http://tools.ietforg/html1rfc3066, e.g., zh-CN represents for "Simplified Chinese") [0085] Dialect code (3-digit number 1-999) [0086] Herein, the industry code represents the industries to which the subject that provides services belongs as pointed to by the irregular natural expression (A
language information) from the customer. For example, it can be represented by 2 letters to cover 676 industries, and optionally, a subordinate industry code of 3 letters can be added to cover additional 17,576 subordinate industries per industry. In this way, the code may basically cover all the common industries; the industry business code represents for the service demand as pointed to by the A language information from the customer, and can also be represented by an Arabic numeral. For example, a 10-digit number is used for coding to cover a larger industry business category; the institution code represents the subject that provides services as pointed to by the A language information from the customer, and, for example, can mark the country and city where the institution is located. The institution business code represents for the internal personalized business division of the subject that provides services, for facilitating the personalized internal management of the institution. The expression information code represents identifying information of the A language information itself of the customer, which may include information type, language type, and the like, represented by numbers and letters.
[0087] The following shows two examples of the regularized standard expression (Y
language information) according to the above coding manner:
[0088] Example 1:
FSBNK27100000000860109558800000000000000000002zh-CNO03 [0089] wherein,
14 LiuShen Docket No. CI 4W5787 [0090] the industry code is [0091] = FS ¨ Financial Service (primary industry) 100921 = BNK Bank (subordinate industry) [0093] the industry business code is [0094] = 2710000000 = Level-1 industry business category ¨ 2 (credit card) Level-2 industry business category ¨ 7 (adjust the credit line) Level-3 industry business category ¨ 1 (raising the credit line) 0000000 (no more subdivision categories) [0095] The institution code is [0096] = 086010955880000000000000 = Country code 086 (China) 010 (Beijing) 955880000000000000 (Head office of the Industrial and Commercial Bank of China) [0097] The institution business code is [0098] = 00000 = no institution business category (in this Y
language information, there is no institution business category self-defined by the institution "Head office of the Industrial and Commercial Bank of China", which means that: the Y language information belongs entirely to the industry business category, which is universal in the bank industry.) [0099] The expression information code is [00100] = 02 = speech (the type of the A language information provided by the customer is "speech") [00101] = zh-CN ¨ Mainland Chinese [00102] = 003 -- Cantonese dialect [00103] In this example, the A language information corresponding to the Y language information may be, for example, "the credit line of my credit card is too low", "I want to raise my credit line", "I want to lower my credit line", "I need to adjust the credit line", and other speech information.
[00104] In some specific application circumstances, especially under the circumstance where the subject that provides services is determined, the above industry code, institution code, and institution business code can all be preset as default values of the system. In other words, the business code and the expression information code are obtained from the A
language information provided by the customer only, and in this case, the Y
language infwinfition can be represented as "271000000002zh-CNO03"; alternatively, if a 3-digit LiuShen Docket No. C14W5787 number is sufficient for representing the industry business code for a specific application, the Y language information can be further represented as "27102zh-CNO03"; further, if only for the speech service, it can be represented as "271zh-CNO03"; if only the requirement expression of the customer is taken into consideration, and the type information of the expression itself is not cared, the Y language information can even be represented by "271"
only. Example 2: TVTKT11200000000014047730305000000000001240003fr-CH000 [00105] = TV = Traveling Service (primary industry) [00106] = TKT = Ticketing (subordinate industry) [00107] = 1120000000 = Level-1 industry business category ¨ 1 (air ticket) Level-2 industry business category ¨ 1 (change the air ticket) Level-3 industry business category ¨2 (delay) 0000000 (no more subdivision categories) [00108] = 001404773030500000000000 = country code 001 (United States) (Atlanta, Georgia) 773030500000000000 (Delta Airlines of the United States) [00109] = 12400 ¨ Level-1 institution business category ¨ 1 (discount ticket) Level-2 institution business category ¨ 2 (off-season) Level-3 institution business category ¨4 (Asia-Pacific) 00 (no more subdivision categories) [00110] = 03 = image (the type of the A language information provided by the customer is "image", for example, when the customer performs an air ticket changing operation on the Delta official website, and encounters a system error reporting, the customer takes a screen shot as a natural expression for turning to the Delta customer service center for help.) [00111] = fr-CH = Switzerland French [00112] = 000 = No dialect [00113] In this example, the A language information corresponding to the Y
language information is obtained through image identification. Likewise, under the circumstance where the subject that provides services is determined, the above industry code and the institution code can both be preset as default values of the system. In this case, the Y
language information may be represented as "11200000001240003fr-CH000"; if only the requirement expression of the customer is taken into consideration, and the type information of the expression itself is not cared, the Y language information is represented by "112000000012400" only; and in the case where 3-digit number is applied specifically to se-LiuShen Docket No. C14W5787 represent the industry business code, and a 3-digit number is applied to represent the institution business code, the Y language information is represented by "112124" only.
[00114] The above are only examples of a regularized standard expression (the Y
language information) according to the embodiments of the present invention, different code digits and code arrangement sequences may be used, and different code expressions or coding manners may also be used.
[001151 The natural expression (the A language information) from the customer always reflects the specific requirements of the customer. As stated above, the A
language information of the customer is first automatically converted to the X language information or the language information for which no conversion needs to be performed (when the A
language infon-nation is the text information), and then the X language information or the text language information is converted to a standard expression in an encoded form (the Y
language information). In the above examples, the Y language information may include an industry code, an industry business code, an institution code, an institution business code, and an expression information code. Optionally, the A language information may also include specific parameters under the category that reflects the customer's requirements (which may be referred to as "requirement parameters"), for example: "Transfer 5000 yuan to a person"
(Example 1), "I want to watch a movie, called Chinese Partners" (Example 2), and so on. A
specific requirement code set (for example including one or more of the aforementioned industry code, industry business code, institution code, institution business code, and expression information code) corresponds to a specific parameter set. As in the above Example 2, if the requirement code of "watch a movie" is 123, the corresponding parameter set may also include a parameter: movie name. Then, the Y language information corresponding to the A language information is "123 <Chinese Partners>". The 123 is the requirement code, and five characters in the c are the requirement parameters.
There are many manners for dividing the requirement codes and the requirement parameters in the Y
language information, which may use a symbol such as "<>", may also be a blank space, or may be arranged in a specific sequence, or the like. The aforementioned process of converting the A language information of the customer into the information in a certain form of language that can be processed by the computer may be implemented through a speech LiuShen Docket No. C14W5787 signal processing technique, a speech identification technique, an image identification technique, and a video processing technique, and these techniques may be existing techniques.
In fact, the concept of an encoded standard expression according to the embodiments of the present invention may also be applied in the identification processing of the natural expression.
[00116] Hereinafter, the processing on the speech information is first taken as an example to introduce the identification processing of the natural expression, and to further illustrate the application of the technical concept of the present invention in the identification processing of the natural expression. FIG. 6 illustratively shows a process of processing the speech information. In the course of the processing, the processing from the A
language to a D language is implemented. It should be noted that, the corresponding relationship between the "X language" information and the "A language" information, and the corresponding relationship between the "X language" information and the "B-language"
information in FIG.
6 are only illustrated for demonstration.
[00117] The A language, namely an acoustic wave, is data on the physical layer collected by an acoustic wave collection device (e.g., a microphone).
[00118] The X language is data on the first logic layer obtained after the speech signal processing on the A language data, which is referred to herein as the "X
language". The X
language is a language formed by various permutations and combinations of X
elements. The X elements are several column elements having different heights formed by automatically spotting the acoustic wave through a certain modeling tool, such as a Gaussian Mixture Model (GMM). FIG 7 shows an example of converting a collected acoustic wave (shown by a histogram) to the X elements (shown by a vector quantization histogram) by using a Gaussian mixture model.
[00119] Based on different modeling tools being applied to different natural speech sets, the number of the X elements can be controlled within a certain range (for example, below 200). According to the embodiments of the present invention, the combination of 2-digit ASCII characters is defined an ID of the X elements, as shown in FIG.
8. In other words, the number of the X elements can be up to a maximum of 16,384 (128 x 128 =
16,384), which can meet the requirement for increasing the number of the X
elements due to DuShen Docket No C14W5787 further development of the acoustic wave modeling technique in the future.
After the spotting, acoustic wave units are one-to-one corresponding to the X elements. Because the A language information can be considered as a combination of the acoustic wave units, and the X
language information is a combination of X elements, the conversion (or referred to as "identification") relationship from the A language to the X language in FIG 6 is a "many-to-many" relationship. FIG 6 shows an example of the X elements represented by ASCII characters.
1001201 The "B language" is a language formed by various permutations and combinations of B elements, and is data on the second logic layer in FIG 6.
All or part of permutations and combinations of the X elements form the B elements, so it can also be understood as that the X language is converted to B elements, and the B
elements constitute the B language. Thus, the conversion relationship from the X language to the B
language is a "many-to-many" relationship. The B elements may be phonemes, and some permutations and combinations of the B elements constitute syllables. The "phoneme" and "syllable" herein have the same meanings as in the category of linguistics. FIG. 6 shows examples of the B
elements, and these examples are phonemes of Chinese (Mandarin).
[00121] The "C language" is a language formed by various permutations and combinations of C elements, and is data on the third logic layer in FIG 6. All or part of permutations and combinations of the B elements form the C elements, so it can also be understood as that the B language is converted to C elements, and the C
elements constitute the C language. Thus, the conversion relationship from the B language to the C
language is a "many-to-many" relationship. If a linguistics system of phonemes and syllables is further used, the C elements correspond to the "characters" in the natural language.
FIG. 6 shows examples of the C elements, and these examples are characters in Chinese.
[001221 The "D language" is a language formed by various permutations and combinations of D elements, and is data on the fourth logic layer in FIG 6.
All or part of permutations and combinations of the C elements form the D elements, so it can also be understood as that the C language is converted to D elements, and the D
elements constitute the D language. Thus, the conversion relationship from the C language to the D
language is a "many-to-many" relationship. If the linguistics system of phonemes and syllables is used, the LiuShen Docket No. C14W5787 D elements correspond to .the "words" or "phrases" in the natural language.
FIG 6 shows examples of the D elements, and these examples are words in Chinese.
[00123] The example of the "C language" and the example of the "D language" in FIG.
6 seem to have the same content, which both are composed of "a", "n", "10", "Ai", "
"7" in sequence, but those familiar with Chinese can know that, the understanding given only according to the C language may produce great ambiguity, but the expression, after being converted to the "D language", can have a definite meaning. For other languages, conversions on the characters words or phrases are also very important for semantics understanding, particularly in the case where the speech identification is implemented by an intelligent system (a speech robot). According to different natural languages, the "characters"
and "words", i.e., the C language information and the D language information, may also be classified as the same level of language information.
[00124] The "Y
language" is data on the fifth logic layer (as shown in FIG. 8), which refers to the language information embodying "meaning" or "meanings" obtained after the understanding of the original natural language information A. The "standard expression"
defined above in the present invention is a form of the "Y language".
According to the embodiments of the present invention, for example, the bank industry may use a business code "21" to represent the meaning of "reporting the loss of a credit card";
use a business code "252" to represent the meaning of "partial repayment of a credit card", and "252-5000"
(the requirement code = 252, and the requirement parameter -= 5000) to represent the meaning of "repayment of 5000 Yuan for a credit card"; the entertainment industry may use a code "24" to represent the meaning of "watch a movie", and "24-Chinese Partners"
(the requirement code = 24, and the requirement parameter = "Chinese Partners") to represent the meaning of "watch a movie called Chinese Partners". Thus, the conversion relationship from the D language to the Y language is also a "many-to-many" relationship.
[00125] FIG 9 schematically shows a process of converting from the collected acoustic wave (the A language information) to the Y language information layer by layer. It can be seen from FIG 9 that, five times of conversion (translation) are performed on six types of language information, from the "acoustic wave" (the A language information) to the "X
elements" (the X language information), then to the "phonemes" (the B language . ____________________________________________________________________________ fIN
LiuShen Docket No. Cl 4W5787 information), then to the "characters" (the C language information), then to the "words" (the D language information), and finally to the "meaning" or "meanings" (the Y
language information). From the perspective of the data structure of the database, it starts from the "acoustic wave" as the initial language information A, and selects the paths of the permutations and combinations of five language constituting elements, to find out or correspond to the sixth type of language information data, namely the target language information Y.
[00126] Because the aforementioned five times of language information conversion need to be performed, the robot is also required to have the ability to achieve the five types of information language conversion. In general, the five-step conversion can be divided into three stages. In the three stages, in order to train the speech robot, the manual aided identification is always required.
[00127] The first stage: from the A language information (acoustic wave) to the C
language information (characters). The two-step conversion from the A language information (acoustic wave) to the B language information (phonemes), with the help of the information extraction and conversion algorithms (such as the aforementioned Gaussian mixture model) of the language information X, generally can be done automatically by the robot more accurately. However, in the conversion from the B language information (phonemes) to the C
language information (characters), a higher error rate may occur. For example, in Chinese, as shown in the example of FIG. 6, the original language information input by the customer is"
I (The table tennis racket is sold out)", but probably because of the customer's pronunciation or accent problems, the "#--kg" may be identified as a "kit", and "b"
may be identified as "M"; as a result, this acoustic wave is eventually converted to seven characters, namely "T[C5fttA546' I". In order to improve the identification accuracy of the robot, especially with respect to the problems such as the aforementioned pronunciation or accent, the identification result of the robot needs to be corrected, usually by means of the manual aided identification. The manual aided identification at this stage is referred to as transcription. The so-called transcription refers to that, the transcription personnel, by the use of specific tools, performs accurate spotting on the "acoustic wave" (the A
language information), and converts the wave bands obtained by spotting to the corresponding LiuShen Docket No. C14W5787 "characters" (the C language information), thereby defining a conversion/translation relationship between the A language (acoustic wave) and the C language (characters) for the robot. The precision of the spotting mainly depends on the carefulness of the transcription personnel and the familiarity for the transcription tools; and whether it can be converted to corresponding "characters" accurately depends on whether the transcription personnel has accurately understood the language environment in which this acoustic wave is located and the context (other acoustic waves before and after this acoustic wave).
Particularly for the Chinese characters, there are many characters having the same pronunciation, which increases the difficulty in accurate operation for the transcription personnel.
[00128] The second stage: from the C language information (characters) to the D
language information (words, phrases). Conversion from characters to words is also open to different interpretations, as in the preceding example, even if the identification from the acoustic wave to the characters is accurate, and a result of sever characters T
" arranged in sequence is obtained, at least two conversion results may be generated, namely "P + 5--67" and".--E-r-,34Z + #it + which have obviously different meanings. Likewise, the manual aided identification may be adopted to make rectification.
The manual aided identification at this stage is referred to as keyword spotting, and may also be referred to as "word spotting" for short; that is, the word spotting personnel combines the "characters" (the C language information) obtained through transcription, to form "words (keywords)" (the D language information), thereby defining a conversion/translation relationship between the C language (characters) and the D language (words) for the robot.
Whether the word spotting is accurate often depends on the mastering degree of the word spotting personnel on the business knowledge. With respect to different fields, the personnel familiar with the business content and terminology in this field is needed to perform the word spotting operation, and the cost thereof is also higher than that of the transcription.
[00129] The third stage: from the D language information to the Y language information, i.e., the understanding of meanings. If merely a certain words arranged in sequence are obtained, a true meaning of the customer often still cannot be accurately understood. For example, the customer says "Rn IA+ IjiL I (My credit card is lost)", the LiuShen Docket No. C14IV.5787 robot cannot identify the meaning thereof, and the technician inputs "TZ n", "ii H", and"
7 "into a syntax table of the database as new keywords; and another customer says:
7", the robot cannot identify the meaning thereof again, and the technician inputs "Win", "Wei -r," (which means the ) f"), and " into the syntax table of the database as new keywords. In this way, by means of the manual aid, the meanings or requirements of the customer are understood, and incorporated into the database. This manual aided identification is referred to as keyword pile-up, or "word pile-up" for short, namely, the permutations and combinations of "words" are accumulated, and incorporated into the database in accordance with the meaning thereof. The workload of such a task is huge, and the expertise of the training personnel is also required to aid the understanding.
[00130]
As stated above, in the natural expression processing method according to the embodiments of the present invention, the natural expression of the customer (the A language information) is first automatically converted to obtain the X language information, or no conversion is needed to directly obtain the C language information (when the A
language information is the text information); and then the X language information or the C language information is converted to the Y language information. With reference to the preceding analysis, the irregular natural expression may be one of the X language information, the B
language information, the C language information, and the D language information. In other words, the process of the natural expression processing may be: one of A-->X*Y, A.-->C¨>Y, and A4D4Y.
[00131]
If, in accordance with the language information converting model shown in FIG. 9, a multiple-layer "many-to-many" relationship conversion on the aforementioned six types of languages A-->X-->BC)D- Y needs to be performed, it is academically referred to as Multi-Layer Perception (MLP), as shown in FIG 10. The disadvantage of the multiple-layer "many-to-many" relationship conversion is that: each time of conversion will cause distortion of the original information to a certain extent, and will add more processing load to the system, resulting in a further loss in performance. More times of conversion cause more serious distortion of the original information, so that the processing speed of the system is slower. Similarly, because the intervention of the manual aid is required in the robot training at all the aforementioned three stages, a very high workload and costs will be LiuShen Docket No. C14W5787 produced on the one hand, and on the other hand, many times of human intervention will also increase the probability of error. Therefore, if the conversion of A4X¨>Y can be implemented and the multiple-layer "many-to-many" conversion of X->B-C4D4Y is omitted, the accuracy and efficiency of the expression information conversion can be improved, and the workload and the error rate of the manual aided identification can also be reduced.
[00132] According to the technique of the present invention, firstly, the irregular natural expression information such as text, speech, image, or video is converted to the X
language information through a modeling tool; then with the X language as the language on the left side and the Y language as the language on the right side, the conversion from the X
language information to the Y language information is implemented through the use of the machine translation (MT) technique.
[00133] Specifically, for example in the processing of the irregular natural expression information such as speech, the "speech signal processing" technique is firstly utilized to automatically convert/translate into the X language (based on the current "speech signal processing" technique, the accuracy rate of A-->X conversion can generally reach above 95%, and the improved "speech signal processing" technique works better in noise reduction, and can increase the accuracy rate of the A--)X conversion to above 99%); and then the machine translation technique can be used to implement the X4Y automatic machine translation, without the need of the multiple-layer conversion of X4B--->C->Y.
[00134] A machine translation algorithm similar to the statistical analysis on instance samples can be used to convert the irregular natural expression (the X
language information) obtained through conversion to the regularized standard expression (the Y
language information). This machine translation algorithm requires for sufficient amount and sufficient accuracy of corresponding data between the X language and the Y language.
[00135] In the method according to the present invention, in consideration that the accurate automatic machine conversion of A4X can be implemented, in order to accumulate the corresponding data between the X language and the Y language, the corresponding data between the A language and the Y language is accumulated. Thus, the solution of the present invention provides a new working mode of a manual agent, namely manual aided 17.
LiuShen Docket No. C141475787 understanding (MAU), which implements accumulation of the corresponding data between the A language and the Y language by means of manual understanding in combination with code input. As in the preceding example, the requirement code "271" may be used to express the meaning of adjusting the credit line of the credit card, and similarly, "21" may also be !!
11.
used to express the meaning of reporting a loss of the credit card, and thus "21" may be used to correspond to the aforementioned natural expression information "R,tr,ift-A-k-T91L-r" or "ft In nil 7". Such a simple code inputting manner, the traditional "agents with talking" is turned into "agents without talking", such that the work of the agents becomes more comfortable, the understanding capability of the highest values of humankind is more fully utilized while the working efficiency is greatly improved, and a tremendous amount of the corresponding data between the A/X language and the Y language is rapidly and accurately collected; the data is provided to an MT engine for cyclic iteration, self-learning the A/X:->Y conversion/translation rule, and forming an A/X4Y translation model.
[00136]
Introduced below are the principles of a machine translation technique and a machine translation robot training technique according to the present invention.
[00137]
The machine translation is an artificial intelligence technique for automatically translating two languages. The "language" mentioned herein is not a narrowly defined national language (for example: Chinese, English ...), but is a generalized information .1 representation mode. As mentioned above, in respect of the representation mode, the language can be divided into four major categories: text, speech, image, animation (also referred to as "video").
i : I
[00138]
The language is the information formed by various permutations and combinations of the elements in an element set. For example: the English text is a language formed by 128 ASCII characters (the elements) in an ASCII character set (the element set) :1 ! , through various one-dimension (serial) permutations and combinations; the Chinese language !
is formed through infinite permutations and combinations of a thousand of characters in combination with punctuations in the international codes (the basic elements constituting the Chinese information); and for another example, an RGB planar image is another language formed by three sub-pixels including red, green, and blue through various two-dimensional (in length and width) permutations and combinations.

LiuShen Docket No. C14W5787 [00139] If a certain conversion/translation rule exists between any two languages, the automatic conversion/translation rule between the two languages can be found through analysis on the corresponding relationship between the permutations and combinations of the two language elements. It is firstly required to manually collect the corresponding data (or "translation samples") of the two languages, then to find out the automatic conversion/translation rule between the two languages through the cyclic iteration of the permutations and combinations of the two language elements, so as to form a translation model of the two languages.
[00140] Two datasets are required for making the machine translation: a "training dataset" and a "testing dataset".
[00141] The two datasets have a similar data structure: pairs of data are stored, in which the left value is a "left language" (or referred to as the "source language"), and the right value is a "right language" (or referred to as the "target language").
An analogy can be made vividly: the "training dataset" is a self-learning book given by the humankind to the MT
robot, and the "testing dataset" is a test question given by the humankind to the MT robot, for evaluating the self-learning effect of the robot.
[00142] The following is an example of the "training dataset" and the "testing dataset"
for the English- Chinese MT:
Training dataset English Chinese 1 How old are you? fclist7.?
2 What's your age?
3 May I have your time?fn] VIA it 14, HVE0 T?
4 May I have your name? ?
Sony, your age is not qualified.
Testing dataset LiuShen Docket No. CI 4W5787 English Chinese 1 May I have your age? i A 2?
[00143]
The MT robot performs the cyclic iteration on the permutations and combinations by taking the elements constituting the language as units. As in the above example, it is found through the two data pairs #3 and #4 in the training dataset that, the permutations and combinations of 15 ASCII character elements (3 English letters "May"-I- 1 space + 1 English letters "I" + 1 space + 4 English letters "have"+ 1 space +
4 English letters "your") of English "May I have your" are corresponding to the permutations and combinations of 3 Chinese characters "iiThe" of GB codes; and it is found through the two =.;
data pairs #2 and #5 in the training dataset that, the permutations and combinations of 3 ASCII character elements of English "age" are corresponding to the permutations and combinations of 2 Chinese characters "2" of GB codes.
[00144]
Therefore, if the robot can translate the English "May I have your age?" in the testing dataset into the Chinese "111-1 ? " accurately, it proves that the robot has learned this English-Chinese translation of this sentence; and otherwise, it proves that the robot has not learned it. Then the robot needs to make a revision on his own learning method (for example, to find another path to try learning again), for which the training dataset is digested again, and this is another iteration; ... if this "iterative amendment" is constantly repeated, the translation accuracy rate of the robot keeps climbing. When the translation accuracy rate climbs to a certain degree (for example, the translation accuracy rate is 70%), the translation accuracy rate of the robot may keep hovering around this level, and is difficult to go up; that is to say, it encounters the "self-learning of the robot"
bottleneck, and then, the data in the MT training dataset needs to be increased for the robot. The data in the MT
training dataset may be imported from an external database, and may also be generated or added through the "manual aided understanding".
[001451 For example, in the previous example of the credit card business, when it is assumed that the irregular natural expression obtained is "An,i,v1-,-.fibRiej.2y.1)7. (the DuShen Docket No. C14W5787 overdraft limit of my credit card is too low)", and when the understanding of the robot is not sufficiently mature, the "manual aided understanding" can intervene, such that the expression can be understood as "RNMAIRIMA -MAO want to raise the credit line of the credit card)" manually, and the corresponding Y language information is input.
Optionally, during the "manual aided understanding" processing, the understanding process and understanding result on the natural expression need not to be recorded, and only the corresponding standard expression (the Y language information) as the final processing result is recorded. In this way, the manual operation is simplified, and resources are saved. For example, the operator only needs to input "271" as the standard expression to complete the processing on the irregular natural expression "MMA--Kft3._Aili,JtJ.> T (the overdraft limit of my credit card is too low)". For example, the new natural expression instance, such as the aforementioned natural expression "R, -r-agiAri<Jl> (the overdraft limit of my credit card is too low)", and the corresponding standard expression "271" are added to the existing MT
training dataset, thereby increasing and updating the data in the MT training dataset.
Thus, through the "manual aided understanding", an accurate and stable conversion on the target natural expression (converted to a standard expression, namely the Y language information) can be achieved on one hand, and efficient adding and updating of data in the MT
training dataset can be achieved on the other hand, such that the data in the MT training dataset of the system becomes richer and more accurate, and the accuracy rate of the translation (conversion) of the robot may also be efficiently improved.
100146] In theory, the MT robot needs to exhaustively list all the permutations and combinations of the 20 ASCII character elements of the #3 left-value "May I
have your time", and also needs to exhaustively list all the permutations and combinations of the 10 GB code Chinese characters of the #3 right-value "IHVYILA11-.[Itn 7". That is, the MT
robot needs to exhaustively list all the permutations and combinations of the left and right groups of elements of each pair of data in the training dataset. Through the exhaustive listing at element level, the MT robot must be able to find a lot of repeated permutations and combinations (such as "your", "May I have your", "age", "time", " ", "14 JA
165 ", so as to find a certain corresponding relationship between the permutations and combinations of the left language elements and the permutations and combinations of the right language 61' LiuShen Docket No. C14W5787 elements which appear repeatedly, i.e., the translation model between two languages. In other words, a larger amount of the left and right language data pairs in the training dataset brings out a larger number of permutations and combinations of the left and right language elements appearing repeatedly as found by the MT robot, a larger number of corresponding relationships of the permutations and combinations of the left and right elements appearing repeatedly, and thus a larger number of conversion/translation rules of the left and right languages mastered by the MT robot, thereby providing a more mature translation model.
Therefore, with the "regularized standard expression" and "manual aided understanding"
according to the technical concept of the present invention, the data of the MT training dataset can be more efficiently accumulated, thus helping to achieve the self-learning and automatic machine translation of the robot.
[00147] In the present invention, the machine translation between the X language4Y
language has the same principle as that of the machine translation between Chinese and English, except that the English is changed into the X language and the Chinese is changed into the Y language, and accordingly the element sets of the left and right languages are different.
[00148] As stated above, the machine translation technique can be used to automatically translate one language into another language. The technical principle thereof is to make analysis at the basic element level on the collected pairing information of two languages (a language on the left side and a language on the right side), by performing an iterative comparison on various permutations and combinations of the basis elements of a large number of language information pairs, to find out the conversion/translation rule between the two languages, thereby forming a translation model of the two languages.
[00149] The present invention extends the application scope of the machine translation technique from automatic translation between different national languages to automatic convertion from all the irregular multimedia natural expression information (text, speech, image, or video, namely the A language information) to the regular standard information (the Y language information), such that they can be processed by business systems of various sectors, so as to realize practical natural language processing (NLP) in the true sense.
1001501 Because multi-layer linguistic analysis needed for the traditional machine LiuShen Docket No. C14W5787 translation is not required, with the analysis of the instances at the basic element level, the accuracy and speed of translation can be increased, and updating and expansion can also be easily achieved by adding the natural expression instances and the standard expression.
[00151] With respect to the natural expression processing according to the embodiments of the present invention, because only the conversion from the natural expression (the A language information) to the standard expression (the Y
speech information) is needed, in other words, it is only required to establish an AIX->Y.
translation model, without the need of processing a translation result of the text language, the modification processing needs not to be performed on the translation result.
1001521 In addition, the natural expression processing according to the embodiments of the present invention can be limited to the use in specific business of specific sectors and institutions, for example, in the above credit card business, such that the scale of the MT
training dataset required by the processing system can be greatly reduced.
Thus, the maturation threshold of the understanding of the robot is increased, the costs for constructing and maintaining the MT training dataset are reduced, and the maturation period of the A/X4Y translation model is effectively shortened.
[00153] As previously stated, the natural expression processing system according to the embodiments of the present invention achieves the conversion from a natural expression to an encoded standard expression. The conversion is based on the MT training dataset storing the pairing data of the A/X language and the Y language information, and the A/X4Y
translation model obtained on the basis of the MT training dataset. Therefore, it is required to collect a certain amount of accurate A/X language data and Y language data to generate the MT training dataset, and to form A/X4Y translation model through the self-learning (self-training) of the robot (the information processing system). The formation of the MT
training dataset may be conducted through the manual aided understanding.
[00154] FIG. 1 schematically shows a flow diagram of a natural expression processing method according to one embodiment of the present invention.
[00155] In step S11, a system receives natural expression information (A
language information), and as previously stated, the natural expression information may be text information, speech information, image information, video information, and so on.

LiitShen Docket No. C.141175787 [00156] In step S21, whether the understanding of a robot is mature is determined.
Herein, the basis of the determining whether the understanding of the robot is mature is that, within a certain time interval (set according to specific application requirements), a result Y I
obtained through converting, by the robot, the A language information to the X
language information and then converting the X language information to the Y language information, is compared with a result Y2 obtained through directly manually converting the A
language information to the Y language information, and the number of times when Y1 and Y2 are the same as each other is divided by the total number of times to obtain a percentage, which is an accuracy rate of the understanding of the robot. The accuracy rate of the understanding of the robot set according to the application requirements is referred to as "a maturation threshold of the understanding of the robot". If the accuracy rate of the understanding of the robot is lower than the maturation threshold of the understanding of the robot, the system considers that the understanding of the robot is not yet mature, and the manual conversion result Y2 is further adopted instead of the robot conversion result Y1, in order to ensure accuracy and stability of the understanding of the system on the A language information. At the same time, the system adds the X language information (language on the left side) obtained through automatic machine conversion performed on the A language information machine, and the manual conversion result Y2 (language on the right side) into the MT training dataset, for use in the self-training of the MT robot.
[00157] If the understanding of the robot is mature, in step S22, the robot automatically converts the natural expression A to the standard expression Y
directly; and if the understanding of the robot is not mature, in step S23, the robot attempts to convert the natural expression A to the standard expression Y1 , and at the same time, in step S24, the MAU agent converts the natural expression A to the standard expressionY2.
[001581 In step S32, if it is determined in step S21 that the understanding ability of the robot has already been mature, the result Y of the automatic conversion of the robot is output;
and otherwise, the result Y2 of the manual conversion of the MAU agent is output.
[001591 Optionally, in step S31, the subsequent processing is performed on the natural expression A, the result Y1 of the conversion attempted by the robot, and the result Y2 of the manual conversion of the MAU agent by placing the X language information (the language LiuShen Docket No. C14W5787 on the left side) automatically converted from A together with Y2 (the language on the right side) into the MT training dataset as a pair of new pairing data; and comparing Y1 with Y2, to serve as the statistic data for "determining whether the understanding of the robot is mature".
Optionally, the original data A is retained, and when the A-->X conversion technique is further developed to be mature (have a higher conversion accuracy rate) in the future, the data of the language on the left side of the MT training dataset is updated.
[00160] FIG 2 schematically shows a flow diagram of a natural expression processing and response method according to one embodiment of the present invention.
[00161] In the processing shown in FIG 2, as in FIG. 1, a natural expression A is firstly received in step S12. Then, whether the natural expression A can be converted to a standard expression Y through machine conversion is determined in step 531. This step is equivalent to step S21 in FIG 1. Similar to the processing in FIG 1, when it is determined in step S31 that the desired standard expression cannot be obtained through the machine conversion, manual conversion processing is performed in step S32.
1001621 In practical applications, there may exist cases where the identified natural expression or the requirement expressed by the customer cannot be understood even through the human processing, and at this time, a response for prompting the customer to re-input is made in step S33 and then the processing returns to step S12, where a natural expression information A re-input by the customer is received. The "response for prompting the customer to re-input" may be, for example, speech prompts "excuse me, could you please say what you need again", "could you speak slowly"; text prompts "excuse me, please write more specifically"; or image prompts.
[00163] In Step S34, the standard expression of the machine conversion or the manual conversion is output. In Step S35, a standard response matching the standard expression is queried. The standard response may be fixed data pre-stored in the database;
alternatively, basic data of the standard response is pre-stored in the database, and then by the system, the basic data is synthesized with the variable parameters of the individual case to generate the standard response. In one embodiment, a standard response ID is set as a primary key of the response data, and a corresponding relationship table between the requirement codes of the standard expression (the Y language information) and the standard response ID
is set in the LittShen Docket No C14W5787 database, such that the requirement codes of the standard expression (the Y
language information) are associated with the response data. Table 1 - Table 3 below schematically show examples of the expression data table, the expression response relationship table, and the response data table, respectively. Optionally, the standard expression and the standard response ID are in a many-to-one relationship, as shown in Table 4. In addition, in other embodiments, because the requirement codes of the standard expression (the Y
language information) are encoded themselves, the requirement codes of the standard expression (the Y
language information) may also be directly used as the primary key of the response data.
Expression Data Table Natural expression Type Standard expression =
Received Speech [expression 1]
Transfer 5000 yuan to my Mom Text [expression 2]
<Transfer failure page screenshot> Image [expression 3]
2-5-1000 Telephone key [expression 4]
Tablel Expression Response Corresponding Table 1 Requirement codes of standard expression Standard response ID
[expression 1] [response 3]
[expression 2] [response 1]
[expression 3] [response 4]
[expression 4] [response 2]
Table2 Response Data Table _ LiuShen Docket No. C14W5787 Standard response ID Type Response content [response 1] Program <Transfers 5000 yuan to MS X>
<Repayment of 1000 yuan to the credit [response 21 Program card>
[response 3] Speech OK, I know, thanks!
<Simple tutorial to error correction in [response 4] Video transfer>
Table3 Expression Response Corresponding Table 2 Standard response ID Standard expression [response 80] [expression 74]
[response 80] [expression 12]
[response 80] [expression 23]
[response 81] [expression 31]
[response 81] [expression 57]
Table 4 [00164] As previously stated, the standard expression may include natural expression-related information, for example, expression type, language type, dialect type, and so on. For example, the natural expression from the customer is the speech "received", the standard response obtained by querying the converted standard expression is the speech "OK, I know, thanks!" For another example, the natural expression from the customer is the image "Transfer failure page screenshot", the standard response obtained by querying the converted standard expression is the video "Simple tutorial to error correction in transfer".
[001651 If the standard response matching the standard expression does not exist in the database, the corresponding response can be matched manually in step S36. The manual LiuShen Docket No. C14W5787 matching can associate the standard expression with the standard response ID
by inputting or selecting the standard response 1D, or associate the standard expression with the response data directly, and may also establish new response data. The reason why the standard response is not found is probably that the standard expression is newly added manually, or is probably that the same type of the standard response is not matched. Then, the response of the machine matching or manual matching is output in step S37. The content of the response is invoked or generated according to different information types. For example, for the speech =
response, the playback of live recording may be conducted or the speech on which TTS (Text To Speech speech synthesis) has been performed is output; for a user's digital operation, such as a telephone key sequential combination "2-5-1000", the operation "Repayment of 1000 yuan to the credit card" is completed by running a program.
[00166] For the text information such as "Transfers 5000 yuan to my Mom", the operation "Transfers 5000 yuan to MS X" is performed by running a program, but the system may not master the account information "MS X" in advance, and thus, on the one hand, the account information may be manually added to achieve the conversion to the standard expression, and on the other hand, even if the conversion to the standard expression is implemented, the corresponding standard response may not be queried, and the response processing needs to be manually performed. At this time, new response data (such as an operational procedure) will be generated, a new standard response ID may also be manually or automatically assigned to the response data, and the standard response ID
is associated with the above converted standard expression. Thus, while the response for the natural expression of the customer is achieved, manual aided understanding and training can be achieved, and an expression-response database is updated.
[00167] In the natural expression processing and response method according to the embodiments of the present invention, the standard expression can be used to quickly point to the response, such that the customer no longer needs to spend a lot of time traversing the complicated routine menu of functions to find out the desired self-service.
[00168] On the other hand, different from the conventional response mode, the manual operation is mainly limited to the "decision" at the background, which includes determining the requirement codes of the standard expression (the Y language information), and selecting LiuShen Docket No. C14F175787 a response (or the response ID) or generating a response operation, without the need of direct communication with the customer at the foreground by phone or by means of text input (other than inputting the requirement parameters of the standard expression (the Y language information)). Thus, a large amount of human efforts can be saved, and the working efficiency can be greatly increased. In addition, as compared with the traditional free-style response provided by the manual agent to the customer directly, the standardized response provided by the system to the customer is not affected by many factors including the manual agent's emotion, gland, accent, and operational proficiency, thereby further guaranteeing the stability of the customer experience.
[00169] Moreover, a standardized natural expression - standard expression -standard response database can be established through the automatic learning, training, and manual aided understanding of the system (robot), so as to implement the automatic understanding and response of the system step by step. In addition, the natural expression data in the database may also have the advantages including a small particle size, a narrow scope of business, and a high data fidelity, so as to reduce the training difficulty of the robot, and shorten the maturation period of the robot intelligence.
[00170] FIG 3 schematically shows an intelligent response system according to the embodiments of the present invention. As shown in FIG 3, the intelligent response system includes an intelligent response device 1 (equivalent to the server side) and a calling device 2 (equivalent to the client side), a customer 8 communicates with the intelligent response device 1 through the calling device 2, and an MAU manual agent 9 (a system service personnel) performs manual operation on the intelligent response device 1.
Herein, the intelligent response device 1 includes a dialogue gateway 11, a central controller 12, an MAU
workstation 13, and a robot 14. Optionally, the intelligent response device 1 further includes a trainer 15.
[00171] The customer 8 refers to an object of remote sales and remote service of the institution. The remote sales usually refer to that the institution actively contacts the customer in the form of "calling out" through its dedicated telephone or Internet channels, and attempts to promote sales for their products and services. The remote services usually refer to that the customer of the institution actively contacts the institution in the form of "calling in" through LiuShen Docket No. C14W5787 the dedicated telephone or Internet channels of the institution, and inquires or uses the products and services of the institution.
[00172] The calling device 2 is a dedicated telephone or Internet channel established by the institution, for performing remote sales (calling-out service) on the customer 8 and providing remote services (calling-in services) to the customer. The telephone channel call system, for example an automatic call distribution (ACD) system (for example, ACD of the Avaya), is a conversation channel for the institution to interact with the customer 8 in the form of speech through an automatic business system (for example, a traditional IVR system based on the telephone key technique, or a novel voice portal (VP) system based on an intelligent speech technique) and a manual agent at the background.
[00173] The Internet channel call system, for example an Internet call center (ICC) system based on the instant messaging (IM) technique, is a conversation channel for the institution to interact with the customer 8 in the form of text, speech, image, video, or others through a customer self-service system (for example, a natural language processing (NLP) system) and a manual agent at the background.
[00174] The intelligent response device 1 enables the institution to control the automatic business system and the manual agent at the background, as well as the conversation with the customer 8 in the form of text, speech, image, video, or other multimedia forms, thus achieving standardized and automatized interactive conversation between the institution and the customer.
1001751 The dialogue gateway 11 plays a role of "preposed portal"
in the intelligent response device 1, and the main functions thereof include: receiving the irregular natural expression (in the form of text, speech, image, and video) and the regular non-natural expression (e.g., in the form of telephone keyboard keys) from the customer 8 via the calling device 2, and transmitting them to the central controller 12 for subsequent processing;
receiving the instructions from the central controller 12, thereby achieving the response to the expression of the customer 8 (in the form of text, speech, image, video, program, or other forms).
[00176] As shown in FIG 4, the dialogue gateway 11 includes an expression receiver 111, an identity authenticator 112, a response database 113, and a response generator 114.

DuShen Docket No. C14W5787 [00177] The expression receiver 111 receives an expression from the customer 8 via the calling device 2. The expression may be the aforementioned various irregular natural expressions and regular non-natural expression.
[00178] Optionally, the identity authenticator 112 is arranged before the expression receiver 111. The identity authenticator 112 may identify and verify the identity of the customer in the initial stage of the conversation. A traditional "password input" technique (such as: a telephone key input password, a keyboard input website login password, and so on) can be adopted; a novel "pass-phrase + voice-print identification" technique can also be adopted; and the above two techniques can be mixed for use. Although the traditional password authentication technique is not convenient, it has long been widely accepted and commonly used by the market, and can be taken as a primary customer's identity identification and verification means on a critical sensitive conversation node (for example, bank transfer); although the latter is much more convenient, but it has not been widely accepted and commonly used by the market, and can be used as a completely new customer's identity identification and verification means for greatly improving the customer's experience .
on a non-critical sensitive conversation node (for example, querying shopping points), and can also be used as an aided identification and verification means for enhancing the security of the former on the critical sensitive conversation node.
[00179] The identity authenticator 112 is set, and the "pass-phrase + voice-print identification" customer's identity identification and verification means is adopted, such that the customer's experience is improved, and the customer no longer needs to further remember multiple different passwords; the security risk that the password may be stolen in the "password input" traditional method is reduced; in addition, the "pass-phrase + voice-print identification" method is mixed with the "password input" traditional method for use, which can be widely accepted by the market, and can further enhance the security of customer's identity identification and verification.
[00180] The response database 113 stores the response data for responding to the customer. Similar to those listed in the above table as examples, the data may include many of the following types:
[00181] Text: pre-programmed text, for example, text answers in an online bank FAQ

LiuShen Docket No. C14W5787 (frequently asked questions).
[00182] Speech: pre-recorded live recording, or the TTS speech synthesis recording without variables, for example: "Hello, Here is the future Bank. Is there anything I can do for you?"
[00183] Image: pre-made image, for example, the Beijing subway network image.
Non-video animation is also included, for example: GIF files, FLASH files, and the like given by the bank for introducing to the customer how to perform the international remittance operation in an online bank system.
[00184] Video: pre-made video, for example, the one given by an electric iron supplier for demonstrating to the customer how to use its new products.
[00185] Programs: a series of re-programmed instructions, for example, when a customer speaks to express "I want to watch the China Partners", an iCloud smart TV
operates according to the requirements of the customer to respond to the customer: firstly turn on the TV, and download and cache the movie Chinese partners automatically for the iCloud server side, and finally start playing.
[00186] Template: filled with variable text, speech, image, program templates.
[00187] The response generator 114 receives instructions of the central controller 12, and generates a response to the expression of the customer 8 by invoking and/or running the data in the response database 113. Specifically, in accordance with the standard response ID
in the instructions, the response data is queried and invoked from the response database 113, or text and image are displayed, or speech and video are played, or a program is conducted;
alternatively, a template is invoked from the response database 113 according to the instructions and the variable parameters transmitted in the instructions are filled, or the TTS
speech synthesis generated in real time is played (for example, "You have successfully repaid 5000 Yuan to the credit card", wherein, the "5000" is a variable in the instructions), or a paragraph of text is displayed, or an image or animation is generated in real time is displayed, or a segment of program is executed.
[00188] Optionally, the central controller 12 may maintain and update the data in the response database 113, including response data, a standard response ID, and so on.
[00189] The central controller 12 receives the customer's requirement expression LiuShen Docket No. C14PV5787 information from the expression receiver 111 (including: irregular natural expression and regular non-natural expression), and cooperates with the robot 14, as well as an MAU manual agent 9 via an MAU workstation 13, to convert the irregular natural expression information of the customer in accordance with the aforementioned method to a standard expression, determines a corresponding standard response ID according to the standard expression, and then transmits the standard response ID to the response generator 114.
Optionally, the central controller 12 may update the data in the MT training dataset.
[00190] The robot 14 is an application robot for implementing the above artificial intelligence technique. The robot 14 may implement the conversion on text information, speech information, image information, video information, and other natural expressions (the language information), to obtain a standard expression (the Y language information). As previously stated, when the understanding ability of the robot 14 reaches a certain level, for example, when it is determined that the understanding ability is mature within a certain specific category, the conversion of A4X4Y may be performed independently, without any aid of the manual agent. The MT training dataset may be arranged in the robot 14, or may be an external database, and the requirement codes of the standard expression data stored therein (the language on the right side) may be associated with the standard response ID. The database may be updated by the central controller 12. In addition, the database for use in text translation, speech identification, image identification, video processing, and so on may be an external database, and may also be arranged in the robot 14.
[00191] The MAU
workstation 13 is an interface between the intelligent response device 1 and the MAU manual agent 9. The MAU workstation 13 presents the identified natural expression or the original expression of the customer to the MAU
manual agent 9.
The MAU manual agent 9 inputs or selects the standard expression through the MAU
workstation 13, and the MAU workstation 13 transmits the standard expression to the central controller 12. Optionally, if the response needs to be determined with manual aid, the MAU
manual agent 9 inputs or selects the response (or the standard response ID) through the MAU
workstation 13.
[00192]
Optionally, the intelligent response device 1 further includes a trainer 15.
The trainer 15 is configured to train the ability of the robot 14 to convert the natural expression LiuShen Docket No. C.14W5787 into the standard expression. For example, the trainer 15 trains the robot 11 by using the determination result of the MAU manual agent 9, thereby constantly enhancing the accuracy rate of the understanding of the robot 11 in various categories (for example, the aforementioned business category and secondary business category, etc.). For each category, in the case where the accuracy rate of the understanding of the robot cannot reach "a maturation threshold of the understanding of the robot", the trainer 15 performs comparison processing between the conversion result of the standard expression of the MAU
manual agent 9 and the conversion result of the standard expression of the standard expression of the robot 11, and if the two results are the same, the "number of times of accurate determination of the robot" and the "number of times of determination of the robot" within the category are correspondingly increased by 1; otherwise, the result of the manual conversion is added into the MT training dataset, as new robot training data. The trainer 15 may also instruct the robot 14 to conduct the aforementioned "self-learning".
[00193] In addition, the trainer 15 may also be configured to train the robot 14 in terms of text translation, speech identification, image identification, video processing, and other artificial intelligence techniques. The trainer 15 may also maintain or update the MT training dataset, and the database for use in text translation, speech identification, image identification, and video processing.
[00194] Optionally, the trainer 15 may also be integrated with the central controller 12.
[00195] Optionally, the response generator 114 and the response database 113 may be independent of the dialogue gateway 11, and may also be integrated in the central controller 12.
[00196] The intelligent response device 1 can implement the aforementioned natural expression processing and response method. For example, the dialogue gateway 11 receives, from the calling device 2, the irregular natural expression information from the customer 8 via the expression receiver 111, and transmits it to the central controller 12; the central controller 12 instructs the robot 11 to identify the irregular natural expression information as a certain form of language information which can be processed by a computer and related expression information, and then instructs the robot 11 to convert the language information and the related expression information to the standard expression; if the understanding of the _ LiuShen Docket No. C14W5787 robot 11 is not sufficiently mature or corpus matching is not matched, thereby failing to complete the conversion to the standard expression, the central controller 12 instructs the MAU workstation 13 to prompt the MAU manual agent 9 to conduct a manual conversion to the standard expression; the MAU manual agent 9 converts the language information and the related expression information identified by the robot 11 to the standard expression, which is input and transmitted to the central controller 12 via the MAU workstation 13.
Optionally, the MAU manual agent 9 may directly convert the non-identified irregular natural expression information into a standard expression; the central controller 12 queries an expression-response database, to retrieve a standard response ID matching the standard expression, and if there is no matching result, further prompts the MAU manual agent 9 via the MAU workstation 13 to select the standard response and input a corresponding standard response ID; optionally, the MAU manual agent 9 may also directly associate the standard expression with the response data, or establish new response data; the central controller 12 instructs the response generator 114 to invoke and/or run the data in the response database 113 to generate a response to the expression of the customer 8; then, the dialogue gateway 11 feeds back the response to the customer 8 via the calling device 2;
optionally, the central controller 12 respectively maintains and updates the MT training dataset or the response database according to the standard expression or the standard response determined or added by the MAU manual agent 9, and accordingly maintains and updates the expression-response database.
[001971 FIG 5 schematically shows an example of an operation interface presented by the MAU workstation to the MAU manual agent 9. As shown in FIG. 5, the operation interfaces of the MAU workstation 13 include: a customer's expression display region 131, a conversation state display region 132, a navigation region 133, a category selection region 134, and a shortcut region 135.
[00198] The customer's expression display region 131 shows the natural expression of the customer, and for example, is rendered as the forms such as text converted from text, image, or speech.
[00199] The conversation state display region 132 displays conversation real-time state information between the customer 8 and the MAU manual agent 9 or the robot 14, such as:

1,41-LiuShen Docket No. C14W5787 To and fro times of conversation, total conversation duration, customer information, and so on. The display region may also be not arranged.
[00200] The navigation region 133 shows the category that the MAU manual agent 9 currently selects to arrive at. The left side of the region displays the text version of the current category path (as shown in the drawings: Bank-Credit card), the right side displays the code corresponding to the category (as shown in the drawings: "12" and "1" stand for the category "Bank", "2" stands for the next level of category "Credit card" in the category "Bank". Unlike .õ.
the preceding examples, in this application, "1" stands for the category "Bank", rather than "BNK", which has the same identification function).
[00201] The category selection region 134 is provided for the MAU manual agent 9 to select the next level of category. As shown in the drawings: the MAU manual agent 9 has entered the next level of category "Credit card" of the category "Bank", and 7 subcategories are administered under this level of category "Credit card": "Activate a new card", "Apply for a new card and enquire application status", "Repayment" and so on. If the expression of the customer 8 is "The overdraft limit of my credit card is too low", the MAU
manual agent 9 selects "7" in the current category "bank-credit card", the navigation region updates to display "bank-credit card->Adjust the credit line... ... 127", and then enters the further next level of category. The MAU manual agent 9 may also directly input "127" on the keyboard after seeing the expression of the customer 8, to reach the target category "bank4credit card¨)Adjust the credit line". In this way, the customer 8 no longer needs to spend a lot of time traversing the complex functional menu tree to find out the desired self service, but simply speak out his demands, such that the MAU manual agent 9 can quickly help the customer to directly start the processing "Adjust the credit line of the credit card". Thus, the user's experience becomes easier and more convenient, and the self-service process utilization rate of the existing traditional IVR system will be increased significantly.
[00202] The shortcut region 135 provides commonly used shortcut keys for the MAU
manual agent 9, for example, "-" for returning to the previous level of category, "0" for transferring to the manual agent, and "+" for returning to the top level of category (which is the root category "Bank" in this case). The shortcut region 135 may also provide other shortcuts for the MAU manual agent 9. The shortcut region 135 may increase the processing LiuShen Docket No. C14W5787 speed of the MAU manual agent 9. The shortcut region 135 is also an optional arrangement region.
[00203] Here merely gives one example of the operation interface of the MAU
workstation 13, which is used for the conversion processing of the MAU manual agent 9 on the standard expression. Similar operation interfaces may also be used to conduct the manual processing on the response.
[00204] The intelligent response device according to the embodiments of the present invention may be implemented by one or more computers, a mobile terminal, or other data processing devices.
[00205] In the natural expression processing and response method, device, and system according to the embodiments of the present invention, the standard expression can be used to quickly point to the response, such that the customer no longer needs to spend a lot of time traversing the complicated routine menu of functions to find out the desired self-service.
[00206] A standardized natural expression ____________ information standard expression¨standard response database can be established through the automatic learning, training, and manual aided understanding of the robot, so as to implement the automatic understanding and response of the system step by step. In addition, the natural expression data in the database may also have the advantages including a small particle size, a narrow scope of business, and a high fidelity, so as to reduce the training difficulty of the robot, and shorten the maturation period of the robot intelligence.
[00207] Unlike the traditional response mode, the manual operation is mainly limited to the "decision" at the background, which includes determining the requirement codes of the standard expression (the Y language information), and selecting a response (or a response ID) or generating a response operation, without the need of direct communication with the customer at the foreground by phone or by means of text input (other than inputting the requirement parameters of the standard expression (the Y language information)). Thus, a large amount of human efforts can be saved, and the working efficiency can be increased. In addition, as compared with the traditional free-style response provided by the traditional manual agent to the customer directly, the standardized response provided by the system to the customer is not affected by many factors including the manual agent's emotion, gland, LiuShen Docket No. C14W5787 accent, and operational proficiency, thereby further guaranteeing the stability of the customer experience.
[00208] In addition, the self-learning, training, and mature degree evaluation can be implemented in each individual specific business category (node), so as to achieve the intelligence of the whole system point by point. In practical applications, the mechanism "the understanding of the robot becomes mature point by point" is more likely to be approved and accepted by the institutions, because the risk is relatively low, the cost for reconstructing the old system cost is not high, and no negative impact will be generated on the daily operations.
[00209] The above are only exemplary embodiments of the present invention, and not intended to limit the scope of protection of the present invention, which is defined by the appended claims.

Claims (20)

1. A natural expression processing method, comprising:
identifying a natural expression from a user, to obtain a certain form of language information which can be processed by a computer; and converting the obtained language information to a standard expression in an encoded form.
2. The natural expression processing method of claim 1, wherein, the standard expression includes requirement codes embodying a user's requirement.
3. The natural expression processing method of claim 2, wherein, the requirement codes are represented by digital codes.
4. The natural expression processing method of claim 2, wherein, the standard expression further includes requirement parameters further embodying the user's specific requirement.
5. The natural expression processing method of any one of claims 1-4, wherein, the language information is constituted by language information units obtained through spotting and conversion performed on the natural expression in the form of speech by using a modeling tool.
6. The natural expression processing method of any one of claims 1-4, wherein, the language information is constituted by one of phonemes, characters, and phrases.
7. The natural expression processing method of any of claims 1-6, wherein, conversion from the language information to the standard expression are implemented on the basis of an MT training dataset between the language information and the standard expression.
8. The natural expression processing method of any one of claims 1-7, wherein, information associated with the natural expression is obtained during the identification of the natural expression, and the information is converted to a part of the standard expression.
9. A method for training an artificial intelligence robot, comprising:
establishing an MT training dataset, wherein the MT training dataset comprises: the computer-processable language information obtained by converting the natural expression, the encoded standard expression, and a corresponding relationship between the language information and the standard expression; and performing, by the artificial intelligence robot, an iterative comparison between various permutations and combinations of elements of the language information existing in the MT
training dataset and various permutations and combinations of elements of the standard expression, to find out a corresponding relationship between the permutations and combinations of the elements of the language information and the permutations and combinations of the elements of the standard expression.
10. The method for training an artificial intelligence robot of claim 9, wherein, the elements of the language information are language information units obtained through spotting and conversion performed on the natural expression in the form of speech by using a modeling tool.
11. The method for training an artificial intelligence robot of claim 9 or 10, wherein, the data in the MT training dataset is imported from an external database, or generated and added through the manual aided understanding.
12. A natural expression processing method, comprising:
inputting a natural expression;
identifying the natural expression, to obtain a certain form of language information which can be processed by a computer;
determining whether the language information can be converted into an encoded standard expression through machine conversion if it is determined that the desired standard expression cannot be obtained through the machine conversion, performing manual conversion processing; and outputting the standard expression of the machine conversion or the manual conversion.
13. The natural expression processing method of claim 12, wherein, the determining refers to determining whether the understanding of the robot is mature, wherein, the determining whether the understanding of the robot is mature is performed on the basis of evaluation on the accuracy rate of the understanding of the robot over a certain time interval.
14. A natural expression processing and response method, comprising:
inputting a natural expression;
identifying the natural expression, to obtain a certain form of language information which can be processed by a computer and relevant expression type information;
and determining whether the identified natural expression and the expression type information can be converted into an encoded standard expression through machine conversion if it is determined that the desired standard expression cannot be obtained through the machine conversion, performing manual conversion processing;
invoking or generating a standard response matching with the standard expression of the machine conversion or the manual conversion; and outputting the generated standard response in the manner of corresponding to the expression type information.
15. The natural expression processing and response method of claim 14, wherein, the standard response is fixed data pre-stored in a database, or the standard response is generated on the basis of variable parameters and basic data of the standard response pre-stored in the database.
16. A natural expression processing and response device (1), comprising: a dialogue gateway (11), a central controller (12), an MAU workstation (13), a robot (14), an expression database, a response database (113), and a response generator (114), wherein, the dialogue gateway (11) receives a natural expression from a user (8), transmits it to the central controller (12) for subsequent processing, and transmits a response for the natural expression to the user (8);
the central controller (12) receives the natural expression from the dialogue gateway (11), and cooperates with the robot (14) and the MAU workstation (13), to convert the natural expression to an encoded standard expression, and instruct the response generator (114) according to the standard expression to generate a standard response corresponding to the standard expression;
the robot (14) identifies the natural expression according to the instruction of the central controller (12), to obtain a certain form of language information which can be processed by a computer, and convert the language information to the standard expression using the expression database;
the MAU workstation (13) presents the identified natural expression or the natural expression from the user (8) to an external MAU manual agent (9), the MAU
manual agent (9) inputs or selects the standard expression through the MAU workstation (13), and then the MAU workstation (13) transmits the standard expression to the central controller (12);
the expression database is configured to store expression-related data, which includes:
the language information data associated with the natural expression, the standard expression data associated with the standard expression, and the data associated with the relationship between the language information and the standard expression;
the response database (113) stores response-related data, including standard response data for invocation and/or data for generating the response; and the response generator (114) receives instructions of the central controller (12), and generates the response for the natural expression of the user (8) by invoking and/or miming the data in the response database (113).
17. The natural expression processing and response device (1) of claim 16, wherein, the central controller (12) updates the expression database and/or the response database (113).
18. The natural expression processing and response device (1) of claim 16 or 17, wherein, the device (1) further includes a trainer (15), configured to train the robot (14) to convert the natural expression into the standard expression.
19. The natural expression processing and response device (1) of any one of claims 16-18, wherein, the dialogue gateway (11) further comprises an identity authenticator (112), configured to identify and verify the user (8)'s identity before receiving the natural expression, wherein authentication methods for the user's identity at least include pass-phrase and voice-print identification.
20. A natural expression processing and response system, comprising: an intelligent response device (1) and a calling device (2), a user (8) communicating with the intelligent response device (1) through the calling device (2), the intelligent response device (1) being operated by an MAU manual agent (9), wherein, the intelligent response device (1) comprises a dialogue gateway (11), a central controller (12), an MAU workstation (13), a robot (14), an expression database, a response database (113), and a response generator (114), the dialogue gateway (11) receives a natural expression from the user (8) from the calling device (2) and transmits it to the central controller (12);
the central controller (12) instructs the robot (14) to identify a certain form of language information which can be processed by a computer and related expression information from the natural expression, and then instructs the robot (14) to convert the language information and related expression information to a standard expression;
if understanding of the robot (14) is not sufficiently mature to complete the conversion to a standard expression, the central controller (12) instructs the MAU
workstation (13) to prompt the MAU manual agent (9) to perform a manual conversion to a standard expression, and the MAU manual agent (9) converts the language information and related expression information identified by the robot (14) to the standard expression and input and transmit it to the central controller (12) through the MAU workstation (13);

the central controller (12) instructs, based on the standard expression, the response generator (114) to invoke and/or run data in the response database (113) to generate a response to the natural expression of the user (8); and the dialogue gateway (11) feeds back the response to the user (8) through the calling device (2).
CA2929018A 2013-10-28 2014-06-16 Natural expression processing method, processing and response method, device and system Expired - Fee Related CA2929018C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CA3011397A CA3011397A1 (en) 2013-10-28 2014-06-16 Natural expression processing method, processing and response method, device and system

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CN201310516340.5A CN103593340B (en) 2013-10-28 2013-10-28 Natural expressing information processing method, processing and response method, equipment and system
CN201310516340.5 2013-10-28
PCT/CN2014/079945 WO2015062284A1 (en) 2013-10-28 2014-06-16 Natural expression processing method, processing and response method, device, and system

Related Child Applications (1)

Application Number Title Priority Date Filing Date
CA3011397A Division CA3011397A1 (en) 2013-10-28 2014-06-16 Natural expression processing method, processing and response method, device and system

Publications (2)

Publication Number Publication Date
CA2929018A1 true CA2929018A1 (en) 2015-05-07
CA2929018C CA2929018C (en) 2018-08-28

Family

ID=50083490

Family Applications (2)

Application Number Title Priority Date Filing Date
CA3011397A Abandoned CA3011397A1 (en) 2013-10-28 2014-06-16 Natural expression processing method, processing and response method, device and system
CA2929018A Expired - Fee Related CA2929018C (en) 2013-10-28 2014-06-16 Natural expression processing method, processing and response method, device and system

Family Applications Before (1)

Application Number Title Priority Date Filing Date
CA3011397A Abandoned CA3011397A1 (en) 2013-10-28 2014-06-16 Natural expression processing method, processing and response method, device and system

Country Status (11)

Country Link
US (2) US9760565B2 (en)
EP (1) EP3062239A4 (en)
JP (1) JP6182272B2 (en)
KR (1) KR20160077190A (en)
CN (2) CN103593340B (en)
CA (2) CA3011397A1 (en)
HK (1) HK1223164A1 (en)
IL (1) IL245322B (en)
RU (1) RU2672176C2 (en)
WO (1) WO2015062284A1 (en)
ZA (1) ZA201603638B (en)

Families Citing this family (66)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103593340B (en) 2013-10-28 2017-08-29 余自立 Natural expressing information processing method, processing and response method, equipment and system
US9535904B2 (en) * 2014-03-26 2017-01-03 Microsoft Technology Licensing, Llc Temporal translation grammar for language translation
US20170154546A1 (en) * 2014-08-21 2017-06-01 Jobu Productions Lexical dialect analysis system
CN104486516B (en) * 2014-11-13 2018-01-09 国网浙江省电力公司电力科学研究院 The robot voice method of servicing of IVR intelligence systems based on 95598 big traffics
CN105094315B (en) * 2015-06-25 2018-03-06 百度在线网络技术(北京)有限公司 The method and apparatus of human-machine intelligence's chat based on artificial intelligence
CN106372091A (en) * 2015-07-23 2017-02-01 中兴通讯股份有限公司 Method and device for providing solution on the basis of information identification
US9682481B2 (en) * 2015-10-26 2017-06-20 X Development Llc Communication of information regarding a robot using an optical identifier
WO2017082447A1 (en) * 2015-11-11 2017-05-18 주식회사 엠글리쉬 Foreign language reading aloud and displaying device and method therefor, motor learning device and motor learning method based on foreign language rhythmic action detection sensor, using same, and electronic medium and studying material in which same is recorded
CN107015983A (en) * 2016-01-27 2017-08-04 阿里巴巴集团控股有限公司 A kind of method and apparatus for being used in intelligent answer provide knowledge information
CN105721470B (en) * 2016-02-19 2020-10-20 广州盛华信息有限公司 Method for realizing visual access of call center
US10769517B2 (en) * 2016-03-05 2020-09-08 Fujitsu Limited Neural network analysis
CN105808501A (en) * 2016-03-09 2016-07-27 北京众星智联科技有限责任公司 Implementation of artificial intelligence learning
US10394964B2 (en) * 2016-04-04 2019-08-27 Oslabs Pte. Ltd. Gesture based system for translation and transliteration of input text and a method thereof
CN105975622B (en) * 2016-05-28 2020-12-29 福州云之智网络科技有限公司 Multi-role intelligent chatting method and system
US10929861B2 (en) * 2016-06-23 2021-02-23 Tata Consultancy Services Limited Method and system for measuring a customer experience in an organization
CN106297782A (en) * 2016-07-28 2017-01-04 北京智能管家科技有限公司 A kind of man-machine interaction method and system
CN106409283B (en) * 2016-08-31 2020-01-10 上海交通大学 Man-machine mixed interaction system and method based on audio
US10984034B1 (en) * 2016-10-05 2021-04-20 Cyrano.ai, Inc. Dialogue management system with hierarchical classification and progression
CN108073804B (en) * 2016-11-14 2022-11-29 百度在线网络技术(北京)有限公司 Risk identification method and device
US10146768B2 (en) * 2017-01-25 2018-12-04 Google Llc Automatic suggested responses to images received in messages using language model
US10909980B2 (en) * 2017-02-27 2021-02-02 SKAEL, Inc. Machine-learning digital assistants
KR102367778B1 (en) * 2017-03-15 2022-02-25 삼성전자주식회사 Method for processing language information and electronic device thereof
CN107331388A (en) * 2017-06-15 2017-11-07 重庆柚瓣科技有限公司 A kind of dialect collection system based on endowment robot
TWI712975B (en) * 2017-06-29 2020-12-11 永豐商業銀行股份有限公司 A voice-activated system for financial information services
JP2019049604A (en) * 2017-09-08 2019-03-28 国立研究開発法人情報通信研究機構 Instruction statement estimation system and instruction statement estimation method
CN109587358A (en) * 2017-09-29 2019-04-05 吴杰 Artificial intelligence customer service turns artificial customer service call method
CN107909995B (en) * 2017-11-16 2021-08-17 北京小米移动软件有限公司 Voice interaction method and device
US10762113B2 (en) * 2018-01-31 2020-09-01 Cisco Technology, Inc. Conversational knowledge graph powered virtual assistant for application performance management
CN108418978B (en) * 2018-02-12 2020-06-19 平安科技(深圳)有限公司 Dynamic route control method, device, computer equipment and storage medium
JP7078429B2 (en) * 2018-03-20 2022-05-31 株式会社Screenホールディングス Text mining methods, text mining programs, and text mining equipment
US10929601B1 (en) * 2018-03-23 2021-02-23 Amazon Technologies, Inc. Question answering for a multi-modal system
US11568863B1 (en) * 2018-03-23 2023-01-31 Amazon Technologies, Inc. Skill shortlister for natural language processing
US20190295082A1 (en) * 2018-03-23 2019-09-26 Mastercard International Incorporated Message Based Payment Card System, Apparatuses, and Method Thereof
US10956462B1 (en) * 2018-06-21 2021-03-23 Amazon Technologies, Inc. System answering of user inputs
EP3800633B1 (en) * 2018-06-27 2023-10-11 Google LLC Rendering responses to a spoken utterance of a user utilizing a local text-response map
CN108806688A (en) * 2018-07-16 2018-11-13 深圳Tcl数字技术有限公司 Sound control method, smart television, system and the storage medium of smart television
US10726058B2 (en) * 2018-07-31 2020-07-28 Market Advantage, Inc. System, computer program product and method for generating embeddings of textual and quantitative data
CN109397287B (en) * 2018-10-12 2021-07-27 珠海格力智能装备有限公司 Robot encoding method and device, robot, storage medium and processor
CN109215654A (en) * 2018-10-22 2019-01-15 北京智合大方科技有限公司 The mobile terminal intelligent customer service auxiliary system of Real-time speech recognition and natural language processing
CN109582971B (en) * 2018-12-20 2022-11-25 广东小天才科技有限公司 Correction method and correction system based on syntactic analysis
IT201900000526A1 (en) * 2019-01-11 2020-07-11 Userbot S R L ARTIFICIAL INTELLIGENCE SYSTEM FOR BUSINESS PROCESSES
CN110059166A (en) * 2019-01-23 2019-07-26 艾肯特公司 Natural expression processing method, response method, equipment and the system of natural intelligence
CN110059167A (en) * 2019-01-23 2019-07-26 艾肯特公司 Natural expression processing method, response method, equipment and the system of natural intelligence
CN110019688A (en) * 2019-01-23 2019-07-16 艾肯特公司 The method that robot is trained
CN110008317A (en) * 2019-01-23 2019-07-12 艾肯特公司 Natural expression processing method, response method, equipment and the system of natural intelligence
CN110046232A (en) * 2019-01-23 2019-07-23 艾肯特公司 Natural expression processing method, response method, equipment and the system of natural intelligence
WO2020151652A1 (en) * 2019-01-23 2020-07-30 艾肯特公司 Naturalist intelligence-based natural expression processing method, and response method, device and system, method for training robot, human-computer interaction system, method for training naturalist intelligence-based human-computer interaction system, and end-to-end control method and control system
CN110059168A (en) * 2019-01-23 2019-07-26 艾肯特公司 The method that man-machine interactive system based on natural intelligence is trained
CN111817943B (en) * 2019-04-12 2022-06-14 腾讯科技(深圳)有限公司 Data processing method and device based on instant messaging application
CN110035187A (en) * 2019-04-16 2019-07-19 浙江百应科技有限公司 A method of realizing AI and operator attendance seamless switching in the phone
CN112312181A (en) * 2019-07-26 2021-02-02 深圳Tcl新技术有限公司 Smart television voice recognition method, system and readable storage medium
CN110428811B (en) * 2019-09-17 2021-09-07 北京声智科技有限公司 Data processing method and device and electronic equipment
CN110826345B (en) * 2019-11-14 2023-09-05 北京香侬慧语科技有限责任公司 Machine translation method and device
CN111078776A (en) * 2019-12-10 2020-04-28 北京明略软件系统有限公司 Data table standardization method, device, equipment and storage medium
TWI751504B (en) * 2020-02-27 2022-01-01 中華電信股份有限公司 Dialogue system and method for human-machine cooperation
CN111540353B (en) * 2020-04-16 2022-11-15 重庆农村商业银行股份有限公司 Semantic understanding method, device, equipment and storage medium
KR20220006426A (en) * 2020-07-08 2022-01-17 황호진 Seller shopping mall system to directly buy in overseas countries and provide lental goods and method thereof
KR102340946B1 (en) * 2020-07-08 2021-12-20 황호진 Seller shopping mall system to provide rental goods and method thereof
CN112488133B (en) * 2020-12-18 2022-06-14 贵州大学 Video/picture-text cross-modal retrieval method
CN113099306B (en) * 2021-03-23 2022-11-22 北京达佳互联信息技术有限公司 Message processing method, device, equipment, server, medium and product
US11875132B2 (en) * 2021-05-13 2024-01-16 Intrado Corporation Validation of revised IVR prompt translation
US11861322B2 (en) 2021-05-13 2024-01-02 West Technology Group, Llc Automated management of revised IVR prompt translations
US20230008868A1 (en) * 2021-07-08 2023-01-12 Nippon Telegraph And Telephone Corporation User authentication device, user authentication method, and user authentication computer program
CN113672206B (en) * 2021-09-02 2024-04-02 北京航空航天大学 X language hybrid modeling platform and modeling method
CN115329785B (en) * 2022-10-15 2023-01-20 小语智能信息科技(云南)有限公司 English-Tai-old multi-language neural machine translation method and device integrated with phoneme characteristics
CN117506926A (en) * 2023-12-19 2024-02-06 广州富唯智能科技有限公司 Robot equipment automatic control method based on large process model

Family Cites Families (41)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS58201175A (en) * 1982-05-20 1983-11-22 Kokusai Denshin Denwa Co Ltd <Kdd> Machine translation system
US4849898A (en) * 1988-05-18 1989-07-18 Management Information Technologies, Inc. Method and apparatus to identify the relation of meaning between words in text expressions
SE466029B (en) * 1989-03-06 1991-12-02 Ibm Svenska Ab DEVICE AND PROCEDURE FOR ANALYSIS OF NATURAL LANGUAGES IN A COMPUTER-BASED INFORMATION PROCESSING SYSTEM
JPH03129469A (en) * 1989-10-14 1991-06-03 Canon Inc Natural language processor
US5338976A (en) * 1991-06-20 1994-08-16 Ricoh Company, Ltd. Interactive language conversion system
US5278980A (en) * 1991-08-16 1994-01-11 Xerox Corporation Iterative technique for phrase query formation and an information retrieval system employing same
US5369575A (en) * 1992-05-15 1994-11-29 International Business Machines Corporation Constrained natural language interface for a computer system
ES2143509T3 (en) * 1992-09-04 2000-05-16 Caterpillar Inc INTEGRATED EDITION AND TRANSLATION SYSTEM.
JPH0744638A (en) * 1993-07-29 1995-02-14 Nec Corp Table data retrieving device
US5748841A (en) * 1994-02-25 1998-05-05 Morin; Philippe Supervised contextual language acquisition system
EP0672989A3 (en) * 1994-03-15 1998-10-28 Toppan Printing Co., Ltd. Machine translation system
JP3350293B2 (en) * 1994-08-09 2002-11-25 株式会社東芝 Dialogue processing device and dialogue processing method
US5794177A (en) * 1995-07-19 1998-08-11 Inso Corporation Method and apparatus for morphological analysis and generation of natural language text
US5909678A (en) * 1996-09-13 1999-06-01 International Business Machines Corporation Computer systems, method and program for constructing statements by dragging and dropping iconic representations of subcomponent statements onto a phrase template
US6026410A (en) * 1997-02-10 2000-02-15 Actioneer, Inc. Information organization and collaboration tool for processing notes and action requests in computer systems
EP1234254A1 (en) * 1999-11-25 2002-08-28 Yeong Kuang Oon A unitary language for problem solving resources for knowledge based services
JP2002108859A (en) 2000-09-29 2002-04-12 Nippon Hoso Kyokai <Nhk> Automatic translated expression extraction system and recording medium
WO2002086864A1 (en) * 2001-04-18 2002-10-31 Rutgers, The State University Of New Jersey System and method for adaptive language understanding by computers
CN1489086A (en) * 2002-10-10 2004-04-14 莎 刘 Semantic-stipulated text translation system and method
JP2005241971A (en) * 2004-02-26 2005-09-08 Seiko Epson Corp Projector system, microphone unit, projector controller, and projector
JP2006024114A (en) 2004-07-09 2006-01-26 Advanced Telecommunication Research Institute International Mechanical translation device and mechanical translation computer program
US7844465B2 (en) 2004-11-30 2010-11-30 Scansoft, Inc. Random confirmation in speech based systems
US7962466B2 (en) 2006-01-23 2011-06-14 Chacha Search, Inc Automated tool for human assisted mining and capturing of precise results
US8065286B2 (en) 2006-01-23 2011-11-22 Chacha Search, Inc. Scalable search system using human searchers
US8266130B2 (en) 2006-01-23 2012-09-11 Chacha Search, Inc. Search tool providing optional use of human search guides
US8117196B2 (en) 2006-01-23 2012-02-14 Chacha Search, Inc. Search tool providing optional use of human search guides
JP4131978B2 (en) 2006-02-24 2008-08-13 本田技研工業株式会社 Voice recognition device controller
US7533089B2 (en) * 2006-06-27 2009-05-12 International Business Machines Corporation Hybrid approach for query recommendation in conversation systems
WO2008011454A2 (en) 2006-07-18 2008-01-24 Chacha Search, Inc. Anonymous search system using human searchers
US7873532B2 (en) 2006-07-19 2011-01-18 Chacha Search, Inc. Method, system, and computer readable medium useful in managing a computer-based system for servicing user initiated tasks
EP2049970A4 (en) 2006-07-24 2014-12-31 Chacha Search Inc Method, system, and computer readable storage for podcasting and video training in an information search system
US8239461B2 (en) 2007-06-28 2012-08-07 Chacha Search, Inc. Method and system for accessing search services via messaging services
EP2223243A4 (en) 2007-11-21 2011-01-12 Chacha Search Inc Method and system for improving utilization of human searchers
US8336035B2 (en) * 2008-12-16 2012-12-18 Microsoft Corporation Customizable dynamic language expression interpreter
US8000454B1 (en) 2010-09-10 2011-08-16 Zvi Or-Bach Systems and methods for visual presentation and selection of IVR menu
EP2666101A4 (en) 2011-01-17 2014-07-30 Chacha Search Inc Method and system of selecting responders
US9842299B2 (en) 2011-01-25 2017-12-12 Telepathy Labs, Inc. Distributed, predictive, dichotomous decision engine for an electronic personal assistant
SG11201505027UA (en) * 2012-12-27 2015-07-30 Panasonic Ip Corp America Information communication method
US8818795B1 (en) * 2013-03-14 2014-08-26 Yahoo! Inc. Method and system for using natural language techniques to process inputs
US20150074524A1 (en) 2013-09-10 2015-03-12 Lenovo (Singapore) Pte. Ltd. Management of virtual assistant action items
CN103593340B (en) * 2013-10-28 2017-08-29 余自立 Natural expressing information processing method, processing and response method, equipment and system

Also Published As

Publication number Publication date
US9760565B2 (en) 2017-09-12
JP6182272B2 (en) 2017-08-16
CN103593340B (en) 2017-08-29
EP3062239A1 (en) 2016-08-31
WO2015062284A1 (en) 2015-05-07
US20160253434A1 (en) 2016-09-01
ZA201603638B (en) 2017-09-27
IL245322A0 (en) 2016-06-30
US9753914B2 (en) 2017-09-05
CN105723362B (en) 2018-10-02
JP2017503282A (en) 2017-01-26
US20160275075A1 (en) 2016-09-22
IL245322B (en) 2018-12-31
EP3062239A4 (en) 2017-11-22
CN103593340A (en) 2014-02-19
RU2016120612A (en) 2017-12-05
RU2672176C2 (en) 2018-11-12
HK1223164A1 (en) 2017-07-21
CN105723362A (en) 2016-06-29
KR20160077190A (en) 2016-07-01
CA3011397A1 (en) 2015-05-07
CA2929018C (en) 2018-08-28

Similar Documents

Publication Publication Date Title
CA2929018C (en) Natural expression processing method, processing and response method, device and system
US11645547B2 (en) Human-machine interactive method and device based on artificial intelligence
WO2022095380A1 (en) Ai-based virtual interaction model generation method and apparatus, computer device and storage medium
CN110019688A (en) The method that robot is trained
CN107924394A (en) For providing the natural language processor of the natural language signal in natural language output
KR102445519B1 (en) System and method for manufacturing conversational intelligence service providing chatbot
CN109325091B (en) Method, device, equipment and medium for updating attribute information of interest points
US8165887B2 (en) Data-driven voice user interface
CN113205817B (en) Speech semantic recognition method, system, device and medium
CN111666380A (en) Intelligent calling method, device, equipment and medium
WO2021218029A1 (en) Artificial intelligence-based interview method and apparatus, computer device, and storage medium
CN110059166A (en) Natural expression processing method, response method, equipment and the system of natural intelligence
WO2020124674A1 (en) Method and device for vectorizing translator&#39;s translation personality characteristics
CN110046232A (en) Natural expression processing method, response method, equipment and the system of natural intelligence
CN110059168A (en) The method that man-machine interactive system based on natural intelligence is trained
US20230394247A1 (en) Human-machine collaborative conversation interaction system and method
CN116235177A (en) Systems and methods related to robotic authoring by mining intent from dialogue data using known intent of an associated sample utterance
CN113284502A (en) Intelligent customer service voice interaction method and system
US20190012313A1 (en) Speech converter
CN110059167A (en) Natural expression processing method, response method, equipment and the system of natural intelligence
CN110008317A (en) Natural expression processing method, response method, equipment and the system of natural intelligence
CN113326367B (en) Task type dialogue method and system based on end-to-end text generation
WO2024066920A1 (en) Processing method and apparatus for dialogue in virtual scene, and electronic device, computer program product and computer storage medium
WO2020151652A1 (en) Naturalist intelligence-based natural expression processing method, and response method, device and system, method for training robot, human-computer interaction system, method for training naturalist intelligence-based human-computer interaction system, and end-to-end control method and control system
CN111460106A (en) Information interaction method, device and equipment

Legal Events

Date Code Title Description
EEER Examination request

Effective date: 20160428

MKLA Lapsed

Effective date: 20210616