WO2006115642A1 - Predictive conversion of user input - Google Patents

Predictive conversion of user input Download PDF

Info

Publication number
WO2006115642A1
WO2006115642A1 PCT/US2006/010355 US2006010355W WO2006115642A1 WO 2006115642 A1 WO2006115642 A1 WO 2006115642A1 US 2006010355 W US2006010355 W US 2006010355W WO 2006115642 A1 WO2006115642 A1 WO 2006115642A1
Authority
WO
WIPO (PCT)
Prior art keywords
character set
target language
user input
completions
converted
Prior art date
Application number
PCT/US2006/010355
Other languages
French (fr)
Inventor
Alexander M. Franz
Original Assignee
Google Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Google Inc. filed Critical Google Inc.
Priority to CN2006800225672A priority Critical patent/CN101203849B/en
Priority to KR1020077027118A priority patent/KR101249663B1/en
Priority to JP2008507675A priority patent/JP2008537260A/en
Publication of WO2006115642A1 publication Critical patent/WO2006115642A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/274Converting codes to words; Guess-ahead of partial word inputs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/126Character encoding
    • G06F40/129Handling non-Latin characters, e.g. kana-to-kanji conversion

Definitions

  • the present invention relates generally to conversion of user text input in one language or character set to another language or character set. More specifically, systems and methods for predictive conversion of user text input in one language or character set to another language or character set are disclosed. Description of Related Art
  • Non-target language input devices such as keyboards.
  • non-Roman based languages such as Chinese, Japanese, and Korean (CJK)
  • CJK Chinese, Japanese, and Korean
  • a separate conversion step is generally required in which a conversion engine converts the user's alphabetic string input to the target language.
  • a conversion engine converts the user's alphabetic string input to the target language.
  • one common method for inputting Chinese language text uses phonetic input, e.g., pinyin.
  • Pinyin uses alphabetic characters and has a vocabulary listed in the form of multiple syllable words.
  • a pinyin converter then converts the text written in pinyin to Chinese. For example, the user may enter a phonetic representation in pinyin of the desired Chinese text input using the roman letters on a keyboard.
  • a pinyin conversion engine can then convert the pinyin input to Chinese text.
  • the appropriate Chinese text can then be entered into an application, e.g., a word processing application and/or search engine.
  • Japanese text can be expressed using a variety of different character sets, including hiragana, katakana, and kanji, none of which are easily entered using a typical ASCII keyboard based on the Roman alphabet.
  • One method for inputting Japanese language text uses romaji, a phonetic, Roman-alphabet representation of Japanese.
  • the user can use a word processor such as Ichitaro, from JustSystem Corp. of Tokushima City, Japan, to convert romaji text input to katakana, hiragana, and/or kanji.
  • the Japanese text can then be entered into an application, e.g., a word processing application and/or search engine.
  • Another example of a commercially available conversion engine or program is Input Method Editors by Microsoft for various languages.
  • conversion programs generally require the user to enter the complete representation of the text input.
  • conversions programs are tedious for the user because there is often no one-to-one conversion between the phonetic Roman representation on the one hand and text in the native or target language on the other hand.
  • most Roman representation input entered by the user is ambiguous.
  • the conversion program thus may list multiple possible conversions, e.g., in order of probability, in a separate window or a drop-down list. The conversion engine and/or the user can then select the best or correct conversion.
  • such an input method can often be tedious and time consuming for the user due at least in part to the ambiguous Roman representation user input.
  • the method may generally include converting a partial user input in a non-target language or character set to a converted partial user input in a target language or character set, computing likely possible converted completions of the converted partial user input based on a likely completions module, the possible completion being in the target language or character set, and outputting one or more possible likely possible converted completions.
  • the converting may be based on a dictionary of non-target language or character set to target language or character set mappings.
  • the likely completions module may be a database of user inputs or a subset thereof or may be a model trained by performing statistical analysis on a database of user inputs or a subset thereof.
  • the subset of user inputs may be selected from those from a predetermined geographic location or time period and/or from the user or the current session.
  • the method may also include selecting the likely completions module from multiple likely completions modules.
  • a predictive conversion system generally includes a predictive conversion engine configured to generate a converted partial user input in a target language or character set from a partial user input in a non-target language or character set and to compute likely possible completions of the converted partial user input in the target language or character set based on a likely completions module and a dictionary of non- target language or character set to target language or character set mappings.
  • the dictionary facilitates the predictive conversion engine in converting the partial user input in the non- target language or character set to the converted partial user input in the target language or character set.
  • a computer program product for use in conjunction with a computer system includes a computer readable storage medium on which are stored instructions executable on a computer processor, the instructions generally including receiving the partial user input, the partial user input being in a non-target language or character set, converting the partial user input to a target language or character set to generate a converted partial user input, computing at least one likely possible completion of the converted partial user input based on a likely completions module, the possible completion being in the target language or character set, outputting at least one possible likely possible converted completion.
  • FIG. 1 is a block diagram of an illustrative information retrieval system in which a predictive user input conversion system may be implemented.
  • FIG. 2 is a block diagram of an illustrative computing system or device such as the client or server shown in FIG. 1, for implementing an exemplary predictive user input conversion system.
  • FIG. 3 is a block diagram illustrating the exemplary predictive user input conversion system in more detail.
  • FIG. 4 is a flowchart illustrating an exemplary process for performing predictive user input conversion.
  • FIG. 5 is a flowchart illustrating an exemplary process for training a likely completions module for the predictive user input converter.
  • Predictive conversion generally refers to both a predictive completion of the user text input and conversion from an input non-target language or character set to a target language or character set.
  • a user input A in an input non-target language or character set may be converted to X in the target language or character set and predictively completed to XYZ.
  • a user input A in an input non-target language or character set may be predictively completed to ABC and converted to XYZ in the target language or character set.
  • the systems and methods for predictive conversions of user entries in one language or form to another language or form facilitate users in entering text into a computer system using non-target language input methods or devices such as keyboards.
  • the predictive user entry conversion systems and methods may be adapted to facilitate users in entering text in non-Roman based languages such as Chinese, Japanese, and Korean (CJK) using conventional keyboards with Roman or alphabetic characters.
  • the user may enter a partial input, e.g., a partial representation of a word or character, in a first language or character set, e.g., a phonetic input such as pinyin or romaji, which the predictive user entry conversion system may predictively convert to the target language or character set, e.g., Chinese characters or katakana, hiragana, and/or kanji, and present one or more predictive suggested conversions to the user.
  • the predictive conversion systems and methods may be implemented in an information retrieval system or search engine to predictively convert the user's alphabetic string search input to the target language.
  • the predictively converted user input may then be used, for example, to perform a search for relevant documents.
  • the predictive conversion systems and methods may be implemented in various other suitable applications such as a word processing application or other applications requiring text entry.
  • FIG. 1 is a block diagram of an illustrative information retrieval system 100 in which a predictive user input conversion system may be may be implemented.
  • the system 100 may include multiple client devices 102 connected to multiple servers 104, 105 via a network 106.
  • the client devices 102 may include a browser 110 for accepting user input, and for displaying information that has been received from other systems 102, 104, 105 over the network 106.
  • the servers 104, 105 may include a search engine 112 for accepting user queries transmitted over the network 106, searching a database of documents, and returning results to the user.
  • the network 106 may comprise a local area network (LAN), a wide area network (WAN), a virtual private network (VPN), a telephone network, such as the Public Switched Telephone Network (PSTN), an intranet, the Internet, or a combination of networks.
  • LAN local area network
  • WAN wide area network
  • VPN virtual private network
  • PSTN Public Switched Telephone Network
  • intranet intranet
  • Internet Internet
  • the information retrieval system 100.shown in FIG. 1 includes three client devices 102 and two servers 104, 105 connected via the network 106.
  • client devices may also perform at least some functions of a server and some servers may also perform at least some functions of a client.
  • FIG. 2 is a block diagram of an illustrative computing system or device 200, such as the client 102 or server 104, 105 shown in FIG. 1, for implementing a predictive user input conversion system.
  • the system 200 includes a computing device such as a personal computer, laptop, mainframe, personal digital assistant, cellular telephone, and/or the like.
  • the system 200 may typically include a processor 202, memory 204, a user interface 206, an input/output port 207 for accepting removable storage media 208, a network interface 210, and a bus 212 for connecting the aforementioned elements.
  • the operation of system 200 may typically be controlled by the processor 202 operating under the guidance of programs stored in memory 204.
  • the memory 204 generally includes some combination of computer readable media, such as high-speed random-access memory (RAM) and non- volatile memory such as read-only memory (ROM), a magnetic disk, disk array, and/or tape array.
  • the input/output port 207 may include a disk drive or memory slot for accepting computer-readable media such as floppy diskettes, CD-ROMs, DVDs, memory cards, magnetic tapes, or the like.
  • the user interface 206 may, for example, include a keyboard, mouse, pen, or voice recognition mechanism for entering information, and one or more mechanisms such as a display, printer, speaker, and/or the like for presenting information to a user.
  • the network interface 210 is typically operable to provide a connection between the system 200 and other systems (and/or networks 220) via a wired, wireless, optical, and/or other connection.
  • the system 200 may be configured to perform predictive conversions of user text inputs in one language or character set to another language or character set. These operations may typically be performed in response to processor 202 executing software instructions contained on a computer readable medium such as memory 204.
  • the software instructions may be read into memory 204 from another computer-readable medium, such as data storage device 208, or from another device via communication interface 210 or I/O port 207.
  • the memory 204 may include a variety of programs or modules for controlling the operation of system 200 and performing the predictive conversion and optional search and retrieval techniques described in more detail below.
  • the memory 204 may include a predictive user input conversion application 229, a user input database 230, and a dictionary of source (non-target input) language to target language mappings 231.
  • the dictionary of source language to target language mappings 231 facilitates the predictive conversion application 229 in converting partial user inputs in the non-target language or character set to the target language or character set.
  • the memory 204 may further include a document database and a corresponding index 232.
  • the document database 232 may be a web corpus and the user input database 230 may be a query log.
  • the memory 204 may also include a query search engine 233 for searching the document database 232 based on a query received via the user interface 206 and/or a query received remotely from a user over network 220.
  • the memory 204 may also include one or more programs for predictively converting user query inputs using the techniques described in more detail below, and a user-interface application 234 for operating user interface 206 and/or for serving web pages to remote users over the network 220.
  • FIG. 2 illustrates a system that is primarily software- based, it will be appreciated that in other embodiments special-purpose circuitry may be used in place of, or in combination with, software instructions to implement processes consistent with the present invention. Thus, the present invention is not limited to any specific combination of hardware and software. It should be appreciated that the systems and methods of the present invention can be practiced with devices and/or architectures that lack some of the components shown in FIGS. 1 and 2 and/or that have other components that are not shown. Thus, it should be appreciated that FIGS. 1 and 2 are provided for purposes of illustration, and do not limit the scope of the invention.
  • system 200 is depicted as a single, general-purpose computing device such as a personal computer or a network server, in other embodiments the system 200 may include one or more such systems operating together using distributed computing techniques. In such embodiments, some or all of the components functionality depicted in FIG. 2 may be spread amongst multiple systems at multiple locations and/or operated by multiple parties.
  • a predictive user input conversion application 229 and its associated components including the user input database 230 and the dictionary of source (input) language to target language mappings 231 may be implemented on a system that is separate from the system on which the document database 232 is hosted.
  • the predictive query conversion application 231 may, in some embodiments, be executed on the client, rather than the server. It will be readily apparent that many similar variations could be made to the illustrations shown in FIGS. 1 and 2 without departing from the principles of the present invention.
  • FIG. 3 is a block diagram illustrating a predictive user input conversion system 250.
  • the predictive user input conversion system 250 includes the predictive user input conversion engine 229 in communication with the dictionary of source (non-target) language to target language mappings 231 and the user input database 230.
  • the predictive user input conversion engine 229 may include a module for likely completions 235.
  • the module for likely completions 235 may be a database of user inputs or a subset thereof.
  • the module may be a model trained using, for example, the user input database 230 (e.g., a query log), or a subset thereof.
  • the likely completions module 235 provides likely completed input strings in the target language or character set and their corresponding probabilities.
  • FIG. 4 is a flowchart illustrating a process 260 for performing predictive user input conversion.
  • the process 260 may be repeated for a series of user inputs in a given session, e.g., for each character, word, or phrase in the target language or character set.
  • a partial user input e.g., a prefix
  • the partial user input is in a first non-target language or character set, e.g., Roman or alphabetic characters, that is different from the target language or character set, e.g., Chinese characters.
  • the predictive conversion engine may then attempt to perform the predictive conversion using the partial user input at blocks 264 and 266.
  • the partial user input in the first non-target language or character set is used to generate possible conversions in the target language or character set.
  • the dictionary of the first non-target language or character set to the target language or character set mappings may be used in performing block 264.
  • possible completions of the possible conversions of the partial user input are generated.
  • the predictive conversion engine generally also ranks the completed predictions based on the probabilities of the predictive conversions. The probabilities may be determined based on the database of user inputs, e.g., a query log, as will be described in more detail below.
  • the likely completions module can be used in performing block 266.
  • the process first generates possible conversions of the partial user input to the target language or character set at block 264 and then generates possible completions of the converted partial user input at block 266.
  • block 266 may be performed prior to block 264.
  • the process 260 may generate possible completions of the partial user input in the non-target language or character set at block 266 and convert the possible completions to the target language or character set at block 264.
  • blocks 264 and 266 may be combined into a single integrated predictive conversion process.
  • the process 260 may utilize an integrated predictive conversion data structure or database containing mappings of partial user inputs in the non-target language or character set to predictive conversion in the target language or character set.
  • the database may also contain frequency or other statistical or probabilistic information for the mappings. In such a configuration, the dictionary of the non-target language or character set to the target language or character set mappings would be integrated in the predictive conversion database.
  • One or more parameters may be utilized to affect the operation of blocks 264 and 266 (or variations thereof as described above). For example, one parameter may indicate a preference for longer or shorter predictive completions. Each parameter may be set automatically by they system or process, set to a default value, and/or set by the user.
  • the predictive conversion engine may, in some embodiments, interactively monitor the extent of the partial user input in block 262 as the user enters the input so that the predictive conversion engine may perform the predictive conversion when the user has entered a partial input that is sufficient to generate one or more predictive conversions.
  • the user input is pinyin
  • the user may enter the pinyin for at least one Chinese character before the predictive conversion engine generates predictive conversions.
  • the determination of when the user input is sufficient to generate meaningful and/or useful predictive conversions may be based on the particular user input and the probabilities of the possible conversions of the partial user input and/or of the possible completions of the possible conversions, for example.
  • one or more predictive conversions in the target language or character set are output to the user.
  • a set of top N e.g., 5
  • predictive conversions based on the probabilities of the predictive conversions are output to the user.
  • only the predictive conversion with the highest probability is output to the user.
  • the number of predictive conversions output to the user may be dynamically determined based on the probabilities of the predictive conversions.
  • the predictive conversion engine may be configured to output only the most likely predictive conversion if, for example, the most likely predictive conversion has a probability that is greater than a predetermined threshold and/or is greater than a predetermined multiple of the next likely predictive conversion.
  • the process 260 continues to receive further user input at block 272 so that additional user input can be used by the predictive user input conversion engine to perform the predictive conversion. The process 260 then returns to block 264 to perform the predictive conversion using the additional user input.
  • the probabilities of predictive conversions may be computed by the predictive conversion engine using the likely completions module.
  • the module for likely completions may be manually generated or otherwise predetermined. More typically, the likely completions module may be a likely completions model trained using a database of user inputs and corresponding completed conversions. The database for likely completions may be tailored as appropriate for the application and/or user. For example, for a user entering text into a search engine, the probabilities of the predictive conversions may be computed using query logs for all users, or query logs for users in a same or similar geographic location and/or a same or similar time period.
  • FIG. 5 is a flowchart illustrating an exemplary process 280 for generating a module of likely completions for the predictive user input converter.
  • a log of user inputs in the target language or character set may be maintained to build the user input database.
  • the user input database may be query or search logs associated with a search engine. The database facilitates determining the likelihood of the predicted completed conversions.
  • the user database or a subset thereof may be used to generate the likely completions module.
  • a subset of the user database may be selected in order to generate a module with particular characteristics, e.g., to tailor the module as appropriate for the application and/or the user.
  • a subset of the user database may be selected based on inputs collected from a particular time period, from users in particular geographic location, from the same user, from the same user in the particular session, and/or various other suitable criteria.
  • a module of likely input strings and likely completions e.g., searches) is generated by performing statistical analysis on the database or subset thereof as selected in block 284.
  • the process for training the likely completions module may be performed periodically as the database is updated and/or may be repeated to generate various models so as to tailor each model as appropriate for the application and/or the user.
  • the predictive conversion engine may be integrated into a search engine.
  • the integration may be achieved in various ways.
  • the predictive conversion engine may be implemented as part of the search engine server so that a user only enters a prefix or other partial query, e.g., in roman representation, and the server converts the partial query to the target language or character set and generates one or more likely completions of the conversion (or, as noted above, generates likely completions of the partial query and converts the likely completions to the target language or character set).
  • the server may then return a list of the most likely completed conversions, e.g., a list of the 5 or 10 most likely completed conversions, from which the user may select to perform the search.
  • the server may perform the search based on the most likely completed conversion and return the search results to the user.
  • the predictive conversion engine may be similarly implemented on the client side such as in the user's client program, e.g. in a toolbar for the interface application such as a web browser.
  • the user may begin entering a partial input, such as a query in roman letters, and the client side predictive conversion engine may monitor the user's input and predictively compute one or more possible completed conversions when the user has provided a sufficient amount of information.
  • the user may enter the input and manually or otherwise cause the predictive conversion engine to predictively compute one or more possible completed conversions.
  • the output may be provided, for example, in a pop-up window or a drop-down list from which the user may select one of the possible completed conversions.
  • the predictive conversion engine or system either installed on a server side or a client side, facilitates user input by allowing the user to enter only a partial representation of the desired input in the non-target language.
  • the user only needs to enter a starting sequence, prefix, or other partial representation of the desired input in the non-target language and the predictive conversion system performs the conversion to the target language as well as the predictive completion of the converted partial input.

Abstract

Systems and methods for predictive conversion of user text input in one language or character set to another language or character set are disclosed. The method may generally include converting a partial user input in a non-target language or character set to a converted partial user input in a target language or character set and computing likely possible converted completions of the converted partial user input based on a likely completions module. The converting may be based on a dictionary of non-target language or character set to target language or character set mappings. The likely completions module may be a database or a model trained by performing statistical analysis on a database of user inputs or a subset thereof. The subset of user inputs may be selected from those from a particular geographic location or time period and/or from the user or the user's current session.

Description

PREDICTIVE CONVERSION QF USER INPUT
BACKGROUND OF THE INVENTION
Field of the Invention
The present invention relates generally to conversion of user text input in one language or character set to another language or character set. More specifically, systems and methods for predictive conversion of user text input in one language or character set to another language or character set are disclosed. Description of Related Art
It may be difficult or impossible for a user to enter text directly into a computer system using non-target language input devices such as keyboards. For example, it may be difficult for the user to enter text directly in non-Roman based languages such as Chinese, Japanese, and Korean (CJK) using a conventional keyboard with Roman or alphabetic characters. Instead, a separate conversion step is generally required in which a conversion engine converts the user's alphabetic string input to the target language. As an example, one common method for inputting Chinese language text uses phonetic input, e.g., pinyin.
Pinyin uses alphabetic characters and has a vocabulary listed in the form of multiple syllable words. A pinyin converter then converts the text written in pinyin to Chinese. For example, the user may enter a phonetic representation in pinyin of the desired Chinese text input using the roman letters on a keyboard. A pinyin conversion engine can then convert the pinyin input to Chinese text. The appropriate Chinese text can then be entered into an application, e.g., a word processing application and/or search engine.
As another example, Japanese text can be expressed using a variety of different character sets, including hiragana, katakana, and kanji, none of which are easily entered using a typical ASCII keyboard based on the Roman alphabet. One method for inputting Japanese language text uses romaji, a phonetic, Roman-alphabet representation of Japanese. For example, the user can use a word processor such as Ichitaro, from JustSystem Corp. of Tokushima City, Japan, to convert romaji text input to katakana, hiragana, and/or kanji. The Japanese text can then be entered into an application, e.g., a word processing application and/or search engine. Another example of a commercially available conversion engine or program is Input Method Editors by Microsoft for various languages.
However, conversion programs generally require the user to enter the complete representation of the text input. In addition, such conversions programs are tedious for the user because there is often no one-to-one conversion between the phonetic Roman representation on the one hand and text in the native or target language on the other hand. Instead, most Roman representation input entered by the user is ambiguous. In other words, for most Roman representation user inputs, there is more than one native language conversion that can be made by the conversion program. The conversion program thus may list multiple possible conversions, e.g., in order of probability, in a separate window or a drop-down list. The conversion engine and/or the user can then select the best or correct conversion. As is evident, such an input method can often be tedious and time consuming for the user due at least in part to the ambiguous Roman representation user input.
What is needed is a more efficient and user friendly method for entering input text using a non-target input language or character set and for converting the user input to a target language or character set.
SUMMARY OF THE INVENTION
Systems and methods for predictive conversion of user text input in one language or character set to. another language or character set are disclosed. It should be appreciated that the present invention can be implemented in numerous ways, including as a process, an apparatus, a system, a device, a method, or a computer readable medium such as a computer readable storage medium or a computer network wherein program instructions are sent over optical or electronic communication lines. Several inventive embodiments of the present invention are described below. The method may generally include converting a partial user input in a non-target language or character set to a converted partial user input in a target language or character set, computing likely possible converted completions of the converted partial user input based on a likely completions module, the possible completion being in the target language or character set, and outputting one or more possible likely possible converted completions. The converting may be based on a dictionary of non-target language or character set to target language or character set mappings. The likely completions module may be a database of user inputs or a subset thereof or may be a model trained by performing statistical analysis on a database of user inputs or a subset thereof. The subset of user inputs may be selected from those from a predetermined geographic location or time period and/or from the user or the current session. The method may also include selecting the likely completions module from multiple likely completions modules.
In another embodiment, a predictive conversion system generally includes a predictive conversion engine configured to generate a converted partial user input in a target language or character set from a partial user input in a non-target language or character set and to compute likely possible completions of the converted partial user input in the target language or character set based on a likely completions module and a dictionary of non- target language or character set to target language or character set mappings. The dictionary facilitates the predictive conversion engine in converting the partial user input in the non- target language or character set to the converted partial user input in the target language or character set.
In yet another embodiment, a computer program product for use in conjunction with a computer system includes a computer readable storage medium on which are stored instructions executable on a computer processor, the instructions generally including receiving the partial user input, the partial user input being in a non-target language or character set, converting the partial user input to a target language or character set to generate a converted partial user input, computing at least one likely possible completion of the converted partial user input based on a likely completions module, the possible completion being in the target language or character set, outputting at least one possible likely possible converted completion.
These and other features and advantages of the present invention will be presented in more detail in the following detailed description and the accompanying figures which illustrate by way of example principles of the invention. r
BRIEF DESCRIPTION OF THE DRAWINGS
The present invention will be readily understood by the following detailed description in conjunction with the accompanying drawings, wherein like reference numerals designate like structural elements.
FIG. 1 is a block diagram of an illustrative information retrieval system in which a predictive user input conversion system may be implemented.
FIG. 2 is a block diagram of an illustrative computing system or device such as the client or server shown in FIG. 1, for implementing an exemplary predictive user input conversion system.
FIG. 3 is a block diagram illustrating the exemplary predictive user input conversion system in more detail.
FIG. 4 is a flowchart illustrating an exemplary process for performing predictive user input conversion. FIG. 5 is a flowchart illustrating an exemplary process for training a likely completions module for the predictive user input converter.
DESCRIPTION OF SPECIFIC EMBODIMENTS
Systems and methods for predictive conversion of user text input in one language or character set to another language or character set are disclosed. Predictive conversion generally refers to both a predictive completion of the user text input and conversion from an input non-target language or character set to a target language or character set. For example, a user input A in an input non-target language or character set may be converted to X in the target language or character set and predictively completed to XYZ. Alternatively, a user input A in an input non-target language or character set may be predictively completed to ABC and converted to XYZ in the target language or character set. The following description is presented to enable any person skilled in the art to make and use the invention. Descriptions of specific embodiments and applications are provided only as examples and various modifications will be readily apparent to those skilled in the art. The general principles defined herein may be applied to other embodiments and applications without departing from the spirit and scope of the invention. Thus, the present invention is to be accorded the widest scope encompassing numerous alternatives, modifications and equivalents consistent with the principles and features disclosed herein. For purpose of clarity, details relating to technical material that is known in the technical fields related to the invention have not been described in detail so as not to unnecessarily obscure the present invention.
The systems and methods for predictive conversions of user entries in one language or form to another language or form facilitate users in entering text into a computer system using non-target language input methods or devices such as keyboards. For example, the predictive user entry conversion systems and methods may be adapted to facilitate users in entering text in non-Roman based languages such as Chinese, Japanese, and Korean (CJK) using conventional keyboards with Roman or alphabetic characters. The user may enter a partial input, e.g., a partial representation of a word or character, in a first language or character set, e.g., a phonetic input such as pinyin or romaji, which the predictive user entry conversion system may predictively convert to the target language or character set, e.g., Chinese characters or katakana, hiragana, and/or kanji, and present one or more predictive suggested conversions to the user. As described in the illustrative examples presented herein, the predictive conversion systems and methods may be implemented in an information retrieval system or search engine to predictively convert the user's alphabetic string search input to the target language. The predictively converted user input may then be used, for example, to perform a search for relevant documents. However, it is to be understood that the predictive conversion systems and methods may be implemented in various other suitable applications such as a word processing application or other applications requiring text entry.
FIG. 1 is a block diagram of an illustrative information retrieval system 100 in which a predictive user input conversion system may be may be implemented. The system 100 may include multiple client devices 102 connected to multiple servers 104, 105 via a network 106. The client devices 102 may include a browser 110 for accepting user input, and for displaying information that has been received from other systems 102, 104, 105 over the network 106. The servers 104, 105 may include a search engine 112 for accepting user queries transmitted over the network 106, searching a database of documents, and returning results to the user. The network 106 may comprise a local area network (LAN), a wide area network (WAN), a virtual private network (VPN), a telephone network, such as the Public Switched Telephone Network (PSTN), an intranet, the Internet, or a combination of networks. Merely for illustration, the information retrieval system 100.shown in FIG. 1 includes three client devices 102 and two servers 104, 105 connected via the network 106. However, it will be appreciated that in practice there may be more or fewer client devices, servers, and/or networks, and that some client devices may also perform at least some functions of a server and some servers may also perform at least some functions of a client. FIG. 2 is a block diagram of an illustrative computing system or device 200, such as the client 102 or server 104, 105 shown in FIG. 1, for implementing a predictive user input conversion system. In one embodiment, the system 200 includes a computing device such as a personal computer, laptop, mainframe, personal digital assistant, cellular telephone, and/or the like. The system 200 may typically include a processor 202, memory 204, a user interface 206, an input/output port 207 for accepting removable storage media 208, a network interface 210, and a bus 212 for connecting the aforementioned elements.
The operation of system 200 may typically be controlled by the processor 202 operating under the guidance of programs stored in memory 204. The memory 204 generally includes some combination of computer readable media, such as high-speed random-access memory (RAM) and non- volatile memory such as read-only memory (ROM), a magnetic disk, disk array, and/or tape array. The input/output port 207 may include a disk drive or memory slot for accepting computer-readable media such as floppy diskettes, CD-ROMs, DVDs, memory cards, magnetic tapes, or the like. The user interface 206 may, for example, include a keyboard, mouse, pen, or voice recognition mechanism for entering information, and one or more mechanisms such as a display, printer, speaker, and/or the like for presenting information to a user. The network interface 210 is typically operable to provide a connection between the system 200 and other systems (and/or networks 220) via a wired, wireless, optical, and/or other connection.
As described in more detail below, the system 200 may be configured to perform predictive conversions of user text inputs in one language or character set to another language or character set. These operations may typically be performed in response to processor 202 executing software instructions contained on a computer readable medium such as memory 204. The software instructions may be read into memory 204 from another computer-readable medium, such as data storage device 208, or from another device via communication interface 210 or I/O port 207. As shown in FIG. 2, the memory 204 may include a variety of programs or modules for controlling the operation of system 200 and performing the predictive conversion and optional search and retrieval techniques described in more detail below. For example, the memory 204 may include a predictive user input conversion application 229, a user input database 230, and a dictionary of source (non-target input) language to target language mappings 231. The dictionary of source language to target language mappings 231 facilitates the predictive conversion application 229 in converting partial user inputs in the non-target language or character set to the target language or character set.
If the system 200 is a server for a search engine, for example, such as the server 105 shown in FIG. 1, the memory 204 may further include a document database and a corresponding index 232. The document database 232 may be a web corpus and the user input database 230 may be a query log. The memory 204 may also include a query search engine 233 for searching the document database 232 based on a query received via the user interface 206 and/or a query received remotely from a user over network 220. As shown in FIG. 2, the memory 204 may also include one or more programs for predictively converting user query inputs using the techniques described in more detail below, and a user-interface application 234 for operating user interface 206 and/or for serving web pages to remote users over the network 220. Although FIG. 2 illustrates a system that is primarily software- based, it will be appreciated that in other embodiments special-purpose circuitry may be used in place of, or in combination with, software instructions to implement processes consistent with the present invention. Thus, the present invention is not limited to any specific combination of hardware and software. It should be appreciated that the systems and methods of the present invention can be practiced with devices and/or architectures that lack some of the components shown in FIGS. 1 and 2 and/or that have other components that are not shown. Thus, it should be appreciated that FIGS. 1 and 2 are provided for purposes of illustration, and do not limit the scope of the invention. For example, it should be appreciated that while, for purposes of illustration, the system 200 is depicted as a single, general-purpose computing device such as a personal computer or a network server, in other embodiments the system 200 may include one or more such systems operating together using distributed computing techniques. In such embodiments, some or all of the components functionality depicted in FIG. 2 may be spread amongst multiple systems at multiple locations and/or operated by multiple parties. For example, a predictive user input conversion application 229 and its associated components including the user input database 230 and the dictionary of source (input) language to target language mappings 231 may be implemented on a system that is separate from the system on which the document database 232 is hosted. The predictive query conversion application 231 may, in some embodiments, be executed on the client, rather than the server. It will be readily apparent that many similar variations could be made to the illustrations shown in FIGS. 1 and 2 without departing from the principles of the present invention.
As previously indicated, the systems shown in FIGS. 1 and 2 can be used to facilitate predictive conversion of partial user input. FIG. 3 is a block diagram illustrating a predictive user input conversion system 250. As shown, the predictive user input conversion system 250 includes the predictive user input conversion engine 229 in communication with the dictionary of source (non-target) language to target language mappings 231 and the user input database 230. The predictive user input conversion engine 229 may include a module for likely completions 235. In some implementations, the module for likely completions 235 may be a database of user inputs or a subset thereof. As another example, the module may be a model trained using, for example, the user input database 230 (e.g., a query log), or a subset thereof. The likely completions module 235 provides likely completed input strings in the target language or character set and their corresponding probabilities.
FIG. 4 is a flowchart illustrating a process 260 for performing predictive user input conversion. The process 260 may be repeated for a series of user inputs in a given session, e.g., for each character, word, or phrase in the target language or character set. At block 262, at least a partial user input, e.g., a prefix, in the non-target language or character set is received. As noted, the partial user input is in a first non-target language or character set, e.g., Roman or alphabetic characters, that is different from the target language or character set, e.g., Chinese characters. The predictive conversion engine may then attempt to perform the predictive conversion using the partial user input at blocks 264 and 266. Specifically, at block 264, the partial user input in the first non-target language or character set is used to generate possible conversions in the target language or character set. For example, the dictionary of the first non-target language or character set to the target language or character set mappings may be used in performing block 264. At block 266, possible completions of the possible conversions of the partial user input are generated. Note that at block 266, the predictive conversion engine generally also ranks the completed predictions based on the probabilities of the predictive conversions. The probabilities may be determined based on the database of user inputs, e.g., a query log, as will be described in more detail below. In particular, the likely completions module can be used in performing block 266.
In the example shown, the process first generates possible conversions of the partial user input to the target language or character set at block 264 and then generates possible completions of the converted partial user input at block 266. It is to be understood that block 266 may be performed prior to block 264. In particular, the process 260 may generate possible completions of the partial user input in the non-target language or character set at block 266 and convert the possible completions to the target language or character set at block 264. As yet another example, blocks 264 and 266 may be combined into a single integrated predictive conversion process. In particular, the process 260 may utilize an integrated predictive conversion data structure or database containing mappings of partial user inputs in the non-target language or character set to predictive conversion in the target language or character set. The database may also contain frequency or other statistical or probabilistic information for the mappings. In such a configuration, the dictionary of the non-target language or character set to the target language or character set mappings would be integrated in the predictive conversion database.
One or more parameters may be utilized to affect the operation of blocks 264 and 266 (or variations thereof as described above). For example, one parameter may indicate a preference for longer or shorter predictive completions. Each parameter may be set automatically by they system or process, set to a default value, and/or set by the user.
Note that the predictive conversion engine may, in some embodiments, interactively monitor the extent of the partial user input in block 262 as the user enters the input so that the predictive conversion engine may perform the predictive conversion when the user has entered a partial input that is sufficient to generate one or more predictive conversions. For example, where the user input is pinyin, the user may enter the pinyin for at least one Chinese character before the predictive conversion engine generates predictive conversions. The determination of when the user input is sufficient to generate meaningful and/or useful predictive conversions may be based on the particular user input and the probabilities of the possible conversions of the partial user input and/or of the possible completions of the possible conversions, for example.
Next, at block 268, one or more predictive conversions in the target language or character set are output to the user. In one embodiment, a set of top N (e.g., 5) predictive conversions based on the probabilities of the predictive conversions are output to the user. In another embodiment, only the predictive conversion with the highest probability is output to the user. The number of predictive conversions output to the user may be dynamically determined based on the probabilities of the predictive conversions. For example, the predictive conversion engine may be configured to output only the most likely predictive conversion if, for example, the most likely predictive conversion has a probability that is greater than a predetermined threshold and/or is greater than a predetermined multiple of the next likely predictive conversion. At decision block 270, if the user selects one of the predictive conversions output to the user, the selected predictive conversion may be adopted as the user's desired conversion and the process for predictive user input conversion 260 is complete. However, if the user does not make a selection from the predictive conversions output in block 268 as determined at decision block 270, then the process 260 continues to receive further user input at block 272 so that additional user input can be used by the predictive user input conversion engine to perform the predictive conversion. The process 260 then returns to block 264 to perform the predictive conversion using the additional user input.
As noted above, the probabilities of predictive conversions may be computed by the predictive conversion engine using the likely completions module. In some embodiments, the module for likely completions may be manually generated or otherwise predetermined. More typically, the likely completions module may be a likely completions model trained using a database of user inputs and corresponding completed conversions. The database for likely completions may be tailored as appropriate for the application and/or user. For example, for a user entering text into a search engine, the probabilities of the predictive conversions may be computed using query logs for all users, or query logs for users in a same or similar geographic location and/or a same or similar time period. As another example, the predictive conversion engine may be personalized by using only the user's own inputs or searches, e.g., the user's current search session or the user's past search sessions. FIG. 5 is a flowchart illustrating an exemplary process 280 for generating a module of likely completions for the predictive user input converter. At block 282, a log of user inputs in the target language or character set may be maintained to build the user input database. As noted, the user input database may be query or search logs associated with a search engine. The database facilitates determining the likelihood of the predicted completed conversions. At block 284, the user database or a subset thereof may be used to generate the likely completions module. In particular, a subset of the user database may be selected in order to generate a module with particular characteristics, e.g., to tailor the module as appropriate for the application and/or the user. For example, a subset of the user database may be selected based on inputs collected from a particular time period, from users in particular geographic location, from the same user, from the same user in the particular session, and/or various other suitable criteria. At block 286, a module of likely input strings and likely completions (e.g., searches) is generated by performing statistical analysis on the database or subset thereof as selected in block 284. The process for training the likely completions module may be performed periodically as the database is updated and/or may be repeated to generate various models so as to tailor each model as appropriate for the application and/or the user.
As noted, the predictive conversion engine may be integrated into a search engine. The integration may be achieved in various ways. For example, the predictive conversion engine may be implemented as part of the search engine server so that a user only enters a prefix or other partial query, e.g., in roman representation, and the server converts the partial query to the target language or character set and generates one or more likely completions of the conversion (or, as noted above, generates likely completions of the partial query and converts the likely completions to the target language or character set). The server may then return a list of the most likely completed conversions, e.g., a list of the 5 or 10 most likely completed conversions, from which the user may select to perform the search. Alternatively or additionally, the server may perform the search based on the most likely completed conversion and return the search results to the user.
Instead of providing the predictive conversion engine on the server side, the predictive conversion engine may be similarly implemented on the client side such as in the user's client program, e.g. in a toolbar for the interface application such as a web browser. In such a configuration, the user may begin entering a partial input, such as a query in roman letters, and the client side predictive conversion engine may monitor the user's input and predictively compute one or more possible completed conversions when the user has provided a sufficient amount of information. Alternatively, the user may enter the input and manually or otherwise cause the predictive conversion engine to predictively compute one or more possible completed conversions. The output may be provided, for example, in a pop-up window or a drop-down list from which the user may select one of the possible completed conversions. As is evident, the predictive conversion engine or system, either installed on a server side or a client side, facilitates user input by allowing the user to enter only a partial representation of the desired input in the non-target language. In other words, the user only needs to enter a starting sequence, prefix, or other partial representation of the desired input in the non-target language and the predictive conversion system performs the conversion to the target language as well as the predictive completion of the converted partial input. While the exemplary embodiments of the present invention are described and illustrated herein, it will be appreciated that they are merely illustrative and that modifications can be made to these embodiments without departing from the spirit and scope of the invention. Thus, the scope of the invention is intended to be defined only in terms of the following claims as may be amended, with each claim being expressly incorporated into this Description of Specific Embodiments as an embodiment of the invention.

Claims

CLAIMSWhat is claimed is:
1. A method for predictively converting a partial user input, comprising: receiving the partial user input from a user, the partial user input being in a non-target language or character set; performing predictive conversion of the partial user input based on a likely completions module to generate at least one possible converted completion of the partial user input, the at least one possible converted completion being in a target language or character set; and outputting one or more of the at least one possible converted completion.
2. The method of claim 1, wherein the performing includes: converting the partial user input to the target language or character set to generate a converted partial user input; and computing the at least one possible converted completion of the converted partial user input based on the likely completions module.
3. The method of claim 1 , wherein the performing includes: computing at least one possible partial user input completion in the non-target language or character set based on the likely completions module, the likely completions module being in the non-target language or character set; and converting the at least one possible partial user input completion to the target language or character set to generate the at least one possible converted completion.
4. The method of claim 1 , wherein the performing is further based on a dictionary of non-target language or character set to target language or character set mappings.
5. The method of claim 1 , further comprising: training the likely completions module based on a database of user inputs.
6. The method of claim 5, wherein the user inputs in the database are in one of the target language or character set and the non-target language or character set.
7. The method of claim 5, wherein the training includes performing statistical analysis on at least a portion of the user inputs in the database to determine likely completions of the partial user inputs.
8. The method of claim 5, wherein the training is based on a subset of the user inputs in the database, the subset of the user inputs being associated with at least one of: a geographic location, a time period, and the user from whom the partial user input is received.
9. The method of claim 1 , further comprising: selecting the likely completions module from a plurality of likely completions modules.
10. The method of claim 1 , wherein the partial user input is a partial user query to a search engine.
11. The method of claim 1 , further comprising: selecting a subset of possible converted completions based on probabilities of the possible converted completions, the outputting being outputting the subset of the possible converted completions.
12. A predictive conversion system, comprising: a predictive conversion engine configured to perform predictive conversion of a partial user input in a non-target language or character set based on a likely completions module to generate at least one possible converted completion of the partial user input, the at least one possible converted completion being in a target language or character set; and a dictionary of non-target language or character set to target language or character set mappings to facilitate the predictive conversion engine in converting from the non-target language or character set to the target language or character set.
13. The system of claim 12, wherein the predictive conversion engine is further configured to generate, based on the dictionary, converted partial user input in the target language or character set from the partial user input in the non-target language or character set.
14. The system of claim 12, wherein the predictive conversion engine is further configured to compute at least one possible partial user input completion in the non-target language or character set based on the likely completions module, the likely completions module being in the non-target language or character set, and to convert the at least one possible partial user input completion to the target language or character set, base on the dictionary, to generate the at least one possible converted completion.
15. The system of claim 12, further comprising: a training engine configured to train the likely completions module based on a database of user inputs.
16. The system of claim 15, wherein the user inputs in the database are in one of the target language or character set and the non-target language or character set.
17. The system of claim 15, wherein the training engine is further configured to perform statistical analysis on at least a portion of the user inputs in the database to determine likely completions of the partial user inputs.
18. The system of claim 15, wherein the training engine is further configured to train the likely completions module based on a subset of the user inputs in the database, the subset of the user inputs being associated with one or more of: a geographic location, a time period, and a user.
19. The system of claim 12, wherein the predictive conversion engine is further configured to select the likely completions module from a plurality of likely completions modules.
20. The system of claim 12, wherein the partial user input is a partial user query to a search engine.
21. The system of claim 12, wherein the predictive conversion engine is further configured to select a subset of possible converted completions based on probabilities of the possible converted completions as output.
22. A computer program product for use in conjunction with a computer system, the computer program product comprising a computer readable storage medium on which are stored instructions executable on a computer processor, the instructions including: receiving a partial user input, the partial user input being in a non-target language or character set; performing predictive conversion of the partial user input based on a likely completions module to generate at least one possible converted completion of the partial user input, the at least one possible converted completion being in a target language or character set; and outputting one or more of the at least one possible converted completion.
23. The computer program product of claim 22, wherein the performing includes converting the partial user input to the target language or character set to generate a converted partial user input and computing the at least one possible converted completion of the converted partial user input based on the likely completions module.
24. The computer program product of claim 22, wherein the performing includes: computing at least one possible partial user input completion in the non-target language or character set based on the likely completions module, the likely completions module being in the non-target language or character set; and converting the at least one possible partial user input completion to the target language or character set to generate the at least one possible converted completion.
25. The computer program product of claim 22, wherein the performing is further based on a dictionary of non-target language or character set to target language or character set mappings.
26. The computer program product of claim 22, the instructions further include: training the likely completions module based on a database of user inputs.
27. The computer program product of claim 26, wherein the user inputs in the database are in one of the target language or character set and the non-target language or character set.
28. The computer program product of claim 26, wherein the training includes performing statistical analysis on at least a portion of the user input in the database to determine likely completions of converted partial user inputs.
29. The computer program product of claim 26, wherein the training is based on a subset of the user inputs in the database, the subset of user inputs being associated with at least one of: a geographic location, a time period, and the user from whom the partial user input is received.
30. The computer program product of claim 22, the instructions further include: selecting the likely completions module from a plurality of likely completions modules.
31. The computer program product of claim 22, wherein the partial user input is a partial user query to a search engine.
32. The computer program product of claim 22, the instructions further include: selecting a subset of possible converted completions based on probabilities of the possible converted completions, the outputting being outputting the subset of the possible converted completions.
PCT/US2006/010355 2005-04-21 2006-03-22 Predictive conversion of user input WO2006115642A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
CN2006800225672A CN101203849B (en) 2005-04-21 2006-03-22 Predictive conversion of user input
KR1020077027118A KR101249663B1 (en) 2005-04-21 2006-03-22 Predictive conversion of user input
JP2008507675A JP2008537260A (en) 2005-04-21 2006-03-22 Predictive conversion of user input

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/111,583 2005-04-21
US11/111,583 US7506254B2 (en) 2005-04-21 2005-04-21 Predictive conversion of user input

Publications (1)

Publication Number Publication Date
WO2006115642A1 true WO2006115642A1 (en) 2006-11-02

Family

ID=36829761

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2006/010355 WO2006115642A1 (en) 2005-04-21 2006-03-22 Predictive conversion of user input

Country Status (5)

Country Link
US (1) US7506254B2 (en)
JP (1) JP2008537260A (en)
KR (1) KR101249663B1 (en)
CN (1) CN101203849B (en)
WO (1) WO2006115642A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8205155B2 (en) 2007-02-28 2012-06-19 Author-It Labs Limited Text management software
CN105069064A (en) * 2015-07-29 2015-11-18 百度在线网络技术(北京)有限公司 Vocabulary acquisition method and apparatus and vocabulary push method and apparatus

Families Citing this family (48)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8972444B2 (en) 2004-06-25 2015-03-03 Google Inc. Nonstandard locality-based text entry
US8392453B2 (en) * 2004-06-25 2013-03-05 Google Inc. Nonstandard text entry
US7840399B2 (en) * 2005-04-07 2010-11-23 Nokia Corporation Method, device, and computer program product for multi-lingual speech recognition
US8904282B2 (en) * 2005-04-21 2014-12-02 Motorola Mobility Llc Electronic device having capability for interpreting user inputs and method therefor
US10735576B1 (en) * 2005-07-14 2020-08-04 Binj Laboratories, Inc. Systems and methods for detecting and controlling transmission devices
US7831913B2 (en) * 2005-07-29 2010-11-09 Microsoft Corporation Selection-based item tagging
US7861164B2 (en) * 2005-11-03 2010-12-28 Bin Qin Method to sequentially encode PINYIN of Chinese character with few symbols
JP2007193438A (en) * 2006-01-17 2007-08-02 Casio Comput Co Ltd Electronic dictionary device and search control program
US8762358B2 (en) * 2006-04-19 2014-06-24 Google Inc. Query language determination using query terms and interface language
US8442965B2 (en) 2006-04-19 2013-05-14 Google Inc. Query language identification
US20080211777A1 (en) * 2007-03-01 2008-09-04 Microsoft Corporation Stroke number input
US8316295B2 (en) * 2007-03-01 2012-11-20 Microsoft Corporation Shared language model
US8677237B2 (en) * 2007-03-01 2014-03-18 Microsoft Corporation Integrated pinyin and stroke input
US8413049B2 (en) * 2007-08-31 2013-04-02 Research In Motion Limited Handheld electronic device and associated method enabling the generation of a proposed character interpretation of a phonetic text input in a text disambiguation environment
US8010465B2 (en) * 2008-02-26 2011-08-30 Microsoft Corporation Predicting candidates using input scopes
US20090249198A1 (en) * 2008-04-01 2009-10-01 Yahoo! Inc. Techniques for input recogniton and completion
US20090292527A1 (en) * 2008-05-22 2009-11-26 Travelocity.Com Lp Methods, Apparatuses and Computer Program Products for Receiving and Utilizing Multidimensional Data Via A Phrase
US8745051B2 (en) * 2008-07-03 2014-06-03 Google Inc. Resource locator suggestions from input character sequence
JP5501581B2 (en) * 2008-07-25 2014-05-21 シャープ株式会社 Information processing apparatus and information processing method
JP4827950B2 (en) * 2008-07-31 2011-11-30 富士通株式会社 Server device
GB0905457D0 (en) 2009-03-30 2009-05-13 Touchtype Ltd System and method for inputting text into electronic devices
US9424246B2 (en) 2009-03-30 2016-08-23 Touchtype Ltd. System and method for inputting text into electronic devices
US10191654B2 (en) 2009-03-30 2019-01-29 Touchtype Limited System and method for inputting text into electronic devices
US20100325130A1 (en) * 2009-06-19 2010-12-23 Microsoft Corporation Media asset interactive search
WO2011003232A1 (en) * 2009-07-07 2011-01-13 Google Inc. Query parsing for map search
US8782556B2 (en) * 2010-02-12 2014-07-15 Microsoft Corporation User-centric soft keyboard predictive technologies
US8327326B2 (en) * 2010-06-15 2012-12-04 Microsoft Corporation Inserting closing code constructs
US9626456B2 (en) * 2010-10-08 2017-04-18 Warner Bros. Entertainment Inc. Crowd sourcing for file recognition
CN102253929A (en) * 2011-06-03 2011-11-23 北京搜狗科技发展有限公司 Method and device for prompting user to input characters
WO2013029239A1 (en) * 2011-08-30 2013-03-07 宇龙计算机通信科技(深圳)有限公司 Dictionary database update device, input system, input method, and terminal
US20130159919A1 (en) 2011-12-19 2013-06-20 Gabriel Leydon Systems and Methods for Identifying and Suggesting Emoticons
US8818791B2 (en) * 2012-04-30 2014-08-26 Google Inc. Techniques for assisting a user in the textual input of names of entities to a user device in multiple different languages
JP5805907B2 (en) * 2013-07-30 2015-11-10 京セラドキュメントソリューションズ株式会社 Display device and image forming apparatus
US9760624B1 (en) 2013-10-18 2017-09-12 Google Inc. Automatic selection of an input language
US20160283582A1 (en) * 2013-11-04 2016-09-29 Beijing Qihoo Technology Company Limited Device and method for detecting similar text, and application
CN105095191A (en) * 2014-04-22 2015-11-25 富士通株式会社 Method and device for assisted translation based on multi-word units
US9043196B1 (en) 2014-07-07 2015-05-26 Machine Zone, Inc. Systems and methods for identifying and suggesting emoticons
US9313219B1 (en) * 2014-09-03 2016-04-12 Trend Micro Incorporated Detection of repackaged mobile applications
US9762385B1 (en) 2015-07-20 2017-09-12 Trend Micro Incorporated Protection of program code of apps of mobile computing devices
CN105138498A (en) * 2015-08-03 2015-12-09 小米科技有限责任公司 Character information output method and apparatus
US9916448B1 (en) 2016-01-21 2018-03-13 Trend Micro Incorporated Detection of malicious mobile apps
GB201610984D0 (en) 2016-06-23 2016-08-10 Microsoft Technology Licensing Llc Suppression of input images
US10375576B1 (en) 2016-09-09 2019-08-06 Trend Micro Incorporated Detection of malware apps that hijack app user interfaces
CN107844199B (en) * 2016-09-18 2022-03-04 北京搜狗科技发展有限公司 Input method, system and device for inputting
CN108427508B (en) * 2017-02-15 2024-01-19 北京搜狗科技发展有限公司 Input method and device, and method and device for establishing local area network word stock
US11250221B2 (en) * 2019-03-14 2022-02-15 Sap Se Learning system for contextual interpretation of Japanese words
CN110737808A (en) * 2019-09-30 2020-01-31 支付宝(杭州)信息技术有限公司 Chinese character string statistical method and system based on block chain
JP2021144271A (en) * 2020-03-10 2021-09-24 オムロン株式会社 Character input apparatus, character input method, and character input program

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5953541A (en) * 1997-01-24 1999-09-14 Tegic Communications, Inc. Disambiguating system for disambiguating ambiguous input sequences by displaying objects associated with the generated input sequences in the order of decreasing frequency of use
US6377965B1 (en) * 1997-11-07 2002-04-23 Microsoft Corporation Automatic word completion system for partially entered data
US6564213B1 (en) * 2000-04-18 2003-05-13 Amazon.Com, Inc. Search query autocompletion
EP1359516A2 (en) * 2002-04-16 2003-11-05 Fujitsu Limited Processing character information
WO2006055120A2 (en) * 2004-11-12 2006-05-26 Google, Inc. Method and system for autocompletion for languages having ideographs and phonetic characters

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08212217A (en) * 1995-02-08 1996-08-20 Toshiba Corp Knowledge accumulation method and natural language processing system using knowledge accumulation method
JPH11338858A (en) * 1998-05-22 1999-12-10 Toshiba Corp Device and method for predicting input and storage medium recording input predictive program
US6356866B1 (en) * 1998-10-07 2002-03-12 Microsoft Corporation Method for converting a phonetic character string into the text of an Asian language
JP2001005807A (en) * 1999-06-24 2001-01-12 Makoto Iriuchijima Language for world word candidate and registration converting method for translation related word thereof
US7165019B1 (en) * 1999-11-05 2007-01-16 Microsoft Corporation Language input architecture for converting one text form to another text form with modeless entry
US6668085B1 (en) * 2000-08-01 2003-12-23 Xerox Corporation Character matching process for text converted from images
US7165022B2 (en) * 2001-06-13 2007-01-16 Fujitsu Limited Chinese language input system
CA2504423C (en) * 2002-10-31 2011-10-18 Arizan Corporation Methods and apparatus for summarizing document content for mobile communication devices
US8285537B2 (en) * 2003-01-31 2012-10-09 Comverse, Inc. Recognition of proper nouns using native-language pronunciation
US7395203B2 (en) * 2003-07-30 2008-07-01 Tegic Communications, Inc. System and method for disambiguating phonetic input
JP2005107638A (en) * 2003-09-29 2005-04-21 Casio Comput Co Ltd Communication terminal, and communication terminal processing program
JP4012143B2 (en) * 2003-12-16 2007-11-21 キヤノン株式会社 Information processing apparatus and data input method
WO2005106705A2 (en) * 2004-04-26 2005-11-10 John Francis Glosson Method, system, and software for embedding metadata objects concomitantly with linguistic content

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5953541A (en) * 1997-01-24 1999-09-14 Tegic Communications, Inc. Disambiguating system for disambiguating ambiguous input sequences by displaying objects associated with the generated input sequences in the order of decreasing frequency of use
US6377965B1 (en) * 1997-11-07 2002-04-23 Microsoft Corporation Automatic word completion system for partially entered data
US6564213B1 (en) * 2000-04-18 2003-05-13 Amazon.Com, Inc. Search query autocompletion
EP1359516A2 (en) * 2002-04-16 2003-11-05 Fujitsu Limited Processing character information
WO2006055120A2 (en) * 2004-11-12 2006-05-26 Google, Inc. Method and system for autocompletion for languages having ideographs and phonetic characters

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"METHOD OF MODELESS WORD COMPLETION", IBM TECHNICAL DISCLOSURE BULLETIN, IBM CORP. NEW YORK, US, vol. 39, no. 2, 1 February 1996 (1996-02-01), pages 155 - 156, XP000559855, ISSN: 0018-8689 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8205155B2 (en) 2007-02-28 2012-06-19 Author-It Labs Limited Text management software
CN105069064A (en) * 2015-07-29 2015-11-18 百度在线网络技术(北京)有限公司 Vocabulary acquisition method and apparatus and vocabulary push method and apparatus

Also Published As

Publication number Publication date
US7506254B2 (en) 2009-03-17
JP2008537260A (en) 2008-09-11
US20060241933A1 (en) 2006-10-26
KR20080000679A (en) 2008-01-02
KR101249663B1 (en) 2013-04-01
CN101203849B (en) 2010-06-16
CN101203849A (en) 2008-06-18

Similar Documents

Publication Publication Date Title
US7506254B2 (en) Predictive conversion of user input
US8745051B2 (en) Resource locator suggestions from input character sequence
JP5634860B2 (en) Input method editor with second language mode
CN102449579B (en) All-in-one chinese character input method
JP4717821B2 (en) Method for searching using a query written in a different character set and / or language than the target page
KR101465770B1 (en) Word probability determination
KR100656736B1 (en) System and method for disambiguating phonetic input
KR100891358B1 (en) System and its method for inputting character by predicting character sequence of user's next input
US20090144049A1 (en) Method and system for adaptive transliteration
WO2012000335A1 (en) Input method and device combined with application interfaces
US6985147B2 (en) Information access method, system and storage medium
US7212967B2 (en) Chinese phonetic transcription input system and method with comparison function for imperfect and fuzzy phonetic transcriptions
US20140067731A1 (en) Multi-dimensional information entry prediction
JP2000298667A (en) Kanji converting device by syntax information
WO2006122361A1 (en) A personal learning system
JP3664383B2 (en) Japanese input system, Japanese input method, and recording medium
JP2018101224A (en) Searching apparatus, searching method, and program
JP5817602B2 (en) Character input device, function execution method by character input, and character input program
JPH10247192A (en) Kana-to-kanji conversion device
JP2004164120A (en) Extracting device, character string inputting device and program
Chaware et al. EVALUATION OF PHONETIC MATCHING APPROACHES FOR HINDI AND MARATHI: INFORMATION RETRIEVAL
JP2012133754A (en) Character string conversion device, character string conversion method, program and recording medium
KR20040016715A (en) Chinese phonetic transcription input system and method with comparison function for imperfect and fuzzy phonetic transcriptions
JP2011129159A (en) Method and system providing function which supports japanese single character input
JPS63206860A (en) Information processor

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200680022567.2

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application
ENP Entry into the national phase

Ref document number: 2008507675

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

NENP Non-entry into the national phase

Ref country code: RU

WWE Wipo information: entry into national phase

Ref document number: 1020077027118

Country of ref document: KR

122 Ep: pct application non-entry in european phase

Ref document number: 06739234

Country of ref document: EP

Kind code of ref document: A1