US20100063968A1 - Structured natural language query and knowledge system - Google Patents

Structured natural language query and knowledge system Download PDF

Info

Publication number
US20100063968A1
US20100063968A1 US12/619,111 US61911109A US2010063968A1 US 20100063968 A1 US20100063968 A1 US 20100063968A1 US 61911109 A US61911109 A US 61911109A US 2010063968 A1 US2010063968 A1 US 2010063968A1
Authority
US
United States
Prior art keywords
relational
variable
user
query
argument
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US12/619,111
Inventor
Phillip Sheu
Atsushi Kitazawa
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Biomedical Objects Inc
Original Assignee
Biomedical Objects Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Biomedical Objects Inc filed Critical Biomedical Objects Inc
Priority to US12/619,111 priority Critical patent/US20100063968A1/en
Publication of US20100063968A1 publication Critical patent/US20100063968A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2452Query translation
    • G06F16/24522Translation of natural language queries to structured queries
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99932Access augmentation or optimizing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99934Query formulation, input preparation, or translation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99935Query augmenting and refining, e.g. inexact access

Definitions

  • the invention relates to the field of relational, object-oriented and object relational databases. More particularly, the invention relates to a structured natural language database query and knowledge system.
  • Relational databases have been used widely for many years.
  • a relational database organizes data into tables that include fields. Two tables that include a same field are related to each other. Compared to the “flat file” approach that stores all data into a single file, the relational approach of tables is more flexible.
  • SQL Structured Query Language
  • employee.salary > 100000 Display employee.name employee.salary. End Or: Select name, salary From employee Where employee.salary > 100000
  • “employee” is the table name
  • “name” and “salary” are fields of the “employee” table
  • “where employee.salary>100000” is a condition and also a qualification
  • “Display employee.name employee.salary” and “Select name, salary” are commands.
  • a qualification is a condition or a plurality of conditions connected by logical connectors such as “and”, “or”, “and not” or “or not”.
  • Object-oriented databases organize data into objects.
  • An object can have attributes, which can also be objects. The recursive nature of an object permits ease of manipulation. Objects can inherit characteristics from other objects, making it easier to create new objects based on existing ones.
  • An object can be associated with a set of procedures (methods) to manipulate its data.
  • ANSI SQL-99 extends the conventional SQL query language to allow tables and fields to be manipulated as objects.
  • a typical ANSI SQL-99 query is expressed as:
  • a condition is a method with the method's arguments applied to a variable.
  • An argument of the “select” command can be a variable, an attribute of a variable or a method with its arguments applied to a variable.
  • ANSI SQL-99 still has significant limitations. As described below in more detail, ANSI SQL-99 allows only “range” type variables and does not allow “set,” “bag,” or temporary variables. The scope of a variable is determined by its type (which is usually a relational table or a class of objects) and cannot be an arbitrary set of objects. Moreover, a condition under SQL-99 does not allow the values produced by a method, whether produced as its returned value or as its parameters, to be used in other conditions. In addition, a method that does not return a logical “true” or “false” value is not allowed to be a condition. Therefore, it would be desirable to introduce a more general and more powerful object relational query language.
  • relational algebra has significant limitations too. Under relational algebra, an expression is simply a relation such as a flat file or a set of records whose fields are primitive values such as integers, floating point numbers and text strings. Moreover, a condition of the “select” operator is restricted to:
  • a structured natural language query and knowledge system is provided to allow a user who lacks programming skills to specify a database query in the form of a structured natural language sentence.
  • the user can also specify a rule in the form of a structured natural language sentence.
  • An improved object relational query language and an object relational algebra are also introduced.
  • the structured natural language is defined by the object relational query language and the object relational algebra.
  • One aspect of the invention relates to a computer-implemented method of composing a structured natural language database query.
  • a user is prompted to select a command from a set of defined commands and to specify one or more arguments for the command.
  • the selected command and arguments are combined to form a verb phrase.
  • the user is also prompted to select zero, one or more conditions from a set of defined conditions and to specify zero, one or more parameters for each selected condition.
  • Each selected condition and its specified parameters are combined into an adjective phrase.
  • the verb phrase and the selected adjective phrases are combined into a structured natural language database query.
  • the structured natural language query is automatically translated into formal query text to be executed by a formal query processing module.
  • the structured natural language query is parsed to identify a verb phrase and zero, one or more adjective phrases. After finding a defined command query text that corresponds to the parsed verb phrase and finding a defined qualification query text that corresponds to each of the parsed adjective phrases, the found query texts are combined into a translated formal query text to be processed by the query processing module.
  • a first variable can be defined as a range variable and a second variable can be defined as a set variable.
  • a command and a qualification are also defined. The defined command, qualification and variables are combined to form an object relational query. Variables can also be defined as temporary variables or bag variables.
  • Still another aspect of the invention relates to a computer-implemented method of creating an object relational query for a database based on algebraic expressions.
  • a query is built upon one or more expressions.
  • An expression is a set of objects.
  • a general method that returns as output a set of objects is an expression.
  • a select operator that selects from a set of objects a subset based on a qualification is also an expression. Because an argument of a general method can be a set of objects, general methods can be nested to produce another expression.
  • Yet another aspect of the invention relates to a computer-implemented method of processing queries expressed in the improved object relational query language or the object relational algebra for a relational, object-relational or object-oriented database system.
  • Another aspect of the invention relates to a computer-implemented method of grouping structured natural language queries, structured natural language rules or both queries and rules into a macro that in turn can be used as a command or condition for other queries or rules.
  • FIG. 1 illustrates one embodiment of a structured natural language system.
  • FIG. 2 illustrates one embodiment of a process of defining a command.
  • FIG. 3 illustrates one embodiment of a process of defining a condition.
  • FIG. 4 illustrates one embodiment of a process of composing a structured natural language query.
  • FIG. 5 illustrates example user interface screens that prompts a user or a programmer to define an object variable.
  • FIG. 6A is an example user interface screen that prompts a user to select a command.
  • FIG. 6B is another example user interface screen that prompts a user to select a command.
  • FIG. 7 is an example screen that prompts the user to select command arguments and optionally name a result.
  • FIG. 8A is an example screen that prompts the user to select a condition.
  • FIG. 8B is another example screen that prompts the user to select a condition.
  • FIG. 9 is an example screen that prompts the user to specify parameters for the selected condition.
  • FIG. 10 is an example screen that displays a composed query.
  • FIG. 11 is an example screen that prompts the user to type a structured natural language sentence.
  • FIG. 12 is an example screen that displays a rule in structured natural language.
  • FIG. 1 illustrates one embodiment of a structured natural language query and knowledge system 100 .
  • the structured natural language system 100 allows a user without programming skills to create a structured natural language query to operate on a database, with the database operation preferably done in real time.
  • the system 100 can also allow a user to convert a relational database into an object-relational database.
  • a structured natural language query may include a verb phrase as a command and zero, one or more adjective phrases as a qualification.
  • a variable definition phrase may also be included in the structured natural language query.
  • the term “query” is broadly defined herein to include not only commands that select or display data, but also commands that create, update or delete data.
  • One embodiment of the system 100 is programmed in Java.
  • the system 100 can also be programmed in C, C++ or other languages, operating platforms or application packages.
  • the system 100 includes a verb phrase definition module 110 and an adjective phrase definition module 120 .
  • the verb phrase definition module 110 allows a programmer to define verb phrases to correspond to formal query texts.
  • the adjective phrase definition module 120 allows a programmer to define adjective phrases to correspond to formal query texts.
  • Formal query texts can be methods, functions, subroutines, procedures, SQL query texts, or combinations of the above.
  • the system 100 also includes a structured natural language composition module 130 that allows a user to compose a structured natural language query using the defined verb phrases and adjective phrases.
  • the system 100 further includes a translation module 140 to translate the structured natural language query into formal query text, and a formal query processing module 150 that processes the translated formal query text to return query results from the underlying database 160 , which can be a commercial database system.
  • the system 100 also includes an optional object schema definition module 101 .
  • the module 101 allows a user or a programmer to define an object class, including the number of attributes and the type of each attribute for an object class. If the underlying database 160 for the system 100 is a relational database, the object schema definition module 101 allows a user or programmer to associate an object class with a table in the relational database, thus allowing the user or programmer to define the relationships among the tables of the relational database in terms of objects.
  • a programmer or a user can define a class “polygon” to be associated with the table “POLYGON” in a relational database.
  • the table “POLYGON” includes a “VERTICES” column.
  • the class “polygon” is defined with an “ID” attribute of text string type, and a “vertices” attribute of “set of vertex” type.
  • Another class “vertex” is associated with the table “VERTEX” in the relational database.
  • the class “vertex” is defined with a “name” attribute of text string type, a “x coordinate” attribute of number type, and a “y coordinate” of number type.
  • the object schema definition module 101 thus stores the relationship that each element of the “VERTICES” column in the “POLYGON” table is the name of a row in the “VERTEX” table.
  • a group of “VERTEX” tuples and a “POLYGON” tuple can thus be treated together as a hierarchically structured “polygon” object class.
  • the programmer or user can then define verb phrases and adjective phrases related to the “polygon” object class. If there are no structural relationships among the tables, each table is then treated as an independent object class, with each relational tuple as a “flat” object without hierarchical structures. In any event, a user can still define verb phrases and adjective phrases in structured natural language.
  • the object schema definitions can be simply imported from the underlying database.
  • the query and knowledge system 100 can also include a structured natural language rule composition module 131 , a rule translation module 141 , and a formal rule processing module 151 .
  • the modules 131 , 141 and 151 are described below in connection with rules.
  • each of the modules 101 - 151 is preferably implemented with software executed by one or more general-purpose computers. Some or all of the modules could alternatively be implemented in whole or in part within application-specific hardware. As will be recognized, the modules need not run or reside on the same computer. For example, modules 110 - 141 could be integrated into a client-side component that runs on a user computer, while the formal processing modules 150 and 151 could run on a remote server that provides network-based access to a database. Further, the translation modules 140 and 141 could be implemented as a server-side component that translates natural language queries and rules received over a network from users. The modules of the system 100 can be implemented in the same or different computing languages, operating platforms or application packages.
  • FIG. 2 illustrates one embodiment of a computer-implemented process to define a verb phrase.
  • the system 100 prompts a programmer to define a command name.
  • a command name is a text string such as “create a group”, “combine two groups”, “find similar genes from two groups”, “show elements of a group”, “store”, and so forth.
  • the command is preferably defined with the appearance of a natural language phrase and thus is easy for users without programming skills to understand.
  • the programmer is prompted to define the number of arguments for the command. For example, a “create a group” command typically has 1 argument and a “find similar genes from two groups” command typically has 2 arguments.
  • the programmer may also define the selection range for each argument. For example, one argument for a patient name can be any user-entered text string, and another argument for the name of an authorized insurer must be a value from a pre-determined list of values.
  • the programmer is prompted to define the data type of each of the arguments of the command.
  • a data type can be a primitive data type such as integer, floating number, date or text string, or another data type defined by a user or a programmer.
  • a new data type can be defined as a set of objects of a given data type. For example, given the defined data type “vertex”, a new data type “Vertex Set” can be defined such that each object of the “Vertex Set” data type is a set of vertices. From block 230 , the process proceeds to block 240 , where the programmer is prompted to create a formal query text to correspond to the defined verb phrase.
  • formal query text is used broadly in the present application to include text in the form of a query language, the name of a collection of computer code such as a method, function, procedure or subroutine, or the name of a macro.
  • a method, function, procedure or subroutine can be “Var0.similar(Var1)”, with “Var0” and “Var1” representing the two parameters of the condition respectively.
  • a method, function, procedure or subroutine can be “find_similar_genes(argument1, argument2),” with “argument 1” and “argument2” representing the two arguments of the command respectively.
  • the collection of computer code is stored in a system library (not shown).
  • a macro is a logical unit for a group of queries, a group of rules, or a group of one or more queries and one or more rules.
  • the programmer is first prompted to create a formal query text for a verb phrase to be defined.
  • the system 100 parses the formal query text, and determines the number of arguments for the verb phrase and the data type of each of the arguments.
  • the programmer is then prompted to define a command name for the verb phrase.
  • FIG. 3 illustrates one embodiment of a computer-implemented process of defining a condition.
  • the system 100 prompts a programmer to define a condition name.
  • Example condition names include “who are diagnosed with disease” and “who respond to medication”.
  • the process then proceeds to block 320 , where the programmer is prompted to define the number of parameters for the condition.
  • the programmer is prompted to define the data type of each of the parameters, such as text, integer and so forth.
  • the programmer can also be prompted to specify if a parameter is an input parameter or an output parameter.
  • the programmer is prompted to create a formal query text to correspond to the defined condition, for example typing texts in a query language, specifying a macro name, or specifying the name of a computer code collection such as a method name, function name, procedure name or subroutine name.
  • the computer code collection is stored in the system library.
  • “ ⁇ argument1>” represents the command argument of the related verb phrase
  • “ ⁇ parameter>” represents the condition parameter value to be entered by the user.
  • the programmer is first prompted to create a formal query text for the condition to be defined.
  • the system 100 then parses the formal query text and determines the number of parameters for the condition and the data type of each of the parameters.
  • the programmer is then prompted to define a condition name.
  • the verb phrases and adjective phrases can be defined to correspond to complex queries.
  • the condition “who are diagnosed with all diseases” can be defined to include a set parameter that allows a list of diseases. A query with this condition finds patients who are diagnosed with all of the listed diseases.
  • Another condition “who are diagnosed with at least one of the diseases” can be defined to include a set parameter that allows a list of diseases. A query with this condition finds patients who are diagnosed with at least one of the listed diseases.
  • only the condition “who are diagnosed with disease” is defined and includes a parameter that allows only one entry of a disease name. If a user wants to find patients who have both diseases A and B, the user combines two such adjective phrases “who are diagnosed with disease A” and “who are diagnosed with disease B”.
  • verb phrases and adjective phrases can be defined and used interchangeably to some extent.
  • the more specific verb phrase “find patients who are diagnosed with disease” serves the same purpose as the combination of the more general verb phrase “find patients” and the adjective phrase “who are diagnosed with disease”. If a query is frequently invoked by users, it may be preferable to define the query as a verb phrase that does not require additional adjective phrases. It is also feasible to define both specific and general verb phrases, and allow users to use either the specific verb phrase or the general verb phrase combined with adjective phrases.
  • a structured natural language query can be created in the form of a question, such as “which patients are diagnosed with disease ‘mild AD’ and who respond to medication ‘aericept’?”. It is equivalent to the query “display patents who are diagnosed with disease ‘mild AD’ and who respond to medication ‘aericept’”. In the query in the form of a question, the portion “which patients” or “which patients are” serves as the verb phrase.
  • a structured natural language query may include a variable definition phrase.
  • a variable definition phrase For example, as described below in subsection B, if the improved object relational query language of subsection B is used in conjunction with the system 100 , an example variable definition phrase can be “range of ⁇ Variable> is polygon: abc”. It means that the values of ⁇ Variable> are obtained from a collection of objects named “abc”, whose elements are bound to the type “polygon”.
  • a variable definition phrase can include one or more variable definitions.
  • a user who is not a programmer may also define a verb phrase and a condition
  • the term “programmer” is used in connection with FIG. 2 and FIG. 3 because some programming skills may be need to create a formal query text that correspond to a defined verb phrase or condition.
  • FIG. 4 shows one embodiment of a computer-implemented process of composing a structured natural language query.
  • the query is composed with the improved object relational query language described in subsection B.
  • the process determines whether variables will be defined by a user. If variables are to be defined, the process proceeds to a block 420 , where the process prompts a user to specify a variable definition phrase.
  • variable definition phrases, command and conditions need not be specified in any particular order. In other words, variable definition phrases need not be specified before the command is specified, and the command need not be specified before conditions are specified.
  • FIG. 5 illustrates example screens that prompt a user or a programmer to specify a variable definition phrase.
  • a user or programmer is prompted to create an object name in the “Object Name” section 510 .
  • the user or programmer is also prompted in the “Type” section 520 to identify the variable type of the created object by selecting from a list of types such as “range”, “temp”, “set” and “bag”.
  • the user or programmer is further prompted to identify the data type of the created object by selecting an object from the “Object Lists” section 530 to inherit the data type of the selected object, or by selecting from the “Primitive Lists” section 540 of primitive data types.
  • the user or programmer is prompted to specify in the “Data Source Element” section 550 the data source element for the created object.
  • a data source element is a set of objects of the same type.
  • FIGS. 6A and 8A are presented as example screens that do not display variable definitions, while FIGS. 6B and 8B are presented as example screens that display variable definitions.
  • Variable definition phrases can also be automatically generated based on the selected commands and conditions.
  • the condition “contains” requires two polygons as inputs and returns a true value if the first polygon contains the second.
  • the system 100 stores the specification that the “contains” condition requires two input parameters of polygon class.
  • the condition “contains” is selected by the user, the system 100 automatically generates a variable definition phrase to define the two input parameters as range variables of polygon class.
  • the condition “intersection” requires two input parameters of polygon class and an output parameter as the intersection polygon of the two input polygons. The “intersection” condition returns a “true” logical value.
  • the system 100 stores the specification that the “intersection” condition requires two input parameters of polygon class and an output parameter of polygon class.
  • the system 100 automatically generates a variable definition phrase to define the two input parameters and the output parameter.
  • the two input parameters are preferably defined as range variables and the output parameter is preferably defined as a temporary variable. If the output parameter is used as an input parameter of another condition, then the input parameter of the other condition do not need to be defined again.
  • the user is prompted to select a command from a list of defined commands.
  • the user is prompted to select from a list of defined commands that are applicable to the objects defined in the variable definition phrase. For example, if the variable definition phrase defines a variable “polygon”, then the user can select from those commands that are applicable to the “polygon” variable, such as “enlarge”, “reduce”, “rotate”, “measure area”, and so forth.
  • FIG. 6A and FIG. 6B are example user interface screens that prompt the user to select a command from a list 610 of commands. Compared to FIG. 6A , FIG. 6B includes an additional section 620 that displays the defined variables. The defined variables are also displayed in the sentence section 630 .
  • the process proceeds to a block 450 , where the user is prompted to select one or more arguments for the selected command.
  • a block 460 in one embodiment that preferably uses the object relational algebra of subsection C, the user is prompted to optionally specify a result name of the selected command.
  • the command, arguments and optional result name form a verb phrase.
  • FIG. 7 is an example screen that prompts the user to select command arguments and optionally name a result. Depending on the number and type of defined arguments for the command, the user may be prompted to identify one or more arguments. As shown in FIG.
  • the user is prompted to select a data type from section 710 , select a data source element from section 720 , and optionally name a result of the command in section 730 .
  • FIG. 8 A is an example screen that prompts the user to select from a list of conditions.
  • a defined verb phrase is displayed in section 810 and as part of a structured natural language sentence in sentence section 830 .
  • FIG. 8B is another example screen that prompts the user to select from a list of conditions.
  • FIG. 8B includes an additional section 840 that displays the defined variables.
  • the defined variable definition phrase and verb phrase are displayed as parts of a sentence in the sentence section 830 .
  • FIG. 9 is an example screen that prompts the user to specify parameters for the selected condition.
  • the user can type in a disease name in section 910 , or select from a list of disease names in section 920 .
  • the selected condition and parameters form an adjective phrase.
  • the composition module 130 automatically puts quotation marks around the specified values of command arguments, named result and condition parameters. If the condition has one or more output parameters, the user can also specify an additional condition by defining the output parameters of the original condition as the input parameters of the additional condition. The additional condition thus evaluates a condition based on the output produced by the original condition.
  • the user is prompted to indicate whether more conditions will be entered. If more conditions will be entered, then the process returns to the block 480 . Otherwise the process proceeds to an end block 498 . If the process returns to the block 480 to specify another adjective phrase, the user is also prompted to specify a logical connector such as “and”, “or” or “and not” to join the two adjective phrases.
  • FIGS. 5-9 are example screens of one embodiment that prompts a user to select from lists of defined commands and conditions.
  • the user selects a command in FIG. 6A or FIG. 6B , selects arguments for the command in FIG. 7 , selects a condition in FIG. 8A or FIG. 8B , and specifies parameters for the condition in FIG. 9 .
  • the user or a programmer also defines variables in FIG. 5 .
  • FIG. 10 is an example screen that shows a composed structured natural language query sentence in section 1040 including a verb phrase in section 1020 and adjective phrases in section 1030 .
  • the query appears in a preferred form of an imperative sentence.
  • the query may also include a variable definition phrase, such as the phrase “range of range1 is Case:Case” shown in sections 1010 and 1040 of FIG. 10 .
  • the user can also be given the option to directly type a structured natural language query into the system 100 , as shown in the sentence section 1110 of FIG. 11 . Experienced users may prefer this option. Users can also use a combination of direct typing and screen selection, for example typing the conditions or commands they are familiar with and selecting other conditions or commands from the menus.
  • the composed query is then sent to the translation module 140 for translation.
  • the translation module 140 parses the composed query to identify the verb phrase, the adjective phrases and the optional variable definition phrase. In a preferred embodiment, the translation module 140 looks for keywords that indicate defined verb phrases or defined adjective phrases. For example, in one embodiment where all conditions start with the word “who” or “whose”, the translation module 140 searches for words “who” and “whose” and identifies any such words as signaling the start of an adjective phrase. However, if the word “who” or “whose” is enclosed in quotation marks in the composed query, then it is considered to represent an argument, named result or parameter value instead of the start of a condition. The translation module 140 also searches for logical connectors “and”, “or”, “or not” and “and not”. A logical connector indicates the end of one adjective phrase and the start of another adjective phrase. By searching for condition labels and logical connectors, the module 140 can identify an adjective phrase as separated from the rest of the query and separated from the other adjective phrases.
  • the translation module 140 also searches for keywords of the defined conditions to identify the conditions. For example, the translation module 140 searches for keywords such as “diagnosed” or “respond” as identifying the condition “who are diagnosed with the disease” or “who respond to medication”.
  • the translation module 140 also searches for keywords of the defined commands to identify the command. For example, the module 140 searches for keywords such as “create a group” or “add elements to a group” to identify the corresponding commands. The module 140 can also search for keyword “from” to signal a command argument and keyword “whose name is” to signal a named result.
  • the parsing process can be simplified. Because the system 100 associates each of the defined commands and conditions with its corresponding formal query text, the corresponding formal query text can be stored at the same time the user selects a command or condition. The stored corresponding formal query texts (or their identifiers) are combined by the translation module 140 to form a translated formal query text. On the other hand, if the user enters a structured natural language query by typing instead of selecting from lists, then the parsing process first identifies the verb phrase and adjective phrases, and then combines their corresponding formal query texts.
  • the translation module 140 detects errors in a query and automatically corrects errors or suggests corrections to errors.
  • Query errors may include misspellings (for example “diagnosed” misspelled as “diagnoised”), missing words (for example “who respond to medication” entered as “who respond to”), out-of-order structures (for example adjective phrases appearing before the verb phrase), or other grammar errors.
  • the translation module 140 corrects these errors and transforms the query to a correct query.
  • the translation module 140 returns the detected errors and a suggested corrected query to the user, and asks the user to confirm that the corrected query is what the user intended to compose.
  • the translation module 140 combines the corresponding formal query texts of the verb phrase and adjective phrases of the structured natural language query to form a translated formal query text.
  • the user-specified command argument values and condition parameter values are incorporated into the translated formal query text.
  • the formal query processing module 150 receives the translated formal query text and processes the text on the underlying database to produce results.
  • the underlying database 160 can be relational, object-oriented or object relational.
  • a relational underlying database if the translated query text is already in the form of one or more SQL queries, then the formal query processing module 150 directly executes the SQL queries on the underlying relational database. If the translated query text is not in SQL query form, then the formal query processing module 150 converts the translated query text into one or more converted SQL queries, and executes the converted queries on the underlying relational database. Likewise, for an object or object relational underlying database, the query processing module 150 directly executes the translated query text, or converts the translated query text into one or more converted queries that can be directly executed on the underlying database.
  • query processing module 150 If the query processing module 150 cannot directly execute the translated query text and cannot convert the translated query context, query processing module 150 still processes the translated query text to return query results from the underlying database 160 .
  • a first variable “t” is defined as a range variable from a first data source element of five polygon objects.
  • a second variable “s” is defined as a range variable from a second data source element of three polygon objects.
  • the translated query text “t.intersect(s)” cannot be directly executed on the relational underlying database.
  • the query processing module 150 enumerates the fifteen combinations of the polygon objects of the two data source elements to determine whether the qualification is true.
  • optimization methods for example rearranging the order of the conditions in a qualification, can also be employed.
  • the structured natural language system 100 works in conjunction with the improved object relational query language described in subsection B.
  • a user creates a variable definition phrase “range of range1 is case:case” in FIG. 10 , to define range1 as a range variable in accordance with the improved query language described in subsection B.
  • the first “case” in “case:case” represents a data type and the second “case” represents a data source element.
  • the system 100 prompts the user to define a variable by selecting from a list of data types, a list of data source elements, and as a range, temporary, set or bag variable.
  • the structured natural language system 100 can also work with conventional relational, object-oriented, and object relational databases.
  • the improved object relational query language or the object relational algebra would expand the scope of permitted queries, but is not a prerequisite for the structured natural language system 100 .
  • Rules can also be composed in structured natural language form and used in the system 100 for data integrity and other purposes. For example, a rule can be presented in the form “On ⁇ event> If ⁇ qualification> then ⁇ command>”, “On ⁇ event> If ⁇ qualification> then ⁇ command> otherwise ⁇ command2>”, or “On ⁇ event> If ⁇ qualification> then ⁇ qualifcation2>”. Using a computer system for medical use as an example, rules can be used to ensure that a patient's age entered into the system is greater than zero, that penicillin is not administered if the patient is known to be allergic to it, that two types of medications of negative interaction are not administered to the same patient, and so forth.
  • a rule's scope can be defined to apply to a single object, to apply to all objects of a class, or to all objects.
  • the structured natural language system 100 allows a user to define a rule by specifying a qualification and a command of the rule.
  • a qualification for example, patient's body temperature exceeds 102 degrees Fahrenheit
  • a command for example, send a warning to a nurse or physician
  • events and qualifications can be used interchangeably.
  • Structured natural language queries and structured natural language rules are typically used in different contexts.
  • the system 100 monitors the qualification and the optional event of a rule at all times, and executes the rule's command when the qualification and optional event are met.
  • a query is typically executed based on a user instruction.
  • a rule is typically displayed in the form of “if qualification . . . then command [else command2]” or “if qualification1 then qualification2” form, while a query is typically displayed in the “command [qualification]” form.
  • a structured natural language rule and a structured natural language query typically each includes a command and a qualification, it is thus feasible to use a rule and a query interchangeably.
  • a user can select a defined query as a rule, or select a defined rule as a query.
  • FIG. 12 shows an example screen displaying a rule in the form of a structured natural language sentence.
  • a rule includes an optional variable definition phrase, a verb phrase (command) and one or more adjective phrases (qualification).
  • a rule may also include one or more event phrases.
  • a rule can have a second verb phrase corresponding to the action when the qualification is false.
  • section 1220 displays the qualification “any of Patient.CurrentVisit treatments interact”
  • section 1230 displays the command “display message string ‘treatment check: interact’”.
  • Section 1210 displays the formal query text that corresponds to the rule.
  • the qualification and the command can be specified by a user, for example using the screens of FIGS. 6A-9 .
  • a set of rules can be grouped into a logical unit named a “macro.”
  • a user or a programmer can specify the relationship among the rules within the macro, and to use a “goto” statement followed by a label to invoke another rule.
  • the label identifies the other rule to be invoked.
  • a macro for approving or rejecting a loan application can include the following rules:
  • “goto(rule-house)” represents evaluating another rule identified by the label “rule-house”.
  • “Return(true)” represents returning a decision to approve the loan application. Instead of returning a “true” or “false” logical value, a rule can also return an object or data source element (DSE) as output.
  • DSE data source element
  • rules can be far more complex than the example above.
  • the macro and the “goto” statement allow a user or a programmer to specify relationships among the rules. The user or programmer is prompted to assign a macro name to the macro.
  • a rule or a macro can be used as a condition or command in a query or in another rule.
  • a plurality of queries, or a combination of one or more rules and one or more queries, can also be grouped into a macro. Queries can be identified by labels, and “goto” statements followed by labels can be used to invoke other queries or rules.
  • a macro can be associated with an object or a class of objects.
  • a user can compose rules in structured natural language form. Similar to a query sentence, a rule sentence has one or more adjective phrases to specify the qualification of the rule, and a verb phrase to specify the action to be taken when the qualification is true. A rule sentence can have a second verb phrase corresponding to the action to be taken when the qualification is false. A rule sentence can also have one or more optional event phrases, each corresponding to an event.
  • the rule composition module 131 prompts a user to compose a rule by selecting one or more adjective phrases as qualification, one or two verb phrases as commands, and optional adjective phrases as events.
  • the user can also specify a macro of rules and/or queries and specify the relationships among the rules and/or queries within the macro.
  • the rule translation module 141 translates a rule into a formal query text.
  • the rule translation module 141 can also directly translate a rule into a function, procedure or subroutine in a host programming language of the system 100 , such as Java, C, C++, and so forth.
  • the rule processing module 151 evaluates the qualification and subsequently executes the command corresponding to the qualification. In addition, the module 151 monitors the occurrence of the optional events of the rules.
  • a query or a rule under the improved object relational query language can be expressed in the following form:
  • variable specifier command [where qualification]
  • a qualification includes one or more conditions (each condition represented by a method or macro) and the conditions' parameters.
  • a command includes a method or macro and the command's arguments.
  • An argument can be a constant or a variable.
  • a method that returns a Boolean true or false value is a logical method.
  • a macro that returns a Boolean true or false value is a logical macro. Otherwise a method/macro is called a general method or general macro.
  • a method or macro can have one or more input parameters or output parameters.
  • a qualification consists of one method or macro with its parameters, and the qualification can be built recursively as follows:
  • variable specifier can be declared in one of the following forms:
  • Range of ⁇ variable-id> is ⁇ data-type>: ⁇ DSE>
  • Temp of ⁇ variable-id> is ⁇ data-type>
  • Set of ⁇ variable-id> is ⁇ data-type>: ⁇ DSE>
  • Bag of ⁇ variable-id> is ⁇ data-type>: ⁇ DSE>
  • Bag Temp of ⁇ variable-id> is ⁇ data-type>
  • Data-type in the forms above represents the format of the data values, such as integer, floating point, logical, text, date, and user-defined data formats.
  • DSE in the forms above represents “data source element.”
  • a data source element can be a user-defined set of objects, a table, a class, a spreadsheet file that includes a set of data rows, an attribute of an object whose value is a set, a method that returns a set of objects, a set variable as shown in form (c) above, a bag variable as shown in form (d) above, the result of a previous query, an object relational expression as described below in more detail in subsection C, and the like.
  • a DSE can be a set of structured data such as a tree, a directed asynchronous graph (DAG) or a semi-structured data set such as an extended markup language (XML) document.
  • a data type can be bound to a DSE in a variety of ways.
  • a subset of a DSE can be formed as another DSE that binds to a data type. For example, assume a tree DSE “T” with two subsets (nodes) called “Person” and “Organization”.
  • a binding Person:T may define the collection of Person nodes of “T” as a new DSE.
  • a data type can also be bound to an XML document by the use of tags.
  • binding a class A (a:integer, b:string) to the XML document below results in a DSE with an object A1 of type A whose value is (10, “America).
  • the tags ⁇ A1> and ⁇ /A1> may be ignored because the object A1 can be inferred from the attributes a and b.
  • ⁇ ROOT> ⁇ A> ⁇ A1> ⁇ a> 10 ⁇ /a> ⁇ b> America ⁇ /b> ⁇ /A1> ⁇ /A> ⁇ /ROOT>
  • a variable defined in form (a) above is a range variable.
  • a variable defined in form (b) is a temporary variable.
  • a variable defined in form (c) is a set variable.
  • a variable defined in form (d) is a bag variable.
  • a variable defined in form (e) is a temporary set variable.
  • a variable defined in form (f) above is a temporary bag variable.
  • a range variable obtains its possible values from its associated DSE.
  • a set of bag variable's value is a set identified by the associated DSE. Unlike a set that contains only distinct values such as ⁇ 1, 2, 3, 4 ⁇ , a bag can contain duplicate values such as ⁇ 1, 2, 3, 4, 1, 3 ⁇ .
  • Set variables and bag variables can be used to build the domains of range variables or temporary variables. Set and bag variables can also be used to aggregate objects in order to form an argument of a method that requires a set or a bag as an argument.
  • a temporary variable is typically used as an output parameter of a general method. The value of a temporary variable is computed at the time the corresponding method or macro for the query is evaluated.
  • a temporary set or bag variable's value is a set identified by the associated DSE and computed at the time the corresponding method or macro for the query is evaluated.
  • variable-id> datatype ⁇ data-type>
  • variabletype ⁇ variable-type>
  • dse ⁇ DSE> with ⁇ variable-type> indicating a variable type of range, temporary, set, bag, temporary set or temporary bag variable.
  • an example database is defined with two classes, “vertex” and “polygon”, with a polygon object defined by a set of vertices and a vertex object defined by two coordinates.
  • Class vertex (name: string, x: integer, y: integer) key: name
  • Class polygon name: string, vertices: set of vertex
  • intersect Associated with the class polygon, two conditions, “intersect” and “contain,” are defined.
  • the “intersect” condition takes two polygons as inputs and returns a true value if the pair of input polygons intersect with each other.
  • the “contain” condition takes two polygons as inputs and returns a true value if the first polygon contains the second.
  • Range of t is polygon: abc (query B1)
  • Range of t is polygon:abc (query B2)
  • Range of s is polygon:abc Retrieve (t.name, s.name) Where t.contains(s)
  • the following query shows all polygons from the data source element abc that are contained in polygon C, do not intersect with polygon E, and whose sizes are greater than 5:
  • Range of t is polygon:abc (query B3) Range of s is polygon:abc Range of r is polygon:abc Var u is float s.show( ) Where t.name.eq(“C”) and t.contains(s) and r.name.eq(“E”) and (not r.intersect(s)) and s.size(u) and u.gt(5)
  • the following query shows all polygons from the data source element abc that intersect with polygon E, in which the intersection is a square whose size is greater than 5, and whose vertices contain a square whose size is greater than 10:
  • Range of t is polygon:abc (query B4)
  • Range of s is polygon:abc Set u is vertex Set w is vertex:s.vertices
  • Var v is float
  • Var x is float s.show( ) Where t.name.eq(“E”) and s.intersection(t, u, v) and is-square(u) and v.gt(5) and .is-square(w) and w.size(x) and x.gt(10)
  • intersection is a general method associated with a polygon. It takes another polygon as the input and returns the intersection (a set of vertices) and the area of the intersection as the output.
  • the qualification “t.name.eq(“E”) and s.intersection(t, u, v) and is-square(u) and v.gt(5) and .is-square(w) and w.size(x) and x.gt(10)” includes several conditions joined by the logical connector “and”.
  • the condition “is-square” is applied to a set of vertices to determine if the collection of vertices form a square.
  • the improved object relational language is similar to ANSI SQL-99 in some aspects.
  • query B3 can be rewritten in SQL-99 form:
  • the improved object relational language has significant advantages over ANSI SQL-99.
  • query B4 cannot be rewritten in SQL-99.
  • ANSI SQL-99 allows only “range” type variables and does not allow set, bag, temporary, temporary set or temporary bag variables.
  • the scope of a variable is determined by its “range” type and cannot be an arbitrary data source element.
  • a condition under SQL-99 cannot produce any value other than a logical true or false value, so a general method cannot be used as a condition or argument.
  • the improved object relational language removes these limitations.
  • a rule in the improved object relational query language can be expressed in the form of:
  • a rule expressed in the second form “[On event] [if qualification1 then] qualification2” is equivalent to a rule in the first form “[On event] [if qualification then] command [else command2]”.
  • the second form may be more user-friendly in some situations. For example, the following rule in the second form:
  • the patient cannot have prescription y is equivalent to a rule in the first form: If the patient has prescription x and the patient has prescription y then report(“constraint violation”)
  • a rule can also include a label that identifies the rule, one or more input parameters and one or more output parameters.
  • An input parameter is a parameter whose value is used in the qualification of the rule.
  • An output parameter is a parameter whose value is returned as computed by a command or condition of the rule.
  • a macro in the improved object relational query language can be expressed in the form of:
  • Label1 represents the label that identifies a rule or a query “rule/query1”
  • label2 represents the label that identifies a rule or a query “rule/query2”.
  • An event may be the addition of a new object, the deletion or modification of an object, or a user-defined event.
  • a rule or a query identified by a label can be invoked by another rule or query with a “goto(label)” statement.
  • queries C1 to C4 can be combined into one query as:
  • the object relational algebra enables the manipulation of complex objects.
  • a data source element is simply a relation such as a flat file or a set of records whose fields are primitive values.
  • a condition of the “select” operator is restricted to simple comparisons of primitive values.
  • a method is not allowed to be an expression under relational algebra.
  • aggregation functions under conventional relational algebra are restricted to functions such as minimum, maximum, mean, variance, average, sum and count, which can be applied only to primitive values. The object relational algebra removes these limitations and provides significant advantages over conventional relational algebra.
  • modules of the structured natural language system 100 can be combined or separated into more or fewer modules.
  • Some of the actions illustrated in the flowcharts can be executed in parallel, in sequence or in different orders. Accordingly, the present invention is not to be limited by the description of the preferred embodiments, but is to be defined by reference to the appended claims.

Abstract

A structured natural language query and knowledge system is provided to allow a user who lacks programming skills to enter a database query or a rule in the form of a structured natural language sentence. The scope of the sentence is preferably defined by an improved object relational query language, an object relational algebra, or both. Command and conditions that appear in natural language form are defined with corresponding formal query texts. A user is prompted to compose a structured natural language sentence using the defined commands and conditions. The user-selected command and its arguments appear as the verb phrase of a structured natural language sentence. The user-selected conditions and their parameters appear as the adjective phrases of the sentence. The sentence is parsed and changed into a translated formal query text for formal database query and rule processing.

Description

    CLAIM OF PRIORITY
  • This application is a continuation application of U.S. patent application Ser. No. 11/846,428, filed Aug. 28, 2007, which issued on Nov. 17, 2009 as U.S. Pat. No. 7,620,542, which is a continuation application of U.S. patent application Ser. No. 10/286,506, filed Oct. 31, 2002, which issued on Aug. 28, 2007 as U.S. Pat. No. 7,263,517, each of which is incorporated by reference herein in its entirety.
  • BACKGROUND OF THE INVENTION
  • 1. Field of the Invention
  • The invention relates to the field of relational, object-oriented and object relational databases. More particularly, the invention relates to a structured natural language database query and knowledge system.
  • 2. Description of the Related Art
  • Relational databases have been used widely for many years. A relational database organizes data into tables that include fields. Two tables that include a same field are related to each other. Compared to the “flat file” approach that stores all data into a single file, the relational approach of tables is more flexible.
  • Most relational database systems conform to the Structured Query Language (SQL) standard. Commercial vendors produce SQL based database systems such as Oracle, Sybase, Informix, Progress and Microsoft Access. These systems use a SQL-type formal query language. The following are examples of a formal query used to display the names and salaries of those employees who make more than $100,000:
  • For each employee where employee.salary > 100000:
      Display employee.name employee.salary.
    End
    Or:
    Select name, salary
    From employee
    Where employee.salary > 100000
  • In the above queries, “employee” is the table name, “name” and “salary” are fields of the “employee” table, “where employee.salary>100000” is a condition and also a qualification, and “Display employee.name employee.salary” and “Select name, salary” are commands. A qualification is a condition or a plurality of conditions connected by logical connectors such as “and”, “or”, “and not” or “or not”.
  • Object-oriented databases organize data into objects. An object can have attributes, which can also be objects. The recursive nature of an object permits ease of manipulation. Objects can inherit characteristics from other objects, making it easier to create new objects based on existing ones. An object can be associated with a set of procedures (methods) to manipulate its data.
  • Attempts have been made to combine relational databases and object-oriented databases to create object relational databases. For example, American National Standards Institute (ANSI) SQL-99 extends the conventional SQL query language to allow tables and fields to be manipulated as objects. A typical ANSI SQL-99 query is expressed as:
  • Select arguments
  • From type var, . . . type var
  • Where condition and condition . . . and condition
  • A condition is a method with the method's arguments applied to a variable. An argument of the “select” command can be a variable, an attribute of a variable or a method with its arguments applied to a variable. With the addition of methods, the scope of SQL-99 becomes wider than the scope of conventional SQL.
  • Unfortunately, ANSI SQL-99 still has significant limitations. As described below in more detail, ANSI SQL-99 allows only “range” type variables and does not allow “set,” “bag,” or temporary variables. The scope of a variable is determined by its type (which is usually a relational table or a class of objects) and cannot be an arbitrary set of objects. Moreover, a condition under SQL-99 does not allow the values produced by a method, whether produced as its returned value or as its parameters, to be used in other conditions. In addition, a method that does not return a logical “true” or “false” value is not allowed to be a condition. Therefore, it would be desirable to introduce a more general and more powerful object relational query language.
  • Conventional relational algebra has significant limitations too. Under relational algebra, an expression is simply a relation such as a flat file or a set of records whose fields are primitive values such as integers, floating point numbers and text strings. Moreover, a condition of the “select” operator is restricted to:
      • 1. A comparison between two primitive values such as equal to, greater than, less than, greater than or equal to, less than or equal to, not equal to;
      • 2. A comparison between a primitive value and a set of primitive values produced by a sub-query such as “is a member of,” “greater than all members of,” “greater than any member of,” and so forth;
      • 3. A comparison between two sets of primitive values produced by a sub-query such as “is a superset of” or “is a subset of”; and
      • 4. A test on a set of primitive values such as “is an empty set.”
  • In addition, a method typically is not allowed to be an expression under relational algebra. Therefore, it would be desirable to introduce a more general and more powerful object relational algebra.
  • Conventional databases use data integrity constraints and event triggers to enforce rules on the databases. However, the scope of these rules is limited by the scope of the query language used by the conventional databases. What is desired is a knowledge system that permits more powerful and more flexible ways of specifying rules.
  • Finally, in a conventional relational or object relational database, a user who lacks programming skills typically cannot compose complex database queries, and must rely on programs written by programmers to search and display data. Therefore the user's options are frequently very limited. It would be desirable to allow such a user to write natural language type instructions to operate on a relational, object-oriented or object relational database in real time.
  • SUMMARY OF THE INVENTION
  • For purposes of summarizing the invention, certain aspects, advantages and novel features of the invention have been described herein. It should be understood that not necessarily all such aspects, advantages or features will be embodied in any particular embodiment of the invention.
  • A structured natural language query and knowledge system is provided to allow a user who lacks programming skills to specify a database query in the form of a structured natural language sentence. The user can also specify a rule in the form of a structured natural language sentence. An improved object relational query language and an object relational algebra are also introduced. In a preferred embodiment, the structured natural language is defined by the object relational query language and the object relational algebra.
  • One aspect of the invention relates to a computer-implemented method of composing a structured natural language database query. A user is prompted to select a command from a set of defined commands and to specify one or more arguments for the command. The selected command and arguments are combined to form a verb phrase. The user is also prompted to select zero, one or more conditions from a set of defined conditions and to specify zero, one or more parameters for each selected condition. Each selected condition and its specified parameters are combined into an adjective phrase. The verb phrase and the selected adjective phrases are combined into a structured natural language database query.
  • The structured natural language query is automatically translated into formal query text to be executed by a formal query processing module. The structured natural language query is parsed to identify a verb phrase and zero, one or more adjective phrases. After finding a defined command query text that corresponds to the parsed verb phrase and finding a defined qualification query text that corresponds to each of the parsed adjective phrases, the found query texts are combined into a translated formal query text to be processed by the query processing module.
  • Another aspect of the invention is related to a computer-implemented method of creating an object relational query for a database. A first variable can be defined as a range variable and a second variable can be defined as a set variable. A command and a qualification are also defined. The defined command, qualification and variables are combined to form an object relational query. Variables can also be defined as temporary variables or bag variables.
  • Still another aspect of the invention relates to a computer-implemented method of creating an object relational query for a database based on algebraic expressions. A query is built upon one or more expressions. An expression is a set of objects. Thus, a general method that returns as output a set of objects is an expression. A select operator that selects from a set of objects a subset based on a qualification is also an expression. Because an argument of a general method can be a set of objects, general methods can be nested to produce another expression.
  • Yet another aspect of the invention relates to a computer-implemented method of processing queries expressed in the improved object relational query language or the object relational algebra for a relational, object-relational or object-oriented database system.
  • Another aspect of the invention relates to a computer-implemented method of grouping structured natural language queries, structured natural language rules or both queries and rules into a macro that in turn can be used as a command or condition for other queries or rules.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • The drawings and the associated descriptions are provided to illustrate embodiments of the invention and not to limit the scope of the invention.
  • FIG. 1 illustrates one embodiment of a structured natural language system.
  • FIG. 2 illustrates one embodiment of a process of defining a command.
  • FIG. 3 illustrates one embodiment of a process of defining a condition.
  • FIG. 4 illustrates one embodiment of a process of composing a structured natural language query.
  • FIG. 5 illustrates example user interface screens that prompts a user or a programmer to define an object variable.
  • FIG. 6A is an example user interface screen that prompts a user to select a command.
  • FIG. 6B is another example user interface screen that prompts a user to select a command.
  • FIG. 7 is an example screen that prompts the user to select command arguments and optionally name a result.
  • FIG. 8A is an example screen that prompts the user to select a condition.
  • FIG. 8B is another example screen that prompts the user to select a condition.
  • FIG. 9 is an example screen that prompts the user to specify parameters for the selected condition.
  • FIG. 10 is an example screen that displays a composed query.
  • FIG. 11 is an example screen that prompts the user to type a structured natural language sentence.
  • FIG. 12 is an example screen that displays a rule in structured natural language.
  • DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT
  • The following subsections describe a structured natural language query and knowledge system, an improved object relational query language, and an object relational algebra that embody various inventive features. As will be recognized, many of these features can be implemented within a given system without others. For example, the structured natural language query system can be implemented in conjunction with conventional database platforms, and need not be implemented with the improved object relational query language or with the object relational algebra. In addition, the various inventive features can be implemented differently than described herein. Thus, the following description is intended only to illustrate, and not limit, the scope of the present invention.
  • A. Structured Natural Language Query and Knowledge System
  • FIG. 1 illustrates one embodiment of a structured natural language query and knowledge system 100. The structured natural language system 100 allows a user without programming skills to create a structured natural language query to operate on a database, with the database operation preferably done in real time. The system 100 can also allow a user to convert a relational database into an object-relational database. A structured natural language query may include a verb phrase as a command and zero, one or more adjective phrases as a qualification. A variable definition phrase may also be included in the structured natural language query. The term “query” is broadly defined herein to include not only commands that select or display data, but also commands that create, update or delete data. One embodiment of the system 100 is programmed in Java. The system 100 can also be programmed in C, C++ or other languages, operating platforms or application packages.
  • As shown in FIG. 1, the system 100 includes a verb phrase definition module 110 and an adjective phrase definition module 120. The verb phrase definition module 110 allows a programmer to define verb phrases to correspond to formal query texts. The adjective phrase definition module 120 allows a programmer to define adjective phrases to correspond to formal query texts. Formal query texts can be methods, functions, subroutines, procedures, SQL query texts, or combinations of the above. The system 100 also includes a structured natural language composition module 130 that allows a user to compose a structured natural language query using the defined verb phrases and adjective phrases. The system 100 further includes a translation module 140 to translate the structured natural language query into formal query text, and a formal query processing module 150 that processes the translated formal query text to return query results from the underlying database 160, which can be a commercial database system.
  • The system 100 also includes an optional object schema definition module 101. The module 101 allows a user or a programmer to define an object class, including the number of attributes and the type of each attribute for an object class. If the underlying database 160 for the system 100 is a relational database, the object schema definition module 101 allows a user or programmer to associate an object class with a table in the relational database, thus allowing the user or programmer to define the relationships among the tables of the relational database in terms of objects.
  • For example, a programmer or a user can define a class “polygon” to be associated with the table “POLYGON” in a relational database. The table “POLYGON” includes a “VERTICES” column. The class “polygon” is defined with an “ID” attribute of text string type, and a “vertices” attribute of “set of vertex” type. Another class “vertex” is associated with the table “VERTEX” in the relational database. The class “vertex” is defined with a “name” attribute of text string type, a “x coordinate” attribute of number type, and a “y coordinate” of number type. The object schema definition module 101 thus stores the relationship that each element of the “VERTICES” column in the “POLYGON” table is the name of a row in the “VERTEX” table. A group of “VERTEX” tuples and a “POLYGON” tuple can thus be treated together as a hierarchically structured “polygon” object class. The programmer or user can then define verb phrases and adjective phrases related to the “polygon” object class. If there are no structural relationships among the tables, each table is then treated as an independent object class, with each relational tuple as a “flat” object without hierarchical structures. In any event, a user can still define verb phrases and adjective phrases in structured natural language.
  • If the underlying database for the system 100 is an object-relational or object-oriented database, then the object schema definitions can be simply imported from the underlying database.
  • Regarding structured natural language queries, an example sentence is:
      • Create a group from range1 whose name is “cc” (query A1)
      • Who are diagnosed to have disease “mild AD” and who respond to medication “aericept”.
  • In the example query above, “create a group from range1 whose name is ‘cc’” is a verb phrase of the query, “who are diagnosed to have disease ‘mild AD’” is an adjective phrase of the query, and “who respond to medication ‘aericept’” is another adjective phrase of the query. Within the verb phrase, “create a group” is a command, “from range 1” is an argument of the command, and “whose name is ‘cc’” is a result name of the command operation. Within each of the adjective phrases, “who are diagnosed to have disease” and “who respond to medication” are conditions, and “mild AD” and “aericept” are parameters of the conditions.
  • The query and knowledge system 100 can also include a structured natural language rule composition module 131, a rule translation module 141, and a formal rule processing module 151. The modules 131, 141 and 151 are described below in connection with rules.
  • In FIG. 1, each of the modules 101-151 is preferably implemented with software executed by one or more general-purpose computers. Some or all of the modules could alternatively be implemented in whole or in part within application-specific hardware. As will be recognized, the modules need not run or reside on the same computer. For example, modules 110-141 could be integrated into a client-side component that runs on a user computer, while the formal processing modules 150 and 151 could run on a remote server that provides network-based access to a database. Further, the translation modules 140 and 141 could be implemented as a server-side component that translates natural language queries and rules received over a network from users. The modules of the system 100 can be implemented in the same or different computing languages, operating platforms or application packages.
  • FIG. 2 illustrates one embodiment of a computer-implemented process to define a verb phrase. At block 210, the system 100 prompts a programmer to define a command name. A command name is a text string such as “create a group”, “combine two groups”, “find similar genes from two groups”, “show elements of a group”, “store”, and so forth. The command is preferably defined with the appearance of a natural language phrase and thus is easy for users without programming skills to understand.
  • At block 220, the programmer is prompted to define the number of arguments for the command. For example, a “create a group” command typically has 1 argument and a “find similar genes from two groups” command typically has 2 arguments. The programmer may also define the selection range for each argument. For example, one argument for a patient name can be any user-entered text string, and another argument for the name of an authorized insurer must be a value from a pre-determined list of values. At block 230, the programmer is prompted to define the data type of each of the arguments of the command. A data type can be a primitive data type such as integer, floating number, date or text string, or another data type defined by a user or a programmer. A new data type can be defined as a set of objects of a given data type. For example, given the defined data type “vertex”, a new data type “Vertex Set” can be defined such that each object of the “Vertex Set” data type is a set of vertices. From block 230, the process proceeds to block 240, where the programmer is prompted to create a formal query text to correspond to the defined verb phrase.
  • The term “formal query text” is used broadly in the present application to include text in the form of a query language, the name of a collection of computer code such as a method, function, procedure or subroutine, or the name of a macro. For example for the condition “(a gene) is similar to (another gene),” a method, function, procedure or subroutine can be “Var0.similar(Var1)”, with “Var0” and “Var1” representing the two parameters of the condition respectively. As another example, for the command “find similar genes from two groups,” a method, function, procedure or subroutine can be “find_similar_genes(argument1, argument2),” with “argument 1” and “argument2” representing the two arguments of the command respectively. The collection of computer code is stored in a system library (not shown). As described below, a macro is a logical unit for a group of queries, a group of rules, or a group of one or more queries and one or more rules.
  • The process illustrated in FIG. 2 can be implemented in other orders. For example, in one embodiment, the programmer is first prompted to create a formal query text for a verb phrase to be defined. The system 100 parses the formal query text, and determines the number of arguments for the verb phrase and the data type of each of the arguments. The programmer is then prompted to define a command name for the verb phrase.
  • FIG. 3 illustrates one embodiment of a computer-implemented process of defining a condition. At block 310, the system 100 prompts a programmer to define a condition name. Example condition names include “who are diagnosed with disease” and “who respond to medication”. The process then proceeds to block 320, where the programmer is prompted to define the number of parameters for the condition. At block 330, the programmer is prompted to define the data type of each of the parameters, such as text, integer and so forth. The programmer can also be prompted to specify if a parameter is an input parameter or an output parameter. At block 340, the programmer is prompted to create a formal query text to correspond to the defined condition, for example typing texts in a query language, specifying a macro name, or specifying the name of a computer code collection such as a method name, function name, procedure name or subroutine name. The computer code collection is stored in the system library.
  • For example, for the condition “who are diagnosed with disease”, an example formal query text can be “<argument1>.diagnosed(<parameter>)” or “<argument1>.diagnosed=<parameter>”. In this example, “<argument1>” represents the command argument of the related verb phrase, and “<parameter>” represents the condition parameter value to be entered by the user.
  • The process illustrated in FIG. 3 can be implemented in other orders. For example, in one embodiment, the programmer is first prompted to create a formal query text for the condition to be defined. The system 100 then parses the formal query text and determines the number of parameters for the condition and the data type of each of the parameters. The programmer is then prompted to define a condition name.
  • The verb phrases and adjective phrases can be defined to correspond to complex queries. For example, the condition “who are diagnosed with all diseases” can be defined to include a set parameter that allows a list of diseases. A query with this condition finds patients who are diagnosed with all of the listed diseases. Another condition “who are diagnosed with at least one of the diseases” can be defined to include a set parameter that allows a list of diseases. A query with this condition finds patients who are diagnosed with at least one of the listed diseases. In another arrangement, only the condition “who are diagnosed with disease” is defined and includes a parameter that allows only one entry of a disease name. If a user wants to find patients who have both diseases A and B, the user combines two such adjective phrases “who are diagnosed with disease A” and “who are diagnosed with disease B”.
  • It should be noted that verb phrases and adjective phrases can be defined and used interchangeably to some extent. For example, the more specific verb phrase “find patients who are diagnosed with disease” serves the same purpose as the combination of the more general verb phrase “find patients” and the adjective phrase “who are diagnosed with disease”. If a query is frequently invoked by users, it may be preferable to define the query as a verb phrase that does not require additional adjective phrases. It is also feasible to define both specific and general verb phrases, and allow users to use either the specific verb phrase or the general verb phrase combined with adjective phrases.
  • In one embodiment, a structured natural language query can be created in the form of a question, such as “which patients are diagnosed with disease ‘mild AD’ and who respond to medication ‘aericept’?”. It is equivalent to the query “display patents who are diagnosed with disease ‘mild AD’ and who respond to medication ‘aericept’”. In the query in the form of a question, the portion “which patients” or “which patients are” serves as the verb phrase.
  • A structured natural language query may include a variable definition phrase. For example, as described below in subsection B, if the improved object relational query language of subsection B is used in conjunction with the system 100, an example variable definition phrase can be “range of <Variable> is polygon: abc”. It means that the values of <Variable> are obtained from a collection of objects named “abc”, whose elements are bound to the type “polygon”. A variable definition phrase can include one or more variable definitions.
  • Although a user who is not a programmer may also define a verb phrase and a condition, the term “programmer” is used in connection with FIG. 2 and FIG. 3 because some programming skills may be need to create a formal query text that correspond to a defined verb phrase or condition.
  • FIG. 4 shows one embodiment of a computer-implemented process of composing a structured natural language query. In a preferred embodiment, the query is composed with the improved object relational query language described in subsection B. At a block 410, the process determines whether variables will be defined by a user. If variables are to be defined, the process proceeds to a block 420, where the process prompts a user to specify a variable definition phrase. It should also be understood that variable definition phrases, command and conditions need not be specified in any particular order. In other words, variable definition phrases need not be specified before the command is specified, and the command need not be specified before conditions are specified.
  • FIG. 5 illustrates example screens that prompt a user or a programmer to specify a variable definition phrase. As shown in FIG. 5, in an embodiment in accordance with an improved object relational query language described below in subsection B, a user or programmer is prompted to create an object name in the “Object Name” section 510. The user or programmer is also prompted in the “Type” section 520 to identify the variable type of the created object by selecting from a list of types such as “range”, “temp”, “set” and “bag”. The user or programmer is further prompted to identify the data type of the created object by selecting an object from the “Object Lists” section 530 to inherit the data type of the selected object, or by selecting from the “Primitive Lists” section 540 of primitive data types. The user or programmer is prompted to specify in the “Data Source Element” section 550 the data source element for the created object. A data source element is a set of objects of the same type.
  • In another embodiment in which the object-relational algebra of subsection C is used, the blocks 410-430 are omitted and the user is not prompted to define variables. Variables are pre-defined by a programmer, and the user directly proceeds to a block 440. In order to demonstrate both options, FIGS. 6A and 8A are presented as example screens that do not display variable definitions, while FIGS. 6B and 8B are presented as example screens that display variable definitions.
  • Variable definition phrases can also be automatically generated based on the selected commands and conditions. In the example described below in subsection B with two object classes “polygon” and “vertex”, the condition “contains” requires two polygons as inputs and returns a true value if the first polygon contains the second. The system 100 stores the specification that the “contains” condition requires two input parameters of polygon class. When the condition “contains” is selected by the user, the system 100 automatically generates a variable definition phrase to define the two input parameters as range variables of polygon class. In another example, the condition “intersection” requires two input parameters of polygon class and an output parameter as the intersection polygon of the two input polygons. The “intersection” condition returns a “true” logical value. The system 100 stores the specification that the “intersection” condition requires two input parameters of polygon class and an output parameter of polygon class. When the user selects the “intersection” condition, the system 100 automatically generates a variable definition phrase to define the two input parameters and the output parameter. The two input parameters are preferably defined as range variables and the output parameter is preferably defined as a temporary variable. If the output parameter is used as an input parameter of another condition, then the input parameter of the other condition do not need to be defined again.
  • At the block 440, the user is prompted to select a command from a list of defined commands. In one embodiment, the user is prompted to select from a list of defined commands that are applicable to the objects defined in the variable definition phrase. For example, if the variable definition phrase defines a variable “polygon”, then the user can select from those commands that are applicable to the “polygon” variable, such as “enlarge”, “reduce”, “rotate”, “measure area”, and so forth. FIG. 6A and FIG. 6B are example user interface screens that prompt the user to select a command from a list 610 of commands. Compared to FIG. 6A, FIG. 6B includes an additional section 620 that displays the defined variables. The defined variables are also displayed in the sentence section 630.
  • Referring back to FIG. 4, from the block 440, the process proceeds to a block 450, where the user is prompted to select one or more arguments for the selected command. At a block 460, in one embodiment that preferably uses the object relational algebra of subsection C, the user is prompted to optionally specify a result name of the selected command. The command, arguments and optional result name form a verb phrase. FIG. 7 is an example screen that prompts the user to select command arguments and optionally name a result. Depending on the number and type of defined arguments for the command, the user may be prompted to identify one or more arguments. As shown in FIG. 7, in one embodiment in accordance with an object relational algebra described below in subsection C, for the command “create a group”, the user is prompted to select a data type from section 710, select a data source element from section 720, and optionally name a result of the command in section 730.
  • Referring back to FIG. 4, if the user wishes to select a condition, then the process proceeds from a block 470 to a block 480, where the user is prompted to select a condition from a list of defined conditions. In one embodiment, the user is prompted to select from a list of defined conditions that are applicable to the objects defined in the variable definition phrase. For example, for a variable “polygon”, conditions such as “contains”, “intersect” and “is a square” are applicable to the “polygon” variable. FIG. 8A is an example screen that prompts the user to select from a list of conditions. A defined verb phrase is displayed in section 810 and as part of a structured natural language sentence in sentence section 830. The user selects a condition from a list of conditions in section 820. FIG. 8B is another example screen that prompts the user to select from a list of conditions. Compared to FIG. 8A, FIG. 8B includes an additional section 840 that displays the defined variables. The defined variable definition phrase and verb phrase are displayed as parts of a sentence in the sentence section 830.
  • Referring back to FIG. 4, at a block 490, the user is prompted to specify the parameters of the selected condition. In one embodiment, multiple conditions are allowed to share the same condition name but with different numbers of parameters. FIG. 9 is an example screen that prompts the user to specify parameters for the selected condition. As shown in FIG. 9, for the condition “Who are diagnosed to have”, the user can type in a disease name in section 910, or select from a list of disease names in section 920. The selected condition and parameters form an adjective phrase. In one embodiment, the composition module 130 automatically puts quotation marks around the specified values of command arguments, named result and condition parameters. If the condition has one or more output parameters, the user can also specify an additional condition by defining the output parameters of the original condition as the input parameters of the additional condition. The additional condition thus evaluates a condition based on the output produced by the original condition.
  • Referring back to FIG. 4, at a block 495, the user is prompted to indicate whether more conditions will be entered. If more conditions will be entered, then the process returns to the block 480. Otherwise the process proceeds to an end block 498. If the process returns to the block 480 to specify another adjective phrase, the user is also prompted to specify a logical connector such as “and”, “or” or “and not” to join the two adjective phrases.
  • As described above, FIGS. 5-9 are example screens of one embodiment that prompts a user to select from lists of defined commands and conditions. The user selects a command in FIG. 6A or FIG. 6B, selects arguments for the command in FIG. 7, selects a condition in FIG. 8A or FIG. 8B, and specifies parameters for the condition in FIG. 9. In one embodiment, the user or a programmer also defines variables in FIG. 5.
  • The specified verb phrase and adjective phrases form a composed structured natural language query. FIG. 10 is an example screen that shows a composed structured natural language query sentence in section 1040 including a verb phrase in section 1020 and adjective phrases in section 1030. In FIG. 10, the query appears in a preferred form of an imperative sentence. The query may also include a variable definition phrase, such as the phrase “range of range1 is Case:Case” shown in sections 1010 and 1040 of FIG. 10. The user can also be given the option to directly type a structured natural language query into the system 100, as shown in the sentence section 1110 of FIG. 11. Experienced users may prefer this option. Users can also use a combination of direct typing and screen selection, for example typing the conditions or commands they are familiar with and selecting other conditions or commands from the menus. The composed query is then sent to the translation module 140 for translation.
  • The translation module 140 parses the composed query to identify the verb phrase, the adjective phrases and the optional variable definition phrase. In a preferred embodiment, the translation module 140 looks for keywords that indicate defined verb phrases or defined adjective phrases. For example, in one embodiment where all conditions start with the word “who” or “whose”, the translation module 140 searches for words “who” and “whose” and identifies any such words as signaling the start of an adjective phrase. However, if the word “who” or “whose” is enclosed in quotation marks in the composed query, then it is considered to represent an argument, named result or parameter value instead of the start of a condition. The translation module 140 also searches for logical connectors “and”, “or”, “or not” and “and not”. A logical connector indicates the end of one adjective phrase and the start of another adjective phrase. By searching for condition labels and logical connectors, the module 140 can identify an adjective phrase as separated from the rest of the query and separated from the other adjective phrases.
  • The translation module 140 also searches for keywords of the defined conditions to identify the conditions. For example, the translation module 140 searches for keywords such as “diagnosed” or “respond” as identifying the condition “who are diagnosed with the disease” or “who respond to medication”.
  • The translation module 140 also searches for keywords of the defined commands to identify the command. For example, the module 140 searches for keywords such as “create a group” or “add elements to a group” to identify the corresponding commands. The module 140 can also search for keyword “from” to signal a command argument and keyword “whose name is” to signal a named result.
  • If the command and conditions are not typed by a user but selected from lists of defined commands and conditions, then the parsing process can be simplified. Because the system 100 associates each of the defined commands and conditions with its corresponding formal query text, the corresponding formal query text can be stored at the same time the user selects a command or condition. The stored corresponding formal query texts (or their identifiers) are combined by the translation module 140 to form a translated formal query text. On the other hand, if the user enters a structured natural language query by typing instead of selecting from lists, then the parsing process first identifies the verb phrase and adjective phrases, and then combines their corresponding formal query texts.
  • In a preferred embodiment, the translation module 140 detects errors in a query and automatically corrects errors or suggests corrections to errors. Query errors may include misspellings (for example “diagnosed” misspelled as “diagnoised”), missing words (for example “who respond to medication” entered as “who respond to”), out-of-order structures (for example adjective phrases appearing before the verb phrase), or other grammar errors. The translation module 140 corrects these errors and transforms the query to a correct query. In one arrangement, the translation module 140 returns the detected errors and a suggested corrected query to the user, and asks the user to confirm that the corrected query is what the user intended to compose.
  • The translation module 140 combines the corresponding formal query texts of the verb phrase and adjective phrases of the structured natural language query to form a translated formal query text. The user-specified command argument values and condition parameter values are incorporated into the translated formal query text. The formal query processing module 150 receives the translated formal query text and processes the text on the underlying database to produce results.
  • The underlying database 160 can be relational, object-oriented or object relational. For a relational underlying database, if the translated query text is already in the form of one or more SQL queries, then the formal query processing module 150 directly executes the SQL queries on the underlying relational database. If the translated query text is not in SQL query form, then the formal query processing module 150 converts the translated query text into one or more converted SQL queries, and executes the converted queries on the underlying relational database. Likewise, for an object or object relational underlying database, the query processing module 150 directly executes the translated query text, or converts the translated query text into one or more converted queries that can be directly executed on the underlying database.
  • If the query processing module 150 cannot directly execute the translated query text and cannot convert the translated query context, query processing module 150 still processes the translated query text to return query results from the underlying database 160. For example, in one embodiment with a relational underlying database, using the improved object relational query language described in subsection B, a first variable “t” is defined as a range variable from a first data source element of five polygon objects. A second variable “s” is defined as a range variable from a second data source element of three polygon objects. For a qualification “where t and s intersect,” the translated query text “t.intersect(s)” cannot be directly executed on the relational underlying database. The query processing module 150 enumerates the fifteen combinations of the polygon objects of the two data source elements to determine whether the qualification is true. In addition to the brute-force evaluation of all possible combinations, optimization methods, for example rearranging the order of the conditions in a qualification, can also be employed.
  • In one preferred embodiment, the structured natural language system 100 works in conjunction with the improved object relational query language described in subsection B. For example, a user creates a variable definition phrase “range of range1 is case:case” in FIG. 10, to define range1 as a range variable in accordance with the improved query language described in subsection B. The first “case” in “case:case” represents a data type and the second “case” represents a data source element. Alternatively, the system 100 prompts the user to define a variable by selecting from a list of data types, a list of data source elements, and as a range, temporary, set or bag variable. However, the structured natural language system 100 can also work with conventional relational, object-oriented, and object relational databases. The improved object relational query language or the object relational algebra would expand the scope of permitted queries, but is not a prerequisite for the structured natural language system 100.
  • Rules can also be composed in structured natural language form and used in the system 100 for data integrity and other purposes. For example, a rule can be presented in the form “On <event> If <qualification> then <command>”, “On <event> If <qualification> then <command> otherwise <command2>”, or “On <event> If <qualification> then <qualifcation2>”. Using a computer system for medical use as an example, rules can be used to ensure that a patient's age entered into the system is greater than zero, that penicillin is not administered if the patient is known to be allergic to it, that two types of medications of negative interaction are not administered to the same patient, and so forth. A rule's scope can be defined to apply to a single object, to apply to all objects of a class, or to all objects. In one embodiment, the structured natural language system 100 allows a user to define a rule by specifying a qualification and a command of the rule. When an event happens (for example, when a patient's temperature changes), if a qualification (for example, patient's body temperature exceeds 102 degrees Fahrenheit) is satisfied, then a command (for example, send a warning to a nurse or physician) is activated according to the rule. In some embodiments, events and qualifications can be used interchangeably.
  • Structured natural language queries and structured natural language rules are typically used in different contexts. For example, the system 100 monitors the qualification and the optional event of a rule at all times, and executes the rule's command when the qualification and optional event are met. On the other hand, a query is typically executed based on a user instruction. In addition, a rule is typically displayed in the form of “if qualification . . . then command [else command2]” or “if qualification1 then qualification2” form, while a query is typically displayed in the “command [qualification]” form. However, since a structured natural language rule and a structured natural language query typically each includes a command and a qualification, it is thus feasible to use a rule and a query interchangeably. In one embodiment, a user can select a defined query as a rule, or select a defined rule as a query.
  • FIG. 12 shows an example screen displaying a rule in the form of a structured natural language sentence. A rule includes an optional variable definition phrase, a verb phrase (command) and one or more adjective phrases (qualification). A rule may also include one or more event phrases. A rule can have a second verb phrase corresponding to the action when the qualification is false. In FIG. 12, section 1220 displays the qualification “any of Patient.CurrentVisit treatments interact”, and section 1230 displays the command “display message string ‘treatment check: interact’”. Section 1210 displays the formal query text that corresponds to the rule. The qualification and the command can be specified by a user, for example using the screens of FIGS. 6A-9.
  • A set of rules can be grouped into a logical unit named a “macro.” A user or a programmer can specify the relationship among the rules within the macro, and to use a “goto” statement followed by a label to invoke another rule. The label identifies the other rule to be invoked. For example, a macro for approving or rejecting a loan application can include the following rules:
  • if house-or-rent = “house” then goto(rule-house); else goto(rule-rent);
    rule-house: if house-value > 500,000 and income > 40,000 then
    return(true);
    else goto(rule-house2);
    rule-house2: if house-value > 5,000,000 then return(true); else
    return(false).
    rule-rent: if income > 60,000 then return(true); else return(false).
  • In the example above, “goto(rule-house)” represents evaluating another rule identified by the label “rule-house”. “Return(true)” represents returning a decision to approve the loan application. Instead of returning a “true” or “false” logical value, a rule can also return an object or data source element (DSE) as output.
  • In real world applications, rules can be far more complex than the example above. The macro and the “goto” statement allow a user or a programmer to specify relationships among the rules. The user or programmer is prompted to assign a macro name to the macro. In addition, a rule or a macro can be used as a condition or command in a query or in another rule.
  • A plurality of queries, or a combination of one or more rules and one or more queries, can also be grouped into a macro. Queries can be identified by labels, and “goto” statements followed by labels can be used to invoke other queries or rules. A macro can be associated with an object or a class of objects.
  • Similar to the manner of composing a structured natural language query, a user can compose rules in structured natural language form. Similar to a query sentence, a rule sentence has one or more adjective phrases to specify the qualification of the rule, and a verb phrase to specify the action to be taken when the qualification is true. A rule sentence can have a second verb phrase corresponding to the action to be taken when the qualification is false. A rule sentence can also have one or more optional event phrases, each corresponding to an event.
  • Referring to FIG. 1, the rule composition module 131 prompts a user to compose a rule by selecting one or more adjective phrases as qualification, one or two verb phrases as commands, and optional adjective phrases as events. The user can also specify a macro of rules and/or queries and specify the relationships among the rules and/or queries within the macro. The rule translation module 141 translates a rule into a formal query text. In one embodiment, the rule translation module 141 can also directly translate a rule into a function, procedure or subroutine in a host programming language of the system 100, such as Java, C, C++, and so forth. The rule processing module 151 evaluates the qualification and subsequently executes the command corresponding to the qualification. In addition, the module 151 monitors the occurrence of the optional events of the rules.
  • B. Improved Object Relational Query Language
  • A query or a rule under the improved object relational query language can be expressed in the following form:
  • Variable specifier
    ...
    variable specifier
    command
    [where qualification]
  • For each possible value of the variables defined in the variable specifiers, if it satisfies the qualification, then the command is executed. The bracket “[ ]” indicates a qualification is optional. A qualification includes one or more conditions (each condition represented by a method or macro) and the conditions' parameters. A command includes a method or macro and the command's arguments. An argument can be a constant or a variable. A method that returns a Boolean true or false value is a logical method. A macro that returns a Boolean true or false value is a logical macro. Otherwise a method/macro is called a general method or general macro. A method or macro can have one or more input parameters or output parameters. If a general method/macro is used as a condition, the condition is considered true once executed, regardless of the value returned by the method/macro. In the simplest case, a qualification consists of one method or macro with its parameters, and the qualification can be built recursively as follows:
  • If α and β are qualifications then (α and β) is a qualification.
  • If α and β are qualifications then (α or β) is a qualification.
  • If α is a qualification then (not α) is a qualification.
  • A variable specifier can be declared in one of the following forms:
  • (a) Range of <variable-id> is <data-type>:<DSE>
    (b) Temp of <variable-id> is <data-type>
    (c) Set of <variable-id> is <data-type>:<DSE>
    (d) Bag of <variable-id> is <data-type>:<DSE>
    (e) Set Temp of <variable-id> is <data-type>
    (f) Bag Temp of <variable-id> is <data-type>
  • “Data-type” in the forms above represents the format of the data values, such as integer, floating point, logical, text, date, and user-defined data formats. “DSE” in the forms above represents “data source element.” A data source element can be a user-defined set of objects, a table, a class, a spreadsheet file that includes a set of data rows, an attribute of an object whose value is a set, a method that returns a set of objects, a set variable as shown in form (c) above, a bag variable as shown in form (d) above, the result of a previous query, an object relational expression as described below in more detail in subsection C, and the like.
  • In addition, a DSE can be a set of structured data such as a tree, a directed asynchronous graph (DAG) or a semi-structured data set such as an extended markup language (XML) document. A data type can be bound to a DSE in a variety of ways. A subset of a DSE can be formed as another DSE that binds to a data type. For example, assume a tree DSE “T” with two subsets (nodes) called “Person” and “Organization”. A binding Person:T may define the collection of Person nodes of “T” as a new DSE. A data type can also be bound to an XML document by the use of tags. For example, binding a class A (a:integer, b:string) to the XML document below results in a DSE with an object A1 of type A whose value is (10, “America). The tags <A1> and </A1> may be ignored because the object A1 can be inferred from the attributes a and b.
  • <ROOT>:
     <A>
      <A1>
         <a> 10 </a>
        <b> America </b>
      </A1>
     </A>
    </ROOT>
  • Additionally, a data source element can be dynamically generated to contain the results from one or more previous queries. A variable defined in form (a) above is a range variable. A variable defined in form (b) is a temporary variable. A variable defined in form (c) is a set variable. A variable defined in form (d) is a bag variable. A variable defined in form (e) is a temporary set variable. A variable defined in form (f) above is a temporary bag variable.
  • A range variable obtains its possible values from its associated DSE. A set of bag variable's value is a set identified by the associated DSE. Unlike a set that contains only distinct values such as {1, 2, 3, 4}, a bag can contain duplicate values such as {1, 2, 3, 4, 1, 3}. Set variables and bag variables can be used to build the domains of range variables or temporary variables. Set and bag variables can also be used to aggregate objects in order to form an argument of a method that requires a set or a bag as an argument. A temporary variable is typically used as an output parameter of a general method. The value of a temporary variable is computed at the time the corresponding method or macro for the query is evaluated. A temporary set or bag variable's value is a set identified by the associated DSE and computed at the time the corresponding method or macro for the query is evaluated.
  • The above forms illustrate a preferred syntax of the improved object relational query language. Other syntax forms can be used without departing from the sprit and scope of the invention. For example, in another syntax form, variables are declared using the form:
  • <variable-id> datatype: <data-type>, variabletype: <variable-type>, dse:
    <DSE>
    with <variable-type> indicating a variable type of range, temporary, set,
    bag, temporary set or temporary bag variable.
  • Some examples are used below to more clearly explain the invention. Returning to the preferred syntax, an example database is defined with two classes, “vertex” and “polygon”, with a polygon object defined by a set of vertices and a vertex object defined by two coordinates.
  • Class vertex (name: string, x: integer, y: integer) key: name
  • Class polygon (name: string, vertices: set of vertex) key: name
  • Associated with the class polygon, two conditions, “intersect” and “contain,” are defined. The “intersect” condition takes two polygons as inputs and returns a true value if the pair of input polygons intersect with each other. The “contain” condition takes two polygons as inputs and returns a true value if the first polygon contains the second.
  • Some example queries using the improved object relational query language are presented below. The following query finds the names and the vertices of all polygons from the data source element abc:
  • Range of t is polygon: abc (query B1)
  • Retrieve (t. name, t. vertices)
  • The following query finds all pairs of polygons from the data source element abc where one polygon contains the other:
  • Range of t is polygon:abc (query B2)
    Range of s is polygon:abc
    Retrieve (t.name, s.name)
    Where t.contains(s)
  • In the query above, “where t.contains(s)” is a qualification. For this qualification, “s” is a parameter of the condition “contains”.
  • The following query shows all polygons from the data source element abc that are contained in polygon C, do not intersect with polygon E, and whose sizes are greater than 5:
  • Range of t is polygon:abc (query B3)
    Range of s is polygon:abc
    Range of r is polygon:abc
    Var u is float
    s.show( )
    Where t.name.eq(“C”) and t.contains(s) and r.name.eq(“E”)
    and (not r.intersect(s)) and s.size(u) and u.gt(5)
  • The following query shows all polygons from the data source element abc that intersect with polygon E, in which the intersection is a square whose size is greater than 5, and whose vertices contain a square whose size is greater than 10:
  • Range of t is polygon:abc (query B4)
    Range of s is polygon:abc
    Set u is vertex
    Set w is vertex:s.vertices
    Var v is float
    Var x is float
    s.show( )
    Where t.name.eq(“E”) and s.intersection(t, u, v) and
    is-square(u) and v.gt(5) and .is-square(w) and w.size(x) and
    x.gt(10)
  • In the query above, “intersection” is a general method associated with a polygon. It takes another polygon as the input and returns the intersection (a set of vertices) and the area of the intersection as the output. The qualification “t.name.eq(“E”) and s.intersection(t, u, v) and is-square(u) and v.gt(5) and .is-square(w) and w.size(x) and x.gt(10)” includes several conditions joined by the logical connector “and”. The condition “is-square” is applied to a set of vertices to determine if the collection of vertices form a square.
  • The improved object relational language is similar to ANSI SQL-99 in some aspects. For example, query B3 can be rewritten in SQL-99 form:
  • Select s.show( )
    From t abc, s abc, r abc
    Where t.name = “C” and r.name = “E” and
    t.contains(s) and not r.intersect(s) and s.size( ) >5
  • However, the improved object relational language has significant advantages over ANSI SQL-99. For example, query B4 cannot be rewritten in SQL-99. Compared to the improved object relational language, ANSI SQL-99 allows only “range” type variables and does not allow set, bag, temporary, temporary set or temporary bag variables. Under SQL-99, the scope of a variable is determined by its “range” type and cannot be an arbitrary data source element. Moreover, a condition under SQL-99 cannot produce any value other than a logical true or false value, so a general method cannot be used as a condition or argument. The improved object relational language removes these limitations.
  • A rule in the improved object relational query language can be expressed in the form of:
  • variable specifier
    ...
    variable specifier
    [On event] [if qualification then] command [else command2].
    Or
    variable specifier
    ...
    variable specifier
    [On event] [if qualification1 then] qualification2
  • A rule expressed in the second form “[On event] [if qualification1 then] qualification2” is equivalent to a rule in the first form “[On event] [if qualification then] command [else command2]”. However the second form may be more user-friendly in some situations. For example, the following rule in the second form:
  • If the patient has prescription x then the patient cannot have prescription y
    is equivalent to a rule in the first form:
    If the patient has prescription x and the patient has prescription y then
    report(“constraint violation”)
  • Optionally, a rule can also include a label that identifies the rule, one or more input parameters and one or more output parameters. An input parameter is a parameter whose value is used in the qualification of the rule. An output parameter is a parameter whose value is returned as computed by a command or condition of the rule. A macro in the improved object relational query language can be expressed in the form of:
  • [On event:] [Object Type] Macro Macro-Name ([parameter], . . . , [parameter])
  • global Variable specifier
    ...
    gobal variable specifier
    [label1:] ru/e/query1;
    .....
    [label2:] rule/query2;
  • In the example above, “Object type” refers to the data type of the optional return value. “Parameter” refers to an input or output parameter of the macro. “Label1” represents the label that identifies a rule or a query “rule/query1”, “label2” represents the label that identifies a rule or a query “rule/query2”. An event may be the addition of a new object, the deletion or modification of an object, or a user-defined event. A rule or a query identified by a label can be invoked by another rule or query with a “goto(label)” statement.
  • C. Object Relational Algebra
  • The object relational algebra is preferably characterized as follows:
      • 1. A DSE is an object relational expression (“expression” hereinafter in subsection C).
      • 2. An expression is “type bound” if all elements of the expression have the same non-primitive type. Otherwise it is “type free”. A type free expression represents a relation such as a flat file or a set of records whose fields are primitive values such as integers, floating point numbers and text strings.
      • 3. If E is a type bound expression, then σP(E) is a type-bound expression, where P is a qualification. The operator σ (called the “Select” operator) returns as result objects in E that satisfy P. P may include conditions associated with the type of the expression, conditions associated with the attributes of the objects of the expression, or both. If E is a type free expression, then σP(E) is a type free expression, where P is a qualification. The operator σ (called the “Select” operator) returns as result objects in E that satisfy P.
      • 4. If E is an expression, then ΠS(E) is a type free expression, where S is a set of attributes. The operator Π (called the “Project” operator and the projection from E) removes those objects of E that satisfy the attributes specified in S from each object of E, and returns as result the other objects of E that do not satisfy the attributes specified in S. In rare occasions, ΠS(E) can be a type bound expression.
      • 5. If E and F are expressions then:
        • E×F is an expression. The operator (Cartesian Product) computes the set {(x,y)|xεE, yεF};
        • E∪F is an expression. The operator (Union) computes the union of the two expressions; and
        • E−F is an expression. The operator (Difference) computes the difference between the two expressions.
      • 6. If E is an expression then SGA (E) is a type free expression. The operator G (called the “Aggregate” operator) groups the objects of E according to the set of attributes specified in S (in the form: attribute, attribute, . . . ), and for each group computes the set of aggregate functions in the form of aggregate-function(attribute) or aggregate-function(object). The aggregate functions can include primitive functions such as min, max, mean, variance, avg, sum, count that can be applied to a group of primitive values. The aggregate functions can also include general functions defined by users or programmers that can be applied to a group of non-primitive objects. A general function is a user or programmer defined non-primitive function operated on a set of non-primitive objects. For example, a user or programmer can define a “best image” general function to process a collection of visual image files and return as output the visual image file with the best signal-to-noise ratio. When the general function is called to operate, the general function typically calls a program, a procedure, a query, a collection of primitive functions, a collection of primitive and general functions, or a collection of other general functions. In rare occasions, SGA (E) can be a type bound expression.
      • 7. If f is a method or macro that returns a DSE as its result, then f(arguments) is an expression. Whether f(arguments) is type free or type bound is determined by the type of the elements in the DSE. If all elements of the DSE belong to the same type and the type is not primitive, the expression is type-bound; otherwise it is type-free.
  • Some example object relational expressions are shown below with an object class “patient” and a data source element “hospital” defined.
  • To retrieve those patients of hospital who are diagnosed to have mild AD and who respond to the medication “Aericept”, call the set S1:
      • S1<σP1 (hospital), where P1=diagnosed(“mild AD”) and respond_to(“Aericept”) (query C1)
  • To retrieve those patients of hospital who are diagnosed to have moderate AD and who respond to the medication “Aericept”, call the set S2:
      • S2←σP2(hospital), where P2=diagnosed(“moderate AD”) and respond to(“Aericept”) (query C2)
  • To compute the union of S1 and S2 and call the result S:
      • S←S1∪S2 (query C3)
  • To find the average age of the male and female patients in S:
      • genderGavg(age)(S) (query C4)
  • To extract the gene profiles of the patients in S1, call it G1:
      • G1←Πgene-profile(S1) (query C5)
  • To extract the gene profiles of the patients in S2, call it G2:
      • G2←Πgene-profile(S2) (query C6)
  • To compare G1 and G2, and plot the results:
  • Compare_profile_and_plot(G1,G2) (query C7)
  • The queries can be combined in a number of ways. For example, queries C1 to C4 can be combined into one query as:
      • genderGavg(age)P1(hospital)∪σP2(hospital)) (query C8)
  • Compared to conventional relational algebra, the object relational algebra enables the manipulation of complex objects. Under relational algebra, a data source element is simply a relation such as a flat file or a set of records whose fields are primitive values. Moreover, as described above in the Background section, a condition of the “select” operator is restricted to simple comparisons of primitive values. In addition, a method is not allowed to be an expression under relational algebra. Finally, aggregation functions under conventional relational algebra are restricted to functions such as minimum, maximum, mean, variance, average, sum and count, which can be applied only to primitive values. The object relational algebra removes these limitations and provides significant advantages over conventional relational algebra.
  • Although the present invention has been described in terms of certain preferred embodiments, other embodiments will be apparent to those of ordinary skill in the art from the disclosure herein. Additionally, other combinations, omissions, substitutions and modifications will be apparent to the skilled artisan in view of the disclosure herein. For example, the modules of the structured natural language system 100 can be combined or separated into more or fewer modules. Some of the actions illustrated in the flowcharts can be executed in parallel, in sequence or in different orders. Accordingly, the present invention is not to be limited by the description of the preferred embodiments, but is to be defined by reference to the appended claims.
  • The following tables list some sample methods included in one embodiment of the system. It should be understood that not all of the methods listed below need to be included in a system of the invention, and that additional methods can be defined.
  • System Provided Methods Sample Logical Methods
  • Method Return
    Name Argument Value Comment
    Gt (First Argument: single, Boolean Returns TRUE if single type and
    Second Argument: single) First Argument > Second
    Argument.
    Ge (First Argument: single, Boolean Returns TRUE if single type and
    Second Argument: single) First Argument >= Second
    Argument.
    Lt (First Argument: single, Boolean Returns TRUE if single type and
    Second Argument: single) First Argument < Second
    Argument.
    Le (First Argument: single, Boolean Returns TRUE if single type and
    Second Argument: single) First Argument <= Second
    Argument.
    Eq (First Argument: single, Boolean Returns TRUE if single type and
    Second Argument: single) First Argument == Second
    Argument.
    Ne (First Argument: single, Boolean Returns TRUE if single type and
    Second Argument: single) First Argument != Second
    Argument.
    Gt (First Argument: integer, Boolean Returns TRUE if integer type and
    Second Argument: integer) First Argument > Second
    Argument.
    Ge (First Argument: integer, Boolean Returns TRUE if integer type and
    Second Argument: integer) First Argument >= Second
    Argument.
    Lt (First Argument: integer, Boolean Returns TRUE if integer type and
    Second Argument: integer) First Argument < Second
    Argument.
    Le (First Argument: integer, Boolean Returns TRUE if integer type and
    Second Argument: integer) First Argument <= Second
    Argument.
    Eq (First Argument: integer, Boolean Returns TRUE if integer type and
    Second Argument: integer) First Argument == Second
    Argument.
    Ne (First Argument: integer, Boolean Returns TRUE if integer type and
    Second Argument: integer) First Argument != Second
    Argument.
    Gt (First Argument: string, Boolean Returns TRUE if the result of First
    Second Argument: string) Argument. Compare To (Second
    Argument) is greater than 0.
    Compare To is a method of the
    String class of Java.
    Ge (First Argument: string, Boolean Returns TRUE if the result of First
    Second Argument: string) Argument. Compare To (Second
    Argument) is greater than 0.
    Compare To is a method of the
    String class of Java.
    Lt (First Argument: string, Boolean Returns TRUE if the result of First
    Second Argument: string) Argument. Compare To (Second
    Argument) is greater than 0.
    Compare To is a method of the
    String class of Java.
    Le (First Argument: string, Boolean Returns TRUE if the result of First
    Second Argument: string) Argument. Compare To (Second
    Argument) is greater than 0.
    Compare To is a method of the
    String class of Java.
    Eq (First Argument: string, Boolean Returns TRUE if string type and
    Second Argument: string) First Argument and Second
    Argument are the same.
    Ne (First Argument: string, Boolean Returns TRUE if string type and
    Second Argument: string) First Argument and Second
    Argument are not the same.
  • Sample General Methods
  • Method Return
    Name Parameter Value Comment
    as (First Argument: single s, Boolean Returns TRUE if single type and
    Second Argument: single s) First Argument is assigned to
    Second Argument.
    add (First Argument: single s, Boolean For single type, adds First
    Second Argument: single s, Argument and Second Argument,
    Third Argument: single s) substitutes the value into Third
    Argument, and returns TRUE.
    sub (First Argument: single s, Boolean For single type, subtracts Second
    Second Argument: single s, Argument from First Argument,
    Third Argument: single s) substitutes the value into Third
    Argument, and returns TRUE.
    mult (First Argument: single s, Boolean For single type, multiplies First
    Second Argument: single s, Argument by Second Argument,
    Third Argument: single s) substitutes the value into Third
    Argument, and returns TRUE.
    div (First Argument: single s, Boolean For single type, divides First
    Second Argument: single s, Argument by Second Argument,
    Third Argument: single s) substitutes the value into Third
    Argument, and returns TRUE.
    as (First Argument: integer i, Boolean Returns TRUE if integer type and
    Second Argument: integer I) First Argument is assigned to
    Second Argument.
    add (First Argument: integer i, Boolean For integer type, adds First
    Second Argument: integer i, Argument and Second Argument,
    Third Argument: integer I) substitutes the value into Third
    Argument, and returns TRUE.
    sub (First Argument: integer I, Boolean For integertype, subtracts Second
    Second Argument: integer i, Argument from First Argument,
    Third Argument: integer I) substitutes the value into Third
    Argument, and returns TRUE.
    mult (First Argument: integer I, Boolean For integertype, multiplies First
    Second Argument: integer i, Argument by Second Argument,
    Third Argument: integer I) substitutes the value into Third
    Argument, and returns TRUE.
    div (First Argument: integer i, Boolean For integer type, divides First
    Second Argument: integer i, Argument by Second Argument,
    Third Argument: integer i) substitutes the value into Third
    Argument, and returns TRUE.
    as (First Argument: string s, Boolean Returns TRUE if string type and
    Second Argument: string s) First Argument is assigned to
    Second Argument.
    max (First Argument: Boolean For single type, First Argument
    DatasourceElement dse, DatasourceElement, and Second
    Second Argument: string Argument string (fieldname
    fieldName, Third Argument: specified), sets the maximum
    single result) value to result and returns TRUE.
    max (First Argument: Boolean For integer type, First Argument
    DatasourceElement dse, DatasourceElement, and Second
    Second Argument: string Argument string (fieldname
    fieldName, Third Argument: specified), sets the maximum
    integer result) value to result and returns TRUE.
    min (First Argument: Boolean For single type, First Argument
    DatasourceElement dse, DatasourceElement, and Second
    Second Argument: string Argument string (fieldname
    fieldName, Third Argument: specified), sets the minimum
    single result) value to result and returns TRUE.
    min (First Argument: Boolean For integer type, First Argument
    DatasourceElement dse, DatasourceElement, and Second
    Second Argument: string Argument string (fieldname
    fieldName, Third Argument: specified), sets the minimum
    integer result) value to result and returns TRUE.
    avg (First Argument: Boolean For single type, First Argument
    DatasourceElement dse, DatasourceElement, and Second
    Second Argument: string Argument string (fieldname
    fieldName, Third Argument: specified), sets the average value
    single result) to result and returns TRUE.
    count (First Argument: Boolean For integer type, First Argument
    DatasourceElement dse, DatasourceElement, and sets the
    Second Argument: integer count value to result and returns
    result) TRUE.
    union (First Argument: Datasource Uses First Argument
    DatasourceElement dse, Element DatasourceElement and Second
    Second Argument: Argument DatasourceElement to
    DatasourceElement dse) set the values and returns
    DatasourceElement.
    unionall (First Argument: Datasource Uses First Argument
    DatasourceElement dse, Element DatasourceElement and Second
    Second Argument: Argument DatasourceElement to
    DatasourceElement dse) set the values and returns
    DatasourceElement.
    dereference (First Argument: Datasource Uses First Argument
    DatasourceElement dse, Element DatasourceElement and Second
    Second Argument: Argument DatasourceElement to
    DatasourceElement dse) set the values and returns
    DatasourceElement.
  • Sample Logical Methods Related to Dates
  • Method Return
    Name Parameter Value Comment
    before (First Argument: date when) Boolean Returns TRUE only if the time
    point is before the time point of
    the when object.
    after (First Argument: date when) Boolean Returns TRUE only if the time
    point is after the time point of
    the when object.
  • Additional Sample Methods
  • Method Return
    Name Parameter Value Comment
    isNull None Boolean Returns TRUE if NULL.
    cast (First Argument: Boolean Returns the value as the
    (PrimitiveDataType dt) PrimitiveDataType value.
    gt (First Argument: integer or Boolean Returns TRUE if the value >
    [string, integer, shortint, First Argument.
    longint, single, real])
    ge (First Argument: integer or Boolean Returns TRUE if the value >=
    [string, integer, shortint, First Argument.
    longint, single, real])
    lt (First Argument: integer or Boolean Returns TRUE if the value <
    [string, integer, shortint, First Argument.
    longint, single, real])
    le (First Argument: integer or Boolean Returns TRUE if the value <=
    [string, integer, shortint, First Argument.
    longint, single, real])
    eq (First Argument: integer or Boolean Returns TRUE if the value ==
    [string, integer, shortint, First Argument.
    longint, single, real])
    ne (First Argument: integer or Boolean Returns TRUE if the value !=
    [string, integer, shortint, First Argument.
    longint, single, real])
    as (First Argument: integer or Boolean Assigns the value of the
    [string, integer, shortint, argument and returns TRUE.
    longint, single, real])
    add (First Argument: integer or Boolean Adds First Argument and
    [string, integer, shortint, Second Argument, substitutes
    longint, single, real], the value into Second
    Second Argument: integer Argument, and returns TRUE.
    or [string, integer, shortint,
    longint, single, real])
    sub (First Argument: integer or Boolean Subtracts the value from First
    [shortint, longint, single, Argument, substitutes the
    real], value into Second Argument,
    Second Argument: integer and returns TRUE.
    or [shortint, longint, single,
    real])
    mult (First Argument: integer or Boolean Multiplies the value by First
    [shortint, longint, single, Argument, substitutes the
    real], value into Second Argument,
    Second Argument: integer and returns TRUE.
    or [shortint, longint, single,
    real])
    div (First Argument: integer or Boolean Divides the value by First
    [shortint, longint, single, Argument, substitutes the
    real], value into Second Argument,
    Second Argument: integer and returns TRUE.
    or [shortint, longint, single,
    real])
    getAllMatches (First Argument: string Boolean Searches for the argument
    pattern) pattern as match pattern, and
    returns the results in an array
    of the REMatch type.
    getMatch (First Argument: string Boolean Searches for the argument
    pattern) pattern as pattern, and returns
    the first match.
    isMatch (First Argument: string Boolean Searches for the argument
    pattern) pattern as match pattern, and
    returns TRUE if matched.
    hasMoreElements (First Argument: string Boolean Regular expression pattern
    pattern)
    hasMoreMatches (First Argument: string Boolean Regular expression pattern
    pattern)
    endsWith (First Argument: String Boolean Determines if the string ends
    suffix) with the specified suffix.
    EqualsIgnoreCase (First Argument: String Boolean Compares the string with
    anotherString) another string.
    startsWith (First Argument: String Boolean Determines if the string starts
    prefix) with the specified prefix.

Claims (23)

1. A computer-implemented method of creating a query in an object-relational query language, the method comprising:
prompting, via a user interface provided by a computer system, a user to define one or more constants or variables comprising a range variable, a bag variable, a set variable, a temporary variable, a temporary bag variable, or a temporary set variable; and the domains of the one or more constants or variables are chosen from a list of defined object-relational data source elements;
prompting, via the user interface, a user to choose one or more previously defined object-relational methods;
prompting, via the user interface, the user to choose from a list of defined object oriented variables and constants as one or more arguments to the one or more object-relational methods selected;
prompting, via the user interface, the user to choose an optional one or more conditions from at least a set of defined conditions, and prompting the user to choose from a list of defined object oriented variables and constants as one or more parameters for the one or more selected conditions; and
using the responses received from the user and an object-relational query language to create an object-relational compatible query.
2. The method of claim 1, further comprising receiving from the user, via the user interface, in response to a prompt for an argument of one of the one or more object-relational methods or a parameter of one of the one or more conditions, a constant or a variable that is declared to be a range variable, a bag variable, a set variable, a temporary variable, a temporary bag variable, or a temporary set variable.
3. The method of claim 1, further comprising receiving from the user, via the user interface, in response to a prompt for an argument of one of the one or more object-relational methods or a parameter of one of the one or more conditions, a previously created constant or variable.
4. The method of claim 1, further comprising receiving from the user, via the user interface, in response to a prompt for an object-relational data source element, an object-relational query created recursively.
5. A computer system for creating a query in an object-relational query language, the system comprising:
a database comprising previously defined object-relational methods, object-relational algebra operators, defined object-relational data source elements, and defined conditions;
a user interface, provided by the computer system, adapted to prompt a user to choose one or more constants or variables comprising a range variable, a bag variable, a set variable, a temporary variable, a temporary bag variable, or a temporary set variable; and the domains of the one or more constants or variables are chosen from a list of defined object-relational data source elements;
the user interface further adapted to prompt a user to choose one or more previously defined object-relational methods;
the user interface further adapted to prompt the user to choose from a list of defined variables and constants as one or more arguments to the one or more object-relational methods selected or the one or more object-relational algebra operators;
the user interface further adapted to prompt the user to choose an optional one or more conditions from at least a set of defined conditions, and adapted to prompt the user to specify one or more defined variables and constants as one or more parameters for the one or more selected conditions; and
a query translation module of executable program code running on the computer system configured to use an object-relational language to create an object-relational query with the responses received from the user.
6. The system of claim 5, further comprising a formal query processing module in communication with the database, the formal query process module configured to search the object-relational database for data entries that satisfy a formal query text of the object-relational query.
7. The system of claim 6, wherein the user interface is further adapted to convey to the user the data entries from the object-relational database found by the formal query processing module to satisfy a formal query text of the object-relational query.
8. The system of claim 5, wherein the parameters of at least one of the one or more object-relational conditions comprise a constant or a variable that is declared to be a range variable, a bag variable, a set variable, a temporary variable, a temporary bag variable, or a temporary set variable.
9. The system of claim 5, wherein the object-relational data source elements comprise a previously created constant or variable.
10. The system of claim 5, wherein the parameters of at least one of the one or more object-relational conditions comprise a previously created constant or variable.
11. The system of claim 5, wherein the object-relational data source elements comprise an object-relational query created recursively.
12. A computer-implemented method of creating an object-relational rule, the method comprising:
prompting, via a user interface provided by a computer system, a user to define one or more constants or variables comprising a range variable, a bag variable, a set variable, a temporary variable, a temporary bag variable, or a temporary set variable; and the domains of the one or more constants or variables are chosen from a list of defined object-relational data source elements;
prompting, via the user interface, the user to choose an optional one or more conditions from at least a set of defined conditions, and prompting the user to choose from a list of defined object oriented variables and constants as one or more parameters for the one or more selected conditions;
prompting, via the user interface, the user to choose one or more previously defined object-relational methods;
prompting, via the user interface, the user to choose from a list of defined object oriented variables and constants as one or more arguments to the one or more methods selected; and
using the responses received from the user to create an object-relational rule comprising one or more variables, with the one or more conditions as a premise of the object-relational rule and the one or more object-relational methods as a consequence of the object-relational rule.
13. The method of claim 12, further comprising receiving from the user, via the user interface, in response to a prompt for an argument of one of the one or more object-relational methods or a parameter of one of the one or more conditions, a constant or a variable that is declared to be a range variable, a bag variable, a set variable, a temporary variable, a temporary bag variable, or a temporary set variable.
14. The method of claim 12, further comprising receiving from the user, via the user interface, in response to a prompt for an argument of one of the one or more object-relational methods or a parameter of one of the one or more conditions, a previously created constant or variable.
15. The method of claim 12, further comprising receiving from the user, via the user interface, in response to a prompt for an object-relational data source element, an object-relational query created recursively.
16. A computer system for creating an object-relational rule, the system comprising:
a database comprising previously defined object-relational methods, object-relational algebra operators, defined object-relational data source elements, and defined conditions;
a user interface, provided by the computer system, adapted to prompt a user to choose one or more constants or variables comprising a range variable, a bag variable, a set variable, a temporary variable, a temporary bag variable, or a temporary set variable; and the domains of the one or more constants or variables are chosen from a list of defined objected relational data source elements;
the user interface further adapted to prompt the user to choose an optional one or more conditions from at least a set of defined conditions, and prompting the user to specify one or more defined variables and constants as one or more parameters for the one or more selected conditions;
the user interface further adapted to prompt the user to choose one or more previously defined object-relational methods;
the user interface further adapted to prompt the user to choose from a list of defined variables and constants as one or more arguments to the one or more object-relational methods selected or the one or more object-relational algebra operators; and
a rule translation module of executable program code running on the computer system configured to use an object-relational language to create an object-relational rule with the responses received from the user.
17. The system of claim 16, further comprising a formal rule processing module in communication with the database, the formal rule processing module configured to search the object-relational database for data entries that satisfy a formal rule text of the object-relational rule.
18. The system of claim 17, wherein the user interface is further adapted to convey to the user the data entries from the object-relational database found by the formal rule processing module to satisfy the premise of the formal rule text of the object-relational rule.
19. The system of claim 17, wherein the formal rule processing module is further adapted to execute the one or more object-relational methods in a consequence part of the object-relational rule based on the data entries from the object-relational database found by the formal rule processing module that satisfy the premise of the object-relational rule.
20. The system of claim 16, wherein the parameters of at least one of the one or more object-relational conditions comprise a constant or a variable that is declared to be a range variable, a bag variable, a set variable, a temporary variable, a temporary bag variable, or a temporary set variable.
21. The system of claim 16, wherein the object-relational data source elements comprise a previously created constant or variable.
22. The system of claim 16, wherein the parameters of at least one of the one or more object-relational conditions comprise a previously created constant or variable.
23. The system of claim 16, wherein the object-relational data source elements comprise an object-relational query created recursively.
US12/619,111 2002-10-31 2009-11-16 Structured natural language query and knowledge system Abandoned US20100063968A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US12/619,111 US20100063968A1 (en) 2002-10-31 2009-11-16 Structured natural language query and knowledge system

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US10/286,506 US7263517B2 (en) 2002-10-31 2002-10-31 Structured natural language query and knowledge system
US11/846,428 US7620542B2 (en) 2002-10-31 2007-08-28 Structured natural language query and knowledge system
US12/619,111 US20100063968A1 (en) 2002-10-31 2009-11-16 Structured natural language query and knowledge system

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US11/846,428 Continuation US7620542B2 (en) 2002-10-31 2007-08-28 Structured natural language query and knowledge system

Publications (1)

Publication Number Publication Date
US20100063968A1 true US20100063968A1 (en) 2010-03-11

Family

ID=32175472

Family Applications (3)

Application Number Title Priority Date Filing Date
US10/286,506 Active 2024-11-16 US7263517B2 (en) 2002-10-31 2002-10-31 Structured natural language query and knowledge system
US11/846,428 Expired - Lifetime US7620542B2 (en) 2002-10-31 2007-08-28 Structured natural language query and knowledge system
US12/619,111 Abandoned US20100063968A1 (en) 2002-10-31 2009-11-16 Structured natural language query and knowledge system

Family Applications Before (2)

Application Number Title Priority Date Filing Date
US10/286,506 Active 2024-11-16 US7263517B2 (en) 2002-10-31 2002-10-31 Structured natural language query and knowledge system
US11/846,428 Expired - Lifetime US7620542B2 (en) 2002-10-31 2007-08-28 Structured natural language query and knowledge system

Country Status (2)

Country Link
US (3) US7263517B2 (en)
JP (3) JP4282411B2 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7788250B2 (en) 2006-08-04 2010-08-31 Mohammad Salman Flexible request and response communications interfaces
US20110125773A1 (en) * 2009-11-25 2011-05-26 International Business Machines Corporation Logical Object Search Framework and Application Programming Interface
US8572101B2 (en) 2011-01-10 2013-10-29 International Business Machines Corporation Faceted interaction interface to object relational data
WO2017031082A1 (en) * 2015-08-14 2017-02-23 California Institute Of Technology Algebraic query language (aql) database management system
WO2019023088A1 (en) * 2017-07-24 2019-01-31 Biomedical Objects, Inc. Structured natural language knowledge system
US10430407B2 (en) 2015-12-02 2019-10-01 International Business Machines Corporation Generating structured queries from natural language text
US11200227B1 (en) * 2019-07-31 2021-12-14 Thoughtspot, Inc. Lossless switching between search grammars
US11379453B2 (en) * 2017-06-02 2022-07-05 Palantir Technologies Inc. Systems and methods for retrieving and processing data

Families Citing this family (130)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2400161C (en) 2000-02-22 2015-11-24 Metacarta, Inc. Spatially coding and displaying information
US8244702B2 (en) * 2002-02-26 2012-08-14 International Business Machines Corporation Modification of a data repository based on an abstract data representation
US6996558B2 (en) 2002-02-26 2006-02-07 International Business Machines Corporation Application portability and extensibility through database schema and query abstraction
EP1567941A2 (en) 2002-11-28 2005-08-31 Koninklijke Philips Electronics N.V. Method to assign word class information
US11132183B2 (en) 2003-08-27 2021-09-28 Equifax Inc. Software development platform for testing and modifying decision algorithms
US7970698B2 (en) * 2003-08-27 2011-06-28 Equifax, Inc. Application processing and decision systems and processes
US20050120300A1 (en) * 2003-09-25 2005-06-02 Dictaphone Corporation Method, system, and apparatus for assembly, transport and display of clinical data
CA2447458A1 (en) * 2003-10-29 2005-04-29 Ibm Canada Limited - Ibm Canada Limitee System and method for managing query access to information
US7900133B2 (en) 2003-12-09 2011-03-01 International Business Machines Corporation Annotation structure type determination
US7822598B2 (en) * 2004-02-27 2010-10-26 Dictaphone Corporation System and method for normalization of a string of words
FR2868588A1 (en) * 2004-04-02 2005-10-07 France Telecom VOICE APPLICATION SYSTEM
US20050264554A1 (en) * 2004-05-25 2005-12-01 Deming James L Tile based rendering of smooth points using polygons
US7707490B2 (en) 2004-06-23 2010-04-27 Microsoft Corporation Systems and methods for flexible report designs including table, matrix and hybrid designs
US20060026498A1 (en) * 2004-07-30 2006-02-02 Microsoft Corporation Systems and methods for controlling report properties based on aggregate scope
US7559023B2 (en) 2004-08-27 2009-07-07 Microsoft Corporation Systems and methods for declaratively controlling the visual state of items in a report
US20060116999A1 (en) * 2004-11-30 2006-06-01 International Business Machines Corporation Sequential stepwise query condition building
US7624097B2 (en) * 2005-01-14 2009-11-24 International Business Machines Corporation Abstract records
US8122012B2 (en) 2005-01-14 2012-02-21 International Business Machines Corporation Abstract record timeline rendering/display
US7689555B2 (en) * 2005-01-14 2010-03-30 International Business Machines Corporation Context insensitive model entity searching
US8095553B2 (en) * 2005-03-17 2012-01-10 International Business Machines Corporation Sequence support operators for an abstract database
US8200676B2 (en) 2005-06-28 2012-06-12 Nokia Corporation User interface for geographic search
US8666928B2 (en) 2005-08-01 2014-03-04 Evi Technologies Limited Knowledge repository
US7747937B2 (en) * 2005-08-16 2010-06-29 Rojer Alan S Web bookmark manager
US7440945B2 (en) * 2005-11-10 2008-10-21 International Business Machines Corporation Dynamic discovery of abstract rule set required inputs
US7444332B2 (en) * 2005-11-10 2008-10-28 International Business Machines Corporation Strict validation of inference rule based on abstraction environment
US20070112827A1 (en) * 2005-11-10 2007-05-17 International Business Machines Corporation Abstract rule sets
US8166020B2 (en) * 2005-12-22 2012-04-24 Oracle International Corporation Query generator
US7979267B2 (en) * 2006-01-06 2011-07-12 Computer Associates Think, Inc. Specifying a subset of dynamic inter-related data
US7966172B2 (en) * 2006-01-06 2011-06-21 Computer Associates Think, Inc. Natural language tool for specifying a subset of dynamic inter-related data
WO2007095224A2 (en) 2006-02-10 2007-08-23 Metacarta, Inc. Systems and methods for spatial thumbnails and companion maps for media objects
US7676460B2 (en) * 2006-03-03 2010-03-09 International Business Machines Corporation Techniques for providing suggestions for creating a search query
US8015183B2 (en) 2006-06-12 2011-09-06 Nokia Corporation System and methods for providing statstically interesting geographical information based on queries to a geographic search engine
US9286404B2 (en) 2006-06-28 2016-03-15 Nokia Technologies Oy Methods of systems using geographic meta-metadata in information retrieval and document displays
US9721157B2 (en) 2006-08-04 2017-08-01 Nokia Technologies Oy Systems and methods for obtaining and using information from map images
US20080056538A1 (en) * 2006-08-04 2008-03-06 Metacarta, Inc. Systems and methods for obtaining and using information from map images
US8612283B1 (en) * 2006-06-30 2013-12-17 At&T Intellectual Property Ii, L.P. Method and apparatus for evaluating the cost of operating a network infrastructure
US8196039B2 (en) 2006-07-07 2012-06-05 International Business Machines Corporation Relevant term extraction and classification for Wiki content
US8560956B2 (en) 2006-07-07 2013-10-15 International Business Machines Corporation Processing model of an application wiki
US20080010387A1 (en) * 2006-07-07 2008-01-10 Bryce Allen Curtis Method for defining a Wiki page layout using a Wiki page
US20080010386A1 (en) * 2006-07-07 2008-01-10 Bryce Allen Curtis Method and apparatus for client wiring model
US8219900B2 (en) 2006-07-07 2012-07-10 International Business Machines Corporation Programmatically hiding and displaying Wiki page layout sections
US20080010388A1 (en) * 2006-07-07 2008-01-10 Bryce Allen Curtis Method and apparatus for server wiring model
US20080010345A1 (en) * 2006-07-07 2008-01-10 Bryce Allen Curtis Method and apparatus for data hub objects
US7954052B2 (en) * 2006-07-07 2011-05-31 International Business Machines Corporation Method for processing a web page for display in a wiki environment
US8775930B2 (en) 2006-07-07 2014-07-08 International Business Machines Corporation Generic frequency weighted visualization component
US20080065769A1 (en) * 2006-07-07 2008-03-13 Bryce Allen Curtis Method and apparatus for argument detection for event firing
US20080010338A1 (en) * 2006-07-07 2008-01-10 Bryce Allen Curtis Method and apparatus for client and server interaction
US20080016047A1 (en) * 2006-07-12 2008-01-17 Dettinger Richard D System and method for creating and populating dynamic, just in time, database tables
US20080016048A1 (en) * 2006-07-12 2008-01-17 Dettinger Richard D Intelligent condition pruning for size minimization of dynamic, just in time tables
US8589869B2 (en) 2006-09-07 2013-11-19 Wolfram Alpha Llc Methods and systems for determining a formula
US8321402B2 (en) * 2006-12-14 2012-11-27 Sap Ag Generic application interface for searching
US20080275860A1 (en) * 2007-05-03 2008-11-06 Alex Zakonov Seamless design
US8140557B2 (en) 2007-05-15 2012-03-20 International Business Machines Corporation Ontological translation of abstract rules
US8700597B2 (en) * 2007-08-07 2014-04-15 Equifax, Inc. Systems and methods for managing statistical expressions
US8200257B2 (en) * 2007-08-30 2012-06-12 Yahoo! Inc. Customizable mobile message services
US7984032B2 (en) * 2007-08-31 2011-07-19 Microsoft Corporation Iterators for applying term occurrence-level constraints in natural language searching
US7885943B1 (en) 2007-10-02 2011-02-08 Emc Corporation IT compliance rules
US8024320B1 (en) 2007-10-02 2011-09-20 Emc Corporation Query language
US8838659B2 (en) 2007-10-04 2014-09-16 Amazon Technologies, Inc. Enhanced knowledge repository
US8732155B2 (en) * 2007-11-16 2014-05-20 Iac Search & Media, Inc. Categorization in a system and method for conducting a search
US7809721B2 (en) * 2007-11-16 2010-10-05 Iac Search & Media, Inc. Ranking of objects using semantic and nonsemantic features in a system and method for conducting a search
US20090132927A1 (en) * 2007-11-16 2009-05-21 Iac Search & Media, Inc. User interface and method for making additions to a map
US20090132486A1 (en) * 2007-11-16 2009-05-21 Iac Search & Media, Inc. User interface and method in local search system with results that can be reproduced
US8090714B2 (en) * 2007-11-16 2012-01-03 Iac Search & Media, Inc. User interface and method in a local search system with location identification in a request
US20090132514A1 (en) * 2007-11-16 2009-05-21 Iac Search & Media, Inc. method and system for building text descriptions in a search database
US20090132572A1 (en) * 2007-11-16 2009-05-21 Iac Search & Media, Inc. User interface and method in a local search system with profile page
US20090132485A1 (en) * 2007-11-16 2009-05-21 Iac Search & Media, Inc. User interface and method in a local search system that calculates driving directions without losing search results
US20090132929A1 (en) * 2007-11-16 2009-05-21 Iac Search & Media, Inc. User interface and method for a boundary display on a map
US7921108B2 (en) 2007-11-16 2011-04-05 Iac Search & Media, Inc. User interface and method in a local search system with automatic expansion
US20090132512A1 (en) * 2007-11-16 2009-05-21 Iac Search & Media, Inc. Search system and method for conducting a local search
US20090132645A1 (en) * 2007-11-16 2009-05-21 Iac Search & Media, Inc. User interface and method in a local search system with multiple-field comparison
US20090132513A1 (en) * 2007-11-16 2009-05-21 Iac Search & Media, Inc. Correlation of data in a system and method for conducting a search
US20090132505A1 (en) * 2007-11-16 2009-05-21 Iac Search & Media, Inc. Transformation in a system and method for conducting a search
US20090132573A1 (en) * 2007-11-16 2009-05-21 Iac Search & Media, Inc. User interface and method in a local search system with search results restricted by drawn figure elements
US20090132236A1 (en) * 2007-11-16 2009-05-21 Iac Search & Media, Inc. Selection or reliable key words from unreliable sources in a system and method for conducting a search
US8145703B2 (en) * 2007-11-16 2012-03-27 Iac Search & Media, Inc. User interface and method in a local search system with related search results
JP5220483B2 (en) * 2008-06-06 2013-06-26 インターナショナル・ビジネス・マシーンズ・コーポレーション Computer system for performing aggregate calculation on tree-structured data, method and computer program therefor
US8417513B2 (en) * 2008-06-06 2013-04-09 Radiant Logic Inc. Representation of objects and relationships in databases, directories, web services, and applications as sentences as a method to represent context in structured data
US8375014B1 (en) * 2008-06-19 2013-02-12 BioFortis, Inc. Database query builder
US8316012B2 (en) * 2008-06-27 2012-11-20 SAP France S.A. Apparatus and method for facilitating continuous querying of multi-dimensional data streams
US8180771B2 (en) 2008-07-18 2012-05-15 Iac Search & Media, Inc. Search activity eraser
US9367608B1 (en) * 2009-01-07 2016-06-14 Guangsheng Zhang System and methods for searching objects and providing answers to queries using association data
US9201927B1 (en) * 2009-01-07 2015-12-01 Guangsheng Zhang System and methods for quantitative assessment of information in natural language contents and for determining relevance using association data
US20100185603A1 (en) * 2009-01-09 2010-07-22 Phibbs Paul H Techniques for using database rule results
US9805089B2 (en) 2009-02-10 2017-10-31 Amazon Technologies, Inc. Local business and product search system and method
US8150882B2 (en) * 2009-03-03 2012-04-03 Microsoft Corporation Mapping from objects to data model
US8433559B2 (en) * 2009-03-24 2013-04-30 Microsoft Corporation Text analysis using phrase definitions and containers
US8601015B1 (en) 2009-05-15 2013-12-03 Wolfram Alpha Llc Dynamic example generation for queries
US8788524B1 (en) 2009-05-15 2014-07-22 Wolfram Alpha Llc Method and system for responding to queries in an imprecise syntax
US8135730B2 (en) * 2009-06-09 2012-03-13 International Business Machines Corporation Ontology-based searching in database systems
US20110055268A1 (en) * 2009-08-27 2011-03-03 Chen-Yu Sheu Search system based on structured natural languages
US8689175B2 (en) * 2010-03-03 2014-04-01 Ebay Inc. Business rules management system
US8484015B1 (en) 2010-05-14 2013-07-09 Wolfram Alpha Llc Entity pages
US9110882B2 (en) 2010-05-14 2015-08-18 Amazon Technologies, Inc. Extracting structured knowledge from unstructured text
US8812298B1 (en) 2010-07-28 2014-08-19 Wolfram Alpha Llc Macro replacement of natural language input
US8666785B2 (en) * 2010-07-28 2014-03-04 Wairever Inc. Method and system for semantically coding data providing authoritative terminology with semantic document map
US8660836B2 (en) 2011-03-28 2014-02-25 International Business Machines Corporation Optimization of natural language processing system based on conditional output quality at risk
US9069814B2 (en) 2011-07-27 2015-06-30 Wolfram Alpha Llc Method and system for using natural language to generate widgets
US8510328B1 (en) * 2011-08-13 2013-08-13 Charles Malcolm Hatton Implementing symbolic word and synonym English language sentence processing on computers to improve user automation
US9501918B2 (en) 2011-08-24 2016-11-22 Safetyminded Holdings, Inc. Human safety indicator
US8601016B2 (en) * 2011-08-30 2013-12-03 International Business Machines Corporation Pre-generation of structured query language (SQL) from application programming interface (API) defined query systems
US9734252B2 (en) 2011-09-08 2017-08-15 Wolfram Alpha Llc Method and system for analyzing data using a query answering system
US9851950B2 (en) 2011-11-15 2017-12-26 Wolfram Alpha Llc Programming in a precise syntax using natural language
GB2503223A (en) * 2012-06-19 2013-12-25 Ibm Redrafting text strings using a vocabulary
US9405424B2 (en) 2012-08-29 2016-08-02 Wolfram Alpha, Llc Method and system for distributing and displaying graphical items
US9501585B1 (en) * 2013-06-13 2016-11-22 DataRPM Corporation Methods and system for providing real-time business intelligence using search-based analytics engine
US9092563B1 (en) * 2013-09-27 2015-07-28 Emc Corporation System for discovering bugs using interval algebra query language
US20150100516A1 (en) * 2013-10-07 2015-04-09 Kevin Hicks Method of analyzing and scoring cosmetic products
US10324964B2 (en) * 2014-01-16 2019-06-18 Massachusetts Institute Of Technology Method and systems for enhanced ontology assisted querying of data stores
US9785669B2 (en) * 2014-05-21 2017-10-10 International Business Machines Corporation Revising policy statements using hyperlinks
US11238101B1 (en) * 2014-09-05 2022-02-01 Soundhound, Inc. System and method for interpreting natural language commands with compound criteria
US11138205B1 (en) 2014-12-22 2021-10-05 Soundhound, Inc. Framework for identifying distinct questions in a composite natural language query
US10146751B1 (en) 2014-12-31 2018-12-04 Guangsheng Zhang Methods for information extraction, search, and structured representation of text data
US20170075953A1 (en) * 2015-09-11 2017-03-16 Google Inc. Handling failures in processing natural language queries
US10282444B2 (en) 2015-09-11 2019-05-07 Google Llc Disambiguating join paths for natural language queries
US11301502B1 (en) 2015-09-15 2022-04-12 Google Llc Parsing natural language queries without retraining
US9959311B2 (en) * 2015-09-18 2018-05-01 International Business Machines Corporation Natural language interface to databases
US10191734B1 (en) 2015-12-15 2019-01-29 Open Text Corporation Method and system for software application optimization using natural language-based queries
US11568129B2 (en) * 2017-02-16 2023-01-31 North Carolina State University Spreadsheet recalculation algorithm for directed acyclic graph processing
US10963433B2 (en) * 2018-08-17 2021-03-30 International Business Machines Corporation ASCII based instant object oriented schema generation
US11429627B2 (en) * 2018-09-28 2022-08-30 Splunk Inc. System monitoring driven by automatically determined operational parameters of dependency graph model with user interface
US11620300B2 (en) 2018-09-28 2023-04-04 Splunk Inc. Real-time measurement and system monitoring based on generated dependency graph models of system components
US10809892B2 (en) 2018-11-30 2020-10-20 Microsoft Technology Licensing, Llc User interface for optimizing digital page
US11048876B2 (en) * 2018-11-30 2021-06-29 Microsoft Technology Licensing, Llc Phrase extraction for optimizing digital page
US11031139B2 (en) * 2019-02-08 2021-06-08 General Electric Company Systems and methods for conversational flexible data presentation
US11600397B2 (en) * 2019-02-08 2023-03-07 General Electric Company Systems and methods for conversational flexible data presentation
US11048696B2 (en) * 2019-06-24 2021-06-29 Workiva Inc. Method and computing device for generating a search query for a graph database
US11604790B2 (en) 2020-08-31 2023-03-14 Unscrambl Inc Conversational interface for generating and executing controlled natural language queries on a relational database
US11775511B2 (en) 2021-08-03 2023-10-03 International Business Machines Corporation Feedback-updated data retrieval chatbot
CN116127047B (en) * 2023-04-04 2023-08-01 北京大学深圳研究生院 Method and device for establishing enterprise information base

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5696962A (en) * 1993-06-24 1997-12-09 Xerox Corporation Method for computerized information retrieval using shallow linguistic analysis
US5884302A (en) * 1996-12-02 1999-03-16 Ho; Chi Fai System and method to answer a question
US5924089A (en) * 1996-09-03 1999-07-13 International Business Machines Corporation Natural language translation of an SQL query
US6026388A (en) * 1995-08-16 2000-02-15 Textwise, Llc User interface and other enhancements for natural language information retrieval system and method
US6304864B1 (en) * 1999-04-20 2001-10-16 Textwise Llc System for retrieving multimedia information from the internet using multiple evolving intelligent agents
US20020156772A1 (en) * 1999-12-02 2002-10-24 International Business Machines Generating one or more XML documents from a single SQL query
US6601026B2 (en) * 1999-09-17 2003-07-29 Discern Communications, Inc. Information retrieval by natural language querying
US6615172B1 (en) * 1999-11-12 2003-09-02 Phoenix Solutions, Inc. Intelligent query engine for processing voice based queries
US6704747B1 (en) * 1999-03-16 2004-03-09 Joseph Shi-Piu Fong Method and system for providing internet-based database interoperability using a frame model for universal database
US20060036448A1 (en) * 2001-06-13 2006-02-16 Caminus Corporation System architecture and method for energy industry trading and transaction management
US7027974B1 (en) * 2000-10-27 2006-04-11 Science Applications International Corporation Ontology-based parser for natural language processing

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SE9003004D0 (en) * 1990-09-21 1990-09-21 Ibm ICON BASED QUERY SYSTEM
JP3707133B2 (en) * 1996-07-08 2005-10-19 富士ゼロックス株式会社 Document database management apparatus and document database management method
JP2000099542A (en) * 1998-09-25 2000-04-07 Ricoh Co Ltd Inquiry of production result and inquiry device of database
JP2002532788A (en) * 1998-12-04 2002-10-02 テクノロジー エネイブリング カンパニー エルエルシー System and method for organizing data
JP2001014318A (en) * 1999-06-28 2001-01-19 Toshiba Corp Database retrieving device and storage medium storing program

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5696962A (en) * 1993-06-24 1997-12-09 Xerox Corporation Method for computerized information retrieval using shallow linguistic analysis
US6026388A (en) * 1995-08-16 2000-02-15 Textwise, Llc User interface and other enhancements for natural language information retrieval system and method
US5924089A (en) * 1996-09-03 1999-07-13 International Business Machines Corporation Natural language translation of an SQL query
US5884302A (en) * 1996-12-02 1999-03-16 Ho; Chi Fai System and method to answer a question
US6704747B1 (en) * 1999-03-16 2004-03-09 Joseph Shi-Piu Fong Method and system for providing internet-based database interoperability using a frame model for universal database
US6304864B1 (en) * 1999-04-20 2001-10-16 Textwise Llc System for retrieving multimedia information from the internet using multiple evolving intelligent agents
US6601026B2 (en) * 1999-09-17 2003-07-29 Discern Communications, Inc. Information retrieval by natural language querying
US6615172B1 (en) * 1999-11-12 2003-09-02 Phoenix Solutions, Inc. Intelligent query engine for processing voice based queries
US20020156772A1 (en) * 1999-12-02 2002-10-24 International Business Machines Generating one or more XML documents from a single SQL query
US20030014397A1 (en) * 1999-12-02 2003-01-16 International Business Machines Corporation Generating one or more XML documents from a relational database using XPath data model
US7027974B1 (en) * 2000-10-27 2006-04-11 Science Applications International Corporation Ontology-based parser for natural language processing
US20060036448A1 (en) * 2001-06-13 2006-02-16 Caminus Corporation System architecture and method for energy industry trading and transaction management

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7788250B2 (en) 2006-08-04 2010-08-31 Mohammad Salman Flexible request and response communications interfaces
US20110125773A1 (en) * 2009-11-25 2011-05-26 International Business Machines Corporation Logical Object Search Framework and Application Programming Interface
US9165043B2 (en) * 2009-11-25 2015-10-20 Maobing Jin Logical object search framework and application programming interface
US8572101B2 (en) 2011-01-10 2013-10-29 International Business Machines Corporation Faceted interaction interface to object relational data
WO2017031082A1 (en) * 2015-08-14 2017-02-23 California Institute Of Technology Algebraic query language (aql) database management system
US10915531B2 (en) 2015-08-14 2021-02-09 California Institute Of Technology Algebraic query language (AQL) database management system
US10430407B2 (en) 2015-12-02 2019-10-01 International Business Machines Corporation Generating structured queries from natural language text
US11068480B2 (en) 2015-12-02 2021-07-20 International Business Machines Corporation Generating structured queries from natural language text
US11379453B2 (en) * 2017-06-02 2022-07-05 Palantir Technologies Inc. Systems and methods for retrieving and processing data
WO2019023088A1 (en) * 2017-07-24 2019-01-31 Biomedical Objects, Inc. Structured natural language knowledge system
US11200227B1 (en) * 2019-07-31 2021-12-14 Thoughtspot, Inc. Lossless switching between search grammars
US11803543B2 (en) 2019-07-31 2023-10-31 Thoughtspot, Inc. Lossless switching between search grammars

Also Published As

Publication number Publication date
US20040088158A1 (en) 2004-05-06
US7263517B2 (en) 2007-08-28
US7620542B2 (en) 2009-11-17
JP4282411B2 (en) 2009-06-24
JP2004152259A (en) 2004-05-27
JP2009076086A (en) 2009-04-09
JP2009076085A (en) 2009-04-09
US20070294233A1 (en) 2007-12-20

Similar Documents

Publication Publication Date Title
US7620542B2 (en) Structured natural language query and knowledge system
US7865491B2 (en) Model entity operations in query results
US7152073B2 (en) Method and system for defining sets by querying relational data using a set definition language
US7912862B2 (en) Relational schema format
US6574623B1 (en) Query transformation and simplification for group by queries with rollup/grouping sets in relational database management systems
US8370375B2 (en) Method for presenting database query result sets using polymorphic output formats
EP0455447B1 (en) Apparatus and method for adding an associative query capability to a programming language
US7739223B2 (en) Mapping architecture for arbitrary data models
US6980976B2 (en) Combined database index of unstructured and structured columns
US6449609B1 (en) Using materialized view to process a related query containing a one to many lossless join
US6449606B1 (en) Using a materialized view to process a related query containing an antijoin
US20060116999A1 (en) Sequential stepwise query condition building
US20030163461A1 (en) Method and system for defining sets by querying relational data using a set definition language
US20030037039A1 (en) Schema for SQL statements
US20080270447A1 (en) Ruleset generation for multiple entities with multiple data values per attribute
JP2009530735A (en) Extended query language that supports rich data types
US20080172408A1 (en) Converting Recursive Hierarchical Data to Relational Data
JPH06290102A (en) Equipment and method for accessing information
US7836071B2 (en) Displaying relevant abstract database elements
US20080168042A1 (en) Generating summaries for query results based on field definitions
CN101493830A (en) Structured natural language query and knowledge system
US8090737B2 (en) User dictionary term criteria conditions
de Paula Braganholo et al. Updating relational databases through XML views
CN100485666C (en) Structured natural language inquiry and knowledge system
US20090119277A1 (en) Differentiation of field attributes as value constraining versus record set constraining

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION