US20150302050A1 - Generation of requests to a data processing system - Google Patents

Generation of requests to a data processing system Download PDF

Info

Publication number
US20150302050A1
US20150302050A1 US14/402,888 US201214402888A US2015302050A1 US 20150302050 A1 US20150302050 A1 US 20150302050A1 US 201214402888 A US201214402888 A US 201214402888A US 2015302050 A1 US2015302050 A1 US 2015302050A1
Authority
US
United States
Prior art keywords
data
request
description
lexeme
query language
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US14/402,888
Inventor
Joerg Wurzer
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Iqser Holding AG
Original Assignee
Iqser IP AG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Iqser IP AG filed Critical Iqser IP AG
Assigned to IQSER IP AG reassignment IQSER IP AG ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: WURZER, JOERG, DR.
Publication of US20150302050A1 publication Critical patent/US20150302050A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • G06F17/30401
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • G06F16/243Natural language query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/242Query formulation
    • G06F16/2423Interactive query statement specification based on a database schema
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2452Query translation
    • G06F16/24522Translation of natural language queries to structured queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2453Query optimisation
    • G06F17/30442
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/268Morphological analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates

Definitions

  • the present invention relates to a method for an at least semiautomatic generation of a request to a data processing system with a data pool respectively data stock, wherein a data model, which is usable for the selection of data from the data pool using a query language, forms the basis of the data pool, wherein a user's request made in natural language, which takes place by a user input of terms (subsequently also referred to as user terms) captured as a series of characters (subsequently also referred to as character string), wherein the terms semantically describe the user intended results of the answer of the data processing system on the request that has to be carried out by the data processing system, is analyzed and/or interpreted, and converted into a request in the query language.
  • a data model which is usable for the selection of data from the data pool using a query language, forms the basis of the data pool
  • a user's request made in natural language which takes place by a user input of terms (subsequently also referred to as user terms) captured as a series of characters (subsequently also referred to as character string
  • so called Information-Retrieval-Systems for example so called search engines respectively Internet search engines
  • various methods for the retrieval of data from a data pool are known.
  • the basis for the data pool is thereby formed by a data model, which enables a request of data from the data pool using a query language of the data processing system.
  • the data of the data pool is structured regarding the respective data model, particularly into data classes according to a certain semantic classification concerning the content type and/or for data attributes concerning meta data.
  • the user respectively the operator of a data processing system does not know the data model on which the data pool is based on, thus the available data classes and/or those data attributes, it is difficult to place a precise request respectively search request to the data processing system.
  • the user respectively the operator would at first have to decide, if he wants to search for certain data classes, meaning a certain semantic classification regarding the content type, to search for data attributes, meaning meta data, or to search for conjunctions, which describe the desired information regarding its content, normally as a keyword. This is described in more detail using the following examples:
  • semantic search engine concepts are known, wherein it shall be achieved to capture the semantic of a user's respectively an operator's request in natural language in the form of text input and to convert it into one or more corresponding requests in the query language of the data processing system.
  • the concepts known so far do not fulfil the requirements regarding the degree of complexity of such requests and also regarding the delivered answers respectively results to such requests and further comprise several constraints, particularly language constraints with respect to the evaluation.
  • the present invention is based on the problem to enable user requests in natural language to a data processing system avoiding the previously described drawbacks.
  • the present invention proposes a method for an at least semiautomatic generation of a request to a data processing system with a data pool, wherein a data model, which is usable for the selection of data from the data pool using a query language, forms the basis of the data pool, wherein a user's request made in natural language, which takes place by a user input of terms (user terms) captured as a series of characters (character string), wherein the terms semantically describe the user intended results of the answer of the data processing system on the request that has to be carried out by the data processing system, is analyzed and/or interpreted, and converted into a request in the query language, which is characterized in that the request in natural language captured as a character string is resolved into lexemes, those lexemes are sequentially processed equivalent to their sequence, wherein a meaning of the particular lexeme is determined on the basis of the data model, and a request in the query language is created out of the determined meaning.
  • a data model which is usable for the selection of data from the data pool
  • a lexeme in terms of the present invention is a meaning unit of linguistic units which can be manifested in different forms of words and thereby does neither consider the precise form of the linguistic unit nor the precise syntactic function of the linguistic unit.
  • lexemes conform syntactic words for the greatest possible extent, with the difference that particularly also Asian languages and the like are considered for which no syntactical recognizable division of words within a sentence is known.
  • Syntactic words in terms of the present invention are units that can be shifted within a sentence, which can be swapped and/or divided by insertion of further words. Words in terms of the present invention are thereby the smallest moveable and replaceable units of a sentence concerning the syntactic aspect.
  • the usage of the term lexeme in terms of the present invention considers that a given word is reduced to a grammatical basic form in order to enable syntactic-grammatical variants to be balanced.
  • a request respectively a search request according to the invention is advantageously characterized in that the user respectively the operator intuitively describes the information he is looking for in an input field respectively text field with several words. This request is then divided into its lexical parts and translated into a request in the query language, which goes beyond the mere full text search by combination of key words.
  • An advantageous embodiment of the invention provides that the user input is done by speech control similar input assistance.
  • the data processing system advantageously comprises means for speech control and means for the conversion of verbal expressions into a series of characters.
  • An advantageous embodiment of the invention provides that for each particular lexeme to be used respectively to be processed a check is carried out if a subsequent lexeme in the order is existing, wherein it is checked in this case, if the lexeme currently to be processed is the description of a data source, the description of a data class, or the description of a data attribute of the data model. Inventively it is first checked if the user's request in natural language comprises multiple terms. If this is the case the first term is checked whether a data source, a data class, meaning a semantic classification regarding a content type, or a data attribute, meaning meta data, is existing with this term respectively with this term description.
  • a further advantageous embodiment of the invention provides that in case the description of a data source or the description of a data class is present, the description is transformed into a parameter of the query language for the selection of data from the data pool, wherein the parameter narrows down the selection of data on data with the description of the data source or the description of the data class.
  • the data source is only checked for the first lexeme in the sequence.
  • a data source detected in this way accordingly limits the search area for the request.
  • the term “Amazon” would for example only search in the data which is provided by the company Amazon. If the first term is a data class, the search would accordingly be limited to semantically classified contents regarding a content type. For example a search for the term “book” would only be performed in data which is classified as books from its content type.
  • the information for that is received by the data processing system by usage of the available data model or in an optional embodiment of the invention advantageously by an ontology, in which data classes representing semantic content types can contain respectively comprise different descriptions, so called literary figures, in different languages.
  • the data processing system can further comprise or use dictionaries, vocabularies or the like as an ontology-giving respectively—proving source of information.
  • a further advantageous embodiment of the invention provides that it is checked for the present lexeme to be processed if a subsequent lexeme in the sequence is existing, wherein it is checked in this case if the lexeme currently to be processed is the description of a data source, the description of a data class or a data attribute of the data model.
  • a further advantageous embodiment of the invention provides that it is checked if the lexeme currently to be used is a filler word, wherein the lexeme currently to be processed is ignored in this case and the subsequent lexeme in the sequence is used.
  • the second term of the user's request in natural language is thus analyzed respectively examined, wherein the second term can be a data class, if the first term of the user's request in natural language limits the request to a data source, or wherein the second term can be a search word, if the first term in the user's natural language conforms a data class representing a semantic content type, or wherein the second term can be a data attribute.
  • a further advantageous embodiment of the invention provides that in case the description of a data attribute is present, the description and the subsequent lexeme in the sequence are transformed into a parameter of the query language as a name-value-pair for the selection of data from the data pool, wherein the parameter narrows down the selection of data to data with the description of the data attribute.
  • the data processing system receives the information required for that by usage of the available data model or in an optional embodiment of the invention from an ontology, in which data attributes (meta data) can contain respectively comprise different descriptions, so called literary figures, in different languages.
  • the data processing system can further comprise or use dictionaries, vocabularies or the like as an ontology-giving respectively—proving source of information.
  • the request respectively the search to the data processing system is inventively narrowed down by amending the request with the attribute value respectively the meta date in this case. This narrowing-down is then used for the subsequent term in the user's request in natural language.
  • the description of a data class in case the description of a data class is present, it is checked if the description is also the description of a data attribute, wherein in this case the currently to be processed lexeme and the subsequent lexeme in the sequence are transformed into a parameter of the query language as a name-value-pair for the selection of data from the data pool, wherein the parameter alternatively narrows down the selection of data to data with the description of the data attribute.
  • Both parameters which on the one hand describe the data class and on the other hand the data attribute, are combined as an OR-conjunction for request of data.
  • a further advantageous embodiment of the invention is characterized in that in case no data source, no data class, no data attribute or no remaining lexeme in the sequence in the character string is described, the currently to be processed lexeme is transformed to a parameter of the query language for the selection of data from the data pool, wherein the parameter narrows down the selection of data to data with the currently to be processed lexeme as a free-text search.
  • the next term of the user's request in natural language can be a keyword which neither represents a data class nor a data attribute and thus advantageously is used for a limiting free text search respectively full text search. If the next term of the user's request in natural language is a data class again, meaning a semantically classified content type, this is inventively interpreted such, that the results search respectively intended with the request have to be linked to contents of this type, meaning this data class.
  • the data class which describes a link
  • the previous part of the request to be created in the query language is completed by limitation of this new data attribute and is further completed by adding a logical “OR” to the until then partially created request in the query language.
  • this includes using the term following on the description of the data attribute (attribute name) as the attribute value.
  • the complexity of the request in the query language is advantageously limited to a single relation.
  • the user input takes place using at least one input assistance, particularly for the semiautomatic creation of complex request.
  • the user respectively the operator is guided automatically in formulating a request.
  • a list of possible data sources, data classes (semantic content types) and/or data attributes for a selectable input, which can be selected by clicking with the mouse pointer or the like, is shown to the user during the user's input of characters of the character string, preferably during input of the first word.
  • the entered character string meaning the first word of the user's input
  • a data class meaning a semantic content type
  • a list of possible data attributes (meta data) for an input is shown to the user for the second word of the character string of the user's input.
  • the display of possible selectable inputs is advantageously carried out by visualization in terms of a so called “drop down”-list which is known from so called internet browsers.
  • the user's input is carried out for complex or more complex queries using a graphical user interface (GUI) by creation of a request-tree.
  • GUI graphical user interface
  • the user respectively the operator can compose a request-tree or can concatenate statements, which are usable for the request, in a list for the request in the form of subject—predicate—object.
  • Such a tree or graph representing “subject—predicate—object”—relations is known for example from WO 2009/030288 A1, whose disclosure is explicitly referenced here.
  • the user advantageously defines the root of the request-tree for the inventive creation of the request-tree. For that the user can chose the data class, meaning the semantic classification of a content type, from a list, which is provided by the data processing system respectively by the input assistance, preferably with a graphical user interface or with so called mouse pointers or the like.
  • the user can add any attributes.
  • the user can also select from a list of possible data attributes which exist for the previously selected data class as a semantic content type in the data model.
  • the data attributes are selectively concatenated by a logical “AND” or “OR.
  • For each of the attribute names respectively each attribute description advantageously multiple attribute values can be specified which are concatenated by a logical “OR”. If the user does not select an attribute name, a free text search respectively a full text search with the stored keywords is initiated.
  • the user can add limiting conjunctions to the object defined this way, comprising the data class which forms the root of the request-tree and/or the data attributes for this data class. For that the user can select those limiting conjunctions for the defined object from a list as well.
  • the user can limit the conjunction himself by selection of a predicate, meaning the reason for a conjunction.
  • the data processing system and/or the input assistance provides a selection list as well.
  • the user can save the request-tree as a bookmark in a database or as a new data class of a semantic content type in an ontology, wherein the ontology can subsequently be used by the data processing system for further requests and/or request-trees.
  • the request-tree created with the inventive input assistance is displayed by the graphical user interface (GUI).
  • GUI graphical user interface
  • a window with a list is shown to the user when selecting a node of the request-tree, preferably by double clicking the node, in which the data class and the data attributes as well as the attribute values can be defined.
  • a double click on an edge of the request-tree opens a selection list for justification of the conjunction, meaning the predicate.
  • Object of the present invention is further a data processing system respectively a data processing unit, comprising a processor, which is designed to respectively prepared to carry out an inventive method automatically respectively automated, particularly by loading a corresponding application program realizing the method from a storage means and execution of the application program by the processor.
  • FIG. 1 in a schematic diagram an embodiment for an inventive request to a data processing system
  • FIG. 2 in a flow chart ( FIGS. 2 a to 2 e ) an embodiment for a realization of an inventive request to a data processing system;
  • FIG. 3 in a flow chart an embodiment for a realization of an inventive input assistance for the semiautomatic creation of a request to a data processing system
  • FIG. 4 in a schematic diagram an embodiment for an inventive input assistance for the semiautomatic creation of a request to the data processing system
  • FIG. 5 in a flow chart an embodiment for a realization of a display of results of an inventive request to a data processing system.
  • FIG. 1 shows a data processing system 1 respectively 1 ′ with a data pool 2 , which is based on a data model that is usable respectively processible to select data from the data pool 2 using a query language of the data processing system 1 .
  • the data processing system 1 comprises an apparatus 5 for analysis and/or interpretation of users' 3 requests 6 in natural language.
  • a user's 3 request in natural language is captured by a user input 4 of user terms, wherein the user terms are captured as a series of characters.
  • the user terms thereby semantically describe the user 3 intended results to the answer of the data processing system 1 on the request that has to be carried out by the data processing system 1 .
  • the input 4 of user terms can be carried out by the user 3 manually using buttons of an input device (not explicitly shown in FIG. 1 ) of the user 3 or using voice control (not explicitly shown in FIG. 1 ).
  • a user's 3 request 6 in natural language is analyzed and/or interpreted by apparatus 5 for analysis and/or interpretation of requests in natural language. Therefore the apparatus 5 uses the data model 7 for analysis and/or interpretation of requests 6 in natural language.
  • the meta data thereby are part of the data processing system 1 (marked in FIG. 1 with reference sign 1 ′) or are used from the data processing system 1 by means of a communication connection (not explicitly shown in FIG. 1 ).
  • the apparatus 5 converts the user's 3 request 6 in natural language into a request 9 in the query language of the data processing system 1 and directs this to the index or the database of the data pool 2 of the data processing system 1 .
  • the result of request 9 in the query language to the index of the data pool 2 of data processing system 1 is then delivered to the user, particularly by means of a graphical playback by a display device (not explicitly shown in FIG. 1 ) of user 3
  • FIG. 2 shows in a flow chart (compare FIGS. 2 a to 2 e ) an embodiment for a realization of an inventive request to a data processing system using a query language of the data processing system—hereinafter called query—by an algorithm.
  • the elements highlighted in FIGS. 2 b and 2 e by hatching, are advantageously optional variants of the method and relate on the one hand to the option of a possible semantic double interpretation of a word in case of a conjunction and on the other hand to the option for a filter for filler words, which enables a more intuitive input for the user respectively operator according to a natural language, as well as a more robust interpretation and thus a better result.
  • Embodiment of the inventive method, for which the optional method variants do not apply comprise instead of the optional elements highlighted with the hatching corresponding references from respectively to each corresponding elements of the flow chart that are not highlighted by hatchings.
  • a simple search request is characterized in that the user describes the desired information in an input field (text field), meaning the terms are entered, which semantically describe the user intended results of the answer to the data processing system on the request that has to be carried out by the data processing system (compare FIG. 2 a , reference sign 11 ).
  • This request in natural language (search request 1 ) of the user is then divided into its lexical parts, at present lexemes (compare FIG. 2 a , reference sign 12 ), and translated respectively converted into a request in the query language of the data processing system (compare FIG. 2 e , reference sign 13 ), which goes beyond a mere full text search by combination of key words.
  • a detected data source (compare FIG. 2 a , reference sign 15 ) would narrow down the search area accordingly (compare FIG. 2 a , reference sign 16 ).
  • the term “Amazon” would only cause a search in data that is provided by Amazon.
  • a detected data class (semantic classification (content type)) (compare FIG. 2 b , reference sign 27 , wherein the content type in FIG. 2 is described as object type OT), would narrow down the search, meaning the request in the query language of the data processing system, to correspondingly (content type respectively object type OT) classified contents (compare FIG. 2 b , reference sign 18 respectively FIG. 2 c , reference sign 27 ), for example to the content type respectively the data class “book”.
  • the data processing system 1 fetches this information from the data model of the data processing system 1 . If the request has already been narrowed down by a data class (semantic classification (content type)), the requested contents are narrowed down using a relation R with data that match compare FIG. 2 b , reference sign 18 ) the data class (semantic classification (content type)).
  • the request Q has already been narrowed down by a data class (semantic classification (content type)) and already contains a relation R ( FIG. 2 b , reference sign 24 ), which contains this data class (semantic classification (content type)), the description of the data class (semantic classification (content type)) is used as a key word for a full text search for further limitation of relation R of request Q ( FIG. 2 b , reference sign 25 ).
  • the described method limits the complexity of request Q to a data processing system using a query language of the data processing system (also called query) to a single relation R.
  • a recognized data attribute respectively a recognized attribute name would limit the search to a corresponding attribute value (meta date) (compare FIG. 2 c , reference sign 20 respectively 28 ). If the relation R is not yet an element of request Q, a limitation of query Q is carried out (compare FIG. 2 c , reference sign 28 ).
  • next term of the user's request in natural language is a catchword, which does neither represent a data class (a content type) nor a data attribute (attribute name), this term is used for a limiting full text search (compare FIG. 2 d , reference sign 21 respectively FIG. 2 e , reference sign 29 ).
  • a recognition is carried out if the relation R of request Q already contains a parameter, which limits the request to the word W(n) that shall be used for a so called full text search. If this is not the case, a recognition is carried out such if the word W(n) is a so called stop word ( FIG. 2 e , reference sign 26 ). If this is not the case, the query Q is completed to a full text search with word W(n) ( FIG. 2 e , reference sign 29 ).
  • the flow chart according FIG. 3 shows an embodiment for a realization of an inventive input assistance for the semiautomatic creation of a request to a data processing system.
  • input assistances are provided additionally respectively optionally, which allow a virtual moderation of the search request (query) using the query language of the data processing system:
  • FIG. 3 shows how such an inventive moderation of the search request can be realized by an algorithm using input assistances for the suggestion of object types, attribute names and attribute values.
  • FIG. 4 shows a visualization of a corresponding input assistance (subsequently also called wizard) for the user.
  • the inventive input assistance subsequently also called wizard: the user creates a request-tree or concatenates statements in the form subject-predicate-object in a list. In doing so the following steps are realized:
  • a graphical solution of this input assistance is advantageously given by a graphical presentation of the request-tree created this way (compare the graph above window F in FIG. 4 ).
  • a window with a list is shown to the user (compare FIG. 4 , reference sign F), in which the data class (content type) and the attribute names as well as the attribute values can be defined.
  • reference signs P 11 , P 12 , P 21 opens and shows a selection list for the reason of the conjunction (predicate).
  • FIG. 5 shows in a flow chart an embodiment for a realization of a representation of results of an inventive request to the data processing system.
  • FIG. 2 for example describes an embodiment for an inventive realization of a complex request that was for example created with the input assistance (wizard).
  • the complex request thereby is translated into request parts which are combined with each other.

Abstract

The present invention relates to a method for an at least semiautomatic generation of a request to a data processing system with a data pool, wherein a data model, which is usable for the selection of data from the data pool using a query language, forms the basis of the data pool, wherein a user's request made in natural language, which takes place by a user input of terms (user terms) captured as a series of characters (character string), wherein the terms semantically describe the user intended results of the answer of the data processing system on the request that has to be carried out by the data processing system, is analyzed and/or interpreted, and converted into a request in the query language.

Description

    CROSS-REFERENCE TO RELATED APPLICATION
  • This is the U.S. National Phase Application under 35 U.S.C. §371 of International Patent Application No. PCT/EP2012/002227 filed May 24, 2012.
  • FIELD OF THE INVENTION
  • The present invention relates to a method for an at least semiautomatic generation of a request to a data processing system with a data pool respectively data stock, wherein a data model, which is usable for the selection of data from the data pool using a query language, forms the basis of the data pool, wherein a user's request made in natural language, which takes place by a user input of terms (subsequently also referred to as user terms) captured as a series of characters (subsequently also referred to as character string), wherein the terms semantically describe the user intended results of the answer of the data processing system on the request that has to be carried out by the data processing system, is analyzed and/or interpreted, and converted into a request in the query language.
  • BACKGROUND
  • In the area of so called Information-Retrieval-Systems, for example so called search engines respectively Internet search engines, various methods for the retrieval of data from a data pool are known. The basis for the data pool is thereby formed by a data model, which enables a request of data from the data pool using a query language of the data processing system. Normally the data of the data pool is structured regarding the respective data model, particularly into data classes according to a certain semantic classification concerning the content type and/or for data attributes concerning meta data.
  • If the user respectively the operator of a data processing system does not know the data model on which the data pool is based on, thus the available data classes and/or those data attributes, it is difficult to place a precise request respectively search request to the data processing system. The user respectively the operator would at first have to decide, if he wants to search for certain data classes, meaning a certain semantic classification regarding the content type, to search for data attributes, meaning meta data, or to search for conjunctions, which describe the desired information regarding its content, normally as a keyword. This is described in more detail using the following examples:
      • Example 1: “Artist” is an object, that contains the attribute “artist”, that can be of type “artist” or a person, that is related to art, and that with the reason respectively the description “manufacturer” or “author”.
      • Example 2: An object, that meets the description “Manufacturer of DVD players”, can contain the text “Manufacturer of DVD players”, can hold the attribute “products” with value “DVD player” or can the relation to products of type “DVD player” and can be of type “company” at the same time.
  • Furthermore problems exist for queries which concern ambiguity of contents. Further problems exist for a query respectively a search for contents of a data class, of which only subordinated data classes are considered in the data pool. This is described in more detail using the following examples:
      • Example 1: A search is performed for objects which have an attribute with the name respectively the description “job” and the value “artist”. But also objects have to be found that have an attribute in the German variant with the name respectively the description “Beruf” (English: profession/job) and the value “Künstler” (English: artist).
      • Example 2: A search is performed for objects of type “document”. This includes objects which are for example of type “Microsoft Word” or “Microsoft Excel”.
      • Example 3: A person can have the role employee or also consultant within a project. In this case consultant could be a subclass of the data class “employee”.
  • In the area of Internet search engines “semantic search engine” concepts are known, wherein it shall be achieved to capture the semantic of a user's respectively an operator's request in natural language in the form of text input and to convert it into one or more corresponding requests in the query language of the data processing system. The concepts known so far do not fulfil the requirements regarding the degree of complexity of such requests and also regarding the delivered answers respectively results to such requests and further comprise several constraints, particularly language constraints with respect to the evaluation.
  • Against the background of that the present invention is based on the problem to enable user requests in natural language to a data processing system avoiding the previously described drawbacks.
  • SUMMARY
  • For the technical solution of this problem the present invention proposes a method for an at least semiautomatic generation of a request to a data processing system with a data pool, wherein a data model, which is usable for the selection of data from the data pool using a query language, forms the basis of the data pool, wherein a user's request made in natural language, which takes place by a user input of terms (user terms) captured as a series of characters (character string), wherein the terms semantically describe the user intended results of the answer of the data processing system on the request that has to be carried out by the data processing system, is analyzed and/or interpreted, and converted into a request in the query language, which is characterized in that the request in natural language captured as a character string is resolved into lexemes, those lexemes are sequentially processed equivalent to their sequence, wherein a meaning of the particular lexeme is determined on the basis of the data model, and a request in the query language is created out of the determined meaning.
  • A lexeme in terms of the present invention is a meaning unit of linguistic units which can be manifested in different forms of words and thereby does neither consider the precise form of the linguistic unit nor the precise syntactic function of the linguistic unit. In terms of the present invention lexemes conform syntactic words for the greatest possible extent, with the difference that particularly also Asian languages and the like are considered for which no syntactical recognizable division of words within a sentence is known. Syntactic words in terms of the present invention are units that can be shifted within a sentence, which can be swapped and/or divided by insertion of further words. Words in terms of the present invention are thereby the smallest moveable and replaceable units of a sentence concerning the syntactic aspect. Furthermore the usage of the term lexeme in terms of the present invention considers that a given word is reduced to a grammatical basic form in order to enable syntactic-grammatical variants to be balanced.
  • A request respectively a search request according to the invention is advantageously characterized in that the user respectively the operator intuitively describes the information he is looking for in an input field respectively text field with several words. This request is then divided into its lexical parts and translated into a request in the query language, which goes beyond the mere full text search by combination of key words. An advantageous embodiment of the invention provides that the user input is done by speech control similar input assistance. For this the data processing system advantageously comprises means for speech control and means for the conversion of verbal expressions into a series of characters.
  • An advantageous embodiment of the invention provides that for each particular lexeme to be used respectively to be processed a check is carried out if a subsequent lexeme in the order is existing, wherein it is checked in this case, if the lexeme currently to be processed is the description of a data source, the description of a data class, or the description of a data attribute of the data model. Inventively it is first checked if the user's request in natural language comprises multiple terms. If this is the case the first term is checked whether a data source, a data class, meaning a semantic classification regarding a content type, or a data attribute, meaning meta data, is existing with this term respectively with this term description.
  • A further advantageous embodiment of the invention provides that in case the description of a data source or the description of a data class is present, the description is transformed into a parameter of the query language for the selection of data from the data pool, wherein the parameter narrows down the selection of data on data with the description of the data source or the description of the data class. Advantageously the data source is only checked for the first lexeme in the sequence. A data source detected in this way accordingly limits the search area for the request. The term “Amazon” would for example only search in the data which is provided by the company Amazon. If the first term is a data class, the search would accordingly be limited to semantically classified contents regarding a content type. For example a search for the term “book” would only be performed in data which is classified as books from its content type. The information for that is received by the data processing system by usage of the available data model or in an optional embodiment of the invention advantageously by an ontology, in which data classes representing semantic content types can contain respectively comprise different descriptions, so called literary figures, in different languages. Alternatively and/or additionally the data processing system can further comprise or use dictionaries, vocabularies or the like as an ontology-giving respectively—proving source of information.
  • A further advantageous embodiment of the invention provides that it is checked for the present lexeme to be processed if a subsequent lexeme in the sequence is existing, wherein it is checked in this case if the lexeme currently to be processed is the description of a data source, the description of a data class or a data attribute of the data model.
  • A further advantageous embodiment of the invention provides that it is checked if the lexeme currently to be used is a filler word, wherein the lexeme currently to be processed is ignored in this case and the subsequent lexeme in the sequence is used.
  • In a further advantageous embodiment of the invention it is provided that in case the description of a data class is present, it is checked if the request in the query language already contains a parameter which narrows down the selection of data to data with the description of the data class, wherein in this case the currently to be processed lexeme is transformed to a parameter of the query language for selection of data from the data pool, wherein the parameter describes a relation between data objects, which are described with the parameter already contained in the request using the query language, and data objects, which are described with parameters beginning with the description of the currently to be processed lexeme. Inventively the second term of the user's request in natural language is thus analyzed respectively examined, wherein the second term can be a data class, if the first term of the user's request in natural language limits the request to a data source, or wherein the second term can be a search word, if the first term in the user's natural language conforms a data class representing a semantic content type, or wherein the second term can be a data attribute.
  • A further advantageous embodiment of the invention provides that in case the description of a data attribute is present, the description and the subsequent lexeme in the sequence are transformed into a parameter of the query language as a name-value-pair for the selection of data from the data pool, wherein the parameter narrows down the selection of data to data with the description of the data attribute. The data processing system receives the information required for that by usage of the available data model or in an optional embodiment of the invention from an ontology, in which data attributes (meta data) can contain respectively comprise different descriptions, so called literary figures, in different languages. Alternatively and/or additionally the data processing system can further comprise or use dictionaries, vocabularies or the like as an ontology-giving respectively—proving source of information. If the term of the user's request in natural language is the description of a data attribute, the request respectively the search to the data processing system is inventively narrowed down by amending the request with the attribute value respectively the meta date in this case. This narrowing-down is then used for the subsequent term in the user's request in natural language.
  • According to another advantageous proposal of the invention in case the description of a data class is present, it is checked if the description is also the description of a data attribute, wherein in this case the currently to be processed lexeme and the subsequent lexeme in the sequence are transformed into a parameter of the query language as a name-value-pair for the selection of data from the data pool, wherein the parameter alternatively narrows down the selection of data to data with the description of the data attribute. Both parameters, which on the one hand describe the data class and on the other hand the data attribute, are combined as an OR-conjunction for request of data.
  • A further advantageous embodiment of the invention is characterized in that in case no data source, no data class, no data attribute or no remaining lexeme in the sequence in the character string is described, the currently to be processed lexeme is transformed to a parameter of the query language for the selection of data from the data pool, wherein the parameter narrows down the selection of data to data with the currently to be processed lexeme as a free-text search. Inventively the next term of the user's request in natural language can be a keyword which neither represents a data class nor a data attribute and thus advantageously is used for a limiting free text search respectively full text search. If the next term of the user's request in natural language is a data class again, meaning a semantically classified content type, this is inventively interpreted such, that the results search respectively intended with the request have to be linked to contents of this type, meaning this data class.
  • Advantageously it is checked at once if the data class, which describes a link, is also the description of a data attribute. If this is the case, the previous part of the request to be created in the query language is completed by limitation of this new data attribute and is further completed by adding a logical “OR” to the until then partially created request in the query language. Advantageously this includes using the term following on the description of the data attribute (attribute name) as the attribute value.
  • With the inventive method the complexity of the request in the query language is advantageously limited to a single relation.
  • According to a particularly advantageous embodiment of the invention the user input takes place using at least one input assistance, particularly for the semiautomatic creation of complex request. Advantageously the user respectively the operator is guided automatically in formulating a request. Advantageously a list of possible data sources, data classes (semantic content types) and/or data attributes for a selectable input, which can be selected by clicking with the mouse pointer or the like, is shown to the user during the user's input of characters of the character string, preferably during input of the first word. If the entered character string, meaning the first word of the user's input, is a data class, meaning a semantic content type, a list of possible data attributes (meta data) for an input, selectable by clicking with a mouse pointer or the like, is shown to the user for the second word of the character string of the user's input. The display of possible selectable inputs is advantageously carried out by visualization in terms of a so called “drop down”-list which is known from so called internet browsers.
  • According to another advantageous embodiment of the invention the user's input is carried out for complex or more complex queries using a graphical user interface (GUI) by creation of a request-tree. Thereby the user respectively the operator can compose a request-tree or can concatenate statements, which are usable for the request, in a list for the request in the form of subject—predicate—object. Such a tree or graph representing “subject—predicate—object”—relations is known for example from WO 2009/030288 A1, whose disclosure is explicitly referenced here. The user advantageously defines the root of the request-tree for the inventive creation of the request-tree. For that the user can chose the data class, meaning the semantic classification of a content type, from a list, which is provided by the data processing system respectively by the input assistance, preferably with a graphical user interface or with so called mouse pointers or the like.
  • At the same time or alternatively the user can add any attributes. In doing so the user can also select from a list of possible data attributes which exist for the previously selected data class as a semantic content type in the data model. The data attributes are selectively concatenated by a logical “AND” or “OR. For each of the attribute names respectively each attribute description advantageously multiple attribute values can be specified which are concatenated by a logical “OR”. If the user does not select an attribute name, a free text search respectively a full text search with the stored keywords is initiated.
  • Subsequently the user can add limiting conjunctions to the object defined this way, comprising the data class which forms the root of the request-tree and/or the data attributes for this data class. For that the user can select those limiting conjunctions for the defined object from a list as well.
  • If the object concatenated in this way is defined, the user can limit the conjunction himself by selection of a predicate, meaning the reason for a conjunction. Advantageously the data processing system and/or the input assistance provides a selection list as well.
  • If multiple limiting conjunctions exist, those can advantageously be concatenated by a logical “AND” or “OR” or “AND NOT”.
  • If the user wants to further refine his request respectively his search, he can advantageously limit the objects on the second level of his request-tree using conjunctions again.
  • In a further advantageous embodiment of the invention the user can save the request-tree as a bookmark in a database or as a new data class of a semantic content type in an ontology, wherein the ontology can subsequently be used by the data processing system for further requests and/or request-trees.
  • In an advantageous embodiment of the invention the request-tree created with the inventive input assistance is displayed by the graphical user interface (GUI). In a further advantageous embodiment of the invention a window with a list is shown to the user when selecting a node of the request-tree, preferably by double clicking the node, in which the data class and the data attributes as well as the attribute values can be defined. Advantageously a double click on an edge of the request-tree opens a selection list for justification of the conjunction, meaning the predicate.
  • Object of the present invention is further a data processing system respectively a data processing unit, comprising a processor, which is designed to respectively prepared to carry out an inventive method automatically respectively automated, particularly by loading a corresponding application program realizing the method from a storage means and execution of the application program by the processor.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • Further details, characteristics and advantages of the invention are described in the following using the figures in the drawings of the embodiment. Thereby show:
  • FIG. 1 in a schematic diagram an embodiment for an inventive request to a data processing system;
  • FIG. 2 in a flow chart (FIGS. 2 a to 2 e) an embodiment for a realization of an inventive request to a data processing system;
  • FIG. 3 in a flow chart an embodiment for a realization of an inventive input assistance for the semiautomatic creation of a request to a data processing system;
  • FIG. 4 in a schematic diagram an embodiment for an inventive input assistance for the semiautomatic creation of a request to the data processing system; and
  • FIG. 5 in a flow chart an embodiment for a realization of a display of results of an inventive request to a data processing system.
  • DETAILED DESCRIPTION
  • FIG. 1 shows a data processing system 1 respectively 1′ with a data pool 2, which is based on a data model that is usable respectively processible to select data from the data pool 2 using a query language of the data processing system 1. The data processing system 1 comprises an apparatus 5 for analysis and/or interpretation of users' 3 requests 6 in natural language. A user's 3 request in natural language is captured by a user input 4 of user terms, wherein the user terms are captured as a series of characters. The user terms thereby semantically describe the user 3 intended results to the answer of the data processing system 1 on the request that has to be carried out by the data processing system 1. The input 4 of user terms can be carried out by the user 3 manually using buttons of an input device (not explicitly shown in FIG. 1) of the user 3 or using voice control (not explicitly shown in FIG. 1).
  • A user's 3 request 6 in natural language is analyzed and/or interpreted by apparatus 5 for analysis and/or interpretation of requests in natural language. Therefore the apparatus 5 uses the data model 7 for analysis and/or interpretation of requests 6 in natural language. The meta data thereby are part of the data processing system 1 (marked in FIG. 1 with reference sign 1′) or are used from the data processing system 1 by means of a communication connection (not explicitly shown in FIG. 1). During the analysis and/or interpretation of the request 6 in natural language the apparatus 5 converts the user's 3 request 6 in natural language into a request 9 in the query language of the data processing system 1 and directs this to the index or the database of the data pool 2 of the data processing system 1. The result of request 9 in the query language to the index of the data pool 2 of data processing system 1 is then delivered to the user, particularly by means of a graphical playback by a display device (not explicitly shown in FIG. 1) of user 3
  • FIG. 2 shows in a flow chart (compare FIGS. 2 a to 2 e) an embodiment for a realization of an inventive request to a data processing system using a query language of the data processing system—hereinafter called query—by an algorithm. The elements highlighted in FIGS. 2 b and 2 e by hatching, are advantageously optional variants of the method and relate on the one hand to the option of a possible semantic double interpretation of a word in case of a conjunction and on the other hand to the option for a filter for filler words, which enables a more intuitive input for the user respectively operator according to a natural language, as well as a more robust interpretation and thus a better result. Embodiment of the inventive method, for which the optional method variants do not apply, comprise instead of the optional elements highlighted with the hatching corresponding references from respectively to each corresponding elements of the flow chart that are not highlighted by hatchings.
  • A simple search request is characterized in that the user describes the desired information in an input field (text field), meaning the terms are entered, which semantically describe the user intended results of the answer to the data processing system on the request that has to be carried out by the data processing system (compare FIG. 2 a, reference sign 11). This request in natural language (search request 1) of the user is then divided into its lexical parts, at present lexemes (compare FIG. 2 a, reference sign 12), and translated respectively converted into a request in the query language of the data processing system (compare FIG. 2 e, reference sign 13), which goes beyond a mere full text search by combination of key words.
  • In doing so the subsequent element (meaning the next term) of list L is analyzed until the difference of the amount of elements (meaning the terms) of list L and index N for the element (meaning the term) of the recent iteration is bigger than 1 (compare FIG. 2 a, reference sign 14).
  • Subsequently it is validated stepwise if the conditions marked in FIG. 2 a with reference sign 15, in FIG. 2 b with reference sign 17, and in FIG. 2 c with reference sign 19 are fulfilled.
  • A detected data source (compare FIG. 2 a, reference sign 15) would narrow down the search area accordingly (compare FIG. 2 a, reference sign 16). Thus the term “Amazon” would only cause a search in data that is provided by Amazon.
  • A detected data class (semantic classification (content type)) (compare FIG. 2 b, reference sign 27, wherein the content type in FIG. 2 is described as object type OT), would narrow down the search, meaning the request in the query language of the data processing system, to correspondingly (content type respectively object type OT) classified contents (compare FIG. 2 b, reference sign 18 respectively FIG. 2 c, reference sign 27), for example to the content type respectively the data class “book”. The data processing system 1 fetches this information from the data model of the data processing system 1. If the request has already been narrowed down by a data class (semantic classification (content type)), the requested contents are narrowed down using a relation R with data that match compare FIG. 2 b, reference sign 18) the data class (semantic classification (content type)).
  • In case of a detection of a data class (semantic content type) (compare FIG. 2 b, reference signs 17 and 18) for a relation R to the requested contents it is subsequently validated if the term is further a data attribute (compare FIG. 2 b, reference sign 22). If this is the case, the search is further limited to a corresponding attribute value (meta date) (compare FIG. 2 b, reference sign 23). For this the previous part of the search request in the query language of the data processing system is completed by limitation of this new data attribute and is further completed by adding a logical “OR” to the previously interpreted search request (compare FIG. 2 b, reference sign 23). The term following the attribute name thereby is used as the attribute value.
  • If the request Q has already been narrowed down by a data class (semantic classification (content type)) and already contains a relation R (FIG. 2 b, reference sign 24), which contains this data class (semantic classification (content type)), the description of the data class (semantic classification (content type)) is used as a key word for a full text search for further limitation of relation R of request Q (FIG. 2 b, reference sign 25).
  • The described method limits the complexity of request Q to a data processing system using a query language of the data processing system (also called query) to a single relation R.
  • A recognized data attribute respectively a recognized attribute name (compare FIG. 2 c, reference sign 19) would limit the search to a corresponding attribute value (meta date) (compare FIG. 2 c, reference sign 20 respectively 28). If the relation R is not yet an element of request Q, a limitation of query Q is carried out (compare FIG. 2 c, reference sign 28).
  • If the next term of the user's request in natural language is a catchword, which does neither represent a data class (a content type) nor a data attribute (attribute name), this term is used for a limiting full text search (compare FIG. 2 d, reference sign 21 respectively FIG. 2 e, reference sign 29). Thereby a recognition is carried out if the relation R of request Q already contains a parameter, which limits the request to the word W(n) that shall be used for a so called full text search. If this is not the case, a recognition is carried out such if the word W(n) is a so called stop word (FIG. 2 e, reference sign 26). If this is not the case, the query Q is completed to a full text search with word W(n) (FIG. 2 e, reference sign 29).
  • The flow chart according FIG. 3 shows an embodiment for a realization of an inventive input assistance for the semiautomatic creation of a request to a data processing system. For the input which has to be done by the user using an input field (text field), the following input assistances are provided additionally respectively optionally, which allow a virtual moderation of the search request (query) using the query language of the data processing system:
      • If the operator begins to type the first word, a list of possible data sources and/or data classes (semantic content types) appears below the input field.
      • If the first word is a data class (semantic content type), a selection of possible attribute names appears for the second word. The visualization can happen using a drop down list, for example in the way how it is already known from so called browsers for their search area.
  • FIG. 3 shows how such an inventive moderation of the search request can be realized by an algorithm using input assistances for the suggestion of object types, attribute names and attribute values. FIG. 4 shows a visualization of a corresponding input assistance (subsequently also called wizard) for the user.
  • The user is guided during the formulation of a search request by the inventive input assistance—subsequently also called wizard: the user creates a request-tree or concatenates statements in the form subject-predicate-object in a list. In doing so the following steps are realized:
      • 1. At first the user defines the root of the request-tree. For that he can chose the data class (semantic content type) from a list. In the embodiment according FIG. 4 this is for example email S1.
      • 2. At the same time or alternatively the user can add any data attributes. In doing so the user can also select from a list of possible data attributes which exist for the selected content type. The attributes are selectively concatenated by a logical “AND” or “OR”. For each of the attribute names multiple attribute values can be specified which are concatenated by a logical “OR”. If the user does not select an attribute name, a full text search is initiated for the selected content type. In the embodiment according FIG. 4 possible data attributes are presented in window F for the data class email S1.
      • 3. In a third step the user can add limiting conjunctions to the so defined object. The user does the definition of associated objects the same way as it is done for the root of the request-tree.
      • 4. If the object is defined, the user can limit the conjunction of a predicate, meaning the reason for a conjunction, by himself. For that a selection list is available for the user as well, which particularly is reflected in a window according window F of data class email S1. In FIG. 4 the corresponding predicates of data class email S1 are marked by edge P11 and P12.
      • 5. If multiple limiting conjunctions are existing, those can be concatenated by a logical “AND” or “OR”.
      • 6. If the user wants to further refine his search, he can limit the objects of the second level of his request-tree also by conjunctions. In FIG. 4 those are the project O11 and the product O12, wherein project O11 becomes subject S2 for further refinement, wherein the subject S2 is linked to the object customer O2 via the predicate forming the edge of the graph. The user can save this query-tree as a bookmark in a data base or as a new semantic content type, meaning a new data class, in an ontology which can subsequently be used by the data processing system.
  • A graphical solution of this input assistance (wizard) is advantageously given by a graphical presentation of the request-tree created this way (compare the graph above window F in FIG. 4). When performing a selection, for example by double clicking the nodes (compare FIG. 4, reference signs S1, O11, O12, S2, O2), a window with a list is shown to the user (compare FIG. 4, reference sign F), in which the data class (content type) and the attribute names as well as the attribute values can be defined. In contrast to that double clicking an edge (compare FIG. 4, reference signs P11, P12, P21) opens and shows a selection list for the reason of the conjunction (predicate).
  • FIG. 5 shows in a flow chart an embodiment for a realization of a representation of results of an inventive request to the data processing system.
  • The embodiments of the invention illustrated in the figures of the drawing and the embodiments of the invention described in this context only serve the explanation of the invention and are not restrictive for the same. FIG. 2 for example describes an embodiment for an inventive realization of a complex request that was for example created with the input assistance (wizard). The complex request thereby is translated into request parts which are combined with each other. This means that the complex request, which represents a graph, is divided into relations (tuples), that are linked to each other.
  • REFERENCE LIST
    • 1, 1′ Data processing system
    • 2 Data pool
    • 3 User (operator)
    • 4 Input of user terms as a series of characters
    • 5 Apparatus for analysis and/or interpretation of requests (6) in natural language
    • 6 Requests in natural language
    • 7 Data model/meta data (literates, dictionaries, vocabularies)
    • 8 Request/usage data model/meta data (7)
    • 9 Request in query language
    • 10 Answer to request in query language
    • 11 Request in natural language
    • 12 Division of request in natural language into lexemes/terms
    • 13 Output of request in query language of data processing system
    • 14 Validation if multiple lexemes/terms exist
    • 15 Recognition data source
    • 16 Completion of request in query language by limitation to data source
    • 17 Recognition data class (semantic content type/object type)
    • 18 Completion of request in query language by limitation to data class (semantic content type/object type)
    • 19 Recognition of data attribute
    • 20 Completion of request in query language by limitation to data attribute
    • 21 Completion of request in query language by full text search
    • 22 Recognition of data attribute
    • 23 Completion of request in query language by limitation to data attribute
    • 24 Recognition of data class (semantic content type/object type) that is already element of the relation of the request in query language
    • 25 Completion of request in query language by full text search
    • 26 Recognition of stop word
    • 27 Completion of request in query language by limitation to data class (semantic content type/object type)
    • 28 Completion of request in query language by limitation to data attribute
    • 29 Completion of request in query language by full text search
    • W Window
    • O11 Object
    • O12 Object
    • O2 Object
    • P11 Predicate
    • P12 Predicate
    • P21 Predicate
    • S1 Subject
    • S2 Subject

Claims (9)

What is claimed is:
1. The method for an at least semiautomatic generation of a request to a data processing system with a data pool,
wherein a data model, which is usable for the selection of data from the data pool using a query language, forms the basis of the data pool,
wherein a user's request made in natural language,
which takes place by a user input of terms captured as a series of characters (character string),
wherein the terms semantically describe the user intended results of the answer of the data processing system on the request that has to be carried out by the data processing system, is analyzed and/or interpreted, and converted into a request (9) in the query language, characterized in that the request in natural language captured as a character string is resolved into lexemes, those lexemes are sequentially processed equivalent to their sequence,
wherein a meaning of the particular lexeme is determined on the basis of the data model, and a request in the query language is created out of the determined meaning.
2. The method according to claim 1, wherein for each particular lexeme to be processed a check is performed if a subsequent lexeme in the sequence is existing, wherein it is checked in this case, if the lexeme currently to be processed is the description of a data source, the description of a data class, or the description of a data attribute of the data model.
3. The method according to claim 2, wherein in case the description of a data source or the description of a data class is present, the description is transformed into a parameter of the query language for the selection of data from the data pool, wherein the parameter narrows down the selection of data on data with the description of the data source or the description of the data class.
4. The method according to claim 2, wherein the data source is checked only for the first lexeme in the sequence.
5. The method according to claim 1, wherein it is checked if the lexeme currently to be used is a filler word, wherein the lexeme currently to be processed is ignored in this case and the subsequent lexeme in the sequence is used.
6. The method according to claim 2, wherein in case the description of a data class is present, it is checked if the request in the query language contains already a parameter which narrows down the selection of data to data with the description of the data class, wherein in this case the currently to be processed lexeme is transformed to a parameter of the query language for selection of data from the data pool, wherein the parameter describes a relation between data objects, which are described with the parameter already contained in the request using the query language, and data objects, which are described with parameters beginning with the description of the currently to be processed lexeme.
7. The method according to claim 2, wherein in case the description of a data attribute is present, the description and the subsequent lexeme in the sequence are transformed into a parameter of the query language as a name-value-pair for the selection of data from the data pool, wherein the parameter narrows down the selection of data to data with the description of the data attribute.
8. The method according to claim 6, wherein in case the description of a data class is present, it is checked if the description is also the description of a data attribute, wherein in this case the currently to be processed lexeme and the subsequent lexeme in the sequence are transformed into a parameter of the query language as a name-value-pair for the selection of data from the data pool, wherein the parameter alternatively narrows down the selection of data to data with the description of the data attribute, meaning that in the request the OR-conjunction is combined with the request of data of a data class.
9. The method according to claim 2, wherein in case no data source, no data class, no data attribute or no remaining lexeme in the sequence in the character string is described, the currently to be processed lexeme is transformed to a parameter of the query language for the selection of data from the data pool, wherein the parameter narrows down the selection of data to data with the currently to be processed lexeme as a free-text search.
US14/402,888 2012-05-24 2012-05-24 Generation of requests to a data processing system Abandoned US20150302050A1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/EP2012/002227 WO2013174407A1 (en) 2012-05-24 2012-05-24 Generation of queries to a data processing system

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2012/002227 A-371-Of-International WO2013174407A1 (en) 2012-05-24 2012-05-24 Generation of queries to a data processing system

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US16/210,190 Continuation US11934391B2 (en) 2012-05-24 2018-12-05 Generation of requests to a processing system

Publications (1)

Publication Number Publication Date
US20150302050A1 true US20150302050A1 (en) 2015-10-22

Family

ID=46275762

Family Applications (2)

Application Number Title Priority Date Filing Date
US14/402,888 Abandoned US20150302050A1 (en) 2012-05-24 2012-05-24 Generation of requests to a data processing system
US16/210,190 Active US11934391B2 (en) 2012-05-24 2018-12-05 Generation of requests to a processing system

Family Applications After (1)

Application Number Title Priority Date Filing Date
US16/210,190 Active US11934391B2 (en) 2012-05-24 2018-12-05 Generation of requests to a processing system

Country Status (3)

Country Link
US (2) US20150302050A1 (en)
EP (1) EP2856344A1 (en)
WO (1) WO2013174407A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190034555A1 (en) * 2017-07-31 2019-01-31 Splunk Inc. Translating a natural language request to a domain specific language request based on multiple interpretation algorithms
US10901811B2 (en) 2017-07-31 2021-01-26 Splunk Inc. Creating alerts associated with a data storage system based on natural language requests
US11494395B2 (en) 2017-07-31 2022-11-08 Splunk Inc. Creating dashboards for viewing data in a data storage system based on natural language requests

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050267871A1 (en) * 2001-08-14 2005-12-01 Insightful Corporation Method and system for extending keyword searching to syntactically and semantically annotated data
US20060059153A1 (en) * 2004-09-10 2006-03-16 France Telecom Computer constructions of a lexical tree
US7181438B1 (en) * 1999-07-21 2007-02-20 Alberti Anemometer, Llc Database access system
US20080040345A1 (en) * 2006-08-07 2008-02-14 International Characters, Inc. Method and Apparatus for String Search Using Parallel Bit Streams
US20080208837A1 (en) * 2007-02-27 2008-08-28 The University Court Of The University Of Edinburgh Methods and apparatus for term normalization
US20090292687A1 (en) * 2008-05-23 2009-11-26 International Business Machines Corporation System and method for providing question and answers with deferred type evaluation
US20130006954A1 (en) * 2011-06-30 2013-01-03 Xerox Corporation Translation system adapted for query translation via a reranking framework
US8516008B1 (en) * 2012-05-18 2013-08-20 Splunk Inc. Flexible schema column store

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SE466029B (en) * 1989-03-06 1991-12-02 Ibm Svenska Ab DEVICE AND PROCEDURE FOR ANALYSIS OF NATURAL LANGUAGES IN A COMPUTER-BASED INFORMATION PROCESSING SYSTEM
US6101490A (en) * 1991-07-19 2000-08-08 Hatton; Charles Malcolm Computer system program for creating new ideas and solving problems
CA2091658A1 (en) * 1993-03-15 1994-09-16 Matthew Lennig Method and apparatus for automation of directory assistance using speech recognition
SE517496C2 (en) * 2000-06-22 2002-06-11 Hapax Information Systems Ab Method and system for information extraction
US7085708B2 (en) * 2000-09-23 2006-08-01 Ravenflow, Inc. Computer system with natural language to machine language translator
WO2002041169A1 (en) * 2000-11-17 2002-05-23 Invention Machine Corporation, Inc. Semantic answering system and method
US7403938B2 (en) * 2001-09-24 2008-07-22 Iac Search & Media, Inc. Natural language query processing
GB0209563D0 (en) * 2002-04-26 2002-06-05 Univ Edinburgh Text processing method and system
US7840072B2 (en) * 2003-03-13 2010-11-23 Hewlett-Packard Development Company, L.P. Method and system for pattern matching
WO2005103951A1 (en) * 2004-04-23 2005-11-03 Novauris Technologies Limited Tree index based method for accessing automatic directory
JP4314221B2 (en) * 2005-07-28 2009-08-12 株式会社東芝 Structured document storage device, structured document search device, structured document system, method and program
US7774198B2 (en) * 2006-10-06 2010-08-10 Xerox Corporation Navigation system for text
US9892111B2 (en) * 2006-10-10 2018-02-13 Abbyy Production Llc Method and device to estimate similarity between documents having multiple segments
JP5245255B2 (en) * 2007-02-15 2013-07-24 富士通株式会社 Specific expression extraction program, specific expression extraction method, and specific expression extraction apparatus
US7890318B2 (en) * 2007-05-23 2011-02-15 Xerox Corporation Informing troubleshooting sessions with device data
WO2009030246A1 (en) 2007-09-03 2009-03-12 Iqser Ip Ag Detecting correlations between data representing information
US8738360B2 (en) * 2008-06-06 2014-05-27 Apple Inc. Data detection of a character sequence having multiple possible data types
US20100088674A1 (en) * 2008-10-06 2010-04-08 Microsoft Corporation System and method for recognizing structure in text
EP2306333A1 (en) * 2009-09-04 2011-04-06 Jentro Technologies GmbH Offline software library
US20110099052A1 (en) * 2009-10-28 2011-04-28 Xerox Corporation Automatic checking of expectation-fulfillment schemes
US8831947B2 (en) * 2010-11-07 2014-09-09 Nice Systems Ltd. Method and apparatus for large vocabulary continuous speech recognition using a hybrid phoneme-word lattice
US9110883B2 (en) * 2011-04-01 2015-08-18 Rima Ghannam System for natural language understanding
US9418151B2 (en) * 2012-06-12 2016-08-16 Raytheon Company Lexical enrichment of structured and semi-structured data
US9626358B2 (en) * 2014-11-26 2017-04-18 Abbyy Infopoisk Llc Creating ontologies by analyzing natural language texts
WO2018039644A1 (en) * 2016-08-25 2018-03-01 Purdue Research Foundation System and method for controlling a self-guided vehicle
US10978053B1 (en) * 2020-03-03 2021-04-13 Sas Institute Inc. System for determining user intent from text

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7181438B1 (en) * 1999-07-21 2007-02-20 Alberti Anemometer, Llc Database access system
US20050267871A1 (en) * 2001-08-14 2005-12-01 Insightful Corporation Method and system for extending keyword searching to syntactically and semantically annotated data
US20060059153A1 (en) * 2004-09-10 2006-03-16 France Telecom Computer constructions of a lexical tree
US20080040345A1 (en) * 2006-08-07 2008-02-14 International Characters, Inc. Method and Apparatus for String Search Using Parallel Bit Streams
US20080208837A1 (en) * 2007-02-27 2008-08-28 The University Court Of The University Of Edinburgh Methods and apparatus for term normalization
US20090292687A1 (en) * 2008-05-23 2009-11-26 International Business Machines Corporation System and method for providing question and answers with deferred type evaluation
US20130006954A1 (en) * 2011-06-30 2013-01-03 Xerox Corporation Translation system adapted for query translation via a reranking framework
US8516008B1 (en) * 2012-05-18 2013-08-20 Splunk Inc. Flexible schema column store

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190034555A1 (en) * 2017-07-31 2019-01-31 Splunk Inc. Translating a natural language request to a domain specific language request based on multiple interpretation algorithms
US10901811B2 (en) 2017-07-31 2021-01-26 Splunk Inc. Creating alerts associated with a data storage system based on natural language requests
US11494395B2 (en) 2017-07-31 2022-11-08 Splunk Inc. Creating dashboards for viewing data in a data storage system based on natural language requests

Also Published As

Publication number Publication date
US20190179811A1 (en) 2019-06-13
WO2013174407A1 (en) 2013-11-28
US11934391B2 (en) 2024-03-19
EP2856344A1 (en) 2015-04-08

Similar Documents

Publication Publication Date Title
US8346795B2 (en) System and method for guiding entity-based searching
EP3142028A2 (en) Handling failures in processing natural language queries through user interactions
US10140333B2 (en) Trusted query system and method
US9448995B2 (en) Method and device for performing natural language searches
US9477782B2 (en) User interface mechanisms for query refinement
US8341167B1 (en) Context based interactive search
US9799040B2 (en) Method and apparatus for computer assisted innovation
WO2018146492A1 (en) Computer-implemented method of querying a dataset
US20160012021A1 (en) Document assembly systems and methods
KR101524889B1 (en) Identification of semantic relationships within reported speech
US20180004838A1 (en) System and method for language sensitive contextual searching
US11934391B2 (en) Generation of requests to a processing system
WO2013134871A1 (en) System and method for conversation-based information search
WO2006051297A1 (en) System and method for formulating and refining queries on structured data
US20080154853A1 (en) English-language translation of exact interpretations of keyword queries
Borsje et al. Graphical query composition and natural language processing in an RDF visualization interface
US11694033B2 (en) Transparent iterative multi-concept semantic search
Iqbal et al. A Negation Query Engine for Complex Query Transformations
US11017172B2 (en) Proposition identification in natural language and usage thereof for search and retrieval
JP2015095180A (en) Content search program for clearly presenting selection candidates for query word in refined search, device, and method
RU2571407C1 (en) Method to generate map of connections of converted structured data array components
da Purificação et al. A Controlled Natural Language Editor for Semantic of Business Vocabulary and Rules
RU2571406C1 (en) Method of double-level search of information in previously converted structured data array
Dinşoreanu et al. Integrated System for Developing Semantically-Enhanced Archive Econtent
Sathick et al. Natural Language to SQL Generation for Semantic Knowledge Extraction in Social Web Sources

Legal Events

Date Code Title Description
AS Assignment

Owner name: IQSER IP AG, GERMANY

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:WURZER, JOERG, DR.;REEL/FRAME:035563/0870

Effective date: 20150427

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION