WO2015175313A1 - Disambiguation of queries implicit to multiple entities - Google Patents

Disambiguation of queries implicit to multiple entities Download PDF

Info

Publication number
WO2015175313A1
WO2015175313A1 PCT/US2015/029696 US2015029696W WO2015175313A1 WO 2015175313 A1 WO2015175313 A1 WO 2015175313A1 US 2015029696 W US2015029696 W US 2015029696W WO 2015175313 A1 WO2015175313 A1 WO 2015175313A1
Authority
WO
WIPO (PCT)
Prior art keywords
entity
entities
query
user device
user
Prior art date
Application number
PCT/US2015/029696
Other languages
French (fr)
Inventor
Nils GRIMSMO
Behshad Behzadi
Original Assignee
Google Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Google Inc. filed Critical Google Inc.
Priority to EP15722417.1A priority Critical patent/EP3143516A1/en
Priority to CN201580024802.9A priority patent/CN106462603B/en
Publication of WO2015175313A1 publication Critical patent/WO2015175313A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9537Spatial or temporal dependent retrieval, e.g. spatiotemporal queries

Definitions

  • the Internet provides access to a wide variety of resources, such as image files, audio files, video files, and web pages.
  • a search system can identify resources in response to queries submitted by users and provide information about the resources in a manner that is useful to the users. The users can navigate through, e.g., click on, search results to acquire information of interest.
  • This specification relates to interpreting user queries based on a location of a user device.
  • Implementations of the present disclosure are generally directed to rewriting queries based on one or more implied entities. More particularly, implementations of the present disclosure are directed to identifying a set of entities based on an approximate location of a user device that submits a query, determining that the query is implicitly directed to multiple entities of the set of entities, selectively providing an interface for disambiguation of the multiple entities, and rewriting the query to explicitly reference an entity of the set of entities. In some examples, the entity is identified by the user using the disambiguation interface.
  • innovative aspects of the subject matter described in this specification can be embodied in methods that include actions of receiving a first query provided from a user device, and determining that the first query is implicitly about some entity, and in response: obtaining a first approximate location of the user device when the user device provided the first query, obtaining a first set of entities including a plurality of entities, each entity in the first set of entities being associated with the first approximate location, determining that the first query is implicitly about at least two entities in the first set of entities, and in response providing a first selection interface for display to the user, the first selection interface including graphical representations, each graphical representation representing a respective entity of the at least two entities of the first set of entities, receiving user input indicating user selection of a first entity, and providing a first revised query based on the first query and the first entity, the first revised query explicitly referencing the first entity.
  • Other implementations of this aspect include corresponding systems, apparatus, and computer programs, configured to perform the actions of the methods, encoded on computer storage devices.
  • a graphical representation includes one or more of a name of a respective entity, a type of the respective entity, an approximate distance between the user device and the respective entity, one or more images of the respective entity, a rating of the respective entity, an address of the respective entity, a telephone number of the respective entity, and one or more results that would be displayed, if the respective entity is selected; actions further include: receiving a second query provided from the user device, and determining that the second query is implicitly about some entity, and in response: obtaining a second approximate location of the user device when the user device provided the second query, obtaining a second set of entities including a plurality of entities, each entity in the second set of entities being associated with the second approximate location, determining that the second query is implicitly about at least two entities in the second set of entities, automatically, without displaying a second selection interface, selecting a second entity, and providing a second revised query based on the second query and the second entity, the second revised query explicitly
  • a disambiguation interface can be selectively provided to the user. For example, previous information can be processed to determine whether to provide the disambiguation interface, or to automatically select an entity without user input. In this manner, interaction with the user can be streamlined.
  • the user submitting the query does not need to know the name of the entity that is the subject of the query. For example, the user can be standing near a monument and can submit the query [what is this monument], without having to first determine the name of the monument. In some examples, the user does not need to know how to properly pronounce and/or spell the name of the entity.
  • implementations of the present disclosure enable users to more conveniently and naturally interact with a search system (e.g., submitting the query [show me lunch specials] instead of the query [Fino Ristorante & Bar lunch specials].
  • search system e.g., submitting the query [show me lunch specials] instead of the query [Fino Ristorante & Bar lunch specials].
  • FIG. 1 depicts an example environment in which a search system provides search results based on interpreted user queries.
  • FIG. 2 depicts an example use cases in accordance with implementations of the present disclosure.
  • FIG. 3 depicts an example disambiguation interface based on the example use case of FIG. 2.
  • FIG. 4 depicts an example process that can be executed in accordance with implementations of the present disclosure.
  • FIG. 5 depicts an example process that can be executed in accordance with implementations of the present disclosure.
  • Implementations of the present disclosure are generally directed to rewriting queries based on one or more implied entities. More particularly, implementations of the present disclosure are directed to determining one or more entities that a received query is implicitly about, selecting an entity of the one or more entities, and rewriting the query to explicitly reference the selected entity.
  • a set of entities is identified based on an approximate location of a user device that submits a query, each entity in the set of entities being determined to be sufficiently close to the approximate location.
  • each entity it is determined whether one or more n-grams of the query correlate to a type of the entity, and, if a correlation is provided, the query is deemed to be potentially about the respective entity, and the entity is added to a sub-set of entities.
  • an entity is selected from the sub-set of entities, and the query is rewritten to explicitly reference the selected entity.
  • search results are provided based on the revised query.
  • FIG. 1 depicts an example environment 100 in which a search system provides search results based on interpreted user queries.
  • the example environment 100 enables users to interact with one or more computer-implemented services.
  • Example computer-implemented services can include a search service, an electronic mail service, a chat service, a document sharing service, a calendar sharing service, a photo sharing service, a video sharing service, blogging service, a micro- blogging service, a social networking service, a location (location-aware) service, a check-in service and a ratings and review service.
  • a search system 120 is depicted, which provides a search service, as described in further detail herein.
  • the example environment 100 includes a network 102, e.g., a local area network (LAN), wide area network (WAN), the Internet, or a combination thereof, connects web sites 104, user devices 106, and the search system 120.
  • the network 102 can be accessed over a wired and/or a wireless communications link.
  • mobile computing devices such as smartphones can utilize a cellular network to access the network 102.
  • the environment 100 may include millions of web sites 104 and user devices 106.
  • a web site 104 is provided as one or more resources 105 associated with a domain name and hosted by one or more servers.
  • An example web site is a collection of web pages formatted in an appropriate machine-readable language, e.g., hypertext markup language (HTML), that can contain text, images, multimedia content, and programming elements, e.g., scripts.
  • HTML hypertext markup language
  • Each web site 104 is maintained by a publisher, e.g., an entity that manages and/or owns the web site.
  • a resource 105 is data provided over the network 102 and that is associated with a resource address, e.g., a uniform resource locator (URL).
  • resources 105 that can be provided by a web site 104 include web pages, word processing documents, and portable document format (PDF) documents, images, video, and feed sources, among other appropriate digital content.
  • PDF portable document format
  • the resources 105 can include content, e.g., words, phrases, images and sounds and may include embedded information, e.g., meta information and hyperlinks, and/or embedded instructions, e.g., scripts.
  • a user device 106 is an electronic device that is capable of requesting and receiving resources 105 over the network 102.
  • Example user devices 106 include personal computers, laptop computers, and mobile computing devices, e.g., smartphones and/or tablet computing devices, that can send and receive data over the network 102.
  • mobile computing device refers to a user device that is configured to communicate over a mobile communications network.
  • a smartphone e.g., a phone that is enabled to communicate over the Internet, is an example of a mobile device.
  • a user device 106 can execute a user application, e.g., a web browser, to facilitate the sending and receiving of data over the network 102.
  • the search system 120 identifies the resources 105 by crawling and indexing the resources 105 provided on web sites 104. Data about the resources 105 can be indexed based on the resource to which the data corresponds. The indexed and, optionally, cached copies of the resources 105 are stored in a search index 122.
  • a user device 106 submit search queries 109 to the search system 120.
  • a user device 106 can include one or more input modalities.
  • Example modalities can include a keyboard, a touchscreen and/or a microphone.
  • a user can use a keyboard and/or touchscreen to type in a search query.
  • a user can speak a search query, the user speech being captured through a microphone, and being processed through speech recognition to provide the search query.
  • a search results display can include one or more web pages, e.g., one or more search results pages.
  • a web page can be provided based on a web document that can be written in any appropriate machine- readable language. It is contemplated, however, that implementations of the present disclosure can include other appropriate display types.
  • search results can be provided in a display generated by an application that is executed on a computing device, and/or a display generated by an operating system, e.g., mobile operating system.
  • search results can be provided based on any appropriate form, e.g., Javascript-html, plaintext.
  • a search result 112 is data generated by the search system 120 that identifies a resource 105 that is responsive to a particular search query, and includes a link to the resource 105.
  • An example search result 1 12 can include a web page title, a snippet of text or a portion of an image extracted from the web page, and the URL of the web page.
  • data provided in the search results 1 12 can be retrieved from a resource data store.
  • the search system 120 can provide the search results display 1 11, which displays the search results 1 12.
  • the search results display 11 1 can be populated with information, e.g., a web page title, a snippet of text or a portion of an image extracted from the web page, that is provided from the resource data store.
  • data for the search queries 109 submitted during user sessions are stored in a data store, such as the historical data store 124.
  • the search system 120 can store received search queries in the historical data store 124.
  • selection data specifying actions taken in response to search results 1 12 provided in response to each search query 109 are also stored in the historical data store 124, for example, by the search system 120. These actions can include whether a search result 112 was selected, e.g., clicked or hovered over with a pointer.
  • the selection data can also include, for each selection of a search result 1 12, data identifying the search query 109 for which the search result 1 12 was provided.
  • the example environment 100 also includes a query interpretation system 130 communicably coupled to the search system 120, e.g., directly coupled or coupled over a network such as network 102. Although the search system 120 and the query interpretation system 120 are depicted as separate systems in FIG.
  • the search system 120 can include the query interpretation system 130.
  • the query interpretation system 130 provides a revised query based on a query received from a user device 106.
  • the revised query is provided based on an approximate location of the user device 106 that provided the query, and one or more entities that are determined to be proximate to the user device 106.
  • the search results 112 provided by the search system 120 are responsive to the revised query.
  • a plurality of entities and information associated therewith can be stored as structured data in a knowledge graph.
  • a knowledge graph includes a plurality of nodes and edges between nodes.
  • a node represents an entity and an edge represents a relationship between entities.
  • the knowledge graph can be provided based on an example schema that structures data based on domains, types and properties.
  • a domain includes one or more types that share a namespace.
  • a namespace is provided as a directory of uniquely named objects, where each object in the namespace has a unique name, e.g., identifier.
  • a type denotes an "is a" relationship about a topic, and is used to hold a collection of properties.
  • a topic represents an entity, such as a person, place or thing.
  • each topic can have one or more types associated therewith.
  • a property is associated with a topic and defines a "has a" relationship between the topic and a value of the property.
  • the value of the property can include another topic.
  • a plurality of entities can be provided in one or more databases.
  • a plurality of entities can be provided in a table that can provide data associated with each entity.
  • Example data can include a name of the entity, a location of the entity, one or more types, described in further detail below, assigned to the entity, one or more ratings associated with the entity, and any other appropriate information that can be provided for the entity.
  • Implementations of the present disclosure are generally directed to rewriting queries based on one or more implied entities. More particularly, implementations of the present disclosure are directed to identifying a set of entities based on an approximate location of a user device that submits a query, determining that the query is implicitly directed to multiple entities of the set of entities, selectively providing an interface for disambiguation of the multiple entities, and rewriting the query to explicitly reference an entity of the set of entities. In some examples, the entity is identified by the user using the disambiguation interface.
  • FIG. 2 depicts an example use cases in accordance with implementations of the present disclosure.
  • an area 200 is provided, in which a user device 202 is located.
  • Multiple entities are also located within the area 200.
  • Example entities can include a restaurant, a bar, a hotel, a theater, a school, a university, a music venue, a tourist attraction, and a park. It is appreciated that implementations of the present disclosure are not limited to entities specifically identified herein.
  • an entity "Great Hotel” 204, an entity “Not-so-Great Hotel” 206, an entity “Horrible Hotel” 208, an entity “Acceptable Hotel” 210, and an entity “Awesome Pizza” 212 are provided in the area 200.
  • a query is received from a user device.
  • the user device 202 can provide a query to a search system, e.g., the search system 120 of FIG. 1.
  • a search system e.g., the search system 120 of FIG. 1.
  • it can be determined that the query is implicitly about an entity.
  • the query is absent an explicit reference to an entity, it can be determined that the query is implicitly about an entity.
  • the query includes a pronoun, it can be determined that the query is implicitly about an entity.
  • the received query can be processed and can be compared to one or more query patterns to determine whether the query is implicitly referencing an entity.
  • the query is processed to remove one or more stop words and/or conversational terms to provide a cleaned query. For example, the example query [can you please show me ratings] can be processed to provide
  • the cleaned query is compared to one or more query patterns, each query pattern indicating an implicit reference to some entity. In some examples, if the cleaned query matches a query pattern, it is determined that the query is implicitly about some entity. In some examples, if the cleaned query does not match a query pattern, it is determined that the query is not implicitly about some entity. For example, the clean query [ratings], from the example above, can match a query pattern [ratings]. Consequently, it can be determined that the query [can you please show me ratings] is implicitly about some entity.
  • the example query [can you please show me ratings for Awesome Pizza] can be processed to provide the cleaned query [ratings Awesome Pizza].
  • the cleaned query [ratings Awesome Pizza] can be compared to one or more query patterns, and it can be determined that the cleaned query does not match any of the one or more query patterns.
  • the term "Awesome Pizza" is specific to an entity, and is thus, not included in any of the query patterns. Consequently, it is determined that the query [can you please show me ratings for Awesome Pizza] is not implicitly about some entity. Instead, and in this example, the query [can you please show me ratings for Awesome Pizza] is explicitly about an entity, i.e., Awesome Pizza.
  • an approximate location of the user device that submitted the query can be determined.
  • the approximate location can be determined based on one or more location-related signals.
  • Example location related-signals can include cellular signals, global-positioning system (GPS) signals, wireless signals (WiFi), and the like.
  • location data that is representative of the one or more location-related signals is provided with the query.
  • the location data is provided to a location service, which processes the location data and provides the approximate location of the user device.
  • the approximate location can be provided as geographic coordinates, e.g., longitude, latitude.
  • a time can be associated with the approximate location.
  • the location service can provide the approximate location and a time associated therewith.
  • the time indicates an approximate time, at which the user device is determined to be or have been at the approximate location.
  • a time difference can be determined between a current time and the time associated with the approximate location, and can be compared to a threshold difference. In some examples, if the time difference exceeds the threshold difference, the approximate location is determined to not be sufficiently fresh. In some examples, the current time is provided as the time, at which the query was sent from the user device, or received by the search system.
  • a set of entities is provided based on the approximate location. For example, if the confidence score exceeds the threshold confidence score and/or the approximate location is sufficiently fresh, the set of entities is provided, as described below.
  • the set of entities includes one or more entities.
  • the set of entities includes one or more entities that are determined to be sufficiently close to the approximate location, such that the query could be potentially about the one or more entities.
  • the one or more entities are selected for inclusion in the set of entities based on a location of a respective entity relative to the approximate location.
  • entities that are within a threshold distance, e.g., threshold radius, of the location can be identified, and are included in the set of entities.
  • a geographic location can be divided into a grid that includes a plurality of cells, e.g., square or rectangular cells, each cell accounting for an area of the geographic location.
  • each cell is associated with a set of entities that are located in the respective area.
  • a cell, within which the approximate location lies can be identified, and can be provided as a central cell.
  • a plurality of surrounding cells that surround the central cell can be identified.
  • any cell that is at least partially within the threshold distance can be provided as a surrounding cell and can be included in the plurality of surrounding cells.
  • the set of entities includes all entities associated with the central cell and each of the surrounding cells in the plurality of surrounding cells.
  • a threshold radius 220 is provided, which defines a circular area 222.
  • the example area 222 is circular, it is contemplated that the area can include any appropriate geometric shape.
  • entities that lie within the area 222 are included in the set of entities. Consequently, in the example of FIG. 2, the entities "Great Hotel” 204, "Not-so-Great Hotel” 206, "Horrible Hotel” 208 and “Acceptable Hotel” 210 are included in the set of entities.
  • the approximate location can be provided to an entity service, which processes the approximate location and provides the set of entities.
  • each entity is associated with a respective type.
  • the type corresponds to a category associated with the entity.
  • the entity "Great Hotel” 204 can be provided as a type [hotel].
  • the entity "Awesome Pizza” 212 can be provided as a type [restaurant].
  • the entity service provides the respective type for each entity in the set of entities.
  • the type of an entity is determined from the knowledge graph, or database, which store information about respective entities, as described above. In accordance with implementations of the present disclosure, for each entity in the set of entities, it is determined whether the query is implicitly referencing the entity. In some examples, the type of the entity is cross-referenced to an index.
  • the index associates types to respective n-grams of queries.
  • an n-gram includes one or more terms of a query. If it is determined, based on the index, that one or more n-grams of the query correlate to the type, the query is deemed to be potentially about the respective entity, and the entity is added to a sub-set of entities.
  • the index is provided based on analysis of one or more search logs.
  • the search logs provide a record of received queries.
  • the search logs can be processed to provide the index.
  • the search logs can be processed to determine that queries including the term [menu] correlate to the type [restaurant].
  • the search logs can be processed to determine that queries including the term [reviews] correlate to the types [restaurant], [hotel], [movie], [play], [store] and [service provider].
  • the index can be provided to correlate the term [menu] to the type [restaurant], and to correlate the terms [reviews] to the types [restaurant], [hotel], [movie], [play], [store], and [service provider].
  • a revised query is provided based on the query and at least one entity in the sub-set of entities.
  • the revised query is provided by rewriting the query to explicitly reference the at least one entity.
  • an entity is selected from the set of entities.
  • an entity of the subset of entities can be selected.
  • the entity is selected based on user input.
  • a selection interface is displayed to the user in response to multiple entities being included in the sub-set of entities.
  • the selection interface includes a graphical representation of each entity in the sub-set of entities.
  • a list of entities is represented in the selection interface.
  • the list of entities includes an ordered list that has the entities ordered based on one or more attributes associated with the respective entities, and/or results that would be displayed based on respective revised queries that would be provided based on the respective entities.
  • Example attributes include an approximate distance between the user and the entity, a rating of the entity, a popularity of the entity, and whether the user has previously submitted a query about the respective entity.
  • Example attributes also include one or more user interactions.
  • Example user interactions include locations physically visited by the user, a web page previously visited by the user.
  • respective results can be provided for each entity, which can include results that would be provided, if the query were to be rewritten to explicitly reference the respective entity.
  • the order of the entities can be at least partially based on the respective results. For example, entities associated with results of relatively higher quality can be displayed higher in the order than entities associated with results of relatively low quality.
  • each graphical representation of an entity provides information associated with the entity.
  • Example information includes a name of the entity, a type of the entity, an approximate distance between the user and the entity, one or more images of the entity, a rating of the entity, an address of the entity, and a telephone number of the entity.
  • a graphical representation of an entity can include one or more results that would be displayed, if the query were to be rewritten to explicitly reference the respective entity.
  • a graphical representation can be provided for each restaurant, each graphical representation including a result that is representative of a menu for the respective restaurant, e.g., a hyperlink to the menu, a snippet of the menu, a thumbnail image of the menu.
  • the query is rewritten to provide a revised query that explicitly references the entity selected by the user.
  • the revised query is provided to a search service.
  • the search service provides search results that are responsive to the revised query, which search results are displayed to the user.
  • an entity can be automatically selected without user input, e.g., without displaying a selection interface to the user.
  • whether to automatically select an entity can be determined based on time, queries, and/or location.
  • it can be determined that the user recently selected an entity using a selection interface, as described herein. For example, it can be determined that the user selected an entity a threshold time ago, e.g., X minute(s) ago, where X is a positive value.
  • the entity can be selected.
  • the user selected an entity a threshold number of queries ago e.g., X quer(y/ies) ago, where X is a positive value.
  • the entity can be selected.
  • the entity can be selected.
  • the selection interface is not displayed, if the approximate location is determined not to be sufficiently fresh and/or not to be sufficiently accurate.
  • a time difference can be determined between a current time and the time associated with the approximate location, and can be compared to a threshold difference. In some examples, if the time difference exceeds the threshold difference, the approximate location is determined to not be sufficiently fresh. Consequently, a selection interface is not displayed.
  • an accuracy of the user's location can be provided and can be compared to an accuracy threshold. For example, the accuracy of the user's location can be provided as 50 ft., meaning that the approximate location is accurate within 50 ft. An example accuracy threshold can be provided as 20 ft. If, for example, the accuracy exceeds the accuracy threshold, the approximate location is not sufficiently accurate. Consequently, a selection interface is not displayed
  • the query [show me room rates] is received, and it is determined that the entity "Great Hotel” 204 having the type [hotel], the entity “Not-so-Great Hotel” 206 having the type [hotel], the entity “Horrible Hotel” 208 having the type [hotel], and the entity “Awesome Pizza” 212 having the type [restaurant] are within the threshold distance of the approximate location of the user device 202. Consequently, the entities "Great Hotel” 204, “Not-so-Great Hotel” 206, “Horrible Hotel” 208 and “Awesome Pizza” 212 are included in a set of entities. In this example, it is determined that one or more of the n-grams [room], [rate] and [room rate] correlate to the type [hotel].
  • the entities "Great Hotel” 204, "Not-so-Great Hotel” 206 and “Horrible Hotel” 208 are included in a sub-set of entities. Also in this example, it is determined that none of the n-grams [room], [rate] and [room rate] correlate to the type [restaurant]. Consequently, the entity “Awesome Pizza” 212 is not included in the sub-set of entities. It is determined that the sub-set of entities includes multiple entities, e.g., "Great Hotel,” “Not-so-Great Hotel,” and “Horrible Hotel.” In response, a selection interface can be displayed to the user, the selection interface including respective graphical elements, etc.
  • FIG. 3 depicts an example selection interface 300 based on the example use case of FIG. 2.
  • the selection interface 300 includes graphical representations 302, 304, 306, respectively representing the entities "Great Hotel,” “Not- so-Great Hotel” and "Horrible Hotel.”
  • each graphical representation 302, 304, 306 includes data associated with the respective entity.
  • each graphical representation 302, 304, 306 provides a name 310 of the entity, an image 312 associated with the entity, contact information 314 for the entity, an approximate distance 316 between the user device 202 and the entity, and respective representations of search results 318, which include search results that would be provided, if the respective entity is selected.
  • the selection interface also includes a selection icon 328 that enables the user to search for other entities that are not graphically represented in the selection interface 300.
  • the user can select the selection icon 328, and graphical representations of additional entities that are within an enlarged threshold distance from the approximate location of the user device 202 can be displayed.
  • the graphical representation 302 of the entity "Great Hotel” can be displayed more prominently than the graphical representation 304 of the entity "Not-so-Great Hotel,” and the graphical representation 306 of the entity "Horrible Hotel.”
  • the user can select an entity using the selection interface 300.
  • the user can tap-on a graphical representation 302, 304, 306 to select an entity.
  • the user selects "Great Hotel” using the selection interface 300.
  • the query can be rewritten to explicitly reference the entity "Great Hotel” instead of the entity "Not-so-Great Hotel” or the entity "Horrible Hotel.”
  • the query [show me room rates] can be rewritten to provide the revised query [room rates Great Hotel].
  • the revised query can be provided to a search service, and search results that are responsive to the revised query can be received.
  • the search query [show me room rates] was received, the entities "Great Hotel,” “Not-so-Great Hotel” and “Horrible Hotel” were included in the sub-set of entities, a selection interface was displayed to the user, the selection interface including respective graphical representations of "Great Hotel,” “Not- so-Great Hotel,” and “Horrible Hotel,” the user selected "Great Hotel,” and the query was rewritten to explicitly reference "Great Hotel.”
  • the search query [show me room images] is next received. It is determined that the entity "Great Hotel” 204 having the type [hotel], the entity “Not-so-Great Hotel” 206 having the type [hotel], the entity “Horrible Hotel” 208 having the type [hotel], and the entity “Awesome Pizza” 212 having the type [restaurant] are within the threshold distance of the approximate location of the user device 202. Consequently, the entities "Great Hotel” 204, "Not-so- Great Hotel” 206, “Horrible Hotel” 208 and “Awesome Pizza” 212 are included in a set of entities.
  • n-grams [room], [images] and [room images] correlate to the type [hotel]. Consequently, the entities "Great Hotel” 204, "Not-so-Great Hotel” 206 and "Horrible Hotel” are included in a subset of entities. Also in this example, it is determined that the n-grams [room] and [room images] do not correlate to the type [restaurant]. Consequently, the entity "Awesome Pizza" 212 is not included in the sub-set of entities.
  • the sub-set of entities includes multiple entities, e.g., "Great Hotel,” “Not-so-Great Hotel” and “Horrible Hotel.” It is also determined that the user device 202 is within a threshold distance of when the previous query [show me room rates] was received, and that the user previously selected "Great Hotel” from a previously displayed selection interface, e.g., the selection interface 300 of FIG. 3. In response, the entity "Great Hotel” is automatically selected, without displaying a selection interface to the user, and the query was rewritten to explicitly reference "Great Hotel.”
  • the revised query can be provided to a search service, and search results that are responsive to the revised query can be received.
  • the search query [show me room images] is received, and it is determined that the entity "Great Hotel” 204 having the type [hotel], the entity “Not-so- Great Hotel” 206 having the type [hotel], and the entity “Horrible Hotel” 208 having the type [hotel] are within the threshold distance of the approximate location of the user device 202. Consequently, the entities "Great Hotel” 204, “Not-so-Great Hotel” 206 and “Horrible Hotel” 208 are included in a set of entities. In this example, it is determined that one or more of the n-grams [room], [images] and [room images] correlate to the type [hotel]. Consequently, the entities "Great Hotel,” “Not-so-Great Hotel” and “Horrible Hotel” are included in a sub-set of entities.
  • the sub-set of entities includes multiple entities, e.g., "Great Hotel,” “Not-so-Great Hotel,” and “Horrible Hotel,” and that the user previously selected "Great Hotel” from a previously displayed selection interface that was displayed in response to the query [show me room rates], e.g., the selection interface 300 of FIG. 3.
  • the user device 202 is approximately Y feet away from the approximate location, from which the user device 202 had previously submitted the query [show me room rates], and that Y is greater than a threshold distance, e.g., X.
  • a selection interface is displayed to the user, because the user device 202 is greater than the threshold distance away from where the user had submitted the query [show me room rates].
  • the selection interface can include respective graphical representations of "Great Hotel,” “Not-so-Great Hotel,” and “Horrible Hotel,” and the user selects "Horrible Hotel” using the selection interface.
  • the query can be rewritten to explicitly reference the entity “Horrible Hotel” instead of the entities "Great Hotel” and/or "Not-so-Great Hotel.”
  • the query [show me room images] can be rewritten to provide the revised query [room images Horrible Hotel].
  • the revised query can be provided to a search service, and search results that are responsive to the revised query can be received.
  • FIG. 4 depicts an example process 400 that can be executed in accordance with implementations of the present disclosure.
  • the example process 400 can be implemented, for example, by the example environment 100 of FIG. 1, e.g., the search system 120 and/or the query interpretation system 130.
  • the example process 400 can be provided by one or more computer-executable programs executed using one or more computing devices.
  • a query Q is received (402).
  • the search system 120 and/or the query interpretation system 130 receives a query from a user device 106, 202.
  • Location data is received (404).
  • the search system 120 and/or the query interpretation system 130 receives a location data from the user device 106, 202.
  • the location data can be provided with the query.
  • a set of entities is received based on the location data (406).
  • entities e within a threshold distance of the user device 106, 202 are determined and are included in the set of entities.
  • the set of entities includes m entities, where m is greater than or equal to 1.
  • a counter is set equal to 1 (408).
  • the query Q is determined whether the query Q is implicitly about an entity a (410). In some examples, and as described herein, if one or more n-grams of the query Q are associated with a type of the entity ei, the query Q is determined to be implicitly about the entity e If the query Q is determined to be implicitly about the entity ei, the entity ei is included in a sub-set of entities (412). If the query Q is not determined to be implicitly about the entity ei, the entity ei is not included in a sub-set of entities, and it is determined whether the counter is equal to m (414). If the counter is not equal to m, all entities in the set of entities have not been considered. Consequently, the counter is incremented (416), and the example process 400 loops back.
  • the query Q is rewritten based on the entity to provide the revised query (420). If the sub-set of entities includes multiple entities, it is determined whether an entity is to be automatically selected from the sub-set of entities (422). In some examples, and as described herein, whether to automatically select an entity can be determined based on time, queries and/or location. If an entity is to be automatically selected from the sub-set of entities, an entity is selected (424) and the query Q is rewritten based on the entity to provide the revised query (420).
  • a selection interface is provided for display to the user (426).
  • the search system 120 and/or the query interpretation system 130 provides instructions to the user device 106, 202 to display the selection interface.
  • a user selection is received (428).
  • the user can selected a graphical representation of an entity from the selection interface, and data indicating the user selection, e.g., indicating the selected entity, is received by the search system 120 and/or the query interpretation system 130.
  • the query Q is rewritten based on the entity to provide the revised query (420). As described herein, the revised query is used, e.g., by the search service 120, to provide search results responsive to the query Q.
  • FIG. 5 depicts an example process 500 that can be executed in accordance with implementations of the present disclosure.
  • the example process 500 can be implemented, for example, by the example environment 100 of FIG. 1, e.g., the search system 120 and/or the query interpretation system 130.
  • the example process 500 can be provided by one or more computer-executable programs executed using one or more computing devices.
  • a query is received (502).
  • the search system 120 and/or the query interpretation system 130 receives a query from a user device 106, 202. It is determined whether the query is implicitly about some entity (504). For example, it can be determined that the query is absent an explicit reference to an entity. Consequently, it can be determined that the query is implicitly about some entity. As another example, it can be determined that the query includes a pronoun. Consequently, it can be determined that the query is implicitly about an entity. If it is determined that the query is not implicitly about some entity, search results are provided based on the query (506). For example, the search system 120 can receive search results that are responsive to the query, and can provide the search results to the user device 106, 202.
  • an approximate location of the user device is obtained (508).
  • the search system 120 and/or the query interpretation system 130 receives location data from the user device 106, 202.
  • the location data can be provided with the query.
  • a set of entities is obtained (510).
  • the set of entities includes one or more entities that are each a threshold distance from the approximate location of the user device 106, 202.
  • the query is implicitly about an entity in the set of entities (514).
  • the entity is selected, based on user input to a selection interface that is displayed on the user device. In some examples, and as described herein, the selection interface in response to determining that the query is implicitly about at least two entities in the set of entities.
  • a revised query is provided (516).
  • the query is rewritten to explicitly reference the entity.
  • Search results are provided based on the revised query (518).
  • the search system 120 can receive search results that are responsive to the revised query, and can provide the search results to the user device 106, 202.
  • Implementations of the subject matter and the operations described in this specification can be realized in digital electronic circuitry, or in computer software, firmware, or hardware, including the structures disclosed in this specification and their structural equivalents, or in combinations of one or more of them. Implementations of the subject matter described in this specification can be realized using one or more computer programs, i.e., one or more modules of computer program instructions, encoded on computer storage medium for execution by, or to control the operation of, data processing apparatus. Alternatively or in addition, the program instructions can be encoded on an artificially-generated propagated signal, e.g., a machine-generated electrical, optical, or electromagnetic signal that is generated to encode information for transmission to suitable receiver apparatus for execution by a data processing apparatus.
  • an artificially-generated propagated signal e.g., a machine-generated electrical, optical, or electromagnetic signal that is generated to encode information for transmission to suitable receiver apparatus for execution by a data processing apparatus.
  • a computer storage medium can be, or be included in, a computer-readable storage device, a computer- readable storage substrate, a random or serial access memory array or device, or a combination of one or more of them. Moreover, while a computer storage medium is not a propagated signal, a computer storage medium can be a source or destination of computer program instructions encoded in an artificially-generated propagated signal. The computer storage medium can also be, or be included in, one or more separate physical components or media (e.g., multiple CDs, disks, or other storage devices).
  • the operations described in this specification can be implemented as operations performed by a data processing apparatus on data stored on one or more computer- readable storage devices or received from other sources.
  • the term "data processing apparatus” encompasses all kinds of apparatus, devices, and machines for processing data, including by way of example a programmable processor, a computer, a system on a chip, or multiple ones, or combinations, of the foregoing
  • the apparatus can include special purpose logic circuitry, e.g., an FPGA (field programmable gate array) or an ASIC (application-specific integrated circuit).
  • the apparatus can also include, in addition to hardware, code that creates an execution environment for the computer program in question, e.g., code that constitutes processor firmware, a protocol stack, a database management system, an operating system, a cross- platform runtime environment, a virtual machine, or a combination of one or more of them.
  • the apparatus and execution environment can realize various different computing model infrastructures, such as web services, distributed computing and grid computing infrastructures.
  • a computer program (also known as a program, software, software application, script, or code) can be written in any form of programming language, including compiled or interpreted languages, declarative or procedural languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, object, or other unit suitable for use in a computing environment.
  • a computer program may, but need not, correspond to a file in a file system.
  • a program can be stored in a portion of a file that holds other programs or data (e.g., one or more scripts stored in a markup language document), in a single file dedicated to the program in question, or in multiple coordinated files (e.g., files that store one or more modules, sub-programs, or portions of code).
  • a computer program can be deployed to be executed on one computer or on multiple computers that are located at one site or distributed across multiple sites and interconnected by a communication network.
  • the processes and logic flows described in this specification can be performed by one or more programmable processors executing one or more computer programs to perform actions by operating on input data and generating output.
  • the processes and logic flows can also be performed by, and apparatus can also be implemented as, special purpose logic circuitry, e.g., an FPGA (field programmable gate array) or an ASIC (application-specific integrated circuit).
  • processors suitable for the execution of a computer program include, by way of example, both general and special purpose microprocessors, and any one or more processors of any kind of digital computer.
  • a processor will receive instructions and data from a read-only memory or a random access memory or both.
  • Elements of a computer can include a processor for performing actions in accordance with instructions and one or more memory devices for storing instructions and data.
  • a computer will also include, or be operatively coupled to receive data from or transfer data to, or both, one or more mass storage devices for storing data, e.g., magnetic, magneto-optical disks, or optical disks.
  • mass storage devices for storing data
  • a computer need not have such devices.
  • a computer can be embedded in another device, e.g., a mobile telephone, a personal digital assistant (PDA), a mobile audio or video player, a game console, a Global Positioning System (GPS) receiver, or a portable storage device (e.g., a universal serial bus (USB) flash drive), to name just a few.
  • PDA personal digital assistant
  • GPS Global Positioning System
  • USB universal serial bus
  • Devices suitable for storing computer program instructions and data include all forms of non-volatile memory, media and memory devices, including by way of example semiconductor memory devices, e.g., EPROM, EEPROM, and flash memory devices; magnetic disks, e.g., internal hard disks or removable disks; magneto-optical disks; and CD-ROM and DVD-ROM disks.
  • semiconductor memory devices e.g., EPROM, EEPROM, and flash memory devices
  • magnetic disks e.g., internal hard disks or removable disks
  • magneto-optical disks e.g., CD-ROM and DVD-ROM disks.
  • the processor and the memory can be supplemented by, or incorporated in, special purpose logic circuitry.
  • implementations of the subject matter described in this specification can be implemented on a computer having a display device, e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor, for displaying information to the user and a keyboard and a pointing device, e.g., a mouse or a trackball, by which the user can provide input to the computer.
  • a display device e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor
  • keyboard and a pointing device e.g., a mouse or a trackball
  • Other kinds of devices can be used to provide for interaction with a user as well; for example, feedback provided to the user can be any form of sensory feedback, e.g., visual feedback, auditory feedback, or tactile feedback; and input from the user can be received in any form, including acoustic, speech, or tactile input.
  • a computer can interact with a user by sending documents to and receiving documents from a device that is used
  • Implementations of the subject matter described in this specification can be implemented in a computing system that includes a back-end component, e.g., as a data server, or that includes a middleware component, e.g., an application server, or that includes a front-end component, e.g., a client computer having a graphical user interface or a Web browser through which a user can interact with an implementation of the subject matter described in this specification, or any combination of one or more such back-end, middleware, or front-end components.
  • the components of the system can be
  • Examples of communication networks include a local area network (“LAN”) and a wide area network (“WAN”), an inter-network (e.g., the
  • peer-to-peer networks e.g., ad hoc peer-to-peer networks.
  • the computing system can include clients and servers.
  • a client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.
  • a server transmits data (e.g., an HTML page) to a client device (e.g., for purposes of displaying data to and receiving user input from a user interacting with the client device).
  • client device e.g., for purposes of displaying data to and receiving user input from a user interacting with the client device.
  • Data generated at the client device e.g., a result of the user interaction

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving a first query provided from a user device, and determining that the first query is implicitly about some entity, and in response: obtaining a first approximate location of the user device when the user device provided the first query, obtaining a first set of entities including a plurality of entities, each entity in the first set of entities being associated with the first approximate location, determining that the first query is implicitly about at least two entities in the first set of entities, and in response providing a first selection interface for display to the user, the first selection interface including graphical representations, each graphical representation representing a respective entity of the at least two entities of the first set of entities, receiving user input indicating user selection of a first entity, and providing a first revised query based on the first query and the first entity, the first revised query explicitly referencing the first entity.

Description

DISAMBIGUATION OF QUERIES IMPLICIT
TO MULTIPLE ENTITIES
BACKGROUND
The Internet provides access to a wide variety of resources, such as image files, audio files, video files, and web pages. A search system can identify resources in response to queries submitted by users and provide information about the resources in a manner that is useful to the users. The users can navigate through, e.g., click on, search results to acquire information of interest.
SUMMARY
This specification relates to interpreting user queries based on a location of a user device.
Implementations of the present disclosure are generally directed to rewriting queries based on one or more implied entities. More particularly, implementations of the present disclosure are directed to identifying a set of entities based on an approximate location of a user device that submits a query, determining that the query is implicitly directed to multiple entities of the set of entities, selectively providing an interface for disambiguation of the multiple entities, and rewriting the query to explicitly reference an entity of the set of entities. In some examples, the entity is identified by the user using the disambiguation interface.
In general, innovative aspects of the subject matter described in this specification can be embodied in methods that include actions of receiving a first query provided from a user device, and determining that the first query is implicitly about some entity, and in response: obtaining a first approximate location of the user device when the user device provided the first query, obtaining a first set of entities including a plurality of entities, each entity in the first set of entities being associated with the first approximate location, determining that the first query is implicitly about at least two entities in the first set of entities, and in response providing a first selection interface for display to the user, the first selection interface including graphical representations, each graphical representation representing a respective entity of the at least two entities of the first set of entities, receiving user input indicating user selection of a first entity, and providing a first revised query based on the first query and the first entity, the first revised query explicitly referencing the first entity. Other implementations of this aspect include corresponding systems, apparatus, and computer programs, configured to perform the actions of the methods, encoded on computer storage devices.
These and other implementations can each optionally include one or more of the following features: a graphical representation includes one or more of a name of a respective entity, a type of the respective entity, an approximate distance between the user device and the respective entity, one or more images of the respective entity, a rating of the respective entity, an address of the respective entity, a telephone number of the respective entity, and one or more results that would be displayed, if the respective entity is selected; actions further include: receiving a second query provided from the user device, and determining that the second query is implicitly about some entity, and in response: obtaining a second approximate location of the user device when the user device provided the second query, obtaining a second set of entities including a plurality of entities, each entity in the second set of entities being associated with the second approximate location, determining that the second query is implicitly about at least two entities in the second set of entities, automatically, without displaying a second selection interface, selecting a second entity, and providing a second revised query based on the second query and the second entity, the second revised query explicitly referencing the second entity; automatically, without displaying a second selection interface, selecting a second entity is performed based on one or more of time, queries and location; the first entity and the second entity are the same entity; determining that the first query is implicitly about the at least two entities includes providing a sub-set of entities based on comparing one or more types of one or more entities of the first set of entities to one or more types associated with the first query, the at least two entities being included in the sub-set of entities; the one or more types associated with the first query are provided from an index that associates types to respective n-grams of queries; and, if a location of a respective entity is within a threshold distance of the approximate location, the respective location is included in the first set of entities.
Particular implementations of the subject matter described in this specification can be implemented so as to realize one or more of the following advantages. In some examples, a disambiguation interface can be selectively provided to the user. For example, previous information can be processed to determine whether to provide the disambiguation interface, or to automatically select an entity without user input. In this manner, interaction with the user can be streamlined. In some examples, the user submitting the query does not need to know the name of the entity that is the subject of the query. For example, the user can be standing near a monument and can submit the query [what is this monument], without having to first determine the name of the monument. In some examples, the user does not need to know how to properly pronounce and/or spell the name of the entity. For example, a user that does not speak German can be on vacation in Zurich, Switzerland and can submit the query [opening hours], while standing near a restaurant called "Zeughauskeller," which may be difficult to pronounce and/or spell for the user. As another example, implementations of the present disclosure enable users to more conveniently and naturally interact with a search system (e.g., submitting the query [show me lunch specials] instead of the query [Fino Ristorante & Bar lunch specials]. These examples highlight simplification of information retrieval and increased information accessibility to users that are provided by implementations of the present disclosure. For example, users do not have to first submit queries to determine the name of an entity, before submitting queries to retrieve other information about the entity. This also provides the advantage of reducing the overall number of queries that might be submitted to a search system, thereby reducing the bandwidth, computational power, and/or memory required by the search system.
The details of one or more implementations of the subject matter described in this specification are set forth in the accompanying drawings and the description below.
Other features, aspects, and advantages of the subject matter will become apparent from the description, the drawings, and the claims.
BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 depicts an example environment in which a search system provides search results based on interpreted user queries.
FIG. 2 depicts an example use cases in accordance with implementations of the present disclosure.
FIG. 3 depicts an example disambiguation interface based on the example use case of FIG. 2.
FIG. 4 depicts an example process that can be executed in accordance with implementations of the present disclosure.
FIG. 5 depicts an example process that can be executed in accordance with implementations of the present disclosure.
Like reference numbers and designations in the various drawings indicate like elements. DETAILED DESCRIPTION
Implementations of the present disclosure are generally directed to rewriting queries based on one or more implied entities. More particularly, implementations of the present disclosure are directed to determining one or more entities that a received query is implicitly about, selecting an entity of the one or more entities, and rewriting the query to explicitly reference the selected entity. In some implementations, a set of entities is identified based on an approximate location of a user device that submits a query, each entity in the set of entities being determined to be sufficiently close to the approximate location. In some implementations, and as described in further detail herein, for each entity in the set of entities, it is determined whether the query may be implicitly referencing the entity. In some examples, for each entity, it is determined whether one or more n-grams of the query correlate to a type of the entity, and, if a correlation is provided, the query is deemed to be potentially about the respective entity, and the entity is added to a sub-set of entities. In some examples, an entity is selected from the sub-set of entities, and the query is rewritten to explicitly reference the selected entity. In some examples, search results are provided based on the revised query.
FIG. 1 depicts an example environment 100 in which a search system provides search results based on interpreted user queries. In some examples, the example environment 100 enables users to interact with one or more computer-implemented services. Example computer-implemented services can include a search service, an electronic mail service, a chat service, a document sharing service, a calendar sharing service, a photo sharing service, a video sharing service, blogging service, a micro- blogging service, a social networking service, a location (location-aware) service, a check-in service and a ratings and review service. In the example of FIG. 1, a search system 120 is depicted, which provides a search service, as described in further detail herein.
With continued reference to FIG. l,the example environment 100 includes a network 102, e.g., a local area network (LAN), wide area network (WAN), the Internet, or a combination thereof, connects web sites 104, user devices 106, and the search system 120. In some examples, the network 102 can be accessed over a wired and/or a wireless communications link. For example, mobile computing devices, such as smartphones can utilize a cellular network to access the network 102. The environment 100 may include millions of web sites 104 and user devices 106.
In some examples, a web site 104 is provided as one or more resources 105 associated with a domain name and hosted by one or more servers. An example web site is a collection of web pages formatted in an appropriate machine-readable language, e.g., hypertext markup language (HTML), that can contain text, images, multimedia content, and programming elements, e.g., scripts. Each web site 104 is maintained by a publisher, e.g., an entity that manages and/or owns the web site.
In some examples, a resource 105 is data provided over the network 102 and that is associated with a resource address, e.g., a uniform resource locator (URL). In some examples, resources 105 that can be provided by a web site 104 include web pages, word processing documents, and portable document format (PDF) documents, images, video, and feed sources, among other appropriate digital content. The resources 105 can include content, e.g., words, phrases, images and sounds and may include embedded information, e.g., meta information and hyperlinks, and/or embedded instructions, e.g., scripts.
In some examples, a user device 106 is an electronic device that is capable of requesting and receiving resources 105 over the network 102. Example user devices 106 include personal computers, laptop computers, and mobile computing devices, e.g., smartphones and/or tablet computing devices, that can send and receive data over the network 102. As used throughout this document, the term mobile computing device ("mobile device") refers to a user device that is configured to communicate over a mobile communications network. A smartphone, e.g., a phone that is enabled to communicate over the Internet, is an example of a mobile device. A user device 106 can execute a user application, e.g., a web browser, to facilitate the sending and receiving of data over the network 102.
In some examples, to facilitate searching of resources 105, the search system 120 identifies the resources 105 by crawling and indexing the resources 105 provided on web sites 104. Data about the resources 105 can be indexed based on the resource to which the data corresponds. The indexed and, optionally, cached copies of the resources 105 are stored in a search index 122.
The user devices 106 submit search queries 109 to the search system 120. In some examples, a user device 106 can include one or more input modalities. Example modalities can include a keyboard, a touchscreen and/or a microphone. For example, a user can use a keyboard and/or touchscreen to type in a search query. As another example, a user can speak a search query, the user speech being captured through a microphone, and being processed through speech recognition to provide the search query.
In response to receiving a search query 109, the search system 120 accesses the search index 122 to identify resources 105 that are relevant to, e.g., have at least a minimum specified relevance score for, the search query 109. The search system 120 identifies the resources 105, generates a search results display 111 that includes search results 1 12 identifying resources 105, and returns the search results display 1 11 to the user devices 106. In an example context, a search results display can include one or more web pages, e.g., one or more search results pages. In some examples, a web page can be provided based on a web document that can be written in any appropriate machine- readable language. It is contemplated, however, that implementations of the present disclosure can include other appropriate display types. For example, the search results can be provided in a display generated by an application that is executed on a computing device, and/or a display generated by an operating system, e.g., mobile operating system. In some examples, search results can be provided based on any appropriate form, e.g., Javascript-html, plaintext.
A search result 112 is data generated by the search system 120 that identifies a resource 105 that is responsive to a particular search query, and includes a link to the resource 105. An example search result 1 12 can include a web page title, a snippet of text or a portion of an image extracted from the web page, and the URL of the web page. In some examples, data provided in the search results 1 12 can be retrieved from a resource data store. For example, the search system 120 can provide the search results display 1 11, which displays the search results 1 12. In some examples, the search results display 11 1 can be populated with information, e.g., a web page title, a snippet of text or a portion of an image extracted from the web page, that is provided from the resource data store.
In some examples, data for the search queries 109 submitted during user sessions are stored in a data store, such as the historical data store 124. For example, the search system 120 can store received search queries in the historical data store 124.
In some examples, selection data specifying actions taken in response to search results 1 12 provided in response to each search query 109 are also stored in the historical data store 124, for example, by the search system 120. These actions can include whether a search result 112 was selected, e.g., clicked or hovered over with a pointer. The selection data can also include, for each selection of a search result 1 12, data identifying the search query 109 for which the search result 1 12 was provided. In accordance with implementations of the present disclosure, the example environment 100 also includes a query interpretation system 130 communicably coupled to the search system 120, e.g., directly coupled or coupled over a network such as network 102. Although the search system 120 and the query interpretation system 120 are depicted as separate systems in FIG. 1, it is contemplated that the search system 120 can include the query interpretation system 130. In some implementations, the query interpretation system 130 provides a revised query based on a query received from a user device 106. In some examples, and as described in further detail herein, the revised query is provided based on an approximate location of the user device 106 that provided the query, and one or more entities that are determined to be proximate to the user device 106. In some examples, the search results 112 provided by the search system 120 are responsive to the revised query.
In some implementations, a plurality of entities and information associated therewith can be stored as structured data in a knowledge graph. In some examples, a knowledge graph includes a plurality of nodes and edges between nodes. In some examples, a node represents an entity and an edge represents a relationship between entities. In some examples, the knowledge graph can be provided based on an example schema that structures data based on domains, types and properties. In some examples, a domain includes one or more types that share a namespace. In some examples, a namespace is provided as a directory of uniquely named objects, where each object in the namespace has a unique name, e.g., identifier. In some examples, a type denotes an "is a" relationship about a topic, and is used to hold a collection of properties. In some examples, a topic represents an entity, such as a person, place or thing. In some examples, each topic can have one or more types associated therewith. In some examples, a property is associated with a topic and defines a "has a" relationship between the topic and a value of the property. In some examples, the value of the property can include another topic.
In some implementations, a plurality of entities can be provided in one or more databases. For example, a plurality of entities can be provided in a table that can provide data associated with each entity. Example data can include a name of the entity, a location of the entity, one or more types, described in further detail below, assigned to the entity, one or more ratings associated with the entity, and any other appropriate information that can be provided for the entity.
Implementations of the present disclosure are generally directed to rewriting queries based on one or more implied entities. More particularly, implementations of the present disclosure are directed to identifying a set of entities based on an approximate location of a user device that submits a query, determining that the query is implicitly directed to multiple entities of the set of entities, selectively providing an interface for disambiguation of the multiple entities, and rewriting the query to explicitly reference an entity of the set of entities. In some examples, the entity is identified by the user using the disambiguation interface.
Implementations of the present disclosure will be described in further detail herein with reference to example use cases. FIG. 2 depicts an example use cases in accordance with implementations of the present disclosure. In the example of FIG. 2, an area 200 is provided, in which a user device 202 is located. Multiple entities are also located within the area 200. Example entities can include a restaurant, a bar, a hotel, a theater, a school, a university, a music venue, a tourist attraction, and a park. It is appreciated that implementations of the present disclosure are not limited to entities specifically identified herein. In the depicted example, an entity "Great Hotel" 204, an entity "Not-so-Great Hotel" 206, an entity "Horrible Hotel" 208, an entity "Acceptable Hotel" 210, and an entity "Awesome Pizza" 212 are provided in the area 200.
In accordance with implementations of the present disclosure, a query is received from a user device. For example, the user device 202 can provide a query to a search system, e.g., the search system 120 of FIG. 1. In some examples, it can be determined that the query is implicitly about an entity. In some examples, if the query is absent an explicit reference to an entity, it can be determined that the query is implicitly about an entity. In some examples, if the query includes a pronoun, it can be determined that the query is implicitly about an entity. In some implementations, the received query can be processed and can be compared to one or more query patterns to determine whether the query is implicitly referencing an entity. In some examples, the query is processed to remove one or more stop words and/or conversational terms to provide a cleaned query. For example, the example query [can you please show me ratings] can be processed to provide
[ratings]. In some examples, the cleaned query is compared to one or more query patterns, each query pattern indicating an implicit reference to some entity. In some examples, if the cleaned query matches a query pattern, it is determined that the query is implicitly about some entity. In some examples, if the cleaned query does not match a query pattern, it is determined that the query is not implicitly about some entity. For example, the clean query [ratings], from the example above, can match a query pattern [ratings]. Consequently, it can be determined that the query [can you please show me ratings] is implicitly about some entity.
As another example, the example query [can you please show me ratings for Awesome Pizza] can be processed to provide the cleaned query [ratings Awesome Pizza]. The cleaned query [ratings Awesome Pizza] can be compared to one or more query patterns, and it can be determined that the cleaned query does not match any of the one or more query patterns. For example, the term "Awesome Pizza" is specific to an entity, and is thus, not included in any of the query patterns. Consequently, it is determined that the query [can you please show me ratings for Awesome Pizza] is not implicitly about some entity. Instead, and in this example, the query [can you please show me ratings for Awesome Pizza] is explicitly about an entity, i.e., Awesome Pizza.
In some implementations, in response to determining that the query is implicitly about an entity, an approximate location of the user device that submitted the query can be determined. In some examples, the approximate location can be determined based on one or more location-related signals. Example location related-signals can include cellular signals, global-positioning system (GPS) signals, wireless signals (WiFi), and the like. In some examples, location data that is representative of the one or more location-related signals is provided with the query. In some examples, the location data is provided to a location service, which processes the location data and provides the approximate location of the user device. In some examples, the approximate location can be provided as geographic coordinates, e.g., longitude, latitude.
In some examples, a time can be associated with the approximate location. For example, the location service can provide the approximate location and a time associated therewith. In some examples, the time indicates an approximate time, at which the user device is determined to be or have been at the approximate location. In some
implementations, if the approximate location is not determined to be sufficiently fresh, the query is not interpreted. In some examples, a time difference can be determined between a current time and the time associated with the approximate location, and can be compared to a threshold difference. In some examples, if the time difference exceeds the threshold difference, the approximate location is determined to not be sufficiently fresh. In some examples, the current time is provided as the time, at which the query was sent from the user device, or received by the search system.
In some implementations, a set of entities is provided based on the approximate location. For example, if the confidence score exceeds the threshold confidence score and/or the approximate location is sufficiently fresh, the set of entities is provided, as described below. In some examples, the set of entities includes one or more entities. In some implementations, the set of entities includes one or more entities that are determined to be sufficiently close to the approximate location, such that the query could be potentially about the one or more entities.
In some implementations, the one or more entities are selected for inclusion in the set of entities based on a location of a respective entity relative to the approximate location. In some examples, entities that are within a threshold distance, e.g., threshold radius, of the location can be identified, and are included in the set of entities. For example, a geographic location can be divided into a grid that includes a plurality of cells, e.g., square or rectangular cells, each cell accounting for an area of the geographic location. In some examples, each cell is associated with a set of entities that are located in the respective area. In some examples, a cell, within which the approximate location lies can be identified, and can be provided as a central cell. In some examples, a plurality of surrounding cells that surround the central cell can be identified. For example, any cell that is at least partially within the threshold distance can be provided as a surrounding cell and can be included in the plurality of surrounding cells. In some examples, the set of entities includes all entities associated with the central cell and each of the surrounding cells in the plurality of surrounding cells.
In the example of FIG. 2, a threshold radius 220 is provided, which defines a circular area 222. Although the example area 222 is circular, it is contemplated that the area can include any appropriate geometric shape. In some examples, entities that lie within the area 222 are included in the set of entities. Consequently, in the example of FIG. 2, the entities "Great Hotel" 204, "Not-so-Great Hotel" 206, "Horrible Hotel" 208 and "Acceptable Hotel" 210 are included in the set of entities. In some examples, the approximate location can be provided to an entity service, which processes the approximate location and provides the set of entities.
In some implementations, each entity is associated with a respective type. In some examples, the type corresponds to a category associated with the entity. For example, the entity "Great Hotel" 204 can be provided as a type [hotel]. As another example, the entity "Awesome Pizza" 212 can be provided as a type [restaurant]. In some examples, the entity service provides the respective type for each entity in the set of entities. In some examples, the type of an entity is determined from the knowledge graph, or database, which store information about respective entities, as described above. In accordance with implementations of the present disclosure, for each entity in the set of entities, it is determined whether the query is implicitly referencing the entity. In some examples, the type of the entity is cross-referenced to an index. In some examples, the index associates types to respective n-grams of queries. In some examples, an n-gram includes one or more terms of a query. If it is determined, based on the index, that one or more n-grams of the query correlate to the type, the query is deemed to be potentially about the respective entity, and the entity is added to a sub-set of entities.
In some implementations, the index is provided based on analysis of one or more search logs. In some examples, the search logs provide a record of received queries. In some examples, the search logs can be processed to provide the index. For example, the search logs can be processed to determine that queries including the term [menu] correlate to the type [restaurant]. As another example, the search logs can be processed to determine that queries including the term [reviews] correlate to the types [restaurant], [hotel], [movie], [play], [store] and [service provider]. In some examples, the index can be provided to correlate the term [menu] to the type [restaurant], and to correlate the terms [reviews] to the types [restaurant], [hotel], [movie], [play], [store], and [service provider].
In some implementations, a revised query is provided based on the query and at least one entity in the sub-set of entities. In some examples, the revised query is provided by rewriting the query to explicitly reference the at least one entity. In some examples, if the sub-set of entities includes a plurality of entities, an entity is selected from the set of entities.
In accordance with implementations of the present disclosure, an entity of the subset of entities can be selected. In some examples, the entity is selected based on user input. In some implementations, a selection interface is displayed to the user in response to multiple entities being included in the sub-set of entities. In some examples, the selection interface includes a graphical representation of each entity in the sub-set of entities.
In some implementations, a list of entities is represented in the selection interface. In some examples, the list of entities includes an ordered list that has the entities ordered based on one or more attributes associated with the respective entities, and/or results that would be displayed based on respective revised queries that would be provided based on the respective entities. Example attributes include an approximate distance between the user and the entity, a rating of the entity, a popularity of the entity, and whether the user has previously submitted a query about the respective entity. Example attributes also include one or more user interactions. Example user interactions include locations physically visited by the user, a web page previously visited by the user. In some examples, respective results can be provided for each entity, which can include results that would be provided, if the query were to be rewritten to explicitly reference the respective entity. In some examples, the order of the entities can be at least partially based on the respective results. For example, entities associated with results of relatively higher quality can be displayed higher in the order than entities associated with results of relatively low quality.
In some implementations, each graphical representation of an entity provides information associated with the entity. Example information includes a name of the entity, a type of the entity, an approximate distance between the user and the entity, one or more images of the entity, a rating of the entity, an address of the entity, and a telephone number of the entity. In some examples, a graphical representation of an entity can include one or more results that would be displayed, if the query were to be rewritten to explicitly reference the respective entity. For example, if the user provides the query [show me the menu], and the query implicates two restaurants, a graphical representation can be provided for each restaurant, each graphical representation including a result that is representative of a menu for the respective restaurant, e.g., a hyperlink to the menu, a snippet of the menu, a thumbnail image of the menu.
In response to user input to the selection interface, e.g., user selection of an entity, the query is rewritten to provide a revised query that explicitly references the entity selected by the user. In some implementations, the revised query is provided to a search service. In some examples, the search service provides search results that are responsive to the revised query, which search results are displayed to the user.
In some implementations, although multiple entities are included in the sub-set of entities, an entity can be automatically selected without user input, e.g., without displaying a selection interface to the user. In some implementations, whether to automatically select an entity can be determined based on time, queries, and/or location. In some examples, it can be determined that the user recently selected an entity using a selection interface, as described herein. For example, it can be determined that the user selected an entity a threshold time ago, e.g., X minute(s) ago, where X is a positive value. In some examples, if an entity was selected within the threshold time, and the entity is included in the sub-set of entities, the entity can be selected. As another example, it can be determined that the user selected an entity a threshold number of queries ago, e.g., X quer(y/ies) ago, where X is a positive value. In some examples, if an entity was selected within the threshold number of queries, and the entity is included in the sub-set of entities, the entity can be selected. As another example, it can be determined that the user selected an entity within X feet of a current location of the user, where X is a positive value. In some examples, if an entity was previously selected within the threshold distance, and the entity is included in the sub-set of entities, the entity can be selected. As another example, it can be determined that the user recently submitted a query that explicitly included an entity. Consequently, the entity can be automatically selected from the sub-set of entities.
In some implementations, the selection interface is not displayed, if the approximate location is determined not to be sufficiently fresh and/or not to be sufficiently accurate. For example, and as described above, a time difference can be determined between a current time and the time associated with the approximate location, and can be compared to a threshold difference. In some examples, if the time difference exceeds the threshold difference, the approximate location is determined to not be sufficiently fresh. Consequently, a selection interface is not displayed. As another example, an accuracy of the user's location can be provided and can be compared to an accuracy threshold. For example, the accuracy of the user's location can be provided as 50 ft., meaning that the approximate location is accurate within 50 ft. An example accuracy threshold can be provided as 20 ft. If, for example, the accuracy exceeds the accuracy threshold, the approximate location is not sufficiently accurate. Consequently, a selection interface is not displayed
Implementations of the present disclosure are described in further detail with reference to examples based on the example use case of FIG. 2.
In one example, the query [show me room rates] is received, and it is determined that the entity "Great Hotel" 204 having the type [hotel], the entity "Not-so-Great Hotel" 206 having the type [hotel], the entity "Horrible Hotel" 208 having the type [hotel], and the entity "Awesome Pizza" 212 having the type [restaurant] are within the threshold distance of the approximate location of the user device 202. Consequently, the entities "Great Hotel" 204, "Not-so-Great Hotel" 206, "Horrible Hotel" 208 and "Awesome Pizza" 212 are included in a set of entities. In this example, it is determined that one or more of the n-grams [room], [rate] and [room rate] correlate to the type [hotel].
Consequently, the entities "Great Hotel" 204, "Not-so-Great Hotel" 206 and "Horrible Hotel" 208 are included in a sub-set of entities. Also in this example, it is determined that none of the n-grams [room], [rate] and [room rate] correlate to the type [restaurant]. Consequently, the entity "Awesome Pizza" 212 is not included in the sub-set of entities. It is determined that the sub-set of entities includes multiple entities, e.g., "Great Hotel," "Not-so-Great Hotel," and "Horrible Hotel." In response, a selection interface can be displayed to the user, the selection interface including respective graphical
representations of "Great Hotel," "Not-so-Great Hotel" and "Horrible Hotel."
FIG. 3 depicts an example selection interface 300 based on the example use case of FIG. 2. In the example of FIG. 3, the selection interface 300 includes graphical representations 302, 304, 306, respectively representing the entities "Great Hotel," "Not- so-Great Hotel" and "Horrible Hotel." In some examples, each graphical representation 302, 304, 306 includes data associated with the respective entity. In the depicted example, each graphical representation 302, 304, 306 provides a name 310 of the entity, an image 312 associated with the entity, contact information 314 for the entity, an approximate distance 316 between the user device 202 and the entity, and respective representations of search results 318, which include search results that would be provided, if the respective entity is selected. In the depicted example, the selection interface also includes a selection icon 328 that enables the user to search for other entities that are not graphically represented in the selection interface 300. For example, the user can select the selection icon 328, and graphical representations of additional entities that are within an enlarged threshold distance from the approximate location of the user device 202 can be displayed.
In this example, it can be determined that the entity "Great Hotel" has a better rating than the entity "Not-so-Great Hotel," and that the entity "Not-so-Great Hotel" has a better rating than the entity "Horrible Hotel." Consequently, the graphical representation 302 of the entity "Great Hotel" can be displayed more prominently than the graphical representation 304 of the entity "Not-so-Great Hotel," and the graphical representation 306 of the entity "Horrible Hotel." The user can select an entity using the selection interface 300. For example, the user can tap-on a graphical representation 302, 304, 306 to select an entity. For example, the user selects "Great Hotel" using the selection interface 300. In response, the query can be rewritten to explicitly reference the entity "Great Hotel" instead of the entity "Not-so-Great Hotel" or the entity "Horrible Hotel." For example, the query [show me room rates] can be rewritten to provide the revised query [room rates Great Hotel]. The revised query can be provided to a search service, and search results that are responsive to the revised query can be received. To summarize the above example, the search query [show me room rates] was received, the entities "Great Hotel," "Not-so-Great Hotel" and "Horrible Hotel" were included in the sub-set of entities, a selection interface was displayed to the user, the selection interface including respective graphical representations of "Great Hotel," "Not- so-Great Hotel," and "Horrible Hotel," the user selected "Great Hotel," and the query was rewritten to explicitly reference "Great Hotel."
In another example, which continues from the above example, the search query [show me room images] is next received. It is determined that the entity "Great Hotel" 204 having the type [hotel], the entity "Not-so-Great Hotel" 206 having the type [hotel], the entity "Horrible Hotel" 208 having the type [hotel], and the entity "Awesome Pizza" 212 having the type [restaurant] are within the threshold distance of the approximate location of the user device 202. Consequently, the entities "Great Hotel" 204, "Not-so- Great Hotel" 206, "Horrible Hotel" 208 and "Awesome Pizza" 212 are included in a set of entities. In this example, it is determined that one or more of the n-grams [room], [images] and [room images] correlate to the type [hotel]. Consequently, the entities "Great Hotel" 204, "Not-so-Great Hotel" 206 and "Horrible Hotel" are included in a subset of entities. Also in this example, it is determined that the n-grams [room] and [room images] do not correlate to the type [restaurant]. Consequently, the entity "Awesome Pizza" 212 is not included in the sub-set of entities.
It is determined that the sub-set of entities includes multiple entities, e.g., "Great Hotel," "Not-so-Great Hotel" and "Horrible Hotel." It is also determined that the user device 202 is within a threshold distance of when the previous query [show me room rates] was received, and that the user previously selected "Great Hotel" from a previously displayed selection interface, e.g., the selection interface 300 of FIG. 3. In response, the entity "Great Hotel" is automatically selected, without displaying a selection interface to the user, and the query was rewritten to explicitly reference "Great Hotel." The revised query can be provided to a search service, and search results that are responsive to the revised query can be received.
In another example, the search query [show me room images] is received, and it is determined that the entity "Great Hotel" 204 having the type [hotel], the entity "Not-so- Great Hotel" 206 having the type [hotel], and the entity "Horrible Hotel" 208 having the type [hotel] are within the threshold distance of the approximate location of the user device 202. Consequently, the entities "Great Hotel" 204, "Not-so-Great Hotel" 206 and "Horrible Hotel" 208 are included in a set of entities. In this example, it is determined that one or more of the n-grams [room], [images] and [room images] correlate to the type [hotel]. Consequently, the entities "Great Hotel," "Not-so-Great Hotel" and "Horrible Hotel" are included in a sub-set of entities.
It is determined that the sub-set of entities includes multiple entities, e.g., "Great Hotel," "Not-so-Great Hotel," and "Horrible Hotel," and that the user previously selected "Great Hotel" from a previously displayed selection interface that was displayed in response to the query [show me room rates], e.g., the selection interface 300 of FIG. 3. In this example, it is also determined that the user device 202 is approximately Y feet away from the approximate location, from which the user device 202 had previously submitted the query [show me room rates], and that Y is greater than a threshold distance, e.g., X. In this example, although the entity "Great Hotel" had been previously selected by the user, a selection interface is displayed to the user, because the user device 202 is greater than the threshold distance away from where the user had submitted the query [show me room rates]. In this example, the selection interface can include respective graphical representations of "Great Hotel," "Not-so-Great Hotel," and "Horrible Hotel," and the user selects "Horrible Hotel" using the selection interface. In response, the query can be rewritten to explicitly reference the entity "Horrible Hotel" instead of the entities "Great Hotel" and/or "Not-so-Great Hotel." For example, the query [show me room images] can be rewritten to provide the revised query [room images Horrible Hotel]. The revised query can be provided to a search service, and search results that are responsive to the revised query can be received.
FIG. 4 depicts an example process 400 that can be executed in accordance with implementations of the present disclosure. The example process 400 can be implemented, for example, by the example environment 100 of FIG. 1, e.g., the search system 120 and/or the query interpretation system 130. In some examples, the example process 400 can be provided by one or more computer-executable programs executed using one or more computing devices.
A query Q is received (402). For example, the search system 120 and/or the query interpretation system 130 receives a query from a user device 106, 202. Location data is received (404). For example, the search system 120 and/or the query interpretation system 130 receives a location data from the user device 106, 202. In some examples, the location data can be provided with the query. A set of entities is received based on the location data (406). In some examples, entities e within a threshold distance of the user device 106, 202 are determined and are included in the set of entities. In some examples, the set of entities includes m entities, where m is greater than or equal to 1. A counter is set equal to 1 (408).
It is determined whether the query Q is implicitly about an entity a (410). In some examples, and as described herein, if one or more n-grams of the query Q are associated with a type of the entity ei, the query Q is determined to be implicitly about the entity e If the query Q is determined to be implicitly about the entity ei, the entity ei is included in a sub-set of entities (412). If the query Q is not determined to be implicitly about the entity ei, the entity ei is not included in a sub-set of entities, and it is determined whether the counter is equal to m (414). If the counter is not equal to m, all entities in the set of entities have not been considered. Consequently, the counter is incremented (416), and the example process 400 loops back.
If the counter is equal to m, all entities in the set of entities have been considered, and it is determined whether the sub-set of entities includes multiple entities (418). If the sub-set of entities does not include multiple entities, the query Q is rewritten based on the entity to provide the revised query (420). If the sub-set of entities includes multiple entities, it is determined whether an entity is to be automatically selected from the sub-set of entities (422). In some examples, and as described herein, whether to automatically select an entity can be determined based on time, queries and/or location. If an entity is to be automatically selected from the sub-set of entities, an entity is selected (424) and the query Q is rewritten based on the entity to provide the revised query (420). If an entity is not to be automatically selected from the sub-set of entities, a selection interface is provided for display to the user (426). In some examples, the search system 120 and/or the query interpretation system 130 provides instructions to the user device 106, 202 to display the selection interface. A user selection is received (428). For example, the user can selected a graphical representation of an entity from the selection interface, and data indicating the user selection, e.g., indicating the selected entity, is received by the search system 120 and/or the query interpretation system 130. The query Q is rewritten based on the entity to provide the revised query (420). As described herein, the revised query is used, e.g., by the search service 120, to provide search results responsive to the query Q.
FIG. 5 depicts an example process 500 that can be executed in accordance with implementations of the present disclosure. The example process 500 can be implemented, for example, by the example environment 100 of FIG. 1, e.g., the search system 120 and/or the query interpretation system 130. In some examples, the example process 500 can be provided by one or more computer-executable programs executed using one or more computing devices.
A query is received (502). For example, the search system 120 and/or the query interpretation system 130 receives a query from a user device 106, 202. It is determined whether the query is implicitly about some entity (504). For example, it can be determined that the query is absent an explicit reference to an entity. Consequently, it can be determined that the query is implicitly about some entity. As another example, it can be determined that the query includes a pronoun. Consequently, it can be determined that the query is implicitly about an entity. If it is determined that the query is not implicitly about some entity, search results are provided based on the query (506). For example, the search system 120 can receive search results that are responsive to the query, and can provide the search results to the user device 106, 202.
If it is determined that the query is implicitly about some entity, an approximate location of the user device is obtained (508). For example, the search system 120 and/or the query interpretation system 130 receives location data from the user device 106, 202. In some examples, the location data can be provided with the query. A set of entities is obtained (510). For example, the set of entities includes one or more entities that are each a threshold distance from the approximate location of the user device 106, 202. It is determined that the query is implicitly about an entity in the set of entities (514). For example, the entity is selected, based on user input to a selection interface that is displayed on the user device. In some examples, and as described herein, the selection interface in response to determining that the query is implicitly about at least two entities in the set of entities. A revised query is provided (516). For example, the query is rewritten to explicitly reference the entity. Search results are provided based on the revised query (518). For example, the search system 120 can receive search results that are responsive to the revised query, and can provide the search results to the user device 106, 202.
Implementations of the subject matter and the operations described in this specification can be realized in digital electronic circuitry, or in computer software, firmware, or hardware, including the structures disclosed in this specification and their structural equivalents, or in combinations of one or more of them. Implementations of the subject matter described in this specification can be realized using one or more computer programs, i.e., one or more modules of computer program instructions, encoded on computer storage medium for execution by, or to control the operation of, data processing apparatus. Alternatively or in addition, the program instructions can be encoded on an artificially-generated propagated signal, e.g., a machine-generated electrical, optical, or electromagnetic signal that is generated to encode information for transmission to suitable receiver apparatus for execution by a data processing apparatus. A computer storage medium can be, or be included in, a computer-readable storage device, a computer- readable storage substrate, a random or serial access memory array or device, or a combination of one or more of them. Moreover, while a computer storage medium is not a propagated signal, a computer storage medium can be a source or destination of computer program instructions encoded in an artificially-generated propagated signal. The computer storage medium can also be, or be included in, one or more separate physical components or media (e.g., multiple CDs, disks, or other storage devices).
The operations described in this specification can be implemented as operations performed by a data processing apparatus on data stored on one or more computer- readable storage devices or received from other sources.
The term "data processing apparatus" encompasses all kinds of apparatus, devices, and machines for processing data, including by way of example a programmable processor, a computer, a system on a chip, or multiple ones, or combinations, of the foregoing The apparatus can include special purpose logic circuitry, e.g., an FPGA (field programmable gate array) or an ASIC (application-specific integrated circuit). The apparatus can also include, in addition to hardware, code that creates an execution environment for the computer program in question, e.g., code that constitutes processor firmware, a protocol stack, a database management system, an operating system, a cross- platform runtime environment, a virtual machine, or a combination of one or more of them. The apparatus and execution environment can realize various different computing model infrastructures, such as web services, distributed computing and grid computing infrastructures.
A computer program (also known as a program, software, software application, script, or code) can be written in any form of programming language, including compiled or interpreted languages, declarative or procedural languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, object, or other unit suitable for use in a computing environment. A computer program may, but need not, correspond to a file in a file system. A program can be stored in a portion of a file that holds other programs or data (e.g., one or more scripts stored in a markup language document), in a single file dedicated to the program in question, or in multiple coordinated files (e.g., files that store one or more modules, sub-programs, or portions of code). A computer program can be deployed to be executed on one computer or on multiple computers that are located at one site or distributed across multiple sites and interconnected by a communication network.
The processes and logic flows described in this specification can be performed by one or more programmable processors executing one or more computer programs to perform actions by operating on input data and generating output. The processes and logic flows can also be performed by, and apparatus can also be implemented as, special purpose logic circuitry, e.g., an FPGA (field programmable gate array) or an ASIC (application-specific integrated circuit).
Processors suitable for the execution of a computer program include, by way of example, both general and special purpose microprocessors, and any one or more processors of any kind of digital computer. Generally, a processor will receive instructions and data from a read-only memory or a random access memory or both. Elements of a computer can include a processor for performing actions in accordance with instructions and one or more memory devices for storing instructions and data.
Generally, a computer will also include, or be operatively coupled to receive data from or transfer data to, or both, one or more mass storage devices for storing data, e.g., magnetic, magneto-optical disks, or optical disks. However, a computer need not have such devices. Moreover, a computer can be embedded in another device, e.g., a mobile telephone, a personal digital assistant (PDA), a mobile audio or video player, a game console, a Global Positioning System (GPS) receiver, or a portable storage device (e.g., a universal serial bus (USB) flash drive), to name just a few. Devices suitable for storing computer program instructions and data include all forms of non-volatile memory, media and memory devices, including by way of example semiconductor memory devices, e.g., EPROM, EEPROM, and flash memory devices; magnetic disks, e.g., internal hard disks or removable disks; magneto-optical disks; and CD-ROM and DVD-ROM disks. The processor and the memory can be supplemented by, or incorporated in, special purpose logic circuitry.
To provide for interaction with a user, implementations of the subject matter described in this specification can be implemented on a computer having a display device, e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor, for displaying information to the user and a keyboard and a pointing device, e.g., a mouse or a trackball, by which the user can provide input to the computer. Other kinds of devices can be used to provide for interaction with a user as well; for example, feedback provided to the user can be any form of sensory feedback, e.g., visual feedback, auditory feedback, or tactile feedback; and input from the user can be received in any form, including acoustic, speech, or tactile input. In addition, a computer can interact with a user by sending documents to and receiving documents from a device that is used by the user; for example, by sending web pages to a web browser on a user's client device in response to requests received from the web browser.
Implementations of the subject matter described in this specification can be implemented in a computing system that includes a back-end component, e.g., as a data server, or that includes a middleware component, e.g., an application server, or that includes a front-end component, e.g., a client computer having a graphical user interface or a Web browser through which a user can interact with an implementation of the subject matter described in this specification, or any combination of one or more such back-end, middleware, or front-end components. The components of the system can be
interconnected by any form or medium of digital data communication, e.g., a
communication network. Examples of communication networks include a local area network ("LAN") and a wide area network ("WAN"), an inter-network (e.g., the
Internet), and peer-to-peer networks (e.g., ad hoc peer-to-peer networks).
The computing system can include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other. In some implementations, a server transmits data (e.g., an HTML page) to a client device (e.g., for purposes of displaying data to and receiving user input from a user interacting with the client device). Data generated at the client device (e.g., a result of the user interaction) can be received from the client device at the server.
While this specification contains many specific implementation details, these should not be construed as limitations on the scope of any implementation of the present disclosure or of what may be claimed, but rather as descriptions of features specific to example implementations. Certain features that are described in this specification in the context of separate implementations can also be implemented in combination in a single implementation. Conversely, various features that are described in the context of a single implementation can also be implemented in multiple implementations separately or in any suitable sub-combination. Moreover, although features may be described above as acting in certain combinations and even initially claimed as such, one or more features from a claimed combination can in some cases be excised from the combination, and the claimed combination may be directed to a sub-combination or variation of a sub-combination.
Similarly, while operations are depicted in the drawings in a particular order, this should not be understood as requiring that such operations be performed in the particular order shown or in sequential order, or that all illustrated operations be performed, to achieve desirable results. In certain circumstances, multitasking and parallel processing may be advantageous. Moreover, the separation of various system components in the implementations described above should not be understood as requiring such separation in all implementations, and it should be understood that the described program components and systems can generally be integrated together in a single software product or packaged into multiple software products.
Thus, particular implementations of the subject matter have been described. Other implementations are within the scope of the following claims. In some cases, the actions recited in the claims can be performed in a different order and still achieve desirable results. In addition, the processes depicted in the accompanying figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In certain implementations, multitasking and parallel processing may be advantageous.

Claims

1. A computer-implemented method executed by one or more processors, the method comprising:
receiving, by the one or more processors, a first query provided from a user device; and
determining, by the one or more processors, that the first query is implicitly about some entity, and in response:
obtaining, by the one or more processors, a first approximate location of the user device when the user device provided the first query,
obtaining, by the one or more processors, a first set of entities comprising a plurality of entities, each entity in the first set of entities being associated with the first approximate location,
determining, by the one or more processors, that the first query is implicitly about at least two entities in the first set of entities, and in response providing a first selection interface for display to the user, the first selection interface comprising graphical representations, each graphical representation representing a respective entity of the at least two entities of the first set of entities,
receiving, by the one or more processors, user input indicating user selection of a first entity, and
providing, by the one or more processors, a first revised query based on the first query and the first entity, the first revised query explicitly referencing the first entity.
2. The method of claim 1, wherein a graphical representation comprises one or more of a name of a respective entity, a type of the respective entity, an approximate distance between the user device and the respective entity, one or more images of the respective entity, a rating of the respective entity, an address of the respective entity, a telephone number of the respective entity, and one or more results that would be displayed, if the respective entity is selected.
3. The method of claim 1, further comprising:
receiving a second query provided from the user device; and
determining, by the one or more processors, that the second query is implicitly about some entity, and in response:
obtaining a second approximate location of the user device when the user device provided the second query,
obtaining a second set of entities comprising a plurality of entities, each entity in the second set of entities being associated with the second approximate location,
determining that the second query is implicitly about at least two entities in the second set of entities,
automatically, without displaying a second selection interface, selecting a second entity, and
providing a second revised query based on the second query and the second entity, the second revised query explicitly referencing the second entity.
4. The method of claim 3, wherein automatically, without displaying a second selection interface, selecting a second entity is performed based on one or more of time, queries and location.
5. The method of claim 3, wherein the first entity and the second entity are the same entity.
6. The method of claim 1, wherein determining that the first query is implicitly about the at least two entities comprises providing a sub-set of entities based on comparing one or more types of one or more entities of the first set of entities to one or more types associated with the first query, the at least two entities being included in the sub-set of entities.
7. The method of claim 6, wherein the one or more types associated with the first query are provided from an index that associates types to respective n-grams of queries.
8. The method of claim 1, wherein, if a location of a respective entity is within a threshold distance of the approximate location, the respective location is included in the first set of entities.
9. A system comprising:
a data store for storing data; and
one or more processors configured to interact with the data store, the one or more processors being further configured to perform operations comprising:
receiving a first query provided from a user device; and
determining that the first query is implicitly about some entity, and in response:
obtaining a first approximate location of the user device when the user device provided the first query,
obtaining a first set of entities comprising a plurality of entities, each entity in the first set of entities being associated with the first approximate location,
determining that the first query is implicitly about at least two entities in the first set of entities, and in response providing a first selection interface for display to the user, the first selection interface comprising graphical representations, each graphical representation representing a respective entity of the at least two entities of the first set of entities,
receiving user input indicating user selection of a first entity, and providing a first revised query based on the first query and the first entity, the first revised query explicitly referencing the first entity.
10. The system of claim 9, wherein a graphical representation comprises one or more of a name of a respective entity, a type of the respective entity, an approximate distance between the user device and the respective entity, one or more images of the respective entity, a rating of the respective entity, an address of the respective entity, a telephone number of the respective entity, and one or more results that would be displayed, if the respective entity is selected.
1 1. The system of claim 9, wherein operations further comprise:
receiving a second query provided from the user device; and
determining that the second query is implicitly about some entity, and in response: obtaining a second approximate location of the user device when the user device provided the second query,
obtaining a second set of entities comprising a plurality of entities, each entity in the second set of entities being associated with the second approximate location,
determining that the second query is implicitly about at least two entities in the second set of entities,
automatically, without displaying a second selection interface, selecting a second entity, and
providing a second revised query based on the second query and the second entity, the second revised query explicitly referencing the second entity.
12. The system of claim 11, wherein automatically, without displaying a second selection interface, selecting a second entity is performed based on one or more of time, queries and location.
13. The system of claim 1 1, wherein the first entity and the second entity are the same entity.
14. The system of claim 9, wherein determining that the first query is implicitly about the at least two entities comprises providing a sub-set of entities based on comparing one or more types of one or more entities of the first set of entities to one or more types associated with the first query, the at least two entities being included in the sub-set of entities.
15. The system of claim 14, wherein the one or more types associated with the first query are provided from an index that associates types to respective n-grams of queries.
16. The system of claim 9, wherein, if a location of a respective entity is within a threshold distance of the approximate location, the respective location is included in the first set of entities.
17. A computer readable medium storing instructions that, when executed by one or more processors, cause the one or more processors to perform operations comprising: receiving a first query provided from a user device; and
determining that the first query is implicitly about some entity, and in response: obtaining a first approximate location of the user device when the user device provided the first query,
obtaining a first set of entities comprising a plurality of entities, each entity in the first set of entities being associated with the first approximate location, determining that the first query is implicitly about at least two entities in the first set of entities, and in response providing a first selection interface for display to the user, the first selection interface comprising graphical representations, each graphical representation representing a respective entity of the at least two entities of the first set of entities,
receiving user input indicating user selection of a first entity, and providing a first revised query based on the first query and the first entity, the first revised query explicitly referencing the first entity.
18. The computer readable medium of claim 17, wherein a graphical representation comprises one or more of a name of a respective entity, a type of the respective entity, an approximate distance between the user device and the respective entity, one or more images of the respective entity, a rating of the respective entity, an address of the respective entity, a telephone number of the respective entity, and one or more results that would be displayed, if the respective entity is selected.
19. The computer readable medium of claim 17, wherein operations further comprise: receiving a second query provided from the user device; and
determining that the second query is implicitly about some entity, and in response: obtaining a second approximate location of the user device when the user device provided the second query, obtaining a second set of entities comprising a plurality of entities, each entity in the second set of entities being associated with the second approximate location,
determining that the second query is implicitly about at least two entities in the second set of entities,
automatically, without displaying a second selection interface, selecting a second entity, and
providing a second revised query based on the second query and the second entity, the second revised query explicitly referencing the second entity.
20. The computer readable medium of claim 19, wherein automatically, without displaying a second selection interface, selecting a second entity is performed based on one or more of time, queries and location.
21. The computer readable medium of claim 19, wherein the first entity and the second entity are the same entity.
22. The computer readable medium of claim 17, wherein determining that the first query is implicitly about the at least two entities comprises providing a sub-set of entities based on comparing one or more types of one or more entities of the first set of entities to one or more types associated with the first query, the at least two entities being included in the sub-set of entities.
23. The computer readable medium of claim 22, wherein the one or more types associated with the first query are provided from an index that associates types to respective n-grams of queries.
24. The computer readable medium of claim 17, wherein, if a location of a respective entity is within a threshold distance of the approximate location, the respective location is included in the first set of entities.
PCT/US2015/029696 2014-05-12 2015-05-07 Disambiguation of queries implicit to multiple entities WO2015175313A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP15722417.1A EP3143516A1 (en) 2014-05-12 2015-05-07 Disambiguation of queries implicit to multiple entities
CN201580024802.9A CN106462603B (en) 2014-05-12 2015-05-07 Disambiguation of queries implied by multiple entities

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201461991950P 2014-05-12 2014-05-12
US61/991,950 2014-05-12

Publications (1)

Publication Number Publication Date
WO2015175313A1 true WO2015175313A1 (en) 2015-11-19

Family

ID=53177398

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2015/029696 WO2015175313A1 (en) 2014-05-12 2015-05-07 Disambiguation of queries implicit to multiple entities

Country Status (3)

Country Link
EP (1) EP3143516A1 (en)
CN (1) CN106462603B (en)
WO (1) WO2015175313A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110059164A (en) * 2018-01-11 2019-07-26 国际商业机器公司 Semantic expressiveness and realization for conversational system
US11797609B2 (en) 2018-01-11 2023-10-24 Intenrational Business Machines Corporation Semantic representation and realization for conversational systems

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110555208B (en) * 2018-06-04 2021-11-19 北京三快在线科技有限公司 Ambiguity elimination method and device in information query and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080097966A1 (en) * 2006-10-18 2008-04-24 Yahoo! Inc. A Delaware Corporation Apparatus and Method for Providing Regional Information Based on Location
US20100318551A1 (en) * 2009-06-15 2010-12-16 Jenny Lai Method and system for search string entry and refinement on a mobile device
US20110313657A1 (en) * 2010-06-18 2011-12-22 Google Inc. Context-Sensitive Point of Interest Retrieval
US20120143859A1 (en) * 2010-12-01 2012-06-07 Microsoft Corporation Real-time personalized recommendation of location-related entities

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130006914A1 (en) * 2011-06-28 2013-01-03 Microsoft Corporation Exposing search history by category

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080097966A1 (en) * 2006-10-18 2008-04-24 Yahoo! Inc. A Delaware Corporation Apparatus and Method for Providing Regional Information Based on Location
US20100318551A1 (en) * 2009-06-15 2010-12-16 Jenny Lai Method and system for search string entry and refinement on a mobile device
US20110313657A1 (en) * 2010-06-18 2011-12-22 Google Inc. Context-Sensitive Point of Interest Retrieval
US20120143859A1 (en) * 2010-12-01 2012-06-07 Microsoft Corporation Real-time personalized recommendation of location-related entities

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110059164A (en) * 2018-01-11 2019-07-26 国际商业机器公司 Semantic expressiveness and realization for conversational system
CN110059164B (en) * 2018-01-11 2023-06-06 国际商业机器公司 Method and system for presenting a user interface of a dialog system
US11797609B2 (en) 2018-01-11 2023-10-24 Intenrational Business Machines Corporation Semantic representation and realization for conversational systems

Also Published As

Publication number Publication date
CN106462603A (en) 2017-02-22
CN106462603B (en) 2020-03-27
EP3143516A1 (en) 2017-03-22

Similar Documents

Publication Publication Date Title
US11397784B2 (en) Structured user graph to support querying and predictions
US11709829B2 (en) Retrieving context from previous sessions
US10354647B2 (en) Correcting voice recognition using selective re-speak
US10474671B2 (en) Interpreting user queries based on nearby locations
US9275147B2 (en) Providing query suggestions
US20170193064A1 (en) Surfacing live events in search results
US20230244657A1 (en) Query composition system
US10922321B2 (en) Interpreting user queries based on device orientation
CN106462603B (en) Disambiguation of queries implied by multiple entities
US20170277702A1 (en) Interpreting user queries based on nearby locations
US10528564B2 (en) Identifying teachable moments for contextual search

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15722417

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

REEP Request for entry into the european phase

Ref document number: 2015722417

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2015722417

Country of ref document: EP