WO2010062737A3 - Retrieval using a generalized sentence collocation - Google Patents
Retrieval using a generalized sentence collocation Download PDFInfo
- Publication number
- WO2010062737A3 WO2010062737A3 PCT/US2009/063057 US2009063057W WO2010062737A3 WO 2010062737 A3 WO2010062737 A3 WO 2010062737A3 US 2009063057 W US2009063057 W US 2009063057W WO 2010062737 A3 WO2010062737 A3 WO 2010062737A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- retrieval
- speech
- word
- retrieval system
- generalized sentence
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/334—Query execution
- G06F16/3344—Query execution using natural language analysis
Abstract
A method and system for identifying documents relevant to a query that specifies a part of speech is provided. A retrieval system receives from a user an input query that includes a word and a part of speech. Upon receiving an input query that includes a word and a part of speech, the retrieval system identifies documents with a sentence that includes that word collocated with a word that is used as that part of speech. The retrieval system displays to the user an indication of the identified documents.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP09829670.0A EP2347354B1 (en) | 2008-11-03 | 2009-11-03 | Retrieval using a generalized sentence collocation |
CN200980143730.4A CN102203774B (en) | 2008-11-03 | 2009-11-03 | Retrieval using a generalized sentence collocation |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11089208P | 2008-11-03 | 2008-11-03 | |
US61/110,892 | 2008-11-03 | ||
US12/362,428 | 2009-01-29 | ||
US12/362,428 US8484014B2 (en) | 2008-11-03 | 2009-01-29 | Retrieval using a generalized sentence collocation |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2010062737A2 WO2010062737A2 (en) | 2010-06-03 |
WO2010062737A3 true WO2010062737A3 (en) | 2010-07-22 |
Family
ID=42132516
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2009/063057 WO2010062737A2 (en) | 2008-11-03 | 2009-11-03 | Retrieval using a generalized sentence collocation |
Country Status (4)
Country | Link |
---|---|
US (1) | US8484014B2 (en) |
EP (1) | EP2347354B1 (en) |
CN (1) | CN102203774B (en) |
WO (1) | WO2010062737A2 (en) |
Families Citing this family (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8290780B2 (en) * | 2009-06-24 | 2012-10-16 | International Business Machines Corporation | Dynamically extending the speech prompts of a multimodal application |
JP5824829B2 (en) * | 2011-03-15 | 2015-12-02 | 富士通株式会社 | Speech recognition apparatus, speech recognition method, and speech recognition program |
US9858343B2 (en) | 2011-03-31 | 2018-01-02 | Microsoft Technology Licensing Llc | Personalization of queries, conversations, and searches |
US9842168B2 (en) | 2011-03-31 | 2017-12-12 | Microsoft Technology Licensing, Llc | Task driven user intents |
US8892555B2 (en) * | 2011-03-31 | 2014-11-18 | Samsung Electronics Co., Ltd. | Apparatus and method for generating story according to user information |
US9760566B2 (en) | 2011-03-31 | 2017-09-12 | Microsoft Technology Licensing, Llc | Augmented conversational understanding agent to identify conversation context between two humans and taking an agent action thereof |
US9244984B2 (en) | 2011-03-31 | 2016-01-26 | Microsoft Technology Licensing, Llc | Location based conversational understanding |
US10642934B2 (en) | 2011-03-31 | 2020-05-05 | Microsoft Technology Licensing, Llc | Augmented conversational understanding architecture |
US9064006B2 (en) | 2012-08-23 | 2015-06-23 | Microsoft Technology Licensing, Llc | Translating natural language utterances to keyword search queries |
US9454962B2 (en) * | 2011-05-12 | 2016-09-27 | Microsoft Technology Licensing, Llc | Sentence simplification for spoken language understanding |
CN102521220B (en) * | 2011-11-29 | 2014-01-08 | 华中师范大学 | Method for recognizing network suicide note |
US20140006012A1 (en) * | 2012-07-02 | 2014-01-02 | Microsoft Corporation | Learning-Based Processing of Natural Language Questions |
US9547640B2 (en) * | 2013-10-16 | 2017-01-17 | International Business Machines Corporation | Ontology-driven annotation confidence levels for natural language processing |
KR20160056548A (en) * | 2014-11-12 | 2016-05-20 | 삼성전자주식회사 | Apparatus and method for qusetion-answering |
US10241716B2 (en) | 2017-06-30 | 2019-03-26 | Microsoft Technology Licensing, Llc | Global occupancy aggregator for global garbage collection scheduling |
US11010180B2 (en) * | 2018-05-29 | 2021-05-18 | Wipro Limited | Method and system for providing real-time guidance to users during troubleshooting of devices |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070010992A1 (en) * | 2005-07-08 | 2007-01-11 | Microsoft Corporation | Processing collocation mistakes in documents |
Family Cites Families (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0756933A (en) * | 1993-06-24 | 1995-03-03 | Xerox Corp | Method for retrieval of document |
US5331556A (en) * | 1993-06-28 | 1994-07-19 | General Electric Company | Method for natural language data processing using morphological and part-of-speech information |
US5873660A (en) * | 1995-06-19 | 1999-02-23 | Microsoft Corporation | Morphological search and replace |
US5963940A (en) * | 1995-08-16 | 1999-10-05 | Syracuse University | Natural language information retrieval system and method |
US5721902A (en) | 1995-09-15 | 1998-02-24 | Infonautics Corporation | Restricted expansion of query terms using part of speech tagging |
US20020123994A1 (en) * | 2000-04-26 | 2002-09-05 | Yves Schabes | System for fulfilling an information need using extended matching techniques |
AU2002220219A1 (en) * | 2000-12-05 | 2002-06-18 | Global Information Research And Technologies, Llc | System for fulfilling an information need using extended matching techniques |
US7269545B2 (en) * | 2001-03-30 | 2007-09-11 | Nec Laboratories America, Inc. | Method for retrieving answers from an information retrieval system |
US7031911B2 (en) * | 2002-06-28 | 2006-04-18 | Microsoft Corporation | System and method for automatic detection of collocation mistakes in documents |
US7293015B2 (en) * | 2002-09-19 | 2007-11-06 | Microsoft Corporation | Method and system for detecting user intentions in retrieval of hint sentences |
US7171351B2 (en) * | 2002-09-19 | 2007-01-30 | Microsoft Corporation | Method and system for retrieving hint sentences using expanded queries |
US7194455B2 (en) * | 2002-09-19 | 2007-03-20 | Microsoft Corporation | Method and system for retrieving confirming sentences |
WO2004066271A1 (en) * | 2003-01-20 | 2004-08-05 | Fujitsu Limited | Speech synthesizing apparatus, speech synthesizing method, and speech synthesizing system |
US7689412B2 (en) * | 2003-12-05 | 2010-03-30 | Microsoft Corporation | Synonymous collocation extraction using translation information |
US7260568B2 (en) * | 2004-04-15 | 2007-08-21 | Microsoft Corporation | Verifying relevance between keywords and web site contents |
CN100530171C (en) * | 2005-01-31 | 2009-08-19 | 日电(中国)有限公司 | Dictionary learning method and devcie |
US7277029B2 (en) * | 2005-06-23 | 2007-10-02 | Microsoft Corporation | Using language models to expand wildcards |
CN100578539C (en) * | 2006-02-28 | 2010-01-06 | 腾讯科技(深圳)有限公司 | Automatic question-answering method and system |
US9020804B2 (en) * | 2006-05-10 | 2015-04-28 | Xerox Corporation | Method for aligning sentences at the word level enforcing selective contiguity constraints |
US7698328B2 (en) * | 2006-08-11 | 2010-04-13 | Apple Inc. | User-directed search refinement |
CN100416570C (en) * | 2006-09-22 | 2008-09-03 | 浙江大学 | FAQ based Chinese natural language ask and answer method |
-
2009
- 2009-01-29 US US12/362,428 patent/US8484014B2/en active Active
- 2009-11-03 WO PCT/US2009/063057 patent/WO2010062737A2/en active Application Filing
- 2009-11-03 EP EP09829670.0A patent/EP2347354B1/en active Active
- 2009-11-03 CN CN200980143730.4A patent/CN102203774B/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070010992A1 (en) * | 2005-07-08 | 2007-01-11 | Microsoft Corporation | Processing collocation mistakes in documents |
Non-Patent Citations (3)
Title |
---|
BARR C. ET AL: "The Linguistic Structure of English Web-Search Queries", EMPIRICAL METHOD IN NATURAL LANGUAGE PROCESSING 2008, October 2008 (2008-10-01), pages 1021 - 1030, XP008145689 * |
NTOULAS A. ET AL: "The Infocious Web Search Engine: Improving Web Searching Through Linguistic Analysis", INTERNATIONAL WORLD WIDE WEB CONFERENCE, - 2005, pages 840 - 849, XP008145687 * |
ZUKERMAN I. ET AL: "Lexical Query Paraphrasing for Document Retrieval", THE 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL LINGUISTICS, 2002, - 2002, XP008145688 * |
Also Published As
Publication number | Publication date |
---|---|
EP2347354B1 (en) | 2018-08-01 |
CN102203774B (en) | 2014-12-17 |
WO2010062737A2 (en) | 2010-06-03 |
EP2347354A2 (en) | 2011-07-27 |
EP2347354A4 (en) | 2016-09-21 |
US8484014B2 (en) | 2013-07-09 |
CN102203774A (en) | 2011-09-28 |
US20100114574A1 (en) | 2010-05-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2010062737A3 (en) | Retrieval using a generalized sentence collocation | |
WO2007005120A3 (en) | Searching for content using voice search queries | |
WO2012048306A3 (en) | Structured searching of dynamic structured document corpuses | |
WO2008051750A3 (en) | Associating geographic-related information with objects | |
MX2019001576A (en) | Systems and methods for contextual retrieval of electronic records. | |
WO2008014499A3 (en) | Information nervous system | |
WO2007115079A3 (en) | Expanded snippets | |
WO2006057741A3 (en) | Interactive system for collecting metadata | |
WO2007008798A3 (en) | System and method for searching for network-based content in a multi-modal system using spoken keywords | |
SG154439A1 (en) | Searching and naming items based on metadata | |
WO2007047971A3 (en) | Real time query trends with multi-document summarization | |
WO2008070877A3 (en) | Online computer-aided translation | |
WO2007016628A3 (en) | Definition extraction | |
WO2008033665A3 (en) | Media systems with integrated content searching | |
WO2007027596A3 (en) | System, device, and method for conveying information using a rapid serial presentation technique | |
WO2008083215A3 (en) | System and method for related information search and presentation from user interface content | |
WO2007103583A3 (en) | Method and system for media navigation | |
WO2010068068A3 (en) | Information search method and information provision method based on user's intention | |
WO2007005536A3 (en) | Information retrieving and displaying method and computer-readable medium | |
WO2012082886A3 (en) | Sender-based ranking of person profiles and multi-person automatic suggestions | |
WO2006124952A3 (en) | The information nervous system | |
WO2008028029A3 (en) | Method and system for providing an automated web transcription service | |
WO2008045981A3 (en) | Virtual network of real-world entities | |
WO2010045549A3 (en) | Textual disambiguation using social connections | |
MY153405A (en) | Context-sensitive searches and functionality for instant messaging applications |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 200980143730.4 Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 09829670 Country of ref document: EP Kind code of ref document: A2 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2009829670 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |