WO2010062737A3 - Retrieval using a generalized sentence collocation - Google Patents

Retrieval using a generalized sentence collocation Download PDF

Info

Publication number
WO2010062737A3
WO2010062737A3 PCT/US2009/063057 US2009063057W WO2010062737A3 WO 2010062737 A3 WO2010062737 A3 WO 2010062737A3 US 2009063057 W US2009063057 W US 2009063057W WO 2010062737 A3 WO2010062737 A3 WO 2010062737A3
Authority
WO
WIPO (PCT)
Prior art keywords
retrieval
speech
word
retrieval system
generalized sentence
Prior art date
Application number
PCT/US2009/063057
Other languages
French (fr)
Other versions
WO2010062737A2 (en
Inventor
Xiaohua Liu
Ming Zhou
Hao Wei
Jing Zhao
Matthew R. Scott
Long Jiang
Gang Chen
Original Assignee
Microsoft Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corporation filed Critical Microsoft Corporation
Priority to EP09829670.0A priority Critical patent/EP2347354B1/en
Priority to CN200980143730.4A priority patent/CN102203774B/en
Publication of WO2010062737A2 publication Critical patent/WO2010062737A2/en
Publication of WO2010062737A3 publication Critical patent/WO2010062737A3/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis

Abstract

A method and system for identifying documents relevant to a query that specifies a part of speech is provided. A retrieval system receives from a user an input query that includes a word and a part of speech. Upon receiving an input query that includes a word and a part of speech, the retrieval system identifies documents with a sentence that includes that word collocated with a word that is used as that part of speech. The retrieval system displays to the user an indication of the identified documents.
PCT/US2009/063057 2008-11-03 2009-11-03 Retrieval using a generalized sentence collocation WO2010062737A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP09829670.0A EP2347354B1 (en) 2008-11-03 2009-11-03 Retrieval using a generalized sentence collocation
CN200980143730.4A CN102203774B (en) 2008-11-03 2009-11-03 Retrieval using a generalized sentence collocation

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US11089208P 2008-11-03 2008-11-03
US61/110,892 2008-11-03
US12/362,428 2009-01-29
US12/362,428 US8484014B2 (en) 2008-11-03 2009-01-29 Retrieval using a generalized sentence collocation

Publications (2)

Publication Number Publication Date
WO2010062737A2 WO2010062737A2 (en) 2010-06-03
WO2010062737A3 true WO2010062737A3 (en) 2010-07-22

Family

ID=42132516

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2009/063057 WO2010062737A2 (en) 2008-11-03 2009-11-03 Retrieval using a generalized sentence collocation

Country Status (4)

Country Link
US (1) US8484014B2 (en)
EP (1) EP2347354B1 (en)
CN (1) CN102203774B (en)
WO (1) WO2010062737A2 (en)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8290780B2 (en) * 2009-06-24 2012-10-16 International Business Machines Corporation Dynamically extending the speech prompts of a multimodal application
JP5824829B2 (en) * 2011-03-15 2015-12-02 富士通株式会社 Speech recognition apparatus, speech recognition method, and speech recognition program
US9858343B2 (en) 2011-03-31 2018-01-02 Microsoft Technology Licensing Llc Personalization of queries, conversations, and searches
US9842168B2 (en) 2011-03-31 2017-12-12 Microsoft Technology Licensing, Llc Task driven user intents
US8892555B2 (en) * 2011-03-31 2014-11-18 Samsung Electronics Co., Ltd. Apparatus and method for generating story according to user information
US9760566B2 (en) 2011-03-31 2017-09-12 Microsoft Technology Licensing, Llc Augmented conversational understanding agent to identify conversation context between two humans and taking an agent action thereof
US9244984B2 (en) 2011-03-31 2016-01-26 Microsoft Technology Licensing, Llc Location based conversational understanding
US10642934B2 (en) 2011-03-31 2020-05-05 Microsoft Technology Licensing, Llc Augmented conversational understanding architecture
US9064006B2 (en) 2012-08-23 2015-06-23 Microsoft Technology Licensing, Llc Translating natural language utterances to keyword search queries
US9454962B2 (en) * 2011-05-12 2016-09-27 Microsoft Technology Licensing, Llc Sentence simplification for spoken language understanding
CN102521220B (en) * 2011-11-29 2014-01-08 华中师范大学 Method for recognizing network suicide note
US20140006012A1 (en) * 2012-07-02 2014-01-02 Microsoft Corporation Learning-Based Processing of Natural Language Questions
US9547640B2 (en) * 2013-10-16 2017-01-17 International Business Machines Corporation Ontology-driven annotation confidence levels for natural language processing
KR20160056548A (en) * 2014-11-12 2016-05-20 삼성전자주식회사 Apparatus and method for qusetion-answering
US10241716B2 (en) 2017-06-30 2019-03-26 Microsoft Technology Licensing, Llc Global occupancy aggregator for global garbage collection scheduling
US11010180B2 (en) * 2018-05-29 2021-05-18 Wipro Limited Method and system for providing real-time guidance to users during troubleshooting of devices

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070010992A1 (en) * 2005-07-08 2007-01-11 Microsoft Corporation Processing collocation mistakes in documents

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0756933A (en) * 1993-06-24 1995-03-03 Xerox Corp Method for retrieval of document
US5331556A (en) * 1993-06-28 1994-07-19 General Electric Company Method for natural language data processing using morphological and part-of-speech information
US5873660A (en) * 1995-06-19 1999-02-23 Microsoft Corporation Morphological search and replace
US5963940A (en) * 1995-08-16 1999-10-05 Syracuse University Natural language information retrieval system and method
US5721902A (en) 1995-09-15 1998-02-24 Infonautics Corporation Restricted expansion of query terms using part of speech tagging
US20020123994A1 (en) * 2000-04-26 2002-09-05 Yves Schabes System for fulfilling an information need using extended matching techniques
AU2002220219A1 (en) * 2000-12-05 2002-06-18 Global Information Research And Technologies, Llc System for fulfilling an information need using extended matching techniques
US7269545B2 (en) * 2001-03-30 2007-09-11 Nec Laboratories America, Inc. Method for retrieving answers from an information retrieval system
US7031911B2 (en) * 2002-06-28 2006-04-18 Microsoft Corporation System and method for automatic detection of collocation mistakes in documents
US7293015B2 (en) * 2002-09-19 2007-11-06 Microsoft Corporation Method and system for detecting user intentions in retrieval of hint sentences
US7171351B2 (en) * 2002-09-19 2007-01-30 Microsoft Corporation Method and system for retrieving hint sentences using expanded queries
US7194455B2 (en) * 2002-09-19 2007-03-20 Microsoft Corporation Method and system for retrieving confirming sentences
WO2004066271A1 (en) * 2003-01-20 2004-08-05 Fujitsu Limited Speech synthesizing apparatus, speech synthesizing method, and speech synthesizing system
US7689412B2 (en) * 2003-12-05 2010-03-30 Microsoft Corporation Synonymous collocation extraction using translation information
US7260568B2 (en) * 2004-04-15 2007-08-21 Microsoft Corporation Verifying relevance between keywords and web site contents
CN100530171C (en) * 2005-01-31 2009-08-19 日电(中国)有限公司 Dictionary learning method and devcie
US7277029B2 (en) * 2005-06-23 2007-10-02 Microsoft Corporation Using language models to expand wildcards
CN100578539C (en) * 2006-02-28 2010-01-06 腾讯科技(深圳)有限公司 Automatic question-answering method and system
US9020804B2 (en) * 2006-05-10 2015-04-28 Xerox Corporation Method for aligning sentences at the word level enforcing selective contiguity constraints
US7698328B2 (en) * 2006-08-11 2010-04-13 Apple Inc. User-directed search refinement
CN100416570C (en) * 2006-09-22 2008-09-03 浙江大学 FAQ based Chinese natural language ask and answer method

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070010992A1 (en) * 2005-07-08 2007-01-11 Microsoft Corporation Processing collocation mistakes in documents

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
BARR C. ET AL: "The Linguistic Structure of English Web-Search Queries", EMPIRICAL METHOD IN NATURAL LANGUAGE PROCESSING 2008, October 2008 (2008-10-01), pages 1021 - 1030, XP008145689 *
NTOULAS A. ET AL: "The Infocious Web Search Engine: Improving Web Searching Through Linguistic Analysis", INTERNATIONAL WORLD WIDE WEB CONFERENCE, - 2005, pages 840 - 849, XP008145687 *
ZUKERMAN I. ET AL: "Lexical Query Paraphrasing for Document Retrieval", THE 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL LINGUISTICS, 2002, - 2002, XP008145688 *

Also Published As

Publication number Publication date
EP2347354B1 (en) 2018-08-01
CN102203774B (en) 2014-12-17
WO2010062737A2 (en) 2010-06-03
EP2347354A2 (en) 2011-07-27
EP2347354A4 (en) 2016-09-21
US8484014B2 (en) 2013-07-09
CN102203774A (en) 2011-09-28
US20100114574A1 (en) 2010-05-06

Similar Documents

Publication Publication Date Title
WO2010062737A3 (en) Retrieval using a generalized sentence collocation
WO2007005120A3 (en) Searching for content using voice search queries
WO2012048306A3 (en) Structured searching of dynamic structured document corpuses
WO2008051750A3 (en) Associating geographic-related information with objects
MX2019001576A (en) Systems and methods for contextual retrieval of electronic records.
WO2008014499A3 (en) Information nervous system
WO2007115079A3 (en) Expanded snippets
WO2006057741A3 (en) Interactive system for collecting metadata
WO2007008798A3 (en) System and method for searching for network-based content in a multi-modal system using spoken keywords
SG154439A1 (en) Searching and naming items based on metadata
WO2007047971A3 (en) Real time query trends with multi-document summarization
WO2008070877A3 (en) Online computer-aided translation
WO2007016628A3 (en) Definition extraction
WO2008033665A3 (en) Media systems with integrated content searching
WO2007027596A3 (en) System, device, and method for conveying information using a rapid serial presentation technique
WO2008083215A3 (en) System and method for related information search and presentation from user interface content
WO2007103583A3 (en) Method and system for media navigation
WO2010068068A3 (en) Information search method and information provision method based on user's intention
WO2007005536A3 (en) Information retrieving and displaying method and computer-readable medium
WO2012082886A3 (en) Sender-based ranking of person profiles and multi-person automatic suggestions
WO2006124952A3 (en) The information nervous system
WO2008028029A3 (en) Method and system for providing an automated web transcription service
WO2008045981A3 (en) Virtual network of real-world entities
WO2010045549A3 (en) Textual disambiguation using social connections
MY153405A (en) Context-sensitive searches and functionality for instant messaging applications

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200980143730.4

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09829670

Country of ref document: EP

Kind code of ref document: A2

WWE Wipo information: entry into national phase

Ref document number: 2009829670

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE