WO2007082845A3 - Query evaluation using ancestor information - Google Patents

Query evaluation using ancestor information Download PDF

Info

Publication number
WO2007082845A3
WO2007082845A3 PCT/EP2007/050292 EP2007050292W WO2007082845A3 WO 2007082845 A3 WO2007082845 A3 WO 2007082845A3 EP 2007050292 W EP2007050292 W EP 2007050292W WO 2007082845 A3 WO2007082845 A3 WO 2007082845A3
Authority
WO
WIPO (PCT)
Prior art keywords
query
document
extraction
step instance
match candidate
Prior art date
Application number
PCT/EP2007/050292
Other languages
French (fr)
Other versions
WO2007082845A2 (en
Inventor
Vanja Josifovski
Edison Lao Ting
Original Assignee
Ibm
Ibm Uk
Vanja Josifovski
Edison Lao Ting
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ibm, Ibm Uk, Vanja Josifovski, Edison Lao Ting filed Critical Ibm
Publication of WO2007082845A2 publication Critical patent/WO2007082845A2/en
Publication of WO2007082845A3 publication Critical patent/WO2007082845A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/221Parsing markup language streams
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2452Query translation
    • G06F16/24524Access plan code generation and invalidation; Reuse of access plans
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/80Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
    • G06F16/83Querying
    • G06F16/835Query processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/80Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
    • G06F16/83Querying
    • G06F16/835Query processing
    • G06F16/8373Query execution
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Provided are techniques for processing a query. A query is received, wherein the query is formed by one or more paths, and wherein each path includes one or more steps. A hierarchical document including one or more document nodes is received. While processing the query and traversing the hierarchical document, one or more extraction entries are constructed, wherein each extraction entry includes a step instance match candidate identifying a document node and a step instance ancestor path for the document node, and one or more tuples are constructed using the one or more extraction entries by associating the step instance match candidate from one of the one or more extraction entries with the step instance match candidate from at least one of the one or more other extraction entries.
PCT/EP2007/050292 2006-01-20 2007-01-12 Query evaluation using ancestor information WO2007082845A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/336,140 2006-01-20
US11/336,140 US7596548B2 (en) 2006-01-20 2006-01-20 Query evaluation using ancestor information

Publications (2)

Publication Number Publication Date
WO2007082845A2 WO2007082845A2 (en) 2007-07-26
WO2007082845A3 true WO2007082845A3 (en) 2007-11-01

Family

ID=37890551

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2007/050292 WO2007082845A2 (en) 2006-01-20 2007-01-12 Query evaluation using ancestor information

Country Status (2)

Country Link
US (5) US7596548B2 (en)
WO (1) WO2007082845A2 (en)

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7596548B2 (en) 2006-01-20 2009-09-29 International Business Machines Corporation Query evaluation using ancestor information
US8122087B2 (en) * 2006-03-21 2012-02-21 Aol Inc. Matching engine for comparing data feeds with user profile criteria
US7698260B2 (en) * 2007-03-09 2010-04-13 International Business Machines Corporation Apparatus and method for handling a LET binding
US7698295B2 (en) * 2007-03-09 2010-04-13 International Business Machines Corporation Method and apparatus for handling a LET binding
US9842090B2 (en) 2007-12-05 2017-12-12 Oracle International Corporation Efficient streaming evaluation of XPaths on binary-encoded XML schema-based documents
US7895232B2 (en) * 2007-12-25 2011-02-22 International Business Machines Corporation Object-oriented twig query evaluation
US7925656B2 (en) * 2008-03-07 2011-04-12 International Business Machines Corporation Node level hash join for evaluating a query
US8024325B2 (en) * 2008-06-25 2011-09-20 Oracle International Corporation Estimating the cost of XML operators for binary XML storage
US20100030727A1 (en) * 2008-07-29 2010-02-04 Sivasankaran Chandrasekar Technique For Using Occurrence Constraints To Optimize XML Index Access
US9298846B2 (en) * 2009-12-23 2016-03-29 Citrix Systems, Inc. Systems and methods for efficient Xpath processing
US9734237B2 (en) * 2012-10-08 2017-08-15 Bmc Software, Inc. Progressive analysis for big data
US10372694B2 (en) * 2014-10-08 2019-08-06 Adobe Inc. Structured information differentiation in naming
US11461323B2 (en) * 2019-06-28 2022-10-04 Visa International Service Association Techniques for efficient query processing
CN112115302B (en) * 2020-09-29 2022-04-12 集美大学 Parallel XPath query method of XML document, terminal equipment and storage medium

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040205082A1 (en) * 2003-04-14 2004-10-14 International Business Machines Corporation System and method for querying XML streams

Family Cites Families (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7167853B2 (en) 1999-05-20 2007-01-23 International Business Machines Corporation Matching and compensation tests for optimizing correlated subqueries within query using automatic summary tables
EP1207461A1 (en) * 2000-11-17 2002-05-22 Siemens Aktiengesellschaft Method and device for performing a query on a markup document
JP4039484B2 (en) 2002-02-28 2008-01-30 インターナショナル・ビジネス・マシーンズ・コーポレーション XPath evaluation method, XML document processing system and program using the same
KR100484138B1 (en) 2002-05-08 2005-04-18 삼성전자주식회사 XML indexing method for regular path expression queries in relational database and data structure thereof.
WO2003107222A1 (en) 2002-06-13 2003-12-24 Cerisent Corporation Parent-child query indexing for xml databases
AUPS300402A0 (en) 2002-06-17 2002-07-11 Canon Kabushiki Kaisha Indexing and querying structured documents
US7162485B2 (en) * 2002-06-19 2007-01-09 Georg Gottlob Efficient processing of XPath queries
US7721202B2 (en) * 2002-08-16 2010-05-18 Open Invention Network, Llc XML streaming transformer
US7171407B2 (en) 2002-10-03 2007-01-30 International Business Machines Corporation Method for streaming XPath processing with forward and backward axes
KR100493882B1 (en) * 2002-10-23 2005-06-10 삼성전자주식회사 Query process method for searching xml data
US7124137B2 (en) 2002-12-19 2006-10-17 International Business Machines Corporation Method, system, and program for optimizing processing of nested functions
US7062507B2 (en) 2003-02-24 2006-06-13 The Boeing Company Indexing profile for efficient and scalable XML based publish and subscribe system
US7653636B2 (en) * 2003-02-28 2010-01-26 Bea Systems, Inc. Systems and methods for streaming XPath query
JP3982623B2 (en) 2003-03-25 2007-09-26 インターナショナル・ビジネス・マシーンズ・コーポレーション Information processing apparatus, database search system, and program
US7383255B2 (en) 2003-06-23 2008-06-03 Microsoft Corporation Common query runtime system and application programming interface
US9152735B2 (en) 2003-07-24 2015-10-06 Alcatel Lucent Method and apparatus for composing XSL transformations with XML publishing views
US7313563B2 (en) 2003-07-30 2007-12-25 International Business Machines Corporation Method, system and recording medium for maintaining the order of nodes in a heirarchical document
US7174328B2 (en) 2003-09-02 2007-02-06 International Business Machines Corp. Selective path signatures for query processing over a hierarchical tagged data structure
US7013311B2 (en) 2003-09-05 2006-03-14 International Business Machines Corporation Providing XML cursor support on an XML repository built on top of a relational database system
US7315852B2 (en) 2003-10-31 2008-01-01 International Business Machines Corporation XPath containment for index and materialized view matching
US7165063B2 (en) * 2003-11-19 2007-01-16 International Business Machines Corporation Context quantifier transformation in XML query rewrite
US20050257201A1 (en) * 2004-05-17 2005-11-17 International Business Machines Corporation Optimization of XPath expressions for evaluation upon streaming XML data
US7346609B2 (en) * 2004-11-16 2008-03-18 International Business Machines Corporation Streaming XPath algorithm for XPath value index key generation
US7548926B2 (en) * 2005-10-05 2009-06-16 Microsoft Corporation High performance navigator for parsing inputs of a message
US20070174241A1 (en) * 2006-01-20 2007-07-26 Beyer Kevin S Match graphs for query evaluation
US7596548B2 (en) * 2006-01-20 2009-09-29 International Business Machines Corporation Query evaluation using ancestor information

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040205082A1 (en) * 2003-04-14 2004-10-14 International Business Machines Corporation System and method for querying XML streams

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
BAR-YOSSEF ZIV ET AL: "Buffering in query evaluation over XML streams", PROC ACM SIGACT SIGMOD SIGART SYMP PRINC DATABASE SYST; PROCEEDINGS OF THE ACM SIGACT-SIGMOD-SIGART SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS; PROCEEDINGS OF THE TWENTY-FOURTH ACM SIGMOD-SIGACT-SIGART SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, PO, 2005, pages 216 - 227, XP002428835 *
D. FLORESCU ET AL.: "The BEA streaming XQuery processor", VLDB JOURNAL, 2004, pages 294 - 315, XP002428852 *
D. FLORESCU ET AL.: "The BEA/XQRL Streaming XQuery Processor", PROCEEDINGS OF VLDB 2003, 2003, XP002428851 *
VANJA JOSIFOVSKI ET AL: "Querying XML streams", THE VLDB JOURNAL ; THE INTERNATIONAL JOURNAL ON VERY LARGE DATA BASES, SPRINGER-VERLAG, BE, vol. 14, no. 2, 1 April 2005 (2005-04-01), pages 197 - 210, XP019351977, ISSN: 0949-877X *
YANLEI DIAO ET AL: "Implementing Memoization in a Streaming XQuery Processor", PROCEEDINGS OF THE DATABASE AND XML TECHNOLOGIES, SECOND INTERNATIONAL XML DATABASE SYMPOSIUM, XSYM 2004, 2004, pages 35 - 50, XP019009590 *

Also Published As

Publication number Publication date
US20140164428A1 (en) 2014-06-12
US8688721B2 (en) 2014-04-01
US9087139B2 (en) 2015-07-21
WO2007082845A2 (en) 2007-07-26
US9659001B2 (en) 2017-05-23
US20150261815A1 (en) 2015-09-17
US20090287700A1 (en) 2009-11-19
US20070174242A1 (en) 2007-07-26
US7979423B2 (en) 2011-07-12
US7596548B2 (en) 2009-09-29
US20110225144A1 (en) 2011-09-15

Similar Documents

Publication Publication Date Title
WO2007082845A3 (en) Query evaluation using ancestor information
WO2008051750A3 (en) Associating geographic-related information with objects
WO2004042493A3 (en) Method and system for discovering knowledge from text documents
WO2006099621A3 (en) Topic specific language models built from large numbers of documents
WO2009114714A3 (en) User contributed knowledge database
WO2007078723A3 (en) Method for displaying search results and contextually related items
WO2007041351A3 (en) Actively characterizing a network
WO2007002412A3 (en) Systems and methods for retrieving data
WO2005098592A3 (en) Systems and methods for weighting a search query result
WO2008039542A3 (en) System and method of ad-hoc analysis of data
WO2008156473A3 (en) Using relevance feedback in face recognition
WO2007024642A3 (en) Ranking and choosing suggested query candidates
WO2007131225A3 (en) Identifying a route configured to travel through multiple points of interest
WO2006028953A3 (en) Query-based document composition
WO2009026193A3 (en) System and method for search
WO2009152007A3 (en) Collecting and scoring online references
WO2008088721A3 (en) Querying data and an associated ontology in a database management system
WO2007025130A3 (en) Search system and method
RU2011140008A (en) ENSURING ACCESS TO THE DATA ELEMENT USING ACCESS GRAPHS
WO2006004946A3 (en) Accelerated schema-based validation
WO2006116537A3 (en) Method and apparatus for in-built searching and aggregating functionality
WO2007140226A3 (en) Method of providing a digital athlete profile
WO2014052609A3 (en) Evaluating xml full text search
WO2010141799A3 (en) Feature engineering and user behavior analysis
WO2011008889A3 (en) Methods and apparatus for efficiently processing multiple keyword queries on a distributed network

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 07703832

Country of ref document: EP

Kind code of ref document: A2