WO2007082845A3 - Query evaluation using ancestor information - Google Patents
Query evaluation using ancestor information Download PDFInfo
- Publication number
- WO2007082845A3 WO2007082845A3 PCT/EP2007/050292 EP2007050292W WO2007082845A3 WO 2007082845 A3 WO2007082845 A3 WO 2007082845A3 EP 2007050292 W EP2007050292 W EP 2007050292W WO 2007082845 A3 WO2007082845 A3 WO 2007082845A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- query
- document
- extraction
- step instance
- match candidate
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/221—Parsing markup language streams
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2452—Query translation
- G06F16/24524—Access plan code generation and invalidation; Reuse of access plans
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/80—Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
- G06F16/83—Querying
- G06F16/835—Query processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/80—Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
- G06F16/83—Querying
- G06F16/835—Query processing
- G06F16/8373—Query execution
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10—TECHNICAL SUBJECTS COVERED BY FORMER USPC
- Y10S—TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y10S707/00—Data processing: database and file management or data structures
- Y10S707/99931—Database or file accessing
- Y10S707/99933—Query processing, i.e. searching
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Physics & Mathematics (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Provided are techniques for processing a query. A query is received, wherein the query is formed by one or more paths, and wherein each path includes one or more steps. A hierarchical document including one or more document nodes is received. While processing the query and traversing the hierarchical document, one or more extraction entries are constructed, wherein each extraction entry includes a step instance match candidate identifying a document node and a step instance ancestor path for the document node, and one or more tuples are constructed using the one or more extraction entries by associating the step instance match candidate from one of the one or more extraction entries with the step instance match candidate from at least one of the one or more other extraction entries.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/336,140 | 2006-01-20 | ||
US11/336,140 US7596548B2 (en) | 2006-01-20 | 2006-01-20 | Query evaluation using ancestor information |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2007082845A2 WO2007082845A2 (en) | 2007-07-26 |
WO2007082845A3 true WO2007082845A3 (en) | 2007-11-01 |
Family
ID=37890551
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP2007/050292 WO2007082845A2 (en) | 2006-01-20 | 2007-01-12 | Query evaluation using ancestor information |
Country Status (2)
Country | Link |
---|---|
US (5) | US7596548B2 (en) |
WO (1) | WO2007082845A2 (en) |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7596548B2 (en) | 2006-01-20 | 2009-09-29 | International Business Machines Corporation | Query evaluation using ancestor information |
US8122087B2 (en) * | 2006-03-21 | 2012-02-21 | Aol Inc. | Matching engine for comparing data feeds with user profile criteria |
US7698260B2 (en) * | 2007-03-09 | 2010-04-13 | International Business Machines Corporation | Apparatus and method for handling a LET binding |
US7698295B2 (en) * | 2007-03-09 | 2010-04-13 | International Business Machines Corporation | Method and apparatus for handling a LET binding |
US9842090B2 (en) | 2007-12-05 | 2017-12-12 | Oracle International Corporation | Efficient streaming evaluation of XPaths on binary-encoded XML schema-based documents |
US7895232B2 (en) * | 2007-12-25 | 2011-02-22 | International Business Machines Corporation | Object-oriented twig query evaluation |
US7925656B2 (en) * | 2008-03-07 | 2011-04-12 | International Business Machines Corporation | Node level hash join for evaluating a query |
US8024325B2 (en) * | 2008-06-25 | 2011-09-20 | Oracle International Corporation | Estimating the cost of XML operators for binary XML storage |
US20100030727A1 (en) * | 2008-07-29 | 2010-02-04 | Sivasankaran Chandrasekar | Technique For Using Occurrence Constraints To Optimize XML Index Access |
US9298846B2 (en) * | 2009-12-23 | 2016-03-29 | Citrix Systems, Inc. | Systems and methods for efficient Xpath processing |
US9734237B2 (en) * | 2012-10-08 | 2017-08-15 | Bmc Software, Inc. | Progressive analysis for big data |
US10372694B2 (en) * | 2014-10-08 | 2019-08-06 | Adobe Inc. | Structured information differentiation in naming |
US11461323B2 (en) * | 2019-06-28 | 2022-10-04 | Visa International Service Association | Techniques for efficient query processing |
CN112115302B (en) * | 2020-09-29 | 2022-04-12 | 集美大学 | Parallel XPath query method of XML document, terminal equipment and storage medium |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040205082A1 (en) * | 2003-04-14 | 2004-10-14 | International Business Machines Corporation | System and method for querying XML streams |
Family Cites Families (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7167853B2 (en) | 1999-05-20 | 2007-01-23 | International Business Machines Corporation | Matching and compensation tests for optimizing correlated subqueries within query using automatic summary tables |
EP1207461A1 (en) * | 2000-11-17 | 2002-05-22 | Siemens Aktiengesellschaft | Method and device for performing a query on a markup document |
JP4039484B2 (en) | 2002-02-28 | 2008-01-30 | インターナショナル・ビジネス・マシーンズ・コーポレーション | XPath evaluation method, XML document processing system and program using the same |
KR100484138B1 (en) | 2002-05-08 | 2005-04-18 | 삼성전자주식회사 | XML indexing method for regular path expression queries in relational database and data structure thereof. |
WO2003107222A1 (en) | 2002-06-13 | 2003-12-24 | Cerisent Corporation | Parent-child query indexing for xml databases |
AUPS300402A0 (en) | 2002-06-17 | 2002-07-11 | Canon Kabushiki Kaisha | Indexing and querying structured documents |
US7162485B2 (en) * | 2002-06-19 | 2007-01-09 | Georg Gottlob | Efficient processing of XPath queries |
US7721202B2 (en) * | 2002-08-16 | 2010-05-18 | Open Invention Network, Llc | XML streaming transformer |
US7171407B2 (en) | 2002-10-03 | 2007-01-30 | International Business Machines Corporation | Method for streaming XPath processing with forward and backward axes |
KR100493882B1 (en) * | 2002-10-23 | 2005-06-10 | 삼성전자주식회사 | Query process method for searching xml data |
US7124137B2 (en) | 2002-12-19 | 2006-10-17 | International Business Machines Corporation | Method, system, and program for optimizing processing of nested functions |
US7062507B2 (en) | 2003-02-24 | 2006-06-13 | The Boeing Company | Indexing profile for efficient and scalable XML based publish and subscribe system |
US7653636B2 (en) * | 2003-02-28 | 2010-01-26 | Bea Systems, Inc. | Systems and methods for streaming XPath query |
JP3982623B2 (en) | 2003-03-25 | 2007-09-26 | インターナショナル・ビジネス・マシーンズ・コーポレーション | Information processing apparatus, database search system, and program |
US7383255B2 (en) | 2003-06-23 | 2008-06-03 | Microsoft Corporation | Common query runtime system and application programming interface |
US9152735B2 (en) | 2003-07-24 | 2015-10-06 | Alcatel Lucent | Method and apparatus for composing XSL transformations with XML publishing views |
US7313563B2 (en) | 2003-07-30 | 2007-12-25 | International Business Machines Corporation | Method, system and recording medium for maintaining the order of nodes in a heirarchical document |
US7174328B2 (en) | 2003-09-02 | 2007-02-06 | International Business Machines Corp. | Selective path signatures for query processing over a hierarchical tagged data structure |
US7013311B2 (en) | 2003-09-05 | 2006-03-14 | International Business Machines Corporation | Providing XML cursor support on an XML repository built on top of a relational database system |
US7315852B2 (en) | 2003-10-31 | 2008-01-01 | International Business Machines Corporation | XPath containment for index and materialized view matching |
US7165063B2 (en) * | 2003-11-19 | 2007-01-16 | International Business Machines Corporation | Context quantifier transformation in XML query rewrite |
US20050257201A1 (en) * | 2004-05-17 | 2005-11-17 | International Business Machines Corporation | Optimization of XPath expressions for evaluation upon streaming XML data |
US7346609B2 (en) * | 2004-11-16 | 2008-03-18 | International Business Machines Corporation | Streaming XPath algorithm for XPath value index key generation |
US7548926B2 (en) * | 2005-10-05 | 2009-06-16 | Microsoft Corporation | High performance navigator for parsing inputs of a message |
US20070174241A1 (en) * | 2006-01-20 | 2007-07-26 | Beyer Kevin S | Match graphs for query evaluation |
US7596548B2 (en) * | 2006-01-20 | 2009-09-29 | International Business Machines Corporation | Query evaluation using ancestor information |
-
2006
- 2006-01-20 US US11/336,140 patent/US7596548B2/en not_active Expired - Fee Related
-
2007
- 2007-01-12 WO PCT/EP2007/050292 patent/WO2007082845A2/en active Application Filing
-
2009
- 2009-08-07 US US12/537,983 patent/US7979423B2/en not_active Expired - Fee Related
-
2011
- 2011-05-23 US US13/113,657 patent/US8688721B2/en not_active Expired - Fee Related
-
2014
- 2014-02-12 US US14/179,176 patent/US9087139B2/en not_active Expired - Fee Related
-
2015
- 2015-06-02 US US14/728,220 patent/US9659001B2/en not_active Expired - Fee Related
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040205082A1 (en) * | 2003-04-14 | 2004-10-14 | International Business Machines Corporation | System and method for querying XML streams |
Non-Patent Citations (5)
Title |
---|
BAR-YOSSEF ZIV ET AL: "Buffering in query evaluation over XML streams", PROC ACM SIGACT SIGMOD SIGART SYMP PRINC DATABASE SYST; PROCEEDINGS OF THE ACM SIGACT-SIGMOD-SIGART SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS; PROCEEDINGS OF THE TWENTY-FOURTH ACM SIGMOD-SIGACT-SIGART SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, PO, 2005, pages 216 - 227, XP002428835 * |
D. FLORESCU ET AL.: "The BEA streaming XQuery processor", VLDB JOURNAL, 2004, pages 294 - 315, XP002428852 * |
D. FLORESCU ET AL.: "The BEA/XQRL Streaming XQuery Processor", PROCEEDINGS OF VLDB 2003, 2003, XP002428851 * |
VANJA JOSIFOVSKI ET AL: "Querying XML streams", THE VLDB JOURNAL ; THE INTERNATIONAL JOURNAL ON VERY LARGE DATA BASES, SPRINGER-VERLAG, BE, vol. 14, no. 2, 1 April 2005 (2005-04-01), pages 197 - 210, XP019351977, ISSN: 0949-877X * |
YANLEI DIAO ET AL: "Implementing Memoization in a Streaming XQuery Processor", PROCEEDINGS OF THE DATABASE AND XML TECHNOLOGIES, SECOND INTERNATIONAL XML DATABASE SYMPOSIUM, XSYM 2004, 2004, pages 35 - 50, XP019009590 * |
Also Published As
Publication number | Publication date |
---|---|
US20140164428A1 (en) | 2014-06-12 |
US8688721B2 (en) | 2014-04-01 |
US9087139B2 (en) | 2015-07-21 |
WO2007082845A2 (en) | 2007-07-26 |
US9659001B2 (en) | 2017-05-23 |
US20150261815A1 (en) | 2015-09-17 |
US20090287700A1 (en) | 2009-11-19 |
US20070174242A1 (en) | 2007-07-26 |
US7979423B2 (en) | 2011-07-12 |
US7596548B2 (en) | 2009-09-29 |
US20110225144A1 (en) | 2011-09-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2007082845A3 (en) | Query evaluation using ancestor information | |
WO2008051750A3 (en) | Associating geographic-related information with objects | |
WO2004042493A3 (en) | Method and system for discovering knowledge from text documents | |
WO2006099621A3 (en) | Topic specific language models built from large numbers of documents | |
WO2009114714A3 (en) | User contributed knowledge database | |
WO2007078723A3 (en) | Method for displaying search results and contextually related items | |
WO2007041351A3 (en) | Actively characterizing a network | |
WO2007002412A3 (en) | Systems and methods for retrieving data | |
WO2005098592A3 (en) | Systems and methods for weighting a search query result | |
WO2008039542A3 (en) | System and method of ad-hoc analysis of data | |
WO2008156473A3 (en) | Using relevance feedback in face recognition | |
WO2007024642A3 (en) | Ranking and choosing suggested query candidates | |
WO2007131225A3 (en) | Identifying a route configured to travel through multiple points of interest | |
WO2006028953A3 (en) | Query-based document composition | |
WO2009026193A3 (en) | System and method for search | |
WO2009152007A3 (en) | Collecting and scoring online references | |
WO2008088721A3 (en) | Querying data and an associated ontology in a database management system | |
WO2007025130A3 (en) | Search system and method | |
RU2011140008A (en) | ENSURING ACCESS TO THE DATA ELEMENT USING ACCESS GRAPHS | |
WO2006004946A3 (en) | Accelerated schema-based validation | |
WO2006116537A3 (en) | Method and apparatus for in-built searching and aggregating functionality | |
WO2007140226A3 (en) | Method of providing a digital athlete profile | |
WO2014052609A3 (en) | Evaluating xml full text search | |
WO2010141799A3 (en) | Feature engineering and user behavior analysis | |
WO2011008889A3 (en) | Methods and apparatus for efficiently processing multiple keyword queries on a distributed network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 07703832 Country of ref document: EP Kind code of ref document: A2 |