CA2242158A1 - Method and apparatus for searching and displaying structured document - Google Patents

Method and apparatus for searching and displaying structured document

Info

Publication number
CA2242158A1
CA2242158A1 CA002242158A CA2242158A CA2242158A1 CA 2242158 A1 CA2242158 A1 CA 2242158A1 CA 002242158 A CA002242158 A CA 002242158A CA 2242158 A CA2242158 A CA 2242158A CA 2242158 A1 CA2242158 A1 CA 2242158A1
Authority
CA
Canada
Prior art keywords
document
search
structured document
information
matching
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CA002242158A
Other languages
French (fr)
Other versions
CA2242158C (en
Inventor
Takuya Okamoto
Toru Takahashi
Yuki Aoyama
Noriyuki Yamasaki
Eiko Murata
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hitachi Ltd
Original Assignee
Hitachi Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hitachi Ltd filed Critical Hitachi Ltd
Publication of CA2242158A1 publication Critical patent/CA2242158A1/en
Application granted granted Critical
Publication of CA2242158C publication Critical patent/CA2242158C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/80Information retrieval; Database structures therefor; File system structures therefor of semi-structured data, e.g. markup language structured data such as SGML, XML or HTML
    • G06F16/81Indexing, e.g. XML tags; Data structures therefor; Storage structures

Abstract

A method and an apparatus for searching and displaying a structured document are disclosed. The process for document registration is executed with a structured document of a file as an input. An analyzed structured document and information for document search are generated, and are stored in data bases, respectively. A query input from an input/output unit is analyzed, a document search index is read and a search process is executed. Matching document identifier information and matching strings position information are output as the result of search. In the display process, a corresponding analyzed structured document is read from the data base based on the document identifier information matched in a document read process. In processing a document display, the matching information are embedded in the structured document based on the matching strings position information, and a structured document for display with highlight information added thereto is generated and displayed. A document is searched from which the element information constituting a stumbling block to the search is removed, and the result of search is displayed with highlight information added to the original structured document.
CA002242158A 1997-07-01 1998-06-29 Method and apparatus for searching and displaying structured document Expired - Fee Related CA2242158C (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
JP19071697 1997-07-01
JP09-190716 1997-07-01
JP19540897 1997-07-22
JP09-195408 1997-07-22

Publications (2)

Publication Number Publication Date
CA2242158A1 true CA2242158A1 (en) 1999-01-01
CA2242158C CA2242158C (en) 2004-06-01

Family

ID=29422287

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002242158A Expired - Fee Related CA2242158C (en) 1997-07-01 1998-06-29 Method and apparatus for searching and displaying structured document

Country Status (4)

Country Link
US (1) US7707139B2 (en)
KR (1) KR100324456B1 (en)
CN (1) CN1170240C (en)
CA (1) CA2242158C (en)

Families Citing this family (90)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7954056B2 (en) * 1997-12-22 2011-05-31 Ricoh Company, Ltd. Television-based visualization and navigation interface
US7596755B2 (en) * 1997-12-22 2009-09-29 Ricoh Company, Ltd. Multimedia visualization and integration environment
JP4183311B2 (en) 1997-12-22 2008-11-19 株式会社リコー Document annotation method, annotation device, and recording medium
US7257589B1 (en) 1997-12-22 2007-08-14 Ricoh Company, Ltd. Techniques for targeting information to users
US7124093B1 (en) 1997-12-22 2006-10-17 Ricoh Company, Ltd. Method, system and computer code for content based web advertising
US20080028292A1 (en) * 1997-12-22 2008-01-31 Ricoh Company, Ltd. Techniques to facilitate reading of a document
US6582475B2 (en) * 1998-09-09 2003-06-24 Ricoh Company Limited Automatic adaptive document printing help system
JP2000339312A (en) * 1999-05-31 2000-12-08 Toshiba Corp System for editing document and method for generating tag information management table
US7228492B1 (en) * 1999-07-06 2007-06-05 Ricoh Company, Ltd. 2D graph displaying document locations of user-specified concept of interest
JP2001028717A (en) * 1999-07-12 2001-01-30 Sony Corp Information display device, information receiver and their methods
JP4091726B2 (en) * 2000-02-23 2008-05-28 インターナショナル・ビジネス・マシーンズ・コーポレーション Method for generating display rule of structured document, medium on which system and program are recorded, method for changing structured document and its document type definition, medium on which system and program are recorded
US8578266B2 (en) * 2000-06-26 2013-11-05 Vertical Computer Systems, Inc. Method and system for providing a framework for processing markup language documents
CN1167027C (en) * 2001-08-03 2004-09-15 富士通株式会社 Format file information extracting device and method
US8635531B2 (en) * 2002-02-21 2014-01-21 Ricoh Company, Ltd. Techniques for displaying information stored in multiple multimedia documents
JP2003196270A (en) * 2001-12-27 2003-07-11 Sharp Corp Document information processing method, document information processor, communication system, computer program and recording medium
JP2004038512A (en) * 2002-07-03 2004-02-05 Nec Corp Information processing terminal, and designated tag position moving method and program used therefor
US20040205514A1 (en) * 2002-06-28 2004-10-14 Microsoft Corporation Hyperlink preview utility and method
US20040064826A1 (en) * 2002-09-30 2004-04-01 Timothy Lim Method and system for object system interoperability
US7149752B2 (en) * 2002-12-03 2006-12-12 Jp Morgan Chase Bank Method for simplifying databinding in application programs
US7401156B2 (en) * 2003-02-03 2008-07-15 Jp Morgan Chase Bank Method using control interface to suspend software network environment running on network devices for loading and executing another software network environment
JP3981729B2 (en) * 2003-03-12 2007-09-26 独立行政法人情報通信研究機構 Keyword emphasis device and program
US7379998B2 (en) * 2003-03-31 2008-05-27 Jp Morgan Chase Bank System and method for multi-platform queue queries
US20040230602A1 (en) * 2003-05-14 2004-11-18 Andrew Doddington System and method for decoupling data presentation layer and data gathering and storage layer in a distributed data processing system
US7356528B1 (en) * 2003-05-15 2008-04-08 At&T Corp. Phrase matching in documents having nested-structure arbitrary (document-specific) markup
US7366722B2 (en) * 2003-05-15 2008-04-29 Jp Morgan Chase Bank System and method for specifying application services and distributing them across multiple processors using XML
US8095659B2 (en) 2003-05-16 2012-01-10 Jp Morgan Chase Bank Service interface
US20040236724A1 (en) * 2003-05-19 2004-11-25 Shu-Yao Chien Searching element-based document descriptions in a database
US20050144174A1 (en) * 2003-12-31 2005-06-30 Leonid Pesenson Framework for providing remote processing of a graphical user interface
JP4435582B2 (en) * 2004-01-08 2010-03-17 株式会社リコー Image processing apparatus, data search method, and data search program
JP2005234837A (en) * 2004-02-19 2005-09-02 Fujitsu Ltd Structured document processing method, structured document processing system and its program
US20050222990A1 (en) * 2004-04-06 2005-10-06 Milne Kenneth T Methods and systems for using script files to obtain, format and disseminate database information
CA2563354C (en) * 2004-04-26 2010-08-17 Jp Morgan Chase Bank System and method for routing messages
US7860874B2 (en) 2004-06-08 2010-12-28 Siemens Industry, Inc. Method for searching across a PLC network
JP4309818B2 (en) * 2004-07-15 2009-08-05 株式会社東芝 Structured document management device, search device, storage method, search method, and program
JP2006127235A (en) * 2004-10-29 2006-05-18 Toshiba Corp Structured document management system, structured document management method and program
CN100462961C (en) * 2004-11-09 2009-02-18 国际商业机器公司 Method for organizing multi-file and equipment for displaying multi-file
JP2006185408A (en) * 2004-11-30 2006-07-13 Matsushita Electric Ind Co Ltd Database construction device, database retrieval device, and database device
US20060136391A1 (en) * 2004-12-21 2006-06-22 Morris Robert P System and method for generating a search index and executing a context-sensitive search
JP4900640B2 (en) * 2005-03-30 2012-03-21 京セラ株式会社 Portable terminal device and document display control method thereof
US8239394B1 (en) 2005-03-31 2012-08-07 Google Inc. Bloom filters for query simulation
US7587387B2 (en) 2005-03-31 2009-09-08 Google Inc. User interface for facts query engine with snippets from information sources that include query terms and answer terms
US7953720B1 (en) 2005-03-31 2011-05-31 Google Inc. Selecting the best answer to a fact query from among a set of potential answers
US7631007B2 (en) * 2005-04-12 2009-12-08 Scenera Technologies, Llc System and method for tracking user activity related to network resources using a browser
US7587395B2 (en) * 2005-07-27 2009-09-08 John Harney System and method for providing profile matching with an unstructured document
US20070185870A1 (en) 2006-01-27 2007-08-09 Hogue Andrew W Data object visualization using graphs
US7925676B2 (en) 2006-01-27 2011-04-12 Google Inc. Data object visualization using maps
US8954426B2 (en) * 2006-02-17 2015-02-10 Google Inc. Query language
US8055674B2 (en) * 2006-02-17 2011-11-08 Google Inc. Annotation framework
JP4489029B2 (en) * 2006-02-01 2010-06-23 株式会社東芝 Structured document search system and structured document search method
JP2007241888A (en) * 2006-03-10 2007-09-20 Sony Corp Information processor, processing method, and program
US8725729B2 (en) 2006-04-03 2014-05-13 Steven G. Lisa System, methods and applications for embedded internet searching and result display
US7610172B2 (en) * 2006-06-16 2009-10-27 Jpmorgan Chase Bank, N.A. Method and system for monitoring non-occurring events
CN101110073A (en) * 2006-07-20 2008-01-23 朗迅科技公司 Method and system for highlighting and adding commentary to network web page content
US8954412B1 (en) 2006-09-28 2015-02-10 Google Inc. Corroborating facts in electronic documents
US7636712B2 (en) * 2006-11-14 2009-12-22 Microsoft Corporation Batching document identifiers for result trimming
US8347202B1 (en) 2007-03-14 2013-01-01 Google Inc. Determining geographic locations for place names in a fact repository
US8161369B2 (en) * 2007-03-16 2012-04-17 Branchfire, Llc System and method of providing a two-part graphic design and interactive document application
US8239751B1 (en) 2007-05-16 2012-08-07 Google Inc. Data from web documents in a spreadsheet
US8321557B2 (en) * 2007-10-10 2012-11-27 Sony Mobile Communications Ab Web feeds over SIP
JP5429165B2 (en) * 2008-06-18 2014-02-26 日本電気株式会社 Retrieval expression generation system, retrieval expression generation method, retrieval expression generation program, and recording medium
US9135277B2 (en) 2009-08-07 2015-09-15 Google Inc. Architecture for responding to a visual query
US9087059B2 (en) * 2009-08-07 2015-07-21 Google Inc. User interface for presenting search results for multiple regions of a visual query
US20120150861A1 (en) * 2010-12-10 2012-06-14 Microsoft Corporation Highlighting known answers in search results
CN102567421B (en) * 2010-12-27 2014-04-02 北大方正集团有限公司 Document retrieval method and device
US8745022B2 (en) * 2011-11-22 2014-06-03 Navteq B.V. Full text search based on interwoven string tokens
US8738595B2 (en) 2011-11-22 2014-05-27 Navteq B.V. Location based full text search
US20130174029A1 (en) * 2012-01-04 2013-07-04 Freedom Solutions Group, LLC d/b/a Microsystems Method and apparatus for analyzing a document
US8700661B2 (en) 2012-04-12 2014-04-15 Navteq B.V. Full text search using R-trees
US10679160B1 (en) 2012-05-24 2020-06-09 Jpmorgan Chase Bank Enterprise fulfillment system with dynamic prefetching capabilities, secured data access capabilities and system monitoring
US9697524B1 (en) 2012-05-24 2017-07-04 Jpmorgan Chase Bank, N.A. Enterprise fulfillment system with dynamic prefetching capabilities
US9990636B1 (en) 2012-05-24 2018-06-05 Jpmorgan Chase Bank, N.A. Enterprise fulfillment system with dynamic prefetching, secured data access, system monitoring, and performance optimization capabilities
WO2013179348A1 (en) * 2012-05-31 2013-12-05 富士通株式会社 Index generating program and search program
US9171069B2 (en) 2012-07-31 2015-10-27 Freedom Solutions Group, Llc Method and apparatus for analyzing a document
US9619445B1 (en) * 2012-08-23 2017-04-11 Inkling Systems, Inc. Conversion of content to formats suitable for digital distributions thereof
US8839202B2 (en) 2012-10-12 2014-09-16 Vmware, Inc. Test environment managed within tests
US9069902B2 (en) 2012-10-12 2015-06-30 Vmware, Inc. Software test automation
US8949794B2 (en) * 2012-10-12 2015-02-03 Vmware, Inc. Binding a software item to a plain english control name
US9684587B2 (en) 2012-10-12 2017-06-20 Vmware, Inc. Test creation with execution
US10067858B2 (en) 2012-10-12 2018-09-04 Vmware, Inc. Cloud-based software testing
US9292422B2 (en) 2012-10-12 2016-03-22 Vmware, Inc. Scheduled software item testing
US10387294B2 (en) 2012-10-12 2019-08-20 Vmware, Inc. Altering a test
US8839201B2 (en) 2012-10-12 2014-09-16 Vmware, Inc. Capturing test data associated with error conditions in software item testing
US9292416B2 (en) 2012-10-12 2016-03-22 Vmware, Inc. Software development kit testing
US10878492B2 (en) * 2015-05-08 2020-12-29 Teachers Insurance & Annuity Association Of America Providing search-directed user interface for online banking applications
US10289719B2 (en) * 2015-07-10 2019-05-14 Mitsubishi Electric Corporation Data acquisition device, data acquisition method and computer readable medium
US11062129B2 (en) * 2015-12-30 2021-07-13 Veritas Technologies Llc Systems and methods for enabling search services to highlight documents
CN110636181A (en) * 2016-03-01 2019-12-31 京瓷办公信息系统株式会社 Information processing apparatus
JP6740803B2 (en) * 2016-08-22 2020-08-19 富士ゼロックス株式会社 Information processing device, information processing system, program
CN112579937A (en) * 2019-09-30 2021-03-30 北京国双科技有限公司 Character highlight display method and device
CN111523019B (en) * 2020-04-23 2023-05-09 北京百度网讯科技有限公司 Method, apparatus, device and storage medium for outputting information

Family Cites Families (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5185698A (en) * 1989-02-24 1993-02-09 International Business Machines Corporation Technique for contracting element marks in a structured document
US5276616A (en) * 1989-10-16 1994-01-04 Sharp Kabushiki Kaisha Apparatus for automatically generating index
CA2048039A1 (en) * 1991-07-19 1993-01-20 Steven Derose Data processing system and method for generating a representation for and random access rendering of electronic documents
JPH0830620A (en) * 1994-07-19 1996-02-02 Fuji Xerox Co Ltd Structure retrieving device
US5583762A (en) * 1994-08-22 1996-12-10 Oclc Online Library Center, Incorporated Generation and reduction of an SGML defined grammer
US5694594A (en) * 1994-11-14 1997-12-02 Chang; Daniel System for linking hypermedia data objects in accordance with associations of source and destination data objects and similarity threshold without using keywords or link-difining terms
JP3063555B2 (en) * 1995-01-06 2000-07-12 富士ゼロックス株式会社 Document database management apparatus and method
JPH08212230A (en) 1995-01-31 1996-08-20 Toshiba Corp Document retrieval method and device therefor
JP2896634B2 (en) * 1995-03-02 1999-05-31 富士ゼロックス株式会社 Full-text registered word search device and full-text registered word search method
JP3724847B2 (en) * 1995-06-05 2005-12-07 株式会社日立製作所 Structured document difference extraction method and apparatus
JPH08339369A (en) 1995-06-14 1996-12-24 Fuji Xerox Co Ltd Method and device for document display
JPH0969101A (en) * 1995-08-31 1997-03-11 Hitachi Ltd Method and device for generating structured document
JP3566457B2 (en) 1996-05-31 2004-09-15 株式会社日立製作所 Structured document version management method and apparatus

Also Published As

Publication number Publication date
CA2242158C (en) 2004-06-01
CN1170240C (en) 2004-10-06
CN1206883A (en) 1999-02-03
US7707139B2 (en) 2010-04-27
KR100324456B1 (en) 2002-04-17
KR19990013482A (en) 1999-02-25
US20020065814A1 (en) 2002-05-30

Similar Documents

Publication Publication Date Title
CA2242158A1 (en) Method and apparatus for searching and displaying structured document
DE69810657D1 (en) SYSTEM AND METHOD FOR FINDING, ORGANIZING AND USING NETWORKED DATA
WO1999019817A3 (en) A system and method for processing a memory map to provide listing information representing data within a database
WO2004090755A3 (en) System and method for providing preferred language ordering of search results
EP1457898A3 (en) Data search system and method
DK1107136T3 (en) Content-based image search system and method
ATE459936T1 (en) METHOD AND DEVICE FOR SEARCHING BIOMETRIC IMAGE DATA
DE69926305D1 (en) Database method and apparatus with hierarchical bit vector based index structure
RU2006133549A (en) SYSTEM AND METHOD OF INTELLECTUAL SEARCH AND SAMPLE
US20060047732A1 (en) Document processing apparatus for searching documents, control method therefor, program for implementing the method, and storage medium storing the program
WO2001082113A3 (en) System and method for proximity searching position information using a proximity parameter
WO2001061571A3 (en) Attribute tagging and matching system and method for database management
KR20030066064A (en) Internet searching service system for displaying a search result to different user interface depending on query and searching method thereof
WO2006031466A3 (en) Functionality and system for converting data from a first to a second form
JPH05324719A (en) Document retrieval system
KR970049752A (en) Korean Natural Language Query Information Retrieval Using Verb Information
JPH07296005A (en) Japanese text registration/retrieval device
JP2601139B2 (en) String search device
JPH05158984A (en) Device for extracting character string
KR960030014A (en) Method of searching simultaneous connection of drawing and part data in vehicle parts search system
JP2581376B2 (en) Document search device
CN115982320A (en) Semantic similarity retrieval method based on medical equipment manual description
JPS62143180A (en) Retrieving device for image information
JPH0844767A (en) Data processing method
JP2001331496A (en) Domain term dictionary preparation system and method

Legal Events

Date Code Title Description
EEER Examination request
MKLA Lapsed