WO2004040446A3 - Hardware parser accelerator - Google Patents

Hardware parser accelerator Download PDF

Info

Publication number
WO2004040446A3
WO2004040446A3 PCT/US2003/031314 US0331314W WO2004040446A3 WO 2004040446 A3 WO2004040446 A3 WO 2004040446A3 US 0331314 W US0331314 W US 0331314W WO 2004040446 A3 WO2004040446 A3 WO 2004040446A3
Authority
WO
WIPO (PCT)
Prior art keywords
state table
palette
documents
dedicated
parsing
Prior art date
Application number
PCT/US2003/031314
Other languages
French (fr)
Other versions
WO2004040446A2 (en
Inventor
Michael C Dapp
Eric C Lett
Original Assignee
Lockheed Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US10/331,315 external-priority patent/US20040083466A1/en
Application filed by Lockheed Corp filed Critical Lockheed Corp
Priority to AU2003277249A priority Critical patent/AU2003277249A1/en
Priority to JP2004548349A priority patent/JP2006505043A/en
Priority to EP03809942A priority patent/EP1579320A2/en
Priority to CA002504652A priority patent/CA2504652A1/en
Publication of WO2004040446A2 publication Critical patent/WO2004040446A2/en
Publication of WO2004040446A3 publication Critical patent/WO2004040446A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/40Transformation of program code
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing

Abstract

Dedicated hardware is employed to perform parsing of documents such as XML'' documents in much reduced time while removing a substantial processing burden from the host CPU.The conventional use of a state table is divided into a character palette, a state table in abbreviated form, and a next state palette. The palettes may be implemented in dedicated high speed memory and a cache arrangement may be used to accelerate accesses to the abbreviated state table. Processing is performed in parallel pipelines which may be partially concurrent. Dedicated registers may be updated in parallel as well and strings of special characters of arbitrary length accommodated by a character palette skip feature under control of a flag bit to further accelerate parsing of a document.
PCT/US2003/031314 2002-10-29 2003-10-03 Hardware parser accelerator WO2004040446A2 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
AU2003277249A AU2003277249A1 (en) 2002-10-29 2003-10-03 Hardware parser accelerator
JP2004548349A JP2006505043A (en) 2002-10-29 2003-10-03 Hardware parser accelerator
EP03809942A EP1579320A2 (en) 2002-10-29 2003-10-03 Hardware parser accelerator
CA002504652A CA2504652A1 (en) 2002-10-29 2003-10-03 Hardware parser accelerator

Applications Claiming Priority (8)

Application Number Priority Date Filing Date Title
US42177402P 2002-10-29 2002-10-29
US42177502P 2002-10-29 2002-10-29
US42177302P 2002-10-29 2002-10-29
US60/421,775 2002-10-29
US60/421,773 2002-10-29
US60/421,774 2002-10-29
US10/331,315 2002-12-31
US10/331,315 US20040083466A1 (en) 2002-10-29 2002-12-31 Hardware parser accelerator

Publications (2)

Publication Number Publication Date
WO2004040446A2 WO2004040446A2 (en) 2004-05-13
WO2004040446A3 true WO2004040446A3 (en) 2004-09-02

Family

ID=32234359

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2003/031314 WO2004040446A2 (en) 2002-10-29 2003-10-03 Hardware parser accelerator

Country Status (6)

Country Link
EP (1) EP1579320A2 (en)
JP (1) JP2006505043A (en)
KR (1) KR20050072128A (en)
AU (1) AU2003277249A1 (en)
CA (1) CA2504652A1 (en)
WO (1) WO2004040446A2 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7716577B2 (en) 2005-11-14 2010-05-11 Oracle America, Inc. Method and apparatus for hardware XML acceleration
US7596745B2 (en) 2005-11-14 2009-09-29 Sun Microsystems, Inc. Programmable hardware finite state machine for facilitating tokenization of an XML document
US7665016B2 (en) 2005-11-14 2010-02-16 Sun Microsystems, Inc. Method and apparatus for virtualized XML parsing
US7665015B2 (en) 2005-11-14 2010-02-16 Sun Microsystems, Inc. Hardware unit for parsing an XML document
KR101231054B1 (en) * 2008-12-19 2013-02-07 한국전자통신연구원 Parser Accelerator having hardware engine and method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5193192A (en) * 1989-12-29 1993-03-09 Supercomputer Systems Limited Partnership Vectorized LR parsing of computer programs
US5414833A (en) * 1993-10-27 1995-05-09 International Business Machines Corporation Network security system and method using a parallel finite state machine adaptive active monitor and responder
US5511213A (en) * 1992-05-08 1996-04-23 Correa; Nelson Associative memory processor architecture for the efficient execution of parsing algorithms for natural language processing and pattern recognition
US5995963A (en) * 1996-06-27 1999-11-30 Fujitsu Limited Apparatus and method of multi-string matching based on sparse state transition list

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS60105039A (en) * 1983-11-12 1985-06-10 Nippon Telegr & Teleph Corp <Ntt> Collation system of character string
JPH0746362B2 (en) * 1986-02-03 1995-05-17 日本電信電話株式会社 String matching method
JP2713962B2 (en) * 1988-03-16 1998-02-16 株式会社日立製作所 Character string search method and apparatus
JPH04348469A (en) * 1990-07-23 1992-12-03 Hitachi Ltd Character string retrieving device and its method
JP3062119B2 (en) * 1997-06-20 2000-07-10 日本電気アイシーマイコンシステム株式会社 Character string search table, method for creating the same, and character string search method
JP3368883B2 (en) * 2000-02-04 2003-01-20 インターナショナル・ビジネス・マシーンズ・コーポレーション Data compression device, database system, data communication system, data compression method, storage medium, and program transmission device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5193192A (en) * 1989-12-29 1993-03-09 Supercomputer Systems Limited Partnership Vectorized LR parsing of computer programs
US5511213A (en) * 1992-05-08 1996-04-23 Correa; Nelson Associative memory processor architecture for the efficient execution of parsing algorithms for natural language processing and pattern recognition
US5414833A (en) * 1993-10-27 1995-05-09 International Business Machines Corporation Network security system and method using a parallel finite state machine adaptive active monitor and responder
US5995963A (en) * 1996-06-27 1999-11-30 Fujitsu Limited Apparatus and method of multi-string matching based on sparse state transition list

Also Published As

Publication number Publication date
JP2006505043A (en) 2006-02-09
KR20050072128A (en) 2005-07-08
CA2504652A1 (en) 2004-05-13
EP1579320A2 (en) 2005-09-28
AU2003277249A1 (en) 2004-05-25
WO2004040446A2 (en) 2004-05-13

Similar Documents

Publication Publication Date Title
US20040083466A1 (en) Hardware parser accelerator
US7421386B2 (en) Full-form lexicon with tagged data and methods of constructing and using the same
Hsu et al. Space-efficient data structures for top-k completion
KR940024591A (en) Computer devices
EP1283486A3 (en) Storage of row-column data
BR0312120A (en) Method for entering text on an electronic device, and, electronic device
KR970705078A (en) Two-Tier Prefetch Buffer Structure and Method with Bypass
Skibiński et al. Revisiting dictionary‐based compression
CA2366485A1 (en) System and method for parsing a document
JPS5762460A (en) Inputting method for sentence to be translated by electronic translating machine
US20040225497A1 (en) Compressed yet quickly searchable digital textual data format
WO2004040446A3 (en) Hardware parser accelerator
Kärkkäinen et al. Lazy lempel-ziv factorization algorithms
WO2002095950A8 (en) Character table implemented data compression method and apparatus
KR910005155A (en) Microprocessor
WO2002027466A3 (en) Method for accessing a storage unit during the search for substrings, and a corresponding storage unit
Ferrada et al. Relative Lempel-Ziv with constant-time random access
Puglisi et al. Document retrieval hacks
Matsumoto et al. Can general-purpose compression schemes really compress DNA sequences
Samanta et al. An enhanced CAM architecture to accelerate LZW compression algorithm
CN1177271C (en) Four-stroke number code input method for characters and words and without duplication code and its keyboard
WORD CERNY’Sconjecture
Moffat et al. Reducing space requirements for disk resident suffix arrays
Nikolova et al. A Compression/Decompression Scheme for Embedded Systems Code
Grabowski et al. Language-independent word-based text compression with fast decompression

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2004548349

Country of ref document: JP

Ref document number: 2003277249

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 1020057007620

Country of ref document: KR

Ref document number: 2504652

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 813/KOLNP/2005

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 2003809942

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 20038A61657

Country of ref document: CN

WWP Wipo information: published in national office

Ref document number: 1020057007620

Country of ref document: KR

WWP Wipo information: published in national office

Ref document number: 2003809942

Country of ref document: EP

DPE2 Request for preliminary examination filed before expiration of 19th month from priority date (pct application filed from 20040101)