WO2007129288A3 - Electronic document reformatting - Google Patents

Electronic document reformatting Download PDF

Info

Publication number
WO2007129288A3
WO2007129288A3 PCT/IE2007/000030 IE2007000030W WO2007129288A3 WO 2007129288 A3 WO2007129288 A3 WO 2007129288A3 IE 2007000030 W IE2007000030 W IE 2007000030W WO 2007129288 A3 WO2007129288 A3 WO 2007129288A3
Authority
WO
WIPO (PCT)
Prior art keywords
data
electronic document
image data
document
alphanumerical
Prior art date
Application number
PCT/IE2007/000030
Other languages
French (fr)
Other versions
WO2007129288A2 (en
Inventor
Seamus Mcgrenery
Brian Mcgrath
Kevin Clarke
Original Assignee
Big River Ltd
Seamus Mcgrenery
Brian Mcgrath
Kevin Clarke
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Big River Ltd, Seamus Mcgrenery, Brian Mcgrath, Kevin Clarke filed Critical Big River Ltd
Publication of WO2007129288A2 publication Critical patent/WO2007129288A2/en
Publication of WO2007129288A3 publication Critical patent/WO2007129288A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/151Transformation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition

Abstract

An apparatus and methods are provided for converting electronic documents formatted for printing into corresponding electronic documents formatted for display. The conversion involves the comparison the document layout defined by the print format against at least one document layout template, the mapping of alphanumerical data in the electronic document to corresponding ASCII character data, the optical recognition of alphanumerical data in the electronic document, the comparison of the optically-recognized alphanumerical data against the mapped alphanumerical data, the identification of image data in the electronic document, the optional rescaling of the image data if the image data exceeds an image data parameter, and the output of a converted electronic document including the optionally rescaled image data and compared ASCII character data, the output electronic document being formatted for display according to the document layout template.
PCT/IE2007/000030 2006-05-05 2007-03-06 Electronic document reformatting WO2007129288A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
IES2006/0361 2006-05-05
IE20060361A IES20060361A2 (en) 2006-05-05 2006-05-05 Electronic document conversion

Publications (2)

Publication Number Publication Date
WO2007129288A2 WO2007129288A2 (en) 2007-11-15
WO2007129288A3 true WO2007129288A3 (en) 2008-05-29

Family

ID=38573300

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IE2007/000030 WO2007129288A2 (en) 2006-05-05 2007-03-06 Electronic document reformatting

Country Status (2)

Country Link
IE (1) IES20060361A2 (en)
WO (1) WO2007129288A2 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101883248A (en) * 2009-05-04 2010-11-10 沈阳爱国者网络科技有限公司 Method for acquiring video file from network
HRP20130700B1 (en) * 2013-07-23 2016-03-11 Microblink D.O.O. System for adaptive detection and extraction of structures from machine-generated documents
CN109635729B (en) * 2018-12-12 2022-02-08 厦门商集网络科技有限责任公司 Form identification method and terminal
AU2022335597A1 (en) 2021-08-27 2024-04-04 Rock Cube Holdings LLC Systems and methods for structure-based automated hyperlinking

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1343095A2 (en) * 2002-03-01 2003-09-10 Xerox Corporation Method and system for document image layout deconstruction and redisplay
US20050193327A1 (en) * 2004-02-27 2005-09-01 Hui Chao Method for determining logical components of a document

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1343095A2 (en) * 2002-03-01 2003-09-10 Xerox Corporation Method and system for document image layout deconstruction and redisplay
US20040205568A1 (en) * 2002-03-01 2004-10-14 Breuel Thomas M. Method and system for document image layout deconstruction and redisplay system
US20050193327A1 (en) * 2004-02-27 2005-09-01 Hui Chao Method for determining logical components of a document

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
CHAO ET AL.: "PDF Document Study with Page Elements and Bounding Boxes", DOCUMENT LAYOUT INTERPRETATION AND ITS APPLICATIONS (DLIA2001), 9 September 2001 (2001-09-09), Seattle, WA, US, pages 1 - 3, XP002249458 *
LOVEGROVE W S ET AL: "Document analysis of PDF files: methods, results and implications", ELECTRONIC PUBLISHING, WILEY, CHICHESTER, GB, vol. 82, no. 2-3, June 1995 (1995-06-01), pages 207 - 220, XP002357644, ISSN: 0894-3982 *

Also Published As

Publication number Publication date
WO2007129288A2 (en) 2007-11-15
IES20060361A2 (en) 2007-10-31

Similar Documents

Publication Publication Date Title
EP2278787A3 (en) Information processing apparatus and computer readable medium
EP2053522A3 (en) Conversion of a Collection of Data to a Structured, Printable and Navigable Format
EP1630688A3 (en) Document processing apparatus and method
WO2018071403A1 (en) Systems and methods for optical charater recognition for low-resolution ducuments
EP1905603A3 (en) Two-dimensional code printing apparatus and method and tangible medium
EP1197917A3 (en) Apparatus, method and computer program product for providing output image adjustment for image files
WO2009075061A1 (en) Information input device, information processing device, information input system, information processing system, two-dimensional format information server, information input method, control program, and recording medium
EP2306301A3 (en) Image processing system, image processing method and image processing program
WO2006078738A3 (en) Method and apparatus for adding signature information to electronic documents
EP2124143A3 (en) Information processing apparatus, preview method, and storage medium
EP2230593A3 (en) Job management apparatus, control method, and program
CN101174350A (en) Bill processing equipment and method
EP2302504A3 (en) Method for printing document of mobile terminal through printer, and mobile terminal therefor
WO2010014491A3 (en) Verifying an electronic document
EP1688853A3 (en) Document processing apparatus, document processing method and program
EP2075712A3 (en) Persistent selection marks
WO2010123242A3 (en) Electronic template converting method, apparatus, and recording medium
EP2345956A3 (en) Information processing apparatus, information processing apparatus control method, and storage medium
WO2007129288A3 (en) Electronic document reformatting
EP2107452A3 (en) Print controlling system
EP2107795A3 (en) Print data generating device, method to generate print data, and computer usable medium therefor
TW200717338A (en) Character recognition apparatus, character recognition method, and character data
US10686963B1 (en) Encoding and decoding digital signals in conductive ink structures
EP2657034A4 (en) Bi-color duplex printing method and device
WO2007008343A3 (en) Image element alignment for printed matter and associated methods

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07713249

Country of ref document: EP

Kind code of ref document: A2

DPE1 Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101)
NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 07713249

Country of ref document: EP

Kind code of ref document: A2