WO2004023330A2 - System and method for identifying line breaks - Google Patents

System and method for identifying line breaks Download PDF

Info

Publication number
WO2004023330A2
WO2004023330A2 PCT/US2003/026996 US0326996W WO2004023330A2 WO 2004023330 A2 WO2004023330 A2 WO 2004023330A2 US 0326996 W US0326996 W US 0326996W WO 2004023330 A2 WO2004023330 A2 WO 2004023330A2
Authority
WO
WIPO (PCT)
Prior art keywords
character
string
browser
document
segment
Prior art date
Application number
PCT/US2003/026996
Other languages
French (fr)
Other versions
WO2004023330A3 (en
Inventor
Anatoliy V. Tsykora
Original Assignee
Vistaprint Technologies Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Vistaprint Technologies Limited filed Critical Vistaprint Technologies Limited
Priority to EP03794524A priority Critical patent/EP1543440A2/en
Priority to AU2003262959A priority patent/AU2003262959A1/en
Publication of WO2004023330A2 publication Critical patent/WO2004023330A2/en
Publication of WO2004023330A3 publication Critical patent/WO2004023330A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/106Display of layout of documents; Previewing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents

Definitions

  • This invention relates generally to processing electronic documents and more specifically to converting multi-line markup language documents into a prepress format.
  • Web browsers Two of the most popular of such applications are Internet Explorer from Microsoft Corporation and Netscape Navigator from Netscape Communications Corporation. These browsers are capable of displaying documents written in the common standard markup languages in use today.
  • HTML Hypertext Markup Language
  • HTML Hypertext Markup Language
  • Microsoft and Netscape browsers have incorporated support for the idea of DHTML (Dynamic HTML) into their browsers.
  • DHTML is not actually a language, but is the combination and interaction of several Web-related standards, including HTML, CSS (Cascading Style Sheets), DOM (Document Object Model), and scripting.
  • XML extensible Markup Language
  • XML is also not a language, but is a metalanguage or set of rules that allow the creation of other markup languages.
  • XML allows developers to create markup languages that are more versatile and powerful than HTML.
  • One of the standard XML languages that are now in common use is XHTML (extensible HTML).
  • the XHTML language standard was developed by adapting the HTML standard to meet the requirements of XML.
  • the DHTML standard also now accommodates XHTML. Specifications and descriptions for HTML, XML, XHTML and other Web-related open technology standards are available on the Web from the World Wide Web Consortium at http://www.w3c.org.
  • Web-based design and editing offers the potential to substantially improve the speed and efficiency of the print job preparation process, which has traditionally involved either not having the opportunity to review the actual appearance of the printed product in advance or the time consuming steps of designing the layout, creating the proofs, reviewing and correcting the proofs, incorporating corrections or modifications, re- reviewing the proofs, purchase order completion, and eventually transmission to the printing facility.
  • U.S. Patent 6,247,011 entitled “Computerized Prepress Authoring for Document Creation”.
  • U.S. Patent 6,247,011 discloses a system wherein an HTML document editing tool is downloaded to the user browser. User information is entered in a document preparation template and is displayed to the user on the user's display monitor. The HTML version of the document is uploaded to the server where it is converted to an image representing the appearance of the final printed document. The image is then downloaded back to the user's system.
  • the HTML version is for general layout and design of the document.
  • the converted image version of the document which would typically not be identical to the HTML version, is the version used for user review and approval.
  • WYSIWYG functionality is highly important to many document developers and has for some time been available to computer users in modern word processing, desktop publishing and other applications.
  • a primary reason for this difficulty with longer text strings is that the XHTML standard does not require, and the browser does not implement, any tag or other indicia showing when or where a string of characters has reached the end of a line and been wrapped by the browser to the next line.
  • the text wrapping decision and control resides in the browser, which dynamically breaks a lengthy string of text according to the width of the text area made available to the browser. If the precise location of all line breaks in the document as it was viewed and approved by the user are not known, a WYSWIG result cannot be guaranteed when the XHTML document is converted into a prepress format.
  • the invention addresses the above-identified shortcoming by providing a system and method for identifying browser-imposed line breaks in multi-line text elements.
  • a markup language document is received by a server that supplies the document to a browser that is a duplicate of the client browser used to create the document.
  • Each text element is reviewed once to identify user-inserted line breaks and a second time to identify browser-imposed line breaks.
  • the invention takes advantage of the browser's ability to provide positional information for segments of the text string.
  • the location of line breaks inserted by the browser are identified by comparing positional information for segments of the text string.
  • FIG. 1 is a block diagram of a system embodying the invention.
  • Fig. 2A shows a flow chart of a first portion of a method for identifying line breaks.
  • Fig. 2B shows a flow chart of a second portion of a method for identifying line breaks.
  • Fig. 3 is a depiction of an example text element containing wrapped text.
  • Fig. 4 is a flow chart of an alternate method for identifying browser imposed line breaks.
  • Fig. 5 is a flow chart of another alternate method for identifying browser imposed line breaks.
  • Fig. 6 is a flow chart of yet another alternate method for identifying browser imposed line breaks.
  • Fig. 1 is a diagram of a system for employing the invention.
  • Server 100 is a Web server having a universal resource locator and being adapted to permit computers having access to the Web, such as Client 120, to access server 100 and download Web pages and other materials. While shown in Fig. 1 as a single unit, it will be understood that server 100 may in fact be comprised of a plurality of individual processors or separate computers, which may be either in the same or in different geographical locations, operating cooperatively so as to provide computational and informational support to Web users. In the preferred embodiment shown in Fig.
  • Client computer 120 is a PC or similar computer having a processor, a memory, a display, and input devices, such as a keyboard and a mouse, however, it will be understood that the invention is applicable to other devices capable of displaying XHTML documents, such as palmtop computers, tablet computers, Web-enabled telephones, and personal digital assistants (PDAs).
  • PDAs personal digital assistants
  • XHTML is referred to throughout, the invention could also be usefully employed with any other markup languages where a text string does not have embedded indications of text wrapping.
  • Fig. 1 depicts server 100 and client 120 communicating via network 110, it will be understood that the invention is not limited to network applications, but can be employed with other situations where a WYSIWYG relationship between the HTML document and the final printed product is desired.
  • Server 100 includes a downloadable document creation program 101 for downloading to and execution on Client 120 in Browser 121.
  • Browser 121 is Microsoft Explorer.
  • Server 100 may also make available a plurality of downloadable document templates and images 102 to assist client 120 in creating document 122.
  • the human operator of client 120 will be controlling the detailed creation of document 122 using the keyboard and/or mouse of Client 120 and observing the state of the document on the display of client 120.
  • the operator transmits document 122 over network 110 to server 100.
  • Server 100 will typically store document 122 and related customer and system information in data storage 120, for example one or more disk drives or a disk array, for a period of time until all pre-printing conditions are met, such as completion of the ordering information, approval or clearance of the customer's form of payment, and scheduling of the printing system.
  • data storage 120 for example one or more disk drives or a disk array
  • Document 122 Prior to printing, document 122 will be retrieved from Data Storage 120 and processed to insure that the text arrangement as viewed and approved by the user is reproduced faithfully on the printed product. Because the design and operation of commercially available browsers varies between browser vendors and between different browser versions from the same vendor, Server 100 maintains access to a copy of every browser version with which Document Creation 101 is compatible. These browsers are collectively identified as Browsers 105. As will be described below, Conversion 104 reviews the text elements of the XHTML document and identifies the location of all line breaks prior to supplying the document to Rendering 103 for conversion to a prepress format. After conversion to a prepress format by Rendering 103, Server 100 can forward the converted document to Printing Facility 130 over network 110, either alone or in combination with other print jobs.
  • text in the document will be contained in one or more text elements.
  • Each element has associated parameters and attributes that define the structure and content of the element. These include both the physical features of the element, such as the height and width of the text area and the absolute or relative spatial location of the element within the document, as well as the specification of any text content of the element, such as font type, font size, font color, and any font attributes such as holding and italics.
  • a document might contain only a single text element or might contain a large number of such elements.
  • Some individual text elements in a document may be empty while other text elements may include combinations of visible characters, spaces characters, and new line control characters in any sequence.
  • a string of characters in a text element may be relatively short, occupying only a single line, or may be quite lengthy, requiring that the browser wrap the text onto multiple lines.
  • the invention described herein is particularly concerned with identifying browser-generated line breaks in multi-line text strings.
  • the browser creates and maintains an invisible bounding rectangle that conceptually surrounds the lines of text in an element.
  • This rectangle represents the area within which the characters are rendered.
  • the rectangle is sized to accommodate the characters contained in the text, including all possible character effects and decoration such as subscripting, superscripting, and underlining.
  • the browser can supply various properties describing the bounding rectangle, such as the rectangle's height and width.
  • the browser can also provide these properties for the bounding rectangle of any subset of the text in a text element. In a preferred embodiment, values for these properties are obtained using the TextRange object.
  • FIG. 2A A preferred method for implementing the invention is shown in Figs. 2A and 2B.
  • the method is implemented in two steps: scanning the text elements in a document to locate all user controlled line breaks and scanning the text elements to locate all browser controlled line breaks.
  • the information designating the location of both types of line breaks is maintained in a separate data structure.
  • Fig. 2A depicts a method for identifying user-inserted lines breaks.
  • the document description and information describing the specific browser in which the document was created are retrieved at step 201. If elements are available at step 202, an element is selected at step 203. All characters in the element are sequentially checked at steps 204-206.
  • the next element in the document is selected until all elements in the document have been examined.
  • the next character in the text string is flagged as being the start of a new line.
  • the Fig. 2A method is performed at the time the XHTML document is parsed.
  • the method just described in connection with Fig. 2A takes the contents of a text element and identifies the location of any user-inserted line breaks. When these breaks, if any, have been identified, the user-inserted break information can be used to divide all of the text content of the element into strings of text characters that are between the user-inserted line breaks. Since one or all of these text strings may be long enough to require the browser to wrap the characters onto multiple lines, the method of Fig. 2B is directed to identifying any browser- imposed line breaks in these strings.
  • the document is selected at step 210 and an element from the document is selected at step 213.
  • the element contains a text string at step 214
  • the TextRange boundingHeight value for the bounding rectangle containing only the first character of the text string is initially requested.
  • the boundingHeight value received at step 215 is saved at step 216.
  • that character is added to the previous character and the TextRange boundingHeight value for that combined text segment is requested.
  • the new height value is then compared to the saved height value at step 219. Steps 217-219 will be executed for each sequential character in the text string. As each character is added to the preceding characters, the boundingHeight value for the new text segment is obtained at step 218 and compared at step 219 to the saved value.
  • the height value obtained at step 218 will not change if the newly added character was rendered by the browser on the same line as the preceding character.
  • the height value will change, however, when a character displayed on another line is added to the test segment. Therefore, when the new height is unequal to the saved height at step 219, it indicates that the character that was just added is at the beginning of a new line. In that event, the character is flagged at step 220 as starting a new line and the new height is saved at step 216.
  • This new height will now be used to check for the next new line, if any, in the text string. Processing will continue in this fashion until the entire text string has been reviewed to identify all browser- imposed line breaks. Similarly, processing will continue until all text lines in all elements in the document have been processed. At the conclusion of the method, all line breaks, both user controlled and browser controlled, have been identified and the content of all character text lines is known.
  • Fig. 3 shows an example of a text element as viewed by the user.
  • the method can be applied to a document containing any number of text elements that contain any number and combination of characters, words, sentences and paragraphs.
  • the user has entered text into a text area that has a width indicated by w.
  • the user has entered the words "one two three four five six" into this area and, since there was insufficient horizontal space available to accommodate all of these words on a single line, the browser has wrapped the text onto three lines as shown.
  • XHTML document description as received at the server, however, contains the text string and specifies the width w, but does not indicate exactly where the text was divided.
  • the document is retrieved at step 210 and the element is selected at step 213.
  • the TextRange boundingHeight value of the text segment containing the first character, in this case the letter "o" is requested from the browser at step 215.
  • the value returned, hi is saved at step 216.
  • the boundingHeight value for the text segment "on” is requested and compared at step 219 with the saved value. The value will again be equal to hi, so the next character is added to the text segment and the boundingHeight value for "one” is requested. This process will continue until the boundingHeight for the text string segment "one two t" is requested.
  • the boundingHeight value returned by the browser for this segment will be h2.
  • the unequal comparison at step 219 will cause the "t" character that was just added to be flagged as the beginning of a new line.
  • the value of h2 will be stored as the new saved height at step 216.
  • the height values will again remain the same until the height of the segment "one two three four f" is requested. This will return a height of h3, indicating that another line has started.
  • the "f ' in "five” will be flagged as the start of a new line. Processing will continue through the last character and, since no other elements are available in this example, processing will terminate. [0034] All lines of characters in the document and the location of all user controlled and browser controlled line breaks have now been identified.
  • the XHTML document and the information about line breaks can now be provided to Rendering 103 for converting the document to a prepress format.
  • knowing the location of the line breaks makes it possible to obtain additional useful information about the lines of text. For example, since the content of each line is known, the width of each line of text could be determined by submitting the text segment to the browser and requesting the TextRange bounding Width value for that segment. Knowing the widths of the individual lines of text could then be used, if desired, to adjust intercharacter spacing on one or more lines of text in the document prior to printing to enhance the appearance of the final printed product.
  • the document and the identification of all line breaks are then forwarded to Rendering Program 103 for subsequent processing into a suitable prepress format.
  • subsequent processing includes rendering the text portion of the document with a commercial word processing program, such as Microsoft Word from Microsoft Corporation.
  • the results from Microsoft Word are forwarded, via the "print" command, to a commercially available program, such as Acrobat Distiller from Adobe Systems Inc., for converting the word processing format into a high-resolution output format such as PDF.
  • a commercially available product such as
  • Fig. 2B could be modified to work in the other direction. That is, referring to Fig. 4, the TextRange height of the entire text string would be obtained at step 406 and saved at step 407. Characters would then be individually removed at step 408. After each character is removed, the height value for the remaining segment could be compared at step 411 with the height value before the character was removed. When the values become unequal, it indicates that the last character that was removed was at the beginning of a line. That character would be flagged at step 412. [0037] As another alternate implementation, another property of the TextRange object could be used to accomplish a similar result. Fig.
  • Fig. 5 shows a method using the TextRange boundingLeft property, which returns the value of the left coordinate of the bounding rectangle.
  • Fig. 5 is directed at a language such as English that has a normal direction of character progression from right to left. It will be understood that by changing step 510, the method of Fig 5 can be readily adapted for use with those languages that have a direction of character progression from right to left.
  • the boundingLeft value for the first character in the string is requested and saved at step 507. If the string contains more than a single character, the boundingLeft value for the next sequential character is obtained at step 509 and compared at step 510. Each succeeding character rendered by the browser on the same line will have a larger boundingLeft value than the preceding character. However, when a character in the string is encountered that has been wrapped to another line, the new boundingLeft value for that character will be less than the preceding character's value. In this event, the new character is flagged at step 511 as the beginning of a new line.
  • Fig. 6 shows a method that is similar to Fig. 5, but operates in the reverse direction by comparing the TextRange boundingLeft property from the final character in the string back to the first character.
  • the boundingLeft value will decrease for characters rendered on the same line until the first character is compared with the last character on the preceding line.
  • the saved value of boundingLeft which represents the value for the first character on one line, will be less than the value for the preceding character, which is the last character on the preceding line. This will cause the prior character to be flagged at step 611 as the start of a new line.
  • the method of Fig. 6 can be readily adapted to languages having a direction of character progression from right to left.

Abstract

Method, system, and computer code for preparing markup language documents containing multiline text elements for WYSIWYG printing. The document is rendered in a prepress server system by a duplicate of the browser that was used to prepare the document in the client system. User-imposed line breaks are identified by reviewing text elements for break characters. Browser-imposed line breaks are identified by comparing spatial location information from the browser for each sequential pair of characters in a text element. The collective line break information is used to ensure that the line breaks viewed by the user are maintained when the document is converted to a prepress system.

Description

SYSTEM AND METHOD FOR IDENTIFYING LINE BREAKS
Field of the Invention
[0001] This invention relates generally to processing electronic documents and more specifically to converting multi-line markup language documents into a prepress format.
Background of the Invention
[0002] Software applications designed to locate and display World Wide Web ("Web") pages are in widespread use. Commonly referred to as Web browsers, two of the most popular of such applications are Internet Explorer from Microsoft Corporation and Netscape Navigator from Netscape Communications Corporation. These browsers are capable of displaying documents written in the common standard markup languages in use today.
[0003] HTML (Hypertext Markup Language) remains a very common and popular authoring and presentation language for creating Web materials. The HTML standard, however, was originally based on the idea that an HTML document would remain static after being rendered in the browser. To allow for elements in an HTML page to be controllable after the page is rendered, Microsoft and Netscape browsers have incorporated support for the idea of DHTML (Dynamic HTML) into their browsers. DHTML is not actually a language, but is the combination and interaction of several Web-related standards, including HTML, CSS (Cascading Style Sheets), DOM (Document Object Model), and scripting.
[0004] Another development was the creation of the XML (extensible Markup Language) standard. XML is also not a language, but is a metalanguage or set of rules that allow the creation of other markup languages. XML allows developers to create markup languages that are more versatile and powerful than HTML. One of the standard XML languages that are now in common use is XHTML (extensible HTML). The XHTML language standard was developed by adapting the HTML standard to meet the requirements of XML. The DHTML standard also now accommodates XHTML. Specifications and descriptions for HTML, XML, XHTML and other Web-related open technology standards are available on the Web from the World Wide Web Consortium at http://www.w3c.org. [0005] Many enterprises have recognized the commercial opportunities presented by the ability to create and edit documents in the browser and have undertaken to capitalize on it by developing products and services supported or enabled by software applications running in the browser. To facilitate and promote the development of browser-compatible software applications, browser vendors typically implement and make readily available an API
(application program interface) of standard routines, protocols and tools for use by application developers.
[0006] One of the many applications that has emerged for this technology is Web-based document preparation allowing a user, using the user's browser, to design, edit and proof a customized WYSIWYG (what you see is what you get) document, prepare an order for printing of the document, and transmit the document to a remote server for printing on an appropriate printing system. The term "document" as used herein refers to an electronic file intended for eventual printing by any known printing process on any printable medium, including, but not limited to, paper, cloth, glass, plastic, rubber, or wood.
[0007] For many types of products, Web-based design and editing offers the potential to substantially improve the speed and efficiency of the print job preparation process, which has traditionally involved either not having the opportunity to review the actual appearance of the printed product in advance or the time consuming steps of designing the layout, creating the proofs, reviewing and correcting the proofs, incorporating corrections or modifications, re- reviewing the proofs, purchase order completion, and eventually transmission to the printing facility.
[0008] One prior art system for performing browser-based document creation and editing is disclosed in U.S. Patent 6,247,011 entitled "Computerized Prepress Authoring for Document Creation". U.S. Patent 6,247,011 discloses a system wherein an HTML document editing tool is downloaded to the user browser. User information is entered in a document preparation template and is displayed to the user on the user's display monitor. The HTML version of the document is uploaded to the server where it is converted to an image representing the appearance of the final printed document. The image is then downloaded back to the user's system. In the 6,247,011 system, the HTML version is for general layout and design of the document. The converted image version of the document, which would typically not be identical to the HTML version, is the version used for user review and approval. [0009] Another system for browser-based document creation is described in co-pending and commonly owned application Serial No. 09/557,571 entitled "Managing Print Jobs", which is hereby incorporated by reference. Application No. 09/557,571 discloses a document preparation system comprising a downloadable editing application that allows a user to design and proof WYSIWYG documents in XHTML in the user's browser and server-side applications that convert the XHTML version of the document received from the browser to a prepress version in preparation for printing on a high-resolution printing device. This application does not expressly discuss the problems related to generating a WYSIWYG prepress version of a markup language document contaimng one or more multi-line text areas.
[0010] WYSIWYG functionality is highly important to many document developers and has for some time been available to computer users in modern word processing, desktop publishing and other applications. The successful application of browser-based XHTML document editing to the preparation of WYSIWYG materials containing multi-line text fields, however, has proved to be difficult. This has made browser-based editing tools generally unsuitable for the WYSIWYG creation of printed products having text lines longer than a few words.
[0011] A primary reason for this difficulty with longer text strings is that the XHTML standard does not require, and the browser does not implement, any tag or other indicia showing when or where a string of characters has reached the end of a line and been wrapped by the browser to the next line. The text wrapping decision and control resides in the browser, which dynamically breaks a lengthy string of text according to the width of the text area made available to the browser. If the precise location of all line breaks in the document as it was viewed and approved by the user are not known, a WYSWIG result cannot be guaranteed when the XHTML document is converted into a prepress format.
[0012] Therefore, in the prior art, the typical approaches taken in connection with proofing documents created in the browser were to either upload the document to a server for conversion to a bitmap image which was then downloaded back to the user for review or to simply notify the user that the final printed version might differ from the version viewed on the user's screen. The former approach can generate repetitive network traffic to and from the user's computer and server conversion activity while the document is being prepared. The latter is unsatisfactory to the user because the user cannot verify the appearance of the finished product. Both approaches can lead to user frustration and dissatisfaction.
Summary of the Invention
[0013] The invention addresses the above-identified shortcoming by providing a system and method for identifying browser-imposed line breaks in multi-line text elements. A markup language document is received by a server that supplies the document to a browser that is a duplicate of the client browser used to create the document. Each text element is reviewed once to identify user-inserted line breaks and a second time to identify browser-imposed line breaks.
[0014] The invention takes advantage of the browser's ability to provide positional information for segments of the text string. The location of line breaks inserted by the browser are identified by comparing positional information for segments of the text string.
Brief Description of the Drawings
[0015] Fig. 1 is a block diagram of a system embodying the invention.
[0016] Fig. 2A shows a flow chart of a first portion of a method for identifying line breaks.
[0017] Fig. 2B shows a flow chart of a second portion of a method for identifying line breaks.
[0018] Fig. 3 is a depiction of an example text element containing wrapped text.
[0019] Fig. 4 is a flow chart of an alternate method for identifying browser imposed line breaks.
[0020] Fig. 5 is a flow chart of another alternate method for identifying browser imposed line breaks.
[0021] Fig. 6 is a flow chart of yet another alternate method for identifying browser imposed line breaks.
Detailed Description of the Preferred Embodiment
[0022] Fig. 1 is a diagram of a system for employing the invention. Server 100 is a Web server having a universal resource locator and being adapted to permit computers having access to the Web, such as Client 120, to access server 100 and download Web pages and other materials. While shown in Fig. 1 as a single unit, it will be understood that server 100 may in fact be comprised of a plurality of individual processors or separate computers, which may be either in the same or in different geographical locations, operating cooperatively so as to provide computational and informational support to Web users. In the preferred embodiment shown in Fig. 1, Client computer 120 is a PC or similar computer having a processor, a memory, a display, and input devices, such as a keyboard and a mouse, however, it will be understood that the invention is applicable to other devices capable of displaying XHTML documents, such as palmtop computers, tablet computers, Web-enabled telephones, and personal digital assistants (PDAs). Similarly, while XHTML is referred to throughout, the invention could also be usefully employed with any other markup languages where a text string does not have embedded indications of text wrapping. While Fig. 1 depicts server 100 and client 120 communicating via network 110, it will be understood that the invention is not limited to network applications, but can be employed with other situations where a WYSIWYG relationship between the HTML document and the final printed product is desired.
[0023] Server 100 includes a downloadable document creation program 101 for downloading to and execution on Client 120 in Browser 121. In a preferred embodiment, Browser 121 is Microsoft Explorer. Server 100 may also make available a plurality of downloadable document templates and images 102 to assist client 120 in creating document 122. The human operator of client 120 will be controlling the detailed creation of document 122 using the keyboard and/or mouse of Client 120 and observing the state of the document on the display of client 120. When document 122 is satisfactory to the operator, the operator transmits document 122 over network 110 to server 100. Server 100 will typically store document 122 and related customer and system information in data storage 120, for example one or more disk drives or a disk array, for a period of time until all pre-printing conditions are met, such as completion of the ordering information, approval or clearance of the customer's form of payment, and scheduling of the printing system.
[0024] Prior to printing, document 122 will be retrieved from Data Storage 120 and processed to insure that the text arrangement as viewed and approved by the user is reproduced faithfully on the printed product. Because the design and operation of commercially available browsers varies between browser vendors and between different browser versions from the same vendor, Server 100 maintains access to a copy of every browser version with which Document Creation 101 is compatible. These browsers are collectively identified as Browsers 105. As will be described below, Conversion 104 reviews the text elements of the XHTML document and identifies the location of all line breaks prior to supplying the document to Rendering 103 for conversion to a prepress format. After conversion to a prepress format by Rendering 103, Server 100 can forward the converted document to Printing Facility 130 over network 110, either alone or in combination with other print jobs.
[0025] In accordance with XHTML standards, text in the document will be contained in one or more text elements. Each element has associated parameters and attributes that define the structure and content of the element. These include both the physical features of the element, such as the height and width of the text area and the absolute or relative spatial location of the element within the document, as well as the specification of any text content of the element, such as font type, font size, font color, and any font attributes such as holding and italics. A document might contain only a single text element or might contain a large number of such elements. Some individual text elements in a document may be empty while other text elements may include combinations of visible characters, spaces characters, and new line control characters in any sequence. A string of characters in a text element may be relatively short, occupying only a single line, or may be quite lengthy, requiring that the browser wrap the text onto multiple lines. The invention described herein is particularly concerned with identifying browser-generated line breaks in multi-line text strings.
[0026] The browser creates and maintains an invisible bounding rectangle that conceptually surrounds the lines of text in an element. This rectangle represents the area within which the characters are rendered. The rectangle is sized to accommodate the characters contained in the text, including all possible character effects and decoration such as subscripting, superscripting, and underlining. The browser can supply various properties describing the bounding rectangle, such as the rectangle's height and width. The browser can also provide these properties for the bounding rectangle of any subset of the text in a text element. In a preferred embodiment, values for these properties are obtained using the TextRange object.
[0027] A preferred method for implementing the invention is shown in Figs. 2A and 2B. In a preferred embodiment, the method is implemented in two steps: scanning the text elements in a document to locate all user controlled line breaks and scanning the text elements to locate all browser controlled line breaks. In a preferred embodiment, the information designating the location of both types of line breaks is maintained in a separate data structure. [0028] Fig. 2A depicts a method for identifying user-inserted lines breaks. At step 201, the document description and information describing the specific browser in which the document was created are retrieved at step 201. If elements are available at step 202, an element is selected at step 203. All characters in the element are sequentially checked at steps 204-206. When no additional characters are available at step 204, the next element in the document is selected until all elements in the document have been examined. Whenever a break or paragraph character is detected at step 205, the next character in the text string is flagged as being the start of a new line. In a preferred implementation, the Fig. 2A method is performed at the time the XHTML document is parsed.
[0029] The method just described in connection with Fig. 2A takes the contents of a text element and identifies the location of any user-inserted line breaks. When these breaks, if any, have been identified, the user-inserted break information can be used to divide all of the text content of the element into strings of text characters that are between the user-inserted line breaks. Since one or all of these text strings may be long enough to require the browser to wrap the characters onto multiple lines, the method of Fig. 2B is directed to identifying any browser- imposed line breaks in these strings.
[0030] Referring to Fig. 2B, the document is selected at step 210 and an element from the document is selected at step 213. If the element contains a text string at step 214, the TextRange boundingHeight value for the bounding rectangle containing only the first character of the text string is initially requested. The boundingHeight value received at step 215 is saved at step 216. If there is another character in the text string at step 217, that character is added to the previous character and the TextRange boundingHeight value for that combined text segment is requested. The new height value is then compared to the saved height value at step 219. Steps 217-219 will be executed for each sequential character in the text string. As each character is added to the preceding characters, the boundingHeight value for the new text segment is obtained at step 218 and compared at step 219 to the saved value.
[0031] The height value obtained at step 218 will not change if the newly added character was rendered by the browser on the same line as the preceding character. The height value will change, however, when a character displayed on another line is added to the test segment. Therefore, when the new height is unequal to the saved height at step 219, it indicates that the character that was just added is at the beginning of a new line. In that event, the character is flagged at step 220 as starting a new line and the new height is saved at step 216. This new height will now be used to check for the next new line, if any, in the text string. Processing will continue in this fashion until the entire text string has been reviewed to identify all browser- imposed line breaks. Similarly, processing will continue until all text lines in all elements in the document have been processed. At the conclusion of the method, all line breaks, both user controlled and browser controlled, have been identified and the content of all character text lines is known.
[0032] To illustrate the method of Fig. 2B, Fig. 3 shows an example of a text element as viewed by the user. As can be readily understood, the method can be applied to a document containing any number of text elements that contain any number and combination of characters, words, sentences and paragraphs. In this example, the user has entered text into a text area that has a width indicated by w. The user has entered the words "one two three four five six" into this area and, since there was insufficient horizontal space available to accommodate all of these words on a single line, the browser has wrapped the text onto three lines as shown. The
XHTML document description as received at the server, however, contains the text string and specifies the width w, but does not indicate exactly where the text was divided.
[0033] Referring again to Fig. 2B, as applied to the example of Fig. 3, the document is retrieved at step 210 and the element is selected at step 213. The TextRange boundingHeight value of the text segment containing the first character, in this case the letter "o", is requested from the browser at step 215. The value returned, hi, is saved at step 216. At step 218, the boundingHeight value for the text segment "on" is requested and compared at step 219 with the saved value. The value will again be equal to hi, so the next character is added to the text segment and the boundingHeight value for "one" is requested. This process will continue until the boundingHeight for the text string segment "one two t" is requested. The boundingHeight value returned by the browser for this segment will be h2. The unequal comparison at step 219 will cause the "t" character that was just added to be flagged as the beginning of a new line. The value of h2 will be stored as the new saved height at step 216. The height values will again remain the same until the height of the segment "one two three four f" is requested. This will return a height of h3, indicating that another line has started. The "f ' in "five" will be flagged as the start of a new line. Processing will continue through the last character and, since no other elements are available in this example, processing will terminate. [0034] All lines of characters in the document and the location of all user controlled and browser controlled line breaks have now been identified. The XHTML document and the information about line breaks can now be provided to Rendering 103 for converting the document to a prepress format. In addition, knowing the location of the line breaks makes it possible to obtain additional useful information about the lines of text. For example, since the content of each line is known, the width of each line of text could be determined by submitting the text segment to the browser and requesting the TextRange bounding Width value for that segment. Knowing the widths of the individual lines of text could then be used, if desired, to adjust intercharacter spacing on one or more lines of text in the document prior to printing to enhance the appearance of the final printed product.
[0035] The document and the identification of all line breaks are then forwarded to Rendering Program 103 for subsequent processing into a suitable prepress format. In a preferred embodiment for preparing a document to be printed on paper on a high-resolution printing press, subsequent processing includes rendering the text portion of the document with a commercial word processing program, such as Microsoft Word from Microsoft Corporation. The results from Microsoft Word are forwarded, via the "print" command, to a commercially available program, such as Acrobat Distiller from Adobe Systems Inc., for converting the word processing format into a high-resolution output format such as PDF. As a final step the non-text elements of the document and the text PDF are merged, using a commercially available product such as
PDFlib from PDFlib GmbH, into a prepress document format. As will be appreciated by those of skill in the art, other commercially available software tools could also be used to create the PDF file and the specific prepress processing steps would vary depending on the specific medium onto which the document will be printed.
[0036] Various alternate implementations of the invention can be employed within the spirit and scope of the invention. For example, the method shown in Fig. 2B could be modified to work in the other direction. That is, referring to Fig. 4, the TextRange height of the entire text string would be obtained at step 406 and saved at step 407. Characters would then be individually removed at step 408. After each character is removed, the height value for the remaining segment could be compared at step 411 with the height value before the character was removed. When the values become unequal, it indicates that the last character that was removed was at the beginning of a line. That character would be flagged at step 412. [0037] As another alternate implementation, another property of the TextRange object could be used to accomplish a similar result. Fig. 5 shows a method using the TextRange boundingLeft property, which returns the value of the left coordinate of the bounding rectangle. Fig. 5 is directed at a language such as English that has a normal direction of character progression from right to left. It will be understood that by changing step 510, the method of Fig 5 can be readily adapted for use with those languages that have a direction of character progression from right to left.
[0038] At step 506 the boundingLeft value for the first character in the string is requested and saved at step 507. If the string contains more than a single character, the boundingLeft value for the next sequential character is obtained at step 509 and compared at step 510. Each succeeding character rendered by the browser on the same line will have a larger boundingLeft value than the preceding character. However, when a character in the string is encountered that has been wrapped to another line, the new boundingLeft value for that character will be less than the preceding character's value. In this event, the new character is flagged at step 511 as the beginning of a new line.
[0039] As yet another alternate implementation, Fig. 6 shows a method that is similar to Fig. 5, but operates in the reverse direction by comparing the TextRange boundingLeft property from the final character in the string back to the first character. With this approach, the boundingLeft value will decrease for characters rendered on the same line until the first character is compared with the last character on the preceding line. In this case, at step 610 the saved value of boundingLeft, which represents the value for the first character on one line, will be less than the value for the preceding character, which is the last character on the preceding line. This will cause the prior character to be flagged at step 611 as the start of a new line. As with Fig. 5 above, the method of Fig. 6 can be readily adapted to languages having a direction of character progression from right to left.
[0040] While various embodiments of the invention have been shown and described, the description is to be considered as illustrative rather than restrictive. The scope of the invention is as indicated in the following claims and all equivalent methods and apparatus.

Claims

What is claimed is:
1) A method for identifying browser-imposed line breaks in a string of sequential characters in a markup language text element, the method comprising the steps of: a) for each character in the string, obtaining a value from the browser indicating the vertical position of that character as rendered by the browser, b) for each pair of sequential characters in the string, comparing the relative vertical position of the two characters in the pair, and c) if the vertical positions of the two characters are not equal, identifying the character in the pair having the relatively lower vertical position as starting a new line.
2) A method for identifying browser-imposed line breaks in a string of sequential characters in a text element, the method comprising the steps of: a) for each character in the string, obtaining a value from the browser indicating the horizontal position of that character as rendered by the browser, b) for each pair of sequential characters in the string, comparing the relative horizontal position of the two characters in the pair, and c) if the horizontal position of the second character in the pair is not farther in the direction of normal character progression than the horizontal position of the first character in the pair, identifying the second character in the pair as starting a new line.
3) A method for preparing a markup language document containing multi-line text elements for printing, the method comprising the steps of
Conducting a first review of each text element in the document to identify line break characters inserted by the document preparer,
Conducting a second review of each text element in the document to identify the location of line breaks imposed by the browser,
Forwarding the document and the identified line break information from the first and second reviews to a rendering program for conversion from the markup language to a prepress format.
4) A markup language document conversion system comprising
A server configured to receive a markup language document from a remote client, A browser program running on the server, the browser program being substantially a duplicate of the browser program used to create the document, A conversion program, running on the server and communicating with the browser program, for identifying all user-inserted line breaks and all browser imposed line breaks in the document, and
A rendering program, running on the server and communicating with the conversion program, for receiving the document and the line break information from the conversion program and converting the document to a prepress format.
5) A method for identifying browser imposed line breaks in a character string containing a plurality of characters, the method comprising the steps of a) defining a first string segment that contains only the initial character in the text string, b) defining a second string segment that contains the first string segment plus the next character in the string, c) comparing the height of the bounding rectangle of the first string segment with the height of the bounding rectangle of the second string segment, d) if the heights are unequal, storing an indication that the character added in step b is at the beginning of a new line, e) setting the first segment equal to the second segment repeating steps b-e until the first segment is set equal to the entire text string.
6) A method for identifying browser imposed line breaks in a character string containing a plurality of characters, the method comprising the steps of: a) defining a first string segment that contains the text string, b) defining a second string segment that contains the first string segment minus the final character in the first string segment, c) comparing the height of the bounding rectangle of the first string segment with the height of the bounding rectangle of the second string segment, d) if the heights are unequal, storing an indication that the character dropped in step b is at the beginning of a new line, e) setting the first segment equal to the second segment repeating steps b-e until the first segment contains only a single character.
PCT/US2003/026996 2002-09-05 2003-08-26 System and method for identifying line breaks WO2004023330A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP03794524A EP1543440A2 (en) 2002-09-05 2003-08-26 System and method for identifying line breaks
AU2003262959A AU2003262959A1 (en) 2002-09-05 2003-08-26 System and method for identifying line breaks

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/235,084 US7020838B2 (en) 2002-09-05 2002-09-05 System and method for identifying line breaks
US10/235,084 2002-09-05

Publications (2)

Publication Number Publication Date
WO2004023330A2 true WO2004023330A2 (en) 2004-03-18
WO2004023330A3 WO2004023330A3 (en) 2004-06-24

Family

ID=31977507

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2003/026996 WO2004023330A2 (en) 2002-09-05 2003-08-26 System and method for identifying line breaks

Country Status (4)

Country Link
US (3) US7020838B2 (en)
EP (2) EP2482198B1 (en)
AU (1) AU2003262959A1 (en)
WO (1) WO2004023330A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2164000A1 (en) * 2008-09-15 2010-03-17 Deutsche Post AG Method for converting text information into a document in pdf format
WO2014041365A1 (en) * 2012-09-15 2014-03-20 Purple Secure Systems Ltd Improving readability of text

Families Citing this family (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6633666B2 (en) 1998-08-28 2003-10-14 Quark, Inc. Process and system for defining and visually depicting colors from the components of arbitrary color models
US7116843B1 (en) * 2000-07-24 2006-10-03 Quark, Inc. Method and system using non-uniform image blocks for rapid interactive viewing of digital images over a network
US20060212805A1 (en) * 2002-04-10 2006-09-21 Quark, Inc. Systems and methods for remote access media production
US20070150358A1 (en) * 2004-08-05 2007-06-28 Quark, Inc. Systems and methods for distributing media production
US20070157080A1 (en) * 2004-08-05 2007-07-05 Quark, Inc. Systems and methods for re-purposing content objects for media production
US20070139661A1 (en) * 2004-08-05 2007-06-21 Quark, Inc. Systems and methods for producing media products
US20070143750A1 (en) * 2004-08-05 2007-06-21 Quark, Inc. Systems and methods for multi-format media production
US20070094636A1 (en) * 2004-08-05 2007-04-26 Quark, Inc. Systems and methods for facilitating media production
US20040193520A1 (en) * 2003-03-27 2004-09-30 Lacomb Christina Automated understanding and decomposition of table-structured electronic documents
US20040252333A1 (en) * 2003-06-16 2004-12-16 Blume Leo Robert Mobile communication device printing
US8223355B2 (en) 2003-06-16 2012-07-17 Hewlett-Packard Development Company, L.P. Cellular telephone protocol adaptive printing
US20060033961A1 (en) * 2004-08-13 2006-02-16 Quark, Inc. Systems and methods for small element trapping
US20060033971A1 (en) * 2004-08-13 2006-02-16 Quark, Inc. Automated trapping system for desktop publishing
US20060087697A1 (en) * 2004-08-13 2006-04-27 Quark, Inc. Systems and methods for recursive trapping
US20060033960A1 (en) * 2004-08-13 2006-02-16 Quark, Inc. Systems and methods for ink selection in the trapping zone
US20060087698A1 (en) * 2004-08-13 2006-04-27 Quark, Inc. Systems and methods for variable trapping
US20060236231A1 (en) * 2004-11-02 2006-10-19 Quark, Inc. Systems and methods for developing dynamic media productions
JP4759378B2 (en) * 2004-12-17 2011-08-31 キヤノン株式会社 Information processing apparatus, information processing method, and control program
US7770111B2 (en) * 2004-12-20 2010-08-03 Microsoft Corporation Method and computer readable medium for optimized paragraph layout
US20070089624A1 (en) * 2005-03-30 2007-04-26 Quark, Inc. Systems and methods for integrated extended process media productions
US20060227347A1 (en) * 2005-03-30 2006-10-12 Quark, Inc. Systems and methods for importing color environment information
WO2007021254A2 (en) * 2005-08-09 2007-02-22 Quark, Inc. Systems and methods for integrating from data sources to data target locations
US7596752B2 (en) * 2005-08-15 2009-09-29 Microsoft Corporation Delaying optimal paragraph layout during editing
US7624343B2 (en) * 2005-09-16 2009-11-24 Microsoft Corporation Performance optimization for text layout processing
US7779427B2 (en) * 2006-01-18 2010-08-17 Microsoft Corporation Automated application configuration using device-provided data
US7743318B2 (en) * 2006-02-27 2010-06-22 Microsoft Corporation Order independent batched updates on a text buffer
KR100856405B1 (en) * 2006-04-13 2008-09-04 삼성전자주식회사 Method and system for outputting personal information management data, and a device thereby
US20080077555A1 (en) * 2006-09-22 2008-03-27 Miller Frank W Variable data workflow system and method
US8234571B1 (en) * 2006-12-15 2012-07-31 Adobe Systems Incorporated Predictive text composition
US7949951B1 (en) * 2007-01-17 2011-05-24 Adobe Systems Incorporated Selectively limiting paragraph recomposition
WO2011034524A1 (en) * 2009-09-15 2011-03-24 Hewlett-Packard Development Company, Lp Method for locating line breaks in text
KR20110088235A (en) * 2010-01-28 2011-08-03 삼성전자주식회사 Text display method and apparatus
US20130159889A1 (en) * 2010-07-07 2013-06-20 Li-Wei Zheng Obtaining Rendering Co-ordinates Of Visible Text Elements
JP2012048457A (en) * 2010-08-26 2012-03-08 Canon Inc Print server device, printer, information processing method, and program
US9800941B2 (en) * 2011-01-03 2017-10-24 Curt Evans Text-synchronized media utilization and manipulation for transcripts
US20130334300A1 (en) 2011-01-03 2013-12-19 Curt Evans Text-synchronized media utilization and manipulation based on an embedded barcode
JP2013089130A (en) * 2011-10-20 2013-05-13 Sony Corp Information processing apparatus, information processing method, program, and recording medium
JP6066108B2 (en) * 2014-04-16 2017-01-25 コニカミノルタ株式会社 Electronic document generation system and program
JP6805265B2 (en) 2016-03-09 2020-12-23 ピーティーアイ マーケティング テクノロジーズ インコーポレイテッド Garnish imposition sorting system
US10579707B2 (en) * 2017-12-29 2020-03-03 Konica Minolta Laboratory U.S.A., Inc. Method for inferring blocks of text in electronic documents

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6247011B1 (en) * 1997-12-02 2001-06-12 Digital-Net, Inc. Computerized prepress authoring for document creation

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5778403A (en) * 1994-09-01 1998-07-07 Microsoft Corporation Method for displaying text on a rendering device to accurately represent the text as if displayed on a target device
US6003048A (en) * 1995-04-27 1999-12-14 International Business Machines Corporation System and method for converting a coordinate based document to a markup language (ML) based document
US5897644A (en) * 1996-09-25 1999-04-27 Sun Microsystems, Inc. Methods and apparatus for fixed canvas presentations detecting canvas specifications including aspect ratio specifications within HTML data streams
US5978819A (en) * 1997-08-12 1999-11-02 International Business Machines Corporation Automatically converting preformatted text into reflowable text for TV viewing
US6247018B1 (en) * 1998-04-16 2001-06-12 Platinum Technology Ip, Inc. Method for processing a file to generate a database
US6567849B2 (en) * 1998-08-17 2003-05-20 International Business Machines Corporation System and method for configuring and administering multiple instances of web servers
US6493749B2 (en) * 1998-08-17 2002-12-10 International Business Machines Corporation System and method for an administration server
US6112246A (en) * 1998-10-22 2000-08-29 Horbal; Mark T. System and method for accessing information from a remote device and providing the information to a client workstation
US6510441B1 (en) * 1998-12-11 2003-01-21 Adobe Systems Incorporated Optimal line break determination
US6766495B1 (en) * 1999-09-27 2004-07-20 International Business Machines Corporation Apparatus and method for improving line-to-line word positioning of text for easier reading
US6556217B1 (en) * 2000-06-01 2003-04-29 Nokia Corporation System and method for content adaptation and pagination based on terminal capabilities
US20030009694A1 (en) * 2001-02-25 2003-01-09 Storymail, Inc. Hardware architecture, operating system and network transport neutral system, method and computer program product for secure communications and messaging
FR2814562A1 (en) * 2000-09-22 2002-03-29 Cytale METHOD FOR DISPLAYING A DIGITAL DOCUMENT, ELECTRONIC DEVICE, SOFTWARE, DIGITAL PUBLICATION, DATA MEDIUM, AND DOWNLOADING METHOD
AU2002305392A1 (en) * 2001-05-02 2002-11-11 Bitstream, Inc. Methods, systems, and programming for producing and displaying subpixel-optimized images and digital content including such images
US20030014445A1 (en) * 2001-07-13 2003-01-16 Dave Formanek Document reflowing technique
US20040205568A1 (en) * 2002-03-01 2004-10-14 Breuel Thomas M. Method and system for document image layout deconstruction and redisplay system
US20040105568A1 (en) * 2002-12-03 2004-06-03 Po-Hsiung Lee Speaker with enhanced magnetic flux

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6247011B1 (en) * 1997-12-02 2001-06-12 Digital-Net, Inc. Computerized prepress authoring for document creation

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP1543440A2 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2164000A1 (en) * 2008-09-15 2010-03-17 Deutsche Post AG Method for converting text information into a document in pdf format
WO2014041365A1 (en) * 2012-09-15 2014-03-20 Purple Secure Systems Ltd Improving readability of text

Also Published As

Publication number Publication date
US20040049735A1 (en) 2004-03-11
EP1543440A2 (en) 2005-06-22
AU2003262959A1 (en) 2004-03-29
WO2004023330A3 (en) 2004-06-24
US7020838B2 (en) 2006-03-28
US20120089900A1 (en) 2012-04-12
EP2482198A1 (en) 2012-08-01
EP2482198B1 (en) 2013-06-26
US20060129923A1 (en) 2006-06-15
US7949942B2 (en) 2011-05-24

Similar Documents

Publication Publication Date Title
US7949942B2 (en) System and method for identifying line breaks
JP4344693B2 (en) System and method for browser document editing
EP1597680B1 (en) Markup language cut-and-paste
US6560621B2 (en) World wide web formatting for program output through print function
US20050235202A1 (en) Automatic graphical layout printing system utilizing parsing and merging of data
US7246305B2 (en) Method and system for previewing and printing customized forms
JP4497432B2 (en) How to draw glyphs using layout service library
US6993209B1 (en) Low resolution-to-high resolution image correlation
US20030160819A1 (en) Interactive print system and method
US20020111963A1 (en) Method, system, and program for preprocessing a document to render on an output device
US20040253568A1 (en) Method of improving reading of a text
US20100318898A1 (en) Rendering definitions
US20060242571A1 (en) Systems and methods for processing derivative featurees in input files
JP2003132078A (en) Database construction device, method therefor, program thereof and recording medium
JP4508264B2 (en) Database construction apparatus, database construction method, database construction program, recording medium
JP4147763B2 (en) Database construction apparatus, database construction method, database construction program, recording medium
JP2001306550A (en) Display information processor
JP4192457B2 (en) Database construction apparatus, database construction method, database construction program, recording medium
King A format design case study
Repole Exporting SAS/GRAPH® Output for Inclusion in Web Pages and Other Software Applications
US20040057078A1 (en) Method and system for printing
Foltinek An overview of cross-platform document technology
Wicker et al. POSTING MATH TO THE WEB USING TOOLS ON THE WAKE FOREST STANDARD LOAD
JP2003132077A (en) Database construction device, database construction method, database construction program, recording medium
JP2008257739A (en) Database construction device, database construction method, database construction program, recording medium

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2003794524

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2003794524

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: JP

WWW Wipo information: withdrawn in national office

Country of ref document: JP