CN103377197A - Rich format document processing method and rich format document processing device - Google Patents

Rich format document processing method and rich format document processing device Download PDF

Info

Publication number
CN103377197A
CN103377197A CN2012101105232A CN201210110523A CN103377197A CN 103377197 A CN103377197 A CN 103377197A CN 2012101105232 A CN2012101105232 A CN 2012101105232A CN 201210110523 A CN201210110523 A CN 201210110523A CN 103377197 A CN103377197 A CN 103377197A
Authority
CN
China
Prior art keywords
content
document
directory
conditioned
directory content
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2012101105232A
Other languages
Chinese (zh)
Inventor
付培
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hanwang Technology Co Ltd
Original Assignee
Hanwang Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hanwang Technology Co Ltd filed Critical Hanwang Technology Co Ltd
Priority to CN2012101105232A priority Critical patent/CN103377197A/en
Publication of CN103377197A publication Critical patent/CN103377197A/en
Pending legal-status Critical Current

Links

Images

Abstract

An embodiment of the invention discloses a rich format document processing method and a rich format document processing device, and relates to the technical field of computers. Complicated operations due to the fact that target contents are searched by turning pages manually are avoided, and reading experience of users is improved. The method includes: acquiring document contents satisfying preset conditions from a to-be-processed document item by item to serve as target contents, and recording positional information of each item of directory contents in the to-be-processed document; and outputting and displaying the directory contents automatically or according to triggering of a user, and after each item of directory contents is clicked, positioning and displaying a to-be-processed document corresponding to the clicked directory contents according to the position information. The method and the device are mainly used for generating rich format document directories.

Description

Rich format file disposal route and device
Technical field
The present invention relates to field of computer technology, relate in particular to rich format file disposal route and device.
Background technology
RTF (Rich Text Format, the rich text form) is a kind of cross-platform document format, similar with word (word-processing application), good compatibility is arranged, most word processor can both read and preserve the RTF document, uses " board " among the Windows " annex " just can open and the RTF document is edited.RTF is a kind of popular file structure, and a lot of text editors are all supported it.
Take word document (document of suffix doc by name or docx) as example, the user is in that to read number of pages more and when word document with the catalogue of hyperlink is not provided, if want to jump to certain interested chapters and sections, must carry out continuous page turn over operation and could arrive the target location, this can allow the user can feel unusual inconvenience.
Summary of the invention
Embodiments of the invention provide a kind of rich format file disposal route and device, have avoided manual page turning to seek the troublesome operation that object content brings, and have promoted user's reading experience.
For achieving the above object, embodiments of the invention adopt following technical scheme:
A kind of rich format file disposal route comprises:
From pending document, obtain one by one and satisfy pre-conditioned document content as directory content, record the positional information of every described directory content in described pending document;
According to user's triggering or the automatic described directory content of output display, make every described directory content clicked after, according to described pending document corresponding to the clicked described directory content of described positional information locating and displaying.
A kind of rich format file treating apparatus comprises:
Acquiring unit satisfies pre-conditioned document content as directory content for obtaining one by one from pending document, records the positional information of every described directory content in described pending document;
Display unit is used for according to user's triggering or the automatic described directory content of output display, make every described directory content clicked after, according to described pending document corresponding to the clicked described directory content of described positional information locating and displaying;
Updating block, described pre-conditioned for upgrading according to arranging of user, the described pre-conditioned default form condition that comprises;
Resolution unit is used for according to user's triggering or automatically resolves and record the format information of pending document document content.
Rich format file disposal route and device that the embodiment of the invention provides, the user is in that to read number of pages more and when the rich format file of catalogue is not provided, but the catalogue of the rich format file of rapid extraction, the user recognizes the main contents of document from catalogue, and from catalogue, navigate to fast own interested part and read, avoid manual page turning to seek the troublesome operation that object content brings, promoted user's reading experience.
Description of drawings
In order to be illustrated more clearly in the technical scheme in the embodiment of the invention, the accompanying drawing of required use was done to introduce simply during the below will describe embodiment, apparently, accompanying drawing in the following describes only is some embodiments of the present invention, for those of ordinary skills, under the prerequisite of not paying creative work, can also obtain according to these accompanying drawings other accompanying drawing.
The process flow diagram of a kind of rich format file disposal route that Fig. 1 provides for the embodiment of the invention;
The process flow diagram of the another kind of rich format file disposal route that Fig. 2 provides for the embodiment of the invention;
The structural drawing of a kind of rich format file treating apparatus that Fig. 3 provides for the embodiment of the invention;
The structural drawing of the another kind of rich format file treating apparatus that Fig. 4 provides for the embodiment of the invention.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the invention, the technical scheme in the embodiment of the invention is clearly and completely described, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills belong to the scope of protection of the invention not making the every other embodiment that obtains under the creative work prerequisite.
The embodiment of the invention provides a kind of rich format file disposal route, as shown in Figure 1, comprising:
101, from pending document, obtain one by one and satisfy pre-conditioned document content as directory content, record the positional information of every described directory content in described pending document.
102, according to user's triggering or the automatic described directory content of output display, make every described directory content clicked after, according to described pending document corresponding to the clicked described directory content of described positional information locating and displaying.
The rich format file disposal route that present embodiment provides, the user is in that to read number of pages more and when the rich format file of catalogue is not provided, but the catalogue of the rich format file of rapid extraction, the user recognizes the main contents of document from catalogue, and from catalogue, navigate to fast own interested part and read, avoid manual page turning to seek the troublesome operation that object content brings, promoted user's reading experience.
As a kind of improvement of present embodiment, the embodiment of the invention provides another kind of rich format file disposal route, as shown in Figure 2, comprising:
201, judge whether pending document exists in directory content.
After the catalogue that receives user's triggering was obtained request, whether the pending document of rich format file treating apparatus automatic decision had existed directory content.
If there is not directory content in pending document, then execution in step 202, otherwise execution in step 204.
202, according to user's triggering or automatically resolve and record the format information of document content in the pending document.
When generating the catalogue of rich text format file, can utilize the format information of document content in the rich text format file, find out the document content different from most of content formats as directory content.
The rich text format file has very abundant format information, and such as various bullets, various font, various font sizes etc., the content of document is usually by the division of teaching contents with special format information.Such as the instructions part in one piece of patent application document, instructions comprises " technical field ", " background technology ", " summary of the invention ", " description of drawings ", " embodiment " five major parts, for the ease of readers ' reading, the title division of this five part uses the form different from body part usually, as use runic, larger font size, make this five most boundary line very clear.Therefore, add these format informations as matching condition, the accuracy rate of the catalogue that can extract greatly.
203, from pending document, obtain one by one and satisfy pre-conditioned document content as directory content, record the positional information of every described directory content in described pending document.
Describedly pre-conditionedly can comprise default form condition and preset content condition, preferred, step 203 comprises:
203a, from pending document, obtain one by one the format file content that form satisfies described default form condition;
203b, from the format file content, obtain content and satisfy the document content of described preset content condition as directory content.
In the Display directory content, need to be connected to pending document corresponding to directory content for directory content arranges hyperlink, therefore, need the positional information of every described directory content of record in described pending document.
Concrete, pre-conditioned can be one or more combination in the following condition, as:
1. font size is compared bigger than normal with most literal;
2. different with the context font, such as runic or black matrix etc.;
3. with bullets or bullets;
4. the text size of this paragraph is no more than certain threshold value or is no more than delegation;
5. whether comprise chapter, joint in the content of text, return and the index type character such as numeral.
The set that can mate the directory name that is finally needed by disposable and a plurality of conditions, also can use first partial condition to carry out prescreen, obtain candidate's catalogue text, then therefrom again screen with other conditions, finally obtain accurately chapters and sections catalogue.
204, according to user's triggering or the automatic described directory content of output display, make every described directory content clicked after, according to described pending document corresponding to the clicked described directory content of described positional information locating and displaying.
Check for the ease of the user, when the Display directory content, according to the sequencing of every catalogue content in the pending document successively described directory content of output display.
Concrete, the chapters and sections catalogue can be used xml (extensible markup language, extend markup language) formatted file but be not limited to this form and preserve, also can be that user-defined format is preserved, store by the sequencing that the catalogue literal occurs in article, the step that reads catalogue is to read first and resolve the chapters and sections catalogue xml file of preserving previously, shows wherein reading out about the property value of chapters and sections name.The method of Display directory is to eject a wicket or be partitioned into a window as display window in current window at screen, sequentially arranges demonstration by the result who reads catalogue.
205, upgrade described pre-conditioned according to arranging of user.
Pre-conditioned can the setting by user oneself in the present embodiment, for the user arranges pre-conditioned input window, after the user was pre-conditioned by the input window submission, it is pre-conditioned that recording user is submitted to, and use the pre-conditioned of state-of-the-art record when generate directory content next time.
Step 202-204 in this enforcement also can judge in step 201 and exist in the situation of directory content, carried out by user's imperative operation, also can judgement have that the user arranges new pre-conditioned lower, regenerate directory content.The execution sequence of step 205 can be adjusted according to actual conditions.
The rich format file disposal route that present embodiment provides, the user is in that to read number of pages more and when the rich format file of catalogue is not provided, but the catalogue of the rich format file of rapid extraction, the user recognizes the main contents of document from catalogue, and from catalogue, navigate to fast own interested part and read, avoid manual page turning to seek the troublesome operation that object content brings, promoted user's reading experience.
The embodiment of the invention provides a kind of rich format file treating apparatus, as shown in Figure 3, comprising: acquiring unit 31, display unit 32, updating block 33, resolution unit 34.
Wherein, acquiring unit 31 satisfies pre-conditioned document content as directory content for obtaining one by one from pending document, records the positional information of every described directory content in described pending document;
Display unit 32 is used for according to user's triggering or the automatic described directory content of output display, make every described directory content clicked after, according to described pending document corresponding to the clicked described directory content of described positional information locating and displaying;
Updating block 33 is used for upgrading described pre-conditioned according to arranging of user; The described pre-conditioned default form condition that comprises;
Resolution unit 34 is used for according to user's triggering or automatically resolves and record the format information of pending document document content.
The rich format file treating apparatus that present embodiment provides, the user is in that to read number of pages more and when the rich format file of catalogue is not provided, but the catalogue of the rich format file of rapid extraction, the user recognizes the main contents of document from catalogue, and from catalogue, navigate to fast own interested part and read, avoid manual page turning to seek the troublesome operation that object content brings, promoted user's reading experience.
As a kind of improvement of present embodiment, the embodiment of the invention provides another kind of rich format file treating apparatus, as shown in Figure 4, comprising: judging unit 41, resolution unit 42, acquiring unit 43, display unit 44, updating block 45.
Wherein, acquiring unit 43 comprises: form acquisition module 431, content obtaining module 432.
Judging unit 41 is used for judging whether described pending document has existed described directory content;
Resolution unit 42 is used for according to user's triggering or automatically resolves and record the format information of pending document document content.
There is not described directory content in acquiring unit 43 if be used for described pending document, then obtains one by one from pending document and satisfies pre-conditioned document content as directory content, records the positional information of every described directory content in described pending document;
Form acquisition module 431 is used for obtaining one by one the format file content that form satisfies default form condition from described pending document;
Content obtaining module 432 is used for obtaining content from described format file content and satisfies the document content of preset content condition as directory content.
Display unit 44 is used for according to user's triggering or the automatic described directory content of output display, make every described directory content clicked after, according to described pending document corresponding to the clicked described directory content of described positional information locating and displaying.
Concrete, described display unit is according to user's triggering or automatically according to the sequencing of every described directory content in the described pending document successively described directory content of output display.
Updating block 45, described pre-conditioned for upgrading according to arranging of user, the described pre-conditioned default form condition that comprises.
The rich format file treating apparatus that present embodiment provides, the user is in that to read number of pages more and when the rich format file of catalogue is not provided, but the catalogue of the rich format file of rapid extraction, the user recognizes the main contents of document from catalogue, and from catalogue, navigate to fast own interested part and read, avoid manual page turning to seek the troublesome operation that object content brings, promoted user's reading experience.
The rich format file disposal route that the embodiment of the invention provides and device are suitable for other document that having except rich format file enriched format information equally, such as the word document.
Through the above description of the embodiments, the those skilled in the art can be well understood to the present invention and can realize by the mode that software adds essential common hardware, can certainly pass through hardware, but the former is better embodiment in a lot of situation.Based on such understanding, the part that technical scheme of the present invention contributes to prior art in essence in other words can embody with the form of software product, this computer software product is stored in the storage medium that can read, floppy disk such as computing machine, hard disk or CD etc., comprise some instructions with so that computer equipment (can be personal computer, server, the perhaps network equipment etc.) carry out the described method of each embodiment of the present invention.
The above; be the specific embodiment of the present invention only, but protection scope of the present invention is not limited to this, anyly is familiar with those skilled in the art in the technical scope that the present invention discloses; can expect easily changing or replacing, all should be encompassed within protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion by described protection domain with claim.

Claims (10)

1. a rich format file disposal route is characterized in that, comprising:
From pending document, obtain one by one and satisfy pre-conditioned document content as directory content, record the positional information of every described directory content in described pending document;
According to user's triggering or the automatic described directory content of output display, make every described directory content clicked after, according to described pending document corresponding to the clicked described directory content of described positional information locating and displaying.
2. method according to claim 1 is characterized in that, also comprises:
Upgrade described pre-conditioned according to arranging of user.
3. method according to claim 2 is characterized in that, the described pre-conditioned default form condition that comprises, described from described pending document, obtain one by one form satisfy pre-conditioned document content as directory content before, described method also comprises:
According to user's triggering or automatically resolve and record the format information of document content in the pending document.
4. method according to claim 3 is characterized in that, the described pre-conditioned preset content condition that also comprises, and described obtaining one by one from described pending document satisfied pre-conditioned document content and comprised as directory content:
From described pending document, obtain one by one the format file content that form satisfies described default form condition;
From described format file content, obtain content and satisfy the document content of described preset content condition as directory content.
5. method according to claim 4 is characterized in that, described from pending document, obtain one by one satisfy pre-conditioned document content as directory content before, also comprise:
Judge whether described pending document exists in described directory content;
The described form that obtains one by one from described pending document satisfies pre-conditioned document content and as directory content is: if there is not described directory content in described pending document, then obtains one by one form and satisfy pre-conditioned document content as directory content from described pending document.
6. according to claim 1 to 5 each described methods, it is characterized in that described triggering or the automatic described directory content of output display according to the user is: according to user's triggering or automatically according to the sequencing of every described directory content in the described pending document successively described directory content of output display.
7. a rich format file treating apparatus is characterized in that, comprising:
Acquiring unit satisfies pre-conditioned document content as directory content for obtaining one by one from pending document, records the positional information of every described directory content in described pending document;
Display unit is used for according to user's triggering or the automatic described directory content of output display, make every described directory content clicked after, according to described pending document corresponding to the clicked described directory content of described positional information locating and displaying;
Updating block, described pre-conditioned for upgrading according to arranging of user, the described pre-conditioned default form condition that comprises;
Resolution unit is used for according to user's triggering or automatically resolves and record the format information of pending document document content.
8. device according to claim 7 is characterized in that, the described pre-conditioned preset content condition that also comprises, and described acquiring unit comprises:
The form acquisition module is used for obtaining one by one the format file content that form satisfies default form condition from described pending document;
The content obtaining module is used for obtaining content from described format file content and satisfies the document content of preset content condition as directory content.
9. device according to claim 8 is characterized in that, also comprises:
Judging unit is used for judging whether described pending document exists at described directory content;
Described acquiring unit obtains form one by one from described pending document satisfies pre-conditioned document content and as directory content is: if there is not described directory content in the described pending document of described acquiring unit, then obtains one by one form and satisfy pre-conditioned document content as directory content from described pending document.
10. according to claim 7 to 9 each described devices, it is characterized in that described display unit according to user's triggering or the automatic described directory content of output display is: described display unit is according to user's triggering or automatically according to the sequencing of every described directory content in the described pending document successively described directory content of output display.
CN2012101105232A 2012-04-13 2012-04-13 Rich format document processing method and rich format document processing device Pending CN103377197A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2012101105232A CN103377197A (en) 2012-04-13 2012-04-13 Rich format document processing method and rich format document processing device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2012101105232A CN103377197A (en) 2012-04-13 2012-04-13 Rich format document processing method and rich format document processing device

Publications (1)

Publication Number Publication Date
CN103377197A true CN103377197A (en) 2013-10-30

Family

ID=49462327

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2012101105232A Pending CN103377197A (en) 2012-04-13 2012-04-13 Rich format document processing method and rich format document processing device

Country Status (1)

Country Link
CN (1) CN103377197A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107301184A (en) * 2016-04-14 2017-10-27 珠海金山办公软件有限公司 It is a kind of to recognize the method and device that word or file generates catalogue
WO2020000835A1 (en) * 2018-06-29 2020-01-02 天津字节跳动科技有限公司 Method and device for automatically displaying document directory
CN111353296A (en) * 2020-02-27 2020-06-30 北京字节跳动网络技术有限公司 Article processing method and device, electronic equipment and computer-readable storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030042319A1 (en) * 2001-08-31 2003-03-06 Xerox Corporation Automatic and semi-automatic index generation for raster documents
CN101442639A (en) * 2007-11-23 2009-05-27 佛山普立华科技有限公司 System and method for establishing catalog
CN101533393A (en) * 2008-03-11 2009-09-16 深圳市乐天科技有限公司 Method for quickly classifying and retrieving sentences in article by using electronic device

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030042319A1 (en) * 2001-08-31 2003-03-06 Xerox Corporation Automatic and semi-automatic index generation for raster documents
CN101442639A (en) * 2007-11-23 2009-05-27 佛山普立华科技有限公司 System and method for establishing catalog
CN101533393A (en) * 2008-03-11 2009-09-16 深圳市乐天科技有限公司 Method for quickly classifying and retrieving sentences in article by using electronic device

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107301184A (en) * 2016-04-14 2017-10-27 珠海金山办公软件有限公司 It is a kind of to recognize the method and device that word or file generates catalogue
WO2020000835A1 (en) * 2018-06-29 2020-01-02 天津字节跳动科技有限公司 Method and device for automatically displaying document directory
US11347930B2 (en) 2018-06-29 2022-05-31 Tianjin Bytedance Technology Co., Ltd. Method and apparatus for automatically displaying directory of document
CN111353296A (en) * 2020-02-27 2020-06-30 北京字节跳动网络技术有限公司 Article processing method and device, electronic equipment and computer-readable storage medium
CN111353296B (en) * 2020-02-27 2023-07-18 抖音视界有限公司 Article processing method, apparatus, electronic device and computer readable storage medium

Similar Documents

Publication Publication Date Title
Edhlund et al. NVivo 12 essentials
US8155969B2 (en) Subtitle generation and retrieval combining document processing with voice processing
US20070050352A1 (en) System and method for providing autocomplete query using automatic query transform
JP5497022B2 (en) Proposal of resource locator from input string
Edhlund et al. Nvivo 11 essentials
US20190258706A1 (en) Slide tagging and filtering
US8868556B2 (en) Method and device for tagging a document
CN103777774B (en) The word error correction method of terminal installation and input method
US20080244381A1 (en) Document processing for mobile devices
US9639518B1 (en) Identifying entities in a digital work
CN107766325B (en) Text splicing method and device
EP3029567B1 (en) Method and device for updating input method system, computer storage medium, and device
US20150254213A1 (en) System and Method for Distilling Articles and Associating Images
EP2831775A1 (en) Information processing terminal and method, and information management apparatus and method
CN102314412A (en) Method and system for recording contextual information and tracing new word context
CN113326413B (en) Webpage information extraction method, system, server and storage medium
CN105653571A (en) Bookmark storage and bookmark operation instruction responding method, and browser
CN114297143A (en) File searching method, file displaying device and mobile terminal
US8244732B2 (en) Named entity marking apparatus, named entity marking method, and computer readable medium thereof
CN103377197A (en) Rich format document processing method and rich format document processing device
US20140136963A1 (en) Intelligent information summarization and display
CN107077515A (en) Display control unit, display control method and display control program
Reidsma et al. Designing focused and efficient annotation tools
CN111274352B (en) Method and equipment for marking characteristic words in tool book
US11783112B1 (en) Framework agnostic summarization of multi-channel communication

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20131030