CN103377197A - Rich format document processing method and rich format document processing device - Google Patents
Rich format document processing method and rich format document processing device Download PDFInfo
- Publication number
- CN103377197A CN103377197A CN2012101105232A CN201210110523A CN103377197A CN 103377197 A CN103377197 A CN 103377197A CN 2012101105232 A CN2012101105232 A CN 2012101105232A CN 201210110523 A CN201210110523 A CN 201210110523A CN 103377197 A CN103377197 A CN 103377197A
- Authority
- CN
- China
- Prior art keywords
- content
- document
- directory
- conditioned
- directory content
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Abstract
An embodiment of the invention discloses a rich format document processing method and a rich format document processing device, and relates to the technical field of computers. Complicated operations due to the fact that target contents are searched by turning pages manually are avoided, and reading experience of users is improved. The method includes: acquiring document contents satisfying preset conditions from a to-be-processed document item by item to serve as target contents, and recording positional information of each item of directory contents in the to-be-processed document; and outputting and displaying the directory contents automatically or according to triggering of a user, and after each item of directory contents is clicked, positioning and displaying a to-be-processed document corresponding to the clicked directory contents according to the position information. The method and the device are mainly used for generating rich format document directories.
Description
Technical field
The present invention relates to field of computer technology, relate in particular to rich format file disposal route and device.
Background technology
RTF (Rich Text Format, the rich text form) is a kind of cross-platform document format, similar with word (word-processing application), good compatibility is arranged, most word processor can both read and preserve the RTF document, uses " board " among the Windows " annex " just can open and the RTF document is edited.RTF is a kind of popular file structure, and a lot of text editors are all supported it.
Take word document (document of suffix doc by name or docx) as example, the user is in that to read number of pages more and when word document with the catalogue of hyperlink is not provided, if want to jump to certain interested chapters and sections, must carry out continuous page turn over operation and could arrive the target location, this can allow the user can feel unusual inconvenience.
Summary of the invention
Embodiments of the invention provide a kind of rich format file disposal route and device, have avoided manual page turning to seek the troublesome operation that object content brings, and have promoted user's reading experience.
For achieving the above object, embodiments of the invention adopt following technical scheme:
A kind of rich format file disposal route comprises:
From pending document, obtain one by one and satisfy pre-conditioned document content as directory content, record the positional information of every described directory content in described pending document;
According to user's triggering or the automatic described directory content of output display, make every described directory content clicked after, according to described pending document corresponding to the clicked described directory content of described positional information locating and displaying.
A kind of rich format file treating apparatus comprises:
Acquiring unit satisfies pre-conditioned document content as directory content for obtaining one by one from pending document, records the positional information of every described directory content in described pending document;
Display unit is used for according to user's triggering or the automatic described directory content of output display, make every described directory content clicked after, according to described pending document corresponding to the clicked described directory content of described positional information locating and displaying;
Updating block, described pre-conditioned for upgrading according to arranging of user, the described pre-conditioned default form condition that comprises;
Resolution unit is used for according to user's triggering or automatically resolves and record the format information of pending document document content.
Rich format file disposal route and device that the embodiment of the invention provides, the user is in that to read number of pages more and when the rich format file of catalogue is not provided, but the catalogue of the rich format file of rapid extraction, the user recognizes the main contents of document from catalogue, and from catalogue, navigate to fast own interested part and read, avoid manual page turning to seek the troublesome operation that object content brings, promoted user's reading experience.
Description of drawings
In order to be illustrated more clearly in the technical scheme in the embodiment of the invention, the accompanying drawing of required use was done to introduce simply during the below will describe embodiment, apparently, accompanying drawing in the following describes only is some embodiments of the present invention, for those of ordinary skills, under the prerequisite of not paying creative work, can also obtain according to these accompanying drawings other accompanying drawing.
The process flow diagram of a kind of rich format file disposal route that Fig. 1 provides for the embodiment of the invention;
The process flow diagram of the another kind of rich format file disposal route that Fig. 2 provides for the embodiment of the invention;
The structural drawing of a kind of rich format file treating apparatus that Fig. 3 provides for the embodiment of the invention;
The structural drawing of the another kind of rich format file treating apparatus that Fig. 4 provides for the embodiment of the invention.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the invention, the technical scheme in the embodiment of the invention is clearly and completely described, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills belong to the scope of protection of the invention not making the every other embodiment that obtains under the creative work prerequisite.
The embodiment of the invention provides a kind of rich format file disposal route, as shown in Figure 1, comprising:
101, from pending document, obtain one by one and satisfy pre-conditioned document content as directory content, record the positional information of every described directory content in described pending document.
102, according to user's triggering or the automatic described directory content of output display, make every described directory content clicked after, according to described pending document corresponding to the clicked described directory content of described positional information locating and displaying.
The rich format file disposal route that present embodiment provides, the user is in that to read number of pages more and when the rich format file of catalogue is not provided, but the catalogue of the rich format file of rapid extraction, the user recognizes the main contents of document from catalogue, and from catalogue, navigate to fast own interested part and read, avoid manual page turning to seek the troublesome operation that object content brings, promoted user's reading experience.
As a kind of improvement of present embodiment, the embodiment of the invention provides another kind of rich format file disposal route, as shown in Figure 2, comprising:
201, judge whether pending document exists in directory content.
After the catalogue that receives user's triggering was obtained request, whether the pending document of rich format file treating apparatus automatic decision had existed directory content.
If there is not directory content in pending document, then execution in step 202, otherwise execution in step 204.
202, according to user's triggering or automatically resolve and record the format information of document content in the pending document.
When generating the catalogue of rich text format file, can utilize the format information of document content in the rich text format file, find out the document content different from most of content formats as directory content.
The rich text format file has very abundant format information, and such as various bullets, various font, various font sizes etc., the content of document is usually by the division of teaching contents with special format information.Such as the instructions part in one piece of patent application document, instructions comprises " technical field ", " background technology ", " summary of the invention ", " description of drawings ", " embodiment " five major parts, for the ease of readers ' reading, the title division of this five part uses the form different from body part usually, as use runic, larger font size, make this five most boundary line very clear.Therefore, add these format informations as matching condition, the accuracy rate of the catalogue that can extract greatly.
203, from pending document, obtain one by one and satisfy pre-conditioned document content as directory content, record the positional information of every described directory content in described pending document.
Describedly pre-conditionedly can comprise default form condition and preset content condition, preferred, step 203 comprises:
203a, from pending document, obtain one by one the format file content that form satisfies described default form condition;
203b, from the format file content, obtain content and satisfy the document content of described preset content condition as directory content.
In the Display directory content, need to be connected to pending document corresponding to directory content for directory content arranges hyperlink, therefore, need the positional information of every described directory content of record in described pending document.
Concrete, pre-conditioned can be one or more combination in the following condition, as:
1. font size is compared bigger than normal with most literal;
2. different with the context font, such as runic or black matrix etc.;
3. with bullets or bullets;
4. the text size of this paragraph is no more than certain threshold value or is no more than delegation;
5. whether comprise chapter, joint in the content of text, return and the index type character such as numeral.
The set that can mate the directory name that is finally needed by disposable and a plurality of conditions, also can use first partial condition to carry out prescreen, obtain candidate's catalogue text, then therefrom again screen with other conditions, finally obtain accurately chapters and sections catalogue.
204, according to user's triggering or the automatic described directory content of output display, make every described directory content clicked after, according to described pending document corresponding to the clicked described directory content of described positional information locating and displaying.
Check for the ease of the user, when the Display directory content, according to the sequencing of every catalogue content in the pending document successively described directory content of output display.
Concrete, the chapters and sections catalogue can be used xml (extensible markup language, extend markup language) formatted file but be not limited to this form and preserve, also can be that user-defined format is preserved, store by the sequencing that the catalogue literal occurs in article, the step that reads catalogue is to read first and resolve the chapters and sections catalogue xml file of preserving previously, shows wherein reading out about the property value of chapters and sections name.The method of Display directory is to eject a wicket or be partitioned into a window as display window in current window at screen, sequentially arranges demonstration by the result who reads catalogue.
205, upgrade described pre-conditioned according to arranging of user.
Pre-conditioned can the setting by user oneself in the present embodiment, for the user arranges pre-conditioned input window, after the user was pre-conditioned by the input window submission, it is pre-conditioned that recording user is submitted to, and use the pre-conditioned of state-of-the-art record when generate directory content next time.
Step 202-204 in this enforcement also can judge in step 201 and exist in the situation of directory content, carried out by user's imperative operation, also can judgement have that the user arranges new pre-conditioned lower, regenerate directory content.The execution sequence of step 205 can be adjusted according to actual conditions.
The rich format file disposal route that present embodiment provides, the user is in that to read number of pages more and when the rich format file of catalogue is not provided, but the catalogue of the rich format file of rapid extraction, the user recognizes the main contents of document from catalogue, and from catalogue, navigate to fast own interested part and read, avoid manual page turning to seek the troublesome operation that object content brings, promoted user's reading experience.
The embodiment of the invention provides a kind of rich format file treating apparatus, as shown in Figure 3, comprising: acquiring unit 31, display unit 32, updating block 33, resolution unit 34.
Wherein, acquiring unit 31 satisfies pre-conditioned document content as directory content for obtaining one by one from pending document, records the positional information of every described directory content in described pending document;
Updating block 33 is used for upgrading described pre-conditioned according to arranging of user; The described pre-conditioned default form condition that comprises;
The rich format file treating apparatus that present embodiment provides, the user is in that to read number of pages more and when the rich format file of catalogue is not provided, but the catalogue of the rich format file of rapid extraction, the user recognizes the main contents of document from catalogue, and from catalogue, navigate to fast own interested part and read, avoid manual page turning to seek the troublesome operation that object content brings, promoted user's reading experience.
As a kind of improvement of present embodiment, the embodiment of the invention provides another kind of rich format file treating apparatus, as shown in Figure 4, comprising: judging unit 41, resolution unit 42, acquiring unit 43, display unit 44, updating block 45.
Wherein, acquiring unit 43 comprises: form acquisition module 431, content obtaining module 432.
Judging unit 41 is used for judging whether described pending document has existed described directory content;
There is not described directory content in acquiring unit 43 if be used for described pending document, then obtains one by one from pending document and satisfies pre-conditioned document content as directory content, records the positional information of every described directory content in described pending document;
Form acquisition module 431 is used for obtaining one by one the format file content that form satisfies default form condition from described pending document;
Content obtaining module 432 is used for obtaining content from described format file content and satisfies the document content of preset content condition as directory content.
Concrete, described display unit is according to user's triggering or automatically according to the sequencing of every described directory content in the described pending document successively described directory content of output display.
Updating block 45, described pre-conditioned for upgrading according to arranging of user, the described pre-conditioned default form condition that comprises.
The rich format file treating apparatus that present embodiment provides, the user is in that to read number of pages more and when the rich format file of catalogue is not provided, but the catalogue of the rich format file of rapid extraction, the user recognizes the main contents of document from catalogue, and from catalogue, navigate to fast own interested part and read, avoid manual page turning to seek the troublesome operation that object content brings, promoted user's reading experience.
The rich format file disposal route that the embodiment of the invention provides and device are suitable for other document that having except rich format file enriched format information equally, such as the word document.
Through the above description of the embodiments, the those skilled in the art can be well understood to the present invention and can realize by the mode that software adds essential common hardware, can certainly pass through hardware, but the former is better embodiment in a lot of situation.Based on such understanding, the part that technical scheme of the present invention contributes to prior art in essence in other words can embody with the form of software product, this computer software product is stored in the storage medium that can read, floppy disk such as computing machine, hard disk or CD etc., comprise some instructions with so that computer equipment (can be personal computer, server, the perhaps network equipment etc.) carry out the described method of each embodiment of the present invention.
The above; be the specific embodiment of the present invention only, but protection scope of the present invention is not limited to this, anyly is familiar with those skilled in the art in the technical scope that the present invention discloses; can expect easily changing or replacing, all should be encompassed within protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion by described protection domain with claim.
Claims (10)
1. a rich format file disposal route is characterized in that, comprising:
From pending document, obtain one by one and satisfy pre-conditioned document content as directory content, record the positional information of every described directory content in described pending document;
According to user's triggering or the automatic described directory content of output display, make every described directory content clicked after, according to described pending document corresponding to the clicked described directory content of described positional information locating and displaying.
2. method according to claim 1 is characterized in that, also comprises:
Upgrade described pre-conditioned according to arranging of user.
3. method according to claim 2 is characterized in that, the described pre-conditioned default form condition that comprises, described from described pending document, obtain one by one form satisfy pre-conditioned document content as directory content before, described method also comprises:
According to user's triggering or automatically resolve and record the format information of document content in the pending document.
4. method according to claim 3 is characterized in that, the described pre-conditioned preset content condition that also comprises, and described obtaining one by one from described pending document satisfied pre-conditioned document content and comprised as directory content:
From described pending document, obtain one by one the format file content that form satisfies described default form condition;
From described format file content, obtain content and satisfy the document content of described preset content condition as directory content.
5. method according to claim 4 is characterized in that, described from pending document, obtain one by one satisfy pre-conditioned document content as directory content before, also comprise:
Judge whether described pending document exists in described directory content;
The described form that obtains one by one from described pending document satisfies pre-conditioned document content and as directory content is: if there is not described directory content in described pending document, then obtains one by one form and satisfy pre-conditioned document content as directory content from described pending document.
6. according to claim 1 to 5 each described methods, it is characterized in that described triggering or the automatic described directory content of output display according to the user is: according to user's triggering or automatically according to the sequencing of every described directory content in the described pending document successively described directory content of output display.
7. a rich format file treating apparatus is characterized in that, comprising:
Acquiring unit satisfies pre-conditioned document content as directory content for obtaining one by one from pending document, records the positional information of every described directory content in described pending document;
Display unit is used for according to user's triggering or the automatic described directory content of output display, make every described directory content clicked after, according to described pending document corresponding to the clicked described directory content of described positional information locating and displaying;
Updating block, described pre-conditioned for upgrading according to arranging of user, the described pre-conditioned default form condition that comprises;
Resolution unit is used for according to user's triggering or automatically resolves and record the format information of pending document document content.
8. device according to claim 7 is characterized in that, the described pre-conditioned preset content condition that also comprises, and described acquiring unit comprises:
The form acquisition module is used for obtaining one by one the format file content that form satisfies default form condition from described pending document;
The content obtaining module is used for obtaining content from described format file content and satisfies the document content of preset content condition as directory content.
9. device according to claim 8 is characterized in that, also comprises:
Judging unit is used for judging whether described pending document exists at described directory content;
Described acquiring unit obtains form one by one from described pending document satisfies pre-conditioned document content and as directory content is: if there is not described directory content in the described pending document of described acquiring unit, then obtains one by one form and satisfy pre-conditioned document content as directory content from described pending document.
10. according to claim 7 to 9 each described devices, it is characterized in that described display unit according to user's triggering or the automatic described directory content of output display is: described display unit is according to user's triggering or automatically according to the sequencing of every described directory content in the described pending document successively described directory content of output display.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2012101105232A CN103377197A (en) | 2012-04-13 | 2012-04-13 | Rich format document processing method and rich format document processing device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2012101105232A CN103377197A (en) | 2012-04-13 | 2012-04-13 | Rich format document processing method and rich format document processing device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103377197A true CN103377197A (en) | 2013-10-30 |
Family
ID=49462327
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2012101105232A Pending CN103377197A (en) | 2012-04-13 | 2012-04-13 | Rich format document processing method and rich format document processing device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103377197A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107301184A (en) * | 2016-04-14 | 2017-10-27 | 珠海金山办公软件有限公司 | It is a kind of to recognize the method and device that word or file generates catalogue |
WO2020000835A1 (en) * | 2018-06-29 | 2020-01-02 | 天津字节跳动科技有限公司 | Method and device for automatically displaying document directory |
CN111353296A (en) * | 2020-02-27 | 2020-06-30 | 北京字节跳动网络技术有限公司 | Article processing method and device, electronic equipment and computer-readable storage medium |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030042319A1 (en) * | 2001-08-31 | 2003-03-06 | Xerox Corporation | Automatic and semi-automatic index generation for raster documents |
CN101442639A (en) * | 2007-11-23 | 2009-05-27 | 佛山普立华科技有限公司 | System and method for establishing catalog |
CN101533393A (en) * | 2008-03-11 | 2009-09-16 | 深圳市乐天科技有限公司 | Method for quickly classifying and retrieving sentences in article by using electronic device |
-
2012
- 2012-04-13 CN CN2012101105232A patent/CN103377197A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030042319A1 (en) * | 2001-08-31 | 2003-03-06 | Xerox Corporation | Automatic and semi-automatic index generation for raster documents |
CN101442639A (en) * | 2007-11-23 | 2009-05-27 | 佛山普立华科技有限公司 | System and method for establishing catalog |
CN101533393A (en) * | 2008-03-11 | 2009-09-16 | 深圳市乐天科技有限公司 | Method for quickly classifying and retrieving sentences in article by using electronic device |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107301184A (en) * | 2016-04-14 | 2017-10-27 | 珠海金山办公软件有限公司 | It is a kind of to recognize the method and device that word or file generates catalogue |
WO2020000835A1 (en) * | 2018-06-29 | 2020-01-02 | 天津字节跳动科技有限公司 | Method and device for automatically displaying document directory |
US11347930B2 (en) | 2018-06-29 | 2022-05-31 | Tianjin Bytedance Technology Co., Ltd. | Method and apparatus for automatically displaying directory of document |
CN111353296A (en) * | 2020-02-27 | 2020-06-30 | 北京字节跳动网络技术有限公司 | Article processing method and device, electronic equipment and computer-readable storage medium |
CN111353296B (en) * | 2020-02-27 | 2023-07-18 | 抖音视界有限公司 | Article processing method, apparatus, electronic device and computer readable storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Edhlund et al. | NVivo 12 essentials | |
US8155969B2 (en) | Subtitle generation and retrieval combining document processing with voice processing | |
US20070050352A1 (en) | System and method for providing autocomplete query using automatic query transform | |
JP5497022B2 (en) | Proposal of resource locator from input string | |
Edhlund et al. | Nvivo 11 essentials | |
US20190258706A1 (en) | Slide tagging and filtering | |
US8868556B2 (en) | Method and device for tagging a document | |
CN103777774B (en) | The word error correction method of terminal installation and input method | |
US20080244381A1 (en) | Document processing for mobile devices | |
US9639518B1 (en) | Identifying entities in a digital work | |
CN107766325B (en) | Text splicing method and device | |
EP3029567B1 (en) | Method and device for updating input method system, computer storage medium, and device | |
US20150254213A1 (en) | System and Method for Distilling Articles and Associating Images | |
EP2831775A1 (en) | Information processing terminal and method, and information management apparatus and method | |
CN102314412A (en) | Method and system for recording contextual information and tracing new word context | |
CN113326413B (en) | Webpage information extraction method, system, server and storage medium | |
CN105653571A (en) | Bookmark storage and bookmark operation instruction responding method, and browser | |
CN114297143A (en) | File searching method, file displaying device and mobile terminal | |
US8244732B2 (en) | Named entity marking apparatus, named entity marking method, and computer readable medium thereof | |
CN103377197A (en) | Rich format document processing method and rich format document processing device | |
US20140136963A1 (en) | Intelligent information summarization and display | |
CN107077515A (en) | Display control unit, display control method and display control program | |
Reidsma et al. | Designing focused and efficient annotation tools | |
CN111274352B (en) | Method and equipment for marking characteristic words in tool book | |
US11783112B1 (en) | Framework agnostic summarization of multi-channel communication |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20131030 |