CN103310130B - Text document embedding algorithm and extracting method - Google Patents
Text document embedding algorithm and extracting method Download PDFInfo
- Publication number
- CN103310130B CN103310130B CN201310256732.2A CN201310256732A CN103310130B CN 103310130 B CN103310130 B CN 103310130B CN 201310256732 A CN201310256732 A CN 201310256732A CN 103310130 B CN103310130 B CN 103310130B
- Authority
- CN
- China
- Prior art keywords
- watermark
- text document
- matrix
- information
- text
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Abstract
The present invention is a kind of text document embedding algorithm and extracting method.Wherein watermark embedment raw information is made a bianry image, and with the programming software such as matlab read with its one to one two values matrix as watermark information, again the fontlib of " 0 " and " 1 " two two-value code character write TrueType EUDC Editor is produced key, then using " ISN " input method of windows to transfer two-value code character is inserted in text document by above-mentioned two values matrix, finally use " hidden text " function of word processor to hide the two values matrix of embedding again, namely complete watermark embedment.By to the conversion of raw information and the use of key, make raw information have good hiding, meanwhile, make again interpolater be difficult to discover the existence of " watermark " to hiding of watermark information, anti-losing one's temper is high.Therefore, the confidentiality of raw information is greatly strengthened.
Description
Technical field
The present invention relates to a kind of digital watermark embedding and extracting method, particularly a kind of text document embedding algorithm and extracting method.It is mainly used in the copyright protection of text document, anti-tamper and secret communication etc.
Background technology
In prior art, Chinese patent 201110396024.X provides " carrying out the method for secret communication by word document ", and wherein the first step of watermark embedment is using the fontlib of raw information as watermark information write TrueType EUDC Editor, and produces key; Second step is embedded in word document by watermark information, then stacked on normal character by each watermark character in watermark information.This patent is provided with twice defence line to watermark information: first is key, as long as namely do not get key, watermark character just can not show, and plays conclusive preventing effect; Second is that character stacks, and that is, has both made to take key by watermark Charactes Display out, and what see is also situation together with watermark character stacks with normal character, is difficult to identification, thus gives and extract raw information and establish one obstacle again.
Deficiency existing for above-mentioned patent is: although it is provided with twice defence line, but in fact just key one defence line, that is, once key is broken, second defence line is just easy to be broken, and almost can not play preventing effect, because once key is broken, just directly can see the existence of " watermark character " during opening document, and launch to stack to realize in this word processor, be easy to so launch it.Moreover due to stacking of font, can occur the operations such as " not easily delete " or " insertions " abnormal when editing, this just makes guilty party very easily to discover " having watermark " existence, thus reduces anti-losing one's temper; Meanwhile, watermark information is exactly raw information, as long as launching to stack just directly can find out raw information, therefore, confidentiality is not high enough.
Summary of the invention
The object of the invention is the deficiency for existing in prior art, a kind of text document embedding algorithm and extracting method are provided, to strengthen the confidentiality of raw information.
For achieving the above object, technical scheme of the present invention is as follows:
(1) make watermark:
The software development of raw information general drafting tools is become a bianry image, then reads this bianry image with matlab, C, C Plus Plus programming software, obtain one and this bianry image one_to_one corresponding " 01 " matrix, as watermark information;
(2) produce watermark keys: the fontlib " 0 " and " 1 " two two-value code characters being write respectively TrueType EUDC Editor, and produce key;
(3) watermark embedment: open a text document, windows input method is switched to " ISN " input method, (1) " 01 " matrix walking gained is inserted in this text document opened, use " hidden text " function in word processor to hide " 01 " matrix of insertion again, namely obtain the text document of an embed watermark information;
(4) extract watermark: by the key copy (2) walking gained to extract watermark information use computer C: window in fonts catalogue, open the text document (3) walking and obtain, cancel word by " hidden text " function in word processor to hide, namely then hidden " 01 " matrix presents, recover to be somebody's turn to do the bianry image corresponding to " 01 " matrix with matlab, C, C Plus Plus programming software again, namely obtain raw information.
The present invention's feature is compared with the prior art as follows:
One, raw information is changed into the two values matrix of bianry image corresponding with it by the first step of the present invention, with this two values matrix as in watermark information embedded text document, gives good hiding to raw information, the security of raw information is improved greatly.
Two, second step of the present invention still borrows TrueType EUDC Editor to produce key, and only need by the fontlib of two two-value code character write TrueType EUDC Editor, not only make watermark information have invisibility, security is high, meanwhile, the character write than existing patent is few, also simply too much.
Three, the two values matrix of embedding stashes by the 3rd step word processor " hidden text " function of the present invention, any vestige can not be stayed during this editor of being hidden in, that is, abnormal situation is not had to occur when editing the text document of embed watermark, interpolater is difficult to the existence discovering " watermark ", and the anti-losing one's temper hidden than existing patent " stacking " is high.
Four, the three lines of defence that arranges of the present invention, first is the invisibility of key, second is watermark information hides, and the 3rd road is the conversion of watermark information.First key is difficult to break through, and key, once be broken, directly can not see watermark information, has both made to see watermark information, directly can not see raw information, and therefore, three lines of defence is all linked with one another, all plays good preventing effect, the confidentiality that greatly strengthen.
Accompanying drawing explanation
Fig. 1 is a bianry image in the embodiment of the present invention.
Fig. 2 is the two values matrix corresponding with Fig. 1 bianry image.
Fig. 3 is common this paper document.
Fig. 4 is this paper document after embed watermark information.
Embodiment
Concrete steps of the present invention are described with reference to the accompanying drawings below.
Concrete steps of the present invention are as follows:
(1) make watermark:
Raw information general drafting tools software (drawing board etc. as in photoshop, windows annex) is made into a bianry image, as shown in Figure 1 (this bianry image can be the types such as bianry image, bitmap images or GRB), this bianry image is read again with matlab, C, C Plus Plus programming software, obtain one and this bianry image " 01 " matrix one to one, as shown in Figure 2, " 01 " matrix is somebody's turn to do as watermark information.
(2) produce watermark keys: the fontlib " 0 " and " 1 " two two-value code characters being write respectively TrueType EUDC Editor, and produce key; The concrete operations of this step can see the first step in Chinese patent ZL201110396024.X.
(3) watermark embedment: (text document can be word document or WPS document etc. to open a text document as shown in Figure 3, and can with picture in selected text document, form and formula etc.), windows input method is switched to " ISN " input method, (1) " 01 " matrix walking gained is inserted in this text document opened, as shown in Figure 4, every a line " 01 " matrix is inserted in each sentence by this example successively, be convenient to like this restore " 01 " matrix, and then " 01 " matrix of insertion is hidden by " hidden text " function in word processor, namely the text document of an embed watermark information is obtained, on the surface, the document of this embed watermark information is identical with the urtext document shown in Fig. 3,
The concrete operations of described hidden text are: for word document, select the word that will hide, then click the Font menu, then click " hidden text ", determine.
(4) extract watermark: by the key copy (2) walking gained to extract watermark information use computer C: window in fonts catalogue, open the text document (3) walking and obtain, cancel word by " hidden text " function in word processor and hide (i.e. full selection word, eliminate again " hidden text "), namely then hidden " 01 " matrix presents, recover to be somebody's turn to do the bianry image corresponding to " 01 " matrix with matlab, C, C Plus Plus programming software again, namely obtain raw information.
Claims (1)
1. text document embedding algorithm and an extracting method, its step is as follows:
(1) make watermark:
The software development of raw information general drafting tools is become a bianry image, then reads this bianry image with matlab, C, C Plus Plus programming software, obtain one and this bianry image one_to_one corresponding " 01 " matrix, as watermark information;
(2) produce watermark keys: the fontlib " 0 " and " 1 " two two-value code characters being write respectively TrueType EUDC Editor, and produce key;
(3) watermark embedment: open a text document, windows input method is switched to " ISN " input method, (1) " 01 " matrix walking gained is inserted in the text document opened, use " hidden text " function in word processor to hide " 01 " matrix of insertion again, namely obtain the text document of an embed watermark information;
(4) extract watermark: by the key copy (2) walking gained to extract watermark information use computer C: window in fonts catalogue, open the text document (3) walking and obtain, cancel word by " hidden text " function in word processor to hide, namely then hidden " 01 " matrix presents, recover to be somebody's turn to do the bianry image corresponding to " 01 " matrix with matlab, C, C Plus Plus programming software again, namely obtain raw information.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310256732.2A CN103310130B (en) | 2013-06-25 | 2013-06-25 | Text document embedding algorithm and extracting method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310256732.2A CN103310130B (en) | 2013-06-25 | 2013-06-25 | Text document embedding algorithm and extracting method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103310130A CN103310130A (en) | 2013-09-18 |
CN103310130B true CN103310130B (en) | 2016-03-30 |
Family
ID=49135339
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310256732.2A Expired - Fee Related CN103310130B (en) | 2013-06-25 | 2013-06-25 | Text document embedding algorithm and extracting method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103310130B (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104850765A (en) * | 2014-02-19 | 2015-08-19 | 中国移动通信集团福建有限公司 | Watermark processing method, device and system |
CN110874456B (en) * | 2018-08-31 | 2022-04-26 | 浙江大学 | Watermark embedding method, watermark extracting method, watermark embedding device, watermark extracting device and data processing method |
CN110457874A (en) * | 2019-08-02 | 2019-11-15 | 中国工商银行股份有限公司 | A kind of information write-in method, information-reading method and device |
CN116956168B (en) * | 2023-09-18 | 2024-01-02 | 云粒智慧科技有限公司 | Text watermark processing method and device, electronic equipment and storage medium |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7760903B2 (en) * | 2005-08-30 | 2010-07-20 | Microsoft Corporation | Tamper-resistant text stream watermarking |
US8014557B2 (en) * | 2003-06-23 | 2011-09-06 | Digimarc Corporation | Watermarking electronic text documents |
CN102385680A (en) * | 2011-12-05 | 2012-03-21 | 西安科技大学 | Tamperproof protective method for Word file |
CN102393892A (en) * | 2011-12-05 | 2012-03-28 | 西安科技大学 | Word document copyright protection method |
CN102495991A (en) * | 2011-12-05 | 2012-06-13 | 西安科技大学 | Method for carrying out secret communication by using word document |
-
2013
- 2013-06-25 CN CN201310256732.2A patent/CN103310130B/en not_active Expired - Fee Related
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8014557B2 (en) * | 2003-06-23 | 2011-09-06 | Digimarc Corporation | Watermarking electronic text documents |
US7760903B2 (en) * | 2005-08-30 | 2010-07-20 | Microsoft Corporation | Tamper-resistant text stream watermarking |
CN102385680A (en) * | 2011-12-05 | 2012-03-21 | 西安科技大学 | Tamperproof protective method for Word file |
CN102393892A (en) * | 2011-12-05 | 2012-03-28 | 西安科技大学 | Word document copyright protection method |
CN102495991A (en) * | 2011-12-05 | 2012-06-13 | 西安科技大学 | Method for carrying out secret communication by using word document |
Non-Patent Citations (1)
Title |
---|
基于Word文本文档的信息隐藏方法研究;李向辉;《中国优秀硕士学位论文全文数据库 信息科技辑》;20061215;第9-10、40-41页 * |
Also Published As
Publication number | Publication date |
---|---|
CN103310130A (en) | 2013-09-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7240209B2 (en) | Methods of invisibly embedding and hiding data into soft-copy text documents | |
Roy et al. | A novel approach to format based text steganography | |
CN100447812C (en) | Document data waterprint embedded method | |
Singh et al. | A novel approach of text steganography based on null spaces | |
CN102393892B (en) | Word document copyright protection method | |
CN103310130B (en) | Text document embedding algorithm and extracting method | |
CN109785222B (en) | Method for quickly embedding and extracting information of webpage | |
Majeed et al. | AN IMPROVED LSB IMAGE STEGANOGRAPHY TECHNIQUE USING BIT-INVERSE IN 24 BIT COLOUR IMAGE. | |
CN104517045A (en) | Method for creating protected digital file | |
CN103745479A (en) | Digital steganography and steganalysis method for color image | |
CN103559251A (en) | Data security protection method based on data hiding | |
Rafat et al. | Secure digital steganography for ASCII text documents | |
Kaur et al. | Encryption based LSB steganography technique for digital images and text data | |
CN102385680A (en) | Tamperproof protective method for Word file | |
Malalla et al. | A novel approach for Arabic text steganography based on the “BloodGroup” text hiding method | |
Kavitha et al. | Lossless steganography on avi file using swapping algorithm | |
Sharma et al. | A study of steganography based data hiding techniques | |
Cheng et al. | A robust text digital watermarking algorithm based on fragments regrouping strategy | |
CN102495991B (en) | Method for carrying out secret communication by using word document | |
Saber et al. | Steganography in MS excel document using unicode system characteristics | |
Reddy et al. | A Novel Approach for Hiding Encrypted Data in Image, Audio and Video using Steganography | |
Khan et al. | Polish text steganography method using letter points and extension | |
CN105208462A (en) | Video digital watermarking processing method and video digital watermarking system executing same | |
Pascaline et al. | Using photomosaic and steganographic techniques for hiding information inside image mosaics | |
Bajpai et al. | Techniques of steganography for securing information: A survey |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20160330 Termination date: 20180625 |