CN103310130A - Text document digital watermark embedding and extracting method - Google Patents
Text document digital watermark embedding and extracting method Download PDFInfo
- Publication number
- CN103310130A CN103310130A CN2013102567322A CN201310256732A CN103310130A CN 103310130 A CN103310130 A CN 103310130A CN 2013102567322 A CN2013102567322 A CN 2013102567322A CN 201310256732 A CN201310256732 A CN 201310256732A CN 103310130 A CN103310130 A CN 103310130A
- Authority
- CN
- China
- Prior art keywords
- watermark
- matrix
- text document
- information
- text
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Abstract
The invention provides a text document digital watermark embedding and extracting method. According to the invention, the watermark embedding is characterized in that raw information is made into a binary image; matlab and other programming softwares are adopted for reading out binary matrixes in one-to-one correspondence with the binary image to serve as the watermark information; '0' and '1', the two binary code characters, are written into a font object library with a TrueType coinage program to generate a secret key; a windows 'internal code' input method is adopted to call binary code characters, and the binary matrix is inserted into the text document; finally a hidden text function of word processing software is adopted for hiding the imbedded binary matrix, so that the watermark embedding is completed. According to the invention, through raw information conversion and application of the secret key, the raw information is better hidden; meanwhile, due to hiding of the watermark information, an interpolator can not easily find the watermark, so that the preventive performance is improved. Therefore, the security of the raw information is greatly enhanced.
Description
Technical field
The present invention relates to a kind of digital watermarking embedding and extracting method, particularly a kind of text document embedding algorithm and extracting method.It is mainly used in the copyright protection of text document, anti-tamper and secret communication etc.
Background technology
In prior art, Chinese patent 201110396024.X provides " carrying out the method for secret communication with the word document ", and wherein the first step of watermark embedding is the fontlib that raw information is write the TrueType EUDC Editor as watermark information, and produces key; Second step is that watermark information is embedded in the word document, then each the watermark character in the watermark information is stacked on normal character.This patent is provided with the twice defence line to watermark information: first is key, as long as namely do not get key, the watermark character just can not show, and plays conclusive preventing effect; Second is that character stacks, and that is to say, has both made to take key the watermark character is shown, see also to be the watermark character stack together situation with normal character, almost illegible extracts raw information and establishes obstacle one again thereby give.
The existing deficiency of above-mentioned patent is: although it is provided with the twice defence line, but in fact be one defence line of key, that is to say, in case key is broken, the second defence line just is easy to be broken, and almost can not play the preventing effect, because in case key is broken, just can directly see the existence of " watermark character " during opening document, and launch to stack in this word processor to realize, be easy to so launch it.Moreover, because the stacking of font, " being difficult for deletion " or " insertions " can when editor, occur undesiredly to wait operation, this is with regard to so that come the guilty party very easily to discover " watermark is arranged " existence, thereby has reduced anti-losing one's temper; Simultaneously, watermark information is exactly raw information, as long as launch just to stack and can directly find out raw information, therefore, confidentiality is not high enough.
Summary of the invention
The objective of the invention is for the deficiency that exists in the prior art, a kind of text document embedding algorithm and extracting method are provided, to strengthen the confidentiality of raw information.
For achieving the above object, technical scheme of the present invention is as follows:
⑴ make watermark:
Raw information is made into a value image with the UNIVERSAL DRAWING tool software, reads this bianry image with matlab, C, C Plus Plus programming software again, obtain one with this bianry image corresponding " 01 " matrix one by one, as watermark information;
⑵ produce watermark keys: " 0 " and " 1 " two two-value code characters are write respectively the fontlib of TrueType EUDC Editor, and produce key;
⑶ watermark embeds: open a text document, the windows input method is switched to " ISN " input method, ⑴ " 01 " matrix that is gone on foot gained is inserted in this text document of opening, use again " hidden text " function in the word processor to hide " 01 " matrix that inserts, namely obtain the text document of an embed watermark information;
⑷ extract watermark: the key that ⑵ is gone on foot gained copy to extract computer that watermark information uses C: window in the fonts catalogue, open the text document that the ⑶ step obtains, hide with the cancellation of " hidden text " function in word processor literal, " 01 " matrix of then being hidden namely presents, recover to be somebody's turn to do the corresponding bianry image of " 01 " matrix with matlab, C, C Plus Plus programming software again, namely obtain raw information.
The present invention's characteristics compared with the prior art are as follows:
One, the first step of the present invention changes into raw information the two values matrix of corresponding with it bianry image, as in the watermark information embedded text document, gives good hiding to raw information with this two values matrix, and the security of raw information is improved greatly.
Two, second step of the present invention still borrows the TrueType EUDC Editor to produce key, and only need two two-value code characters are write the fontlib of TrueType EUDC Editor, and not only make watermark information have invisibility, safe, simultaneously, the character that writes than existing patent lacks, also simply too much.
Three, the 3rd step of the present invention is to stash with " hidden text " function of word processor two values matrix with embedding, this being hidden in when editing can not be stayed any vestige, that is to say, there is not when the text document of embed watermark is edited deviant circumstance to occur, the interpolater is difficult to discover the existence of " watermark ", and the anti-losing one's temper of hiding than existing patent " stacking " is high.
Four, the three lines of defence that arranges of the present invention, first are the hiding of watermark information of invisibility, the second of key, and the 3rd road is the conversion of watermark information.At first key is difficult to break through, and in a single day key is broken, and can not directly see watermark information, has both made and has seen watermark information, can not directly see raw information, therefore, the three lines of defence is all linked with one another, all plays preferably preventing effect, the confidentiality that has greatly strengthened.
Description of drawings
Fig. 1 is a bianry image in the embodiment of the invention.
Fig. 2 is the two values matrix corresponding with Fig. 1 bianry image.
Fig. 3 is common this paper document.
Fig. 4 is this paper document behind the embed watermark information.
Embodiment
The below illustrates concrete steps of the present invention with reference to the accompanying drawings.
Concrete steps of the present invention are as follows:
⑴ make watermark:
Raw information is made into a value image with UNIVERSAL DRAWING tool software (such as drawing board in photoshop, the windows annex etc.), as shown in Figure 1 (this bianry image can be the types such as bianry image, bitmap images or GRB), read this bianry image with matlab, C, C Plus Plus programming software again, obtain one with this bianry image " 01 " matrix one to one, as shown in Figure 2, be somebody's turn to do " 01 " matrix as watermark information.
⑵ produce watermark keys: " 0 " and " 1 " two two-value code characters are write respectively the fontlib of TrueType EUDC Editor, and produce key; The concrete operations of this step can be referring to the first step among the Chinese patent ZL201110396024.X.
⑶ watermark embeds: (text document can be word document or WPS document etc. to open a text document as shown in Figure 3, and can be with picture in the selected text document, form and formula etc.), the windows input method is switched to " ISN " input method, ⑴ " 01 " matrix that is gone on foot gained is inserted in this text document of opening, as shown in Figure 4, this example is inserted in every delegation " 01 " matrix in each sentence successively, be convenient to like this restore " 01 " matrix, and then " 01 " matrix that hide to insert with " hidden text " function in the word processor, namely obtain the text document of an embed watermark information, on the surface, the document of this embed watermark information is finished identical with urtext document shown in Figure 3;
The concrete operations of described hidden text are: take the word document as example, then the literal that selection will be hidden clicks the Font menu, clicks " hidden text " again, determines to get final product.
⑷ extract watermark: the key that ⑵ is gone on foot gained copy to extract computer that watermark information uses C: window in the fonts catalogue, open the text document that the ⑶ step obtains, hiding with the cancellation of " hidden text " function in word processor literal (is full selection word, eliminate again " hidden text "), " 01 " matrix of then being hidden namely presents, recover to be somebody's turn to do the corresponding bianry image of " 01 " matrix with matlab, C, C Plus Plus programming software again, namely obtain raw information.
Claims (1)
1. a text document embedding algorithm and extracting method, its step is as follows:
⑴ make watermark:
Raw information is made into a value image with the UNIVERSAL DRAWING tool software, reads this bianry image with matlab, C, C Plus Plus programming software again, obtain one with this bianry image corresponding " 01 " matrix one by one, as watermark information;
⑵ produce watermark keys: " 0 " and " 1 " two two-value code characters are write respectively the fontlib of TrueType EUDC Editor, and produce key;
⑶ watermark embeds: open a text document, the windows input method is switched to " ISN " input method, ⑴ " 01 " matrix that is gone on foot gained is inserted in the text document of opening, use again " hidden text " function in the word processor to hide " 01 " matrix that inserts, namely obtain the text document of an embed watermark information;
⑷ extract watermark: the key that ⑵ is gone on foot gained copy to extract computer that watermark information uses C: window in the fonts catalogue, open the text document that the ⑶ step obtains, hide with the cancellation of " hidden text " function in word processor literal, " 01 " matrix of then being hidden namely presents, recover to be somebody's turn to do the corresponding bianry image of " 01 " matrix with matlab, C, C Plus Plus programming software again, namely obtain raw information.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310256732.2A CN103310130B (en) | 2013-06-25 | 2013-06-25 | Text document embedding algorithm and extracting method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310256732.2A CN103310130B (en) | 2013-06-25 | 2013-06-25 | Text document embedding algorithm and extracting method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103310130A true CN103310130A (en) | 2013-09-18 |
CN103310130B CN103310130B (en) | 2016-03-30 |
Family
ID=49135339
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310256732.2A Expired - Fee Related CN103310130B (en) | 2013-06-25 | 2013-06-25 | Text document embedding algorithm and extracting method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103310130B (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104850765A (en) * | 2014-02-19 | 2015-08-19 | 中国移动通信集团福建有限公司 | Watermark processing method, device and system |
CN110457874A (en) * | 2019-08-02 | 2019-11-15 | 中国工商银行股份有限公司 | A kind of information write-in method, information-reading method and device |
CN110874456A (en) * | 2018-08-31 | 2020-03-10 | 浙江大学 | Watermark embedding method, watermark extracting method, watermark embedding device, watermark extracting device and data processing method |
CN116956168A (en) * | 2023-09-18 | 2023-10-27 | 云粒智慧科技有限公司 | Text watermark processing method and device, electronic equipment and storage medium |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7760903B2 (en) * | 2005-08-30 | 2010-07-20 | Microsoft Corporation | Tamper-resistant text stream watermarking |
US8014557B2 (en) * | 2003-06-23 | 2011-09-06 | Digimarc Corporation | Watermarking electronic text documents |
CN102385680A (en) * | 2011-12-05 | 2012-03-21 | 西安科技大学 | Tamperproof protective method for Word file |
CN102393892A (en) * | 2011-12-05 | 2012-03-28 | 西安科技大学 | Word document copyright protection method |
CN102495991A (en) * | 2011-12-05 | 2012-06-13 | 西安科技大学 | Method for carrying out secret communication by using word document |
-
2013
- 2013-06-25 CN CN201310256732.2A patent/CN103310130B/en not_active Expired - Fee Related
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8014557B2 (en) * | 2003-06-23 | 2011-09-06 | Digimarc Corporation | Watermarking electronic text documents |
US7760903B2 (en) * | 2005-08-30 | 2010-07-20 | Microsoft Corporation | Tamper-resistant text stream watermarking |
CN102385680A (en) * | 2011-12-05 | 2012-03-21 | 西安科技大学 | Tamperproof protective method for Word file |
CN102393892A (en) * | 2011-12-05 | 2012-03-28 | 西安科技大学 | Word document copyright protection method |
CN102495991A (en) * | 2011-12-05 | 2012-06-13 | 西安科技大学 | Method for carrying out secret communication by using word document |
Non-Patent Citations (1)
Title |
---|
李向辉: "基于Word文本文档的信息隐藏方法研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》, 15 December 2006 (2006-12-15) * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104850765A (en) * | 2014-02-19 | 2015-08-19 | 中国移动通信集团福建有限公司 | Watermark processing method, device and system |
CN110874456A (en) * | 2018-08-31 | 2020-03-10 | 浙江大学 | Watermark embedding method, watermark extracting method, watermark embedding device, watermark extracting device and data processing method |
CN110457874A (en) * | 2019-08-02 | 2019-11-15 | 中国工商银行股份有限公司 | A kind of information write-in method, information-reading method and device |
CN116956168A (en) * | 2023-09-18 | 2023-10-27 | 云粒智慧科技有限公司 | Text watermark processing method and device, electronic equipment and storage medium |
CN116956168B (en) * | 2023-09-18 | 2024-01-02 | 云粒智慧科技有限公司 | Text watermark processing method and device, electronic equipment and storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN103310130B (en) | 2016-03-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Roy et al. | A novel approach to format based text steganography | |
CN100447812C (en) | Document data waterprint embedded method | |
CN102393892B (en) | Word document copyright protection method | |
CN109785222B (en) | Method for quickly embedding and extracting information of webpage | |
CN102881034B (en) | A kind of system and method inserting watermark in profile | |
CN103310130A (en) | Text document digital watermark embedding and extracting method | |
CN102385680B (en) | Tamperproof protective method for Word file | |
CN103745479A (en) | Digital steganography and steganalysis method for color image | |
CN110874456B (en) | Watermark embedding method, watermark extracting method, watermark embedding device, watermark extracting device and data processing method | |
Fei et al. | A reversible watermark scheme for 2D vector map based on reversible contrast mapping | |
CN102968596A (en) | Delete marker-based office open xml (OOX) document digital watermarking method | |
El_Rahman | A comprehensive image steganography tool using LSB scheme | |
Jaiswal et al. | Implementation of a new technique for web document protection using unicode | |
Sasaki et al. | Constructing Digital Watermark Based on Orthogonal Functions | |
Malalla et al. | A novel approach for Arabic text steganography based on the “BloodGroup” text hiding method | |
Shah et al. | Text steganography using character spacing after normalization | |
Rui et al. | A multiple watermarking algorithm for texts mixed Chinese and English | |
Cheng et al. | A robust text digital watermarking algorithm based on fragments regrouping strategy | |
Zhang et al. | Chinese text watermarking based on occlusive components | |
Kumar et al. | A Multilayered architecture for hiding executable files in 3D images | |
CN102495991B (en) | Method for carrying out secret communication by using word document | |
Reddy et al. | A Novel Approach for Hiding Encrypted Data in Image, Audio and Video using Steganography | |
Saber et al. | Steganography in MS excel document using unicode system characteristics | |
Zhang et al. | Hiding new words in a PDF document | |
GB2339656A (en) | Electronic text watermarking |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20160330 Termination date: 20180625 |