CN103310130A - Text document digital watermark embedding and extracting method - Google Patents

Text document digital watermark embedding and extracting method Download PDF

Info

Publication number
CN103310130A
CN103310130A CN2013102567322A CN201310256732A CN103310130A CN 103310130 A CN103310130 A CN 103310130A CN 2013102567322 A CN2013102567322 A CN 2013102567322A CN 201310256732 A CN201310256732 A CN 201310256732A CN 103310130 A CN103310130 A CN 103310130A
Authority
CN
China
Prior art keywords
watermark
matrix
text document
information
text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2013102567322A
Other languages
Chinese (zh)
Other versions
CN103310130B (en
Inventor
张释如
孟晓春
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xian University of Science and Technology
Original Assignee
Xian University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xian University of Science and Technology filed Critical Xian University of Science and Technology
Priority to CN201310256732.2A priority Critical patent/CN103310130B/en
Publication of CN103310130A publication Critical patent/CN103310130A/en
Application granted granted Critical
Publication of CN103310130B publication Critical patent/CN103310130B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention provides a text document digital watermark embedding and extracting method. According to the invention, the watermark embedding is characterized in that raw information is made into a binary image; matlab and other programming softwares are adopted for reading out binary matrixes in one-to-one correspondence with the binary image to serve as the watermark information; '0' and '1', the two binary code characters, are written into a font object library with a TrueType coinage program to generate a secret key; a windows 'internal code' input method is adopted to call binary code characters, and the binary matrix is inserted into the text document; finally a hidden text function of word processing software is adopted for hiding the imbedded binary matrix, so that the watermark embedding is completed. According to the invention, through raw information conversion and application of the secret key, the raw information is better hidden; meanwhile, due to hiding of the watermark information, an interpolator can not easily find the watermark, so that the preventive performance is improved. Therefore, the security of the raw information is greatly enhanced.

Description

Text document embedding algorithm and extracting method
Technical field
The present invention relates to a kind of digital watermarking embedding and extracting method, particularly a kind of text document embedding algorithm and extracting method.It is mainly used in the copyright protection of text document, anti-tamper and secret communication etc.
Background technology
In prior art, Chinese patent 201110396024.X provides " carrying out the method for secret communication with the word document ", and wherein the first step of watermark embedding is the fontlib that raw information is write the TrueType EUDC Editor as watermark information, and produces key; Second step is that watermark information is embedded in the word document, then each the watermark character in the watermark information is stacked on normal character.This patent is provided with the twice defence line to watermark information: first is key, as long as namely do not get key, the watermark character just can not show, and plays conclusive preventing effect; Second is that character stacks, and that is to say, has both made to take key the watermark character is shown, see also to be the watermark character stack together situation with normal character, almost illegible extracts raw information and establishes obstacle one again thereby give.
The existing deficiency of above-mentioned patent is: although it is provided with the twice defence line, but in fact be one defence line of key, that is to say, in case key is broken, the second defence line just is easy to be broken, and almost can not play the preventing effect, because in case key is broken, just can directly see the existence of " watermark character " during opening document, and launch to stack in this word processor to realize, be easy to so launch it.Moreover, because the stacking of font, " being difficult for deletion " or " insertions " can when editor, occur undesiredly to wait operation, this is with regard to so that come the guilty party very easily to discover " watermark is arranged " existence, thereby has reduced anti-losing one's temper; Simultaneously, watermark information is exactly raw information, as long as launch just to stack and can directly find out raw information, therefore, confidentiality is not high enough.
Summary of the invention
The objective of the invention is for the deficiency that exists in the prior art, a kind of text document embedding algorithm and extracting method are provided, to strengthen the confidentiality of raw information.
For achieving the above object, technical scheme of the present invention is as follows:
⑴ make watermark:
Raw information is made into a value image with the UNIVERSAL DRAWING tool software, reads this bianry image with matlab, C, C Plus Plus programming software again, obtain one with this bianry image corresponding " 01 " matrix one by one, as watermark information;
⑵ produce watermark keys: " 0 " and " 1 " two two-value code characters are write respectively the fontlib of TrueType EUDC Editor, and produce key;
⑶ watermark embeds: open a text document, the windows input method is switched to " ISN " input method, ⑴ " 01 " matrix that is gone on foot gained is inserted in this text document of opening, use again " hidden text " function in the word processor to hide " 01 " matrix that inserts, namely obtain the text document of an embed watermark information;
⑷ extract watermark: the key that ⑵ is gone on foot gained copy to extract computer that watermark information uses C: window in the fonts catalogue, open the text document that the ⑶ step obtains, hide with the cancellation of " hidden text " function in word processor literal, " 01 " matrix of then being hidden namely presents, recover to be somebody's turn to do the corresponding bianry image of " 01 " matrix with matlab, C, C Plus Plus programming software again, namely obtain raw information.
The present invention's characteristics compared with the prior art are as follows:
One, the first step of the present invention changes into raw information the two values matrix of corresponding with it bianry image, as in the watermark information embedded text document, gives good hiding to raw information with this two values matrix, and the security of raw information is improved greatly.
Two, second step of the present invention still borrows the TrueType EUDC Editor to produce key, and only need two two-value code characters are write the fontlib of TrueType EUDC Editor, and not only make watermark information have invisibility, safe, simultaneously, the character that writes than existing patent lacks, also simply too much.
Three, the 3rd step of the present invention is to stash with " hidden text " function of word processor two values matrix with embedding, this being hidden in when editing can not be stayed any vestige, that is to say, there is not when the text document of embed watermark is edited deviant circumstance to occur, the interpolater is difficult to discover the existence of " watermark ", and the anti-losing one's temper of hiding than existing patent " stacking " is high.
Four, the three lines of defence that arranges of the present invention, first are the hiding of watermark information of invisibility, the second of key, and the 3rd road is the conversion of watermark information.At first key is difficult to break through, and in a single day key is broken, and can not directly see watermark information, has both made and has seen watermark information, can not directly see raw information, therefore, the three lines of defence is all linked with one another, all plays preferably preventing effect, the confidentiality that has greatly strengthened.
Description of drawings
Fig. 1 is a bianry image in the embodiment of the invention.
Fig. 2 is the two values matrix corresponding with Fig. 1 bianry image.
Fig. 3 is common this paper document.
Fig. 4 is this paper document behind the embed watermark information.
Embodiment
The below illustrates concrete steps of the present invention with reference to the accompanying drawings.
Concrete steps of the present invention are as follows:
⑴ make watermark:
Raw information is made into a value image with UNIVERSAL DRAWING tool software (such as drawing board in photoshop, the windows annex etc.), as shown in Figure 1 (this bianry image can be the types such as bianry image, bitmap images or GRB), read this bianry image with matlab, C, C Plus Plus programming software again, obtain one with this bianry image " 01 " matrix one to one, as shown in Figure 2, be somebody's turn to do " 01 " matrix as watermark information.
⑵ produce watermark keys: " 0 " and " 1 " two two-value code characters are write respectively the fontlib of TrueType EUDC Editor, and produce key; The concrete operations of this step can be referring to the first step among the Chinese patent ZL201110396024.X.
⑶ watermark embeds: (text document can be word document or WPS document etc. to open a text document as shown in Figure 3, and can be with picture in the selected text document, form and formula etc.), the windows input method is switched to " ISN " input method, ⑴ " 01 " matrix that is gone on foot gained is inserted in this text document of opening, as shown in Figure 4, this example is inserted in every delegation " 01 " matrix in each sentence successively, be convenient to like this restore " 01 " matrix, and then " 01 " matrix that hide to insert with " hidden text " function in the word processor, namely obtain the text document of an embed watermark information, on the surface, the document of this embed watermark information is finished identical with urtext document shown in Figure 3;
The concrete operations of described hidden text are: take the word document as example, then the literal that selection will be hidden clicks the Font menu, clicks " hidden text " again, determines to get final product.
⑷ extract watermark: the key that ⑵ is gone on foot gained copy to extract computer that watermark information uses C: window in the fonts catalogue, open the text document that the ⑶ step obtains, hiding with the cancellation of " hidden text " function in word processor literal (is full selection word, eliminate again " hidden text "), " 01 " matrix of then being hidden namely presents, recover to be somebody's turn to do the corresponding bianry image of " 01 " matrix with matlab, C, C Plus Plus programming software again, namely obtain raw information.

Claims (1)

1. a text document embedding algorithm and extracting method, its step is as follows:
⑴ make watermark:
Raw information is made into a value image with the UNIVERSAL DRAWING tool software, reads this bianry image with matlab, C, C Plus Plus programming software again, obtain one with this bianry image corresponding " 01 " matrix one by one, as watermark information;
⑵ produce watermark keys: " 0 " and " 1 " two two-value code characters are write respectively the fontlib of TrueType EUDC Editor, and produce key;
⑶ watermark embeds: open a text document, the windows input method is switched to " ISN " input method, ⑴ " 01 " matrix that is gone on foot gained is inserted in the text document of opening, use again " hidden text " function in the word processor to hide " 01 " matrix that inserts, namely obtain the text document of an embed watermark information;
⑷ extract watermark: the key that ⑵ is gone on foot gained copy to extract computer that watermark information uses C: window in the fonts catalogue, open the text document that the ⑶ step obtains, hide with the cancellation of " hidden text " function in word processor literal, " 01 " matrix of then being hidden namely presents, recover to be somebody's turn to do the corresponding bianry image of " 01 " matrix with matlab, C, C Plus Plus programming software again, namely obtain raw information.
CN201310256732.2A 2013-06-25 2013-06-25 Text document embedding algorithm and extracting method Expired - Fee Related CN103310130B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310256732.2A CN103310130B (en) 2013-06-25 2013-06-25 Text document embedding algorithm and extracting method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310256732.2A CN103310130B (en) 2013-06-25 2013-06-25 Text document embedding algorithm and extracting method

Publications (2)

Publication Number Publication Date
CN103310130A true CN103310130A (en) 2013-09-18
CN103310130B CN103310130B (en) 2016-03-30

Family

ID=49135339

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310256732.2A Expired - Fee Related CN103310130B (en) 2013-06-25 2013-06-25 Text document embedding algorithm and extracting method

Country Status (1)

Country Link
CN (1) CN103310130B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104850765A (en) * 2014-02-19 2015-08-19 中国移动通信集团福建有限公司 Watermark processing method, device and system
CN110457874A (en) * 2019-08-02 2019-11-15 中国工商银行股份有限公司 A kind of information write-in method, information-reading method and device
CN110874456A (en) * 2018-08-31 2020-03-10 浙江大学 Watermark embedding method, watermark extracting method, watermark embedding device, watermark extracting device and data processing method
CN116956168A (en) * 2023-09-18 2023-10-27 云粒智慧科技有限公司 Text watermark processing method and device, electronic equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7760903B2 (en) * 2005-08-30 2010-07-20 Microsoft Corporation Tamper-resistant text stream watermarking
US8014557B2 (en) * 2003-06-23 2011-09-06 Digimarc Corporation Watermarking electronic text documents
CN102385680A (en) * 2011-12-05 2012-03-21 西安科技大学 Tamperproof protective method for Word file
CN102393892A (en) * 2011-12-05 2012-03-28 西安科技大学 Word document copyright protection method
CN102495991A (en) * 2011-12-05 2012-06-13 西安科技大学 Method for carrying out secret communication by using word document

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8014557B2 (en) * 2003-06-23 2011-09-06 Digimarc Corporation Watermarking electronic text documents
US7760903B2 (en) * 2005-08-30 2010-07-20 Microsoft Corporation Tamper-resistant text stream watermarking
CN102385680A (en) * 2011-12-05 2012-03-21 西安科技大学 Tamperproof protective method for Word file
CN102393892A (en) * 2011-12-05 2012-03-28 西安科技大学 Word document copyright protection method
CN102495991A (en) * 2011-12-05 2012-06-13 西安科技大学 Method for carrying out secret communication by using word document

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
李向辉: "基于Word文本文档的信息隐藏方法研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》, 15 December 2006 (2006-12-15) *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104850765A (en) * 2014-02-19 2015-08-19 中国移动通信集团福建有限公司 Watermark processing method, device and system
CN110874456A (en) * 2018-08-31 2020-03-10 浙江大学 Watermark embedding method, watermark extracting method, watermark embedding device, watermark extracting device and data processing method
CN110457874A (en) * 2019-08-02 2019-11-15 中国工商银行股份有限公司 A kind of information write-in method, information-reading method and device
CN116956168A (en) * 2023-09-18 2023-10-27 云粒智慧科技有限公司 Text watermark processing method and device, electronic equipment and storage medium
CN116956168B (en) * 2023-09-18 2024-01-02 云粒智慧科技有限公司 Text watermark processing method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN103310130B (en) 2016-03-30

Similar Documents

Publication Publication Date Title
Roy et al. A novel approach to format based text steganography
CN100447812C (en) Document data waterprint embedded method
CN102393892B (en) Word document copyright protection method
CN109785222B (en) Method for quickly embedding and extracting information of webpage
CN102881034B (en) A kind of system and method inserting watermark in profile
CN103310130A (en) Text document digital watermark embedding and extracting method
CN102385680B (en) Tamperproof protective method for Word file
CN103745479A (en) Digital steganography and steganalysis method for color image
CN110874456B (en) Watermark embedding method, watermark extracting method, watermark embedding device, watermark extracting device and data processing method
Fei et al. A reversible watermark scheme for 2D vector map based on reversible contrast mapping
CN102968596A (en) Delete marker-based office open xml (OOX) document digital watermarking method
El_Rahman A comprehensive image steganography tool using LSB scheme
Jaiswal et al. Implementation of a new technique for web document protection using unicode
Sasaki et al. Constructing Digital Watermark Based on Orthogonal Functions
Malalla et al. A novel approach for Arabic text steganography based on the “BloodGroup” text hiding method
Shah et al. Text steganography using character spacing after normalization
Rui et al. A multiple watermarking algorithm for texts mixed Chinese and English
Cheng et al. A robust text digital watermarking algorithm based on fragments regrouping strategy
Zhang et al. Chinese text watermarking based on occlusive components
Kumar et al. A Multilayered architecture for hiding executable files in 3D images
CN102495991B (en) Method for carrying out secret communication by using word document
Reddy et al. A Novel Approach for Hiding Encrypted Data in Image, Audio and Video using Steganography
Saber et al. Steganography in MS excel document using unicode system characteristics
Zhang et al. Hiding new words in a PDF document
GB2339656A (en) Electronic text watermarking

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20160330

Termination date: 20180625