CN103310130B - Text document embedding algorithm and extracting method - Google Patents

Text document embedding algorithm and extracting method Download PDF

Info

Publication number
CN103310130B
CN103310130B CN201310256732.2A CN201310256732A CN103310130B CN 103310130 B CN103310130 B CN 103310130B CN 201310256732 A CN201310256732 A CN 201310256732A CN 103310130 B CN103310130 B CN 103310130B
Authority
CN
China
Prior art keywords
watermark
text document
matrix
information
text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201310256732.2A
Other languages
Chinese (zh)
Other versions
CN103310130A (en
Inventor
张释如
孟晓春
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xian University of Science and Technology
Original Assignee
Xian University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xian University of Science and Technology filed Critical Xian University of Science and Technology
Priority to CN201310256732.2A priority Critical patent/CN103310130B/en
Publication of CN103310130A publication Critical patent/CN103310130A/en
Application granted granted Critical
Publication of CN103310130B publication Critical patent/CN103310130B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The present invention is a kind of text document embedding algorithm and extracting method.Wherein watermark embedment raw information is made a bianry image, and with the programming software such as matlab read with its one to one two values matrix as watermark information, again the fontlib of " 0 " and " 1 " two two-value code character write TrueType EUDC Editor is produced key, then using " ISN " input method of windows to transfer two-value code character is inserted in text document by above-mentioned two values matrix, finally use " hidden text " function of word processor to hide the two values matrix of embedding again, namely complete watermark embedment.By to the conversion of raw information and the use of key, make raw information have good hiding, meanwhile, make again interpolater be difficult to discover the existence of " watermark " to hiding of watermark information, anti-losing one's temper is high.Therefore, the confidentiality of raw information is greatly strengthened.

Description

Text document embedding algorithm and extracting method
Technical field
The present invention relates to a kind of digital watermark embedding and extracting method, particularly a kind of text document embedding algorithm and extracting method.It is mainly used in the copyright protection of text document, anti-tamper and secret communication etc.
Background technology
In prior art, Chinese patent 201110396024.X provides " carrying out the method for secret communication by word document ", and wherein the first step of watermark embedment is using the fontlib of raw information as watermark information write TrueType EUDC Editor, and produces key; Second step is embedded in word document by watermark information, then stacked on normal character by each watermark character in watermark information.This patent is provided with twice defence line to watermark information: first is key, as long as namely do not get key, watermark character just can not show, and plays conclusive preventing effect; Second is that character stacks, and that is, has both made to take key by watermark Charactes Display out, and what see is also situation together with watermark character stacks with normal character, is difficult to identification, thus gives and extract raw information and establish one obstacle again.
Deficiency existing for above-mentioned patent is: although it is provided with twice defence line, but in fact just key one defence line, that is, once key is broken, second defence line is just easy to be broken, and almost can not play preventing effect, because once key is broken, just directly can see the existence of " watermark character " during opening document, and launch to stack to realize in this word processor, be easy to so launch it.Moreover due to stacking of font, can occur the operations such as " not easily delete " or " insertions " abnormal when editing, this just makes guilty party very easily to discover " having watermark " existence, thus reduces anti-losing one's temper; Meanwhile, watermark information is exactly raw information, as long as launching to stack just directly can find out raw information, therefore, confidentiality is not high enough.
Summary of the invention
The object of the invention is the deficiency for existing in prior art, a kind of text document embedding algorithm and extracting method are provided, to strengthen the confidentiality of raw information.
For achieving the above object, technical scheme of the present invention is as follows:
(1) make watermark:
The software development of raw information general drafting tools is become a bianry image, then reads this bianry image with matlab, C, C Plus Plus programming software, obtain one and this bianry image one_to_one corresponding " 01 " matrix, as watermark information;
(2) produce watermark keys: the fontlib " 0 " and " 1 " two two-value code characters being write respectively TrueType EUDC Editor, and produce key;
(3) watermark embedment: open a text document, windows input method is switched to " ISN " input method, (1) " 01 " matrix walking gained is inserted in this text document opened, use " hidden text " function in word processor to hide " 01 " matrix of insertion again, namely obtain the text document of an embed watermark information;
(4) extract watermark: by the key copy (2) walking gained to extract watermark information use computer C: window in fonts catalogue, open the text document (3) walking and obtain, cancel word by " hidden text " function in word processor to hide, namely then hidden " 01 " matrix presents, recover to be somebody's turn to do the bianry image corresponding to " 01 " matrix with matlab, C, C Plus Plus programming software again, namely obtain raw information.
The present invention's feature is compared with the prior art as follows:
One, raw information is changed into the two values matrix of bianry image corresponding with it by the first step of the present invention, with this two values matrix as in watermark information embedded text document, gives good hiding to raw information, the security of raw information is improved greatly.
Two, second step of the present invention still borrows TrueType EUDC Editor to produce key, and only need by the fontlib of two two-value code character write TrueType EUDC Editor, not only make watermark information have invisibility, security is high, meanwhile, the character write than existing patent is few, also simply too much.
Three, the two values matrix of embedding stashes by the 3rd step word processor " hidden text " function of the present invention, any vestige can not be stayed during this editor of being hidden in, that is, abnormal situation is not had to occur when editing the text document of embed watermark, interpolater is difficult to the existence discovering " watermark ", and the anti-losing one's temper hidden than existing patent " stacking " is high.
Four, the three lines of defence that arranges of the present invention, first is the invisibility of key, second is watermark information hides, and the 3rd road is the conversion of watermark information.First key is difficult to break through, and key, once be broken, directly can not see watermark information, has both made to see watermark information, directly can not see raw information, and therefore, three lines of defence is all linked with one another, all plays good preventing effect, the confidentiality that greatly strengthen.
Accompanying drawing explanation
Fig. 1 is a bianry image in the embodiment of the present invention.
Fig. 2 is the two values matrix corresponding with Fig. 1 bianry image.
Fig. 3 is common this paper document.
Fig. 4 is this paper document after embed watermark information.
Embodiment
Concrete steps of the present invention are described with reference to the accompanying drawings below.
Concrete steps of the present invention are as follows:
(1) make watermark:
Raw information general drafting tools software (drawing board etc. as in photoshop, windows annex) is made into a bianry image, as shown in Figure 1 (this bianry image can be the types such as bianry image, bitmap images or GRB), this bianry image is read again with matlab, C, C Plus Plus programming software, obtain one and this bianry image " 01 " matrix one to one, as shown in Figure 2, " 01 " matrix is somebody's turn to do as watermark information.
(2) produce watermark keys: the fontlib " 0 " and " 1 " two two-value code characters being write respectively TrueType EUDC Editor, and produce key; The concrete operations of this step can see the first step in Chinese patent ZL201110396024.X.
(3) watermark embedment: (text document can be word document or WPS document etc. to open a text document as shown in Figure 3, and can with picture in selected text document, form and formula etc.), windows input method is switched to " ISN " input method, (1) " 01 " matrix walking gained is inserted in this text document opened, as shown in Figure 4, every a line " 01 " matrix is inserted in each sentence by this example successively, be convenient to like this restore " 01 " matrix, and then " 01 " matrix of insertion is hidden by " hidden text " function in word processor, namely the text document of an embed watermark information is obtained, on the surface, the document of this embed watermark information is identical with the urtext document shown in Fig. 3,
The concrete operations of described hidden text are: for word document, select the word that will hide, then click the Font menu, then click " hidden text ", determine.
(4) extract watermark: by the key copy (2) walking gained to extract watermark information use computer C: window in fonts catalogue, open the text document (3) walking and obtain, cancel word by " hidden text " function in word processor and hide (i.e. full selection word, eliminate again " hidden text "), namely then hidden " 01 " matrix presents, recover to be somebody's turn to do the bianry image corresponding to " 01 " matrix with matlab, C, C Plus Plus programming software again, namely obtain raw information.

Claims (1)

1. text document embedding algorithm and an extracting method, its step is as follows:
(1) make watermark:
The software development of raw information general drafting tools is become a bianry image, then reads this bianry image with matlab, C, C Plus Plus programming software, obtain one and this bianry image one_to_one corresponding " 01 " matrix, as watermark information;
(2) produce watermark keys: the fontlib " 0 " and " 1 " two two-value code characters being write respectively TrueType EUDC Editor, and produce key;
(3) watermark embedment: open a text document, windows input method is switched to " ISN " input method, (1) " 01 " matrix walking gained is inserted in the text document opened, use " hidden text " function in word processor to hide " 01 " matrix of insertion again, namely obtain the text document of an embed watermark information;
(4) extract watermark: by the key copy (2) walking gained to extract watermark information use computer C: window in fonts catalogue, open the text document (3) walking and obtain, cancel word by " hidden text " function in word processor to hide, namely then hidden " 01 " matrix presents, recover to be somebody's turn to do the bianry image corresponding to " 01 " matrix with matlab, C, C Plus Plus programming software again, namely obtain raw information.
CN201310256732.2A 2013-06-25 2013-06-25 Text document embedding algorithm and extracting method Expired - Fee Related CN103310130B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310256732.2A CN103310130B (en) 2013-06-25 2013-06-25 Text document embedding algorithm and extracting method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310256732.2A CN103310130B (en) 2013-06-25 2013-06-25 Text document embedding algorithm and extracting method

Publications (2)

Publication Number Publication Date
CN103310130A CN103310130A (en) 2013-09-18
CN103310130B true CN103310130B (en) 2016-03-30

Family

ID=49135339

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310256732.2A Expired - Fee Related CN103310130B (en) 2013-06-25 2013-06-25 Text document embedding algorithm and extracting method

Country Status (1)

Country Link
CN (1) CN103310130B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104850765A (en) * 2014-02-19 2015-08-19 中国移动通信集团福建有限公司 Watermark processing method, device and system
CN110874456B (en) * 2018-08-31 2022-04-26 浙江大学 Watermark embedding method, watermark extracting method, watermark embedding device, watermark extracting device and data processing method
CN110457874A (en) * 2019-08-02 2019-11-15 中国工商银行股份有限公司 A kind of information write-in method, information-reading method and device
CN116956168B (en) * 2023-09-18 2024-01-02 云粒智慧科技有限公司 Text watermark processing method and device, electronic equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7760903B2 (en) * 2005-08-30 2010-07-20 Microsoft Corporation Tamper-resistant text stream watermarking
US8014557B2 (en) * 2003-06-23 2011-09-06 Digimarc Corporation Watermarking electronic text documents
CN102385680A (en) * 2011-12-05 2012-03-21 西安科技大学 Tamperproof protective method for Word file
CN102393892A (en) * 2011-12-05 2012-03-28 西安科技大学 Word document copyright protection method
CN102495991A (en) * 2011-12-05 2012-06-13 西安科技大学 Method for carrying out secret communication by using word document

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8014557B2 (en) * 2003-06-23 2011-09-06 Digimarc Corporation Watermarking electronic text documents
US7760903B2 (en) * 2005-08-30 2010-07-20 Microsoft Corporation Tamper-resistant text stream watermarking
CN102385680A (en) * 2011-12-05 2012-03-21 西安科技大学 Tamperproof protective method for Word file
CN102393892A (en) * 2011-12-05 2012-03-28 西安科技大学 Word document copyright protection method
CN102495991A (en) * 2011-12-05 2012-06-13 西安科技大学 Method for carrying out secret communication by using word document

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
基于Word文本文档的信息隐藏方法研究;李向辉;《中国优秀硕士学位论文全文数据库 信息科技辑》;20061215;第9-10、40-41页 *

Also Published As

Publication number Publication date
CN103310130A (en) 2013-09-18

Similar Documents

Publication Publication Date Title
US7240209B2 (en) Methods of invisibly embedding and hiding data into soft-copy text documents
Roy et al. A novel approach to format based text steganography
CN100447812C (en) Document data waterprint embedded method
Singh et al. A novel approach of text steganography based on null spaces
CN102393892B (en) Word document copyright protection method
CN103310130B (en) Text document embedding algorithm and extracting method
CN109785222B (en) Method for quickly embedding and extracting information of webpage
Majeed et al. AN IMPROVED LSB IMAGE STEGANOGRAPHY TECHNIQUE USING BIT-INVERSE IN 24 BIT COLOUR IMAGE.
CN104517045A (en) Method for creating protected digital file
CN103745479A (en) Digital steganography and steganalysis method for color image
CN103559251A (en) Data security protection method based on data hiding
Rafat et al. Secure digital steganography for ASCII text documents
Kaur et al. Encryption based LSB steganography technique for digital images and text data
CN102385680A (en) Tamperproof protective method for Word file
Malalla et al. A novel approach for Arabic text steganography based on the “BloodGroup” text hiding method
Kavitha et al. Lossless steganography on avi file using swapping algorithm
Sharma et al. A study of steganography based data hiding techniques
Cheng et al. A robust text digital watermarking algorithm based on fragments regrouping strategy
CN102495991B (en) Method for carrying out secret communication by using word document
Saber et al. Steganography in MS excel document using unicode system characteristics
Reddy et al. A Novel Approach for Hiding Encrypted Data in Image, Audio and Video using Steganography
Khan et al. Polish text steganography method using letter points and extension
CN105208462A (en) Video digital watermarking processing method and video digital watermarking system executing same
Pascaline et al. Using photomosaic and steganographic techniques for hiding information inside image mosaics
Bajpai et al. Techniques of steganography for securing information: A survey

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20160330

Termination date: 20180625