WO2002015586A9 - Method and system of watermarking digital data using scaled bin encoding and maximum likelihood decoding - Google Patents

Method and system of watermarking digital data using scaled bin encoding and maximum likelihood decoding

Info

Publication number
WO2002015586A9
WO2002015586A9 PCT/IB2001/001447 IB0101447W WO0215586A9 WO 2002015586 A9 WO2002015586 A9 WO 2002015586A9 IB 0101447 W IB0101447 W IB 0101447W WO 0215586 A9 WO0215586 A9 WO 0215586A9
Authority
WO
WIPO (PCT)
Prior art keywords
watermark
sub
data
coefficients
image
Prior art date
Application number
PCT/IB2001/001447
Other languages
French (fr)
Other versions
WO2002015586A3 (en
WO2002015586A2 (en
Inventor
Avraham Levy
Neri Merhav
Original Assignee
Hewlett Packard Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hewlett Packard Co filed Critical Hewlett Packard Co
Priority to AU2001276620A priority Critical patent/AU2001276620A1/en
Priority to JP2002520566A priority patent/JP4226897B2/en
Priority to KR10-2003-7002275A priority patent/KR20030024880A/en
Priority to EP01954279A priority patent/EP1310098A2/en
Publication of WO2002015586A2 publication Critical patent/WO2002015586A2/en
Publication of WO2002015586A3 publication Critical patent/WO2002015586A3/en
Publication of WO2002015586A9 publication Critical patent/WO2002015586A9/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T1/00General purpose image data processing
    • G06T1/0021Image watermarking
    • G06T1/005Robust watermarking, e.g. average attack or collusion attack resistant
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • H04N19/467Embedding additional information in the video signal during the compression process characterised by the embedded information being invisible, e.g. watermarking
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/48Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using compressed domain processing techniques other than decoding, e.g. modification of transform coefficients, variable length coding [VLC] data or run-length data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/238Interfacing the downstream path of the transmission network, e.g. adapting the transmission rate of a video stream to network bandwidth; Processing of multiplex streams
    • H04N21/2389Multiplex stream processing, e.g. multiplex stream encrypting
    • H04N21/23892Multiplex stream processing, e.g. multiplex stream encrypting involving embedding information at multiplex stream level, e.g. embedding a watermark at packet level
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/835Generation of protective data, e.g. certificates
    • H04N21/8358Generation of protective data, e.g. certificates involving watermark
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2201/00General purpose image data processing
    • G06T2201/005Image watermarking
    • G06T2201/0052Embedding of the watermark in the frequency domain
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2201/00General purpose image data processing
    • G06T2201/005Image watermarking
    • G06T2201/0065Extraction of an embedded watermark; Reliable detection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2201/00General purpose image data processing
    • G06T2201/005Image watermarking
    • G06T2201/0083Image watermarking whereby only watermarked image required at decoder, e.g. source-based, blind, oblivious

Definitions

  • the present invention relates to watermarking of digital data and in particular to the technique of embedding watermark data into digital image data and recapturing the embedded watermark data from a noisy version of the watermarked digital image without the use of the original digital image data.
  • Watermarking techniques are used to embed secret information into, for instance, an image in such a way that this information cannot be recioved or deciphered without access to a secret key.
  • the watermarked image needs to be perceptually identical once the watermark is embedded into the original image.
  • watermarking The main application of watermarking is for proving ownership of the data and for protection against forgers. It is therefore crucial to ensure that no malicious attacker is able to remove the watermark without damaging the image to the degree that it becomes useless.
  • watermarking techniques need to be robust to standard operations performed on images, like printing, scanning, lossy compression (e.g., JPEG), filtering, and so on.
  • JPEG lossy compression
  • filtering filtering
  • the tradeoffs to be considered in the design of a watermarking technique rely on three principal parameters: robustness to attacks and image processing operations, high quality, and the relative amount of secret information to be embedded, i.e., the coding rate.
  • Prior art image watermarking techniques can be divided into two groups according to the type of feature set the watermark is embedded in. Specifically, the watermark is embedded into either the intensity value of the luminance in the spatial domain representation of the image or the transform coefficients in the transform domain representation (e.g. DCT, DWT) of the image.
  • the algorithms that are used to detect the watermark signal can also be placed under two categories based on whether or not they use the original image during the watermark detection process.
  • watermarking techniques that embed the watermark in the coefficients of a transform domain and detect it without resorting to the original image enjoy many advantages in their robustness to standard image processing operations and to hostile attacks.
  • the basic technique for embedding a watermark message or data into an image / by encoding the watermark data within the transform domain is performed by initially transforming the image / to obtain a representation in a transform domain (e.g. DCT, DFT etc) of the image. Next, a subset of the transform coefficients is selected and the watermark message is encoded by slightly changing the sub-set of transform coefficients values, thus producing a watermarked transform coefficient set. The watermarked coefficients are then combined with the non-watermarked coefficients to generate a watermark embedded transform representation of the image. The watermarked transform representation is then inverse-transformed to produce a watermarked image .
  • the coefficient modification is subtle enough so that / is perceptually indistinguishable from the original image / .
  • a decoder receives the distorted or noisy version of I , that may have suffered certain image processing operations or hostile attacks. It transforms the received version to the appropriate transform domain representation and selects the same sub-set of coefficients in which the watermark signal has been encoded. The decoder then extracts the watermark message from the sub-set of coefficients using a decoding procedure.
  • the watermark data is added to the original image by perturbing the values of significant DCT (i.e., transform) coefficients. For example, if 5 is a DCT coefficient, a "zero" bit of the watermark data is encoded by changing the coefficient to s + ⁇ and a "one" bit of the watermark data is encoded by changing the coefficient to s - ⁇ , where ⁇ is a small constant. Recapturing the watermark message from the watermarked image / is accomplished by correlating the appropriate DCT coefficients with the watermark message.
  • DCT i.e., transform
  • Variations of this watermarking technique have been suggested such as the use of different transform domains such as DFT or DWT and/or the use of different perturbation schemes (e.g., ,s(l ⁇ ⁇ ) or s ⁇ ⁇ s ⁇ ).
  • Dithering modulation is based on quantizing the transform domain coefficients.
  • embedding a watermark message in a selected sub-set of coefficients is based on replacing these coefficients with their quantized values in a way that depends on the watermark message. For example, a "zero" bit is encoded by quantizing the coefficients with a quantizer q 0 and a "one" bit is encoded by quantizing the coefficients with a different quantizer q .
  • the prior art watermarking schemes including coefficient perturbation and dither modulation are not explicitly designed to adapt to a predefined distortion level or noise criteria. As a result, they provide a relatively low rate of information embedding or suffer from high error rate under changing noise conditions. More importantly, since attempts by malicious attackers take the form of unknown noise conditions on the watermark embedded image data, these prior art schemes are less able to protect against attacks upon the security of the data - one of the main purposes of watermarking.
  • the present invention presents a watermarking technique which is based on a new encoding scheme referred to as scaled bin encoding which encodes watermark data into image data by modifying image values in a way that preserves high image quality (i.e., low distortion levels) and adapts to expected (i.e., worse case) noise level.
  • Recapturing of the watermark data from the watermark embedded image after it has been exposed to unintentional and/or intentional noise is performed via a decoding method using a probability based procedure (e.g., maximum likelihood decoding), based on estimated statistics of the original image values and an expected statistical model of the noise introduced to the image by image processing operations or attack noise, thereby providing a robust and high quality watermarking system and method.
  • a probability based procedure e.g., maximum likelihood decoding
  • the present invention provides a method and system of embedding digital watermark data into digital image data and recapturing the embedded watermark from the watermarked digital data without the use of the original digital image data and despite noise introduced into the watermarked digital image data.
  • the system and method of the present invention is performed by two processes: the process of embedding the watermark into the original digital data to obtain watermarked digital data and the process of decoding the watermarked digital data to obtain the original watermarked data.
  • Watermark data is embedded into the digital image data using an encoding method referred to as scaled bin encoding.
  • the embedded watermark data is recaptured from the watermarked data using a probability based decoding scheme.
  • the probability based decoding scheme is the Maximum Likelihood (ML) decoding scheme.
  • embedding of the watermarked data is achieved by transforming the original digital image data into first transform coefficients; encoding the watermark data using an error correcting code; embedding the encoded watermark data into a first sub-set of the first transform coefficients to generate a sub-set of embedded transform coefficients using scaled bin encoding; and then inversely transforming the watermark embedded first sub-set of coefficients along with the remaining non-watermark embedded first transform coefficients to generate watermark embedded digital image data which includes the original digital image data and the watermark data
  • scaled bin encoding is performed by scaling each coefficient of the sub-set of transform coefficients with a predetermined scaling parameter which is representative of an expected noise level and an allowed distortion model; mapping each scaled coefficient to one of a pair of skewed discrete mappings dependent on the logic state of the corresponding encoded watermark data bit to be embedded into each scaled coefficient; obtaining a difference between each scaled coefficient from its corresponding mapped and scaled coefficient; and adding the difference to its corresponding original (i.e., unsealed and unmapped) coefficient to obtain each watermark embedded transform coefficient.
  • a predetermined scaling parameter which is representative of an expected noise level and an allowed distortion model
  • the watermark data is recaptured from the watermark embedded digital image data by transforming the watermark embedded image into second transform coefficients using the same transformation as used during watermark encoding; selecting a second sub-set of watermarked transform coefficients which correspond to the first sub-set of transform coefficients; estimating statistical parameters of the first sub-set of transform coefficients using image statistics of the second watermarked sub-set of coefficients; extracting embedded watermark data from the second sub-set of coefficients with a probability based decoding scheme which uses the predetermined scaling parameter, known aspects of the scaling and mapping steps, the expected noise and allowed distortion model, and the estimated statistical parameters; decoding the extracted watermark data using an error correcting decoder based on the error correcting code used when watermark encoding to obtain an estimate of the original watermark.
  • the ML decoding scheme is used as the probability type decoding scheme.
  • one embodiment of the system of watermarking includes an image transformer for transforming the digital image data into a transform domain representation; an encoder for encoding watermark data using an error correcting code to generate encoded watermark data; a watermark encoder for embedding the encoded watermark data into a first sub-set of the first transform coefficients to generate a sub-set of embedded transform coefficients by using scaled bin encoding; an inverse image transformer for inversely transforming the watermark embedded first sub-set of coefficients along with the remaining non-watermark embedded first transform coefficients to generate watermarked digital image data which includes the digital image data and the watermark data.
  • one embodiment of the system of watermarking includes an image transformer for transforming the watermarked image data into the transform domain representation; a means for determining the statistical parameters of the second sub-set of coefficients; a probability based watermark decoder for extracting embedded watermark data from the second sub-set of coefficients with a probability based decoding scheme which uses the predetermined scaling parameter, known aspects of the scaling and mapping steps, the expected noise and allowed distortion model, and the estimated statistical parameters; a decoder for decoding the extracted watermark data using an error correcting decoder based on the original error correcting code to generate the watermark data.
  • Fig. 1 illustrates basic stages of the watermarking process
  • Figs. 2A and 2B illustrate a first embodiment of a system and method of watermarking using scaled bin encoding in accordance with the present invention
  • Fig. 3 illustrates a first embodiment of a method of scaled bin encoding in accordance with the present invention
  • Figs. 4A and 4B illustrate a first embodiment of a system and method of watermarking using a probability based decoding scheme
  • Fig. 5 illustrates an embodiment of a watermarking method in which watermark data is decoded using a probability based decoding scheme
  • Figs. 6A and 6B illustrate an embodiment of watermarking using scaled bin encoding and the Maximum Likelihood decoding
  • Fig. 6C illustrates ordering of DCT coefficients for selection of a sub-set of transform coefficients in accordance with the present invention.
  • the watermarking process can be represented as having three stages: a watermark encoding stage 10, an attack channel stage 11, and a watermark decoding stage 12.
  • the encoding stage 10 receives two inputs: m - the watermark data or message to be embedded and i a covertext image where the watermark message is to be embedded.
  • Covertext image i can include digital data comprising text data, video data, audio data or combinations of these types of data.
  • the output of the encoder is a watermarked image i' that is perceptually similar to the original image i.
  • the attack channel 11 represents a composition of expected and/or non-malicious signal processing operations (e.g. compression, filtering, etc.) and/or a deliberate attack on the watermarked image.
  • the watermark embedded image output i* of the channel is a distorted version of the input watermark embedded image i', which is assumed to result from noise signal r.
  • the decoder 12 receives the image i* as an input and estimates the embedded message m without resorting to the original covertext image i.
  • Fig. 2A illustrates one embodiment of a system of watermarking for embedding a watermark message into a covertext image.
  • Embedding of the watermark message m into image data i is performed in a transform domain.
  • the image transform block 20 can be embodied in numerous manners as is well known in the field of digital signal processing.
  • the image transform block may be embodied so as to convert the image into a Discrete Cosine Transform (DCT) representation or a Discrete Fourier Transform (DFT) representation.
  • DCT Discrete Cosine Transform
  • DFT Discrete Fourier Transform
  • the watermark message m is encoded into an error corrected code format b by error correcting code block 21.
  • a sub-set of coefficients s are selected from the image coefficients t for embedding the watermark data into.
  • the scaled bin Encoding block 22 embeds the message b into the selected image data transform coefficients s to generate watermark embedded coefficients x using a coding method in which each coefficient of the sub-set of transform coefficients is scaled with a predetermined scaling parameter which is a function of an expected noise model and an allowed distortion model; each scaled coefficient is mapped to one of a pair of skewed discrete mappings dependent on the logic state of the corresponding encoded watermark data bit to be embedded into each scaled coefficient; a difference between each scaled coefficient from its corresponding mapped and scaled coefficient is obtained; and the difference is added to its corresponding original (i.e., unsealed and unmapped) coefficient to obtain each watermark embedded transform coefficient.
  • the watermark embedded coefficients x replace the corresponding non- watermark embedded coefficients s and are combined with the remaining non- embedded coefficients.
  • the resulting combined coefficients are inversely transformed by inverse image transform block 23 to generate a watermark embedded image i'.
  • bin coding is generally defined to be a technique in which code words are chosen for encoding a given message depending on the message itself and on the image in which it is being encoded into.
  • Scaled bin encoding is generally defined as a technique in which an appropriate scaling parameter is introduced into a bin-coding scheme to adapt the coding rate (i.e., number of bins) to the level of noise that is expected to disrupt the transform coefficients and an allowed distortion.
  • the scaled bin encoding based watermarking system and method in accordance with the present invention is robust to noise introduced by image processing operations and hostile attacks.
  • FIG. 2A The system shown in Fig. 2A including and image transformer 20, an encoder
  • scaled bin encoder 22 and inverse image transformer 23 can be embodied in either a software implementation performed by a computing or processing system, or in a dedicated hardware implementation such as an ASIC, or any combination of a software and hardware implementation.
  • Fig. 2B illustrates an embodiment of a method of watermarking using scaled bin encoding corresponding to the system shown in Fig. 2A. The method shown in Fig.
  • 2B includes the steps of transforming image data i into transform coefficients (step 24); encoding watermark data m into an error correcting code format to generate encoded watermark data b (step 25); embedding the encoded watermark data b into a sub-set of the transform coefficients s using scaled bin encoding (step 26); inversely transforming the watermark embedded coefficients x and the remaining non- watermark embedded coefficients to obtain a watermark embedded image i r (step 27).
  • scale bin encoding of watermark data into an image can be performed within the spatial (i.e. pixel) domain of the image instead of a given transform domain of the image.
  • system of watermarking image transform block 20 and inverse image transform block 23 are not required and in an alternative embodiment of the method of watermarking the steps of transforming 24 and inversely transforming 27 are not required
  • embedding using scaled bin encoding is performed so as to ensure that the watermarked image is perceptually similar to the original image by adhering to the following allowed distortion model and expected noise model criteria:
  • a scalar parameter 0 ⁇ l whose value depends on the relative power of the expected noise in the attack channel.
  • Fig. 3 shows the steps of performing scaled bin encoding using the scaling parameter ⁇ which is a function of an expected noise level and using discrete mappings which are a function of an allowed distortion model as defined above.
  • each coefficient s, of the sub-set s is scaled with the predetermined scaling parameter ⁇ (step 30) and are then mapped to one of a pair of skewed discrete mappings (q°, q 1 ) dependent on the logic state of a corresponding encoded watermark bit b, to be embedded into the scaled coefficient s, (step 31).
  • a difference between each scaled coefficient and its corresponding mapped, scaled coefficient is determined (step 32) and the difference is added to its corresponding original (i.e., unsealed, unmapped) coefficient to obtain the digital data representation of each watermark embedded transform coefficient x, (step 33).
  • the watermark decoding stage 13 (Fig. 1) is implemented using a probability based decoding scheme which includes parametric statistical estimation procedures for estimating statistical information relating to the transform coefficients of the original image and an expected (i.e., worse case) model for the attack channel noise.
  • the probability based decoding scheme exploits the fact the original image and the watermark embedded image have statistical similarities thereby allowing the watermark embedded image to be used to obtain estimated statistical data of the original image.
  • an image transform block 40 receives the noisy image i* and transforms it into a transform representation.
  • the type of transform performed by transform block 40 is the same type as that performed during the watermark encoding process.
  • the image transform block 40 provides a sub-set of watermark embedded transformed coefficients y to the probability based watermark decoder 41.
  • the sub-set of watermark embedded transformed coefficients y corresponds to the original sub- set of coefficients used during the watermark encoding process (i.e., s).
  • the watermark decoder 41 extracts the embedded watermark data b* from the sub-set using a probability based decoding scheme.
  • the extracted watermark data b* is provided to the error correcting code (ECC) decoder 42 which decodes the watermark data b* and outputs estimated watermark data m*.
  • ECC error correcting code
  • Fig. 4B shows an embodiment of the watermarking method using probability based decoding in which the encoded watermark message m is estimated from the distorted or noisy version of the image i*.
  • the noisy watermark embedded image i* is transformed into a transform domain representation F(i*) (step 43) and a second sub-set of transform coefficients is selected (step 44) having the same location in the transform domain as the first sub-set s(s ⁇ , ....,s N ) used in the watermark encoding process.
  • the statistical parameters of the original sub-set of coefficients s (s ⁇ ,...
  • the encoded watermark data word b* is extracted from y using a probability based decoding scheme which utilizes known aspects of the scaled bin encoding process including the scaling parameter, the expected noise and allowed distortion model, and the estimated statistical parameters (step 46). Once watermark data b* is determined it is decoded to obtain the estimated decoded message m* (step 47).
  • the system shown in Fig. 4A including and image transformer 40, watermark decoder 41 and ECC decoder 42 can be embodied in either a software implementation performed by a computing or processing system, or in a dedicated hardware implementation such as an ASIC, or any combination of a software and hardware implementation.
  • the "location" of coefficients within the transform domain refers to the location within the array of coefficients generated from the transformation. It should further be noted that the location can be a 1, 2, 3 or any multi-dimensional location depending on the dimensions of the array of coefficients generated from the transformation.
  • the probability based decoding scheme is the Maximum Likelihood (ML) decoding scheme
  • the same parameters used by the scaled bin encoding procedure ( , q° and q 1 ), the expected noise and allowed distortion model, and the estimated statistical parameters of the original sub-set s(s ⁇ . . . S N ) are used.
  • Fig. 5 shows one embodiment of the method of the ML decoding procedure. Given the scaled bin encoding parameter 51 (i.e., ⁇ ), known aspects of the original mapping 52, and the expected noise and allowed distortion model 53 and estimated statistical parameters 54, the following probabilities are determined (steps 55 and 56):
  • Figs. 6A and 6B shows an embodiment of the method of watermarking using scaled bin encoding (Fig. 6A) and ML decoding (Fig. 6B) in which encoding of the watermark data is performed so as to simplify decoding when extracting the watermark data from the noisy image i*.
  • the transform coefficients are ordered (step 61) in the DCT domain according to the order of the off-diagonals starting from the top left comer (no. 1) and ending at the right bottom comer (no. M) as shown in Fig. 6C.
  • the DCT domain corresponds to a frequency domain representation of the image.
  • coefficients are ordered diagonally from the lowest frequency (i.e., DC) situated at the top left comer 1 to the highest frequency at the bottom right comer M.
  • Each off-diagonal e.g., Li, L n
  • S covertext signal
  • a statistical parameter is computed (step 63) for the coefficients corresponding to this off -diagonal (where i is the index of the coefficient).
  • the statistical parameter is the estimated variance Q of the set of coefficients located on the off-diagonal of the i-th coefficent. Coefficients corresponding to certain off-diagonal share the same estimated variance.
  • a pair of uniform discrete mappings are computed (step 64).
  • round(-) returns the closest integer value to a real input.
  • the message is encoded using Hadamard matrix rows as error correcting code words which accelerates probability computations during the ML decoding process.
  • N nR, where R is an integer.
  • is ⁇ + ⁇ defined as above and ⁇ is the ratio between the expected (i.e., worst case) noise power and the covertext signal power.
  • the ratio parameter ⁇ is determined experimentally by measuring its value for many images and various noise types. The stronger the noise the higher the value of ⁇ . The maximum level of noise is chosen by determining how robust the watermarking scheme needs to be, and the corresponding value of ⁇ is determined. It should be noted that in another embodiment, the parameter ⁇ may depend on the index i and hence is not fixed.
  • the scaled coefficients are then mapped using the vector discrete mappings qo n (determined in step 64) as defined by the following:
  • the coefficient sequence s in the DCT representation t are replaced with the watermarked coefficient sequence x and the resulting watermarked transform representation is denoted by t' (step 67).
  • the decoder receives the watermark image i' after it has been exposed to some form of intentional and/or unintentional noise resulting in a distorted or noisy version of the image denoted i* .
  • the coefficients are ordered according to off-diagonals starting form the top left comer (no.l) and ending at the right bottom comer (no. M).
  • a subset (yi ... y N ) of N transform coefficients is selected (step 71) corresponding to the original sub-set s (i.e., off diagonals indexed from L ! to L h where 1 ⁇ L ⁇ L h ⁇ M).
  • the distorted image i* is visually similar to the original image i, such that the statistical parameters of the original watermarked sub-set can be estimated by determining the statistical parameters (e.g., the variance estimations Qi*) for the noisy version coefficients y.
  • the statistical parameter, Qi* is determined (step 72) for groups (i.e., off-diagonals) within the sub-set y in the same manner as determined for sub-set s.
  • a pair of uniform discrete mappings q°* and q 1 * are then computed for each off-diagonal in the same manner as computed for the scaled bin encoding process (step 73).
  • the encoded watermark data b* is extracted using Maximum Likelihood decoding (step 74). Specifically, for each observation yi, estimate the probabilities Po(yi[j) and
  • a set of scores is computed for all of the Hadamard code words.
  • the code word b that maximizes the score function is the estimated watermarked code word b*.
  • the watermark code word b* is decoded using the indices of the Hadamard matrix rows to obtain the estimated decoded message m* (step 75). It should be noted that according to the method shown in Figs. 6 A and 6B coefficients are ordered and selected according to an off-diagonal ordering to facilitate simplification of the determination of Sc(b) using Hadamard decoding however it should be further noted that the watermarking technique can be optimized similarly using other transform orderings paired with other decoding schemes in order to minimized probability computations performed during ML decoding.

Abstract

A transform domain watermarking technique which is based on a new encoding scheme referred to as scaled bin encoding which encodes a message in a set of transform coefficients by modifying their values in a way that preserves high image quality (i.e., low distortion levels) and adapts to expected noise level. Recapturing of the watermark image is performed via a decoding method using a maximum likelihood procedure (i.e., maximum likelihood decoding, based on the statistics of the transform coefficients and a worst case statistical model of the noise introduced to these coefficients by image processing operations or attack noise.

Description

METHOD AND SYSTEM OF WATERMARKING DIGITAL DATA USING SCALED BIN ENCODING AND MAXIMUM LIKELIHOOD DECODING
FIELD OF THE INVENTION The present invention relates to watermarking of digital data and in particular to the technique of embedding watermark data into digital image data and recapturing the embedded watermark data from a noisy version of the watermarked digital image without the use of the original digital image data.
BACKGROUND OF THE INVENTION
The fast development of digital information technology, combined with the simplicity of duplication and distribution of digital data across communication networks like the Internet (using, e.g., publicly accessible web sites), has exposed content providers to a real challenge of how to protect their electronic data (e.g., images, video, sound, text). This has stimulated many research efforts towards the design and study of sophisticated watermarking and information hiding methodologies for copyright protection.
Watermarking techniques are used to embed secret information into, for instance, an image in such a way that this information cannot be recioved or deciphered without access to a secret key. On the other hand, to maintain quality, the watermarked image needs to be perceptually identical once the watermark is embedded into the original image.
The main application of watermarking is for proving ownership of the data and for protection against forgers. It is therefore crucial to ensure that no malicious attacker is able to remove the watermark without damaging the image to the degree that it becomes useless. In addition watermarking techniques need to be robust to standard operations performed on images, like printing, scanning, lossy compression (e.g., JPEG), filtering, and so on. Hence, the tradeoffs to be considered in the design of a watermarking technique rely on three principal parameters: robustness to attacks and image processing operations, high quality, and the relative amount of secret information to be embedded, i.e., the coding rate.
Prior art image watermarking techniques can be divided into two groups according to the type of feature set the watermark is embedded in. Specifically, the watermark is embedded into either the intensity value of the luminance in the spatial domain representation of the image or the transform coefficients in the transform domain representation (e.g. DCT, DWT) of the image. The algorithms that are used to detect the watermark signal can also be placed under two categories based on whether or not they use the original image during the watermark detection process. In general, watermarking techniques that embed the watermark in the coefficients of a transform domain and detect it without resorting to the original image enjoy many advantages in their robustness to standard image processing operations and to hostile attacks.
The basic technique for embedding a watermark message or data into an image / by encoding the watermark data within the transform domain is performed by initially transforming the image / to obtain a representation in a transform domain (e.g. DCT, DFT etc) of the image. Next, a subset of the transform coefficients is selected and the watermark message is encoded by slightly changing the sub-set of transform coefficients values, thus producing a watermarked transform coefficient set. The watermarked coefficients are then combined with the non-watermarked coefficients to generate a watermark embedded transform representation of the image. The watermarked transform representation is then inverse-transformed to produce a watermarked image . The coefficient modification is subtle enough so that / is perceptually indistinguishable from the original image / .
In order to recapture the watermark from the watermarked image ϊ a decoder receives the distorted or noisy version of I , that may have suffered certain image processing operations or hostile attacks. It transforms the received version to the appropriate transform domain representation and selects the same sub-set of coefficients in which the watermark signal has been encoded. The decoder then extracts the watermark message from the sub-set of coefficients using a decoding procedure.
Recently, two methods for watermarking in a transform domain have been suggested for decoding without resorting to the original image: coefficient perturbation and dithering modulation.
In one prior art method using coefficient perturbation, the watermark data is added to the original image by perturbing the values of significant DCT (i.e., transform) coefficients. For example, if 5 is a DCT coefficient, a "zero" bit of the watermark data is encoded by changing the coefficient to s + ε and a "one" bit of the watermark data is encoded by changing the coefficient to s - ε , where ε is a small constant. Recapturing the watermark message from the watermarked image / is accomplished by correlating the appropriate DCT coefficients with the watermark message. Variations of this watermarking technique have been suggested such as the use of different transform domains such as DFT or DWT and/or the use of different perturbation schemes (e.g., ,s(l ± ε) or s ± ε\s\ ).
Dithering modulation is based on quantizing the transform domain coefficients. In this case, embedding a watermark message in a selected sub-set of coefficients is based on replacing these coefficients with their quantized values in a way that depends on the watermark message. For example, a "zero" bit is encoded by quantizing the coefficients with a quantizer q0 and a "one" bit is encoded by quantizing the coefficients with a different quantizer q .
In a variation of the dither modulation watermarking scheme "zero" bit and "one" bit values are encoded using two "self-noise suppression" mappings fo and fi which are based on a small modification of the quatizer mappings qo and qi used in the dither modulation scheme.
To date, the prior art watermarking schemes including coefficient perturbation and dither modulation are not explicitly designed to adapt to a predefined distortion level or noise criteria. As a result, they provide a relatively low rate of information embedding or suffer from high error rate under changing noise conditions. More importantly, since attempts by malicious attackers take the form of unknown noise conditions on the watermark embedded image data, these prior art schemes are less able to protect against attacks upon the security of the data - one of the main purposes of watermarking.
The present invention presents a watermarking technique which is based on a new encoding scheme referred to as scaled bin encoding which encodes watermark data into image data by modifying image values in a way that preserves high image quality (i.e., low distortion levels) and adapts to expected (i.e., worse case) noise level. Recapturing of the watermark data from the watermark embedded image after it has been exposed to unintentional and/or intentional noise is performed via a decoding method using a probability based procedure (e.g., maximum likelihood decoding), based on estimated statistics of the original image values and an expected statistical model of the noise introduced to the image by image processing operations or attack noise, thereby providing a robust and high quality watermarking system and method.
SUMMARY OF THE INVENTION
The present invention provides a method and system of embedding digital watermark data into digital image data and recapturing the embedded watermark from the watermarked digital data without the use of the original digital image data and despite noise introduced into the watermarked digital image data. The system and method of the present invention is performed by two processes: the process of embedding the watermark into the original digital data to obtain watermarked digital data and the process of decoding the watermarked digital data to obtain the original watermarked data. Watermark data is embedded into the digital image data using an encoding method referred to as scaled bin encoding. The embedded watermark data is recaptured from the watermarked data using a probability based decoding scheme. In one embodiment, the probability based decoding scheme is the Maximum Likelihood (ML) decoding scheme. In one embodiment, embedding of the watermarked data is achieved by transforming the original digital image data into first transform coefficients; encoding the watermark data using an error correcting code; embedding the encoded watermark data into a first sub-set of the first transform coefficients to generate a sub-set of embedded transform coefficients using scaled bin encoding; and then inversely transforming the watermark embedded first sub-set of coefficients along with the remaining non-watermark embedded first transform coefficients to generate watermark embedded digital image data which includes the original digital image data and the watermark data
According to this embodiment, scaled bin encoding is performed by scaling each coefficient of the sub-set of transform coefficients with a predetermined scaling parameter which is representative of an expected noise level and an allowed distortion model; mapping each scaled coefficient to one of a pair of skewed discrete mappings dependent on the logic state of the corresponding encoded watermark data bit to be embedded into each scaled coefficient; obtaining a difference between each scaled coefficient from its corresponding mapped and scaled coefficient; and adding the difference to its corresponding original (i.e., unsealed and unmapped) coefficient to obtain each watermark embedded transform coefficient.
In accordance with another embodiment of the system and method of the present invention, the watermark data is recaptured from the watermark embedded digital image data by transforming the watermark embedded image into second transform coefficients using the same transformation as used during watermark encoding; selecting a second sub-set of watermarked transform coefficients which correspond to the first sub-set of transform coefficients; estimating statistical parameters of the first sub-set of transform coefficients using image statistics of the second watermarked sub-set of coefficients; extracting embedded watermark data from the second sub-set of coefficients with a probability based decoding scheme which uses the predetermined scaling parameter, known aspects of the scaling and mapping steps, the expected noise and allowed distortion model, and the estimated statistical parameters; decoding the extracted watermark data using an error correcting decoder based on the error correcting code used when watermark encoding to obtain an estimate of the original watermark. In one embodiment the ML decoding scheme is used as the probability type decoding scheme.
For embedding the watermark into the digital image data, one embodiment of the system of watermarking includes an image transformer for transforming the digital image data into a transform domain representation; an encoder for encoding watermark data using an error correcting code to generate encoded watermark data; a watermark encoder for embedding the encoded watermark data into a first sub-set of the first transform coefficients to generate a sub-set of embedded transform coefficients by using scaled bin encoding; an inverse image transformer for inversely transforming the watermark embedded first sub-set of coefficients along with the remaining non-watermark embedded first transform coefficients to generate watermarked digital image data which includes the digital image data and the watermark data.
For recapturing the watermark data from the watermarked image, one embodiment of the system of watermarking includes an image transformer for transforming the watermarked image data into the transform domain representation; a means for determining the statistical parameters of the second sub-set of coefficients; a probability based watermark decoder for extracting embedded watermark data from the second sub-set of coefficients with a probability based decoding scheme which uses the predetermined scaling parameter, known aspects of the scaling and mapping steps, the expected noise and allowed distortion model, and the estimated statistical parameters; a decoder for decoding the extracted watermark data using an error correcting decoder based on the original error correcting code to generate the watermark data.
BRIEF DESCRIPTION OF THE DRAWINGS
The objects, features, and advantages of the present invention will be apparent to one skilled in the art, in view of the following detailed description in which:
Fig. 1 illustrates basic stages of the watermarking process;
Figs. 2A and 2B illustrate a first embodiment of a system and method of watermarking using scaled bin encoding in accordance with the present invention;
Fig. 3 illustrates a first embodiment of a method of scaled bin encoding in accordance with the present invention;
Figs. 4A and 4B illustrate a first embodiment of a system and method of watermarking using a probability based decoding scheme;
Fig. 5 illustrates an embodiment of a watermarking method in which watermark data is decoded using a probability based decoding scheme;
Figs. 6A and 6B illustrate an embodiment of watermarking using scaled bin encoding and the Maximum Likelihood decoding; and
Fig. 6C illustrates ordering of DCT coefficients for selection of a sub-set of transform coefficients in accordance with the present invention.
DETAILED DESCRIPTION OF THE INVENTION
The watermarking process can be represented as having three stages: a watermark encoding stage 10, an attack channel stage 11, and a watermark decoding stage 12.
The encoding stage 10 receives two inputs: m - the watermark data or message to be embedded and i a covertext image where the watermark message is to be embedded. Covertext image i can include digital data comprising text data, video data, audio data or combinations of these types of data. The output of the encoder is a watermarked image i' that is perceptually similar to the original image i. The attack channel 11 represents a composition of expected and/or non-malicious signal processing operations (e.g. compression, filtering, etc.) and/or a deliberate attack on the watermarked image. The watermark embedded image output i* of the channel is a distorted version of the input watermark embedded image i', which is assumed to result from noise signal r. Finally, the decoder 12 receives the image i* as an input and estimates the embedded message m without resorting to the original covertext image i.
Embedding using Scaled Bin Encoding:
Fig. 2A illustrates one embodiment of a system of watermarking for embedding a watermark message into a covertext image. Embedding of the watermark message m into image data i is performed in a transform domain. The image i is initially transformed into a set of transform coefficients t=F(i) by image transform block 20. The image transform block 20 can be embodied in numerous manners as is well known in the field of digital signal processing. For instance, the image transform block may be embodied so as to convert the image into a Discrete Cosine Transform (DCT) representation or a Discrete Fourier Transform (DFT) representation. Also prior to embedding, the watermark message m is encoded into an error corrected code format b by error correcting code block 21. A sub-set of coefficients s are selected from the image coefficients t for embedding the watermark data into. The scaled bin Encoding block 22 embeds the message b into the selected image data transform coefficients s to generate watermark embedded coefficients x using a coding method in which each coefficient of the sub-set of transform coefficients is scaled with a predetermined scaling parameter which is a function of an expected noise model and an allowed distortion model; each scaled coefficient is mapped to one of a pair of skewed discrete mappings dependent on the logic state of the corresponding encoded watermark data bit to be embedded into each scaled coefficient; a difference between each scaled coefficient from its corresponding mapped and scaled coefficient is obtained; and the difference is added to its corresponding original (i.e., unsealed and unmapped) coefficient to obtain each watermark embedded transform coefficient.
The watermark embedded coefficients x replace the corresponding non- watermark embedded coefficients s and are combined with the remaining non- embedded coefficients. The resulting combined coefficients are inversely transformed by inverse image transform block 23 to generate a watermark embedded image i'.
It should be understood that, bin coding is generally defined to be a technique in which code words are chosen for encoding a given message depending on the message itself and on the image in which it is being encoded into. Scaled bin encoding, according to the system and method of the present invention, is generally defined as a technique in which an appropriate scaling parameter is introduced into a bin-coding scheme to adapt the coding rate (i.e., number of bins) to the level of noise that is expected to disrupt the transform coefficients and an allowed distortion. Thus, the scaled bin encoding based watermarking system and method in accordance with the present invention is robust to noise introduced by image processing operations and hostile attacks.
The system shown in Fig. 2A including and image transformer 20, an encoder
21, scaled bin encoder 22 and inverse image transformer 23 can be embodied in either a software implementation performed by a computing or processing system, or in a dedicated hardware implementation such as an ASIC, or any combination of a software and hardware implementation. Fig. 2B illustrates an embodiment of a method of watermarking using scaled bin encoding corresponding to the system shown in Fig. 2A. The method shown in Fig. 2B includes the steps of transforming image data i into transform coefficients (step 24); encoding watermark data m into an error correcting code format to generate encoded watermark data b (step 25); embedding the encoded watermark data b into a sub-set of the transform coefficients s using scaled bin encoding (step 26); inversely transforming the watermark embedded coefficients x and the remaining non- watermark embedded coefficients to obtain a watermark embedded image ir (step 27).
It should be noted that scale bin encoding of watermark data into an image can be performed within the spatial (i.e. pixel) domain of the image instead of a given transform domain of the image. Hence, in an alternative embodiment of the system of watermarking image transform block 20 and inverse image transform block 23 (Fig. 2A) are not required and in an alternative embodiment of the method of watermarking the steps of transforming 24 and inversely transforming 27 are not required
In one embodiment, embedding using scaled bin encoding is performed so as to ensure that the watermarked image is perceptually similar to the original image by adhering to the following allowed distortion model and expected noise model criteria:
1. A pair of scalar discrete mapping - q° and q1 both having a scalar distortion level q (s) - s\ < D and \q (s) - s < D that is compatible with the distortion constraint d(x,s)<D.
2. A scalar parameter 0<α<l whose value depends on the relative power of the expected noise in the attack channel.
Hence, in accordance with another embodiment of the system and method of watermarking in which a sequence b=(bι,...,bN) of N bits is embedded into a sequence s=(sι , ... ,SΝ) of N transform coefficients, embedding is performed using scaled bin encoding by scaling the sequence s by α and then mapping it using a discrete mapping q defined component-wise by the sequence b such that the i-th component is mapped by q° or by q1 according to the sign of bt. More formally: q(as) = (qbl (asA- ,q"n asN ))
Next, the difference between as and its mapped version is added to the original sequence s to obtain the watermarked signal x:
x = s + (q(os) - os)
Fig. 3 shows the steps of performing scaled bin encoding using the scaling parameter α which is a function of an expected noise level and using discrete mappings which are a function of an allowed distortion model as defined above. According to this method, each coefficient s, of the sub-set s is scaled with the predetermined scaling parameter α (step 30) and are then mapped to one of a pair of skewed discrete mappings (q°, q1) dependent on the logic state of a corresponding encoded watermark bit b, to be embedded into the scaled coefficient s, (step 31). A difference between each scaled coefficient and its corresponding mapped, scaled coefficient is determined (step 32) and the difference is added to its corresponding original (i.e., unsealed, unmapped) coefficient to obtain the digital data representation of each watermark embedded transform coefficient x, (step 33).
Decoding using Probability Based Decoding:
In accordance with another embodiment of the system and method of watermarking, the watermark decoding stage 13 (Fig. 1) is implemented using a probability based decoding scheme which includes parametric statistical estimation procedures for estimating statistical information relating to the transform coefficients of the original image and an expected (i.e., worse case) model for the attack channel noise. The probability based decoding scheme exploits the fact the original image and the watermark embedded image have statistical similarities thereby allowing the watermark embedded image to be used to obtain estimated statistical data of the original image.
In one embodiment of the watermarking system using a probability based decoding scheme ( Fig. 4A) an image transform block 40 receives the noisy image i* and transforms it into a transform representation. The type of transform performed by transform block 40 is the same type as that performed during the watermark encoding process. The image transform block 40 provides a sub-set of watermark embedded transformed coefficients y to the probability based watermark decoder 41. The sub-set of watermark embedded transformed coefficients y corresponds to the original sub- set of coefficients used during the watermark encoding process (i.e., s). The watermark decoder 41 extracts the embedded watermark data b* from the sub-set using a probability based decoding scheme. The extracted watermark data b* is provided to the error correcting code (ECC) decoder 42 which decodes the watermark data b* and outputs estimated watermark data m*.
Fig. 4B shows an embodiment of the watermarking method using probability based decoding in which the encoded watermark message m is estimated from the distorted or noisy version of the image i*. Initially the noisy watermark embedded image i* is transformed into a transform domain representation F(i*) (step 43) and a second sub-set of transform coefficients is selected (step 44) having the same location in the transform domain as the first sub-set s(sι, ....,sN) used in the watermark encoding process. These coefficients are denoted y=(yι , ... ,yN). Next, the statistical parameters of the original sub-set of coefficients s=(sι,... ,SN) are estimated, by determining (step 45) the statistics of the distorted coefficients y=(yι,... N). The encoded watermark data word b* is extracted from y using a probability based decoding scheme which utilizes known aspects of the scaled bin encoding process including the scaling parameter, the expected noise and allowed distortion model, and the estimated statistical parameters (step 46). Once watermark data b* is determined it is decoded to obtain the estimated decoded message m* (step 47).
The system shown in Fig. 4A including and image transformer 40, watermark decoder 41 and ECC decoder 42 can be embodied in either a software implementation performed by a computing or processing system, or in a dedicated hardware implementation such as an ASIC, or any combination of a software and hardware implementation.
It should be understood that the "location" of coefficients within the transform domain refers to the location within the array of coefficients generated from the transformation. It should further be noted that the location can be a 1, 2, 3 or any multi-dimensional location depending on the dimensions of the array of coefficients generated from the transformation.
In one embodiment in which the probability based decoding scheme is the Maximum Likelihood (ML) decoding scheme the same parameters used by the scaled bin encoding procedure ( , q° and q1), the expected noise and allowed distortion model, and the estimated statistical parameters of the original sub-set s(sι . . . SN) are used. Fig. 5 shows one embodiment of the method of the ML decoding procedure. Given the scaled bin encoding parameter 51 (i.e., α), known aspects of the original mapping 52, and the expected noise and allowed distortion model 53 and estimated statistical parameters 54, the following probabilities are determined (steps 55 and 56):
• the probability Po(j) that the point q (j) was used in encoding the corresponding original coefficient; and
• the conditional probability Po(yi[j) of observing the distorted coefficient value y, given that the point q°(j) was used in encoding the corresponding original coefficient;
• the probability Pι(j) that the point q'(j) was used in encoding the corresponding original coefficient; and
• the conditional probability Pι(yi[j) of observing the distorted coefficient value y; given that the point ql ) was used in encoding the corresponding original coefficient;
(where j is an integer index of the discrete mapping points which ranges over a finite interval {-J,J}). Using the results of the probabilities and for each possible code word b (where b ranges over the 2k possible error correcting code words) the following score is determined (step 57):
Sc(b) = (yl \ J)Pbl U)
Figure imgf000015_0001
The scores are then evaluated and the code word b that maximizes the score Sc(b) is selected and corresponds to the estimated encoded watermark data word b* (step 58).
Figs. 6A and 6B shows an embodiment of the method of watermarking using scaled bin encoding (Fig. 6A) and ML decoding (Fig. 6B) in which encoding of the watermark data is performed so as to simplify decoding when extracting the watermark data from the noisy image i*. As shown in Fig. 6A, initially the image i to be watermarked is transformed into its DCT transform representation t=DCT(i) (step 60).
Next, the transform coefficients are ordered (step 61) in the DCT domain according to the order of the off-diagonals starting from the top left comer (no. 1) and ending at the right bottom comer (no. M) as shown in Fig. 6C. It should be noted that the DCT domain corresponds to a frequency domain representation of the image. Referring to Fig. 6C, coefficients are ordered diagonally from the lowest frequency (i.e., DC) situated at the top left comer 1 to the highest frequency at the bottom right comer M. Each off-diagonal (e.g., Li, Ln) represent a range of frequencies including groups (i.e., off-diagonals) of coefficients within that range.
A subset s of N transform coefficients is then selected (step 62) from the ordered coefficients corresponding to off-diagonals indexed from L to Ln where 1<L < Lh <M. These coefficients are to be used as the covertext signal S=(SI,... ,SN) in which the watermark message or data is to be encoded into. For each off-diagonal with indexes in the range [L ,Ln] a statistical parameter is computed (step 63) for the coefficients corresponding to this off -diagonal (where i is the index of the coefficient). In this case, the statistical parameter is the estimated variance Q of the set of coefficients located on the off-diagonal of the i-th coefficent. Coefficients corresponding to certain off-diagonal share the same estimated variance.
The formula of the estimated variance of a set of numbers {α, =1 is well known in the
field of statistics and is given by: : V -
Figure imgf000016_0001
For each off-diagonal a pair of uniform discrete mappings are computed (step 64). The discrete mappings depend on an allowed scalar distortion level. This distortion level, for the coefficient s, , is defined by Dt = ε [Ql , where ε is a global parameter that controls the visual distortion level. This distortion term is motivated by models of the Human visual system and may be replaced by other distortion functions. It should be noted that all the coefficients corresponding to the same off-diagonal have a common distortion level D, and hence share the same discrete mappings thereby significantly reducing mapping computation. Consequently, for each distortion level parameter D, a pair of uniform scalar discrete mapping - q° and q1 can be determined by the following:
and qD l (x) =
Figure imgf000017_0001
Figure imgf000017_0002
where round(-) returns the closest integer value to a real input.
The mappings q° and
Figure imgf000017_0003
{(2j — )D} =→_ , respectively. We denote by q,°(j) and q/ ) where j is an integer
index the corresponding elements in the above mentioned discrete sets.
The watermark message m is encoded as a sequence of N bits denoted b=(bι,...,b ), using a code selected dependent on the type of transform ordering and for optimizing subsequent decoding steps (step 65). In this case, the message is encoded using Hadamard matrix rows as error correcting code words which accelerates probability computations during the ML decoding process. Each block of k message bits is mapped into the appropriate row of length n=2k of the Hadamard matrix of order k, where k might change as a function of the off-diagonal and as a function of the expected noise power. In this embodiment it is assumed that N=nR, where R is an integer. The binary sequence b=(bι,...,bn) is embedded in the coefficient sub-set s=(sι,...,sn) using the scaled bin-coding scheme (step 66) to obtain the watermarked coefficients sequence x. The scaled bin encoding is performed for each block of n coefficients until the sub-set S=(SI,... ,SN) is exhausted. Initially, the coefficients are
£ scaled using the scaling parameter α defined by the formula: = where ε is ε +η defined as above and η is the ratio between the expected (i.e., worst case) noise power and the covertext signal power. The ratio parameter η is determined experimentally by measuring its value for many images and various noise types. The stronger the noise the higher the value of η. The maximum level of noise is chosen by determining how robust the watermarking scheme needs to be, and the corresponding value of η is determined. It should be noted that in another embodiment, the parameter α may depend on the index i and hence is not fixed. The scaled coefficients are then mapped using the vector discrete mappings qon (determined in step 64) as defined by the following:
q(as) = (q b> (asl),-;qDn bA∞n))
Next, the difference between ocs and its mapped version is subtracted from the original sequence s to obtain the watermarked signal x:
x = s + (q(αs) - os)
The coefficient sequence s in the DCT representation t are replaced with the watermarked coefficient sequence x and the resulting watermarked transform representation is denoted by t' (step 67). The watermarked DCT domain representation is then inversely transformed (step 68) to obtain the watermarked image ϊ= DCT-l(t').
Referring to Fig. 6B, the decoder receives the watermark image i' after it has been exposed to some form of intentional and/or unintentional noise resulting in a distorted or noisy version of the image denoted i* . The decoder extracts by estimation the embedded watermark data given the noisy image i* using Maximum Likelihood decoding. Initially, the DCT transform of the noisy image i* is determined and denoted t* = DCT(i*) (step 69). Next, the transform coefficients t* of the noisy image are ordered in the same manner as ordered for the scaled bin encoding process (step 70). In this case, the coefficients are ordered according to off-diagonals starting form the top left comer (no.l) and ending at the right bottom comer (no. M). Once the coefficients are ordered, a subset = (yi ... yN) of N transform coefficients is selected (step 71) corresponding to the original sub-set s (i.e., off diagonals indexed from L! to Lh where 1<L < Lh <M).
In accordance with the method of the present invention, it is assumed that the distorted image i* is visually similar to the original image i, such that the statistical parameters of the original watermarked sub-set can be estimated by determining the statistical parameters (e.g., the variance estimations Qi*) for the noisy version coefficients y. The statistical parameter, Qi* is determined (step 72) for groups (i.e., off-diagonals) within the sub-set y in the same manner as determined for sub-set s.
A pair of uniform discrete mappings q°* and q1* are then computed for each off-diagonal in the same manner as computed for the scaled bin encoding process (step 73).
Once the statistical parameters of the watermarked sub-set is determined, the encoded watermark data b* is extracted using Maximum Likelihood decoding (step 74). Specifically, for each observation yi, estimate the probabilities Po(yi[j) and
Pι(yι|j)) of observing the value y, given that the points qi°*(j) and qi1:H(j)> respectively, were used to encode the corresponding original coefficient Sj. Using an independent Gaussain model for the joint distribution of the DCT coefficients, the allowed watermark distortion and the expected noise in the attack channel, these probabilities are given by:
Figure imgf000019_0001
where β = 1 + ε + η and σ,2 = Q * (1 + ε +η - β2( 2 + ε)) . a + ε +η The a-priori probabilities Po(j) and Pι(j) of using the corresponding points q,°*(j) and q *(j) in encoding the corresponding original coefficient s, are independent of i and can be estimated by:
Figure imgf000020_0001
For each Hadamard code word b (where b ranges over the 2k possible error correcting code words) the following score is computed:
Sc(b) = P{y \b}= plyi,...,y ι (yl \ j)Pb, (j) ,
Figure imgf000020_0002
where J is chosen in such a way that the
difference Pb(y, | j)P (j) - ∑R^ I j)P (j) is negligible for every i and for
be {θ,l}. It should be noted that the above defined score function is efficiently computed using a fast Hadamard transform. Hence, watermark decoding is improved by ordering and selecting off-diagonal coefficients during watermark encoding so as to facilitate probability computation using a fast Hadamard transform.
A set of scores is computed for all of the Hadamard code words. The code word b that maximizes the score function is the estimated watermarked code word b*. The watermark code word b* is decoded using the indices of the Hadamard matrix rows to obtain the estimated decoded message m* (step 75). It should be noted that according to the method shown in Figs. 6 A and 6B coefficients are ordered and selected according to an off-diagonal ordering to facilitate simplification of the determination of Sc(b) using Hadamard decoding however it should be further noted that the watermarking technique can be optimized similarly using other transform orderings paired with other decoding schemes in order to minimized probability computations performed during ML decoding. In the preceding description, numerous specific details are set forth, such as specific transform, encoding, decoding, and ordering types in order to provide a through understanding of the present invention. It will be apparent, however, to one skilled in the art that these specific details need not be employed to practice the present invention. In other instances, well-known image processing steps have not been described in detail in order to avoid unnecessarily obscuring the present invention.
In addition, although element of the present invention have been described in conjunction with a certain embodiment, it is appreciated that the invention can be implement in a variety of other ways. Consequently, it is to be understood that the particular embodiment shown and described by way of illustration is in no way intended to be considered limiting. Reference to the details of this embodiment is not intended to limit the scope of the claims which themselves recited only those features regarded as essential to the invention.

Claims

CLAIMSWe claim:
1. A method of watermarking digital image data having an associated domain representation comprising the steps of:
encoding watermark data using an error correcting code to generate encoded watermark data;
embedding the encoded watermark data into a first sub-set of elements of a domain representation of the image to generate an embedded sub-set of the domain representation using scaled bin encoding;
combining the embedded sub-set of elements with the remaining unembedded portion of elements of the domain representation of the image to generate a watermark embedded domain representation of the image.
2. The method as described in Claim 1 further comprising the steps of:
transforming the digital image data from a spatial domain representation into a transform domain representation prior to embedding the encoded watermark data into the first sub-set of elements of the domain representation of the image; and
inversely transforming the watermark embedded domain representation of the image to generate a spatial domain representation of the watermarked digital image which includes the digital image data and the watermark data.
3. The method of watermarking as described in Claim 2 wherein scaled bin encoding comprises the steps of:
scaling each element of the sub-set of the domain representation of the image with a scaling parameter which is representative of the statistics of the image data, an expected noise level and an allowed distortion model; mapping each scaled element to one of a pair of skewed discrete mappings dependent on the logic state of a corresponding encoded watermark data bit to be embedded into each scaled element;
obtaining a difference between each scaled element from its corresponding original element; and
adding the difference to its corresponding original element to obtain each watermark embedded element.
4. The method of watermarking as described in Claim 2 further comprising the step of recapturing the watermark from a noisy version of the watermark embedded spatial domain representation of the image:
transforming the spatial domain representation of the noisy watermark embedded image into the transform domain representation;
selecting a second sub-set of elements from the noisy watermarked embedded transform domain representation of the image which corresponds to the first sub-set of elements;
determining statistical parameters of the second sub-set of elements;
extracting embedded watermark coded data from the second sub-set of elements with a probability based decoding scheme which uses the statistical parameters, an expected noise and allowed distortion model and a scaling parameter associated with the step of scaled bin encoding;
decoding the extracted watermark coded data using an error correcting decoder based on the error correcting code to generate the watermark data.
5. The method of watermarking as described in Claim 4 wherein the step of extracting embedded watermark data further comprises the steps of:
1) for each of the second sub-set of elements and for each discrete mapping determining: a) a first probability that a given mapping value was used to encode a corresponding original element within the first sub-set of elements;
b) a second probability of observing each element of the second sub-set of elements given the first probability that the given mapping value was used;
2) determining a score value for each possible watermark data code word determined from the first and second probabilities;
3) selecting the watermark code data word that maximizes the score value and thereby provides an estimation of the watermark data.
6. The method of watermarking as described in Claim 5 wherein the transform domain representation corresponds to DCT transform domain having associated coefficients.
7. The method of watermarking as described in Claim 6 further comprising the step of ordering the first sub-set of transform coefficients in the DCT domain representation according to the order of off-diagonals and selecting the first sub-set of transform coefficients corresponding to off diagonals wherein groups of the first sub-set of transform coefficients have common distortion and variance characteristics.
8. The method as described in Claim 7 further comprising the step of ordering the transform domain representation coefficients prior to selection of the second sub-set of coefficients wherein the second sub-set of coefficients have the same transform domain location as the first sub-set of coefficients.
9. The method as described in Claim 8 wherein the step of extracting the watermark coded data using Hadamard decoding.
10. A system of watermarking digital image data comprising:
a first digital image processor for: transforming the digital image data into a transform domain representation to generate first transform coefficients; encoding watermark data using an error correcting code to generate encoded watermark data; embedding the encoded watermark data into a first sub-set of the first transform coefficients to generate a sub-set of embedded transform coefficients by using scaled bin encoding; and inversely transforming the watermark embedded first sub-set of coefficients along with non-watermark embedded first transform coefficients to generate watermarked digital image data which includes the digital image data and the watermark data; and
a second digital image processor for: transforming the watermarked image data into the transform domain representation of second transform coefficients; selecting a second sub-set of the second transform coefficients which correspond to the first sub-set of transform coefficients; estimating statistical parameters of the first sub-set of transform coefficients using image statistics of the second sub-set of coefficients; extracting embedded watermark data from the second sub-set of coefficients with a probability based decoding scheme which uses the predetermined scaling parameter, known aspects of the scaling and mapping steps, the noise and distortion model, and the estimated statistical parameters; and decoding the extracted watermark data using an error correcting decoder based on the error correcting code to generate the watermark data.
PCT/IB2001/001447 2000-08-18 2001-08-14 Method and system of watermarking digital data using scaled bin encoding and maximum likelihood decoding WO2002015586A2 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
AU2001276620A AU2001276620A1 (en) 2000-08-18 2001-08-14 Method and system of watermarking digital data using scaled bin encoding and maximum likelihood decoding
JP2002520566A JP4226897B2 (en) 2000-08-18 2001-08-14 How to embed a digital watermark in digital image data
KR10-2003-7002275A KR20030024880A (en) 2000-08-18 2001-08-14 Method and system of watermarking digital data using scaled bin encoding and maximum likelihood decoding
EP01954279A EP1310098A2 (en) 2000-08-18 2001-08-14 Method and system of watermarking digital data using scaled bin encoding and maximum likelihood decoding

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/642,223 2000-08-18
US09/642,223 US6721439B1 (en) 2000-08-18 2000-08-18 Method and system of watermarking digital data using scaled bin encoding and maximum likelihood decoding

Publications (3)

Publication Number Publication Date
WO2002015586A2 WO2002015586A2 (en) 2002-02-21
WO2002015586A3 WO2002015586A3 (en) 2002-06-13
WO2002015586A9 true WO2002015586A9 (en) 2003-05-08

Family

ID=24575709

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2001/001447 WO2002015586A2 (en) 2000-08-18 2001-08-14 Method and system of watermarking digital data using scaled bin encoding and maximum likelihood decoding

Country Status (6)

Country Link
US (1) US6721439B1 (en)
EP (1) EP1310098A2 (en)
JP (1) JP4226897B2 (en)
KR (1) KR20030024880A (en)
AU (1) AU2001276620A1 (en)
WO (1) WO2002015586A2 (en)

Families Citing this family (55)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7644282B2 (en) 1998-05-28 2010-01-05 Verance Corporation Pre-processed information embedding system
US6737957B1 (en) 2000-02-16 2004-05-18 Verance Corporation Remote control signaling using audio watermarks
JP4005780B2 (en) * 2001-07-12 2007-11-14 興和株式会社 Digital watermark embedding and detection
JP3485911B2 (en) * 2001-12-17 2004-01-13 シャープ株式会社 Data usage restriction setting method, data usage restriction setting device, data usage restriction setting program, and recording medium recording the program
US7188248B2 (en) * 2002-07-09 2007-03-06 Kaleidescope, Inc. Recovering from de-synchronization attacks against watermarking and fingerprinting
US7702101B2 (en) * 2002-07-09 2010-04-20 Kaleidescape, Inc. Secure presentation of media streams in response to encrypted digital content
AU2003282763A1 (en) 2002-10-15 2004-05-04 Verance Corporation Media monitoring, management and information system
US7460684B2 (en) 2003-06-13 2008-12-02 Nielsen Media Research, Inc. Method and apparatus for embedding watermarks
US20060239501A1 (en) 2005-04-26 2006-10-26 Verance Corporation Security enhancements of digital watermarks for multi-media content
WO2005076985A2 (en) * 2004-02-04 2005-08-25 Digimarc Corporation Digital watermarking image signals on-chip and photographic travel logs through digital watermarking
KR101087588B1 (en) 2004-07-02 2011-11-29 닐슨 미디어 리서치 인코퍼레이티드 Methods And Apparatus For Mixing Compressed Digital Bit Streams
US8020004B2 (en) 2005-07-01 2011-09-13 Verance Corporation Forensic marking using a common customization function
KR100685974B1 (en) * 2005-07-04 2007-02-26 엘지전자 주식회사 Apparatus and method for watermark insertion/detection
US8781967B2 (en) 2005-07-07 2014-07-15 Verance Corporation Watermarking in an encrypted domain
FR2894759A1 (en) * 2005-12-12 2007-06-15 Nextamp Sa METHOD AND DEVICE FOR FLOW TATTOO
US8078301B2 (en) 2006-10-11 2011-12-13 The Nielsen Company (Us), Llc Methods and apparatus for embedding codes in compressed audio data streams
WO2008107511A1 (en) * 2007-03-08 2008-09-12 Nokia Corporation Interoperability of digital broadband broadcasting and cellular communication systems
US9349153B2 (en) 2007-04-25 2016-05-24 Digimarc Corporation Correcting image capture distortion
US20090060257A1 (en) * 2007-08-29 2009-03-05 Korea Advanced Institute Of Science And Technology Watermarking method resistant to geometric attack in wavelet transform domain
US7974437B2 (en) * 2007-11-19 2011-07-05 Seiko Epson Corporation Identifying steganographic data in an image
US8081823B2 (en) * 2007-11-20 2011-12-20 Seiko Epson Corporation Segmenting a string using similarity values
US8031905B2 (en) * 2007-11-21 2011-10-04 Seiko Epson Corporation Extracting data from images
US8243981B2 (en) * 2007-11-26 2012-08-14 Seiko Epson Corporation Identifying embedded data in an image
US8009862B2 (en) * 2007-11-27 2011-08-30 Seiko Epson Corporation Embedding data in images
JP2011151776A (en) * 2009-12-25 2011-08-04 Canon Inc Information processing apparatus, verification apparatus, and methods of controlling the same
US8838977B2 (en) 2010-09-16 2014-09-16 Verance Corporation Watermark extraction and content screening in a networked environment
CN103229213B (en) * 2010-11-29 2016-08-10 汤姆逊许可公司 The method and apparatus rebuilding the self similarity texture region of image
US9396509B1 (en) 2011-10-30 2016-07-19 Digimarc Corporation Closed form non-iterative watermark embedding
US8615104B2 (en) * 2011-11-03 2013-12-24 Verance Corporation Watermark extraction based on tentative watermarks
US8923548B2 (en) 2011-11-03 2014-12-30 Verance Corporation Extraction of embedded watermarks from a host content using a plurality of tentative watermarks
US8682026B2 (en) 2011-11-03 2014-03-25 Verance Corporation Efficient extraction of embedded watermarks in the presence of host content distortions
US8745403B2 (en) 2011-11-23 2014-06-03 Verance Corporation Enhanced content management based on watermark extraction records
US9323902B2 (en) 2011-12-13 2016-04-26 Verance Corporation Conditional access using embedded watermarks
US9401001B2 (en) 2014-01-02 2016-07-26 Digimarc Corporation Full-color visibility model using CSF which varies spatially with local luminance
US9449357B1 (en) 2012-08-24 2016-09-20 Digimarc Corporation Geometric enumerated watermark embedding for spot colors
US11263829B2 (en) 2012-01-02 2022-03-01 Digimarc Corporation Using a predicted color for both visibility evaluation and signal robustness evaluation
US9380186B2 (en) 2012-08-24 2016-06-28 Digimarc Corporation Data hiding for spot colors in product packaging
US11810378B2 (en) 2012-08-24 2023-11-07 Digimarc Corporation Data hiding through optimization of color error and modulation error
US9571606B2 (en) 2012-08-31 2017-02-14 Verance Corporation Social media viewing system
US9106964B2 (en) 2012-09-13 2015-08-11 Verance Corporation Enhanced content distribution using advertisements
US8869222B2 (en) 2012-09-13 2014-10-21 Verance Corporation Second screen content
US9262793B2 (en) 2013-03-14 2016-02-16 Verance Corporation Transactional video marking system
US9251549B2 (en) 2013-07-23 2016-02-02 Verance Corporation Watermark extractor enhancements based on payload ranking
US9208334B2 (en) 2013-10-25 2015-12-08 Verance Corporation Content management using multiple abstraction layers
US9565335B2 (en) 2014-01-02 2017-02-07 Digimarc Corporation Full color visibility model using CSF which varies spatially with local luminance
WO2015138798A1 (en) 2014-03-13 2015-09-17 Verance Corporation Interactive content acquisition using embedded codes
US10504200B2 (en) 2014-03-13 2019-12-10 Verance Corporation Metadata acquisition using embedded watermarks
JP6705596B2 (en) * 2014-08-12 2020-06-03 ディジマーク コーポレイション Data hiding for spot colors in product packaging
US9667829B2 (en) 2014-08-12 2017-05-30 Digimarc Corporation System and methods for encoding information for printed articles
KR20170043627A (en) 2014-08-20 2017-04-21 베란스 코오포레이션 Watermark detection using a multiplicity of predicted patterns
US9942602B2 (en) 2014-11-25 2018-04-10 Verance Corporation Watermark detection and metadata delivery associated with a primary content
EP3225034A4 (en) * 2014-11-25 2018-05-02 Verance Corporation Enhanced metadata and content delivery using watermarks
US9602891B2 (en) 2014-12-18 2017-03-21 Verance Corporation Service signaling recovery for multimedia content using embedded watermarks
JP2017177684A (en) 2016-03-31 2017-10-05 ブラザー工業株式会社 Control apparatus and computer program
JP2017182597A (en) 2016-03-31 2017-10-05 ブラザー工業株式会社 Control device and computer program

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4939515A (en) * 1988-09-30 1990-07-03 General Electric Company Digital signal encoding and decoding apparatus
US5646997A (en) * 1994-12-14 1997-07-08 Barton; James M. Method and apparatus for embedding authentication information within digital data
US5848155A (en) * 1996-09-04 1998-12-08 Nec Research Institute, Inc. Spread spectrum watermark for embedded signalling
US5915027A (en) * 1996-11-05 1999-06-22 Nec Research Institute Digital watermarking
JP3154325B2 (en) 1996-11-28 2001-04-09 日本アイ・ビー・エム株式会社 System for hiding authentication information in images and image authentication system
US6208735B1 (en) * 1997-09-10 2001-03-27 Nec Research Institute, Inc. Secure spread spectrum watermarking for multimedia data
US6332030B1 (en) * 1998-01-15 2001-12-18 The Regents Of The University Of California Method for embedding and extracting digital data in images and video

Also Published As

Publication number Publication date
AU2001276620A1 (en) 2002-02-25
US6721439B1 (en) 2004-04-13
JP2004507177A (en) 2004-03-04
WO2002015586A3 (en) 2002-06-13
EP1310098A2 (en) 2003-05-14
WO2002015586A2 (en) 2002-02-21
JP4226897B2 (en) 2009-02-18
KR20030024880A (en) 2003-03-26

Similar Documents

Publication Publication Date Title
US6721439B1 (en) Method and system of watermarking digital data using scaled bin encoding and maximum likelihood decoding
US8370635B2 (en) Synchronization of digital watermarks
JP4266677B2 (en) Digital watermark embedding method and encoding device and decoding device capable of using the method
Caldelli et al. Reversible watermarking techniques: An overview and a classification
US7360093B2 (en) System and method for authentication of JPEG image data
US6895101B2 (en) System and method for embedding information in digital signals
Qiao et al. Watermarking methods for MPEG encoded video: Towards resolving rightful ownership
Liu et al. High-performance JPEG steganography using complementary embedding strategy
Tagliasacchi et al. Hash-based identification of sparse image tampering
Chandramouli et al. Digital watermarking
US20040013268A1 (en) Method for authentication of JPEG image data
US7451317B2 (en) Apparatus for and method of embedding watermark into original information, transmitting watermarked information, and reconstructing the watermark
Hosam et al. Adaptive block‐based pixel value differencing steganography
Kumar et al. A reversible high capacity data hiding scheme using combinatorial strategy
US7493489B2 (en) System and method for authentication of JPEG image data
Mandal et al. A genetic algorithm based steganography in frequency domain (GASFD)
Ramanjaneyulu et al. An oblivious and robust multiple image watermarking scheme using genetic algorithm
US7886151B2 (en) Temporal synchronization of video and audio signals
US7627761B2 (en) System for authentication of JPEG image data
Korus et al. A scheme for censorship of sensitive image content with high-quality reconstruction ability
Liu et al. Watermarking for digital images
Kumar et al. An optimally robust digital watermarking algorithm for stereo image coding
Caragata et al. Fragile watermarking using chaotic sequences
Pei et al. Robustness enhancement for noncentric quantization-based image watermarking
Wöhnert et al. A study on the use of perceptual hashing to detect manipulation of embedded messages in images

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2002520566

Country of ref document: JP

Ref document number: 1020037002275

Country of ref document: KR

WWE Wipo information: entry into national phase

Ref document number: 2001954279

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1020037002275

Country of ref document: KR

COP Corrected version of pamphlet

Free format text: PAGES 1-20, DESCRIPTION, REPLACED BY NEW PAGES 1-19; PAGES 21-24, CLAIMS, REPLACED BY NEW PAGES 20-23; PAGES 1/9-9/9, DRAWINGS, REPLACED BY NEW PAGES 1/9-9/9

WWP Wipo information: published in national office

Ref document number: 2001954279

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWW Wipo information: withdrawn in national office

Ref document number: 2001954279

Country of ref document: EP