DIGITAL WATERMARKING AND BANKNOTES
Field of the Invention The present application relates to the use of digital watermarking in connection with banknotes and other security documents.
Background and Summary of the Invention The problem of casual counterfeiting of banknotes first arose two decades ago, with the introduction of color photocopiers. A number of techniques were proposed to address the problem. U.S. Patent 5,659,628 (assigned to Ricoh) is one of several patents noting that photocopiers can be equipped to recognize banknotes and prevent their photocopying. The Ricoh patent particularly proposed that the red seal printed on Japanese yen notes is a pattern well suited for machine recognition. U.S. Patents 5,845,008 (assigned to Omron), and 5,724,154 and 5,731,880 (both assigned to Canon) show other photocopiers that sense the presence of the seal emblem on banknotes, and disable a photocopier in response. U.S. Patent 5,678,155 (assigned to Sharp) notes that a great number of other features visible on banknotes (e.g. logos, stamps, signets, face values, symbols including $ and YEN, etc.) can similarly be used as the basis for banknote recognition.
Other technologies proposed that counterfeiting might be deterred by uniquely marking the printed output from each color photocopier, so that copies could be traced back to the originating machine. U.S. Patent 5,568,268, for example, discloses the addition of essentially imperceptible patterns of yellow dots to printed output; the pattern is unique to the machine. U.S. Patent 5,557,742 discloses a related arrangement in which the photocopier's serial number is printed on output documents, again in essentially imperceptible form (small yellow lettering). U.S. Patent 5,661,574 shows an arrangement in which bits comprising the photocopier's serial number are represented in the photocopier's printed output by incrementing, or decrementing, pixel values (e.g. of yellow pixels) at known locations by fixed amounts (e.g. +/-30), depending on whether the corresponding serial number bit is a "1" or a "0."
Recent advances in personal computers, and color scanning and printing technologies, have greatly increased the level of casual counterfeiting. High quality scanners are now readily available to many computer users, with 300 dpi scanners available for under $100, and 600 dpi scanners available for marginally more. Similarly, photographic quality color ink-jet printers are commonly available from Hewlett-Packard Co., Epson, etc. for under $300.
These tools pose new threats. For example, a banknote can be doctored (e.g. by white-out, scissors, or less crude techniques) to remove/obliterate the visible patterns on which prior art banknote detection techniques relied to prevent counterfeiting. Such a doctored document can then be freely scanned or copied, even on photocopiers designed to prevent processing of banknote
images. The removed pattern(s) can then be added back in, e.g. by use of digital image editing tools, permitting free reproduction of the banknote.
Digital watermarking technologies provide novel ways to address the problem of counterfeiting — overcoming many of the newly-emerging threats and providing other advantages not previously available.
Digital watermarking (sometimes simply "watermarking") is a quickly growing field of endeavor, with several different approaches. The present assignee's work is reflected in U.S. Patents 5,841,978, 5,768,426, 5,748,783, 5,748,763, 5,745,604, 5,710,834, 5,636,292, 5,721,788, and in laid-open PCT applications WO 95/14289, WO 96/36163, and W097/43736. Applicant's laid-open application WO 95/14289 discloses a system for redundantly embedding plural bit codes in various electronic and physical media using noise-like signals, and notes that the invention may be applied to paper currency. One embodiment alters the micro- topology of the surface of a medium to effect encoding. Another embodiment is a scanner that embeds identification codes in its scanned output data. Applicant's laid-open application WO 96/36163 discloses a reproduction kiosk for use in retail photography stores, permitting consumers to reproduce photographs. The kiosk examines the consumer's photograph for embedded watermark data (e.g. indicating that the photograph is copyrighted), and interrupts reproduction of the photograph if such watermark data is detected. This application also discloses that information can be encoded in vector graphics and very-low-order bit- mapped graphics by slightly changing a line's contour, e.g. slightly up, down, left or right.
Other digital watermarking work is illustrated by U.S. Patents 5,734,752, 5,646,997, 5,659,726, 5,664,018, 5,671,277, 5,687,191, 5,687,236, 5,689,587, 5,568,570, 5,572,247, 5,574,962, 5,579,124, 5,581,500, 5,613,004, 5,629,770, 5,461,426, 5,743,631, 5,488,664, 5,530,759,5,539,735, 4,943,973, 5,337,361, 5,404,160, 5,404,377, 5,315,098, 5,319,735, 5,337,362, 4,972,471, 5,161,210, 5,243,423, 5,091,966, 5,113,437, 4,939,515, 5,374,976, 4,855,827, 4,876,617, 4,939,515, 4,963,998, 4,969,041, and published applications WO 98/02864, EP 822,550, WO 97/39410, WO 96/36163, GB 2,196,167, EP 777,197, EP 736,860, EP 705,025, EP 766,468, EP 782,322, WO 95/20291 , WO 96/26494, WO 96/36935, WO 96/42151, WO 97/22206, WO 97/26733. Some of the foregoing patents relate to visible watermarking techniques. Other visible watermarking techniques (e.g. data glyphs) are described in U.S. Patents 5,706,364, 5,689,620, 5,684,885, 5,680,223, 5,668,636,
5,640,647, 5,594,809. (More typically, digital watermarks are steganographic - that is, they are essentially imperceptible to human vision, serving to hide the embedded data before the viewer's eyes.)
Most of the work in watermarking, however, is not in the patent literature but rather in published research. In addition to the patentees of the foregoing patents, some of the other workers in this field (whose watermark-related writings can by found by an author search in the INSPEC database) include I. Pitas, Eckhard Koch, Jian Zhao, Norishige Morimoto, Laurence Boney, Kineo
Matsui, A.Z. Tirkel, Fred Mintzer, B. Macq, Ahmed H. Tewfik, Frederic Jordan, Naohisa Komatsu, and Lawrence O'Gorman.
In Szepanski, A Signal Theoretic Method for Creating Forgery-Proof Documents for Automatic Verification, 1979 Carnahan Conference on Crime Countermeasures, the author noted that overprinting of documents (e.g. money bills) with complicated line patterns is a deterrent against forgery, but does not convey meaningful information usable in an automatic verification procedure. He thus proposed superimposing a pattern representing plural bit data onto a document instead. The document is partitioned into as many square regions as there are data bits to be represented, and a carrier pattern is chosen. In regions corresponding to a "1" bit, the carrier pattern is added to the underlying document samples; in regions corresponding to a "0" bit, the pattern is subtracted. An automatic verification procedure can then be employed to scan the document, correlate the scan data with the carrier pattern, and extract the digital data. The author particularly considers passports in which the passport photograph is so processed to convey the owner's name and date of birth. The artisan is assumed to be familiar with the foregoing prior art. In the present disclosure it should be understood that references to watermarking encompass not only the assignee's watermarking technology, but can likewise be practiced with any other watermarking technology, such as those indicated above.
Watermarking can be applied to myriad forms of information. The present disclosure focuses on its applications to banknotes, travelers checks, passports, stock certificates, and the like (hereafter collectively referred to as "security documents"), which are generally characterized by use of line art imagery. However, it should be recognized that the principles discussed below can also be applied outside this particular field.
Most of the prior art in image watermarking has focused on pixelated imagery (e.g. bitmapped images, JPEG/MPEG imagery, VGA/SVGA display devices, etc.). In most watermarking techniques, the luminance or color values of component pixels are slightly changed to effect subliminal encoding of binary data through the image. (This encoding can be done directly in the pixel domain, or in another domain, such as the DCT domain.) In some systems, isolated pixels are changed in accordance with one or more bits of the binary data; in others, plural domain-related groupings of pixels (e.g. locally adjoining, or corresponding to a given DCT component) are so changed. In all cases, however, pixels have served as the ultimate carriers of the embedded data. While pixelated imagery is a relatively recent development, line art goes back centuries. One familiar example is U.S. paper currency. On the one dollar banknote, for example, line art is used in several different ways. One is to form intricate webbing patterns around the margin of the note (generally comprised of light lines on dark background). Another is to form grayscale imagery, such as the portrait of George Washington (generally comprised of dark lines on a light background). There are two basic ways to simulate grayscales in line art. One is to change the relative spacings of the lines to effect a lightening or darkening of an image region. Fig. 1A shows such an
arrangement; area B looks darker than area A due to the closer spacings of the component lines. The other technique is to change the widths of the component lines — wider lines resulting in darker areas and narrower lines resulting in lighter areas. Fig. IB shows such an arrangement. Again, area B looks darker than area A, this time due to the greater widths of the component lines. These techniques are often used together.
In accordance with one aspect of the invention, security of a banknote (or other security document) is enhanced by marking its face with machine readable, generally imperceptible digital data by slightly altering the distribution of ink on the face of the banknote. The digital data can comprise plural bits, and may be encoded redundantly across the banknote (rather than marking the banknote at a single, localized region only.) The encoding can make use of a code signal or a discrete cosine transform. Two watermarks can advantageously be used in some embodiments, in which case these may be encoded with different robustness, or in accordance with different code signals. Some such banknotes may be provided with holograms as well, to further increase security. In accordance with another aspect of the invention, a banknote (or other security document) is encoded with plural-bit digital data so as to facilitate its later machine identification. This may be accomplished by receiving initial banknote artwork together with the plural-bit digital data. A pattern of data is next produced, in which the digital data is encoded. The initial artwork is then adjusted in accordance with this data, and the banknote is printed using the adjusted artwork. Again, the arrangement desirably effects a spatial spreading of the plural bits throughout the banknote. One such embodiment posits a virtual grid of points imposed on a line art image, with the points spaced at regular intervals in vertical and horizontal directions. (The horizontal and vertical intervals need not be equal.) The virtual points may be imposed over some or all of the bill at equal vertical and horizontal spacings of 250 μm. In regions of the banknote having line art, the component lines of the line art snake in and amongst these virtual grid points. Each grid point is considered to be the center of a corresponding region. The luminance of the region is a function of the proximity of any line(s) within the boundary of the region to the region's center point, and the thickness of the line(s).
To change the luminance of the region, the contour of the line(s) is changed slightly within the region. In particular, the line is made slightly thicker to decrease luminance; or thinner to increase luminance. (In this example, dark lines on light backgrounds are presumed.) The ability to effect these slight changes is then employed, in accordance with known pixelation-based watermarking techniques, to encode binary data in the line art. If such a banknote is thereafter scanned by a scanner, the values of the pixel data produced by the scanner will reflect the foregoing alterations in luminance values, permitting embedded watermark data to be decoded. In an alternative embodiment, the line widths are not changed. Instead, the positions of the lines are shifted slightly towards or away from certain virtual grid points to effect an increase or decrease in the corresponding area's luminosity, with the same effect.
In accordance with another aspect of the present invention, reproduction of a banknote (or other security document) is deterred by encoding the banknote with plural bit digital data. Sampled image data corresponding to such a banknote is thereafter provided, and analyzed to detect the presence of the digital data therein. If such data is detected, an appropriate intervention action is taken.
In one such embodiment, the sampled image data is also analyzed to detect the presence of a visible structure characteristic of a banknote. The intervention action can be triggered on the detection on both, or either, of the visible structure and/or the digital data.
The interventions can include preventing reproduction of the image data, sending a message to a remote location to report processing of banknote data, or inserting forensic tracing data in the image data.
In accordance with another aspect of the invention, reproduction of a banknote (or other security document) is deterred by examining suspect image data corresponding to the sampled optical luminance of an object, and determining whether the suspect image data corresponds to a banknote. Both of these operations are performed at a first site. If the suspect image data corresponds to a banknote, then a second, remote site is contacted and a report is sent regarding detection of banknote- related data.
In accordance with another aspect of the invention, output from the printer of a personal computer system is marked to permit its later identification. This method includes receiving (or intercepting) printer data being sent from the personal computer to the printer; and digitally watermarking this data with a machine readable, generally imperceptible, digital watermark (desirably spread throughout the printer data). This watermark serves to mark the printer data as having been printed by that particular printer, permitting the resultant document to be matched with that printer. In accordance with another aspect of the invention, an apparatus for processing (e.g. recognizing or validating) a banknote (or other security document) includes an optical scanner for producing image data corresponding to a note input to the apparatus, and a processor for detecting plural-bit digital watermark data within the image data. The detection of such watermark data indicates that the note is a banknote. If the watermark data is detected, a control unit responds in a first manner to change or limit the apparatus's operation (e.g. by halting reproduction, inserting forensic tracer information in output data, alerting a remote site or service, etc.). If the watermark data is not detected, the control unit responds in a second, different, manner. Scanners, photocopiers, and printers are examples of apparatuses that can employ such banknote-aware arrangements.
In accordance with another aspect of the invention, a cash processing apparatus includes an input for receiving plural banknote bills, and a feed mechanism for conveying bills from the input past an optical detector. The optical detector produces digital image data corresponding to the conveyed bills. This image data is then processed by a processor, which detects plural-bit digital
watermark data steganographically encoded in artwork on at least certain of said banknotes. A control unit responds in a manner dependent on the detected watermark data.
The foregoing and other features and advantages of the invention will be more readily apparent from the following detailed description, which proceeds with reference to the accompanying drawings.
Brief Description of the Drawings Figs. 1 A and IB show prior art techniques for achieving grayscale effects using line art. Fig. 2 shows a virtual array of grid points that can be imposed on an image according to one embodiment of the present invention.
Fig. 3 shows a virtual array of regions that can be imposed on an image according to the Fig. 2 embodiment.
Fig. 4 shows an excerpt of Fig. 3 with a line from a line art image passing therethrough. Fig. 5 shows changes to the width of the line of Fig. 3 to effect watermark encoding according to one embodiment of the present invention.
Fig. 6 shows changes to the position of the line of Fig. 3 to effect watermark encoding according to another embodiment of the present invention.
Fig. 7 is a block diagram of a photocopier according to another embodiment of the invention. Fig. 8 shows part of an automatic teller machine employing principles of the present invention.
Fig. 9 shows part of a device (e.g. a photocopier, scanner, or printer) employing principles of the present invention.
Fig. 10 shows part of a device (e.g. a photocopier, scanner, or printer) employing principles of the present invention.
Detailed Description Referring to Fig. 2, an illustrative form of the invention employs a grid 10 of imaginary reference points arrayed over a line art image. The spacing between points is 250 μm in the illustrated arrangement, but greater or lesser spacings can of course be used.
Associated with each grid point is a surrounding region 12, shown in Fig. 3. As described below, the luminosity (or reflectance) of each of these regions 12 is slightly changed to effect the subliminal encoding of binary data.
Region 12 can take various shapes; the illustrated rounded-rectangular shape is representative only. (The illustrated shape has the advantage of encompassing a fairly large area while introducing fewer visual artifacts than, e.g., square regions.) In other embodiments, squares, rectangles, circles, ellipses, etc., can alternatively be employed.
Fig. 4 is a magnified view of an excerpt of Fig. 3, showing a line 14 passing through the grid of points. The width of the line, of course, depends on the particular image of which it is a part. The illustrated line is about 25 μm in width; greater or lesser widths can naturally be used.
In a first embodiment of the invention, shown in Fig. 5, the width of the line is controllably varied so as to change the luminosity of the regions through which it passes. To increase the luminosity (or reflectance), the line is made narrower (i.e. less ink in the region). To decrease the luminosity, the line is made wider (i.e. more ink).
Whether the luminance in a given region should be increased or decreased depends on the particular watermarking algorithm used. Any algorithm can be used, by changing the luminosity of regions 12 as the algorithm would otherwise change the luminance or colors of pixels in a pixelated image. (Some watermarking algorithms effect their changes in a transformed domain, such as DCT, wavelet, or Fourier. However, such changes are ultimately manifested as changes in luminance or color.)
In an exemplary algorithm, the binary data is represented as a sequence of -Is and Is, instead of Os and Is. (The binary data can comprise a single datum, but more typically comprises several. In an illustrative embodiment, the data comprises 100 bits - some of which are error- correcting or -detecting bits.)
Each element of the binary data sequence is then multiplied by a corresponding element of a pseudo-random number sequence, comprised of -Is and Is, to yield an intermediate data signal. Each element of this intermediate data signal corresponds to a respective sub-part of the image, such as a region 12. Collectively, these elements form a pattern of binary data. (Typically each element of the intermediate data signal is mapped to several of the regions.)
The image in (and optionally around) each such region is analyzed to determine its relative capability to conceal embedded data, and a corresponding scale factor is produced. Exemplary scale factors may range from 0 to 3. The scale factor for the region is then multiplied by the element of the intermediate data signal corresponding to (i.e. mapped to) the region in order to yield a "tweak" (or "bias") value for the region. In the illustrated case, the resulting tweaks can range from -3 to 3. Collectively, these elements form a pattern of gray-scaled data.
The luminosity of the region on the banknote is then adjusted in accordance with the corresponding tweak value. A tweak value of -3 may correspond to a -5% change in luminosity; -2 may correspond to -2% change; -1 may correspond to -1% change; 0 may correspond to no change; 1 may correspond to +1% change; 2 may correspond to +2% change, and 3 may correspond to +5% change. (This example follows the basic techniques described in the Real Time Encoder embodiment disclosed in patent 5,710,834.) In Fig. 5, the watermarking algorithm determined that the luminance of region A should be reduced by a certain percentage, while the luminance of regions C and D should be increased by certain percentages.
In region A, the luminance is reduced by increasing the line width. In region D, the luminance is increased by reducing the line width; similarly in region C (but to a lesser extent).
No line passes through region B, so there is no opportunity to change the region's luminance. This is not fatal to the method, however, since the exemplary watermarking algorithm redundantly encodes each bit of data in sub-parts spaced throughout the line art image.
The changes to line widths in regions A and D of Fig. 5 are exaggerated for purposes of illustration. While the illustrated variance is possible, most implementations will typically modulate the line width 3 - 50% (increase or decrease).
(Many watermarking algorithms routinely operate within a signal margin of about +/- 1% changes in luminosity to effect encoding. That is, the "noise" added by the encoding amounts to just 1% or so of the underlying signal. Lines typically don't occupy the full area of a region, so a 10% change to line width may only effect a 1% change to region luminosity, etc. Banknotes are different from photographs in that the artwork generally need not convey photorealism. Thus, banknotes can be encoded with higher energy than is used in watermarking photographs, provided the result is still aesthetically satisfactory. To illustrate, localized luminance changes on the order of 10% are possible in banknotes, while such a level of watermark energy in photographs would generally be considered unacceptable. In some contexts, localized luminance changes of 20, 30, 50 or even 100% are acceptable.)
In the illustrated embodiment, the change to line width is a function solely of the tweak to be applied to a single region. Thus, if a line passes through any part of a region to which a tweak of 2% is to be applied, the line width in that region is changed to effect the 2% luminance difference. In variant embodiments, the change in line width is a function of the line's position in the region. In particular, the change in line width is a function of the distance between the region's center grid point and the line's closest approach to that point. If the line passes through the grid point, the full 2% change is effected. At successively greater distances, successively less change is applied. The manner in which the magnitude of the tweak changes as a function of line position within the region can be determined by applying one of various interpolation algorithms, such as the bi-linear, bicubic, cubic splines, custom curve, etc.
In other variant embodiments, the change in line width in a given region is a weighted function of the tweaks for adjoining or surrounding regions. Thus, the line width in one region may be increased or decreased in accordance with a tweak value corresponding to one or more adjoining regions.
Combinations of the foregoing embodiments can also be employed. In the foregoing embodiments, it is sometimes necessary to trade-off the tweak values of adjoining regions. For example, a line may pass along a border between regions, or pass through the point equidistant from four grid points ("equidistant zones"). In such cases, the line may be subject to conflicting tweak values — one region may want to increase the line width, while another may
want to decrease the line width. (Or both may want to increase the line width, but differing amounts.) Similarly in cases where the line does not pass through an equidistant zone, but the change in line width is a function of a neighborhood of regions whose tweaks are of different values. Again, known interpolation functions can be employed to determine the weight to be given the tweak from each region in determining what change is to be made to the line width in any given region. In the exemplary watermarking algorithm, the average change in luminosity across the image is zero, so no generalized lightening or darkening of the image is apparent. The localized changes in luminosity are so minute in magnitude, and localized in position, that they are essentially invisible (e.g. inconspicuous/subliminal) to human viewers. An alternative embodiment is shown in Fig. 6, in which line position is changed rather than line width.
In Fig. 6 the original position of the line is shown in dashed form, and the changed position of the line is shown in solid form. To decrease a region's luminosity, the line is moved slightly closer to the center of the grid point; to increase a region's luminosity, the line is moved slightly away. Thus, in region A, the line is moved towards the center grid point, while in region D it is moved away.
It will be noted that the line on the left edge of region A does not return to its nominal (dashed) position as it exits the region. This is because the region to the left of region A also is to have decreased luminosity. Where possible, it is generally preferable not to return a line to its nominal position, but instead permit shifted lines to remain shifted as they enter adjoining regions. So doing permits a greater net line movement within a region, increasing the embedded signal level.
Again, the line shifts in Fig. 6 are somewhat exaggerated. More typical line shifts are on the order of 3 - 50 μm.
One way to think of the Fig. 6 embodiment is to employ a magnetism analogy. The grid point in the center of each region can be thought of as a magnet. It either attracts or repels lines. A tweak value of -3, for example, may correspond to a strong-valued attraction force; a tweak value of +2 may correspond to a middle-valued repulsion force, etc. In Fig. 6, the grid point in region A exhibits an attraction force (i.e. a negative tweak value), and the grid point in region D exhibits a repulsion force (e.g. a positive tweak value). The magnetic analogy is useful because the magnetic effect exerted on a line depends on the distance between the line and the grid point. Thus, a line passing near a grid point is shifted more in position than a line near the periphery of the region.
(Actually, the magnetism analogy can serve as more than a conceptual tool. Instead, magnetic effects can be modeled in a computer program and serve to synthesize a desired placement of the lines relative to the grid points. Arbitrarily customized magnetic fields can be used.)
Each of the variants discussed above in connection with Fig. 5 is likewise applicable to Fig. 6.
10
Combinations of the embodiments of Figs. 5 and 6 can of course be used, resulting in increased watermark energy, better signal-to-noise ratio and, in many cases, less noticeable changes. In still a further embodiment, the luminance in each region is changed while leaving the line unchanged. This can be effected by sprinkling tiny dots of ink in otherwise- vacant parts of the region. In high quality printing, of the type used with banknotes, droplets on the order of 3μm in diameter can be deposited. (Still larger droplets are still beyond the perception threshold for most viewers.) Speckling a region with such droplets (either in a regular array, or random, or according to a desired profile such as Gaussian), can readily effect a 1% or so change in luminosity. (Usually dark droplets are added to a region, effecting a decrease in luminosity. Increases in luminosity can be effected by speckling with a light colored ink, or by forming light voids in line art otherwise present in a region.) (Actually, production realities often mean that many such microdots will not print, but statistically some will.)
In a variant of the speckling technique, very thin mesh lines can be inserted in the artwork - again to slightly change the luminance of one or more regions (so-called "background tinting"). To decode watermark data, the encoded line art image must be converted into electronic form for analysis. This conversion is typically performed by a scanner.
Scanners are well known, so a detailed description is not provided here. Suffice it to say that scanners conventionally employ a line of closely spaced photodetector cells that produce signals related to the amount of the light reflected from successive swaths of the image. Most inexpensive consumer scanners have a resolution of 300 dots per inch (dpi), or a center to center spacing of component photodetectors of about 84 μm. Higher quality scanners of the sort found in most professional imaging equipment and photocopiers have resolutions of 600 dpi (42μm), 1200 dpi (21 μm), or better.
Taking the example of a 300 dpi scanner (84 μm photodetector spacing), each 250 μm region 12 on the banknote will correspond to about a 3 x 3 array of photodetector samples.
Naturally, only in rare instances will a given region be physically registered with the scanner so that nine photodetector samples capture the luminance in that region, and nothing else. More commonly, the line art is skewed with respect to the scanner photodetectors, or is longitudinally misaligned (i.e. some photodetectors image sub-parts of two adjoining regions). However, since the scanner oversamples the regions, the luminance of each region can unambiguously be determined.
In one embodiment, the scanned data from the line art is collected in a two dimensional array and processed — according to one of the techniques disclosed in my prior patents and applications ~ to analyze the statistics of the data (using the techniques disclosed in my prior writings) to extract the bits of the embedded data. (Again, the reference to my earlier watermark decoding techniques is exemplary only. Once scanning begins and the data is available in pixel form, it is straightforward to apply any other watermark decoding technique to extract a correspondingly-encoded watermark.)
11
In a variant embodiment, the scanned data is not assembled in a complete array prior to the processing. Instead, it is processed in real-time, as it is generated, in order to detect embedded watermark data without delay. (Depending on the parameters of the scanner, it may be necessary to scan a half-inch or so of the line art image before the statistics of the resulting data unambiguously indicate the presence of a watermark.)
In accordance with another aspect of the invention, various hardware devices are provided with the capability to recognize embedded watermark data in any line art images they process, and to respond accordingly.
One example is a color photocopier. Such devices employ a color scanner to generate sampled (pixel) data corresponding to an input media (e.g. a dollar bill). If watermark data associated with a banknote is detected, the photocopier can take one or more steps.
One option is simply to interrupt copying, and display a message reminding the operator that it is illegal to reproduce currency.
Another option is to contact a remote service or site, and report the attempted reproduction of a banknote. Photocopiers with telephone dial-out capabilities are known in the art (e.g. patent 5,305,199) and are readily adapted to this purpose. Similarly, personal computers with modems or network communication capabilities can send a message to a remote location alerting of the attempted reproduction (e.g. over the internet). The remote service can be an independent service, or can be a government agency. Yet another option is to permit the copying, but to insert forensic tracer data in the resultant copy. This tracer data can take various forms. Steganographically encoded binary data is one example; the data embedding shown in patent 5,568,268 can also be used. The tracer data can memorialize the serial number of the machine that made the copy and/or the date and time the copy was made. To address privacy concerns, such tracer data is not normally inserted in photocopied output, but is so inserted only when the subject being photocopied is detected as being a banknote. (Such an arrangement is shown in Fig. 7.)
Desirably, the scan data is analyzed on a line-by-line basis in order to identify illicit photocopying with a minimum of delay. If a banknote is scanned, one or more lines of scanner output data may be provided to the photocopier's reprographic unit before the banknote detection decision has been made. In this case the photocopy will have two regions: a first region that is not tracer-marked, and a second, subsequent region in which the tracer data has been inserted.
Another hardware device that can employ the foregoing principles is a standalone scanner. A programmed processor (or dedicated hardware) inside the scanner analyzes the data being generated by the device, and responds accordingly. Yet another hardware device that can employ the foregoing principles is a printer. A processor inside the device analyzes graphical image data to be printed, looking for watermarks associated with banknotes.
12
For both the scanner and printer devices, response strategies can include disabling operation, or inserting tracer information. (Such devices typically do not have dial-out capabilities or network connections, although such capabilities of a conventional computer can be invoked by a scanner or printer device using known techniques, if desired.) Again, it is desirable to process the scanner or printer data as it becomes available, so as to detect any banknote processing with a minimum of delay. Again, there will be some lag time before a detection decision is made. Accordingly, the scanner or printer output will be comprised of two parts, one without the tracer data, and another with the tracer data.
In another embodiment (Fig. 10) an imperceptible watermark having a universal ID (UID) is inserted into all documents printed with a printer, scanned with a scanner, or reproduced by a photocopier. The UID is associated with the particular printer/photocopier/scanner in a registry database maintained by the products' manufacturers. The manufacturer can also enter in this database the name of the distributor to whom the product was initially shipped. Still further, the owner's name and address can be added to the database when the machine is registered for warranty service. While not preventing use of such machines in counterfeiting, the embedded UID facilitates identifying the machine that generated a counterfeit banknote. (This is an application in which a private watermark might best be used.)
Another advantageous use of digital watermarks is in high-confidence authentication of banknotes for, e.g., automatic teller machines that both accept — as well as dispense ~ cash. Referring to Fig. 8, such a machine (1 1) is provided with a known optical scanner (13) to produce digital data (15) corresponding to the face(s) of the bill (16). This image set (14) is then analyzed (16) to extract embedded watermark data. In watermarking technologies that require knowledge of a code signal (20) for decoding (e.g. noise modulation signal, crypto key, spreading signal, etc.), a bill may be watermarked in accordance with several such codes. Some of these codes are public - permitting their reading by conventional machines. Others are private, and are reserved for use by government agencies and the like. (C.f public and private codes in the present assignee's issued patents.)
As noted, banknotes presently include certain visible structures, or markings (e.g., the seal emblem noted in the earlier-cited patents), which can be used as aids to note authentication (either by visual inspection or by machine detection). Desirably, a note is examined by an integrated detection system (24), and confirms the presence of both the visible structures (22) mid the digitally watermarked data before concluding that a banknote is genuine.
Visible banknote structures can be sensed using known pattern recognition techniques. Examples of such techniques are disclosed in U.S. Patents 5,321,773, 5,390,259, 5,533,144, 5,539,841, 5,583,614, 5,633,952, 4,723,149 and 5,424,807 and laid-open foreign application EP 766,449.
13
Referring to Fig. 9, a photocopier (30), for example, can sense the presence of either a visible structure (32) or embedded banknote watermark data (34), and disable copying if either is present (36). Scanners and printers can be equipped with a similar capability - analyzing the data scanned or to be printed for either of these banknote hallmarks. If either is detected, the software (or hardware) disables further operation.
Identification of banknotes by watermark data provides an important advantage not otherwise available. As noted, an original bill can be doctored (e.g. by white-out, scissors, or less crude techniques) to remove/obliterate the visible structures. Such a document can then be freely copied on either a visible structure-sensing photocopier or scanner/printer installation. The removed visible structure can then be added in via a second printing/photocopying operation. If the printer is not equipped with banknote-disabling capabilities, image-editing tools can be used to insert visible structures back into image data sets scanned from such doctored bills, and the complete bill freely printed. By additionally including embedded watermark data in the banknote, and sensing same, such ruses will not succeed. (A similar ruse is to scan a banknote image on a non-banknote-sensing scanner. The resulting image set can then be edited by conventional image editing tools to remove/obliterate the visible structures. Such a data set can then be printed - even on a printer/photocopier that examines such data for the presence of visible structures. Again, the missing visible structures can be inserted by a subsequent printing/photocopying operation.) Desirably, the visible structure detector and the watermark detector are integrated together as a single hardware and/or software tool. This arrangement provides various economies, e.g., in interfacing with the scanner, manipulating pixel data sets for pattern recognition and watermark extraction, electronically re-registering the image to facilitate pattern recognition/watermark extraction, issuing control signals (e.g. disabling) signals to the photocopier/scanner, etc. While the foregoing applications disabled potential counterfeiting operations upon the detection of either a visible structure or watermarked data, in other applications, both criteria must be met before a banknote is recognized as genuine. Such applications typically involve the receipt or acceptance of banknotes, e.g. by ATMs as discussed above and illustrated in Fig. 8.
(To provide a comprehensive disclosure without unduly lengthening the following specification, applicants incorporate by reference the patent documents cited above.)
From the foregoing, it will be recognized that embodiments according to the present invention allow line art images to serve as subliminal carriers for binary data. Additionally, existing deterrents to banknote counterfeiting have been enhanced to prevent common work-arounds, and other advantages have been provided. Having described and illustrated the principles of my invention with reference to several illustrative embodiments, it will be recognized that these embodiments are exemplary only and should not be taken as limiting the scope of my invention. Guided by the foregoing teachings, it
14
should be apparent that other watermarking, decoding, and anti-counterfeiting technologies can be substituted for, and/or combined with, the elements detailed above to yield similar effects.
For example, while the invention has been described with reference to embodiments employing regular rectangular arrays of grid points, those skilled in the art will recognize that other arrays of points ~ neither rectangular nor regular ~ can alternatively be used.
Similarly, while the invention has been particularly illustrated with reference to banknotes, the principles thereof are equally applicable to other security documents.
While the invention has been described with reference to embodiments that scale the embedded energy in accordance with local image characteristics, in other embodiments a manually crafted energy profile can be implemented. That is, a mask defining embedded signal magnitudes at different parts of the image can be manually devised, and employed to tailor the change in luminance in each region.
Likewise, while one encoding technique has been particularly detailed, the invention is not so limited. Any technique for hiding plural-bit binary data in a security document, so that it can be detected from sampled optical data corresponding to the document, can likewise be employed. Not all such techniques involve slight alterations to the ink (e.g. color, density, distribution, etc.). For example, another approach is to digitally watermark the underlying medium (whether paper, polymer, etc.). This can be done by changing the microtopology of the medium (a la mini-Braille) to manifest the pattern of encoded data. Such texturing can be optically sensed and decoded using a an algorithm corresponding to the encoding algorithm that produced the patterned data. Another option is to employ a laminate on or within the banknote, where the laminate has the watermarking manifested thereon/therein. The laminate can be textured (as above), or its optical transmissivity can vary in accordance with a noise-like pattern that is the watermark, or a chemical property can similarly vary. Such approaches are detailed, e.g., in my laid-open PCT applications cited above. While the invention has been particularly illustrated with reference to certain hardware devices (e.g. scanners, photocopiers, and printers), and certain applications (deterring counterfeiting, and banknote authentication), the technology is not so limited. For example, many different devices can be equipped with the banknote recognition capabilities detailed herein. One example is cash- accepting vending machines, which examine bills for authenticity. Another is personal computers, which can examine data processed thereby to determine whether same originated with a banknote, and to intervene as appropriate.
Still other applications can also be served by this technology as well. One is banknote processing, e.g. the devices used by banks and issuing authorities for counting, sorting, and reading banknotes, and culling from circulation certain aged and counterfeit banknotes. Such a device can include an input (e.g. hopper) for receiving plural banknote bills, and a feed mechanism for conveying bills from the input past an optical detector. The optical detector produces digital image data corresponding to the conveyed bills. This image data is then processed by a processor, which
15
detects plural-bit digital watermark data steganographically encoded in artwork on at least certain of said banknotes. A control unit responds in a manner dependent on the detected watermark data. For example, the control unit can cause the device to sort bills by denomination or age, or cull counterfeits, based on the decoded data. Moreover, it will be recognized that the methods of the invention can be performed by hardware, software, or by a combination (e.g. some image processing operations performed by dedicated hardware, with others performed in software).
In still other embodiments, at least part of a watermark can be printed using photoluminescent ink. This allows, e.g., a merchant presented with a banknote to quickly verify the presence of *some* watermark-like indicia in/on the bill even without resort to a scanner and computer analysis (e.g. by examining under a black light). Such photoluminescent ink can also print human-readable indicia on the bill, such as the denomination of a banknote. (Since ink-jet printers and other common mass-printing technologies employ cyan/magenta/yellow/black to form colors, they can produce only a limited spectrum of colors. Photoluminescent colors are outside their capabilities. Fluorescent colors - such as the yellow, pink and green dyes used in highlighting markers - can similarly be used and have the advantage of being visible without a black light.) An improvement to the detailed, and other, encoding techniques is to add an iterative assessment of the robustness of the digital watermark, with a corresponding adjustment in a re- watermarking operation. Especially when encoding multiple bit watermarks, the characteristics of the underlying content (e.g. artwork) may result in some bits being more robustly (e.g. strongly) encoded than others. In an illustrative technique employing this improvement, a watermark is first encoded. Next, a trial decoding operation is performed. A confidence measure (e.g. signal-to-noise ratio) associated with each bit detected in the decoding operation is then assessed. The bits that appear weakly encoded are identified, and corresponding changes are made to the watermarking parameters to bring up the relative strengths of these bits. The object is then watermarked anew, with the changed parameters. This process can be repeated, as needed, until all of the bits comprising the encoded data are approximately equally detectable from the encoded object, or meet some predetermined signal-to-noise ratio threshold.
Banknotes, like most other media and objects, can generally benefit by use of multiple watermarks. For example, a banknote can be marked once in the spatial domain, and a second time in the spatial frequency domain. (It should be understood that any change in one domain has repercussions in the other. Here I reference the domain in which the change is directly effected.)
Another option is to mark a banknote (or other physical or digital object) with watermarks of two different levels of robustness, or strength. The more robust watermark withstands various types of corruption, and is detectable in the object even after multiple generations of intervening distortion. The less robust watermark can be made frail enough to fail with the first distortion of the object. In a banknote, for example, the less robust watermark serves as an authentication mark. Any
16
scanning and reprinting operation will cause it to become unreadable. Both the robust and the frail watermarks should be present in an authentic banknote; only the former will be present in a counterfeit.
Still another option is to encode two different watermarks in accordance with two different code signals. One watermark can identify an apparatus used in an attempted counterfeiting operation (e.g. by serial number). The other watermark can be used for other purposes.
The foregoing discussion has addressed various technological fixes to various problems, particularly illustrated in connection with banknotes. Exemplary solutions have been detailed. Others will be apparent to the artisan by applying common knowledge to extrapolate from the solutions provided above.
For example, the technology and solutions disclosed herein have made use of elements and techniques known from the cited references. Other elements and techniques from the cited references can similarly be combined to yield further implementations within the scope of the present invention. Thus, for example, holograms with or without watermark data can be employed in banknotes for additional security; single-bit watermarking can commonly be substituted for multi-bit watermarking; technology described as using imperceptible watermarks can sometimes be practiced using visible watermarks (glyphs, etc.); local scaling of watermark energy can be provided to enhance watermark signal-to-noise ratio without increasing human perceptibility; various filtering operations can be employed for various functions; encoding may proceed at the granularity of a single pixel (or DCT coefficient), or may similarly treat adjoining groups of pixels (or DCT coefficients); the encoding can be optimized to withstand expected forms of content corruption. Etc., etc., etc. (The cited variants are generally all further detailed in my laid-open PCT applications.) Thus, the exemplary embodiments are only selected samples of the solutions available by combining the teachings referenced above. The other solutions necessarily are not exhaustively described herein, but are fairly within the understanding of an artisan given the foregoing disclosure and familiarity with the cited art.
Miscellaneous
(Most of the following remarks were originally presented ~ in the U.S. priority applications ~ before the foregoing discussion concerning banknotes. Since the present claims emphasize banknote applications, the following material has been moved to the end of the specification for clarity's sake. While not specifically illustrated with reference to banknote applications, this material provides a helpful context within which the banknote applications may be more fully understood.) An improvement to existing watermark encoding techniques is to add an iterative assessment of the robustness of the mark, with a corresponding adjustment in a re-watermarking
17
operation. Especially when encoding multiple bit watermarks, the characteristics of the underlying content may result in some bits being more robustly (e.g. strongly) encoded than others. In an illustrative technique employing this improvement, a watermark is first embedded in an object. Next, a trial decoding operation is performed. A confidence measure (e.g. signal-to-noise ratio) associated with each bit detected in the decoding operation is then assessed. The bits that appear weakly encoded are identified, and corresponding changes are made to the watermarking parameters to bring up the relative strengths of these bits. The object is then watermarked anew, with the changed parameters. This process can be repeated, as needed, until all of the bits comprising the encoded data are approximately equally detectable from the encoded object, or meet some predetermined signal-to- noise ratio threshold.
While the foregoing analysis evaluated confidence on a per-bit basis, related iterative procedures can evaluate confidence on a per-portion basis. That is, the encoded object is considered in portions, and each portion is analyzed for the robustness of the data encoded thereby. In portions evidencing "weak" encoding, the encoding parameters can be adjusted to strengthen the encoding in one or more subsequent re-encoding operations.
These portions can take different forms, e.g., rectangular patches in a still or moving image; brief temporal excerpts in audio or video; certain DCT/Fourier/wavelet coefficients (or adjoining groups of coefficients) in coefficient-based representations of the object in a transformed domain, etc. By this technique, even if the encoded object is spatially or temporally excerpted, or filtered
(e.g. spectrally), there is increased assurance that the watermark energy remaining after such processing will permit accurate decoding.
In an illustrative embodiment, this iterative process is highly automated and essentially transparent to a user. The user simply instructs a computer-controlled system to watermark an object, and the system responds by performing the trial watermarking, decoding, making successive adjustments, and repeating as necessary until a final encoded object meeting desired watermark- quality requirements is produced.
Watermarking can be applied to myriad forms of information. These include imagery (including video) and audio - whether represented in digital form (e.g. an image comprised of pixels, digital video, MP3/MP4 audio, etc.), or in an analog representation (e.g. non-sampled music, printed imagery, banknotes, etc.) Watermarking can be applied to digital content (e.g. imagery, audio) either before or after compression. Watermarking can also be used in various "description" or "synthesis" language representations of content, such as Structured Audio, Csound, NetSound, SNHC Audio and the like {c.f http://sound.media.mit.edu/mpeg4/) by specifying synthesis commands that generate watermark data as well as the intended audio signal. Watermarking can also be applied to ordinary media, whether or not it conveys information. Examples include paper, plastics, laminates, paper/film emulsions, etc. A watermark can embed a single bit of information, or any number of bits.
18
The physical manifestation of watermarked information most commonly takes the form of altered signal values, such as slightly changed pixel values, picture luminance, picture colors, DCT coefficients, instantaneous audio amplitudes, etc. However, a watermark can also be manifested in other ways, such as changes in the surface micro-topology of a medium, localized chemical changes (e.g. in photographic emulsions), localized variations in optical density, localized changes in luminescence, etc. Watermarks can also be optically implemented in holograms and conventional paper watermarks.
One improvement to existing technology is to employ established web crawler services (e.g. AltaVista, Excite, or Inktomi) to search for watermarked content (on the Web, in internet news groups, BBS systems, on-line systems, etc.) in addition to their usual data collecting/indexing operations. Such crawlers can download files that may have embedded watermarks (e.g. *.JPG, *.WAV, etc.) for later analysis. These files can be processed, as described below, in real time. More commonly, such files are queued and processed by a computer distinct from the crawler computer. Instead of performing watermark-read operations on each such file, a screening technique can be employed to identify those most likely to be conveying watermark data. One such technique is to perform a DCT operation on an image, and look for spectral coefficients associated with certain watermarking techniques (e.g. coefficients associated with an inclined embedded subliminal grid). To decode spread-spectrum based watermarks, the analyzing computer requires access to the noise signal used to spread the data signal. In one embodiment, interested parties submit their noise/key signals to the crawler service so as to enable their marked content to be located. The crawler service maintains such information in confidence, and uses different noise signals in decoding an image (image is used herein as a convenient shorthand for imagery, video, and audio) until watermarked data is found (if present). This allows the use of web crawlers to locate content with privately-coded watermarks, instead of just publicly-coded watermarks, as is presently the case. The queuing of content data for analysis provides certain opportunities for computational shortcuts. For example, like-sized images (e.g. 256 x 256 pixels) can be tiled into a larger image, and examined as a unit for the presence of watermark data. If the decoding technique (or the optional pre-screening technique) employs a DCT transform or the like, the block size of the transform can be tailored to correspond to the tile size (or some integral fraction thereof). Blocks indicated as likely having watermarks can then be subjected to a full read operation. If the queued data is sorted by file name, file size, or checksum, duplicate files can be identified. Once such duplicates are identified, the analysis computer need consider only one instance of the file. If watermark data is decoded from such a file, the content provider can be informed of each URL at which copies of the file were found.
Some commentators have observed that web crawler-based searches for watermarked images can be defeated by breaking a watermarked image into sub-blocks (tiles). HTML instructions, or the like, cause the sub-blocks to be presented in tiled fashion, recreating the complete
19
image. However, due to the small size of the component sub-blocks, watermark reading is not reliably accomplished.
This attack is overcome by instructing the web-crawler to collect the display instructions (e.g. HTML) by which image files are positioned for display on a web page, in addition to the image files themselves. Before files collected from a web page are scrutinized for watermarks, they can be concatenated in the arrangement specified by the display instructions. By this arrangement, the tiles are reassembled, and the watermark data can be reliably recovered.
Another such postulated attack against web crawler detection of image watermarks is to scramble the image (and thus the watermark) in a file, and employ a Java applet or the like to unscramble the image prior to viewing. Existing web crawlers inspect the file as they find it, so the watermark is not detected. However, just as the Java descrambling applet can be invoked when a user wishes access to a file, the same applet can similarly be employed in a web crawler to overcome such attempted circumvention of watermark detection.
Although "content" can be located and indexed by various web crawlers, the contents of the "content" are unknown. A *.JPG file, for example, may include pornography, a photo of a sunset, etc.
Watermarks can be used to indelibly associate meta-data within content (as opposed to stored in a data structure that forms another part of the object, as is conventionally done with metadata). The watermark can include text saying "sunset" or the like. More compact information representations can alternatively be employed (e.g. coded references). Still further, the watermark can include (or consist entirely of) a Unique ID (UID) that serves as an index (key) into a network- connected remote database containing the meta data descriptors. The remote data may contain metadata described by an extensible markup language (XML) tag set. By such arrangements, web crawlers and the like can extract and index the meta-data descriptor tags, allowing searches to be conducted based on semantic descriptions of the file contents, rather than just by file name.
Existing watermarks commonly embed information serving to communicate copyright information. Some systems embed text identifying the copyright holder. Others embed a UID which is used as an index into a database where the name of the copyright owner, and associated information, is stored. Looking ahead, watermarks should serve more than as silent copyright notices. One option is to use watermarks to embed "intelligence" in content. One form of intelligence is knowing its "home." "Home" can be the URL of a site with which the content is associated. A photograph of a car, for example, can be watermarked with data identifying the web site of an auto-dealer that published the image. Wherever the image goes, it serves as a link back to the original disseminator. The same technique can be applied to corporate logos. Wherever they are copied on the internet, a suitably-equipped browser or the like can decode the data and link back to the corporation's home
20
page. (Decoding may be effected by positioning the cursor over the logo and pressing the right- mouse button, which opens a window of options - one of which is Decode Watermark.)
To reduce the data load of the watermark, the intelligence need not be wholly encoded in the content's watermark. Instead, the watermark can again provide a UID - this time identifying a remote database record where the URL of the car dealer, etc., can be retrieved. In this manner, images and the like become marketing agents - linking consumers with vendors (with some visual salesmanship thrown in). In contrast to the copyright paradigm, in which dissemination of imagery was an evil sought to be tracked and stopped, dissemination of the imagery can now be treated as a selling opportunity. A watermarked image becomes a portal to a commercial transaction. (Using an intermediate database between a watermarked content file and its ultimate home
(i.e. indirect linking) serves an important advantage: it allows the disseminator to change the "home" simply by updating a record in the database. Thus, for example, if one company is acquired by another, the former company's smart images can be made to point to the new company's home web page by updating a database record. In contrast, if the old company's home URL is hard-coded (i.e. watermarked) in the object, it may point to a URL that eventually is abandoned. In this sense, the intermediate database serves as a switchboard that couples the file to its current home.
The foregoing techniques are not limited to digital content files. The same approach is equally applicable with printed imagery, etc. A printed catalog, for example, can include a picture illustrating a jacket. Embedded in the picture is watermarked data. This data can be extracted by a simple hand-scanner/decoder device using straightforward scanning and decoding techniques (e.g. those known to artisans in those fields). In watermark-reading applications employing hand-scanners and the like, it is important that the watermark decoder be robust to rotation of the image, since the catalog photo will likely be scanned off-axis. One option is to encode subliminal graticules (e.g. visualization synchronization codes) in the catalog photo so that the set of image data can be post- processed to restore it to proper alignment prior to decoding.
The scanner/decoder device can be coupled to a modem-equipped computer, a telephone, or any other communications device. In the former instance, the device provides URL data to the computer's web browser, linking the browser to the catalog vendor's order page. (The device need not include its own watermark decoder; this task can be performed by the computer.) The vendor's order page can detail the size and color options of the jacket, inventory availability, and solicit ordering instructions (credit card number, delivery options, etc.) - as is conventionally done with online merchants. Such a device connected to a telephone can dial the catalog vendor's toll-free automated order-taking telephone number (known, e.g., from data encoded in the watermark), and identify the jacket to the order center. Voice prompts can then solicit the customer's choice of size, color, and delivery options, which are input by Touch Tone instructions, or by voiced words (using known voice recognition software at the vendor facility).
21
In such applications, the watermark may be conceptualized as an invisible bar code employed in a purchase transaction. Here, as elsewhere, the watermark can serve as a seamless interface bridging the print and digital worlds
Another way of providing content with intelligence is to use the watermark to provide Java or ActiveX code. The code can be embedded in the content, or can be stored remotely and linked to the content. When the watermarked object is activated, the code can be executed (either automatically, or at the option of the user). This code can perform virtually any function. One is to "phone home" - initiating a browser and linking to the object's home The object can then relay any manner of data to its home. This data can specify some attribute of the data, or its use. The code can also prevent accessing the underlying content until permission is received. An example is a digital movie that, when double-clicked, automatically executes a watermark-embedded Java applet which links through a browser to the movie's distributor. The user is then prompted to input a credit card number. After the number has been verified and a charge made, the applet releases the content of the file to the computer's viewer for viewing of the movie. Support for these operations is desirably provided via the computer's operating system, or plug-in software.
Similar functionality can be provided with an operating system to trigger special treatment of watermarked content, provided the operating system is aware of the content type (e.g. in an object- oriented file system, or in a file system that is "media aware"). One exemplary application is in the acquisition of content from external audio/image/video devices. Such arrangements can also be used to collect user-provided demographic information when smart image content is accessed by the consumer of the content. The demographic information can be written to a remote database and can be used for market research, customization or personalization of information about the content provided to the consumer, sales opportunities, advertising, etc.
In audio and video and the like, watermarks can serve to convey related information, such as links to WWW fan sites, actor biographies, advertising for marketing tie-ins (T-shirts, CDs, concert tickets). In such applications, it is desirable (but not necessary) to display on the user interface (e.g. screen) a small logo to signal the presence of additional information. When the consumer selects the logo via some selection device (mouse, remote control button, etc.), the information is revealed to the consumer, who can then interact with it. Much has been written (and patented) on the topic of asset rights management. Sample patent documents include U.S. patents 5,715,403, 5,638,443, 5,634,012, 5,629,980. Again, much of the technical work is memorialized in journal articles, which can be identified by searching for relevant company names and trademarks such as IBM's Cryptolope system, Portland Software's ZipLock system, the Rights Exchange service by Softbank Net Solutions, and the DigiBox system from InterTrust Technologies.
An exemplary asset management system makes content available (e.g. from a web server, or on a new computer's hard disk) in encrypted form. Associated with the encrypted content is data
22
identifying the content (e.g. a preview) and data specifying various rights associated with the content. If a user wants to make fuller use of the content, the user provides a charge authorization (e.g. a credit card) to the distributor, who then provides a decryption key, allowing access to the content. (Such systems are often realized using object-based technology. In such systems, the content is commonly said to be distributed in a "secure container.")
Other asset management systems deliver content "in the clear" - not encrypted or in a secure container. Such arrangements are common, e.g., in enterprise asset management systems (as opposed, for example, to e-commerce applications where container-based approaches are prevalent).
Desirably, the content should be marked (personalized/serialized) so that any illicit use of the content can be tracked. This marking can be performed with watermarking, which assures that the mark travels with the content wherever — and in whatever form — it may go. The watermarking can be effected by the distributor - prior to dissemination of the (encrypted) object - such as by encoding a UID that is associated in a database with that particular container. When access is provided to the content (e.g. an access right is granted to a secure container, or non-encrypted content is made available to a user), the database record can be updated to reflect the purchaser/user, the purchase date, the rights granted, etc. An alternative is to include a watermark encoder in the software tool used to access the content (e.g. a decryption engine, for secure container-based systems). Such an encoder can embed watermark data in the content before it is provided to the user (e.g. before it is released from the container, in secure container systems). The embedded data can include a UID, as described above. This UID can be assigned by the distributor prior to disseminating the content/container. Alternatively, the UID can be a data string not known or created until access (or the right to access) has been granted. In addition to the UID, the watermark can include other data not known to the distributor, e.g. information specific to the time(s) and manner(s) of accessing the content. In still other non-container-based systems, access rights can again be implemented realized with watermarks. Full resolution images, for example, can be freely available on the web. If a user wishes to incorporate the imagery into a web page or a magazine, the user can interrogate the imagery as to its terms and conditions of use. This may entail linking to a web site specified by the embedded watermark (directly, or through an intermediate database), which specifies the desired information. The user can then arrange the necessary payment, and use the image knowing that the necessary rights have been secured.
Tagging of image/video/audio assets at the time of their end-user distribution (e.g. from a managed content repository) need not consist exclusively of a UID. Other data can also be watermarked into the content, e.g. a serial number, the identity of the recipient, the date of distribution, etc. Additionally, the watermarked data can serve to establish a persistent link to metadata contained in an associated asset management database.
23
As noted, digital watermarks can also be realized using conventional (e.g. paper) watermarking technologies. Known techniques for watermarking media (e.g. paper, plastic, polymer) are disclosed in U.S. patents 5,536,468, 5,275,870, 4,760,239, 4,256,652, 4,370,200, and 3,985,927 and can be adapted to display of a visual watermark instead of a logo or the like. Note that some forms of traditional watermarks which are designed to be viewed with transmissive light can also show up as low level signals in reflective light, as is typically used in scanners. Transmissive illumination detection systems can also be employed to detect such watermarks, using optoelectronic traditional-watermark detection technologies known in the art.
As also noted, digital watermarks can be realized as part of optical holograms. Known techniques for producing and securely mounting holograms are disclosed in U.S. patents 5,319,475, 5,694,229, 5,492,370, 5,483,363, 5,658,411 and 5,310,222. To watermark a hologram, the watermark can be represented in the image or data model from which the holographic diffraction grating is produced. In one embodiment, the hologram is produced as before, and displays an object or symbol. The watermark markings appear in the background of the image so that they can be detected from all viewing angles. In this context, it is not critical that the watermark representation be essentially imperceptible to the viewer. If desired, a fairly visible noise-like pattern can be used without impairing the use to which the hologram is put.
Digital watermarks can also be employed in conjunction with labels and tags. In addition to conventional label/tag printing processes, other techniques - tailored to security - can also be employed. Known techniques useful in producing security labels/tags are disclosed in U.S. patents 5,665,194, 5,732,979, 5,651,615, and 4,268,983. The imperceptibility of watermarked data, and the ease of machine decoding, are some of the benefits associated with watermarked tags/labels. Additionally, the cost is far less than many related technologies (e.g. holograms). Watermarks in this application can be used to authenticate the originality of a product label, either to the merchant or to the consumer of the associated product, using a simple scanner device, thereby reducing the rate of counterfeit product sales.
Conveniently ignoring for the time being the inevitable social ramifications of the following apparatus, consider a steganographic setup whereby a given computing system which is based on a "master clock" driving function can, with a small modification, impress a signal upon its electrical operations that could uniquely identify the computing system when that system communicates digitally with a second computing system. The slight modification referred to would be such that it would not interfere at all with the basic operation of the first computing system, nor interfere with the communications between the two systems.
The social ramifications referred to allude to the idea that there still exists the notion of pure anonymity within an internet context and the associated idea of the right to privacy. The "good" of this notion seems to be that non-anonymity can be the seed for exploitation, while the "bad" of this notion seems to be an illusory escape from responsibility for one's own actions. Be this as it all may,
24
there still seems to be ground for the socially neutral concept of simply identifying a given physical object relative to its neighbors, hence the following approach to impressing a unique signal onto a computing system such that a second system in communication with the first can identify the first.
Using spread spectrum principles (e.g. where a "noise like" carrier signal is modulated by an information signal to yield a data signal), the instantaneous phase of a computer's clock signal is modulated to convey steganographically encoded data. In one embodiment, the phase of the clock frequency is instantaneously varied in accordance with the data signal. (In some embodiments, the data signal needn't reflect modulation with a noise-like carrier signal; the unaltered information signal can be sent.) In an illustrative system, the period of the clock signal is 3 nanoseconds, and the instantaneous phase shifting is on the order of +/-10 picoseconds. (These figures will, of course, depend on the particular microprocessor used, and its tolerance for phase noise while still performing within specification.) If a binary "1" is to be encoded, the phase is advanced 10 picoseconds. If a binary "0" is to be encoded, the phase is retarded 10 picoseconds. Successive clock edges are successively advanced or retarded in accordance with corresponding bits of the data signal until the end of data is reached. Then the data signal desirably repeats, so as to provide redundant encoding. Such modulation of the computer's clock system will be manifested throughout its operation, including in its communication with connected devices. Thus, for example, a modem or network connection will convey data that is influenced by the phase modulation of the clock signal. In particular, the signal transitions in any such communication (e.g. RS-232, TCP/IP, wireless modem broadcasts, etc.) will not occur on uniformly-spaced clock edges, but instead will occur on clock edges whose exact temporal characteristics vary slightly in accordance with the data signal with which the clock has been modulated.
A remote computer that receives information sent by such a computer can examine the timing of the signal (e.g. its edge transitions), and discern thereby the encoded data signal. If the channel to the remote computer offers near perfect temporal fidelity (i.e. it introduces essentially no variable delay into the data's propagation), the data can be extracted by simply noting the instantaneous offset of each edge transition from its expected nominal value, and a corresponding element of the data signal is thereby identified. If the data signal was modulated by a noise-like signal, demodulation with the same noise-like signal will provide the original information signal. (No such demodulation with a noise signal is required if the original information signal served as the basis for the clock phase shifting.)
More typically, the channel to the remote computer introduces temporal distortion. An example is the internet, where different packets may follow different routings between the originating computer and the receiving computer. In such environments, a large number of data must be collected and processed to extract the data signal. For example, it may be necessary to average, or otherwise process, thousands of edge transitions corresponding to a given bit in order to determine whether the edge was advanced or retarded, and thereby decode that bit with a high degree of
25
statistical assurance. Thus, in environments with highly variable transmission latencies (e.g. the cited internet example), a message of 100 bits might require collection and processing of on the order of a million edge transitions to assure reliable message recovery. However, given the acceleration of data rates on the internet, this may require only a few seconds of data. In an illustrative embodiment, the master clock of the computer is controlled by a phase locked loop (PLL) circuit configured as a phase modulator and driven by the data signal. Decoding is performed by another PLL circuit, configured as a phase demodulator. (The two PLLs needn't share a common time base, since decoding can proceed by examining deviations of instantaneous edge transitions from the average clock period.) The design of such PLL phase modulators and demodulators is within the routine capabilities of an artisan in this field.
In view of the many possible embodiments to which the principles of the invention may be put, it should be recognized that the detailed embodiments are illustrative only and should not be taken as limiting the scope of my invention. Rather, I claim as my invention all such embodiments as may come within the scope and spirit of the following claims and equivalents thereto.