[go: nahoru, domu]

US20050177361A1 - Multi-band spectral audio encoding - Google Patents

Multi-band spectral audio encoding Download PDF

Info

Publication number
US20050177361A1
US20050177361A1 US11/100,291 US10029105A US2005177361A1 US 20050177361 A1 US20050177361 A1 US 20050177361A1 US 10029105 A US10029105 A US 10029105A US 2005177361 A1 US2005177361 A1 US 2005177361A1
Authority
US
United States
Prior art keywords
frequency
signal
audio
indices
block
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US11/100,291
Inventor
Venugopal Srinivasan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
TNC US Holdings Inc
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to US11/100,291 priority Critical patent/US20050177361A1/en
Publication of US20050177361A1 publication Critical patent/US20050177361A1/en
Assigned to NIELSEN MEDIA RESEARCH, INC, reassignment NIELSEN MEDIA RESEARCH, INC, ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: SRINIVASAN, VENUGOPAL
Assigned to CITIBANK, N.A., AS COLLATERAL AGENT reassignment CITIBANK, N.A., AS COLLATERAL AGENT SECURITY AGREEMENT Assignors: AC NIELSEN (US), INC., BROADCAST DATA SYSTEMS, LLC, NIELSEN MEDIA RESEARCH, INC., VNU MARKETING INFORMATION, INC.
Assigned to THE NIELSEN COMPANY (US), LLC, VNU MARKETING INFORMATION, INC. reassignment THE NIELSEN COMPANY (US), LLC RELEASE (REEL 018207 / FRAME 0607) Assignors: CITIBANK, N.A.
Abandoned legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H20/00Arrangements for broadcast or for distribution combined with broadcast
    • H04H20/28Arrangements for simultaneous broadcast of plural pieces of information
    • H04H20/30Arrangements for simultaneous broadcast of plural pieces of information by a single channel
    • H04H20/31Arrangements for simultaneous broadcast of plural pieces of information by a single channel using in-band signals, e.g. subsonic or cue signal

Definitions

  • the present invention relates to a system and method for adding an inaudible code to an audio signal and for subsequently retrieving that code.
  • a code may be used, for example, in an audience measurement application in order to identify a broadcast program.
  • ancillary codes can be hidden in non-viewable portions of video by inserting the codes into either the video's vertical blanking interval or the video's horizontal retrace interval.
  • An exemplary system that hides codes in non-viewable portions of video is referred to as “AMOL” and is taught in U.S. Pat. No. 4,025,851. This system is used by the assignee of the present application in order to monitor broadcasts of television programming as well as the times of such broadcasts.
  • Audio encoding has the obvious advantage of being applicable not only to television, but also to radio broadcasts and to prerecorded music. Moreover, the speaker of a receiver reproduces, in the audio signal output, the ancillary codes that are added to audio signals. Accordingly, audio encoding offers the possibility of non-intrusive interception (i.e., interception of the codes without intrusion into the interior of the receiver) and of decoding the codes with equipment that has microphones as inputs. Moreover, audio encoding permits the measurement of broadcast audiences by the use of portable metering equipment carried by panelists.
  • Jensen et al. in U.S. Pat. No. 5,450,490, teach an arrangement for adding a code at a fixed set of frequencies and using one of two masking signals. The choice of masking signal is made on the basis of a frequency analysis of the audio signal to which the code is to be added. Jensen et al. do not teach arrangements for selecting a maximum acceptable code energy to be used in each of a predetermined set of frequency intervals, nor do Jensen et al. teach energy exchange coding which transfers energy between spectral components and which thereby holds the total acoustic energy constant.
  • Preuss et al. in U.S. Pat. No. 5,319,735, teach a multi-band audio encoding arrangement in which a spread spectrum code is inserted in recorded music at a fixed ratio to the input signal intensity (code-to-music ratio) that is preferably 19 dB.
  • Lee et al. in U.S. Pat. No. 5,687,191, teach an audio coding arrangement suitable for use with digitized audio signals. The code intensity is made to match the input signal by calculating a signal-to-mask ratio in each of several frequency bands and by then inserting the code at an intensity that is a predetermined ratio of the audio input in that band.
  • Lee et al. has also described a method of embedding digital information in a digital waveform in U.S. Pat. No. 5,824,360.
  • Jensen et al. in U.S. Pat. No. 5,764,763, teach a method in which code signals consisting of sinusoidal waves at ten pre-selected frequencies in a high resolution spectrum are added to the original audio in order to represent either a binary bit (0 or 1) and the start and end of an embedded message. Forty unique frequencies are required for encoding these four symbols. Their values range from 1046.9 Hz to 2851.6 Hz in a typical practical embodiment. The frequency separation between adjacent lines in the spectrum is 4 Hz and the minimum separation between frequencies selected to constitute the set of 40 frequencies is 8 Hz.
  • the amplitude of the injected code signal is controlled by a masking analysis. In the decoding process, the injected code signal is distinguished by the fact that its level will be significantly above a noise level computed for a band of frequencies.
  • ancillary codes are preferably inserted at low intensities in order to prevent the codes from distracting a listener of program audio, such codes may be vulnerable to various signal processing operations as well as to interference from extraneous electromagnetic sources.
  • Lee et al. discuss digitized audio signals
  • many of the earlier known approaches to encoding a broadcast audio signal are not compatible with current and proposed digital audio standards, particularly those employing signal compression methods that may reduce the signal's dynamic range (and thereby delete a low level code) or that otherwise may damage an ancillary code.
  • U.S. patent application Ser. No. 09/116,397 filed Jul. 16, 1998 and U.S. patent application Ser. No. 09/428,425 filed Oct. 27, 1999 disclose a system and method for inserting a code into an audio signal so that the code is likely to survive compression and decompression as required by current and proposed digital audio standards.
  • Spectral modulation of the amplitude or phase of the signal at selected code frequencies is used to insert the code into the audio signal.
  • These selected code frequencies which could comprise multiple frequency sets within a given audio block, may be varied from audio block to audio block, and the spectral modulation may be implemented as amplitude modulation, modulation by frequency swapping, phase modulation, and/or odd/even index modulation.
  • an approach is taught to measuring audio quality of each block and of suspending encoding in cases where the code might be audible to a listener.
  • codes are added by manipulating pairs of frequencies that are spaced apart by about 100 Hz. These systems are thus vulnerable to interference, such as reverberation or multi-path distortion, that affect one of the encoded frequencies substantially more than the other.
  • the present invention is arranged to solve one or more of the above noted problems.
  • a system for adding an interference-resistant, inaudible code to an audio signal comprises a sampler, a processor, a frequency transformation, a frequency selector, and an encoder.
  • the sampler is arranged to sample the audio signal at a sampling rate and to generate therefrom a plurality of short blocks of sampled audio, where each of the short blocks has a duration less than a minimum audibly perceivable signal delay.
  • the processor is arranged to combine the plurality of short blocks into a long block having a predetermined minimum duration.
  • the frequency transformation is arranged to transform the long block into a frequency domain signal comprising a plurality of independently modulatable frequency indices, where a frequency difference between two adjacent ones of the indices is determined by the minimum duration and the sampling rate.
  • the frequency selector is arranged to select a neighborhood of frequency indices so that the frequency difference between a lowest index and a highest index within the neighborhood is less than a predetermined value.
  • the encoder is arranged to modulate two or more of the indices in the neighborhood so as to make a selected one of the indices an extremum while keeping the total energy of the neighborhood constant.
  • a method to add a code to a frequency band of a sampled audio portion of a composite signal without thereby introducing a perceptible delay between the encoded audio portion and another portion of the composite signal.
  • the method comprises the steps of: a) selecting a sampling rate and a frequency difference between adjacent ones of a predetermined number of frequency indices included in a frequency neighborhood; b) determining from the sampling rate and from the frequency difference a duration of a block of samples; c) determining an integral number of sequential sub-blocks to make up the block, where the integral number is selected so that each of the sub-blocks has a sub-block duration less than the perceptible delay; d) processing the block so as to modulate a selected one of the frequency indices without changing a total signal energy of the band.
  • an apparatus to read a code from an audio signal.
  • the code comprises a sequence of blocks having a predetermined number of samples of the audio signal, and the code comprises a synchronization block followed by a predetermined number of data blocks.
  • the apparatus comprises a buffer memory, a frequency transformation, a processor, and a vote determiner.
  • the buffer memory is arranged to hold one of the blocks.
  • the frequency transformation is arranged to transform the one block into spectral data spanning a predetermined number of frequency bands, where each of the frequency bands comprises a respective neighborhood of frequency indices.
  • the processor is arranged to determine, for each of the neighborhoods, if a respective predetermined one of the frequency indices is modulated.
  • the vote determiner is arranged to determine that the one block is the synchronization block if, in a majority of the frequency bands, the respective modulated frequency index is a respective index selected for inclusion in the synchronization block.
  • the processor is further arranged to determine if, in one of the data blocks received subsequent to the synchronization block, a respective predetermined one of the frequency indices is modulated.
  • the vote determiner is further arranged to determine if, in a majority of the frequency bands, the respective modulated frequency index is a respective index selected for inclusion in the one data block.
  • a method is provided to read a code from an audio signal by sequentially transforming a sequence of blocks of audio samples into spectral data spanning a predetermined number of frequency bands.
  • Each of the frequency bands comprises a predetermined number of frequency indices, and each of the blocks comprises a predetermined number of the samples.
  • the code comprises a synchronization block followed by a predetermined number of data blocks.
  • the method comprises the steps of: a) determining, in each of the frequency bands of one of the blocks of audio samples, if one of the frequency indices is modulated; b) comparing each modulated frequency index found in step a) with that index selected for modulation in the respective frequency band of the synchronization block; c) determining that the one block is the synchronization block if the majority of the comparisons made in step b) result in a match, and otherwise repeating steps a) through b); d) determining, in each of the frequency bands of one of the data blocks received subsequent to the synchronization block, if a respective one of the frequency indices is modulated; and, e) comparing the respective modulated frequency indices found in step d) with ones of a plurality of predetermined index patterns, each of the index patterns uniquely associated with a respective code bit, and reading the code bit only if the majority of modulated indices match the predetermined index pattern.
  • a system for adding an inaudible code to a tone-like audio portion of a composite signal having two or more portions comprises a sampling apparatus, a processor, a frequency transformation, an encoder, a signal analyzer, and an encoder suspender.
  • the sampling apparatus is arranged to sample audio at a sampling rate and to generate therefrom a plurality of short blocks of sampled audio, where each of the short blocks has a duration less than a minimum audibly perceptible signal delay.
  • the processor is arranged to combine the plurality of short blocks into a long block having a predetermined minimum duration.
  • the frequency transformation is arranged to transform the long block into a frequency domain signal comprising a plurality of independently modulatable frequency indices located in a plurality of frequency bands.
  • the encoder is arranged to modulate two or more of the indices in each of the frequency bands so as to make a respective selected one of the indices an extremum while keeping a total acoustic energy of the audio constant.
  • the signal analyzer is arranged to determine if the tone-like audio portion has a tone-like character within any one of the predetermined number of neighborhoods.
  • the encoder suspender is arranged to suspend the encoding of the encoder within any neighborhood in which the tone-like audio portion has a tone-like character.
  • a method to add an inaudible code to at least one of a predetermined number of frequency neighborhoods within a tone-like audio portion of a composite signal having one or more additional portions.
  • the method comprises the steps of: a) sampling the audio portion and generating from the sampled signal a plurality of short blocks, each of the short blocks having a duration less than a minimum audibly perceptible signal delay; b) combining the plurality of short blocks into a long block having a predetermined minimum duration; c) transforming the long block into a frequency domain signal comprising a plurality of independently modulatable frequency indices; d) identifying those neighborhoods, if any, of the predetermined number of frequency neighborhoods in which the tone-like audio portion has a tone-like character; and, e) modulating a respective index in each neighborhood not identified in step d) so as to make a selected index in such neighborhood an extremum while keeping the total acoustic energy of the audio portion constant, and not modul
  • a broadcast audience measurement system in which an inaudible code added to an audio signal is read by a decoding apparatus located within a statistically sampled dwelling, comprises an encoder, a receiver, and a decoder.
  • the encoder is arranged to add a predetermined code bit to each of a predetermined number of odd frequency bands within a bandwidth of the audio signal.
  • the receiver is within the dwelling and is arranged to receive the encoded audio portion.
  • the decoder has an input from the receiver, and the decoder is arranged to acquire a respective test value of the code bit from each of the frequency bands, to compare the test values, to determine that one of the test values is the code bit only if that test value is acquired from a majority of the frequency bands, and to otherwise determine that no code bit has been read.
  • a broadcast audience measurement system in which an inaudible code added to an audio signal is read within a statistically sampled dwelling unit, comprises an encoding apparatus, a receiver, and a decoder.
  • the encoding apparatus is arranged to add a code bit to a sampled long block of the audio signal, where the long block comprises a predetermined number of short blocks.
  • Each of the short blocks has a predetermined duration that is selected to be short enough not to be perceptible to a member of a broadcast audience.
  • the encoding apparatus is further arranged to modulate a selected frequency index in each of a plurality of frequency neighborhoods so as to make each selected index an extremum in the respective neighborhood thereof while keeping a total energy of the audio signal constant.
  • the receiver is within the dwelling, and is arranged to acquire the encoded audio signal.
  • the decoder is arranged to read the code from the audio signal.
  • the decoder has an input from the receiver, and the decoder comprises a buffer memory arranged to store one of the short blocks.
  • the buffer memory is not arranged to store a long block.
  • a method of encoding an audio signal comprises the following steps: a) generating a plurality of short blocks from the audio signal, wherein each of the short blocks has a duration less than a minimum audibly perceivable signal delay; b) combining the plurality of short blocks into a long block; c) transforming the long block into a spectrum comprising a plurality of independently modulatable frequency indices; and, d) modulating at least two of the indices so as to make one of the indices an extremum while keeping the total energy of a neighborhood of the modulated indices substantially constant.
  • a method of reading a code element from an audio signal comprises the following steps: a) transforming at least a portion of the audio signal into spectral data spanning a predetermined number of frequency bands having a plurality of frequency neighborhoods; b) determining, for each of the neighborhoods, if one of the frequency indices is modulated; and, c) assigning a transmitted code value to the code element if, in a majority of the neighborhoods, the respective modulated frequency index is an index selected for inclusion in the audio signal.
  • FIG. 1 is a schematic depiction of a broadcast audience measurement system employing a program identifying code added to the audio portion of a composite television signal;
  • FIG. 2 is a flow chart depicting an encoding process of the present invention.
  • FIG. 3 is a flow chart depicting a decoding process of the present invention.
  • Audio signals are usually digitized at sampling rates that range between thirty-two kHz and forty-eight kHz. For example, a sampling rate of 44.1 kHz is commonly used during the digital recording of music. However, digital television (“DTV”) is likely to use a forty eight kHz sampling rate.
  • DTV digital television
  • another parameter of interest in digitizing an audio signal is the number of binary bits used to represent the audio signal at each of the instants when it is sampled. This number of binary bits can vary, for example, between sixteen and twenty four bits per sample. The amplitude dynamic range resulting from using sixteen bits per sample of the audio signal is ninety-six dB.
  • the dynamic range resulting from using twenty-four bits per sample is 144 dB.
  • Audio compression is typically accomplished by transform coding.
  • a block of audio consisting of samples may be decomposed, by application of a Fast Fourier Transform or other similar frequency analysis process, into a spectral representation.
  • overlapping blocks of audio are commonly used to produce the samples.
  • a block includes 512 “old” audio samples (i.e., audio samples from a previous block) and 512 “new” or current audio samples.
  • the spectral representation of such a block is divided into critical bands, where each band comprises a group of several neighboring frequencies. The power in each of these bands can be calculated by summing the squares of the amplitudes of the frequency components within the band.
  • Audio compression is based on the following principle of masking: in the presence of high spectral energy at one frequency (i.e., the masking frequency), the human ear is unable to perceive a lower energy signal if the lower energy signal has a frequency (i.e., the masked frequency) near that of the higher energy signal.
  • the lower energy signal at the masked frequency is called a masked signal.
  • a masking threshold which represents either (i) the acoustic energy required at the masked frequency in order to make it audible or (ii) an energy change in the existing spectral value that would be perceptible, can be dynamically computed for each band.
  • the frequency components in a masked band can be represented in a coarse fashion by using fewer bits based on this masking threshold. That is, the masking thresholds and the amplitudes of the frequency components in each band are coded with a smaller number of bits that constitute the compressed audio. Decompression reconstructs the original signal based on these data.
  • the masking threshold depends to some extent on the nature of the sound being masked. Tone-like sounds, in which only one, or a few, frequencies are present in the acoustic spectrum, present special masking problems that are not encountered when dealing with a broad-band acoustic signal. Thus, a signal, that would be masked if added to a passage of speech, might be audible to a listener if added to a passage of music having the same acoustic energy.
  • a television audience measurement system 10 shown in FIG. 1 is an example of a system in which the present invention may be used.
  • the television audience measurement system 10 includes an encoder 12 that adds an ancillary code to an audio signal portion 14 of a broadcast program signal.
  • the encoder 12 may be provided, as is known in the art, at some other location in the program signal distribution chain.
  • a transmitter 16 transmits the encoded audio signal portion along with a video signal portion 18 of the program signal.
  • the audio signal portion of the received program signal is processed to recover the ancillary code, even though the presence of that ancillary code is imperceptible to a listener when the encoded audio signal portion is supplied to speakers 24 of the receiver 20 .
  • a decoder 26 is connected either directly to an audio output 28 available at the receiver 20 or to a microphone 30 placed in the vicinity of the speakers 24 through which the audio is reproduced.
  • the received audio signal can be either in a monaural or stereo format.
  • audio blocks may comprise 512 samples of an audio stream sampled at a 48 kHz sampling rate.
  • the time duration of such a block is 10.6 ms.
  • this arrangement comprises a total delay of about 22 ms, which would be perceptible to a viewer as a loss of synchronization between the video and audio signals.
  • a compensating delay is introduced into the video signal. Because it is preferable to do without such compensating delay, the encoder 12 implements encoding as represented by the flow chart of FIG. 2 in order to avoid loss of video/audio synchronization while at the same time avoiding the use of a compensation delay circuit.
  • the encoding implemented by the encoder 12 reduces the audio encoding delay to an imperceptible 5.3 milliseconds by structuring a complete, or “long”, code block as a sequence of overlapping short blocks that can be processed in a pairwise fashion with correspondingly smaller buffers and that are only 1 ⁇ 2 as long as the blocks used in the '397 and '425 applications.
  • a spectral analysis of a sampled interval of the audio signal that is long enough to form a block of 512 samples collected at a sampling rate of 48 kHz yields frequency “lines” separated from one another by 93.75 Hz.
  • a neighborhood is a set of five consecutive frequency lines covering a neighborhood bandwidth of 468.75 Hz that lies within a selected portion of the overall bandwidth of the audio portion being encoded.
  • a binary data bit is encoded by changing (preferably by boosting) the amplitude of one of the frequencies in the neighborhood such that it becomes a local extremum (i.e., a maximum in the preferred case, although the local extremum could alternatively a minimum).
  • Another frequency in the same neighborhood is changed in the alternate sense (i.e., preferably attenuated) in order to maintain the overall energy within the band at a constant level, a practice that is referred to herein as “energy exchange encoding”. It has been found that the 468.75 Hz neighborhood bandwidth required for a code block is great enough that codes may be subject to interference effects when two frequencies in a single neighborhood undergo different amounts of change.
  • a much longer “long block” sampling interval (8192 samples taken at 48 kHz) is used. This longer sampling interval reduces the spacing between spectral lines to 5.85 Hz.
  • this preferred system writes an energy-exchange code bit in a frequency neighborhood containing eight adjacent frequency indices.
  • this frequency neighborhood requires a bandwidth of less than 50 Hz.
  • This selection of sampling rate, number of samples in a sampling interval, and number of frequency indices in a neighborhood leads to a very small frequency difference in a neighborhood and thereby offers an interference-resistant code having a high degree of invulnerability to narrow-band interference effects.
  • an In Buffer having 256 memory locations is initialized by setting all of its memory locations to zero.
  • an Out Buffer having 128 memory locations is initialized by setting all of its memory locations to zero.
  • a sub-block counter and a long-block counter are both set to zero.
  • data is shifted from the second half of the In Buffer to its first half, and data is copied from the second half of a Temporary Buffer to the first half of the Out Buffer.
  • a short block is constructed at a step 42 by reading 128 samples of new data from the audio signal portion 14 into the second half of the In Buffer which combines these 128 new samples with the last 128 samples of a previous block stored in the first half of the In Buffer as a result of the step 41 .
  • the encoder 12 should preferably use frequencies and critical bands that match those used in compression.
  • a suitable value for N S is 256, for example, and a suitable value for N l is 8192, for example.
  • the short block itself is constructed from the last 128 samples of a previous block and the 128 samples of new data read at the step 42 of FIG. 2 .
  • the samples may be derived from the audio signal portion 14 by the encoder 12 such as by use of an analog to digital converter.
  • the amplitude of the audio signal within a short block may be represented by the time-domain function v(n), where n is the sample index.
  • the time-domain function v(n) is converted to a time value by multiplication by the sample interval at a step 43 .
  • a Discrete Fourier Transform F(u) of v(n)w(n), where u is a frequency index is computed.
  • This Discrete Fourier Transform can be performed using the well-known Fast Fourier Transform (FFT) algorithm.
  • the selected frequencies and indices are shown in the following table: Short Block Long Block Band Index Central index Central Index Long Block Range 0 7 224 220-227 (1287 Hz-1328 Hz) 1 11 352 348-355 (2035 Hz-2077 Hz) 2 15 480 476-483 (2785 Hz-2826 Hz) 3 19 608 604-611 (3533 Hz-3574 Hz) 4 23 736 732-739 (4282 Hz-4323 Hz)
  • each long block in the arrangement shown in the above exemplary table is set up to define neighborhoods having eight long block indices. It will be recognized that different numbers of indices could be used. Adding indices has the effect of increasing the numerical range that can be accommodated in a single block, but it also has the effect of increasing the frequency span of a block, thereby rendering the code more susceptible to interference effects.
  • L a long block consists of 8192 samples made up of 64 sub-blocks, with each sub-block having 128 new samples.
  • a 256-sample short block is constructed from adjacent sub-blocks by the use of the window function of equation (1).
  • L consists of a sequence of sixty four overlapped short blocks, each of which has 256 samples.
  • These short blocks may conveniently by indexed as S i , where the short block index i ranges from 0 to 63.
  • a masking analysis of the sort conventionally used in compression algorithms is preferably applied at the step 44 to the short blocks in order to determine the maximum change in energy E b or in the masking energy level that can occur at any critical frequency band without making the modulation perceptible to a listener.
  • These critical frequency bands may vary in width from single frequency bands at the low end of the spectrum to bands containing ten or more adjacent frequencies at the upper end of the audible spectrum.
  • critical band eighteen includes two frequencies with indexes 19 and 20 of a short audio block.
  • the acoustic energy in each critical band influences the masking energy of its neighbors.
  • Algorithms for computing the masking effect are described in the standards document such as ISO/IEC 13818-7:1997. These analyses may be used to determine for each audio block the masking contribution due to “tonality” as well as “noise” like features of the audio spectrum.
  • the tonality index computed by these algorithms at the step 44 provides a useful tool for determining circumstances under which a sub-block may produce audible degradation when encoded.
  • the analysis can also be used to determine, on a per critical band basis, the amplitude of a time domain code signal that can be added without producing any noticeable audio degradation.
  • a preferred code waveform is constructed using long block indices that are very near to the central index of the corresponding short block for a selected band. For example, if a sub-block S m with a sub-block index m and a coding band b is considered, and if a spectral frequency having a long block index of J b is enhanced, an appropriate code waveform will have 256 samples, which can be denoted as C b (p), where the index p runs from 0 to 255.
  • each of these components is selected to follow the relationship:
  • C b ⁇ ( p ) A b ⁇ ⁇ cos ( ⁇ m + 2 ⁇ ⁇ ⁇ ⁇ ⁇ J b ⁇ p 8192 ) + k b ⁇ A b ⁇ ⁇ cos ( ⁇ + ⁇ j + 2 ⁇ ⁇ ⁇ ⁇ ⁇ j b ⁇ p 256 ) ( 5 )
  • a b is a nominal code amplitude level
  • J b is an index in the long block frequency space
  • j b is the central index of the corresponding short block
  • ⁇ m is given by the following equation:
  • ⁇ m 2 ⁇ ⁇ ⁇ ⁇ J b ⁇ m128 8192 ( 6 )
  • ⁇ m is the starting phase angle for sub-block m
  • ⁇ j is the phase angle of the short block frequency index j b obtained from the Fourier Transform analysis.
  • the quantity ⁇ m ensures that the code component having a frequency index of J b is in phase in all 64 blocks constituting the long block. It may be noted that, in order to simplify the representation, a multiplication of the code signal with a window function (not shown) may be implemented.
  • the first cosine term in equation (5) represents an added energy.
  • the corresponding short block index j b term because of the change in phase angle of ⁇ , subtracts a compensating amount of energy with the assumption that the spectral energy at j b represents the overall energy in the coding band b and includes all of the high resolution coding frequencies in the band.
  • each high resolution frequency component such as J b , influences not only the spectral amplitude at j b but also its neighbors. The most significant impact is on the immediate neighbors j b ⁇ 1 and j b +1.
  • the constant k b with a value in the range 0 to 0.8 is used to control the extent to which a single index j b compensates for the code signal.
  • the window function applied at the step 43 causes further interaction among the short block frequency indexes. Because the high resolution frequencies are close to each other, these amplitude changes are not perceptible. Because of the encoding operation, the desired long block frequency with index J b is enhanced relative to its neighbors in band. For example, if a long block index of 223 is selected, where the corresponding short block central index is seven, and the code energy for all 64 blocks is calculated, a component with frequency index 223 has a higher energy level than the other indices in the neighborhood from 220 to 227.
  • the nominal code amplitude level A b is chosen such that it is the lowest value that permits successful extraction of the embedded code during decoding. For most sub-blocks, the nominal code amplitude level A b is expected to be well below the corresponding masking amplitude level M j . However, in cases where M is not greater than A b , M j replaces A b in equation (5).
  • signal analyzers or signal analyzing algorithms are used to examine each encodable neighborhood of each short block to see if the signal being encoded has a tone-like character within that neighborhood.
  • the tonality index calculated at the step 44 by the masking algorithm described in ISO/IEC 13818-7:1997, for example, provides such a measure.
  • a purely tonal audio block is expected to have a tonality index of 1.0, whereas a “noise-like” block has a tonality index close to 0. If the tonality index for the bands used in coding has a value exceeding a tonal threshold, the encoding operation is suspended for that sub-block.
  • a preferred encoding arrangement of the invention uses a redundant transmission scheme to make the system more robust.
  • five different frequency bands are defined in the exemplary system.
  • the coding arrangement disclosed above was described with respect to only one of these bands. That is, the five bands are essentially independent of each other so that a code symbol can be sent in multiple bands at any given time in the interest of providing redundant transmission.
  • One of the advantages of the encoding method described above is that the processing uses only 256 samples at each stage, of which 128 are new samples and 128 are carried over from the prior processing step.
  • a loss of synchronization of less than about 10 msec between two portions (e.g., left and right stereo channel) of a composite audio signal or between an audio and a video portion of a composite television signal is not perceptible.
  • the encoding method of the present invention does not require introducing a compensating delay in another portion of the signal.
  • the present system has the advantage that it can be used without a video delay circuit and without disturbing the viewer with a perceptible loss of synchronization.
  • a preferred system of the invention defines a synchronization block having a unique structure that differentiates it from other encoded blocks.
  • a synchronization block consisting of 8192 samples is selected when the long block counter has a count of zero such that the synchronization block has the following characteristics: in Band 0, index 220, which is the first frequency line in that neighborhood, is enhanced; in Band 1, the second frequency line, index 349, is enhanced; in Band 2, the third frequency line, index 478, is enhanced; in Band 3, the fourth frequency line, index 607, is enhanced; and, in Band 4, the fifth frequency line, index 736, is enhanced.
  • the decoder When the decoder analyzes a long block by comparing each enhanced frequency index with the respective index selected for enhancement in a synchronization block and finds a match in at least three of the five frequency bands, the system determines that a potential synchronization block has been detected, and interprets the long blocks following a synchronization block as the actual message data.
  • each long block comprises a set of eight indices that can be modulated to form a code.
  • a complete encoded message may comprise forty-eight bits consisting of a sixteen bit Station Identifier (SID) and a thirty-two bit time stamp (TS).
  • SID Station Identifier
  • TS thirty-two bit time stamp
  • the forty-eight bits of data may be grouped into sixteen three-bit sets. The decimal value of each of these three-bit sets can range from zero to seven so that each of the three-bit sets can be encoded by using the selected long blocks.
  • the system encodes a value of k (where k is in the range of zero to seven) by modulating the k th available index.
  • the 6 th index in each band i.e., indices 225, 353, 481, 609, and 737
  • a forty-eight bit data packet can be transmitted as one long synchronization block followed by sixteen long data blocks. For the choice of code blocks and sampling frequency disclosed above, sending these seventeen long blocks requires 2.89 seconds. This arrangement provides a clear distinction from the synchronization block, which has a different index enhanced in each band.
  • each of a plurality of possible code bits has an index pattern uniquely associated with it, and decoding a bit comprises comparing each of plurality of enhanced indices with ones of the index patterns to determine if a majority of the enhanced indices match with one of the predetermined patterns.
  • decoding a bit comprises comparing each of plurality of enhanced indices with ones of the index patterns to determine if a majority of the enhanced indices match with one of the predetermined patterns.
  • This arrangement of sending the same data in each of five bands at the same time fits well with the masking algorithms discussed above. That is, one can select a masking algorithm that suspends coding in one or more of the bands, but that continues to encode in the other ones of the bands.
  • the signal at these frequencies is enhanced at the step 46 assuming that the masking level and the tonality as indicated by the tonality index are acceptable.
  • the samples v(n)w(n) stored in the Temporary Buffer are modified according to equations (5) and (6) and, at a step 47 , the code signal is added to the Temporary Buffer.
  • the first half of the Temporary Buffer is added to the Out Buffer, and the 128 samples in the Out Buffer are passed to the transmitter 16 as encoded data.
  • the sub-block counter is incremented by one and, if the sub-block counter is equal to 64, the long block counter is incremented by one. No other sub-blocks are encoded until the long block counter is incremented.
  • the long block counter is equal to 17, then a complete code message (a synchronization block and sixteen data blocks) has been passed to the transmitter 16 and the long block counter is reset to zero to begin encoding a new message. If the sub-block counter is not equal to 64, or after the long block counter has been reset to zero, program flow returns to the block 41 .
  • a preferred system provides an audio signal acquisition arrangement at a receiving location.
  • This location may be within the statistically selected metering site 22 .
  • the embedded digital code can be recovered from the audio signal available at the audio output 28 of the receiver 20 . When such an output is available, it provides a relatively high quality signal source. However, many receivers 20 do not have the audio output 28 , which constrains the audience research system operator to acquire an analog audio signal with the microphone 30 placed in the vicinity of the speakers 24 .
  • the microphone 30 is preferably placed behind the receiver 20 , where the quality of the signal it acquires is degraded from what would be found if the microphone 30 were placed in front of the receiver 20 .
  • This signal degradation has led to the failure of many prior art systems that attempted to read a buried code from an audio signal picked up with a microphone.
  • the redundancy obtained by encoding five frequency bands as discussed above increases the likelihood that the code can be successfully recovered.
  • the decoder 26 converts the analog audio to a sampled digital output stream at a preferred sampling rate matching the sampling rate of the encoder 12 .
  • the receiver 20 provides digital outputs, the digital outputs are processed directly by the decoder 26 without sampling but at a data rate suitable for the decoder 26 .
  • the ability to decode an audio stream in real-time is highly desirable. It is also highly desirable to transmit the decoded data to a remote central office.
  • the decoder 26 may be arranged to run the decoding algorithm described below in connection with FIG. 3 on Digital Signal Processing (DSP) based hardware of the sort typically used in such applications.
  • DSP Digital Signal Processing
  • the incoming encoded audio signal may be made available to the decoder 26 from either the audio output 28 or from the microphone 30 placed in the vicinity of the speakers 24 .
  • a circular buffer capable of storing 4096 samples is initialized by setting all of its storage locations to zero. Also, a set of frequency bins are set to zero. At a block 51 , 256 samples are read into an audio buffer. Also, a block sample counter is set to zero. Before recovering the actual data bits representing code information, it is necessary to locate the synchronization block which is preferably encoded by enhancing (or diminishing) the amplitude of a unique set of frequencies. In one preferred embodiment these frequencies have indexes 220 , 349 , 478 , 607 , and 736 and each one is in a different coding band.
  • the circular buffer In order to search for the synchronization block, as well as to extract data from subsequent blocks within an incoming audio stream, the circular buffer is used.
  • the circular buffer has a sufficient size to store 4096 samples in the case of half rate sampling. This arrangement is essential in order to implement a near real-time decoding scheme based on a sliding FFT routine which forms part of the decoding algorithm shown in the flow chart of FIG. 3 .
  • the frequency bins are updated at the block 53 with the results of the analysis performed according to equation (7) If the block sample counter has a count which is a multiple of 64, the frequency bins are analyzed and the results of the analysis are stored in a Status Information Structure (SIS) as indicated in step 54 of FIG. 3 .
  • SIS Status Information Structure
  • Each SIS structure is updated at 4096 sample intervals, which corresponds to the length of a long block in the half-sampling rate case.
  • Each SIS structure contains a synchronization flag and a data storage location. Also, the SIS includes a counter.
  • the search for the synchronization block is the first step in the decoding process. Let us assume that at a sample location where the SIS SIS k needs to be updated because a spectrum, which satisfies the characteristics of a synchronization block, is found. In such a spectrum, indexes 220 , 349 , 478 , 607 , 736 are enhanced and possess higher spectral power than their neighbors in the respective bands. Due to factors such as audio compression, audio degradation due to amplifier-speaker-microphone non-linearities, or ambient noise in the case of microphone based decoding systems, it is possible that not all the five bands have the desired characteristics.
  • the redundant transmission feature described above enables detection of a long block as being a synchronization block even if only three of the five bands satisfy the criteria for a synchronization block.
  • a synchronization flag within the corresponding SIS structure is set to one.
  • more than one SIS structure can have its synchronization flag set to one.
  • SIS k ⁇ 2 , SIS k ⁇ 1 , SIS k , SIS k+1 , SIS k+2 may all have synchronization flags set to one because the spectrum of a long audio block does not change rapidly.
  • the algorithm recognizes the synchronization flag and attempts to extract the first three-bit data value encoded in the spectrum. This extraction may be done by means of a voting algorithm that compares test values taken from each of the neighborhoods and that accepts a test value as the data value if the same test value is found in three out of the five band neighborhoods. In addition, if a valid data value in the range zero to seven is extracted, the counter within the SIS is incremented to show that the first member of the sixteen member message data has been extracted. The extracted three-bit datum is also stored within the structure at a corresponding data storage location.
  • the SIS structure's synchronization flag is reset to zero and the counter is reset to zero.
  • the block sample counter is incremented by two corresponding to the two samples read from the audio buffer to the circular buffer at the step 52 . If the block sample counter does not have a count equal to 256, flow returns to the step 52 where two more samples from the audio buffer are read into the circular buffer. On the other hand, if the block sample counter does have a count equal to 256, flow returns to the step 51 where another 256 samples are inserted into the audio buffer.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

An encoder includes a sampler that samples an audio signal and that generates from the samples a plurality of short blocks of sampled audio. Each of the short blocks has a duration less than a minimum audibly perceivable signal delay. A processor combines the plurality of short blocks into a long block. The long block is transformed into a frequency domain signal having a plurality of independently modulatable frequency indices. The frequency difference between adjacent indices is determined by the minimum duration and the sampling rate of the sampler. A neighborhood of frequency indices is selected so that the frequency difference between a lowest index and a highest index within the neighborhood is less than a predetermined value. Two or more of the indices are modulated in the neighborhood so as to make a selected one of the indices an extremum while keeping the total energy of the neighborhood constant. A plurality of frequency bands are so coded. A decoder decides that a bit or bits have been received if, in a majority of the frequency bands, the decoder detects a modulated index.

Description

    RELATED APPLICATION
  • This application contains disclosure similar to the disclosure in U.S. patent application Ser. No. 09/116,397 filed Jul. 16, 1998, in U.S. patent application Ser. No. 09/427,970 filed Oct. 27, 1999, and in U.S. patent application Ser. No. 09/428,425 filed Oct. 27, 1999.
  • TECHNICAL FIELD OF THE INVENTION
  • The present invention relates to a system and method for adding an inaudible code to an audio signal and for subsequently retrieving that code. Such a code may be used, for example, in an audience measurement application in order to identify a broadcast program.
  • BACKGROUND OF THE INVENTION
  • There are many arrangements for adding an ancillary code to a signal in such a way that the added code is not noticed. For example, it is well known in television broadcasting that ancillary codes can be hidden in non-viewable portions of video by inserting the codes into either the video's vertical blanking interval or the video's horizontal retrace interval. An exemplary system that hides codes in non-viewable portions of video is referred to as “AMOL” and is taught in U.S. Pat. No. 4,025,851. This system is used by the assignee of the present application in order to monitor broadcasts of television programming as well as the times of such broadcasts.
  • Other known video encoding systems have sought to bury ancillary codes in a portion of a television signal's transmission bandwidth that otherwise carries little signal energy. Dougherty in U.S. Pat. No. 5,629,739, which is assigned to the assignee of the present application, discloses an example of such a system.
  • It is also known to add ancillary codes to audio signals for the purpose of identifying the signals and, perhaps, for tracing their courses through signal distribution chains. Audio encoding has the obvious advantage of being applicable not only to television, but also to radio broadcasts and to prerecorded music. Moreover, the speaker of a receiver reproduces, in the audio signal output, the ancillary codes that are added to audio signals. Accordingly, audio encoding offers the possibility of non-intrusive interception (i.e., interception of the codes without intrusion into the interior of the receiver) and of decoding the codes with equipment that has microphones as inputs. Moreover, audio encoding permits the measurement of broadcast audiences by the use of portable metering equipment carried by panelists.
  • In the field of audio signal encoding for broadcast audience measurement purposes, Crosby, in U.S. Pat. No. 3,845,391, teaches an audio encoding approach in which the code is inserted in a narrow frequency “notch” from which the original audio signal is deleted. The notch is made at a fixed predetermined frequency (e.g., 40 Hz). This approach leads to codes that are audible when the original audio signal containing the code is of low intensity.
  • A series of improvements followed the Crosby patent. Thus, Howard, in U.S. Pat. No. 4,703,476, teaches the use of two separate notch frequencies for the mark and the space portions of a code signal. Kramer, in U.S. Pat. No. 4,931,871 and in U.S. Pat. No. 4,945,412 teaches, inter alia, using a code signal having an amplitude that tracks the amplitude of the audio signal to which the code is added.
  • Broadcast audience measurement systems in which panelists are expected to carry microphone-equipped audio monitoring devices that can pick up and store inaudible codes broadcast in an audio signal are also known. For example, Aijalla et al., in WO 94/11989 and in U.S. Pat. No. 5,579,124, describe an arrangement in which spread spectrum techniques are used to add a code to an audio signal. The code is either not perceptible, or can be heard only as low level “static” noise.
  • Also, Jensen et al., in U.S. Pat. No. 5,450,490, teach an arrangement for adding a code at a fixed set of frequencies and using one of two masking signals. The choice of masking signal is made on the basis of a frequency analysis of the audio signal to which the code is to be added. Jensen et al. do not teach arrangements for selecting a maximum acceptable code energy to be used in each of a predetermined set of frequency intervals, nor do Jensen et al. teach energy exchange coding which transfers energy between spectral components and which thereby holds the total acoustic energy constant.
  • Preuss et al., in U.S. Pat. No. 5,319,735, teach a multi-band audio encoding arrangement in which a spread spectrum code is inserted in recorded music at a fixed ratio to the input signal intensity (code-to-music ratio) that is preferably 19 dB. Lee et al., in U.S. Pat. No. 5,687,191, teach an audio coding arrangement suitable for use with digitized audio signals. The code intensity is made to match the input signal by calculating a signal-to-mask ratio in each of several frequency bands and by then inserting the code at an intensity that is a predetermined ratio of the audio input in that band. Lee et al. has also described a method of embedding digital information in a digital waveform in U.S. Pat. No. 5,824,360.
  • Jensen et al., in U.S. Pat. No. 5,764,763, teach a method in which code signals consisting of sinusoidal waves at ten pre-selected frequencies in a high resolution spectrum are added to the original audio in order to represent either a binary bit (0 or 1) and the start and end of an embedded message. Forty unique frequencies are required for encoding these four symbols. Their values range from 1046.9 Hz to 2851.6 Hz in a typical practical embodiment. The frequency separation between adjacent lines in the spectrum is 4 Hz and the minimum separation between frequencies selected to constitute the set of 40 frequencies is 8 Hz. The amplitude of the injected code signal is controlled by a masking analysis. In the decoding process, the injected code signal is distinguished by the fact that its level will be significantly above a noise level computed for a band of frequencies.
  • It will be recognized that, because ancillary codes are preferably inserted at low intensities in order to prevent the codes from distracting a listener of program audio, such codes may be vulnerable to various signal processing operations as well as to interference from extraneous electromagnetic sources. For example, although Lee et al. discuss digitized audio signals, many of the earlier known approaches to encoding a broadcast audio signal are not compatible with current and proposed digital audio standards, particularly those employing signal compression methods that may reduce the signal's dynamic range (and thereby delete a low level code) or that otherwise may damage an ancillary code. In this regard, it is particularly important for an ancillary code to survive compression and subsequent de-compression by the AC-3 algorithm or by one of the algorithms recommended in the ISO/IEC 11172 MPEG standard, which is expected to be widely used in future digital television broadcasting systems.
  • U.S. patent application Ser. No. 09/116,397 filed Jul. 16, 1998 and U.S. patent application Ser. No. 09/428,425 filed Oct. 27, 1999 disclose a system and method for inserting a code into an audio signal so that the code is likely to survive compression and decompression as required by current and proposed digital audio standards. Spectral modulation of the amplitude or phase of the signal at selected code frequencies is used to insert the code into the audio signal. These selected code frequencies, which could comprise multiple frequency sets within a given audio block, may be varied from audio block to audio block, and the spectral modulation may be implemented as amplitude modulation, modulation by frequency swapping, phase modulation, and/or odd/even index modulation. Moreover, an approach is taught to measuring audio quality of each block and of suspending encoding in cases where the code might be audible to a listener.
  • In experimental systems of the sort taught in the '397 application and in the '425 application, the audio sampling process during encoding imposes a delay in excess of twenty milliseconds in the audio portion of a television program. Left uncorrected, this delay results in a perceptible loss of synchronization between the audio and video portions of a viewed program. Hence, practical systems of this sort have required the use of a compensating video delay circuit. However, it is preferable to do without such a circuit.
  • Moreover, in systems of the sort taught in the '397 application and in the '425 application, codes are added by manipulating pairs of frequencies that are spaced apart by about 100 Hz. These systems are thus vulnerable to interference, such as reverberation or multi-path distortion, that affect one of the encoded frequencies substantially more than the other.
  • The present invention is arranged to solve one or more of the above noted problems.
  • SUMMARY OF THE INVENTION
  • According to one aspect of the present invention, a system for adding an interference-resistant, inaudible code to an audio signal comprises a sampler, a processor, a frequency transformation, a frequency selector, and an encoder. The sampler is arranged to sample the audio signal at a sampling rate and to generate therefrom a plurality of short blocks of sampled audio, where each of the short blocks has a duration less than a minimum audibly perceivable signal delay. The processor is arranged to combine the plurality of short blocks into a long block having a predetermined minimum duration. The frequency transformation is arranged to transform the long block into a frequency domain signal comprising a plurality of independently modulatable frequency indices, where a frequency difference between two adjacent ones of the indices is determined by the minimum duration and the sampling rate. The frequency selector is arranged to select a neighborhood of frequency indices so that the frequency difference between a lowest index and a highest index within the neighborhood is less than a predetermined value. The encoder is arranged to modulate two or more of the indices in the neighborhood so as to make a selected one of the indices an extremum while keeping the total energy of the neighborhood constant.
  • According to another aspect of the present invention, a method is provided to add a code to a frequency band of a sampled audio portion of a composite signal without thereby introducing a perceptible delay between the encoded audio portion and another portion of the composite signal. The method comprises the steps of: a) selecting a sampling rate and a frequency difference between adjacent ones of a predetermined number of frequency indices included in a frequency neighborhood; b) determining from the sampling rate and from the frequency difference a duration of a block of samples; c) determining an integral number of sequential sub-blocks to make up the block, where the integral number is selected so that each of the sub-blocks has a sub-block duration less than the perceptible delay; d) processing the block so as to modulate a selected one of the frequency indices without changing a total signal energy of the band.
  • According to still another aspect of the present invention, an apparatus is provided to read a code from an audio signal. The code comprises a sequence of blocks having a predetermined number of samples of the audio signal, and the code comprises a synchronization block followed by a predetermined number of data blocks. The apparatus comprises a buffer memory, a frequency transformation, a processor, and a vote determiner. The buffer memory is arranged to hold one of the blocks. The frequency transformation is arranged to transform the one block into spectral data spanning a predetermined number of frequency bands, where each of the frequency bands comprises a respective neighborhood of frequency indices. The processor is arranged to determine, for each of the neighborhoods, if a respective predetermined one of the frequency indices is modulated. The vote determiner is arranged to determine that the one block is the synchronization block if, in a majority of the frequency bands, the respective modulated frequency index is a respective index selected for inclusion in the synchronization block. The processor is further arranged to determine if, in one of the data blocks received subsequent to the synchronization block, a respective predetermined one of the frequency indices is modulated. The vote determiner is further arranged to determine if, in a majority of the frequency bands, the respective modulated frequency index is a respective index selected for inclusion in the one data block.
  • According to yet another aspect of the present invention, a method is provided to read a code from an audio signal by sequentially transforming a sequence of blocks of audio samples into spectral data spanning a predetermined number of frequency bands. Each of the frequency bands comprises a predetermined number of frequency indices, and each of the blocks comprises a predetermined number of the samples. The code comprises a synchronization block followed by a predetermined number of data blocks. The method comprises the steps of: a) determining, in each of the frequency bands of one of the blocks of audio samples, if one of the frequency indices is modulated; b) comparing each modulated frequency index found in step a) with that index selected for modulation in the respective frequency band of the synchronization block; c) determining that the one block is the synchronization block if the majority of the comparisons made in step b) result in a match, and otherwise repeating steps a) through b); d) determining, in each of the frequency bands of one of the data blocks received subsequent to the synchronization block, if a respective one of the frequency indices is modulated; and, e) comparing the respective modulated frequency indices found in step d) with ones of a plurality of predetermined index patterns, each of the index patterns uniquely associated with a respective code bit, and reading the code bit only if the majority of modulated indices match the predetermined index pattern.
  • According to a further aspect of the present invention, a system for adding an inaudible code to a tone-like audio portion of a composite signal having two or more portions comprises a sampling apparatus, a processor, a frequency transformation, an encoder, a signal analyzer, and an encoder suspender. The sampling apparatus is arranged to sample audio at a sampling rate and to generate therefrom a plurality of short blocks of sampled audio, where each of the short blocks has a duration less than a minimum audibly perceptible signal delay. The processor is arranged to combine the plurality of short blocks into a long block having a predetermined minimum duration. The frequency transformation is arranged to transform the long block into a frequency domain signal comprising a plurality of independently modulatable frequency indices located in a plurality of frequency bands. The encoder is arranged to modulate two or more of the indices in each of the frequency bands so as to make a respective selected one of the indices an extremum while keeping a total acoustic energy of the audio constant. The signal analyzer is arranged to determine if the tone-like audio portion has a tone-like character within any one of the predetermined number of neighborhoods. The encoder suspender is arranged to suspend the encoding of the encoder within any neighborhood in which the tone-like audio portion has a tone-like character.
  • According to yet a further aspect of the present invention, a method is provided to add an inaudible code to at least one of a predetermined number of frequency neighborhoods within a tone-like audio portion of a composite signal having one or more additional portions. The method comprises the steps of: a) sampling the audio portion and generating from the sampled signal a plurality of short blocks, each of the short blocks having a duration less than a minimum audibly perceptible signal delay; b) combining the plurality of short blocks into a long block having a predetermined minimum duration; c) transforming the long block into a frequency domain signal comprising a plurality of independently modulatable frequency indices; d) identifying those neighborhoods, if any, of the predetermined number of frequency neighborhoods in which the tone-like audio portion has a tone-like character; and, e) modulating a respective index in each neighborhood not identified in step d) so as to make a selected index in such neighborhood an extremum while keeping the total acoustic energy of the audio portion constant, and not modulating an index in any of those neighborhoods identified in step d).
  • According to still a further aspect of the present invention, a broadcast audience measurement system, in which an inaudible code added to an audio signal is read by a decoding apparatus located within a statistically sampled dwelling, comprises an encoder, a receiver, and a decoder. The encoder is arranged to add a predetermined code bit to each of a predetermined number of odd frequency bands within a bandwidth of the audio signal. The receiver is within the dwelling and is arranged to receive the encoded audio portion. The decoder has an input from the receiver, and the decoder is arranged to acquire a respective test value of the code bit from each of the frequency bands, to compare the test values, to determine that one of the test values is the code bit only if that test value is acquired from a majority of the frequency bands, and to otherwise determine that no code bit has been read.
  • According to another aspect of the present invention, a broadcast audience measurement system, in which an inaudible code added to an audio signal is read within a statistically sampled dwelling unit, comprises an encoding apparatus, a receiver, and a decoder. The encoding apparatus is arranged to add a code bit to a sampled long block of the audio signal, where the long block comprises a predetermined number of short blocks. Each of the short blocks has a predetermined duration that is selected to be short enough not to be perceptible to a member of a broadcast audience. The encoding apparatus is further arranged to modulate a selected frequency index in each of a plurality of frequency neighborhoods so as to make each selected index an extremum in the respective neighborhood thereof while keeping a total energy of the audio signal constant. The receiver is within the dwelling, and is arranged to acquire the encoded audio signal. The decoder is arranged to read the code from the audio signal. The decoder has an input from the receiver, and the decoder comprises a buffer memory arranged to store one of the short blocks. The buffer memory is not arranged to store a long block.
  • According to still aspect of the present invention, a method of encoding an audio signal comprises the following steps: a) generating a plurality of short blocks from the audio signal, wherein each of the short blocks has a duration less than a minimum audibly perceivable signal delay; b) combining the plurality of short blocks into a long block; c) transforming the long block into a spectrum comprising a plurality of independently modulatable frequency indices; and, d) modulating at least two of the indices so as to make one of the indices an extremum while keeping the total energy of a neighborhood of the modulated indices substantially constant.
  • According to yet aspect of the present invention, a method of reading a code element from an audio signal comprises the following steps: a) transforming at least a portion of the audio signal into spectral data spanning a predetermined number of frequency bands having a plurality of frequency neighborhoods; b) determining, for each of the neighborhoods, if one of the frequency indices is modulated; and, c) assigning a transmitted code value to the code element if, in a majority of the neighborhoods, the respective modulated frequency index is an index selected for inclusion in the audio signal.
  • BRIEF DESCRIPTION OF THE DRAWING
  • These and other features and advantages will become more apparent from a detailed consideration of the invention when taken in conjunction with the drawings in which:
  • FIG. 1 is a schematic depiction of a broadcast audience measurement system employing a program identifying code added to the audio portion of a composite television signal;
  • FIG. 2 is a flow chart depicting an encoding process of the present invention; and,
  • FIG. 3 is a flow chart depicting a decoding process of the present invention.
  • DETAILED DESCRIPTION OF THE INVENTION
  • Audio signals are usually digitized at sampling rates that range between thirty-two kHz and forty-eight kHz. For example, a sampling rate of 44.1 kHz is commonly used during the digital recording of music. However, digital television (“DTV”) is likely to use a forty eight kHz sampling rate. Besides the sampling rate, another parameter of interest in digitizing an audio signal is the number of binary bits used to represent the audio signal at each of the instants when it is sampled. This number of binary bits can vary, for example, between sixteen and twenty four bits per sample. The amplitude dynamic range resulting from using sixteen bits per sample of the audio signal is ninety-six dB. This decibel measure is the ratio of the square of the highest audio amplitude (216=65536) to the square of the lowest audio amplitude (12=1). The dynamic range resulting from using twenty-four bits per sample is 144 dB. Raw audio, which is sampled at the 44.1 kHz rate and which is converted to a sixteen-bit per sample representation, results in a data rate of 705.6 kbits/s.
  • Compression of audio signals is performed in order to reduce this data rate to a level which makes it possible to transmit a stereo pair of such data on a channel with a throughput as low as 192 kbits/s. Audio compression is typically accomplished by transform coding. A block of audio consisting of samples, for example, may be decomposed, by application of a Fast Fourier Transform or other similar frequency analysis process, into a spectral representation. In order to prevent errors that may occur at the boundary between one block of audio and the previous or subsequent block of audio, overlapping blocks of audio are commonly used to produce the samples. In one such arrangement where 1024 samples per overlapped block are used, a block includes 512 “old” audio samples (i.e., audio samples from a previous block) and 512 “new” or current audio samples. The spectral representation of such a block is divided into critical bands, where each band comprises a group of several neighboring frequencies. The power in each of these bands can be calculated by summing the squares of the amplitudes of the frequency components within the band.
  • Audio compression is based on the following principle of masking: in the presence of high spectral energy at one frequency (i.e., the masking frequency), the human ear is unable to perceive a lower energy signal if the lower energy signal has a frequency (i.e., the masked frequency) near that of the higher energy signal. The lower energy signal at the masked frequency is called a masked signal. A masking threshold, which represents either (i) the acoustic energy required at the masked frequency in order to make it audible or (ii) an energy change in the existing spectral value that would be perceptible, can be dynamically computed for each band. The frequency components in a masked band can be represented in a coarse fashion by using fewer bits based on this masking threshold. That is, the masking thresholds and the amplitudes of the frequency components in each band are coded with a smaller number of bits that constitute the compressed audio. Decompression reconstructs the original signal based on these data.
  • It may be noted that the masking threshold depends to some extent on the nature of the sound being masked. Tone-like sounds, in which only one, or a few, frequencies are present in the acoustic spectrum, present special masking problems that are not encountered when dealing with a broad-band acoustic signal. Thus, a signal, that would be masked if added to a passage of speech, might be audible to a listener if added to a passage of music having the same acoustic energy.
  • A television audience measurement system 10 shown in FIG. 1 is an example of a system in which the present invention may be used. The television audience measurement system 10 includes an encoder 12 that adds an ancillary code to an audio signal portion 14 of a broadcast program signal. Alternatively, the encoder 12 may be provided, as is known in the art, at some other location in the program signal distribution chain. A transmitter 16 transmits the encoded audio signal portion along with a video signal portion 18 of the program signal.
  • When the encoded signal is received by a receiver 20 located at a statistically selected metering site 22, the audio signal portion of the received program signal is processed to recover the ancillary code, even though the presence of that ancillary code is imperceptible to a listener when the encoded audio signal portion is supplied to speakers 24 of the receiver 20. To this end, a decoder 26 is connected either directly to an audio output 28 available at the receiver 20 or to a microphone 30 placed in the vicinity of the speakers 24 through which the audio is reproduced. The received audio signal can be either in a monaural or stereo format.
  • As disclosed in the '397 application and in the '425 application, audio blocks may comprise 512 samples of an audio stream sampled at a 48 kHz sampling rate. The time duration of such a block is 10.6 ms. Because two blocks are buffered, this arrangement comprises a total delay of about 22 ms, which would be perceptible to a viewer as a loss of synchronization between the video and audio signals. To avoid losing synchronization, a compensating delay is introduced into the video signal. Because it is preferable to do without such compensating delay, the encoder 12 implements encoding as represented by the flow chart of FIG. 2 in order to avoid loss of video/audio synchronization while at the same time avoiding the use of a compensation delay circuit.
  • The encoding implemented by the encoder 12 reduces the audio encoding delay to an imperceptible 5.3 milliseconds by structuring a complete, or “long”, code block as a sequence of overlapping short blocks that can be processed in a pairwise fashion with correspondingly smaller buffers and that are only ½ as long as the blocks used in the '397 and '425 applications.
  • According to the '397 application and the '425 application, a spectral analysis of a sampled interval of the audio signal that is long enough to form a block of 512 samples collected at a sampling rate of 48 kHz yields frequency “lines” separated from one another by 93.75 Hz. In these applications, a neighborhood is a set of five consecutive frequency lines covering a neighborhood bandwidth of 468.75 Hz that lies within a selected portion of the overall bandwidth of the audio portion being encoded. A binary data bit, either a ‘0’or ‘1’, is encoded by changing (preferably by boosting) the amplitude of one of the frequencies in the neighborhood such that it becomes a local extremum (i.e., a maximum in the preferred case, although the local extremum could alternatively a minimum). Another frequency in the same neighborhood is changed in the alternate sense (i.e., preferably attenuated) in order to maintain the overall energy within the band at a constant level, a practice that is referred to herein as “energy exchange encoding”. It has been found that the 468.75 Hz neighborhood bandwidth required for a code block is great enough that codes may be subject to interference effects when two frequencies in a single neighborhood undergo different amounts of change.
  • In a preferred system of the present invention, a much longer “long block” sampling interval (8192 samples taken at 48 kHz) is used. This longer sampling interval reduces the spacing between spectral lines to 5.85 Hz. As will be described in greater detail hereinafter, this preferred system writes an energy-exchange code bit in a frequency neighborhood containing eight adjacent frequency indices. Thus, this frequency neighborhood requires a bandwidth of less than 50 Hz. This selection of sampling rate, number of samples in a sampling interval, and number of frequency indices in a neighborhood leads to a very small frequency difference in a neighborhood and thereby offers an interference-resistant code having a high degree of invulnerability to narrow-band interference effects.
  • Encoding by Spectral Modulation
  • At a step 40 of the encoding implemented by the encoder 12 and shown in FIG. 2, an In Buffer having 256 memory locations is initialized by setting all of its memory locations to zero. Also, an Out Buffer having 128 memory locations is initialized by setting all of its memory locations to zero. Moreover, a sub-block counter and a long-block counter are both set to zero. At a step 41, data is shifted from the second half of the In Buffer to its first half, and data is copied from the second half of a Temporary Buffer to the first half of the Out Buffer.
  • A short block is constructed at a step 42 by reading 128 samples of new data from the audio signal portion 14 into the second half of the In Buffer which combines these 128 new samples with the last 128 samples of a previous block stored in the first half of the In Buffer as a result of the step 41. In order for the encoder 12 to embed a digital code in an audio data stream in a manner compatible with compression technology, the encoder 12 should preferably use frequencies and critical bands that match those used in compression. The short block length Ns of the audio signal that is used for coding may be chosen such that, for example, NS=Nl/j, where j is an integer, and where Nl is the length in samples of a long block. A suitable value for NS is 256, for example, and a suitable value for Nl is 8192, for example. The short block itself is constructed from the last 128 samples of a previous block and the 128 samples of new data read at the step 42 of FIG. 2. The samples may be derived from the audio signal portion 14 by the encoder 12 such as by use of an analog to digital converter.
  • The amplitude of the audio signal within a short block may be represented by the time-domain function v(n), where n is the sample index. The time-domain function v(n) is converted to a time value by multiplication by the sample interval at a step 43. To this end, a “window function” is defined according to the following equation: w ( n ) = 1 - cos ( 2 π n N S ) 2 ( 1 )
    and is applied to v(n) at the step 43 by multiplication to obtain a windowed signal v(n)w(n) which is stored in the Temporary Buffer. At a step 44, a Discrete Fourier Transform F(u) of v(n)w(n), where u is a frequency index, is computed. This Discrete Fourier Transform can be performed using the well-known Fast Fourier Transform (FFT) algorithm.
  • The frequencies resulting from the Fourier Transform are indexed in the range −127 to +127, where an index of 127 corresponds to exactly half the sampling frequency fS. Therefore, for a forty-eight kHz sampling frequency, the highest index would correspond to a frequency of twenty-four kHz. Accordingly, for purposes of this indexing, the index closest to a particular frequency component fj, where frequency is measured in kHz, resulting from the Fourier Transform is given by the following equation: j = 128 f j 24 ( 2 )
    where equation (2) is used in the following discussion to relate a frequency fj to its corresponding short-block index j. As noted above, in the preferred coding arrangement, sequential indices calculated for a short block are separated from each other by a frequency of 187.5 Hz. Correspondingly, in considering a long block made up of 64 sub-blocks of 128 samples each (where the sub-blocks are processed in pairs having 256 samples), an equation relating the long block index J to a high resolution spectral frequency fJ in kHz is given by the following: J = 4096 f J 24 ( 3 )
    From equations (2) and (3), it is clear that J=32j for frequencies which are common to both the high (long block) and low (short block) resolution spectra.
  • In the preferred high resolution encoding arrangement of the present invention, five frequency bands are selected for use in a “voting” arrangement to be discussed in greater detail hereinafter. For each of the selected frequency bands, a high resolution neighborhood of eight long block indices JL=JS−4, JS−3, JS−2, JS−1, JS, JS+1, JS+1, JS+2, JS+3 is defined about a central short block index jS with JS=32js. In one such embodiment, the selected frequencies and indices are shown in the following table:
    Short Block Long Block
    Band Index Central index Central Index Long Block Range
    0 7 224 220-227
    (1287 Hz-1328 Hz)
    1 11 352 348-355
    (2035 Hz-2077 Hz)
    2 15 480 476-483
    (2785 Hz-2826 Hz)
    3 19 608 604-611
    (3533 Hz-3574 Hz)
    4 23 736 732-739
    (4282 Hz-4323 Hz)
  • It may be noted that each long block in the arrangement shown in the above exemplary table is set up to define neighborhoods having eight long block indices. It will be recognized that different numbers of indices could be used. Adding indices has the effect of increasing the numerical range that can be accommodated in a single block, but it also has the effect of increasing the frequency span of a block, thereby rendering the code more susceptible to interference effects.
  • Let it be assumed that a long block L consists of 8192 samples made up of 64 sub-blocks, with each sub-block having 128 new samples. A 256-sample short block is constructed from adjacent sub-blocks by the use of the window function of equation (1). Thus, L consists of a sequence of sixty four overlapped short blocks, each of which has 256 samples. These short blocks may conveniently by indexed as Si, where the short block index i ranges from 0 to 63.
  • A masking analysis of the sort conventionally used in compression algorithms is preferably applied at the step 44 to the short blocks in order to determine the maximum change in energy Eb or in the masking energy level that can occur at any critical frequency band without making the modulation perceptible to a listener. These critical frequency bands, determined by experimental studies carried out on human auditory perception, may vary in width from single frequency bands at the low end of the spectrum to bands containing ten or more adjacent frequencies at the upper end of the audible spectrum. In the psycho-acoustic modeling scheme used in the MPEG-AAC audio compression standard ISO/IEC 13818-7:1997, for example, critical band eighteen includes two frequencies with indexes 19 and 20 of a short audio block. The acoustic energy in each critical band influences the masking energy of its neighbors. Algorithms for computing the masking effect are described in the standards document such as ISO/IEC 13818-7:1997. These analyses may be used to determine for each audio block the masking contribution due to “tonality” as well as “noise” like features of the audio spectrum. The tonality index computed by these algorithms at the step 44 provides a useful tool for determining circumstances under which a sub-block may produce audible degradation when encoded. The analysis can also be used to determine, on a per critical band basis, the amplitude of a time domain code signal that can be added without producing any noticeable audio degradation. Thus, for a short block frequency index j, belonging to a critical band with masking energy Ej, the maximum amplitude of a code signal is given by the following equation:
    M j=128{square root}{square root over (E j )}  (4)
    where 128 is a factor required to convert from a spectral domain to the time domain.
  • A preferred code waveform is constructed using long block indices that are very near to the central index of the corresponding short block for a selected band. For example, if a sub-block Sm with a sub-block index m and a coding band b is considered, and if a spectral frequency having a long block index of Jb is enhanced, an appropriate code waveform will have 256 samples, which can be denoted as Cb(p), where the index p runs from 0 to 255. In a preferred embodiment, each of these components is selected to follow the relationship: C b ( p ) = A b cos ( ϕ m + 2 π J b p 8192 ) + k b A b cos ( π + ϕ j + 2 π j b p 256 ) ( 5 )
    where Ab is a nominal code amplitude level, Jb is an index in the long block frequency space, jb is the central index of the corresponding short block, φm is given by the following equation: ϕ m = 2 π J b m128 8192 ( 6 )
    φm is the starting phase angle for sub-block m, and φj is the phase angle of the short block frequency index jb obtained from the Fourier Transform analysis. The quantity φm ensures that the code component having a frequency index of Jb is in phase in all 64 blocks constituting the long block. It may be noted that, in order to simplify the representation, a multiplication of the code signal with a window function (not shown) may be implemented.
  • The above choice for a code waveform provides an energy exchange coding feature. For a given large block index Jb, the first cosine term in equation (5) represents an added energy. The corresponding short block index jb term, because of the change in phase angle of π, subtracts a compensating amount of energy with the assumption that the spectral energy at jb represents the overall energy in the coding band b and includes all of the high resolution coding frequencies in the band.
  • It should be noted that each high resolution frequency component, such as Jb, influences not only the spectral amplitude at jb but also its neighbors. The most significant impact is on the immediate neighbors jb−1 and jb+1. The constant kb with a value in the range 0 to 0.8 is used to control the extent to which a single index jb compensates for the code signal.
  • The window function applied at the step 43 causes further interaction among the short block frequency indexes. Because the high resolution frequencies are close to each other, these amplitude changes are not perceptible. Because of the encoding operation, the desired long block frequency with index Jb is enhanced relative to its neighbors in band. For example, if a long block index of 223 is selected, where the corresponding short block central index is seven, and the code energy for all 64 blocks is calculated, a component with frequency index 223 has a higher energy level than the other indices in the neighborhood from 220 to 227.
  • The nominal code amplitude level Ab is chosen such that it is the lowest value that permits successful extraction of the embedded code during decoding. For most sub-blocks, the nominal code amplitude level Ab is expected to be well below the corresponding masking amplitude level Mj. However, in cases where M is not greater than Ab, Mj replaces Ab in equation (5).
  • In preferred embodiments of the encoding system of the present invention, signal analyzers or signal analyzing algorithms are used to examine each encodable neighborhood of each short block to see if the signal being encoded has a tone-like character within that neighborhood. The tonality index calculated at the step 44 by the masking algorithm described in ISO/IEC 13818-7:1997, for example, provides such a measure. A purely tonal audio block is expected to have a tonality index of 1.0, whereas a “noise-like” block has a tonality index close to 0. If the tonality index for the bands used in coding has a value exceeding a tonal threshold, the encoding operation is suspended for that sub-block. (See the discussion below regarding step 46.) It is noted that, even if several sub-blocks are tonal, coded data can still be successfully retrieved because there are 64 sub-blocks in each long block. It is the spectrum of the long block that is analyzed during decoding.
  • A preferred encoding arrangement of the invention uses a redundant transmission scheme to make the system more robust. As depicted in the table shown above, five different frequency bands are defined in the exemplary system. The coding arrangement disclosed above was described with respect to only one of these bands. That is, the five bands are essentially independent of each other so that a code symbol can be sent in multiple bands at any given time in the interest of providing redundant transmission.
  • One of the advantages of the encoding method described above is that the processing uses only 256 samples at each stage, of which 128 are new samples and 128 are carried over from the prior processing step. Thus, at a selected sampling rate of 48 kHz, the total buffer capacity required to hold the samples in a “double buffer” is 256 and the corresponding time duration is 256/48000=5.3 milliseconds. As is known to those skilled in the arts of perceptual psychology, a loss of synchronization of less than about 10 msec between two portions (e.g., left and right stereo channel) of a composite audio signal or between an audio and a video portion of a composite television signal is not perceptible. Thus, the encoding method of the present invention does not require introducing a compensating delay in another portion of the signal. When used for television audience research purposes, the present system has the advantage that it can be used without a video delay circuit and without disturbing the viewer with a perceptible loss of synchronization.
  • In order to design a practical encoding scheme, it is essential to develop a synchronization method that will allow the decoding system to determine the start of a new message. As is often done in encoded messaging systems, a preferred system of the invention defines a synchronization block having a unique structure that differentiates it from other encoded blocks. At a step 45, therefore, a synchronization block consisting of 8192 samples is selected when the long block counter has a count of zero such that the synchronization block has the following characteristics: in Band 0, index 220, which is the first frequency line in that neighborhood, is enhanced; in Band 1, the second frequency line, index 349, is enhanced; in Band 2, the third frequency line, index 478, is enhanced; in Band 3, the fourth frequency line, index 607, is enhanced; and, in Band 4, the fifth frequency line, index 736, is enhanced. When the decoder analyzes a long block by comparing each enhanced frequency index with the respective index selected for enhancement in a synchronization block and finds a match in at least three of the five frequency bands, the system determines that a potential synchronization block has been detected, and interprets the long blocks following a synchronization block as the actual message data.
  • As noted above, in discussing the blocks selected for an exemplary system and shown in the above table, each long block comprises a set of eight indices that can be modulated to form a code. In a television audience measurement application of interest to the inventor, a complete encoded message may comprise forty-eight bits consisting of a sixteen bit Station Identifier (SID) and a thirty-two bit time stamp (TS). To match this message to the selected set of indices, the forty-eight bits of data may be grouped into sixteen three-bit sets. The decimal value of each of these three-bit sets can range from zero to seven so that each of the three-bit sets can be encoded by using the selected long blocks. In one preferred arrangement, the system encodes a value of k (where k is in the range of zero to seven) by modulating the kth available index. In this arrangement, for example, to send a code group having a value=five, the 6th index in each band (i.e., indices 225, 353, 481, 609, and 737) is selected at the step 45 for enhancement. In this embodiment, a forty-eight bit data packet can be transmitted as one long synchronization block followed by sixteen long data blocks. For the choice of code blocks and sampling frequency disclosed above, sending these seventeen long blocks requires 2.89 seconds. This arrangement provides a clear distinction from the synchronization block, which has a different index enhanced in each band.
  • More generally speaking, each of a plurality of possible code bits has an index pattern uniquely associated with it, and decoding a bit comprises comparing each of plurality of enhanced indices with ones of the index patterns to determine if a majority of the enhanced indices match with one of the predetermined patterns. The exemplary embodiment recited above is both conceptually straightforward and robust, but may lead to an audible beat phenomenon because each code frequency is separated from its central short block frequency by the same value in all the coding bands. In the case of a code bit of value five, this constant difference frequency is 5.85 Hz, which corresponds to an index difference of one. In another preferred embodiment, this problem is overcome at the step 45 by choosing as the index pattern a pre-determined pseudo-random combination of frequency indexes for each band. Thus, for example, a value of five could be coded by using the following frequency indexes in the five bands: 225, 355, 476, 607, and 737. The beat phenomenon is substantially decreased by this change.
  • This arrangement of sending the same data in each of five bands at the same time fits well with the masking algorithms discussed above. That is, one can select a masking algorithm that suspends coding in one or more of the bands, but that continues to encode in the other ones of the bands.
  • Once the frequencies have been selected at the step 45, the signal at these frequencies is enhanced at the step 46 assuming that the masking level and the tonality as indicated by the tonality index are acceptable. The samples v(n)w(n) stored in the Temporary Buffer are modified according to equations (5) and (6) and, at a step 47, the code signal is added to the Temporary Buffer. At a step 48, the first half of the Temporary Buffer is added to the Out Buffer, and the 128 samples in the Out Buffer are passed to the transmitter 16 as encoded data.
  • At a step 49, the sub-block counter is incremented by one and, if the sub-block counter is equal to 64, the long block counter is incremented by one. No other sub-blocks are encoded until the long block counter is incremented. When the long block counter is equal to 17, then a complete code message (a synchronization block and sixteen data blocks) has been passed to the transmitter 16 and the long block counter is reset to zero to begin encoding a new message. If the sub-block counter is not equal to 64, or after the long block counter has been reset to zero, program flow returns to the block 41.
  • Decoding the Spectrally Modulated Signal
  • A preferred system provides an audio signal acquisition arrangement at a receiving location. This location, for example, may be within the statistically selected metering site 22. In some instances, the embedded digital code can be recovered from the audio signal available at the audio output 28 of the receiver 20. When such an output is available, it provides a relatively high quality signal source. However, many receivers 20 do not have the audio output 28, which constrains the audience research system operator to acquire an analog audio signal with the microphone 30 placed in the vicinity of the speakers 24. Because audience measurement systems generally have a goal of minimizing the intrusion that they make into the measured television viewing environment, the microphone 30 is preferably placed behind the receiver 20, where the quality of the signal it acquires is degraded from what would be found if the microphone 30 were placed in front of the receiver 20. This signal degradation has led to the failure of many prior art systems that attempted to read a buried code from an audio signal picked up with a microphone. However, the redundancy obtained by encoding five frequency bands as discussed above increases the likelihood that the code can be successfully recovered.
  • In the case where the microphone 30 is used, or in the case where the signal on the audio output 28 is analog, the decoder 26 converts the analog audio to a sampled digital output stream at a preferred sampling rate matching the sampling rate of the encoder 12. In decoding systems where there are limitations in terms of memory and computing power, a half-rate sampling could be used. In the case of half-rate sampling, each short block would consist of NS/2=128 samples, and the resolution in the frequency domain (i.e., the frequency difference between successive spectral components) would remain the same as in the full sampling rate case. In the case where the receiver 20 provides digital outputs, the digital outputs are processed directly by the decoder 26 without sampling but at a data rate suitable for the decoder 26.
  • In a practical implementation of audio decoding, such as may be used in a home audience metering system, the ability to decode an audio stream in real-time is highly desirable. It is also highly desirable to transmit the decoded data to a remote central office. The decoder 26 may be arranged to run the decoding algorithm described below in connection with FIG. 3 on Digital Signal Processing (DSP) based hardware of the sort typically used in such applications. As disclosed above, the incoming encoded audio signal may be made available to the decoder 26 from either the audio output 28 or from the microphone 30 placed in the vicinity of the speakers 24.
  • As shown by step 50 in the flow chart of FIG. 3, a circular buffer capable of storing 4096 samples is initialized by setting all of its storage locations to zero. Also, a set of frequency bins are set to zero. At a block 51, 256 samples are read into an audio buffer. Also, a block sample counter is set to zero. Before recovering the actual data bits representing code information, it is necessary to locate the synchronization block which is preferably encoded by enhancing (or diminishing) the amplitude of a unique set of frequencies. In one preferred embodiment these frequencies have indexes 220, 349, 478, 607, and 736 and each one is in a different coding band. In order to search for the synchronization block, as well as to extract data from subsequent blocks within an incoming audio stream, the circular buffer is used. The circular buffer has a sufficient size to store 4096 samples in the case of half rate sampling. This arrangement is essential in order to implement a near real-time decoding scheme based on a sliding FFT routine which forms part of the decoding algorithm shown in the flow chart of FIG. 3.
  • Let it be assumed that, for the audio buffer currently stored in the circular buffer, there are a spectral amplitude B0[J] and a phase angle φ0[J] at a frequency with index J. The spectral amplitude B0[J] and the phase angle φ0[J] represent the spectral values for the 4096 audio samples currently in the circular buffer. If two new time domain samples v4094 and v4095 are read from the audio buffer and are inserted into the circular buffer as indicated by a step 52 so as to replace the two earliest samples v0 and v1 in the circular buffer, then the new spectral amplitude B1[J] and phase angle φ1[J] for each of the indices J are determined at a step 53 in accordance with the following equation: B 1 [ J ] exp ϕ 1 [ J ] = B 0 [ J ] exp ϕ 0 [ J ] + ( v 4094 exp ( 2 π J ( 4096 - 2 ) 4096 ) ) + ( V 4095 exp ( 2 π J ( 4096 - 1 ) 4096 ) ) - ( v 0 exp ( - 2 π J2 4096 ) ) - ( v 1 exp ( - 2 π J 4096 ) ) ( 7 )
    Thus, the spectrum of the circular buffer can be computed merely by updating the existing spectrum for the samples contained in the circular buffer according to equation (7). Even when all the spectral values—amplitude and phase—are initially set to 0 at the step 50, as new data enters the circular buffer, and as old data gets discarded, the spectral values gradually change until they correspond to the actual FFT spectral values for the data currently in the circular buffer. In order to overcome certain instabilities that may arise during computation, multiplication of the incoming audio samples by a stability factor (usually set to 0.99995) and multiplication of the discarded samples by a factor 0.999952048=0.902666 is known to most practitioners in this field. The sliding FFT algorithm provides a computationally efficient means of calculating the spectral components of interest for the 4095 samples preceding the current sample location and the current sample itself. The frequency bins are updated at the block 53 with the results of the analysis performed according to equation (7) If the block sample counter has a count which is a multiple of 64, the frequency bins are analyzed and the results of the analysis are stored in a Status Information Structure (SIS) as indicated in step 54 of FIG. 3. This value 64 may be used because the frequency spectrum of a long block of 4096 samples changes very little over a small number of samples of an audio stream. Even though the sliding FFT algorithm is used to update the spectral values in two sample increments, the analysis of the spectrum to locate the synchronization block and to extract data needs to be performed only every 64 samples. Thus, 4096/64=64 SIS structures are used to track the intermediate results of the decoding operation. These SIS structures are indexed as SIS0, SIS1, . . . SIS63. Each SIS structure is updated at 4096 sample intervals, which corresponds to the length of a long block in the half-sampling rate case. Each SIS structure contains a synchronization flag and a data storage location. Also, the SIS includes a counter.
  • The search for the synchronization block is the first step in the decoding process. Let us assume that at a sample location where the SIS SISk needs to be updated because a spectrum, which satisfies the characteristics of a synchronization block, is found. In such a spectrum, indexes 220, 349, 478, 607, 736 are enhanced and possess higher spectral power than their neighbors in the respective bands. Due to factors such as audio compression, audio degradation due to amplifier-speaker-microphone non-linearities, or ambient noise in the case of microphone based decoding systems, it is possible that not all the five bands have the desired characteristics. The redundant transmission feature described above enables detection of a long block as being a synchronization block even if only three of the five bands satisfy the criteria for a synchronization block. Once a synchronization block has been detected, a synchronization flag within the corresponding SIS structure is set to one. In a practical implementation, more than one SIS structure can have its synchronization flag set to one. Usually several adjacent SIS structures, for example, SISk−2, SISk−1, SISk, SISk+1, SISk+2, may all have synchronization flags set to one because the spectrum of a long audio block does not change rapidly.
  • When SISk is analyzed 4096 samples later, the algorithm recognizes the synchronization flag and attempts to extract the first three-bit data value encoded in the spectrum. This extraction may be done by means of a voting algorithm that compares test values taken from each of the neighborhoods and that accepts a test value as the data value if the same test value is found in three out of the five band neighborhoods. In addition, if a valid data value in the range zero to seven is extracted, the counter within the SIS is incremented to show that the first member of the sixteen member message data has been extracted. The extracted three-bit datum is also stored within the structure at a corresponding data storage location. In the event a valid datum is not found either at the current location or at any one of the fifteen subsequent locations where SISk is updated, the SIS structure's synchronization flag is reset to zero and the counter is reset to zero. These actions frees the SIS to once again look for synchronization blocks. When an SIS structure's counter increments to sixteen, it contains a full message packet consisting of forty-eight bits that could be transmitted out, as indicated in step 55 of the flow chart in FIG. 3. For example, the message packet may be transmitted to a Central Office. When this transmission is done, the synchronization flag is reset to zero and the counter is reset.
  • At a block 56, the block sample counter is incremented by two corresponding to the two samples read from the audio buffer to the circular buffer at the step 52. If the block sample counter does not have a count equal to 256, flow returns to the step 52 where two more samples from the audio buffer are read into the circular buffer. On the other hand, if the block sample counter does have a count equal to 256, flow returns to the step 51 where another 256 samples are inserted into the audio buffer.
  • Although the present invention has been described with respect to several preferred embodiments, many modifications and alterations can be made without departing from the invention. Accordingly, it is intended that all such modifications and alterations be considered as within the spirit and scope of the invention as defined in the attached claims.

Claims (33)

1. A system for adding an interference-resistant, inaudible code to an audio signal comprising:
a sampler arranged to sample the audio signal at a sampling rate and to generate therefrom a plurality of short blocks of sampled audio, each of the short blocks having a duration less than a minimum audibly perceivable signal delay;
a processor arranged to combine the plurality of short blocks into a long block having a predetermined minimum duration;
a frequency transformation arranged to transform the long block into a frequency domain signal comprising a plurality of independently modulatable frequency indices, wherein a frequency difference between two adjacent ones of the indices is determined by the minimum duration and the sampling rate;
a frequency selector arranged to select a neighborhood of frequency indices so that the frequency difference between a lowest index and a highest index within the neighborhood is less than a predetermined value; and,
an encoder arranged to modulate two or more of the indices in the neighborhood so as to make a selected one of the indices an extremum while keeping the total energy of the neighborhood constant.
2. The system of claim 1 wherein the processor comprises a digital computer having a buffer memory.
3. The system of claim 1 wherein the frequency transformation comprises a Fast Fourier Transform algorithm.
4. The system of claim 1 wherein the encoder comprises an algorithm that increases the energy of a selected index in the neighborhood and that decreases the energy of a short block associated therewith.
5. A method of adding a code to a frequency band of a sampled audio portion of a composite signal without thereby introducing a perceptible delay between the encoded audio portion and another portion of the composite signal, the method comprising the steps of:
a) selecting a sampling rate and a frequency difference between adjacent ones of a predetermined number of frequency indices included in a frequency neighborhood;
b) determining from the sampling rate and from the frequency difference a duration of a block of samples;
c) determining an integral number of sequential sub-blocks to make up the block, where the integral number is selected so that each of the sub-blocks has a sub-block duration less than the perceptible delay; and,
d) processing the block so as to modulate a selected one of the frequency indices without changing a total signal energy of the band.
6. The method of claim 5 wherein the composite signal comprises a television broadcast signal and wherein the another portion of the composite signal comprises a video signal.
7. The method of claim 5 wherein in step d) the processing comprises modulating two or more of the frequency indices within the neighborhood so as to make a selected one of the indices an extremum.
8. Apparatus for reading a code from an audio signal, the code comprising a sequence of blocks having a predetermined number of samples of the audio signal, the code comprising a synchronization block followed by a predetermined number of data blocks, the apparatus comprising:
a buffer memory arranged to hold one of the blocks;
a frequency transformation arranged to transform the one block into spectral data spanning a predetermined number of frequency bands, wherein each of the frequency bands comprises a respective neighborhood of frequency indices;
a processor arranged to determine, for each of the neighborhoods, if a respective predetermined one of the frequency indices is modulated; and,
a vote determiner arranged to determine that the one block is the synchronization block if, in a majority of the frequency bands, the respective modulated frequency index is a respective index selected for inclusion in the synchronization block;
wherein the processor is further arranged to determine if, in one of the data blocks received subsequent to the synchronization block, a respective predetermined one of the frequency indices is modulated;
wherein the vote determiner is further arranged to determine if, in a majority of the frequency bands, the respective modulated frequency index is a respective index selected for inclusion in the one data block.
9. The apparatus of claim 8 wherein the frequency transformation comprises a Fast Fourier Transform algorithm executed by a digital computer.
10. The apparatus of claim 8 wherein the processor comprises a general purpose digital computer operating under program control and having a plurality of algorithms stored in a memory.
11. The apparatus of claim 8 wherein the vote determiner comprises an algorithm executed by a digital computer.
12. A method of reading a code from an audio signal by sequentially transforming a sequence of blocks of audio samples into spectral data spanning a predetermined number of frequency bands, wherein each of the frequency bands comprises a predetermined number of frequency indices, wherein each of the blocks comprises a predetermined number of the samples, and wherein the code comprises a synchronization block followed by a predetermined number of data blocks, the method comprising the steps of:
a) determining, in each of the frequency bands of one of the blocks of audio samples, if one of the frequency indices is modulated;
b) comparing each modulated frequency index found in step a) with that index selected for modulation in the respective frequency band of the synchronization block;
c) determining that the one block is the synchronization block if the majority of the comparisons made in step b) result in a match, and otherwise repeating steps a) through b);
d) determining, in each of the frequency bands of one of the data blocks received subsequent to the synchronization block, if a respective one of the frequency indices is modulated; and,
e) comparing the respective modulated frequency indices found in step d) with ones of a plurality of predetermined index patterns, each of the index patterns uniquely associated with a respective code bit, and reading the code bit only if the majority of modulated indices match the predetermined index pattern.
13. The method of claim 12 wherein a value of k is read as the code bit in step e) if the kth index in each of the bands is modulated.
14. The method of claim 12 wherein the predetermined index pattern comprises a pseudo-random sequence.
15. A system for adding an inaudible code to a tone-like audio portion of a composite signal having two or more portions, the system comprising:
a sampling apparatus arranged to sample audio at a sampling rate and to generate therefrom a plurality of short blocks of sampled audio, each of the short blocks having a duration less than a minimum audibly perceptible signal delay;
a processor arranged to combine the plurality of short blocks into a long block having a predetermined minimum duration;
a frequency transformation arranged to transform the long block into a frequency domain signal comprising a plurality of independently modulatable frequency indices located in a plurality of frequency bands;
an encoder arranged to modulate two or more of the indices in each of the frequency bands so as to make a respective selected one of the indices an extremum while keeping a total acoustic energy of the audio constant;
a signal analyzer arranged to determine if the tone-like audio portion has a tone-like character within any one of the predetermined number of neighborhoods; and,
an encoder suspender arranged to suspend the encoding of the encoder within any neighborhood in which the tone-like audio portion has a tone-like character.
16. The system of claim 15 wherein the audio signal is part of a television broadcast signal.
17. The system of claim 15 wherein the frequency transformation comprises a Fast Fourier Transform algorithm.
18. The system of claim 16 wherein the signal analyzer comprises a computer arranged to carry out a masking algorithm described in ISO/IEC 13818-7:1997.
19. A method for adding an inaudible code to at least one of a predetermined number of frequency neighborhoods within a tone-like audio portion of a composite signal having one or more additional portions, the method comprising the steps of:
a) sampling the audio portion and generating from the sampled signal a plurality of short blocks, each of the short blocks having a duration less than a minimum audibly perceptible signal delay;
b) combining the plurality of short blocks into a long block having a predetermined minimum duration;
c) transforming the long block into a frequency domain signal comprising a plurality of independently modulatable frequency indices;
d) identifying those neighborhoods, if any, of the predetermined number of frequency neighborhoods in which the tone-like audio portion has a tone-like character; and,
e) modulating a respective index in each neighborhood not identified in step d) so as to make a selected index in such neighborhood an extremum while keeping the total acoustic energy of the audio portion constant, and not modulating an index in any of those neighborhoods identified in step d).
20. The method of claim 19 wherein the composite signal comprises a television broadcast signal and wherein one of the additional portions comprises a video signal.
21. The method of claim 19 wherein step c) comprises the step of transforming the long block according to a Fast Fourier Transform.
22. The method of claim 19 wherein step c) comprises a sub-step of carrying out a masking algorithm described in ISO/IEC 13818-7:1997.
23. A broadcast audience measurement system in which an inaudible code added to an audio signal is read by a decoding apparatus located within a statistically sampled dwelling, the system comprising:
an encoder arranged to add a predetermined code bit to each of a predetermined number of odd frequency bands within a bandwidth of the audio signal;
a receiver within the dwelling arranged to receive the encoded audio portion; and,
a decoder having an input from the receiver, the decoder arranged to acquire a respective test value of the code bit from each of the frequency bands, to compare the test values, to determine that one of the test values is the code bit only if that test value is acquired from a majority of the frequency bands, and to otherwise determine that no code bit has been read.
24. The broadcast audience measurement system of claim 23 wherein the audio signal is part of a television broadcast signal.
25. The broadcast audience measurement system of claim 23 wherein the receiver includes a microphone.
26. The broadcast audience measurement system of claim 23 wherein the receiver comprises an audio output jack.
27. A broadcast audience measurement system in which an inaudible code added to an audio signal is read within a statistically sampled dwelling unit, the system comprising:
an encoding apparatus arranged to add a code bit to a sampled long block of the audio signal, the long block comprising a predetermined number of short blocks, each of the short blocks having a predetermined duration that is selected to be short enough not to be perceptible to a member of a broadcast audience, the encoding apparatus being further arranged to modulate a selected frequency index in each of a plurality of frequency neighborhoods so as to make each selected index an extremum in the respective neighborhood thereof while keeping a total energy of the audio signal constant;
a receiver within the dwelling, the receiver being arranged to acquire the encoded audio signal; and,
a decoder arranged to read the code from the audio signal, the decoder having an input from the receiver, the decoder comprising a buffer memory arranged to store one of the short blocks, the buffer memory being arranged to store a long block.
28. The broadcast audience system of claim 27 wherein the audio signal is part of a television signal.
29. The broadcast audience system of claim 27 wherein the encoder comprises a frequency transformation arranged to transform the long block into a frequency domain signal.
30. The broadcast audience system of claim 27 wherein the receiver comprises a microphone.
31. The broadcast audience system of claim 27 wherein the receiver comprises an audio output jack.
32. A method of encoding an audio signal comprising the following steps:
a) generating a plurality of short blocks from the audio signal, wherein each of the short blocks has a duration less than a minimum audibly perceivable signal delay;
b) combining the plurality of short blocks into a long block;
c) transforming the long block into a spectrum comprising a plurality of independently modulatable frequency indices; and,
d) modulating at least two of the indices so as to make one of the indices an extremum while keeping the total energy of a neighborhood of the modulated indices substantially constant.
33. A method of reading a code element from an audio signal comprising the following steps:
a) transforming at least a portion of the audio signal into spectral data spanning a predetermined number of frequency bands having a plurality of frequency neighborhoods;
b) determining, for each of the neighborhoods, if one of the frequency indices is modulated; and,
c) assigning a transmitted code value to the code element if, in a majority of the neighborhoods, the respective modulated frequency index is an index selected for inclusion in the audio signal.
US11/100,291 2000-04-06 2005-04-06 Multi-band spectral audio encoding Abandoned US20050177361A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US11/100,291 US20050177361A1 (en) 2000-04-06 2005-04-06 Multi-band spectral audio encoding

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/543,480 US6968564B1 (en) 2000-04-06 2000-04-06 Multi-band spectral audio encoding
US11/100,291 US20050177361A1 (en) 2000-04-06 2005-04-06 Multi-band spectral audio encoding

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US09/543,480 Continuation US6968564B1 (en) 2000-04-06 2000-04-06 Multi-band spectral audio encoding

Publications (1)

Publication Number Publication Date
US20050177361A1 true US20050177361A1 (en) 2005-08-11

Family

ID=24168239

Family Applications (2)

Application Number Title Priority Date Filing Date
US09/543,480 Expired - Lifetime US6968564B1 (en) 2000-04-06 2000-04-06 Multi-band spectral audio encoding
US11/100,291 Abandoned US20050177361A1 (en) 2000-04-06 2005-04-06 Multi-band spectral audio encoding

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US09/543,480 Expired - Lifetime US6968564B1 (en) 2000-04-06 2000-04-06 Multi-band spectral audio encoding

Country Status (11)

Country Link
US (2) US6968564B1 (en)
EP (1) EP1269669B1 (en)
JP (1) JP2003530763A (en)
CN (2) CN1645774A (en)
AU (3) AU5127401A (en)
BR (1) BR0107542A (en)
CA (1) CA2405179C (en)
MX (1) MXPA02009683A (en)
NO (1) NO20024778L (en)
WO (1) WO2001078271A2 (en)
ZA (1) ZA200207800B (en)

Cited By (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040210922A1 (en) * 2002-01-08 2004-10-21 Peiffer John C. Method and apparatus for identifying a digital audio dignal
US20040267533A1 (en) * 2000-09-14 2004-12-30 Hannigan Brett T Watermarking in the time-frequency domain
US20060072785A1 (en) * 2000-09-11 2006-04-06 Davidson Clayton L Watermark encoding and decoding
US20070006275A1 (en) * 2004-02-17 2007-01-04 Wright David H Methods and apparatus for monitoring video games
WO2008091697A1 (en) * 2007-01-25 2008-07-31 Arbitron, Inc. Research data gathering
US20080281604A1 (en) * 2007-05-08 2008-11-13 Samsung Electronics Co., Ltd. Method and apparatus to encode and decode an audio signal
US7480393B2 (en) 2003-11-19 2009-01-20 Digimarc Corporation Optimized digital watermarking functions for streaming data
US20090192788A1 (en) * 2008-01-25 2009-07-30 Yamaha Corporation Sound Processing Device and Program
US20140180673A1 (en) * 2012-12-21 2014-06-26 Arbitron Inc. Audio Processing Techniques for Semantic Audio Recognition and Report Generation
US8908909B2 (en) 2009-05-21 2014-12-09 Digimarc Corporation Watermark decoding with selective accumulation of components
US9158760B2 (en) 2012-12-21 2015-10-13 The Nielsen Company (Us), Llc Audio decoding with supplemental semantic audio recognition and report generation
US9183849B2 (en) 2012-12-21 2015-11-10 The Nielsen Company (Us), Llc Audio matching with semantic audio recognition and report generation
WO2016061353A1 (en) * 2014-10-15 2016-04-21 Lisnr, Inc. Inaudible signaling tone
US9466307B1 (en) 2007-05-22 2016-10-11 Digimarc Corporation Robust spectral encoding and decoding methods
CN107516528A (en) * 2017-08-31 2017-12-26 惠州华阳通用电子有限公司 A kind of voice frequency link self checking method
US10826623B2 (en) 2017-12-19 2020-11-03 Lisnr, Inc. Phase shift keyed signaling tone
US10885543B1 (en) * 2006-12-29 2021-01-05 The Nielsen Company (Us), Llc Systems and methods to pre-scale media content to facilitate audience measurement
US11074033B2 (en) 2012-05-01 2021-07-27 Lisnr, Inc. Access control and validation using sonic tones
US11189295B2 (en) 2017-09-28 2021-11-30 Lisnr, Inc. High bandwidth sonic tone generation
US11233582B2 (en) 2016-03-25 2022-01-25 Lisnr, Inc. Local tone generation
US11452153B2 (en) 2012-05-01 2022-09-20 Lisnr, Inc. Pairing and gateway connection using sonic tones
US11628853B2 (en) 2018-03-05 2023-04-18 Continental Automotive France Method for inspecting the emission of an audio safety message in a vehicle

Families Citing this family (70)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7756892B2 (en) * 2000-05-02 2010-07-13 Digimarc Corporation Using embedded data with file sharing
US8752118B1 (en) 1999-05-19 2014-06-10 Digimarc Corporation Audio and video content-based methods
KR100865247B1 (en) 2000-01-13 2008-10-27 디지맥 코포레이션 Authenticating metadata and embedding metadata in watermarks of media signals
DE60209888T2 (en) * 2001-05-08 2006-11-23 Koninklijke Philips Electronics N.V. CODING AN AUDIO SIGNAL
US7006662B2 (en) 2001-12-13 2006-02-28 Digimarc Corporation Reversible watermarking using expansion, rate control and iterative embedding
US7239981B2 (en) 2002-07-26 2007-07-03 Arbitron Inc. Systems and methods for gathering audience measurement data
US7395062B1 (en) 2002-09-13 2008-07-01 Nielson Media Research, Inc. A Delaware Corporation Remote sensing system
US9711153B2 (en) 2002-09-27 2017-07-18 The Nielsen Company (Us), Llc Activating functions in processing devices using encoded audio and detecting audio signatures
US8959016B2 (en) 2002-09-27 2015-02-17 The Nielsen Company (Us), Llc Activating functions in processing devices using start codes embedded in audio
WO2004038538A2 (en) 2002-10-23 2004-05-06 Nielsen Media Research, Inc. Digital data insertion apparatus and methods for use with compressed audio/video data
EP1586045A1 (en) 2002-12-27 2005-10-19 Nielsen Media Research, Inc. Methods and apparatus for transcoding metadata
US7142250B1 (en) * 2003-04-05 2006-11-28 Apple Computer, Inc. Method and apparatus for synchronizing audio and video streams
US7043204B2 (en) * 2003-06-26 2006-05-09 The Regents Of The University Of California Through-the-earth radio
CA2562137C (en) 2004-04-07 2012-11-27 Nielsen Media Research, Inc. Data insertion apparatus and methods for use with compressed audio/video data
JP4896455B2 (en) * 2005-07-11 2012-03-14 株式会社エヌ・ティ・ティ・ドコモ Data embedding device, data embedding method, data extracting device, and data extracting method
WO2008008915A2 (en) 2006-07-12 2008-01-17 Arbitron Inc. Methods and systems for compliance confirmation and incentives
US20080134264A1 (en) * 2006-11-30 2008-06-05 Motorola, Inc. Method and apparatus for interactivity with broadcast media
US8060372B2 (en) 2007-02-20 2011-11-15 The Nielsen Company (Us), Llc Methods and appratus for characterizing media
US20100174608A1 (en) * 2007-03-22 2010-07-08 Harkness David H Digital rights management and audience measurement systems and methods
US8458737B2 (en) 2007-05-02 2013-06-04 The Nielsen Company (Us), Llc Methods and apparatus for generating signatures
JP5414684B2 (en) 2007-11-12 2014-02-12 ザ ニールセン カンパニー (ユー エス) エルエルシー Method and apparatus for performing audio watermarking, watermark detection, and watermark extraction
US8457951B2 (en) 2008-01-29 2013-06-04 The Nielsen Company (Us), Llc Methods and apparatus for performing variable black length watermarking of media
US8600531B2 (en) 2008-03-05 2013-12-03 The Nielsen Company (Us), Llc Methods and apparatus for generating signatures
US8275209B2 (en) * 2008-10-10 2012-09-25 Microsoft Corporation Reduced DC gain mismatch and DC leakage in overlap transform processing
US8121830B2 (en) 2008-10-24 2012-02-21 The Nielsen Company (Us), Llc Methods and apparatus to extract data encoded in media content
US9667365B2 (en) * 2008-10-24 2017-05-30 The Nielsen Company (Us), Llc Methods and apparatus to perform audio watermarking and watermark detection and extraction
AU2013203674B2 (en) * 2008-10-24 2016-01-14 The Nielsen Company (Us), Llc Methods and apparatus to perform audio watermarking and watermark detection and extraction
US8359205B2 (en) 2008-10-24 2013-01-22 The Nielsen Company (Us), Llc Methods and apparatus to perform audio watermarking and watermark detection and extraction
US8108887B2 (en) 2008-10-30 2012-01-31 The Nielsen Company (Us), Llc Methods and apparatus for identifying media content using temporal signal characteristics
US20100205628A1 (en) 2009-02-12 2010-08-12 Davis Bruce L Media processing methods and arrangements
US8508357B2 (en) 2008-11-26 2013-08-13 The Nielsen Company (Us), Llc Methods and apparatus to encode and decode audio for shopper location and advertisement presentation tracking
US8265450B2 (en) * 2009-01-16 2012-09-11 Apple Inc. Capturing and inserting closed captioning data in digital video
US10008212B2 (en) * 2009-04-17 2018-06-26 The Nielsen Company (Us), Llc System and method for utilizing audio encoding for measuring media exposure with environmental masking
US20100268573A1 (en) * 2009-04-17 2010-10-21 Anand Jain System and method for utilizing supplemental audio beaconing in audience measurement
US8392004B2 (en) * 2009-04-30 2013-03-05 Apple Inc. Automatic audio adjustment
AU2013203888B2 (en) * 2009-05-01 2015-02-12 The Nielsen Company (Us), Llc Methods, apparatus and articles of manufacture to provide secondary content in association with primary broadcast media content
CA2760677C (en) 2009-05-01 2018-07-24 David Henry Harkness Methods, apparatus and articles of manufacture to provide secondary content in association with primary broadcast media content
US8245249B2 (en) * 2009-10-09 2012-08-14 The Nielson Company (Us), Llc Methods and apparatus to adjust signature matching results for audience measurement
US8175617B2 (en) 2009-10-28 2012-05-08 Digimarc Corporation Sensor-based mobile search, related methods and systems
US9218530B2 (en) 2010-11-04 2015-12-22 Digimarc Corporation Smartphone-based methods and systems
US8121618B2 (en) 2009-10-28 2012-02-21 Digimarc Corporation Intuitive computing methods and systems
US8355910B2 (en) 2010-03-30 2013-01-15 The Nielsen Company (Us), Llc Methods and apparatus for audio watermarking a substantially silent media content presentation
US8676570B2 (en) 2010-04-26 2014-03-18 The Nielsen Company (Us), Llc Methods, apparatus and articles of manufacture to perform audio watermark decoding
US8842842B2 (en) 2011-02-01 2014-09-23 Apple Inc. Detection of audio channel configuration
US8621355B2 (en) 2011-02-02 2013-12-31 Apple Inc. Automatic synchronization of media clips
US9196028B2 (en) 2011-09-23 2015-11-24 Digimarc Corporation Context-based smartphone sensor logic
US9380356B2 (en) 2011-04-12 2016-06-28 The Nielsen Company (Us), Llc Methods and apparatus to generate a tag for media content
TWI450266B (en) * 2011-04-19 2014-08-21 Hon Hai Prec Ind Co Ltd Electronic device and decoding method of audio files
US9209978B2 (en) 2012-05-15 2015-12-08 The Nielsen Company (Us), Llc Methods and apparatus to measure exposure to streaming media
US9515904B2 (en) 2011-06-21 2016-12-06 The Nielsen Company (Us), Llc Monitoring streaming media content
US8965774B2 (en) 2011-08-23 2015-02-24 Apple Inc. Automatic detection of audio compression parameters
US8498627B2 (en) 2011-09-15 2013-07-30 Digimarc Corporation Intuitive computing methods and systems
US9402099B2 (en) 2011-10-14 2016-07-26 Digimarc Corporation Arrangements employing content identification and/or distribution identification data
US9223893B2 (en) 2011-10-14 2015-12-29 Digimarc Corporation Updating social graph data using physical objects identified from images captured by smartphone
US9332363B2 (en) 2011-12-30 2016-05-03 The Nielsen Company (Us), Llc System and method for determining meter presence utilizing ambient fingerprints
US9282366B2 (en) 2012-08-13 2016-03-08 The Nielsen Company (Us), Llc Methods and apparatus to communicate audience measurement information
US9305559B2 (en) 2012-10-15 2016-04-05 Digimarc Corporation Audio watermark encoding with reversing polarity and pairwise embedding
US9401153B2 (en) 2012-10-15 2016-07-26 Digimarc Corporation Multi-mode audio recognition and auxiliary data encoding and decoding
US9313544B2 (en) 2013-02-14 2016-04-12 The Nielsen Company (Us), Llc Methods and apparatus to measure exposure to streaming media
US9311640B2 (en) 2014-02-11 2016-04-12 Digimarc Corporation Methods and arrangements for smartphone payments and transactions
US20150039321A1 (en) 2013-07-31 2015-02-05 Arbitron Inc. Apparatus, System and Method for Reading Codes From Digital Audio on a Processing Device
US9711152B2 (en) 2013-07-31 2017-07-18 The Nielsen Company (Us), Llc Systems apparatus and methods for encoding/decoding persistent universal media codes to encoded audio
US9699499B2 (en) 2014-04-30 2017-07-04 The Nielsen Company (Us), Llc Methods and apparatus to measure exposure to streaming media
EP2980794A1 (en) 2014-07-28 2016-02-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder and decoder using a frequency domain processor and a time domain processor
EP2980795A1 (en) * 2014-07-28 2016-02-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoding and decoding using a frequency domain processor, a time domain processor and a cross processor for initialization of the time domain processor
US10147433B1 (en) 2015-05-03 2018-12-04 Digimarc Corporation Digital watermark encoding and decoding with localization and payload replacement
US9762965B2 (en) 2015-05-29 2017-09-12 The Nielsen Company (Us), Llc Methods and apparatus to measure exposure to streaming media
US10236031B1 (en) 2016-04-05 2019-03-19 Digimarc Corporation Timeline reconstruction using dynamic path estimation from detections in audio-video signals
CN111126001A (en) * 2019-11-19 2020-05-08 深圳追一科技有限公司 Character marking method, device, equipment and storage medium
CN112953873B (en) * 2021-02-10 2022-07-29 西南电子技术研究所(中国电子科技集团公司第十研究所) High-dynamic weak 8PSK/16PSK signal carrier capturing method

Citations (96)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US2573279A (en) * 1946-11-09 1951-10-30 Serge A Scherbatskoy System of determining the listening habits of wave signal receiver users
US2630525A (en) * 1951-05-25 1953-03-03 Musicast Inc System for transmitting and receiving coded entertainment programs
US2766374A (en) * 1951-07-25 1956-10-09 Internat Telementer Corp System and apparatus for determining popularity ratings of different transmitted programs
US2982813A (en) * 1958-08-28 1961-05-02 Sound
US3004104A (en) * 1954-04-29 1961-10-10 Muzak Corp Identification of sound and like signals
US3492577A (en) * 1966-10-07 1970-01-27 Intern Telemeter Corp Audience rating system
US3684838A (en) * 1968-06-26 1972-08-15 Kahn Res Lab Single channel audio signal transmission system
US3696298A (en) * 1970-07-27 1972-10-03 Kahn Res Lab Audio signal transmission system and method
US3733430A (en) * 1970-12-28 1973-05-15 Rca Corp Channel monitoring system
US3735048A (en) * 1971-05-28 1973-05-22 Motorola Inc In-band data transmission system
US3760275A (en) * 1970-10-24 1973-09-18 T Ohsawa Automatic telecasting or radio broadcasting monitoring system
US3845391A (en) * 1969-07-08 1974-10-29 Audicom Corp Communication including submerged identification signal
US4025851A (en) * 1975-11-28 1977-05-24 A.C. Nielsen Company Automatic monitor for programs broadcast
US4134127A (en) * 1975-06-12 1979-01-09 Indesit Industria Elettrodomestici Italiana S.P.A. Color television signal including auxiliary information
US4225967A (en) * 1978-01-09 1980-09-30 Fujitsu Limited Broadcast acknowledgement method and system
US4285498A (en) * 1976-05-17 1981-08-25 Imperial Chemical Industries Limited Control valves
US4313197A (en) * 1980-04-09 1982-01-26 Bell Telephone Laboratories, Incorporated Spread spectrum arrangement for (de)multiplexing speech signals and nonspeech signals
US4379947A (en) * 1979-02-02 1983-04-12 Teleprompter Corporation System for transmitting data simultaneously with audio
US4425642A (en) * 1982-01-08 1984-01-10 Applied Spectrum Technologies, Inc. Simultaneous transmission of two information signals within a band-limited communications channel
US4425661A (en) * 1981-09-03 1984-01-10 Applied Spectrum Technologies, Inc. Data under voice communications system
US4512013A (en) * 1983-04-11 1985-04-16 At&T Bell Laboratories Simultaneous transmission of speech and data over an analog channel
US4523311A (en) * 1983-04-11 1985-06-11 At&T Bell Laboratories Simultaneous transmission of speech and data over an analog channel
US4652915A (en) * 1985-11-12 1987-03-24 Control Data Corporation Method for polling headphones of a passive TV audience meter system
US4677466A (en) * 1985-07-29 1987-06-30 A. C. Nielsen Company Broadcast program identification method and apparatus
US4688255A (en) * 1984-05-29 1987-08-18 Kahn Leonard R Compatible AM broadcast/data transmisison system
US4697209A (en) * 1984-04-26 1987-09-29 A. C. Nielsen Company Methods and apparatus for automatically identifying programs viewed or recorded
US4703476A (en) * 1983-09-16 1987-10-27 Audicom Corporation Encoding of transmitted program material
US4750173A (en) * 1985-05-21 1988-06-07 Polygram International Holding B.V. Method of transmitting audio information and additional information in digital form
US4750053A (en) * 1984-02-02 1988-06-07 Broadcast Advertisers Reports, Inc. Method and system for enabling television commerical monitoring using a marking signal superimposed over an audio signal
US4771455A (en) * 1982-05-17 1988-09-13 Sony Corporation Scrambling apparatus
US4876617A (en) * 1986-05-06 1989-10-24 Thorn Emi Plc Signal identification
US4931871A (en) * 1988-06-14 1990-06-05 Kramer Robert A Method of and system for identification and verification of broadcasted program segments
US4943973A (en) * 1989-03-31 1990-07-24 At&T Company Spread-spectrum identification signal for communications system
US4945412A (en) * 1988-06-14 1990-07-31 Kramer Robert A Method of and system for identification and verification of broadcasting television and radio program segments
US4956709A (en) * 1988-03-11 1990-09-11 Pbs Enterprises, Inc. Forward error correction of data transmitted via television signals
US4972471A (en) * 1989-05-15 1990-11-20 Gary Gross Encoding system
US5079647A (en) * 1989-02-14 1992-01-07 Sony Corporation Method and apparatus for recording/reproducing monaural audio signal mixed with the clock and data signals
US5113437A (en) * 1988-10-25 1992-05-12 Thorn Emi Plc Signal identification system
US5212551A (en) * 1989-10-16 1993-05-18 Conanan Virgilio D Method and apparatus for adaptively superimposing bursts of texts over audio signals and decoder thereof
US5213337A (en) * 1988-07-06 1993-05-25 Robert Sherman System for communication using a broadcast audio signal
US5227874A (en) * 1986-03-10 1993-07-13 Kohorn H Von Method for measuring the effectiveness of stimuli on decisions of shoppers
US5319735A (en) * 1991-12-17 1994-06-07 Bolt Beranek And Newman Inc. Embedded signalling
US5355161A (en) * 1993-07-28 1994-10-11 Concord Media Systems Identification system for broadcast program segments
US5379345A (en) * 1993-01-29 1995-01-03 Radio Audit Systems, Inc. Method and apparatus for the processing of encoded data in conjunction with an audio broadcast
US5394274A (en) * 1988-01-22 1995-02-28 Kahn; Leonard R. Anti-copy system utilizing audible and inaudible protection signals
US5404377A (en) * 1994-04-08 1995-04-04 Moses; Donald W. Simultaneous transmission of data and audio signals by means of perceptual coding
US5450490A (en) * 1994-03-31 1995-09-12 The Arbitron Company Apparatus and methods for including codes in audio signals and decoding
US5463423A (en) * 1992-03-11 1995-10-31 Thomson Consumer Electronics, Inc. Auxiliary video data detector and data slicer
US5481370A (en) * 1992-08-07 1996-01-02 Samsung Electronics Co., Ltd. Apparatus for discriminating audio signals
US5535300A (en) * 1988-12-30 1996-07-09 At&T Corp. Perceptual coding of audio signals using entropy coding and/or multiple power spectra
US5534941A (en) * 1994-05-20 1996-07-09 Encore Media Corporation System for dynamic real-time television channel expansion
US5550593A (en) * 1992-11-30 1996-08-27 Sharp Kabushiki Kaisha Multiplex communication system using separated and multiplexed data
US5574963A (en) * 1995-07-31 1996-11-12 Lee S. Weinblatt Audience measurement during a mute mode
US5574962A (en) * 1991-09-30 1996-11-12 The Arbitron Company Method and apparatus for automatically identifying a program including a sound signal
US5579124A (en) * 1992-11-16 1996-11-26 The Arbitron Company Method and apparatus for encoding/decoding broadcast or recorded segments and monitoring audience exposure thereto
US5594934A (en) * 1994-09-21 1997-01-14 A.C. Nielsen Company Real time correlation meter
US5612729A (en) * 1992-04-30 1997-03-18 The Arbitron Company Method and system for producing a signature characterizing an audio broadcast signal
US5629739A (en) * 1995-03-06 1997-05-13 A.C. Nielsen Company Apparatus and method for injecting an ancillary signal into a low energy density portion of a color television frequency spectrum
US5668805A (en) * 1993-11-25 1997-09-16 Sony Corporation Multiplex broadcasting method and system
US5675388A (en) * 1982-06-24 1997-10-07 Cooper; J. Carl Apparatus and method for transmitting audio signals as part of a television video signal
US5687191A (en) * 1995-12-06 1997-11-11 Solana Technology Development Corporation Post-compression hidden data transport
US5719937A (en) * 1995-12-06 1998-02-17 Solana Technology Develpment Corporation Multi-media copy management system
US5731841A (en) * 1994-05-25 1998-03-24 Wavephore, Inc. High performance data tuner for video systems
US5745604A (en) * 1993-11-18 1998-04-28 Digimarc Corporation Identification/authentication system using robust, distributed coding
US5757417A (en) * 1995-12-06 1998-05-26 International Business Machines Corporation Method and apparatus for screening audio-visual materials presented to a subscriber
US5761606A (en) * 1996-02-08 1998-06-02 Wolzien; Thomas R. Media online services access via address embedded in video or audio program
US5768680A (en) * 1995-05-05 1998-06-16 Thomas; C. David Media monitor
US5774452A (en) * 1995-03-14 1998-06-30 Aris Technologies, Inc. Apparatus and method for encoding and decoding information in audio signals
US5808689A (en) * 1994-04-20 1998-09-15 Shoot The Moon Products, Inc. Method and apparatus for nesting secondary signals within a television signal
US5822436A (en) * 1996-04-25 1998-10-13 Digimarc Corporation Photographic products and methods employing embedded information
US5822360A (en) * 1995-09-06 1998-10-13 Solana Technology Development Corporation Method and apparatus for transporting auxiliary data in audio signals
US5826165A (en) * 1997-01-21 1998-10-20 Hughes Electronics Corporation Advertisement reconciliation system
US5856973A (en) * 1996-09-10 1999-01-05 Thompson; Kenneth M. Data multiplexing in MPEG server to decoder systems
US5930369A (en) * 1995-09-28 1999-07-27 Nec Research Institute, Inc. Secure spread spectrum watermarking for multimedia data
US5972471A (en) * 1994-10-31 1999-10-26 Morton International, Inc. Decorative coating with textured pattern
US6035177A (en) * 1996-02-26 2000-03-07 Donald W. Moses Simultaneous transmission of ancillary and audio signals by means of perceptual coding
US6253185B1 (en) * 1998-02-25 2001-06-26 Lucent Technologies Inc. Multiple description transform coding of audio using optimal transforms of arbitrary dimension
US6266430B1 (en) * 1993-11-18 2001-07-24 Digimarc Corporation Audio or video steganography
US6286100B1 (en) * 1996-11-27 2001-09-04 International Business Machines Corporation Method for hiding message data into media data and a method for extracting that hidden data
US6304966B1 (en) * 1996-12-25 2001-10-16 International Business Machines Corporation Data hiding method and system using statistical properties
US6308150B1 (en) * 1998-06-16 2001-10-23 Matsushita Electric Industrial Co., Ltd. Dynamic bit allocation apparatus and method for audio coding
US6338037B1 (en) * 1996-03-05 2002-01-08 Central Research Laboratories Limited Audio signal identification using code labels inserted in the audio signal
US20020006203A1 (en) * 1999-12-22 2002-01-17 Ryuki Tachibana Electronic watermarking method and apparatus for compressed audio data, and system therefor
US20020010919A1 (en) * 1998-05-12 2002-01-24 Nielsen Media Research, Inc. Audience measurement system for digital television
US6349284B1 (en) * 1997-11-20 2002-02-19 Samsung Sdi Co., Ltd. Scalable audio encoding/decoding method and apparatus
US6353672B1 (en) * 1993-11-18 2002-03-05 Digimarc Corporation Steganography using dynamic codes
US6359573B1 (en) * 1999-08-31 2002-03-19 Yamaha Corporation Method and system for embedding electronic watermark information in main information
US6385329B1 (en) * 2000-02-14 2002-05-07 Digimarc Corporation Wavelet domain watermarks
US20020055398A1 (en) * 1999-03-12 2002-05-09 Halko Roman D. Multilayer golf ball with wound intermediate layer
US6389055B1 (en) * 1998-03-30 2002-05-14 Lucent Technologies, Inc. Integrating digital data with perceptible signals
US6421445B1 (en) * 1994-03-31 2002-07-16 Arbitron Inc. Apparatus and methods for including codes in audio signals
US6512796B1 (en) * 1996-03-04 2003-01-28 Douglas Sherwood Method and system for inserting and retrieving data in an audio signal
US6519769B1 (en) * 1998-11-09 2003-02-11 General Electric Company Audience measurement system employing local time coincidence coding
US6574350B1 (en) * 1995-05-08 2003-06-03 Digimarc Corporation Digital watermarking employing both frail and robust watermarks
US6584138B1 (en) * 1996-03-07 2003-06-24 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Coding process for inserting an inaudible data signal into an audio signal, decoding process, coder and decoder
US6904089B1 (en) * 1998-12-28 2005-06-07 Matsushita Electric Industrial Co., Ltd. Encoding device and decoding device

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3733460A (en) 1971-09-15 1973-05-15 Gec Bridgeport Apparatus for heating dispensed flowable material
DE2757171C3 (en) 1977-12-22 1980-07-10 Standard Elektrik Lorenz Ag, 7000 Stuttgart Method and arrangement for the transmission of two different pieces of information in a single transmission channel with a given bandwidth on a carrier wave
WO1992012607A1 (en) * 1991-01-08 1992-07-23 Dolby Laboratories Licensing Corporation Encoder/decoder for multidimensional sound fields
US5285498A (en) 1992-03-02 1994-02-08 At&T Bell Laboratories Method and apparatus for coding audio signals based on perceptual model
EP0959621B1 (en) 1993-11-18 2001-02-28 Digimarc Corporation Video copy control with plural embedded signals
US5832119C1 (en) 1993-11-18 2002-03-05 Digimarc Corp Methods for controlling systems using control signals embedded in empirical data
US5689822A (en) 1995-02-17 1997-11-18 Zucker; Leo Wireless coupled adapter for decoding information from a broadcast signal to which a radio is tuned
FR2734977B1 (en) * 1995-06-02 1997-07-25 Telediffusion Fse DATA DISSEMINATION SYSTEM.
US5699124A (en) 1995-06-28 1997-12-16 General Instrument Corporation Of Delaware Bandwidth efficient communication of user data in digital television data stream
US5703877A (en) 1995-11-22 1997-12-30 General Instrument Corporation Of Delaware Acquisition and error recovery of audio data carried in a packetized data stream
EP1134724B1 (en) 2000-03-17 2008-07-23 Sony France S.A. Real time audio spatialisation system with high level control

Patent Citations (99)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US2573279A (en) * 1946-11-09 1951-10-30 Serge A Scherbatskoy System of determining the listening habits of wave signal receiver users
US2630525A (en) * 1951-05-25 1953-03-03 Musicast Inc System for transmitting and receiving coded entertainment programs
US2766374A (en) * 1951-07-25 1956-10-09 Internat Telementer Corp System and apparatus for determining popularity ratings of different transmitted programs
US3004104A (en) * 1954-04-29 1961-10-10 Muzak Corp Identification of sound and like signals
US2982813A (en) * 1958-08-28 1961-05-02 Sound
US3492577A (en) * 1966-10-07 1970-01-27 Intern Telemeter Corp Audience rating system
US3684838A (en) * 1968-06-26 1972-08-15 Kahn Res Lab Single channel audio signal transmission system
US3845391A (en) * 1969-07-08 1974-10-29 Audicom Corp Communication including submerged identification signal
US3696298A (en) * 1970-07-27 1972-10-03 Kahn Res Lab Audio signal transmission system and method
US3760275A (en) * 1970-10-24 1973-09-18 T Ohsawa Automatic telecasting or radio broadcasting monitoring system
US3733430A (en) * 1970-12-28 1973-05-15 Rca Corp Channel monitoring system
US3735048A (en) * 1971-05-28 1973-05-22 Motorola Inc In-band data transmission system
US4134127A (en) * 1975-06-12 1979-01-09 Indesit Industria Elettrodomestici Italiana S.P.A. Color television signal including auxiliary information
US4025851A (en) * 1975-11-28 1977-05-24 A.C. Nielsen Company Automatic monitor for programs broadcast
US4285498A (en) * 1976-05-17 1981-08-25 Imperial Chemical Industries Limited Control valves
US4225967A (en) * 1978-01-09 1980-09-30 Fujitsu Limited Broadcast acknowledgement method and system
US4379947A (en) * 1979-02-02 1983-04-12 Teleprompter Corporation System for transmitting data simultaneously with audio
US4313197A (en) * 1980-04-09 1982-01-26 Bell Telephone Laboratories, Incorporated Spread spectrum arrangement for (de)multiplexing speech signals and nonspeech signals
US4425661A (en) * 1981-09-03 1984-01-10 Applied Spectrum Technologies, Inc. Data under voice communications system
US4425642A (en) * 1982-01-08 1984-01-10 Applied Spectrum Technologies, Inc. Simultaneous transmission of two information signals within a band-limited communications channel
US4771455A (en) * 1982-05-17 1988-09-13 Sony Corporation Scrambling apparatus
US5675388A (en) * 1982-06-24 1997-10-07 Cooper; J. Carl Apparatus and method for transmitting audio signals as part of a television video signal
US4523311A (en) * 1983-04-11 1985-06-11 At&T Bell Laboratories Simultaneous transmission of speech and data over an analog channel
US4512013A (en) * 1983-04-11 1985-04-16 At&T Bell Laboratories Simultaneous transmission of speech and data over an analog channel
US4703476A (en) * 1983-09-16 1987-10-27 Audicom Corporation Encoding of transmitted program material
US4750053A (en) * 1984-02-02 1988-06-07 Broadcast Advertisers Reports, Inc. Method and system for enabling television commerical monitoring using a marking signal superimposed over an audio signal
US4697209A (en) * 1984-04-26 1987-09-29 A. C. Nielsen Company Methods and apparatus for automatically identifying programs viewed or recorded
US4688255A (en) * 1984-05-29 1987-08-18 Kahn Leonard R Compatible AM broadcast/data transmisison system
US4750173A (en) * 1985-05-21 1988-06-07 Polygram International Holding B.V. Method of transmitting audio information and additional information in digital form
US4677466A (en) * 1985-07-29 1987-06-30 A. C. Nielsen Company Broadcast program identification method and apparatus
US4652915A (en) * 1985-11-12 1987-03-24 Control Data Corporation Method for polling headphones of a passive TV audience meter system
US5227874A (en) * 1986-03-10 1993-07-13 Kohorn H Von Method for measuring the effectiveness of stimuli on decisions of shoppers
US4876617A (en) * 1986-05-06 1989-10-24 Thorn Emi Plc Signal identification
US5394274A (en) * 1988-01-22 1995-02-28 Kahn; Leonard R. Anti-copy system utilizing audible and inaudible protection signals
US4956709A (en) * 1988-03-11 1990-09-11 Pbs Enterprises, Inc. Forward error correction of data transmitted via television signals
US4931871A (en) * 1988-06-14 1990-06-05 Kramer Robert A Method of and system for identification and verification of broadcasted program segments
US4945412A (en) * 1988-06-14 1990-07-31 Kramer Robert A Method of and system for identification and verification of broadcasting television and radio program segments
US5213337A (en) * 1988-07-06 1993-05-25 Robert Sherman System for communication using a broadcast audio signal
US5113437A (en) * 1988-10-25 1992-05-12 Thorn Emi Plc Signal identification system
US5535300A (en) * 1988-12-30 1996-07-09 At&T Corp. Perceptual coding of audio signals using entropy coding and/or multiple power spectra
US5079647A (en) * 1989-02-14 1992-01-07 Sony Corporation Method and apparatus for recording/reproducing monaural audio signal mixed with the clock and data signals
US4943973A (en) * 1989-03-31 1990-07-24 At&T Company Spread-spectrum identification signal for communications system
US4972471A (en) * 1989-05-15 1990-11-20 Gary Gross Encoding system
US5212551A (en) * 1989-10-16 1993-05-18 Conanan Virgilio D Method and apparatus for adaptively superimposing bursts of texts over audio signals and decoder thereof
US5787334A (en) * 1991-09-30 1998-07-28 Ceridian Corporation Method and apparatus for automatically identifying a program including a sound signal
US5574962A (en) * 1991-09-30 1996-11-12 The Arbitron Company Method and apparatus for automatically identifying a program including a sound signal
US5319735A (en) * 1991-12-17 1994-06-07 Bolt Beranek And Newman Inc. Embedded signalling
US5463423A (en) * 1992-03-11 1995-10-31 Thomson Consumer Electronics, Inc. Auxiliary video data detector and data slicer
US5612729A (en) * 1992-04-30 1997-03-18 The Arbitron Company Method and system for producing a signature characterizing an audio broadcast signal
US5481370A (en) * 1992-08-07 1996-01-02 Samsung Electronics Co., Ltd. Apparatus for discriminating audio signals
US5579124A (en) * 1992-11-16 1996-11-26 The Arbitron Company Method and apparatus for encoding/decoding broadcast or recorded segments and monitoring audience exposure thereto
US5550593A (en) * 1992-11-30 1996-08-27 Sharp Kabushiki Kaisha Multiplex communication system using separated and multiplexed data
US5379345A (en) * 1993-01-29 1995-01-03 Radio Audit Systems, Inc. Method and apparatus for the processing of encoded data in conjunction with an audio broadcast
US5355161A (en) * 1993-07-28 1994-10-11 Concord Media Systems Identification system for broadcast program segments
US5768426A (en) * 1993-11-18 1998-06-16 Digimarc Corporation Graphics processing system employing embedded code signals
US5745604A (en) * 1993-11-18 1998-04-28 Digimarc Corporation Identification/authentication system using robust, distributed coding
US6353672B1 (en) * 1993-11-18 2002-03-05 Digimarc Corporation Steganography using dynamic codes
US6266430B1 (en) * 1993-11-18 2001-07-24 Digimarc Corporation Audio or video steganography
US5668805A (en) * 1993-11-25 1997-09-16 Sony Corporation Multiplex broadcasting method and system
US5764763A (en) * 1994-03-31 1998-06-09 Jensen; James M. Apparatus and methods for including codes in audio signals and decoding
US6421445B1 (en) * 1994-03-31 2002-07-16 Arbitron Inc. Apparatus and methods for including codes in audio signals
US5450490A (en) * 1994-03-31 1995-09-12 The Arbitron Company Apparatus and methods for including codes in audio signals and decoding
US5404377A (en) * 1994-04-08 1995-04-04 Moses; Donald W. Simultaneous transmission of data and audio signals by means of perceptual coding
US5808689A (en) * 1994-04-20 1998-09-15 Shoot The Moon Products, Inc. Method and apparatus for nesting secondary signals within a television signal
US5534941A (en) * 1994-05-20 1996-07-09 Encore Media Corporation System for dynamic real-time television channel expansion
US5731841A (en) * 1994-05-25 1998-03-24 Wavephore, Inc. High performance data tuner for video systems
US5594934A (en) * 1994-09-21 1997-01-14 A.C. Nielsen Company Real time correlation meter
US5972471A (en) * 1994-10-31 1999-10-26 Morton International, Inc. Decorative coating with textured pattern
US5629739A (en) * 1995-03-06 1997-05-13 A.C. Nielsen Company Apparatus and method for injecting an ancillary signal into a low energy density portion of a color television frequency spectrum
US5774452A (en) * 1995-03-14 1998-06-30 Aris Technologies, Inc. Apparatus and method for encoding and decoding information in audio signals
US5768680A (en) * 1995-05-05 1998-06-16 Thomas; C. David Media monitor
US6574350B1 (en) * 1995-05-08 2003-06-03 Digimarc Corporation Digital watermarking employing both frail and robust watermarks
US5574963A (en) * 1995-07-31 1996-11-12 Lee S. Weinblatt Audience measurement during a mute mode
US5822360A (en) * 1995-09-06 1998-10-13 Solana Technology Development Corporation Method and apparatus for transporting auxiliary data in audio signals
US5930369A (en) * 1995-09-28 1999-07-27 Nec Research Institute, Inc. Secure spread spectrum watermarking for multimedia data
US5687191A (en) * 1995-12-06 1997-11-11 Solana Technology Development Corporation Post-compression hidden data transport
US5757417A (en) * 1995-12-06 1998-05-26 International Business Machines Corporation Method and apparatus for screening audio-visual materials presented to a subscriber
US5719937A (en) * 1995-12-06 1998-02-17 Solana Technology Develpment Corporation Multi-media copy management system
US5761606A (en) * 1996-02-08 1998-06-02 Wolzien; Thomas R. Media online services access via address embedded in video or audio program
US6035177A (en) * 1996-02-26 2000-03-07 Donald W. Moses Simultaneous transmission of ancillary and audio signals by means of perceptual coding
US6512796B1 (en) * 1996-03-04 2003-01-28 Douglas Sherwood Method and system for inserting and retrieving data in an audio signal
US6338037B1 (en) * 1996-03-05 2002-01-08 Central Research Laboratories Limited Audio signal identification using code labels inserted in the audio signal
US6584138B1 (en) * 1996-03-07 2003-06-24 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Coding process for inserting an inaudible data signal into an audio signal, decoding process, coder and decoder
US5822436A (en) * 1996-04-25 1998-10-13 Digimarc Corporation Photographic products and methods employing embedded information
US5856973A (en) * 1996-09-10 1999-01-05 Thompson; Kenneth M. Data multiplexing in MPEG server to decoder systems
US6286100B1 (en) * 1996-11-27 2001-09-04 International Business Machines Corporation Method for hiding message data into media data and a method for extracting that hidden data
US6304966B1 (en) * 1996-12-25 2001-10-16 International Business Machines Corporation Data hiding method and system using statistical properties
US5826165A (en) * 1997-01-21 1998-10-20 Hughes Electronics Corporation Advertisement reconciliation system
US6349284B1 (en) * 1997-11-20 2002-02-19 Samsung Sdi Co., Ltd. Scalable audio encoding/decoding method and apparatus
US6253185B1 (en) * 1998-02-25 2001-06-26 Lucent Technologies Inc. Multiple description transform coding of audio using optimal transforms of arbitrary dimension
US6389055B1 (en) * 1998-03-30 2002-05-14 Lucent Technologies, Inc. Integrating digital data with perceptible signals
US20020010919A1 (en) * 1998-05-12 2002-01-24 Nielsen Media Research, Inc. Audience measurement system for digital television
US6308150B1 (en) * 1998-06-16 2001-10-23 Matsushita Electric Industrial Co., Ltd. Dynamic bit allocation apparatus and method for audio coding
US6519769B1 (en) * 1998-11-09 2003-02-11 General Electric Company Audience measurement system employing local time coincidence coding
US6904089B1 (en) * 1998-12-28 2005-06-07 Matsushita Electric Industrial Co., Ltd. Encoding device and decoding device
US20020055398A1 (en) * 1999-03-12 2002-05-09 Halko Roman D. Multilayer golf ball with wound intermediate layer
US6359573B1 (en) * 1999-08-31 2002-03-19 Yamaha Corporation Method and system for embedding electronic watermark information in main information
US20020006203A1 (en) * 1999-12-22 2002-01-17 Ryuki Tachibana Electronic watermarking method and apparatus for compressed audio data, and system therefor
US6385329B1 (en) * 2000-02-14 2002-05-07 Digimarc Corporation Wavelet domain watermarks

Cited By (57)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7657057B2 (en) 2000-09-11 2010-02-02 Digimarc Corporation Watermark encoding and decoding
US20060072785A1 (en) * 2000-09-11 2006-04-06 Davidson Clayton L Watermark encoding and decoding
US8126201B2 (en) 2000-09-11 2012-02-28 Digimarc Corporation Watermark decoding from streaming media
US20040267533A1 (en) * 2000-09-14 2004-12-30 Hannigan Brett T Watermarking in the time-frequency domain
US7330562B2 (en) 2000-09-14 2008-02-12 Digimarc Corporation Watermarking in the time-frequency domain
US20080181449A1 (en) * 2000-09-14 2008-07-31 Hannigan Brett T Watermarking Employing the Time-Frequency Domain
US8077912B2 (en) 2000-09-14 2011-12-13 Digimarc Corporation Signal hiding employing feature modification
US7711144B2 (en) * 2000-09-14 2010-05-04 Digimarc Corporation Watermarking employing the time-frequency domain
US20040210922A1 (en) * 2002-01-08 2004-10-21 Peiffer John C. Method and apparatus for identifying a digital audio dignal
US8548373B2 (en) 2002-01-08 2013-10-01 The Nielsen Company (Us), Llc Methods and apparatus for identifying a digital audio signal
US7742737B2 (en) * 2002-01-08 2010-06-22 The Nielsen Company (Us), Llc. Methods and apparatus for identifying a digital audio signal
US20100014705A1 (en) * 2003-11-19 2010-01-21 Gustafson Ammon E Optimized Digital Watermarking Functions for Streaming Data
US7957552B2 (en) 2003-11-19 2011-06-07 Digimarc Corporation Optimized digital watermarking functions for streaming data
US7480393B2 (en) 2003-11-19 2009-01-20 Digimarc Corporation Optimized digital watermarking functions for streaming data
US9491518B2 (en) 2004-02-17 2016-11-08 The Nielsen Company (Us), Llc Methods and apparatus for monitoring video games
US11115721B2 (en) 2004-02-17 2021-09-07 The Nielsen Company (Us), Llc Methods and apparatus for monitoring video games
US20070006275A1 (en) * 2004-02-17 2007-01-04 Wright David H Methods and apparatus for monitoring video games
US8863218B2 (en) 2004-02-17 2014-10-14 The Nielsen Company (Us), Llc Methods and apparatus for monitoring video games
US10405050B2 (en) 2004-02-17 2019-09-03 The Nielsen Company (Us), Llc Methods and apparatus for monitoring video games
US11568439B2 (en) 2006-12-29 2023-01-31 The Nielsen Company (Us), Llc Systems and methods to pre-scale media content to facilitate audience measurement
US10885543B1 (en) * 2006-12-29 2021-01-05 The Nielsen Company (Us), Llc Systems and methods to pre-scale media content to facilitate audience measurement
US11928707B2 (en) 2006-12-29 2024-03-12 The Nielsen Company (Us), Llc Systems and methods to pre-scale media content to facilitate audience measurement
US10847168B2 (en) 2007-01-25 2020-11-24 The Nielsen Company (Us), Llc Research data gathering
US9824693B2 (en) 2007-01-25 2017-11-21 The Nielsen Company (Us), Llc Research data gathering
WO2008091697A1 (en) * 2007-01-25 2008-07-31 Arbitron, Inc. Research data gathering
US11670309B2 (en) 2007-01-25 2023-06-06 The Nielsen Company (Us), Llc Research data gathering
EP3726528A1 (en) * 2007-01-25 2020-10-21 Arbitron Inc. Research data gathering
EP2122609A4 (en) * 2007-01-25 2015-08-19 Arbitron Inc Research data gathering
US10418039B2 (en) 2007-01-25 2019-09-17 The Nielsen Company (Us), Llc Research data gathering
US20080281604A1 (en) * 2007-05-08 2008-11-13 Samsung Electronics Co., Ltd. Method and apparatus to encode and decode an audio signal
US9773504B1 (en) 2007-05-22 2017-09-26 Digimarc Corporation Robust spectral encoding and decoding methods
US9466307B1 (en) 2007-05-22 2016-10-11 Digimarc Corporation Robust spectral encoding and decoding methods
US20090192788A1 (en) * 2008-01-25 2009-07-30 Yamaha Corporation Sound Processing Device and Program
US8473282B2 (en) * 2008-01-25 2013-06-25 Yamaha Corporation Sound processing device and program
US8908909B2 (en) 2009-05-21 2014-12-09 Digimarc Corporation Watermark decoding with selective accumulation of components
US11074033B2 (en) 2012-05-01 2021-07-27 Lisnr, Inc. Access control and validation using sonic tones
US11126394B2 (en) 2012-05-01 2021-09-21 Lisnr, Inc. Systems and methods for content delivery and management
US11452153B2 (en) 2012-05-01 2022-09-20 Lisnr, Inc. Pairing and gateway connection using sonic tones
US20140180673A1 (en) * 2012-12-21 2014-06-26 Arbitron Inc. Audio Processing Techniques for Semantic Audio Recognition and Report Generation
US9640156B2 (en) 2012-12-21 2017-05-02 The Nielsen Company (Us), Llc Audio matching with supplemental semantic audio recognition and report generation
US9183849B2 (en) 2012-12-21 2015-11-10 The Nielsen Company (Us), Llc Audio matching with semantic audio recognition and report generation
US10366685B2 (en) 2012-12-21 2019-07-30 The Nielsen Company (Us), Llc Audio processing techniques for semantic audio recognition and report generation
US9754569B2 (en) 2012-12-21 2017-09-05 The Nielsen Company (Us), Llc Audio matching with semantic audio recognition and report generation
US9158760B2 (en) 2012-12-21 2015-10-13 The Nielsen Company (Us), Llc Audio decoding with supplemental semantic audio recognition and report generation
US11087726B2 (en) 2012-12-21 2021-08-10 The Nielsen Company (Us), Llc Audio matching with semantic audio recognition and report generation
US11094309B2 (en) 2012-12-21 2021-08-17 The Nielsen Company (Us), Llc Audio processing techniques for semantic audio recognition and report generation
US9812109B2 (en) 2012-12-21 2017-11-07 The Nielsen Company (Us), Llc Audio processing techniques for semantic audio recognition and report generation
US11837208B2 (en) 2012-12-21 2023-12-05 The Nielsen Company (Us), Llc Audio processing techniques for semantic audio recognition and report generation
US9195649B2 (en) * 2012-12-21 2015-11-24 The Nielsen Company (Us), Llc Audio processing techniques for semantic audio recognition and report generation
US10360883B2 (en) 2012-12-21 2019-07-23 The Nielsen Company (US) Audio matching with semantic audio recognition and report generation
US11330319B2 (en) 2014-10-15 2022-05-10 Lisnr, Inc. Inaudible signaling tone
WO2016061353A1 (en) * 2014-10-15 2016-04-21 Lisnr, Inc. Inaudible signaling tone
US11233582B2 (en) 2016-03-25 2022-01-25 Lisnr, Inc. Local tone generation
CN107516528A (en) * 2017-08-31 2017-12-26 惠州华阳通用电子有限公司 A kind of voice frequency link self checking method
US11189295B2 (en) 2017-09-28 2021-11-30 Lisnr, Inc. High bandwidth sonic tone generation
US10826623B2 (en) 2017-12-19 2020-11-03 Lisnr, Inc. Phase shift keyed signaling tone
US11628853B2 (en) 2018-03-05 2023-04-18 Continental Automotive France Method for inspecting the emission of an audio safety message in a vehicle

Also Published As

Publication number Publication date
CA2405179C (en) 2014-07-08
EP1269669A2 (en) 2003-01-02
ZA200207800B (en) 2003-09-29
MXPA02009683A (en) 2004-09-06
NO20024778D0 (en) 2002-10-03
EP1269669B1 (en) 2019-02-20
CN1645774A (en) 2005-07-27
AU5127401A (en) 2001-10-23
WO2001078271A3 (en) 2002-07-04
BR0107542A (en) 2003-01-14
NO20024778L (en) 2002-12-02
AU2001251274B2 (en) 2004-11-25
AU2005200858B2 (en) 2008-01-03
CA2405179A1 (en) 2001-10-18
CN1422466A (en) 2003-06-04
JP2003530763A (en) 2003-10-14
US6968564B1 (en) 2005-11-22
WO2001078271A2 (en) 2001-10-18
AU2005200858A1 (en) 2005-03-17

Similar Documents

Publication Publication Date Title
US6968564B1 (en) Multi-band spectral audio encoding
AU2001251274A1 (en) System and method for adding an inaudible code to an audio signal and method and apparatus for reading a code signal from an audio signal
US6621881B2 (en) Broadcast encoding system and method
US11562752B2 (en) Methods and apparatus to perform audio watermarking and watermark detection and extraction
US7006555B1 (en) Spectral audio encoding
WO2001031816A1 (en) System and method for encoding an audio signal for use in broadcast program identification systems, by adding inaudible codes to the audio signal
US7466742B1 (en) Detection of entropy in connection with audio signals
AU2008201526A1 (en) System and method for adding an inaudible code to an audio signal and method and apparatus for reading a code signal from an audio signal
CN100372270C (en) System and method of broadcast code
MXPA01000433A (en) System and method for encoding an audio signal, by adding an inaudible code to the audio signal, for use in broadcast programme identification systems

Legal Events

Date Code Title Description
AS Assignment

Owner name: NIELSEN MEDIA RESEARCH, INC,, NEW YORK

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:SRINIVASAN, VENUGOPAL;REEL/FRAME:016916/0185

Effective date: 20000403

AS Assignment

Owner name: CITIBANK, N.A., AS COLLATERAL AGENT,NEW YORK

Free format text: SECURITY AGREEMENT;ASSIGNORS:NIELSEN MEDIA RESEARCH, INC.;AC NIELSEN (US), INC.;BROADCAST DATA SYSTEMS, LLC;AND OTHERS;REEL/FRAME:018207/0607

Effective date: 20060809

Owner name: CITIBANK, N.A., AS COLLATERAL AGENT, NEW YORK

Free format text: SECURITY AGREEMENT;ASSIGNORS:NIELSEN MEDIA RESEARCH, INC.;AC NIELSEN (US), INC.;BROADCAST DATA SYSTEMS, LLC;AND OTHERS;REEL/FRAME:018207/0607

Effective date: 20060809

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION

AS Assignment

Owner name: THE NIELSEN COMPANY (US), LLC, NEW YORK

Free format text: RELEASE (REEL 018207 / FRAME 0607);ASSIGNOR:CITIBANK, N.A.;REEL/FRAME:061749/0001

Effective date: 20221011

Owner name: VNU MARKETING INFORMATION, INC., NEW YORK

Free format text: RELEASE (REEL 018207 / FRAME 0607);ASSIGNOR:CITIBANK, N.A.;REEL/FRAME:061749/0001

Effective date: 20221011