Human auditory system modeling with masking energy adaptation

    公开(公告)号:US11145317B1

    公开(公告)日:2021-10-12

    申请号:US16056160

    申请日:2018-08-06

    Abstract: A method for generating a psychoacoustic model from an audio signal transforms a block of samples of an audio signal into a frequency spectrum comprising frequency components. From this frequency spectrum, it derives group masking energies. These group masking energies each correspond to a group of neighboring frequency components in the frequency spectrum. For a group of frequency components, the method allocates the group masking energy to the frequency components in the group in proportion to energy of the frequency components within the group to provide adapted mask energies for the frequency components within the group, the adapted mask energies providing masking thresholds for the psychoacoustic model of the audio signal.

    Assessment of camera phone distortion for digital watermarking

    公开(公告)号:US09697578B2

    公开(公告)日:2017-07-04

    申请号:US14579371

    申请日:2014-12-22

    CPC classification number: G06T1/0028 G06T2201/0051 G06T2201/0202

    Abstract: The present disclosure relates generally to cell phones and cameras, and to digital watermarking involving such cell phones and cameras. One claim recites a method comprising: measuring distortion introduced by a cell phone camera; using a programmed electronic processor, quantifying the distortion; and providing quantified distortion as feedback to adjust a digital watermark embedding process in view of the distortion introduced by the cell phone camera. The act of quantifying distortion may include, e.g., quantifying a spatial frequency response (SFR) of the cell phone camera. Of course, other claims and combinations are provided too.

    Digital watermarks adapted to compensate for time scaling, pitch shifting and mixing

    公开(公告)号:US10236006B1

    公开(公告)日:2019-03-19

    申请号:US15671090

    申请日:2017-08-07

    Abstract: Pre-processing modules are configured to compensate for time and pitch scaling and shifting and provide compensated audio frames to a watermark detector. Audio frames are adjusted for time stretching and shrinking and for pitch shifting. Detection metrics are evaluated to identify candidates to a watermark detector. Various schemes are also detailed for tracking modifications made to audio stems mixed into audio tracks, and for accessing a history of modifications for facilitating identification of audio stems and audio tracks comprised of stems. Various approaches address interference from audio overlays added to channels of audio after embedding. One approach applies informed embedding based on phase differences between corresponding components of the channels. A detector extracts the watermark payload effectively from either additive or subtractive combination of the channels because the informed embedding ensures that the watermark survives both types of processing. Other approaches applies different polarity patterns, watermark mappings, or protocol keys to the channels. These techniques enable the watermark to survive ambient mixing, conversion to mono, as well as channel differencing to reduce interference from voice-overs and other audio overlays.

    Timeline reconstruction using dynamic path estimation from detections in audio-video signals

    公开(公告)号:US10236031B1

    公开(公告)日:2019-03-19

    申请号:US15479959

    申请日:2017-04-05

    Abstract: A dynamic path estimation method reconstructs a program timeline in real time from an incoming stream of audio or visual content in which watermark payloads are redundantly encoded. A receiving device buffers a portion of the incoming signal, executes watermark detection on the contents of the buffer, presents detection results, and then advances the incoming signal in the buffer. Each detection result corresponds to different possible detection paths, as the detector does not reveal the precise position of the watermark payload. The dynamic path estimation method operates on the detection results to determine a global cost function for each possible detection path. As the incoming audio advances through a detection buffer, the method updates cost values for the possible paths, determines a global cost for the paths, and outputs a timeline based on the path of the lowest global cost.

Patent Agency Ranking