Media content identification on mobile devices
    1.
    发明授权
    Media content identification on mobile devices 有权
    移动设备上的媒体内容标识

    公开(公告)号:US09313359B1

    公开(公告)日:2016-04-12

    申请号:US13590701

    申请日:2012-08-21

    IPC分类号: H04N1/32 H04N5/44 H04N21/439

    摘要: A mobile device responds in real time to media content presented on a media device, such as a television. The mobile device captures temporal fragments of audio-video content on its microphone, camera, or both and generates corresponding audio-video query fingerprints. The query fingerprints are transmitted to a search server located remotely or used with a search function on the mobile device for content search and identification. Audio features are extracted and audio signal global onset detection is used for input audio frame alignment. Additional audio feature signatures are generated from local audio frame onsets, audio frame frequency domain entropy, and maximum change in the spectral coefficients. Video frames are analyzed to find a television screen in the frames, and a detected active television quadrilateral is used to generate video fingerprints to be combined with audio fingerprints for more reliable content identification.

    摘要翻译: 移动设备实时响应媒体设备(如电视机)上呈现的媒体内容。 移动设备捕获其麦克风,相机或两者上的音频 - 视频内容的时间片段,并生成相应的音视频查询指纹。 查询指纹被发送到位于远程的搜索服务器或与移动设备上的搜索功能一起使用,用于内容搜索和识别。 提取音频特征,音频信号全局起始检测用于输入音频帧对齐。 额外的音频特征签名由本地音频帧发送,音频帧频域熵和频谱系数的最大变化产生。 视频帧被分析以在帧中找到电视屏幕,并且使用检测到的活动电视四边形来生成视频指纹以与音频指纹组合以用于更可靠的内容识别。

    Digital video content fingerprinting based on scale invariant interest region detection with an array of anisotropic filters
    2.
    发明授权
    Digital video content fingerprinting based on scale invariant interest region detection with an array of anisotropic filters 有权
    基于尺度不变兴趣区域检测的数字视频内容指纹与各向异性过滤器阵列

    公开(公告)号:US09396393B2

    公开(公告)日:2016-07-19

    申请号:US14298261

    申请日:2014-06-06

    摘要: Video sequence processing is described with various filtering rules applied to extract dominant features for content based video sequence identification. Active regions are determined in video frames of a video sequence. Video frames are selected in response to temporal statistical characteristics of the determined active regions. A two pass analysis is used to detect a set of initial interest points and interest regions in the selected video frames to reduce the effective area of images that are refined by complex filters that provide accurate region characterizations resistant to image distortion for identification of the video frames in the video sequence. Extracted features and descriptors are robust with respect to image scaling, aspect ratio change, rotation, camera viewpoint change, illumination and contrast change, video compression/decompression artifacts and noise. Compact, representative signatures are generated for video sequences to provide effective query video matching and retrieval in a large video database.

    摘要翻译: 使用应用于提取用于基于内容的视频序列识别的主要特征的各种滤波规则来描述视频序列处理。 在视频序列的视频帧中确定有效区域。 响应于所确定的活动区域的时间统计特性来选择视频帧。 双通道分析用于检测所选择的视频帧中的一组初始兴趣点和感兴趣区域,以减少由复杂滤波器精化的图像的有效面积,该复杂滤波器提供抵抗图像失真的精确区域表征以识别视频帧 在视频序列中。 提取的特征和描述符在图像缩放,宽高比变化,旋转,相机视点改变,照明和对比度变化,视频压缩/解压缩伪像和噪声方面是稳健的。 为视频序列生成紧凑的代表性签名,以便在大型视频数据库中提供有效的查询视频匹配和检索。

    Audio content fingerprinting based on two-dimensional constant Q-factor transform representation and robust audio identification for time-aligned applications
    3.
    发明授权
    Audio content fingerprinting based on two-dimensional constant Q-factor transform representation and robust audio identification for time-aligned applications 有权
    基于二维常数Q因子变换表示的音频内容指纹识别和用于时间对齐应用的鲁棒音频识别

    公开(公告)号:US09299364B1

    公开(公告)日:2016-03-29

    申请号:US13647996

    申请日:2012-10-09

    摘要: Content identification methods for consumer devices determine robust audio fingerprints that are resilient to audio distortions. One method generates signatures representing audio content based on a constant Q-factor transform (CQT). A 2D spectral representation of a 1D audio signal facilitates generation of region based signatures within frequency octaves and across the entire 2D signal representation. Also, points of interest are detected within the 2D audio signal representation and interest regions are determined around selected points of interest. Another method generates audio descriptors using an accumulating filter function on bands of the audio spectrum and generates audio transform coefficients. A response of each spectral band is computed and transform coefficients are determined by filtering, by accumulating derivatives with different lags, and computing second order derivatives. Additionally, time and frequency based onset detection determines audio descriptors at events and enhances descriptors with information related to an event.

    摘要翻译: 用于消费者设备的内容识别方法确定对音频失真具有弹性的鲁棒音频指纹。 一种方法基于恒定的Q因子变换(CQT)生成表示音频内容的签名。 1D音频信号的2D频谱表示有助于在频率八度和频率整个2D信号表示内产生基于区域的签名。 此外,在2D音频信号表示中检测兴趣点,并且围绕所选择的兴趣点确定兴趣区域。 另一种方法使用音频频谱的频带上的累积滤波函数来生成音频描述符,并且生成音频变换系数。 计算每个频谱带的响应,并通过滤波,通过累积具有不同滞后的导数和计算二阶导数来确定变换系数。 另外,基于时间和频率的开始检测确定事件处的音频描述符,并且利用与事件相关的信息增强描述符。

    Digital Video Content Fingerprinting Based on Scale Invariant Interest Region Detection with an Array of Anisotropic Filters
    4.
    发明申请
    Digital Video Content Fingerprinting Based on Scale Invariant Interest Region Detection with an Array of Anisotropic Filters 有权
    基于尺度不变兴趣区域的数字视频内容指纹识别与各向异性滤波器阵列

    公开(公告)号:US20150003731A1

    公开(公告)日:2015-01-01

    申请号:US14298261

    申请日:2014-06-06

    IPC分类号: G06K9/00 H04N5/14

    摘要: Video sequence processing is described with various filtering rules applied to extract dominant features for content based video sequence identification. Active regions are determined in video frames of a video sequence. Video frames are selected in response to temporal statistical characteristics of the determined active regions. A two pass analysis is used to detect a set of initial interest points and interest regions in the selected video frames to reduce the effective area of images that are refined by complex filters that provide accurate region characterizations resistant to image distortion for identification of the video frames in the video sequence. Extracted features and descriptors are robust with respect to image scaling, aspect ratio change, rotation, camera viewpoint change, illumination and contrast change, video compression/decompression artifacts and noise. Compact, representative signatures are generated for video sequences to provide effective query video matching and retrieval in a large video database.

    摘要翻译: 使用应用于提取用于基于内容的视频序列识别的主要特征的各种滤波规则来描述视频序列处理。 在视频序列的视频帧中确定有效区域。 响应于所确定的活动区域的时间统计特性来选择视频帧。 双通道分析用于检测所选择的视频帧中的一组初始兴趣点和感兴趣区域,以减少由复杂滤波器精化的图像的有效面积,该复杂滤波器提供抵抗图像失真的精确区域表征以识别视频帧 在视频序列中。 提取的特征和描述符在图像缩放,宽高比变化,旋转,相机视点改变,照明和对比度变化,视频压缩/解压缩伪像和噪声方面是稳健的。 为视频序列生成紧凑的代表性签名,以便在大型视频数据库中提供有效的查询视频匹配和检索。

    Digital video content fingerprinting based on scale invariant interest region detection with an array of anisotropic filters
    5.
    发明授权
    Digital video content fingerprinting based on scale invariant interest region detection with an array of anisotropic filters 有权
    基于尺度不变兴趣区域检测的数字视频内容指纹与各向异性过滤器阵列

    公开(公告)号:US08781245B2

    公开(公告)日:2014-07-15

    申请号:US13455560

    申请日:2012-04-25

    IPC分类号: G06K9/40

    摘要: Video sequence processing is described with various filtering rules applied to extract dominant features for content based video sequence identification. Active regions are determined in video frames of a video sequence. Video frames are selected in response to temporal statistical characteristics of the determined active regions. A two pass analysis is used to detect a set of initial interest points and interest regions in the selected video frames to reduce the effective area of images that are refined by complex filters that provide accurate region characterizations resistant to image distortion for identification of the video frames in the video sequence. Extracted features and descriptors are robust with respect to image scaling, aspect ratio change, rotation, camera viewpoint change, illumination and contrast change, video compression/decompression artifacts and noise. Compact, representative signatures are generated for video sequences to provide effective query video matching and retrieval in a large video database.

    摘要翻译: 使用应用于提取用于基于内容的视频序列识别的主要特征的各种滤波规则来描述视频序列处理。 在视频序列的视频帧中确定有效区域。 响应于所确定的活动区域的时间统计特性来选择视频帧。 双通道分析用于检测所选择的视频帧中的一组初始兴趣点和感兴趣区域,以减少由复杂滤波器精化的图像的有效面积,该复杂滤波器提供抵抗图像失真的精确区域表征以识别视频帧 在视频序列中。 提取的特征和描述符在图像缩放,宽高比变化,旋转,相机视点改变,照明和对比度变化,视频压缩/解压缩伪像和噪声方面是稳健的。 为视频序列生成紧凑的代表性签名,以便在大型视频数据库中提供有效的查询视频匹配和检索。

    Digital Video Content Fingerprinting Based on Scale Invariant Interest Region Detection with an Array of Anisotropic Filters
    6.
    发明申请
    Digital Video Content Fingerprinting Based on Scale Invariant Interest Region Detection with an Array of Anisotropic Filters 有权
    基于尺度不变兴趣区域的数字视频内容指纹识别与各向异性滤波器阵列

    公开(公告)号:US20100303338A1

    公开(公告)日:2010-12-02

    申请号:US12612729

    申请日:2009-11-05

    IPC分类号: G06K9/00 G06K9/46 G06K9/40

    摘要: Video sequence processing is described with various filtering rules applied to extract dominant features for content based video sequence identification. Active regions are determined in video frames of a video sequence. Video frames are selected in response to temporal statistical characteristics of the determined active regions. A two pass analysis is used to detect a set of initial interest points and interest regions in the selected video frames to reduce the effective area of images that are refined by complex filters that provide accurate region characterizations resistant to image distortion for identification of the video frames in the video sequence. Extracted features and descriptors are robust with respect to image scaling, aspect ratio change, rotation, camera viewpoint change, illumination and contrast change, video compression/decompression artifacts and noise. Compact, representative signatures are generated for video sequences to provide effective query video matching and retrieval in a large video database.

    摘要翻译: 使用应用于提取用于基于内容的视频序列识别的主要特征的各种滤波规则来描述视频序列处理。 在视频序列的视频帧中确定有效区域。 响应于所确定的活动区域的时间统计特性来选择视频帧。 双通道分析用于检测所选择的视频帧中的一组初始兴趣点和感兴趣区域,以减少由复杂滤波器精化的图像的有效面积,该复杂滤波器提供抵抗图像失真的精确区域表征以识别视频帧 在视频序列中。 提取的特征和描述符在图像缩放,宽高比变化,旋转,相机视点改变,照明和对比度变化,视频压缩/解压缩伪像和噪声方面是稳健的。 为视频序列生成紧凑的代表性签名,以便在大型视频数据库中提供有效的查询视频匹配和检索。

    Digital Video Content Fingerprinting Based on Scale Invariant Interest Region Detection with an Array of Anisotropic Filters
    7.
    发明申请
    Digital Video Content Fingerprinting Based on Scale Invariant Interest Region Detection with an Array of Anisotropic Filters 有权
    基于尺度不变兴趣区域的数字视频内容指纹识别与各向异性滤波器阵列

    公开(公告)号:US20120207402A1

    公开(公告)日:2012-08-16

    申请号:US13455560

    申请日:2012-04-25

    IPC分类号: G06K9/40

    摘要: Video sequence processing is described with various filtering rules applied to extract dominant features for content based video sequence identification. Active regions are determined in video frames of a video sequence. Video frames are selected in response to temporal statistical characteristics of the determined active regions. A two pass analysis is used to detect a set of initial interest points and interest regions in the selected video frames to reduce the effective area of images that are refined by complex filters that provide accurate region characterizations resistant to image distortion for identification of the video frames in the video sequence. Extracted features and descriptors are robust with respect to image scaling, aspect ratio change, rotation, camera viewpoint change, illumination and contrast change, video compression/decompression artifacts and noise. Compact, representative signatures are generated for video sequences to provide effective query video matching and retrieval in a large video database.

    摘要翻译: 使用应用于提取用于基于内容的视频序列识别的主要特征的各种滤波规则来描述视频序列处理。 在视频序列的视频帧中确定有效区域。 响应于所确定的活动区域的时间统计特性来选择视频帧。 双通道分析用于检测所选择的视频帧中的一组初始兴趣点和感兴趣区域,以减少由复杂滤波器精化的图像的有效面积,该复杂滤波器提供抵抗图像失真的精确区域表征以识别视频帧 在视频序列中。 提取的特征和描述符在图像缩放,宽高比变化,旋转,相机视点改变,照明和对比度变化,视频压缩/解压缩伪像和噪声方面是稳健的。 为视频序列生成紧凑的代表性签名,以便在大型视频数据库中提供有效的查询视频匹配和检索。

    Digital video content fingerprinting based on scale invariant interest region detection with an array of anisotropic filters
    8.
    发明授权
    Digital video content fingerprinting based on scale invariant interest region detection with an array of anisotropic filters 有权
    基于尺度不变兴趣区域检测的数字视频内容指纹与各向异性过滤器阵列

    公开(公告)号:US08189945B2

    公开(公告)日:2012-05-29

    申请号:US12612729

    申请日:2009-11-05

    IPC分类号: G06K9/00 G06K9/46 G06K9/40

    摘要: Video sequence processing is described with various filtering rules applied to extract dominant features for content based video sequence identification. Active regions are determined in video frames of a video sequence. Video frames are selected in response to temporal statistical characteristics of the determined active regions. A two pass analysis is used to detect a set of initial interest points and interest regions in the selected video frames to reduce the effective area of images that are refined by complex filters that provide accurate region characterizations resistant to image distortion for identification of the video frames in the video sequence. Extracted features and descriptors are robust with respect to image scaling, aspect ratio change, rotation, camera viewpoint change, illumination and contrast change, video compression/decompression artifacts and noise. Compact, representative signatures are generated for video sequences to provide effective query video matching and retrieval in a large video database.

    摘要翻译: 使用应用于提取用于基于内容的视频序列识别的主要特征的各种滤波规则来描述视频序列处理。 在视频序列的视频帧中确定有效区域。 响应于所确定的活动区域的时间统计特性来选择视频帧。 双通道分析用于检测所选择的视频帧中的一组初始兴趣点和感兴趣区域,以减少由复杂滤波器精化的图像的有效面积,该复杂滤波器提供抵抗图像失真的精确区域表征以识别视频帧 在视频序列中。 提取的特征和描述符在图像缩放,宽高比变化,旋转,相机视点改变,照明和对比度变化,视频压缩/解压缩伪像和噪声方面是稳健的。 为视频序列生成紧凑的代表性签名,以便在大型视频数据库中提供有效的查询视频匹配和检索。

    Method to decode temporal watermarks in compressed video
    10.
    发明申请
    Method to decode temporal watermarks in compressed video 有权
    在压缩视频中解码时间水印的方法

    公开(公告)号:US20050123168A1

    公开(公告)日:2005-06-09

    申请号:US10884832

    申请日:2004-07-02

    申请人: Peter Wendt

    发明人: Peter Wendt

    摘要: A system and method for efficient recovery of watermarks from compressed video is disclosed, wherein, in one embodiment, cyclic watermark noise blocks are tiled and embedded in a plurality of frames of compressed video, quantized coefficients are computed for a group of compressed video frames on a pixel-by-pixel basis, the scaled coefficients for the group of compressed video frames are summed into an output transform frame, and the entire summed output transform frame is transformed to recover peak values for the group of compressed video frames to recover the watermark. Additionally, zero band normalization and edge filtering are also provided to increase the accuracy and efficiency of recovering watermarks from video frames.

    摘要翻译: 公开了一种用于从压缩视频中有效地恢复水印的系统和方法,其中在一个实施例中,循环水印噪声块被平铺并嵌入在压缩视频的多个帧中,针对一组压缩视频帧计算量化系数, 以像素为单位,将压缩视频帧组的缩放系数求和到输出变换帧中,并将整个求和的输出变换帧变换为恢复压缩视频帧组的峰值以恢复水印 。 此外,还提供零带归一化和边缘滤波以增加从视频帧恢复水印的准确性和效率。