Digital video content fingerprinting based on scale invariant interest region detection with an array of anisotropic filters
    1.
    发明授权
    Digital video content fingerprinting based on scale invariant interest region detection with an array of anisotropic filters 有权
    基于尺度不变兴趣区域检测的数字视频内容指纹与各向异性过滤器阵列

    公开(公告)号:US09396393B2

    公开(公告)日:2016-07-19

    申请号:US14298261

    申请日:2014-06-06

    摘要: Video sequence processing is described with various filtering rules applied to extract dominant features for content based video sequence identification. Active regions are determined in video frames of a video sequence. Video frames are selected in response to temporal statistical characteristics of the determined active regions. A two pass analysis is used to detect a set of initial interest points and interest regions in the selected video frames to reduce the effective area of images that are refined by complex filters that provide accurate region characterizations resistant to image distortion for identification of the video frames in the video sequence. Extracted features and descriptors are robust with respect to image scaling, aspect ratio change, rotation, camera viewpoint change, illumination and contrast change, video compression/decompression artifacts and noise. Compact, representative signatures are generated for video sequences to provide effective query video matching and retrieval in a large video database.

    摘要翻译: 使用应用于提取用于基于内容的视频序列识别的主要特征的各种滤波规则来描述视频序列处理。 在视频序列的视频帧中确定有效区域。 响应于所确定的活动区域的时间统计特性来选择视频帧。 双通道分析用于检测所选择的视频帧中的一组初始兴趣点和感兴趣区域,以减少由复杂滤波器精化的图像的有效面积,该复杂滤波器提供抵抗图像失真的精确区域表征以识别视频帧 在视频序列中。 提取的特征和描述符在图像缩放,宽高比变化,旋转,相机视点改变,照明和对比度变化,视频压缩/解压缩伪像和噪声方面是稳健的。 为视频序列生成紧凑的代表性签名,以便在大型视频数据库中提供有效的查询视频匹配和检索。

    Audio content fingerprinting based on two-dimensional constant Q-factor transform representation and robust audio identification for time-aligned applications
    2.
    发明授权
    Audio content fingerprinting based on two-dimensional constant Q-factor transform representation and robust audio identification for time-aligned applications 有权
    基于二维常数Q因子变换表示的音频内容指纹识别和用于时间对齐应用的鲁棒音频识别

    公开(公告)号:US09299364B1

    公开(公告)日:2016-03-29

    申请号:US13647996

    申请日:2012-10-09

    摘要: Content identification methods for consumer devices determine robust audio fingerprints that are resilient to audio distortions. One method generates signatures representing audio content based on a constant Q-factor transform (CQT). A 2D spectral representation of a 1D audio signal facilitates generation of region based signatures within frequency octaves and across the entire 2D signal representation. Also, points of interest are detected within the 2D audio signal representation and interest regions are determined around selected points of interest. Another method generates audio descriptors using an accumulating filter function on bands of the audio spectrum and generates audio transform coefficients. A response of each spectral band is computed and transform coefficients are determined by filtering, by accumulating derivatives with different lags, and computing second order derivatives. Additionally, time and frequency based onset detection determines audio descriptors at events and enhances descriptors with information related to an event.

    摘要翻译: 用于消费者设备的内容识别方法确定对音频失真具有弹性的鲁棒音频指纹。 一种方法基于恒定的Q因子变换(CQT)生成表示音频内容的签名。 1D音频信号的2D频谱表示有助于在频率八度和频率整个2D信号表示内产生基于区域的签名。 此外,在2D音频信号表示中检测兴趣点,并且围绕所选择的兴趣点确定兴趣区域。 另一种方法使用音频频谱的频带上的累积滤波函数来生成音频描述符,并且生成音频变换系数。 计算每个频谱带的响应,并通过滤波,通过累积具有不同滞后的导数和计算二阶导数来确定变换系数。 另外,基于时间和频率的开始检测确定事件处的音频描述符,并且利用与事件相关的信息增强描述符。

    Media content identification on mobile devices
    3.
    发明授权
    Media content identification on mobile devices 有权
    移动设备上的媒体内容标识

    公开(公告)号:US09313359B1

    公开(公告)日:2016-04-12

    申请号:US13590701

    申请日:2012-08-21

    IPC分类号: H04N1/32 H04N5/44 H04N21/439

    摘要: A mobile device responds in real time to media content presented on a media device, such as a television. The mobile device captures temporal fragments of audio-video content on its microphone, camera, or both and generates corresponding audio-video query fingerprints. The query fingerprints are transmitted to a search server located remotely or used with a search function on the mobile device for content search and identification. Audio features are extracted and audio signal global onset detection is used for input audio frame alignment. Additional audio feature signatures are generated from local audio frame onsets, audio frame frequency domain entropy, and maximum change in the spectral coefficients. Video frames are analyzed to find a television screen in the frames, and a detected active television quadrilateral is used to generate video fingerprints to be combined with audio fingerprints for more reliable content identification.

    摘要翻译: 移动设备实时响应媒体设备(如电视机)上呈现的媒体内容。 移动设备捕获其麦克风,相机或两者上的音频 - 视频内容的时间片段,并生成相应的音视频查询指纹。 查询指纹被发送到位于远程的搜索服务器或与移动设备上的搜索功能一起使用,用于内容搜索和识别。 提取音频特征,音频信号全局起始检测用于输入音频帧对齐。 额外的音频特征签名由本地音频帧发送,音频帧频域熵和频谱系数的最大变化产生。 视频帧被分析以在帧中找到电视屏幕,并且使用检测到的活动电视四边形来生成视频指纹以与音频指纹组合以用于更可靠的内容识别。

    Digital Video Content Fingerprinting Based on Scale Invariant Interest Region Detection with an Array of Anisotropic Filters
    4.
    发明申请
    Digital Video Content Fingerprinting Based on Scale Invariant Interest Region Detection with an Array of Anisotropic Filters 有权
    基于尺度不变兴趣区域的数字视频内容指纹识别与各向异性滤波器阵列

    公开(公告)号:US20150003731A1

    公开(公告)日:2015-01-01

    申请号:US14298261

    申请日:2014-06-06

    IPC分类号: G06K9/00 H04N5/14

    摘要: Video sequence processing is described with various filtering rules applied to extract dominant features for content based video sequence identification. Active regions are determined in video frames of a video sequence. Video frames are selected in response to temporal statistical characteristics of the determined active regions. A two pass analysis is used to detect a set of initial interest points and interest regions in the selected video frames to reduce the effective area of images that are refined by complex filters that provide accurate region characterizations resistant to image distortion for identification of the video frames in the video sequence. Extracted features and descriptors are robust with respect to image scaling, aspect ratio change, rotation, camera viewpoint change, illumination and contrast change, video compression/decompression artifacts and noise. Compact, representative signatures are generated for video sequences to provide effective query video matching and retrieval in a large video database.

    摘要翻译: 使用应用于提取用于基于内容的视频序列识别的主要特征的各种滤波规则来描述视频序列处理。 在视频序列的视频帧中确定有效区域。 响应于所确定的活动区域的时间统计特性来选择视频帧。 双通道分析用于检测所选择的视频帧中的一组初始兴趣点和感兴趣区域,以减少由复杂滤波器精化的图像的有效面积,该复杂滤波器提供抵抗图像失真的精确区域表征以识别视频帧 在视频序列中。 提取的特征和描述符在图像缩放,宽高比变化,旋转,相机视点改变,照明和对比度变化,视频压缩/解压缩伪像和噪声方面是稳健的。 为视频序列生成紧凑的代表性签名,以便在大型视频数据库中提供有效的查询视频匹配和检索。

    Digital video content fingerprinting based on scale invariant interest region detection with an array of anisotropic filters
    5.
    发明授权
    Digital video content fingerprinting based on scale invariant interest region detection with an array of anisotropic filters 有权
    基于尺度不变兴趣区域检测的数字视频内容指纹与各向异性过滤器阵列

    公开(公告)号:US08781245B2

    公开(公告)日:2014-07-15

    申请号:US13455560

    申请日:2012-04-25

    IPC分类号: G06K9/40

    摘要: Video sequence processing is described with various filtering rules applied to extract dominant features for content based video sequence identification. Active regions are determined in video frames of a video sequence. Video frames are selected in response to temporal statistical characteristics of the determined active regions. A two pass analysis is used to detect a set of initial interest points and interest regions in the selected video frames to reduce the effective area of images that are refined by complex filters that provide accurate region characterizations resistant to image distortion for identification of the video frames in the video sequence. Extracted features and descriptors are robust with respect to image scaling, aspect ratio change, rotation, camera viewpoint change, illumination and contrast change, video compression/decompression artifacts and noise. Compact, representative signatures are generated for video sequences to provide effective query video matching and retrieval in a large video database.

    摘要翻译: 使用应用于提取用于基于内容的视频序列识别的主要特征的各种滤波规则来描述视频序列处理。 在视频序列的视频帧中确定有效区域。 响应于所确定的活动区域的时间统计特性来选择视频帧。 双通道分析用于检测所选择的视频帧中的一组初始兴趣点和感兴趣区域,以减少由复杂滤波器精化的图像的有效面积,该复杂滤波器提供抵抗图像失真的精确区域表征以识别视频帧 在视频序列中。 提取的特征和描述符在图像缩放,宽高比变化,旋转,相机视点改变,照明和对比度变化,视频压缩/解压缩伪像和噪声方面是稳健的。 为视频序列生成紧凑的代表性签名,以便在大型视频数据库中提供有效的查询视频匹配和检索。

    Method for Efficient Database Formation and Search on Media Devices Acting Synchronously with Television Programming
    6.
    发明申请
    Method for Efficient Database Formation and Search on Media Devices Acting Synchronously with Television Programming 有权
    有效的数据库形成和搜索与电视节目同步的媒体设备的方法

    公开(公告)号:US20130246457A1

    公开(公告)日:2013-09-19

    申请号:US13826502

    申请日:2013-03-14

    IPC分类号: G06F17/30

    摘要: Techniques for efficient database formation and search in applications embedded in a media device are provided. The search may be performed synchronously with presentation of media programming content on a nearby media presentation device. A mobile media device captures some temporal fragments of the presented audio/video content on its microphone and camera, and then generates query fingerprints for the captured fragment. A local reference database resides on the mobile media device and a master reference database resides on a remote server with a most recent chunk of reference fingerprints transferred dynamically to the local mobile media device. A chunk of the query fingerprints generated locally on the mobile media device are searched on the local reference database for continuous content search and identification. The method presented automatically switches between the local search on the mobile media device and a remote search on an external search server.

    摘要翻译: 提供了用于在嵌入在媒体设备中的应用中有效的数据库形成和搜索的技术。 搜索可以与附近的媒体呈现设备上的媒体节目内容的呈现同步地执行。 移动媒体设备捕获其麦克风和相机上呈现的音频/视频内容的一些时间片段,然后为捕获的片段生成查询指纹。 本地参考数据库驻留在移动媒体设备上,主参考数据库驻留在远程服务器上,最近一批参考指纹被动态传输到本地移动媒体设备。 在本地参考数据库上搜索在移动媒体设备上本地生成的查询指纹的块,用于连续内容搜索和识别。 所呈现的方法自动地在移动媒体设备上的本地搜索和外部搜索服务器上的远程搜索之间进行切换。

    Methods and Apparatus for Providing a Scalable Identification of Digital Video Sequences
    7.
    发明申请
    Methods and Apparatus for Providing a Scalable Identification of Digital Video Sequences 有权
    提供数字视频序列可扩展识别的方法和装置

    公开(公告)号:US20120237129A1

    公开(公告)日:2012-09-20

    申请号:US13488568

    申请日:2012-06-05

    IPC分类号: G06K9/48

    CPC分类号: G06K9/00711

    摘要: Scaleable video sequence processing with various filtering rules is applied to extract dominant features, and generate unique set of signatures based on video content. Video sequence structuring and subsequent video sequence characterization is performed by tracking statistical changes in the content of a succession of video frames and selecting suitable frames for further treatment by region based intra-frame segmentation and contour tracing and description. Compact representative signatures are generated on the video sequence structural level as well as on the selected video frame level, resulting in an efficient video database formation and search.

    摘要翻译: 应用具有各种过滤规则的可扩展视频序列处理来提取主要特征,并且基于视频内容生成唯一的签名集合。 通过跟踪一系列视频帧的内容中的统计变化并选择合适的帧以进行基于区域的帧内分割和轮廓跟踪和描述的进一步处理来执行视频序列构造和后续视频序列表征。 在视频序列结构层面以及所选择的视频帧级别上生成紧凑的代表性签名,从而形成和搜索高效的视频数据。

    Method and Apparatus for Multi-Dimensional Content Search and Video Identification
    8.
    发明申请
    Method and Apparatus for Multi-Dimensional Content Search and Video Identification 有权
    用于多维内容搜索和视频识别的方法和装置

    公开(公告)号:US20080313140A1

    公开(公告)日:2008-12-18

    申请号:US12141337

    申请日:2008-06-18

    IPC分类号: G06F17/30

    摘要: A multi-dimensional database and indexes and operations on the multi-dimensional database are described which include video search applications or other similar sequence or structure searches. Traversal indexes utilize highly discriminative information about images and video sequences or about object shapes. Global and local signatures around keypoints are used for compact and robust retrieval and discriminative information content of images or video sequences of interest. For other objects or structures relevant signature of pattern or structure are used for traversal indexes. Traversal indexes are stored in leaf nodes along with distance measures and occurrence of similar images in the database. During a sequence query, correlation scores are calculated for single frame, for frame sequence, and video clips, or for other objects or structures.

    摘要翻译: 描述了多维数据库以及关于多维数据库的索引和操作,其包括视频搜索应用或其他类似的序列或结构搜索。 遍历索引利用关于图像和视频序列或关于对象形状的高度辨别信息。 关键点周围的全局和本地签名用于紧凑和鲁棒的检索和感兴趣的图像或视频序列的辨别信息内容。 对于其他对象或结构,模式或结构的相关签名用于遍历索引。 遍历索引与距离测量一起存储在叶节点中,并在数据库中出现类似的图像。 在序列查询期间,计算单帧,帧序列和视频剪辑或其他对象或结构的相关分数。

    Method and Apparatus for Multi-Dimensional Content Search and Video Identification
    9.
    发明申请
    Method and Apparatus for Multi-Dimensional Content Search and Video Identification 有权
    用于多维内容搜索和视频识别的方法和装置

    公开(公告)号:US20120207387A1

    公开(公告)日:2012-08-16

    申请号:US13432914

    申请日:2012-03-28

    IPC分类号: G06K9/68

    摘要: A multi-dimensional database and indexes and operations on the multi-dimensional database are described which include video search applications or other similar sequence or structure searches. Traversal indexes utilize highly discriminative information about images and video sequences or about object shapes. Global and local signatures around keypoints are used for compact and robust retrieval and discriminative information content of images or video sequences of interest. For other objects or structures relevant signature of pattern or structure are used for traversal indexes. Traversal indexes are stored in leaf nodes along with distance measures and occurrence of similar images in the database. During a sequence query, correlation scores are calculated for single frame, for frame sequence, and video clips, or for other objects or structures.

    摘要翻译: 描述了多维数据库以及关于多维数据库的索引和操作,其包括视频搜索应用或其他类似的序列或结构搜索。 遍历索引利用关于图像和视频序列或关于对象形状的高度辨别信息。 关键点周围的全局和本地签名用于紧凑和鲁棒的检索和感兴趣的图像或视频序列的辨别信息内容。 对于其他对象或结构,模式或结构的相关签名用于遍历索引。 遍历索引与距离测量一起存储在叶节点中,并在数据库中出现类似的图像。 在序列查询期间,计算单帧,帧序列和视频剪辑或其他对象或结构的相关分数。

    Method and apparatus for multi-dimensional content search and video identification
    10.
    发明授权
    Method and apparatus for multi-dimensional content search and video identification 有权
    多维内容搜索和视频识别的方法和装置

    公开(公告)号:US08171030B2

    公开(公告)日:2012-05-01

    申请号:US12141337

    申请日:2008-06-18

    IPC分类号: G06F7/00 G06F17/30

    摘要: A multi-dimensional database and indexes and operations on the multi-dimensional database are described which include video search applications or other similar sequence or structure searches. Traversal indexes utilize highly discriminative information about images and video sequences or about object shapes. Global and local signatures around keypoints are used for compact and robust retrieval and discriminative information content of images or video sequences of interest. For other objects or structures relevant signature of pattern or structure are used for traversal indexes. Traversal indexes are stored in leaf nodes along with distance measures and occurrence of similar images in the database. During a sequence query, correlation scores are calculated for single frame, for frame sequence, and video clips, or for other objects or structures.

    摘要翻译: 描述了多维数据库以及关于多维数据库的索引和操作,其包括视频搜索应用或其他类似的序列或结构搜索。 遍历索引利用关于图像和视频序列或关于对象形状的高度辨别信息。 关键点周围的全局和本地签名用于紧凑和鲁棒的检索和感兴趣的图像或视频序列的辨别信息内容。 对于其他对象或结构,模式或结构的相关签名用于遍历索引。 遍历索引与距离测量一起存储在叶节点中,并在数据库中出现类似的图像。 在序列查询期间,计算单帧,帧序列和视频剪辑或其他对象或结构的相关分数。