Content-based matching of videos using local spatio-temporal fingerprints
    1.
    发明授权
    Content-based matching of videos using local spatio-temporal fingerprints 有权
    使用本地时空指纹的视频内容匹配

    公开(公告)号:US08498487B2

    公开(公告)日:2013-07-30

    申请号:US12262463

    申请日:2008-10-31

    IPC分类号: G06K9/50

    摘要: A computer implemented method computer implemented method for deriving a fingerprint from video data is disclosed, comprising the steps of receiving a plurality of frames from the video data; selecting at least one key frame from the plurality of frames, the at least one key frame being selected from two consecutive frames of the plurality of frames that exhibiting a maximal cumulative difference in at least one spatial feature of the two consecutive frames; detecting at least one 3D spatio-temporal feature within the at least one key frame; and encoding a spatio-temporal fingerprint based on mean luminance of the at least one 3D spatio-temporal feature. The least one spatial feature can be intensity. The at least one 3D spatio-temporal feature can be at least one Maximally Stable Volume (MSV). Also disclosed is a method for matching video data to a database containing a plurality of video fingerprints of the type described above, comprising the steps of calculating at least one fingerprint representing at least one query frame from the video data; indexing into the database using the at least one calculated fingerprint to find a set of candidate fingerprints; applying a score to each of the candidate fingerprints; selecting a subset of candidate fingerprints as proposed frames by rank ordering the candidate fingerprints; and attempting to match at least one fingerprint of at least one proposed frame based on a comparison of gradient-based descriptors associated with the at least one query frame and the at least one proposed frame.

    摘要翻译: 公开了一种用于从视频数据导出指纹的计算机实现方法计算机实现方法,包括以下步骤:从视频数据中接收多个帧; 从所述多个帧中选择至少一个关键帧,所述至少一个关键帧从在所述两个连续帧的至少一个空间特征中呈现最大累积差异的所述多个帧中的两个连续帧中选择; 检测所述至少一个关键帧内的至少一个3D空间 - 时间特征; 以及基于所述至少一个3D时空特征的平均亮度对时空指纹进行编码。 至少一个空间特征可以是强度。 至少一个3D空间 - 时间特征可以是至少一个最大稳定体积(MSV)。 还公开了一种用于将视频数据与包含上述类型的多个视频指纹的数据库进行匹配的方法,包括以下步骤:从视频数据计算表示至少一个查询帧的至少一个指纹; 使用所述至少一个计算的指纹索引到数据库中以找到一组候选指纹; 对每个候选指纹应用分数; 通过对候选指纹进行排序来选择候选指纹的子集作为所提出的帧; 以及基于与所述至少一个查询帧和所述至少一个所提出的帧相关联的基于梯度的描述符的比较,尝试匹配至少一个所提出的帧的至少一个指纹。

    CONTENT-BASED MATCHING OF VIDEOS USING LOCAL SPATIO-TEMPORAL FINGERPRINTS
    2.
    发明申请
    CONTENT-BASED MATCHING OF VIDEOS USING LOCAL SPATIO-TEMPORAL FINGERPRINTS 有权
    使用本地空间指纹图像进行基于内容的视频匹配

    公开(公告)号:US20100049711A1

    公开(公告)日:2010-02-25

    申请号:US12262463

    申请日:2008-10-31

    IPC分类号: G06F17/30 H04N7/26

    摘要: A computer implemented method computer implemented method for deriving a fingerprint from video data is disclosed, comprising the steps of receiving a plurality of frames from the video data; selecting at least one key frame from the plurality of frames, the at least one key frame being selected from two consecutive frames of the plurality of frames that exhibiting a maximal cumulative difference in at least one spatial feature of the two consecutive frames; detecting at least one 3D spatio-temporal feature within the at least one key frame; and encoding a spatio-temporal fingerprint based on mean luminance of the at least one 3D spatio-temporal feature. The least one spatial feature can be intensity. The at least one 3D spatio-temporal feature can be at least one Maximally Stable Volume (MSV). Also disclosed is a method for matching video data to a database containing a plurality of video fingerprints of the type described above, comprising the steps of calculating at least one fingerprint representing at least one query frame from the video data; indexing into the database using the at least one calculated fingerprint to find a set of candidate fingerprints; applying a score to each of the candidate fingerprints; selecting a subset of candidate fingerprints as proposed frames by rank ordering the candidate fingerprints; and attempting to match at least one fingerprint of at least one proposed frame based on a comparison of gradient-based descriptors associated with the at least one query frame and the at least one proposed frame.

    摘要翻译: 公开了一种用于从视频数据导出指纹的计算机实现方法计算机实现方法,包括以下步骤:从视频数据中接收多个帧; 从所述多个帧中选择至少一个关键帧,所述至少一个关键帧从在所述两个连续帧的至少一个空间特征中呈现最大累积差异的所述多个帧中的两个连续帧中选择; 检测所述至少一个关键帧内的至少一个3D空间 - 时间特征; 以及基于所述至少一个3D时空特征的平均亮度对时空指纹进行编码。 至少一个空间特征可以是强度。 至少一个3D空间 - 时间特征可以是至少一个最大稳定体积(MSV)。 还公开了一种用于将视频数据与包含上述类型的多个视频指纹的数据库进行匹配的方法,包括以下步骤:从视频数据计算表示至少一个查询帧的至少一个指纹; 使用所述至少一个计算的指纹索引到数据库中以找到一组候选指纹; 对每个候选指纹应用分数; 通过对候选指纹进行排序来选择候选指纹的子集作为所提出的帧; 以及基于与所述至少一个查询帧和所述至少一个所提出的帧相关联的基于梯度的描述符的比较,尝试匹配至少一个所提出的帧的至少一个指纹。

    FOOD RECOGNITION USING VISUAL ANALYSIS AND SPEECH RECOGNITION
    6.
    发明申请
    FOOD RECOGNITION USING VISUAL ANALYSIS AND SPEECH RECOGNITION 有权
    食品识别使用视觉分析和语音识别

    公开(公告)号:US20100173269A1

    公开(公告)日:2010-07-08

    申请号:US12683124

    申请日:2010-01-06

    IPC分类号: G09B19/00

    CPC分类号: G09B19/0092

    摘要: A method and system for analyzing at least one food item on a food plate is disclosed. A plurality of images of the food plate is received by an image capturing device. A description of the at least one food item on the food plate is received by a recognition device. The description is at least one of a voice description and a text description. At least one processor extracts a list of food items from the description; classifies and segments the at least one food item from the list using color and texture features derived from the plurality of images; and estimates the volume of the classified and segmented at least one food item. The processor is also configured to estimate the caloric content of the at least one food item.

    摘要翻译: 公开了一种用于分析食品板上的至少一种食品的方法和系统。 食品牌的多个图像由图像捕获装置接收。 对食品牌上的至少一种食品的描述由识别装置接收。 描述是语音描述和文本描述中的至少一个。 至少一个处理器从描述中提取食物列表; 使用从多个图像导出的颜色和纹理特征来对列表中的至少一个食物项进行分类和分割; 并估计分类和分段的至少一种食品的体积。 处理器还被配置为估计至少一个食物的热量含量。

    Food recognition using visual analysis and speech recognition
    7.
    发明授权
    Food recognition using visual analysis and speech recognition 有权
    食物识别使用视觉分析和语音识别

    公开(公告)号:US08439683B2

    公开(公告)日:2013-05-14

    申请号:US12683124

    申请日:2010-01-06

    IPC分类号: G09B19/00

    CPC分类号: G09B19/0092

    摘要: A method and system for analyzing at least one food item on a food plate is disclosed. A plurality of images of the food plate is received by an image capturing device. A description of the at least one food item on the food plate is received by a recognition device. The description is at least one of a voice description and a text description. At least one processor extracts a list of food items from the description; classifies and segments the at least one food item from the list using color and texture features derived from the plurality of images; and estimates the volume of the classified and segmented at least one food item. The processor is also configured to estimate the caloric content of the at least one food item.

    摘要翻译: 公开了一种用于分析食品板上的至少一种食品的方法和系统。 食品牌的多个图像由图像捕获装置接收。 对食品牌上的至少一种食品的描述由识别装置接收。 描述是语音描述和文本描述中的至少一个。 至少一个处理器从描述中提取食物列表; 使用从多个图像导出的颜色和纹理特征来对列表中的至少一个食物项进行分类和分割; 并估计分类和分段的至少一种食品的体积。 处理器还被配置为估计至少一个食物的热量含量。

    Secure robust high-fidelity watermarking
    8.
    发明授权
    Secure robust high-fidelity watermarking 有权
    安全可靠的高保真水印

    公开(公告)号:US07298865B2

    公开(公告)日:2007-11-20

    申请号:US10124995

    申请日:2002-04-18

    IPC分类号: G06K9/00 G06K15/00

    CPC分类号: G06T1/0085 G06T1/0071

    摘要: For each small image region (in space and time), a measure of perceptual transparence of each of a set of possible watermark carrier modulations is used to choose a subset of such modulations, from which a secure random number generator selects, for each image region, a single carrier, modulations of which carry the watermark data.

    摘要翻译: 对于每个小图像区域(空间和时间),使用一组可能的水印载波调制中的每一个的感知透明度的度量来选择这样的调制的子集,安全随机数发生器从该子集中为每个图像区域 ,一个载波,其中携带水印数据的调制。