Food recognition using visual analysis and speech recognition
    1.
    发明授权
    Food recognition using visual analysis and speech recognition 有权
    食物识别使用视觉分析和语音识别

    公开(公告)号:US08439683B2

    公开(公告)日:2013-05-14

    申请号:US12683124

    申请日:2010-01-06

    IPC分类号: G09B19/00

    CPC分类号: G09B19/0092

    摘要: A method and system for analyzing at least one food item on a food plate is disclosed. A plurality of images of the food plate is received by an image capturing device. A description of the at least one food item on the food plate is received by a recognition device. The description is at least one of a voice description and a text description. At least one processor extracts a list of food items from the description; classifies and segments the at least one food item from the list using color and texture features derived from the plurality of images; and estimates the volume of the classified and segmented at least one food item. The processor is also configured to estimate the caloric content of the at least one food item.

    摘要翻译: 公开了一种用于分析食品板上的至少一种食品的方法和系统。 食品牌的多个图像由图像捕获装置接收。 对食品牌上的至少一种食品的描述由识别装置接收。 描述是语音描述和文本描述中的至少一个。 至少一个处理器从描述中提取食物列表; 使用从多个图像导出的颜色和纹理特征来对列表中的至少一个食物项进行分类和分割; 并估计分类和分段的至少一种食品的体积。 处理器还被配置为估计至少一个食物的热量含量。

    FOOD RECOGNITION USING VISUAL ANALYSIS AND SPEECH RECOGNITION
    2.
    发明申请
    FOOD RECOGNITION USING VISUAL ANALYSIS AND SPEECH RECOGNITION 有权
    食品识别使用视觉分析和语音识别

    公开(公告)号:US20100173269A1

    公开(公告)日:2010-07-08

    申请号:US12683124

    申请日:2010-01-06

    IPC分类号: G09B19/00

    CPC分类号: G09B19/0092

    摘要: A method and system for analyzing at least one food item on a food plate is disclosed. A plurality of images of the food plate is received by an image capturing device. A description of the at least one food item on the food plate is received by a recognition device. The description is at least one of a voice description and a text description. At least one processor extracts a list of food items from the description; classifies and segments the at least one food item from the list using color and texture features derived from the plurality of images; and estimates the volume of the classified and segmented at least one food item. The processor is also configured to estimate the caloric content of the at least one food item.

    摘要翻译: 公开了一种用于分析食品板上的至少一种食品的方法和系统。 食品牌的多个图像由图像捕获装置接收。 对食品牌上的至少一种食品的描述由识别装置接收。 描述是语音描述和文本描述中的至少一个。 至少一个处理器从描述中提取食物列表; 使用从多个图像导出的颜色和纹理特征来对列表中的至少一个食物项进行分类和分割; 并估计分类和分段的至少一种食品的体积。 处理器还被配置为估计至少一个食物的热量含量。

    Content-based matching of videos using local spatio-temporal fingerprints
    3.
    发明授权
    Content-based matching of videos using local spatio-temporal fingerprints 有权
    使用本地时空指纹的视频内容匹配

    公开(公告)号:US08498487B2

    公开(公告)日:2013-07-30

    申请号:US12262463

    申请日:2008-10-31

    IPC分类号: G06K9/50

    摘要: A computer implemented method computer implemented method for deriving a fingerprint from video data is disclosed, comprising the steps of receiving a plurality of frames from the video data; selecting at least one key frame from the plurality of frames, the at least one key frame being selected from two consecutive frames of the plurality of frames that exhibiting a maximal cumulative difference in at least one spatial feature of the two consecutive frames; detecting at least one 3D spatio-temporal feature within the at least one key frame; and encoding a spatio-temporal fingerprint based on mean luminance of the at least one 3D spatio-temporal feature. The least one spatial feature can be intensity. The at least one 3D spatio-temporal feature can be at least one Maximally Stable Volume (MSV). Also disclosed is a method for matching video data to a database containing a plurality of video fingerprints of the type described above, comprising the steps of calculating at least one fingerprint representing at least one query frame from the video data; indexing into the database using the at least one calculated fingerprint to find a set of candidate fingerprints; applying a score to each of the candidate fingerprints; selecting a subset of candidate fingerprints as proposed frames by rank ordering the candidate fingerprints; and attempting to match at least one fingerprint of at least one proposed frame based on a comparison of gradient-based descriptors associated with the at least one query frame and the at least one proposed frame.

    摘要翻译: 公开了一种用于从视频数据导出指纹的计算机实现方法计算机实现方法,包括以下步骤:从视频数据中接收多个帧; 从所述多个帧中选择至少一个关键帧,所述至少一个关键帧从在所述两个连续帧的至少一个空间特征中呈现最大累积差异的所述多个帧中的两个连续帧中选择; 检测所述至少一个关键帧内的至少一个3D空间 - 时间特征; 以及基于所述至少一个3D时空特征的平均亮度对时空指纹进行编码。 至少一个空间特征可以是强度。 至少一个3D空间 - 时间特征可以是至少一个最大稳定体积(MSV)。 还公开了一种用于将视频数据与包含上述类型的多个视频指纹的数据库进行匹配的方法,包括以下步骤:从视频数据计算表示至少一个查询帧的至少一个指纹; 使用所述至少一个计算的指纹索引到数据库中以找到一组候选指纹; 对每个候选指纹应用分数; 通过对候选指纹进行排序来选择候选指纹的子集作为所提出的帧; 以及基于与所述至少一个查询帧和所述至少一个所提出的帧相关联的基于梯度的描述符的比较,尝试匹配至少一个所提出的帧的至少一个指纹。

    CONTENT-BASED MATCHING OF VIDEOS USING LOCAL SPATIO-TEMPORAL FINGERPRINTS
    4.
    发明申请
    CONTENT-BASED MATCHING OF VIDEOS USING LOCAL SPATIO-TEMPORAL FINGERPRINTS 有权
    使用本地空间指纹图像进行基于内容的视频匹配

    公开(公告)号:US20100049711A1

    公开(公告)日:2010-02-25

    申请号:US12262463

    申请日:2008-10-31

    IPC分类号: G06F17/30 H04N7/26

    摘要: A computer implemented method computer implemented method for deriving a fingerprint from video data is disclosed, comprising the steps of receiving a plurality of frames from the video data; selecting at least one key frame from the plurality of frames, the at least one key frame being selected from two consecutive frames of the plurality of frames that exhibiting a maximal cumulative difference in at least one spatial feature of the two consecutive frames; detecting at least one 3D spatio-temporal feature within the at least one key frame; and encoding a spatio-temporal fingerprint based on mean luminance of the at least one 3D spatio-temporal feature. The least one spatial feature can be intensity. The at least one 3D spatio-temporal feature can be at least one Maximally Stable Volume (MSV). Also disclosed is a method for matching video data to a database containing a plurality of video fingerprints of the type described above, comprising the steps of calculating at least one fingerprint representing at least one query frame from the video data; indexing into the database using the at least one calculated fingerprint to find a set of candidate fingerprints; applying a score to each of the candidate fingerprints; selecting a subset of candidate fingerprints as proposed frames by rank ordering the candidate fingerprints; and attempting to match at least one fingerprint of at least one proposed frame based on a comparison of gradient-based descriptors associated with the at least one query frame and the at least one proposed frame.

    摘要翻译: 公开了一种用于从视频数据导出指纹的计算机实现方法计算机实现方法,包括以下步骤:从视频数据中接收多个帧; 从所述多个帧中选择至少一个关键帧,所述至少一个关键帧从在所述两个连续帧的至少一个空间特征中呈现最大累积差异的所述多个帧中的两个连续帧中选择; 检测所述至少一个关键帧内的至少一个3D空间 - 时间特征; 以及基于所述至少一个3D时空特征的平均亮度对时空指纹进行编码。 至少一个空间特征可以是强度。 至少一个3D空间 - 时间特征可以是至少一个最大稳定体积(MSV)。 还公开了一种用于将视频数据与包含上述类型的多个视频指纹的数据库进行匹配的方法,包括以下步骤:从视频数据计算表示至少一个查询帧的至少一个指纹; 使用所述至少一个计算的指纹索引到数据库中以找到一组候选指纹; 对每个候选指纹应用分数; 通过对候选指纹进行排序来选择候选指纹的子集作为所提出的帧; 以及基于与所述至少一个查询帧和所述至少一个所提出的帧相关联的基于梯度的描述符的比较,尝试匹配至少一个所提出的帧的至少一个指纹。

    Calibrating devices by selecting images having a target having fiducial features
    5.
    发明授权
    Calibrating devices by selecting images having a target having fiducial features 有权
    通过选择具有基准特征的目标的图像来校准设备

    公开(公告)号:US09338447B1

    公开(公告)日:2016-05-10

    申请号:US13419750

    申请日:2012-03-14

    IPC分类号: H04N17/02

    摘要: Devices and techniques are described for automatically calibrating a device such as a camera, projector, or both using a video stream. The device undergoing calibration is coupled to a computing device and configured to acquire video comprising a plurality of images. These images, at least some of the time, include targets having known characteristics present in an environment. From these acquired images calibration data comprising parameters such as intrinsic parameters, extrinsic parameters, or both, may be determined. The calibration data may be embedded or otherwise associated with the video stream as calibration matrix metadata.

    摘要翻译: 描述了使用视频流自动校准诸如相机,投影仪或两者的装置的装置和技术。 正在进行校准的设备被耦合到计算设备并且被配置为获取包括多个图像的视频。 这些图像至少在某些时候包括在环境中存在已知特征的目标。 从这些获取的图像可以确定包括诸如固有参数,外在参数或两者之类的参数的校准数据。 校准数据可以嵌入或以视频流的形式与校准矩阵元数据相关联。

    Hand gesture detection
    6.
    发明授权
    Hand gesture detection 有权
    手势检测

    公开(公告)号:US08970479B1

    公开(公告)日:2015-03-03

    申请号:US13562734

    申请日:2012-07-31

    IPC分类号: G09G5/00 G06F3/01

    CPC分类号: G06F3/017 G06K9/00355

    摘要: Techniques are described for detecting a hand gesture made by a user. Fingertips of a hand may be identified and tracked over time. When a user contracts the fingertips from an extended position, hand spread may be calculated based on the area of the hand and fingers. The hand spread over time may be compared to a Gaussian function to evaluate whether the observed motion represents a grasping motion.

    摘要翻译: 描述了用于检测由用户做出的手势的技术。 一只手的指尖可能随时间被识别和跟踪。 当用户从伸展位置收缩指尖时,可以基于手和手指的面积来计算手伸。 随着时间推移的手可以与高斯函数进行比较,以评估观察到的运动是否表示抓握运动。

    Automatic camera calibration
    7.
    发明授权
    Automatic camera calibration 有权
    自动相机校准

    公开(公告)号:US08619144B1

    公开(公告)日:2013-12-31

    申请号:US13419857

    申请日:2012-03-14

    IPC分类号: H04N17/00

    CPC分类号: H04N17/002 G06T7/80

    摘要: Devices and techniques are described for automatically calibrating a camera system. The camera system undergoing calibration is coupled to a computing device and an automated positioning platform coupled to a target structure. The computing device acquires images from the camera of the target structure in a plurality of repeatable poses. From these acquired images, intrinsic camera parameters may be determined. Once determined, the parameters may be used to correct images acquired by the camera system.

    摘要翻译: 描述了自动校准摄像机系统的设备和技术。 正在进行校准的相机系统耦合到计算设备和耦合到目标结构的自动定位平台。 计算装置以多个可重复姿态从目标结构的照相机获取图像。 从这些获取的图像可以确定固有的相机参数。 一旦确定,可以使用参数来校正由相机系统获取的图像。