-
公开(公告)号:US20120027304A1
公开(公告)日:2012-02-02
申请号:US12845095
申请日:2010-07-28
IPC分类号: G06K9/46
CPC分类号: G06K9/00718 , G06K9/00369 , G06K9/00664 , G06K9/469 , G06K9/6201 , G06K9/6202 , G06K9/6232 , G06K9/6857
摘要: The invention provides an improved method to detect semantic attributes of human body in computer vision. In detecting semantic attributes of human body in computer vision, the invention maintains a list of semantic attributes, each of which corresponds to a human body part. A computer module then analyzes segments of a frame of a digital video to detect each semantic attribute by finding a most likely attribute for each segment. A threshold is applied to select candidate segments of the frame for further analysis. The candidate segments of the frame then go through geometric and resolution context analysis by applying the physical structure principles of a human body and by analyzing increasingly higher resolution versions of the image to verify the existence and accuracy of parts and attributes. A computer module computes a resolution context score for a lower resolution version of the image based on a weighted average score computed for a higher resolution version of the image by evaluating appearance features, geometric features, and resolution context features when available on the higher resolution version of the image. Finally, an optimal configuration step is performed via dynamic programming to select an optimal output with both semantic attributes and spatial positions of human body parts on the frame.
摘要翻译: 本发明提供了一种用于检测计算机视觉中人体语义属性的改进方法。 在检测计算机视觉中人体的语义属性时,本发明保留了语义属性的列表,每个语义属性对应于人体部分。 然后,计算机模块通过为每个段找到最可能的属性来分析数字视频的帧的段以检测每个语义属性。 应用阈值来选择帧的候选片段用于进一步分析。 然后,帧的候选片段通过应用人体的物理结构原理并通过分析图像的越来越高的分辨率版本来验证部件和属性的存在和准确性来进行几何和分辨率上下文分析。 计算机模块基于通过在更高分辨率版本上可用时评估外观特征,几何特征和分辨率上下文特征来计算针对图像的较高分辨率版本的加权平均得分,来计算图像的较低分辨率版本的分辨率上下文得分 的图像。 最后,通过动态规划执行最佳配置步骤,以选择具有框架上人体部位的语义属性和空间位置的最优输出。
-
公开(公告)号:US08532390B2
公开(公告)日:2013-09-10
申请号:US12845095
申请日:2010-07-28
IPC分类号: G06K9/00
CPC分类号: G06K9/00718 , G06K9/00369 , G06K9/00664 , G06K9/469 , G06K9/6201 , G06K9/6202 , G06K9/6232 , G06K9/6857
摘要: The invention provides an improved method to detect semantic attributes of human body in computer vision. In detecting semantic attributes of human body in computer vision, the invention maintains a list of semantic attributes, each of which corresponds to a human body part. A computer module then analyzes segments of a frame of a digital video to detect each semantic attribute by finding a most likely attribute for each segment. A threshold is applied to select candidate segments of the frame for further analysis. The candidate segments of the frame then go through geometric and resolution context analysis by applying the physical structure principles of a human body and by analyzing increasingly higher resolution versions of the image to verify the existence and accuracy of parts and attributes. A computer module computes a resolution context score for a lower resolution version of the image based on a weighted average score computed for a higher resolution version of the image by evaluating appearance features, geometric features, and resolution context features when available on the higher resolution version of the image. Finally, an optimal configuration step is performed via dynamic programming to select an optimal output with both semantic attributes and spatial positions of human body parts on the frame.
摘要翻译: 本发明提供了一种用于检测计算机视觉中人体语义属性的改进方法。 在检测计算机视觉中人体的语义属性时,本发明保留了语义属性的列表,每个语义属性对应于人体部分。 然后,计算机模块通过为每个段找到最可能的属性来分析数字视频的帧的段以检测每个语义属性。 应用阈值来选择帧的候选片段用于进一步分析。 然后,帧的候选片段通过应用人体的物理结构原理并通过分析图像的越来越高的分辨率版本来验证部件和属性的存在和准确性来进行几何和分辨率上下文分析。 计算机模块基于通过在更高分辨率版本上可用时评估外观特征,几何特征和分辨率上下文特征来计算针对图像的较高分辨率版本的加权平均得分,来计算图像的较低分辨率版本的分辨率上下文得分 的图像。 最后,通过动态规划执行最佳配置步骤,以选择具有框架上人体部位的语义属性和空间位置的最优输出。
-
公开(公告)号:US08520899B2
公开(公告)日:2013-08-27
申请号:US13525905
申请日:2012-06-18
CPC分类号: G06K9/00369 , G06K9/00771 , G06K9/2081 , G06K9/3241 , G06K2209/23
摘要: Techniques for classifying one or more objects in at least one video, wherein the at least one video comprises a plurality of frames are provided. One or more objects in the plurality of frames are tracked. A level of deformation is computed for each of the one or more tracked objects in accordance with at least one change in a plurality of histograms of oriented gradients for a corresponding tracked object. Each of the one or more tracked objects is classified in accordance with the computed level of deformation.
摘要翻译: 用于对至少一个视频中的一个或多个对象进行分类的技术,其中所述至少一个视频包括多个帧。 跟踪多个帧中的一个或多个对象。 根据对应的跟踪对象的定向梯度的多个直方图中的至少一个变化来计算一个或多个跟踪对象中的每一个的变形水平。 一个或多个跟踪物体中的每一个根据计算的变形水平进行分类。
-
公开(公告)号:US20100054535A1
公开(公告)日:2010-03-04
申请号:US12200017
申请日:2008-08-28
IPC分类号: G06K9/62
CPC分类号: G06K9/00369 , G06K9/00771 , G06K9/2081 , G06K9/3241 , G06K2209/23
摘要: Techniques for classifying one or more objects in at least one video, wherein the at least one video comprises a plurality of frames are provided. One or more objects in the plurality of frames are tracked. A level of deformation is computed for each of the one or more tracked objects in accordance with at least one change in a plurality of histograms of oriented gradients for a corresponding tracked object. Each of the one or more tracked objects is classified in accordance with the computed level of deformation.
摘要翻译: 用于对至少一个视频中的一个或多个对象进行分类的技术,其中所述至少一个视频包括多个帧。 跟踪多个帧中的一个或多个对象。 根据对应的跟踪对象的定向梯度的多个直方图中的至少一个变化来计算一个或多个跟踪对象中的每一个的变形水平。 一个或多个跟踪物体中的每一个根据计算的变形水平进行分类。
-
公开(公告)号:US20120257793A1
公开(公告)日:2012-10-11
申请号:US13525905
申请日:2012-06-18
IPC分类号: G06K9/62
CPC分类号: G06K9/00369 , G06K9/00771 , G06K9/2081 , G06K9/3241 , G06K2209/23
摘要: Techniques for classifying one or more objects in at least one video, wherein the at least one video comprises a plurality of frames are provided. One or more objects in the plurality of frames are tracked. A level of deformation is computed for each of the one or more tracked objects in accordance with at least one change in a plurality of histograms of oriented gradients for a corresponding tracked object. Each of the one or more tracked objects is classified in accordance with the computed level of deformation.
摘要翻译: 用于对至少一个视频中的一个或多个对象进行分类的技术,其中所述至少一个视频包括多个帧。 跟踪多个帧中的一个或多个对象。 根据对应的跟踪对象的定向梯度的多个直方图中的至少一个变化来计算一个或多个跟踪对象中的每一个的变形水平。 一个或多个跟踪物体中的每一个根据计算的变形水平进行分类。
-
公开(公告)号:US20100054540A1
公开(公告)日:2010-03-04
申请号:US12200059
申请日:2008-08-28
CPC分类号: G06K9/6267 , G06K9/00369 , G06K9/00771 , G06K9/2081 , G06K9/3241 , G06K2209/23
摘要: Techniques for calibrating a classification system, wherein one or more objects in at least one video are classified, are provided. At least one view associated with the at least one video is obtained. The at least one view is partitioned into at least one region. A given object is classified in accordance with its location in reference to the at least one region. In an additional embodiment, one or more object models are obtained. At least one normalized size of the one or more objects is defined within at least one view associated with the at least one video in accordance with the one or more object models. The one or more objects are classified in accordance with the at least one defined normalized size.
摘要翻译: 提供了一种用于校准分类系统的技术,其中至少一个视频中的一个或多个对象被分类。 获得与至少一个视频相关联的至少一个视图。 至少一个视图被划分为至少一个区域。 给定对象根据其位置参照至少一个区域进行分类。 在另外的实施例中,获得一个或多个对象模型。 根据一个或多个对象模型,在与至少一个视频相关联的至少一个视图内定义一个或多个对象的至少一个归一化大小。 根据至少一个定义的归一化尺寸对一个或多个对象进行分类。
-
公开(公告)号:US08483490B2
公开(公告)日:2013-07-09
申请号:US12200059
申请日:2008-08-28
CPC分类号: G06K9/6267 , G06K9/00369 , G06K9/00771 , G06K9/2081 , G06K9/3241 , G06K2209/23
摘要: Techniques for calibrating a classification system, wherein one or more objects in at least one video are classified, are provided. At least one view associated with the at least one video is obtained. The at least one view is partitioned into at least one region. A given object is classified in accordance with its location in reference to the at least one region. In an additional embodiment, one or more object models are obtained. At least one normalized size of the one or more objects is defined within at least one view associated with the at least one video in accordance with the one or more object models. The one or more objects are classified in accordance with the at least one defined normalized size.
摘要翻译: 提供了一种用于校准分类系统的技术,其中至少一个视频中的一个或多个对象被分类。 获得与至少一个视频相关联的至少一个视图。 至少一个视图被划分为至少一个区域。 给定对象根据其位置参照至少一个区域进行分类。 在另外的实施例中,获得一个或多个对象模型。 根据一个或多个对象模型,在与至少一个视频相关联的至少一个视图内定义一个或多个对象的至少一个归一化大小。 根据至少一个定义的归一化尺寸对一个或多个对象进行分类。
-
公开(公告)号:US08249301B2
公开(公告)日:2012-08-21
申请号:US12200017
申请日:2008-08-28
CPC分类号: G06K9/00369 , G06K9/00771 , G06K9/2081 , G06K9/3241 , G06K2209/23
摘要: Techniques for classifying one or more objects in at least one video, wherein the at least one video comprises a plurality of frames are provided. One or more objects in the plurality of frames are tracked. A level of deformation is computed for each of the one or more tracked objects in accordance with at least one change in a plurality of histograms of oriented gradients for a corresponding tracked object. Each of the one or more tracked objects is classified in accordance with the computed level of deformation.
摘要翻译: 用于对至少一个视频中的一个或多个对象进行分类的技术,其中所述至少一个视频包括多个帧。 跟踪多个帧中的一个或多个对象。 根据对应的跟踪对象的定向梯度的多个直方图中的至少一个变化来计算一个或多个跟踪对象中的每一个的变形水平。 一个或多个跟踪物体中的每一个根据计算的变形水平进行分类。
-
公开(公告)号:US08289392B2
公开(公告)日:2012-10-16
申请号:US12166732
申请日:2008-07-02
申请人: Andrew William Senior , Sharathchandra Pankanti , Arun Hampapur , Lisa Marie Brown , Ying-Li Tian
发明人: Andrew William Senior , Sharathchandra Pankanti , Arun Hampapur , Lisa Marie Brown , Ying-Li Tian
IPC分类号: H04N7/18
CPC分类号: H04N5/232 , H04N5/23216 , H04N5/247 , H04N7/18
摘要: A system for automatically acquiring high-resolution images by steering a pan-tilt-zoom camera at targets detected in a fixed camera view is provided. The system uses automatic or manual calibration between multiple cameras. Using automatic calibration, the homography between the cameras in a home position is estimated together with the effects of pan and tilt controls and the expected height of a person in the image. These calibrations are chained together to steer a slave camera. The manual calibration scheme steers a camera to the desired region of interest and calculates the pan, tile and zoom parameters accordingly.
摘要翻译: 提供了一种用于通过将俯仰变焦相机瞄准在固定摄像机视图中检测到的目标来自动获取高分辨率图像的系统。 该系统使用多台摄像机之间的自动或手动校准。 使用自动校准,估计初始位置的摄像机之间的单应性以及平移和倾斜控制的影响以及人物在图像中的预期高度。 这些校准被链接在一起以引导从照相机。 手动校准方案将相机引导到所需的感兴趣区域,并相应地计算平移,平铺和缩放参数。
-
公开(公告)号:US07796154B2
公开(公告)日:2010-09-14
申请号:US11074383
申请日:2005-03-07
申请人: Andrew William Senior , Sharathchandra Pankanti , Arun Hampapur , Lisa Marie Brown , Ying-Li Tian
发明人: Andrew William Senior , Sharathchandra Pankanti , Arun Hampapur , Lisa Marie Brown , Ying-Li Tian
IPC分类号: H04N7/18
CPC分类号: H04N5/232 , H04N5/23216 , H04N5/247 , H04N7/18
摘要: A system for automatically acquiring high-resolution images by steering a pan-tilt-zoom camera at targets detected in a fixed camera view is provided. The system uses automatic or manual calibration between multiple cameras. Using automatic calibration, the homography between the cameras in a home position is estimated together with the effects of pan and tilt controls and the expected height of a person in the image. These calibrations are chained together to steer a slave camera. The manual calibration scheme steers a camera to the desired region of interest and calculates the pan, tile and zoom parameters accordingly.
摘要翻译: 提供了一种用于通过将俯仰变焦相机瞄准在固定摄像机视图中检测到的目标来自动获取高分辨率图像的系统。 该系统使用多台摄像机之间的自动或手动校准。 使用自动校准,估计初始位置的摄像机之间的单应性以及平移和倾斜控制的效果以及人物在图像中的预期高度。 这些校准被链接在一起以引导从照相机。 手动校准方案将相机引导到所需的感兴趣区域,并相应地计算平移,平铺和缩放参数。
-
-
-
-
-
-
-
-
-