-
公开(公告)号:US08625897B2
公开(公告)日:2014-01-07
申请号:US12790026
申请日:2010-05-28
申请人: Antonio Criminisi , Jamie Daniel Joseph Shotton , Andrew Fitzgibbon , Toby Sharp , Matthew Darius Cook
发明人: Antonio Criminisi , Jamie Daniel Joseph Shotton , Andrew Fitzgibbon , Toby Sharp , Matthew Darius Cook
IPC分类号: G06K9/34
CPC分类号: G06K9/34 , G06K9/38 , G06T7/11 , G06T7/168 , G06T7/187 , G06T7/194 , G06T2207/10016 , G06T2207/20048 , G06T2207/20156 , G06T2207/30196 , H04N13/239
摘要: Foreground and background image segmentation is described. In an example, a seed region is selected in a foreground portion of an image, and a geodesic distance is calculated from each image element to the seed region. A subset of the image elements having a geodesic distance less than a threshold is determined, and this subset of image elements are labeled as foreground. In another example, an image element from an image showing at least a user, a foreground object in proximity to the user, and a background is applied to trained decision trees to obtain probabilities of the image element representing one of these items, and a corresponding classification assigned to the image element. This is repeated for each image element. Image elements classified as belonging to the user are labeled as foreground, and image elements classified as foreground objects or background are labeled as background.
摘要翻译: 描述了前景和背景图像分割。 在一个示例中,在图像的前景部分中选择种子区域,并且从每个图像元素计算到种子区域的测地距离。 确定具有小于阈值的测地距离的图像元素的子集,并且该图像元素的子集被标记为前景。 在另一示例中,将来自显示至少用户的图像,邻近用户的前景对象和背景的图像元素应用于经过训练的决策树,以获得表示这些项目之一的图像元素的概率,以及相应的 分类到图像元素的分类。 对于每个图像元素重复这一点。 分类为属于用户的图像元素被标记为前景,并且被分类为前景对象或背景的图像元素被标记为背景。
-
公开(公告)号:US20110293180A1
公开(公告)日:2011-12-01
申请号:US12790026
申请日:2010-05-28
申请人: Antonio Criminisi , Jamie Daniel Joseph Shotton , Andrew Fitzgibbon , Toby Sharp , Matthew Darius Cook
发明人: Antonio Criminisi , Jamie Daniel Joseph Shotton , Andrew Fitzgibbon , Toby Sharp , Matthew Darius Cook
CPC分类号: G06K9/34 , G06K9/38 , G06T7/11 , G06T7/168 , G06T7/187 , G06T7/194 , G06T2207/10016 , G06T2207/20048 , G06T2207/20156 , G06T2207/30196 , H04N13/239
摘要: Foreground and background image segmentation is described. In an example, a seed region is selected in a foreground portion of an image, and a geodesic distance is calculated from each image element to the seed region. A subset of the image elements having a geodesic distance less than a threshold is determined, and this subset of image elements are labeled as foreground. In another example, an image element from an image showing at least a user, a foreground object in proximity to the user, and a background is applied to trained decision trees to obtain probabilities of the image element representing one of these items, and a corresponding classification assigned to the image element. This is repeated for each image element. Image elements classified as belonging to the user are labeled as foreground, and image elements classified as foreground objects or background are labeled as background.
摘要翻译: 描述了前景和背景图像分割。 在一个示例中,在图像的前景部分中选择种子区域,并且从每个图像元素计算到种子区域的测地距离。 确定具有小于阈值的测地距离的图像元素的子集,并且该图像元素的子集被标记为前景。 在另一示例中,将来自显示至少用户的图像,邻近用户的前景对象和背景的图像元素应用于经过训练的决策树,以获得表示这些项目之一的图像元素的概率,以及相应的 分类到图像元素的分类。 对于每个图像元素重复这一点。 分类为属于用户的图像元素被标记为前景,并且被分类为前景对象或背景的图像元素被标记为背景。
-
公开(公告)号:US20120239174A1
公开(公告)日:2012-09-20
申请号:US13050858
申请日:2011-03-17
申请人: Jamie Daniel Joseph Shotton , Pushmeet Kohli , Ross Brook Girshick , Andrew Fitzgibbon , Antonio Criminisi
发明人: Jamie Daniel Joseph Shotton , Pushmeet Kohli , Ross Brook Girshick , Andrew Fitzgibbon , Antonio Criminisi
CPC分类号: G06F3/017 , G06K9/00362 , G06N5/025
摘要: Predicting joint positions is described, for example, to find joint positions of humans or animals (or parts thereof) in an image to control a computer game or for other applications. In an embodiment image elements of a depth image make joint position votes so that for example, an image element depicting part of a torso may vote for a position of a neck joint, a left knee joint and a right knee joint. A random decision forest may be trained to enable image elements to vote for the positions of one or more joints and the training process may use training images of bodies with specified joint positions. In an example a joint position vote is expressed as a vector representing a distance and a direction of a joint position from an image element making the vote. The random decision forest may be trained using a mixture of objectives.
摘要翻译: 例如,描述关节位置的描述是为了在图像中找到人或动物(或其部分)的联合位置,以控制计算机游戏或用于其他应用。 在一个实施例中,深度图像的图像元素进行联合位置投票,使得例如描绘躯干的一部分的图像元素可以投射颈部关节,左膝关节和右膝关节的位置。 可以对随机决策林进行训练,以使图像元素能够对一个或多个关节的位置进行投票,并且训练过程可以使用具有指定关节位置的身体的训练图像。 在一个例子中,联合立场表决被表示为表示从投票的图像元素的联合位置的距离和方向的向量。 可以使用目标混合来训练随机决策林。
-
公开(公告)号:US08571263B2
公开(公告)日:2013-10-29
申请号:US13050858
申请日:2011-03-17
申请人: Jamie Daniel Joseph Shotton , Pushmeet Kohli , Ross Brook Girshick , Andrew Fitzgibbon , Antonio Criminisi
发明人: Jamie Daniel Joseph Shotton , Pushmeet Kohli , Ross Brook Girshick , Andrew Fitzgibbon , Antonio Criminisi
IPC分类号: G06K9/00
CPC分类号: G06F3/017 , G06K9/00362 , G06N5/025
摘要: Predicting joint positions is described, for example, to find joint positions of humans or animals (or parts thereof) in an image to control a computer game or for other applications. In an embodiment image elements of a depth image make joint position votes so that for example, an image element depicting part of a torso may vote for a position of a neck joint, a left knee joint and a right knee joint. A random decision forest may be trained to enable image elements to vote for the positions of one or more joints and the training process may use training images of bodies with specified joint positions. In an example a joint position vote is expressed as a vector representing a distance and a direction of a joint position from an image element making the vote. The random decision forest may be trained using a mixture of objectives.
摘要翻译: 例如,描述关节位置的描述是为了在图像中找到人或动物(或其部分)的联合位置,以控制计算机游戏或用于其他应用。 在一个实施例中,深度图像的图像元素进行联合位置投票,使得例如描绘躯干的一部分的图像元素可以投射颈部关节,左膝关节和右膝关节的位置。 可以对随机决策林进行训练,以使图像元素能够对一个或多个关节的位置进行投票,并且训练过程可以使用具有指定关节位置的身体的训练图像。 在一个例子中,联合立场表决被表示为表示从投票的图像元素的联合位置的距离和方向的向量。 可以使用目标混合来训练随机决策林。
-
公开(公告)号:US08401242B2
公开(公告)日:2013-03-19
申请号:US13017587
申请日:2011-01-31
申请人: Richard Newcombe , Shahram Izadi , David Molyneaux , Otmar Hilliges , David Kim , Jamie Daniel Joseph Shotton , Pushmeet Kohli , Andrew Fitzgibbon , Stephen Edward Hodges , David Alexander Butler
发明人: Richard Newcombe , Shahram Izadi , David Molyneaux , Otmar Hilliges , David Kim , Jamie Daniel Joseph Shotton , Pushmeet Kohli , Andrew Fitzgibbon , Stephen Edward Hodges , David Alexander Butler
IPC分类号: G06K9/00
CPC分类号: A63F13/00 , A63F13/06 , A63F2300/1087 , A63F2300/69 , G06K9/00 , G06K9/00664 , G06T7/251 , G06T7/30 , G06T2207/10028 , G06T2207/10048 , G06T2207/30244
摘要: Real-time camera tracking using depth maps is described. In an embodiment depth map frames are captured by a mobile depth camera at over 20 frames per second and used to dynamically update in real-time a set of registration parameters which specify how the mobile depth camera has moved. In examples the real-time camera tracking output is used for computer game applications and robotics. In an example, an iterative closest point process is used with projective data association and a point-to-plane error metric in order to compute the updated registration parameters. In an example, a graphics processing unit (GPU) implementation is used to optimize the error metric in real-time. In some embodiments, a dense 3D model of the mobile camera environment is used.
摘要翻译: 描述使用深度图的实时相机跟踪。 在一个实施例中,深度图帧由移动深度相机以每秒20帧的速度捕获,并且用于实时动态地更新一组注册参数,这些参数指定移动深度相机已经移动。 在实例中,实时相机跟踪输出用于计算机游戏应用和机器人。 在一个例子中,迭代最近点处理与投影数据关联和点到平面误差度量一起使用,以便计算更新的注册参数。 在一个例子中,使用图形处理单元(GPU)实现来实时优化误差度量。 在一些实施例中,使用移动照相机环境的密集3D模型。
-
公开(公告)号:US20120194650A1
公开(公告)日:2012-08-02
申请号:US13017518
申请日:2011-01-31
申请人: Shahram Izadi , David Molyneaux , Otmar Hilliges , David Kim , Jamie Daniel Joseph Shotton , Stephen Edward Hodges , David Alexander Butler , Andrew Fitzgibbon , Pushmeet Kohli
发明人: Shahram Izadi , David Molyneaux , Otmar Hilliges , David Kim , Jamie Daniel Joseph Shotton , Stephen Edward Hodges , David Alexander Butler , Andrew Fitzgibbon , Pushmeet Kohli
IPC分类号: H04N13/02
CPC分类号: G06T7/55 , G01B9/08 , G01B11/22 , H04N5/23229 , H04N5/247 , H04N5/33 , H04N5/332 , H04N13/271
摘要: Systems and methods for reducing interference between multiple infra-red depth cameras are described. In an embodiment, the system comprises multiple infra-red sources, each of which projects a structured light pattern into the environment. A controller is used to control the sources in order to reduce the interference caused by overlapping light patterns. Various methods are described including: cycling between the different sources, where the cycle used may be fixed or may change dynamically based on the scene detected using the cameras; setting the wavelength of each source so that overlapping patterns are at different wavelengths; moving source-camera pairs in independent motion patterns; and adjusting the shape of the projected light patterns to minimize overlap. These methods may also be combined in any way. In another embodiment, the system comprises a single source and a mirror system is used to cast the projected structured light pattern around the environment.
摘要翻译: 描述了用于减少多个红外深度摄像机之间的干扰的系统和方法。 在一个实施例中,系统包括多个红外源,每个红外源将结构化的光图案投射到环境中。 控制器用于控制源,以减少由重叠的光图案引起的干扰。 描述了各种方法,包括:在不同的源之间循环,其中使用的周期可以是固定的,或者可以基于使用相机检测的场景动态地改变; 设置每个源的波长,使得重叠图案处于不同的波长; 以独立运动模式移动源摄像机对; 并调整投影光图案的形状以最小化重叠。 这些方法也可以以任何方式组合。 在另一个实施例中,系统包括单个源,并且使用镜子系统将投射的结构化光图案围绕环境投射。
-
公开(公告)号:US09053571B2
公开(公告)日:2015-06-09
申请号:US13154288
申请日:2011-06-06
申请人: Jamie Daniel Joseph Shotton , Shahram Izadi , Otmar Hilliges , David Kim , David Molyneaux , Pushmeet Kohli , Andrew Fitzgibbon , Stephen Edward Hodges
发明人: Jamie Daniel Joseph Shotton , Shahram Izadi , Otmar Hilliges , David Kim , David Molyneaux , Pushmeet Kohli , Andrew Fitzgibbon , Stephen Edward Hodges
CPC分类号: G06T7/251 , G06T17/10 , G06T2200/08 , G06T2207/10028
摘要: Generating computer models of 3D objects is described. In one example, depth images of an object captured by a substantially static depth camera are used to generate the model, which is stored in a memory device in a three-dimensional volume. Portions of the depth image determined to relate to the background are removed to leave a foreground depth image. The position and orientation of the object in the foreground depth image is tracked by comparison to a preceding depth image, and the foreground depth image is integrated into the volume by using the position and orientation to determine where to add data derived from the foreground depth image into the volume. In examples, the object is hand-rotated by a user before the depth camera. Hands that occlude the object are integrated out of the model as they do not move in sync with the object due to re-gripping.
摘要翻译: 描述生成3D对象的计算机模型。 在一个示例中,使用由基本上静态的深度相机拍摄的对象的深度图像来生成存储在三维体积中的存储器设备中的模型。 确定与背景相关的深度图像的部分被去除以留下前景深度图像。 通过与前一个深度图像进行比较来跟踪前景深度图像中的对象的位置和方向,并且通过使用位置和方向来将前景深度图像集成到卷中,以确定在哪里添加从前景深度图像导出的数据 进入卷。 在示例中,该对象在深度相机之前由用户手动旋转。 闭合对象的手从模型中集成出来,因为它们不会因为重新抓取而与对象同步移动。
-
公开(公告)号:US08570320B2
公开(公告)日:2013-10-29
申请号:US13017729
申请日:2011-01-31
申请人: Shahram Izadi , David Molyneaux , Otmar Hilliges , David Kim , Jamie Daniel Joseph Shotton , Pushmeet Kohli , Andrew Fitzgibbon , Stephen Edward Hodges , David Alexander Butler
发明人: Shahram Izadi , David Molyneaux , Otmar Hilliges , David Kim , Jamie Daniel Joseph Shotton , Pushmeet Kohli , Andrew Fitzgibbon , Stephen Edward Hodges , David Alexander Butler
CPC分类号: G06F3/011 , A63F13/65 , A63F2300/1087 , A63F2300/6009 , G06T7/20 , G06T17/00 , G06T2207/10016 , G06T2207/10028 , G06T2207/30244
摘要: Use of a 3D environment model in gameplay is described. In an embodiment, a mobile depth camera is used to capture a series of depth images as it is moved around and a dense 3D model of the environment is generated from this series of depth images. This dense 3D model is incorporated within an interactive application, such as a game. The mobile depth camera is then placed in a static position for an interactive phase, which in some examples is gameplay, and the system detects motion of a user within a part of the environment from a second series of depth images captured by the camera. This motion provides a user input to the interactive application, such as a game. In further embodiments, automatic recognition and identification of objects within the 3D model may be performed and these identified objects then change the way that the interactive application operates.
摘要翻译: 描述了在游戏中使用3D环境模型。 在一个实施例中,移动深度相机被用来捕获一系列深度图像,因为它被移动,并且从该系列深度图像生成环境的密集3D模型。 这种密集的3D模型被并入在诸如游戏的交互式应用中。 然后将移动深度相机放置在用于交互式相位的静态位置,在一些示例中为游戏画面,并且系统从相机拍摄的第二系列深度图像中检测环境部分内的用户的运动。 该运动向诸如游戏的交互式应用提供用户输入。 在另外的实施例中,可以执行3D模型内的对象的自动识别和识别,并且这些识别的对象然后改变交互式应用程序的操作方式。
-
公开(公告)号:US20120194644A1
公开(公告)日:2012-08-02
申请号:US13017474
申请日:2011-01-31
申请人: Richard Newcombe , Shahram Izadi , David Molyneaux , Otmar Hilliges , David Kim , Jamie Daniel Joseph Shotton , Pushmeet Kohli , Andrew Fitzgibbon , Stephen Edward Hodges , David Alexander Butler
发明人: Richard Newcombe , Shahram Izadi , David Molyneaux , Otmar Hilliges , David Kim , Jamie Daniel Joseph Shotton , Pushmeet Kohli , Andrew Fitzgibbon , Stephen Edward Hodges , David Alexander Butler
IPC分类号: H04N5/225
CPC分类号: G06T7/20 , G06T7/74 , G06T2207/10016 , G06T2207/10021 , G06T2207/10024 , G06T2207/10028 , G06T2207/30244
摘要: Mobile camera localization using depth maps is described for robotics, immersive gaming, augmented reality and other applications. In an embodiment a mobile depth camera is tracked in an environment at the same time as a 3D model of the environment is formed using the sensed depth data. In an embodiment, when camera tracking fails, this is detected and the camera is relocalized either by using previously gathered keyframes or in other ways. In an embodiment, loop closures are detected in which the mobile camera revisits a location, by comparing features of a current depth map with the 3D model in real time. In embodiments the detected loop closures are used to improve the consistency and accuracy of the 3D model of the environment.
摘要翻译: 使用深度图的移动摄像机定位被描述为机器人,沉浸式游戏,增强现实和其他应用。 在一个实施例中,移动深度相机在环境中跟踪,同时使用感测的深度数据形成环境的3D模型。 在一个实施例中,当相机跟踪失败时,检测到该相机并且通过使用先前收集的关键帧或以其它方式来重新定位相机。 在一个实施例中,通过将当前深度图与3D模型的特征实时比较,检测到环路闭合,其中移动摄像机重新访问位置。 在实施例中,检测到的环路闭合用于改善环境的3D模型的一致性和准确性。
-
公开(公告)号:US08587583B2
公开(公告)日:2013-11-19
申请号:US13017690
申请日:2011-01-31
申请人: Richard Newcombe , Shahram Izadi , David Molyneaux , Otmar Hilliges , David Kim , Jamie Daniel Joseph Shotton , Stephen Edward Hodges , David Alexander Butler , Andrew Fitzgibbon , Pushmeet Kohli
发明人: Richard Newcombe , Shahram Izadi , David Molyneaux , Otmar Hilliges , David Kim , Jamie Daniel Joseph Shotton , Stephen Edward Hodges , David Alexander Butler , Andrew Fitzgibbon , Pushmeet Kohli
CPC分类号: G06T17/00 , G06T2200/08
摘要: Three-dimensional environment reconstruction is described. In an example, a 3D model of a real-world environment is generated in a 3D volume made up of voxels stored on a memory device. The model is built from data describing a camera location and orientation, and a depth image with pixels indicating a distance from the camera to a point in the environment. A separate execution thread is assigned to each voxel in a plane of the volume. Each thread uses the camera location and orientation to determine a corresponding depth image location for its associated voxel, determines a factor relating to the distance between the associated voxel and the point in the environment at the corresponding location, and updates a stored value at the associated voxel using the factor. Each thread iterates through an equivalent voxel in the remaining planes of the volume, repeating the process to update the stored value.
摘要翻译: 描述了三维环境重建。 在一个示例中,在由存储在存储器件上的体素组成的3D体积中生成真实世界环境的3D模型。 该模型是从描述相机位置和方向的数据构建的,以及具有指示从相机到环境中的点的距离的像素的深度图像。 单独的执行线程被分配给卷的平面中的每个体素。 每个线程使用摄像机位置和方向来确定其相关体素的相应深度图像位置,确定与相关体素和相应位置处的环境中的点之间的距离有关的因子,并更新相关联的体素的存储值 体素使用因子。 每个线程遍历卷的剩余平面中的等效体素,重复更新存储值的过程。
-
-
-
-
-
-
-
-
-