-
公开(公告)号:US08437570B2
公开(公告)日:2013-05-07
申请号:US12126302
申请日:2008-05-23
Applicant: Antonio Criminisi , Toby Sharp
Inventor: Antonio Criminisi , Toby Sharp
CPC classification number: G06T5/002 , G06K9/342 , G06T7/11 , G06T7/155 , G06T2207/10016 , G06T2207/30212
Abstract: A method of geodesic image and video processing is proposed. In an embodiment, the method uses a geodesic distance transform to construct an image filter. The filter can be used in a variety of image editing operations such as segmentation, denoising, texture smoothing, image stitching and cartooning. In one embodiment, the method may be made efficient by utilizing parallelism of the algorithm to carry out processing steps on at least two processing cores concurrently. This efficiency may enable high-resolution images and video to be processed at ‘real time’ rates without the need for specialist hardware.
Abstract translation: 提出了一种测地图像和视频处理方法。 在一个实施例中,该方法使用测地距离变换来构造图像滤波器。 滤镜可用于各种图像编辑操作,如分割,去噪,纹理平滑,图像拼接和卡通。 在一个实施例中,可以通过利用算法的并行性来同时对至少两个处理核执行处理步骤来使该方法有效。 这种效率可以使得高分辨率图像和视频以“实时”速率被处理,而不需要专用硬件。
-
公开(公告)号:US20110293180A1
公开(公告)日:2011-12-01
申请号:US12790026
申请日:2010-05-28
Applicant: Antonio Criminisi , Jamie Daniel Joseph Shotton , Andrew Fitzgibbon , Toby Sharp , Matthew Darius Cook
Inventor: Antonio Criminisi , Jamie Daniel Joseph Shotton , Andrew Fitzgibbon , Toby Sharp , Matthew Darius Cook
CPC classification number: G06K9/34 , G06K9/38 , G06T7/11 , G06T7/168 , G06T7/187 , G06T7/194 , G06T2207/10016 , G06T2207/20048 , G06T2207/20156 , G06T2207/30196 , H04N13/239
Abstract: Foreground and background image segmentation is described. In an example, a seed region is selected in a foreground portion of an image, and a geodesic distance is calculated from each image element to the seed region. A subset of the image elements having a geodesic distance less than a threshold is determined, and this subset of image elements are labeled as foreground. In another example, an image element from an image showing at least a user, a foreground object in proximity to the user, and a background is applied to trained decision trees to obtain probabilities of the image element representing one of these items, and a corresponding classification assigned to the image element. This is repeated for each image element. Image elements classified as belonging to the user are labeled as foreground, and image elements classified as foreground objects or background are labeled as background.
Abstract translation: 描述了前景和背景图像分割。 在一个示例中,在图像的前景部分中选择种子区域,并且从每个图像元素计算到种子区域的测地距离。 确定具有小于阈值的测地距离的图像元素的子集,并且该图像元素的子集被标记为前景。 在另一示例中,将来自显示至少用户的图像,邻近用户的前景对象和背景的图像元素应用于经过训练的决策树,以获得表示这些项目之一的图像元素的概率,以及相应的 分类到图像元素的分类。 对于每个图像元素重复这一点。 分类为属于用户的图像元素被标记为前景,并且被分类为前景对象或背景的图像元素被标记为背景。
-
公开(公告)号:US20110228997A1
公开(公告)日:2011-09-22
申请号:US12725811
申请日:2010-03-17
Applicant: Toby Sharp , Antonio Criminisi , Khan Mohammad Siddiqui
Inventor: Toby Sharp , Antonio Criminisi , Khan Mohammad Siddiqui
IPC: G06K9/62
CPC classification number: G06T19/00 , G06T15/30 , G06T2200/24 , G06T2207/10081 , G06T2210/12 , G06T2219/004 , G06T2219/2012
Abstract: Medical image rendering is described. In an embodiment a medical image visualization engine receives results from an organ recognition system which provide estimated organ centers, bounding boxes and organ classification labels for a given medical image. In examples the visualization engine uses the organ recognition system results to select appropriate transfer functions, bounding regions, clipping planes and camera locations in order to optimally view an organ. For example, a rendering engine uses the selections to render a two-dimensional image of medical diagnostic quality with minimal user input. In an embodiment a graphical user interface populates a list of organs detected in a medical image and a clinician is able to select one organ and immediately be presented with the optimal view of that organ. In an example opacity of background regions of the medical image may be adjusted to provide context for organs presented in a foreground region.
Abstract translation: 描述医学图像呈现。 在一个实施例中,医学图像可视化引擎从提供给定医学图像的估计的器官中心,边界框和器官分类标签的器官识别系统接收结果。 在示例中,可视化引擎使用器官识别系统结果来选择适当的传递函数,边界区域,剪切平面和相机位置,以便最佳地观察器官。 例如,渲染引擎使用选择来以最小的用户输入呈现医学诊断质量的二维图像。 在一个实施例中,图形用户界面填充在医学图像中检测到的器官的列表,并且临床医生能够选择一个器官并且立即呈现该器官的最佳视图。 在示例性医学图像的背景区域的不透明度可以被调整以提供前景区域中呈现的器官的上下文。
-
公开(公告)号:US08411948B2
公开(公告)日:2013-04-02
申请号:US12718232
申请日:2010-03-05
Applicant: Carsten Curt Eckard Rother , Toby Sharp
Inventor: Carsten Curt Eckard Rother , Toby Sharp
Abstract: A method of up-sampling binary images for segmentation is described. In an embodiment, digital images are down-sampled before segmentation. The resulting initial binary segmentation, which has a lower resolution than the original image, is then up-sampled and smoothed to generate an interim non-binary solution which has a higher resolution than the initial binary segmentation. The final binary segmentation for the image is then computed from the interim non-binary solution based on a threshold. This method does not use the original image data in inferring the final binary segmentation solution from the initial binary segmentation. In an embodiment, the method may be applied to all images and in another embodiment, the method may be used for images which comprise a large number of pixels in total or in single dimension and smaller images may not be down-sampled before segmentation.
Abstract translation: 描述了用于分割的二进制图像的上采样方法。 在一个实施例中,在分割之前对数字图像进行下采样。 然后,所得到的具有比原始图像更低分辨率的初始二进制分割被上采样和平滑以产生具有比初始二进制分割更高分辨率的临时非二进制解。 然后基于阈值从临时非二进制解决方案计算图像的最终二进制分割。 该方法不使用原始图像数据从最初的二进制分割推断最终的二进制分割解决方案。 在一个实施例中,该方法可以应用于所有图像,并且在另一个实施例中,该方法可以用于总共或单维度中包含大量像素的图像,并且在分割之前可能不会对较小的图像进行下采样。
-
公开(公告)号:US20080178087A1
公开(公告)日:2008-07-24
申请号:US11625049
申请日:2007-01-19
Applicant: Andrew Fitzgibbon , Toby Sharp
Inventor: Andrew Fitzgibbon , Toby Sharp
CPC classification number: G06T13/20 , G06T19/20 , G06T2219/2016
Abstract: Using in-scene editing, an added title, or object, moves as the camera moves through the imaged scene. Previously this has been complex to achieve, requiring expert users to explicitly align 3D coordinate systems in the image sequence and on the added title or object. For example, this has been used to add 3D objects into live-action footage in big-budget movies or advertising. A simple, easy to use system is described for achieving in-scene editing. A user specifies projection constraints by making 2D actions on one or more images in the image sequence. A 3D motion trajectory is computed for a 3D object model on the basis of the specified projection constraints and a smoothness indicator. Using the computed trajectory the 3D object model is added to the image sequence. Projection constraints may be added, amended or deleted to position the 3D object model and/or to animate it.
Abstract translation: 使用场景编辑,添加的标题或对象,随着相机移动通过成像的场景而移动。 以前,这一点很复杂,要求专家用户明确地对齐图像序列中的3D坐标系和添加的标题或对象。 例如,这已被用于将3D对象添加到大型预算电影或广告中的实时影像中。 描述了一个简单易用的系统,用于实现现场编辑。 用户通过在图像序列中的一个或多个图像上进行2D动作来指定投影约束。 基于指定的投影约束和平滑指标,为3D对象模型计算3D运动轨迹。 使用计算的轨迹,将3D对象模型添加到图像序列中。 可以添加,修改或删除投影约束以定位3D对象模型和/或使其动画化。
-
公开(公告)号:US08605992B2
公开(公告)日:2013-12-10
申请号:US13083271
申请日:2011-04-08
Applicant: Pushmeet Kohli , Toby Sharp , Carsten Curt Eckard Rother
Inventor: Pushmeet Kohli , Toby Sharp , Carsten Curt Eckard Rother
CPC classification number: G06T5/005 , G06T2207/10004 , G06T2207/10024 , G06T2207/10028
Abstract: Image completion using scene geometry is described, for example, to remove marks from digital photographs or complete regions which are blank due to editing. In an embodiment an image depicting, from a viewpoint, a scene of textured objects has regions to be completed. In an example, geometry of the scene is estimated from a depth map and the geometry used to warp the image so that at least some surfaces depicted in the image are fronto-parallel to the viewpoint. An image completion process is guided using distortion applied during the warping. For example, patches used to fill the regions are selected on the basis of distortion introduced by the warping. In examples where the scene comprises regions having only planar surfaces the warping process comprises rotating the image. Where the scene comprises non-planar surfaces, geodesic distances between image elements may be scaled to flatten the non-planar surfaces.
Abstract translation: 描述使用场景几何的图像完成,例如,从数字照片或由于编辑而为空的完整区域中移除标记。 在一个实施例中,从视点描绘纹理对象的场景的图像具有要完成的区域。 在一个示例中,从深度图和用于扭曲图像的几何估计场景的几何形状,使得图像中描绘的至少一些表面与视点平行。 使用在翘曲期间施加的变形来指导图像完成处理。 例如,基于由翘曲引入的失真来选择用于填充区域的补丁。 在场景包括仅具有平面表面的区域的示例中,翘曲过程包括旋转图像。 在场景包括非平面表面的情况下,图像元素之间的测地距离可以被缩放以平坦化非平面表面。
-
公开(公告)号:US20110228055A1
公开(公告)日:2011-09-22
申请号:US12728921
申请日:2010-03-22
Applicant: Toby Sharp
Inventor: Toby Sharp
CPC classification number: G06T15/06 , G06T15/08 , H04N13/275
Abstract: Space skipping for multi-dimensional image rendering is described. In an embodiment a ray-casting engine is used to form a two dimensional image from an at least three dimensional image volume by computing rays extending from a camera location, through the two dimensional image and into the volume. For example, a space skipping logic is used to clip the rays such that computationally expensive aspects of ray-casting only need to be performed along the clipped rays. For example a volume pyramid is formed by repeatedly reducing the resolution of the volume data. In an example, each ray is intersected with the lowest resolution volume of the pyramid and clipped using data from that volume. In examples, the clipping is then repeated at higher resolutions in order to clip the ray closely to non-transparent voxels in the volume and optimize the task of rendering the image.
Abstract translation: 描述了用于多维图像渲染的空间跳过。 在一个实施例中,射线铸造引擎用于通过计算从相机位置延伸通过二维图像并进入体积的从至少三维图像体积形成二维图像。 例如,空间跳过逻辑用于剪切光线,使得射线投射的计算上昂贵的方面仅需要沿着剪切的射线执行。 例如,通过重复地降低体数据的分辨率来形成体积金字塔。 在一个示例中,每个射线与金字塔的最低分辨率体积相交,并使用来自该卷的数据进行剪切。 在示例中,然后以更高分辨率重复剪辑,以将光线紧密地剪切到体积中的不透明体素,并优化渲染图像的任务。
-
公开(公告)号:US20110216975A1
公开(公告)日:2011-09-08
申请号:US12718232
申请日:2010-03-05
Applicant: Carsten Curt Eckard Rother , Toby Sharp
Inventor: Carsten Curt Eckard Rother , Toby Sharp
Abstract: A method of up-sampling binary images for segmentation is described. In an embodiment, digital images are down-sampled before segmentation. The resulting initial binary segmentation, which has a lower resolution than the original image, is then up-sampled and smoothed to generate an interim non-binary solution which has a higher resolution than the initial binary segmentation. The final binary segmentation for the image is then computed from the interim non-binary solution based on a threshold. This method does not use the original image data in inferring the final binary segmentation solution from the initial binary segmentation. In an embodiment, the method may be applied to all images and in another embodiment, the method may be used for images which comprise a large number of pixels in total or in single dimension and smaller images may not be down-sampled before segmentation.
Abstract translation: 描述了用于分割的二进制图像的上采样方法。 在一个实施例中,在分割之前对数字图像进行下采样。 然后,所得到的具有比原始图像更低分辨率的初始二进制分割被上采样和平滑以产生具有比初始二进制分割更高分辨率的临时非二进制解。 然后基于阈值从临时非二进制解决方案计算图像的最终二进制分割。 该方法不使用原始图像数据从最初的二进制分割推断最终的二进制分割解决方案。 在一个实施例中,该方法可以应用于所有图像,并且在另一个实施例中,该方法可以用于总共或单维度中包含大量像素的图像,并且在分割之前可能不会对较小的图像进行下采样。
-
公开(公告)号:US09557836B2
公开(公告)日:2017-01-31
申请号:US13286966
申请日:2011-11-01
Applicant: Toby Sharp , Jamie Daniel Joseph Shotton
Inventor: Toby Sharp , Jamie Daniel Joseph Shotton
CPC classification number: G06F3/038 , A63F13/42 , A63F2300/1093 , A63F2300/6045 , G06F3/017 , G06F3/0304 , G06K9/00335 , G06K9/00362 , G06K9/00369 , G06K9/00375 , G06K9/6269 , G06T2207/10028 , G06T2207/30196 , H04N19/597
Abstract: Depth image compression is described for example, to enable body-part centers of players of a game to be detected in real time from depth images or for other applications such as augmented reality, and human-computer interaction. In an embodiment, depth images which have associated body-part probabilities, are compressed using probability mass which is related to the depth of an image element and a probability of a body part for the image element. In various examples, compression of the depth images using probability mass enables body part center detection, by clustering output elements, to be speeded up. In some examples, the scale of the compression is selected according to a depth of a foreground region and in some cases different scales are used for different image regions. In some examples, certainties of the body-part centers are calculated using probability masses of clustered image elements.
Abstract translation: 例如,深度图像压缩被描述为使得能够从深度图像或诸如增强现实和人机交互的其他应用实时地检测游戏的玩家的身体部位中心。 在一个实施例中,具有相关联的身体部位概率的深度图像使用与图像元素的深度和图像元素的身体部位的概率相关的概率质量进行压缩。 在各种示例中,使用概率质量压缩深度图像可以通过聚类输出元素来加快身体部位中心检测。 在一些示例中,根据前景区域的深度选择压缩的比例,并且在一些情况下,不同的比例尺用于不同的图像区域。 在一些示例中,使用聚类图像元素的概率质量来计算身体部位中心的确定性。
-
公开(公告)号:US08655069B2
公开(公告)日:2014-02-18
申请号:US12718343
申请日:2010-03-05
Applicant: Carsten Curt Eckard Rother , Toby Sharp , Andrew Blake , Vladimir Kolmogorov
Inventor: Carsten Curt Eckard Rother , Toby Sharp , Andrew Blake , Vladimir Kolmogorov
IPC: G06K9/34
CPC classification number: G06K9/34
Abstract: Methods of updating image segmentation following user input are described. In an embodiment, the properties used in computing the different portions of the image are updated as a result of one or more user inputs. Image elements which have been identified by a user input are given more weight when updating the properties than other image elements which have already been assigned to a particular portion of the image. In another embodiment, an updated segmentation is post-processed such that only regions which are connected to an appropriate user input are updated.
Abstract translation: 描述用户输入后更新图像分割的方法。 在一个实施例中,用于计算图像的不同部分的属性被更新为一个或多个用户输入的结果。 当已经被分配给图像的特定部分的其他图像元素更新时,由用户输入识别的图像元素被赋予更多的权重。 在另一个实施例中,更新的分段被后处理,使得仅更新连接到适当的用户输入的区域。
-
-
-
-
-
-
-
-
-