专利检索 ap:("Richard S. Szeliski" OR "Sing Bing Kang" OR "Ce Liu" OR "Charles L. Zitnick") AND inv:"Sing Bing Kang" 第 4 页

31.

发明授权
Self-calibration for a catadioptric camera 有权
标题翻译：反折射相机的自校准

公开(公告)号：US06870563B1

公开(公告)日：2005-03-22

申请号：US09591781

申请日：2000-06-12

申请人： Sing Bing Kang

发明人： Sing Bing Kang

IPC分类号： G06T5/00 , G06T7/00 , H04N5/225 , H04N5/228 , H04N5/262

CPC分类号： H04N5/2628 , G06T5/006 , G06T7/80

摘要： A method and a system for self-calibrating a wide field-of-view camera (such as a catadioptric camera) using a sequence of omni-directional images of a scene obtained from the camera. The present invention uses the consistency of pairwise features tracked across at least a portion of the image collection and uses these tracked features to determine unknown calibration parameters based on the characteristics of catadioptric imaging. More specifically, the self-calibration method of the present invention generates a sequence of omni-directional images representing a scene and tracks features across the image sequence. An objective function is defined in terms of the tracked features and an error metric (an image-based error metric in a preferred embodiment). The catadioptric imaging characteristics are defined by calibration parameters, and determination of optimal calibration parameters is accomplished by minimizing the objective function using an optimizing technique. Moreover, the present invention also includes a technique for reformulating a projection equation such that the projection equation is equivalent to that of a rectilinear perspective camera. This technique allows analyses (such as structure from motion) to be applied (subsequent to calibration of the catadioptric camera) in the same direct manner as for rectilinear image sequences.

摘要翻译： 一种用于使用从相机获得的场景的全方位图像序列来自动校准宽视野相机（例如反折射相机）的方法和系统。本发明使用在图像集合的至少一部分上跟踪的成对特征的一致性，并且使用这些跟踪的特征来基于反射折射成像的特征来确定未知的校准参数。更具体地，本发明的自校准方法生成表示场景的全方位图像序列，并且跨越图像序列跟踪特征。根据跟踪特征和误差度量（优选实施例中的基于图像的误差度量）来定义目标函数。反射折射成像特征由校准参数定义，并且通过使用优化技术来最小化目标函数来实现最佳校准参数的确定。此外，本发明还包括一种用于重新配置投影方程式的技术，使得投影方程相当于直线透视照相机的投影方程。这种技术允许以与直线图像序列相同的直接方式应用分析（例如运动结构）（在反折射相机的校准之后）。

32.

发明授权
Depth painting for 3-D rendering applications 失效
标题翻译： 3-D渲染应用的深度绘画

公开(公告)号：US06417850B1

公开(公告)日：2002-07-09

申请号：US09238250

申请日：1999-01-27

申请人： Sing Bing Kang

发明人： Sing Bing Kang

IPC分类号： G06T1540

CPC分类号： G06T15/205

摘要： A 3-D effect is added to a single image by adding depth to the single image. Depth can be added to the single image by selecting an arbitrary region or a number of pixels. A user interface simultaneously displays the single image and novel views of the single original image taken from virtual camera positions rotated relative to the original field of view. Depths given to the original image allow pixels to be reprojected onto the novel views to allow the user to observe the depth changes as they are being added. Functions are provided to edit gaps or voids generated in the process of adding depth to the single image. The gaps occur because of depth discontinuities between regions to which depth has been added and the voids are due to the uncovering of previously occluded surfaces in the original image.

摘要翻译： 通过向单个图像添加深度，将3-D效果添加到单个图像。可以通过选择任意区域或多个像素将深度添加到单个图像。用户界面同时显示从相对于原始视野旋转的虚拟摄像机位置拍摄的单个原始图像的单个图像和新颖视图。给予原始图像的深度允许将像素重新投影到新颖视图上，以允许用户在添加时观察深度变化。提供了功能来编辑在单个图像添加深度的过程中生成的空白或空白。由于在添加了深度的区域之间的深度不连续性，并且空隙是由于原始图像中先前遮挡的表面的揭开而发生的。

33.

发明授权
Combined spline and block based motion estimation for coding a sequence of video images 失效
标题翻译：用于编码视频图像序列的组合样条和基于块的运动估计

公开(公告)号：US06295377B1

公开(公告)日：2001-09-25

申请号：US09114684

申请日：1998-07-13

申请人： Frederic Dufaux , Sing Bing Kang

发明人： Frederic Dufaux , Sing Bing Kang

IPC分类号： G06K936

CPC分类号： G06T9/004 , G06T7/238 , G06T2207/10016

摘要： In an coder for producing a bitstream representative of a sequence of video images, a previous image is registered with a current image using spline-based registration to produce estimated motion vectors. The estimated motion vectors are used to match blocks of the previous image and the current image to produce translation vectors. The translation vectors compensate for motion while encoding the sequence as a bitstream.

摘要翻译： 在用于产生表示视频图像序列的比特流的编码器中，使用基于样条的注册来对当前图像登记先前的图像，以产生估计的运动矢量。估计的运动矢量用于匹配先前图像和当前图像的块以产生平移矢量。翻译矢量补偿运动，同时将序列编码为比特流。

34.

发明授权
Multi-layer image-based rendering for video synthesis 失效
标题翻译：用于视频合成的多层基于图像的渲染

公开(公告)号：US06266068B1

公开(公告)日：2001-07-24

申请号：US09039022

申请日：1998-03-13

申请人： Sing Bing Kang , James M. Rehg

发明人： Sing Bing Kang , James M. Rehg

IPC分类号： G06T1160

CPC分类号： G06T11/60

摘要： A computerized method and related computer system synthesize video from a plurality of sources of image data. The sources include a variety of image data types such a collection of image stills, a sequence of video frames, and 3-D models of objects. Each source provides image data associated with an object. One source provides image data associated with a first object, and a second source provides image data associated with a second object. The image data of the first and second objects are combined to generate composite images of the first and second objects. From the composite images, an output image of the first and second objects as viewed from an arbitrary viewpoint is generated. Gaps of pixels with unspecified pixel values may appear in the output image. Accordingly, a pixel value for each of these “missing pixels” is obtained by using an epipolar search process to determine which one of the sources of image data should provide the pixel value for that missing pixel.

摘要翻译： 计算机化方法和相关计算机系统从多个图像数据源合成视频。这些源包括各种图像数据类型，诸如图像静止图像集合，视频帧序列和对象的3D模型。每个源提供与对象相关联的图像数据。一个源提供与第一对象相关联的图像数据，第二源提供与第二对象相关联的图像数据。组合第一和第二对象的图像数据以生成第一和第二对象的合成图像。从合成图像生成从任意视点观察第一和第二对象的输出图像。输出图像中可能会出现具有未指定像素值的像素间隙。因此，通过使用对极搜索处理来确定图像数据的哪一个源应该为缺失像素提供像素值，从而获得每个这些“丢失像素”的像素值。

35.

发明授权
Method for reconstructing a three-dimensional object from a closed-loop sequence of images taken by an uncalibrated camera 失效
标题翻译：从未校准的相机拍摄的闭环图像重建三维物体的方法

公开(公告)号：US6061468A

公开(公告)日：2000-05-09

申请号：US901391

申请日：1997-07-28

申请人： Sing Bing Kang

发明人： Sing Bing Kang

IPC分类号： G06T1/00 , G06T7/20 , G06T17/40 , G06K9/00

CPC分类号： G06T7/0071 , G06T2207/10016

摘要： In a computerized method, the three-dimensional structure of an object is recovered from a closed-loop sequence of two-dimensional images taken by a camera undergoing some arbitrary motion. In one type of motion, the camera is held fixed, while the object completes a full 360.degree. rotation about an arbitrary axis. Alternatively, the camera can make a complete rotation about the object. In the sequence of images, feature tracking points are selected using pair-wise image registration. Ellipses are fitted to the feature tracking points to estimate the tilt of the axis of rotation. A set of variables are set to fixed values while minimizing an image-based objective function to extract a set of first structure and motion parameters. Then the set of variables freed while minimizing of the objective function continues to extract a second set of structure and motion parameters that are substantially the same as the first set of structure and motion parameters.

摘要翻译： 在计算机化方法中，从经历某种任意运动的相机拍摄的二维图像的闭环序列中恢复对象的三维结构。在一种类型的运动中，摄像机保持固定，同时物体围绕任意轴完成360度旋转。或者，相机可以围绕对象进行完全旋转。在图像序列中，使用成对图像配准来选择特征跟踪点。椭圆适配到特征跟踪点以估计旋转轴的倾斜度。将一组变量设置为固定值，同时使基于图像的目标函数最小化以提取一组第一结构和运动参数。然后在最小化目标函数的同时释放的变量集继续提取与第一组结构和运动参数基本相同的第二组结构和运动参数。

36.

发明授权
Compressing and decompressing multiple, layered, video streams employing multi-directional spatial encoding 有权
标题翻译：使用多向空间编码压缩和解压缩多个分层的视频流

公开(公告)号：US08774274B2

公开(公告)日：2014-07-08

申请号：US13348262

申请日：2012-01-11

申请人： Simon Winder , Matthew T. Uyttendaele , Charles Lawrence Zitnick, III , Richard Szeliski , Sing Bing Kang

发明人： Simon Winder , Matthew T. Uyttendaele , Charles Lawrence Zitnick, III , Richard Szeliski , Sing Bing Kang

IPC分类号： H04N7/12

CPC分类号： H04N21/4347 , H04N13/111 , H04N19/109 , H04N19/39 , H04N19/593 , H04N19/597 , H04N19/61 , H04N19/70 , H04N19/96 , H04N21/2365

摘要： A process for compressing and decompressing non-keyframes in sequential sets of contemporaneous video frames making up multiple video streams where the video frames in a set depict substantially the same scene from different viewpoints. Each set of contemporaneous video frames has a plurality frames designated as keyframes with the remaining being non-keyframes. In one embodiment, the non-keyframes are compressed using a multi-directional spatial prediction technique. In another embodiment, the non-keyframes of each set of contemporaneous video frames are compressed using a combined chaining and spatial prediction compression technique. The spatial prediction compression technique employed can be a single direction technique where just one reference frame, and so one chain, is used to predict each non-keyframe, or it can be a multi-directional technique where two or more reference frames, and so chains, are used to predict each non-keyframe.

摘要翻译： 一种用于在构成多个视频流的同步视频帧的顺序集合中压缩和解压缩非关键帧的过程，其中集合中的视频帧从不同视点描绘基本相同的场景。每组同时期的视频帧具有指定为关键帧的多个帧，其余的是非关键帧。在一个实施例中，使用多方向空间预测技术来压缩非关键帧。在另一个实施例中，使用组合链接和空间预测压缩技术来压缩每组同时期视频帧的非关键帧。所使用的空间预测压缩技术可以是单向技术，其中仅使用一个参考帧，因此使用一条链来预测每个非关键帧，或者它可以是多方向技术，其中两个或更多个参考帧等链，用于预测每个非关键帧。

37.

发明授权
Stereo movie editing 有权
标题翻译：立体声电影编辑

公开(公告)号：US08330802B2

公开(公告)日：2012-12-11

申请号：US12331419

申请日：2008-12-09

申请人： Sanjeev J. Koppal , Sing Bing Kang , Charles Lawrence Zitnick, III , Michael F. Cohen , Bryan Kent Ressler

发明人： Sanjeev J. Koppal , Sing Bing Kang , Charles Lawrence Zitnick, III , Michael F. Cohen , Bryan Kent Ressler

IPC分类号： H04N13/02

CPC分类号： H04N13/10

摘要： The stereo movie editing technique described herein combines knowledge of both multi-view stereo algorithms and human depth perception. The technique creates a digital editor, specifically for stereographic cinema. The technique employs an interface that allows intuitive manipulation of the different parameters in a stereo movie setup, such as camera locations and screen position. Using the technique it is possible to reduce or enhance well-known stereo movie effects such as cardboarding and miniaturization. The technique also provides new editing techniques such as directing the user's attention and easier transitions between scenes.

摘要翻译： 这里描述的立体声电影编辑技术结合了多视角立体声算法和人类深度感知的知识。该技术创建了一个专门用于立体影院的数字编辑器。该技术采用允许在立体声电影设置中的不同参数的直观操纵的界面，例如相机位置和屏幕位置。使用该技术可以减少或增强诸如硬纸板和小型化的众所周知的立体电影效果。该技术还提供了新的编辑技术，例如指导用户的注意力和更容易的场景之间的转换。

38.

发明申请
NOISE SUPPRESSION IN LOW LIGHT IMAGES 有权
标题翻译：低光图像中的噪声抑制

公开(公告)号：US20120224789A1

公开(公告)日：2012-09-06

申请号：US13037366

申请日：2011-03-01

申请人： Priyam Chatterjee , Neel Joshi , Sing Bing Kang , Yasuyuki Matsushita

发明人： Priyam Chatterjee , Neel Joshi , Sing Bing Kang , Yasuyuki Matsushita

IPC分类号： G06K9/40

CPC分类号： G06T5/002 , G06T3/4015

摘要： A low light noise reduction mechanism may perform denoising prior to demosaicing, and may also use parameters determined during the denoising operation for performing demosaicing. The denoising operation may attempt to find several patches of an image that are similar to a first patch, and use a weighted average based on similarity to determine an appropriate value for denoising a raw digital image. The same weighted average and similar patches may be used for demosaicing the same image after the denoising operation.

摘要翻译： 低光噪声降低机构可以在去马赛克之前执行去噪，并且还可以使用在去噪操作期间确定的用于执行去马赛克的参数。去噪操作可以尝试找到类似于第一贴片的图像的多个贴片，并且使用基于相似度的加权平均来确定用于去噪原始数字图像的适当值。在去噪操作之后，可以使用相同的加权平均和相似的补丁来对相同的图像进行去马赛克。

39.

发明授权
Video completion by motion field transfer 有权
标题翻译：视频完成通过运动场传输

公开(公告)号：US08243805B2

公开(公告)日：2012-08-14

申请号：US11559861

申请日：2006-11-14

申请人： Yasuyuki Matsushita , Sing Bing Kang , Xiaoou Tang , Takaaki Shiratori

发明人： Yasuyuki Matsushita , Sing Bing Kang , Xiaoou Tang , Takaaki Shiratori

IPC分类号： H04N7/12

CPC分类号： G06T5/005 , G06T5/20 , G06T5/50 , G06T2207/10016 , G06T2207/20016

摘要： Systems and methods for video completion by motion field transfer are described. In one aspect, a spatio-temporal target patch of an input video data sequence is filled in or replaced by motion field transfer from a spatio-temporal source patch of the input video data sequence. Color is propagated to corresponding portions of the spatio-temporal target patch by treating the transferred motion information as directed edges. These motion field transfer and color propagation operations result in a video completed spatio-temporal target patch. The systems and methods present the video data sequence, which now includes the video completed spatio-temporal target patch, to user for viewing.

摘要翻译： 描述了通过运动场传输进行视频完成的系统和方法。在一个方面，输入视频数据序列的时空目标贴片由输入视频数据序列的时空源片段填充或由运动场传输代替。通过将转移的运动信息作为有向边缘进行处理，将颜色传播到时空目标贴片的相应部分。这些运动场传输和颜色传播操作导致视频完成时空目标补丁。系统和方法呈现视频数据序列，其现在包括视频完成的时空目标补丁，供用户观看。

40.

发明申请
ESTABLISHING CLUSTERS OF USER PREFERENCES FOR IMAGE ENHANCEMENT 有权
标题翻译：建立图像增强用户首选项集

公开(公告)号：US20120155759A1

公开(公告)日：2012-06-21

申请号：US12973922

申请日：2010-12-21

申请人： Sing Bing Kang , Ashish Kapoor , Juan Caicedo Rueda

发明人： Sing Bing Kang , Ashish Kapoor , Juan Caicedo Rueda

IPC分类号： G06K9/68 , G06K9/00

CPC分类号： G06T5/00 , G06T2207/20092

摘要： An image enhancement system may match images to a matrix having various enhancements of images for groups of users. The matrix may define image enhancement settings for the particular images and groups of users, and the matching may apply enhancements to a new image that closely matches a user's preferences. After the matrix is initially populated, new users and new images may be added to increase the matrix's accuracy. The image enhancement system may be deployed as a cloud service, where images may be enhanced as a standalone application or as part of a social network or image sharing website. In some embodiments, the image enhancement system may be deployed on a personal computer or as a component of an image capture device.

摘要翻译： 图像增强系统可将图像与具有用户组的各种图像增强的矩阵相匹配。矩阵可以为特定图像和用户组定义图像增强设置，并且匹配可以对与用户偏好紧密匹配的新图像应用增强。在最初填充矩阵之后，可以添加新的用户和新的图像以增加矩阵的准确度。图像增强系统可以部署为云服务，其中可以将图像作为独立应用或作为社交网络或图像共享网站的一部分而增强。在一些实施例中，图像增强系统可以部署在个人计算机上或作为图像捕获设备的组件。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类