专利检索 ap:("Matthew T. Uyttendaele" OR "Richard S. Szeliski") AND inv:"Richard S. Szeliski" 第 1 页

1.

发明授权
Panoramic video 有权
标题翻译：全景视频

公开(公告)号：US06788333B1

公开(公告)日：2004-09-07

申请号：US09611646

申请日：2000-07-07

申请人： Matthew T. Uyttendaele , Richard S. Szeliski

发明人： Matthew T. Uyttendaele , Richard S. Szeliski

IPC分类号： H04N700

CPC分类号： H04N5/23238 , G03B37/04 , G06T7/30 , G06T7/80 , G06T2200/32 , H04N17/002

摘要： A system and process for generating a panoramic video. Essentially, the panoramic video is created by first acquiring multiple videos of the scene being depicted. Preferably, these videos collectively depict a full 360 degree view of the surrounding scene and are captured using a multiple camera rig. The acquisition phase also includes a calibration procedure that provides information about the camera rig used to capture the videos that is used in the next phase for creating the panoramic video. This next phase, which is referred to as the authoring phase, involves mosaicing or stitching individual frames of the videos, which were captured at approximately the same moment in time, to form each frame of the panoramic video. A series of texture maps are then constructed for each frame of the panoramic video. Each texture map coincides with a portion of a prescribed environment model of the scene. The texture map representations of each frame of the panoramic video are encoded so as to facilitate their transfer and viewing. This can include compressing the panoramic video frames. Such a procedure is useful in applications where the panoramic video is to be transferred over a network, such as the Internet.

摘要翻译： 用于生成全景视频的系统和过程。本质上，通过首先获取所描绘的场景的多个视频来创建全景视频。优选地，这些视频共同地描绘了周围场景的完整360度视图，并且使用多个相机钻机来捕获。采集阶段还包括一个校准程序，该程序提供有关用于捕获下一阶段中用于创建全景视频的视频的摄像机的信息。这个下一个阶段（被称为创作阶段）涉及拼接或缝合在大约相同的时刻被捕获的视频的各个帧，以形成全景视频的每个帧。然后为全景视频的每个帧构建一系列纹理贴图。每个纹理贴图与场景的规定环境模型的一部分重合。对全景视频的每个帧的纹理映射表示进行编码，以便于它们的传送和观看。这可以包括压缩全景视频帧。这样的程序在通过诸如因特网的网络传输全景视频的应用中是有用的。

2.

发明授权
Automatic digital image grouping using criteria based on image metadata and spatial information 有权
标题翻译：使用基于图像元数据和空间信息的标准自动数字图像分组

公开(公告)号：US07580952B2

公开(公告)日：2009-08-25

申请号：US11069662

申请日：2005-02-28

申请人： Ronald K. Logan , Richard S. Szeliski , Matthew T. Uyttendaele

发明人： Ronald K. Logan , Richard S. Szeliski , Matthew T. Uyttendaele

IPC分类号： G06F17/00

CPC分类号： G06F17/30265 , Y10S707/99942 , Y10S707/99943 , Y10S707/99945 , Y10S707/99948 , Y10S707/99953

摘要： An automatic digital image grouping system and method for automatically generating groupings of related images based on criteria that includes image metadata and spatial information. The system and method takes an unordered and unorganized set of digital images and organizes and groups related images into image subsets. The criteria for defining an image subset varies and can be customized depending on the needs of the user. Metadata (such as EXIF tags) already embedded inside the images is used to extract likely image subsets. This metadata may include the temporal proximity of images, focal length, color overlap, and geographical location. The first component of the automatic image grouping system and method is a subset image stage that analyzes the metadata and generates potential image subsets containing related images. The second component is an overlap detection stage, where potential image subset is analyzed and verified by examining pixels of the related images.

摘要翻译： 一种基于包括图像元数据和空间信息的标准自动生成相关图像分组的自动数字图像分组系统和方法。该系统和方法采用无序和无组织的数字图像集，将相关图像组织并组合成图像子集。定义图像子集的标准不同，可以根据用户的需要进行定制。已经嵌入图像内的元数据（例如EXIF标签）用于提取可能的图像子集。该元数据可以包括图像的时间接近，焦距，颜色重叠和地理位置。自动图像分组系统和方法的第一个组件是分析元数据并生成包含相关图像的潜在图像子集的子集图像阶段。第二部分是重叠检测阶段，其中通过检查相关图像的像素来分析和验证潜在图像子集。

3.

发明授权
Deghosting panoramic video 有权
标题翻译：脱镜全景视频

公开(公告)号：US06701030B1

公开(公告)日：2004-03-02

申请号：US09612830

申请日：2000-07-07

申请人： Matthew T. Uyttendaele , Richard S. Szeliski

发明人： Matthew T. Uyttendaele , Richard S. Szeliski

IPC分类号： G06K936

CPC分类号： G06K9/03 , G06K2009/2045 , G06T3/4038 , H04N5/23238

摘要： A system and process for deghosting mosaiced images created by stitching together images of a scene captured from different viewpoints is presented. When images are mosaiced, which were captured by different cameras at different viewpoints, the possibility of localized double images of objects exists. Essentially, this double imaging or ghosting will occur if an object in the scene is close in to the cameras capturing the images. However, this localized ghosting can be compensated for by estimating the amount of local mis-registration and then locally warping each image in the mosaiced image to reduce any ghosting.

摘要翻译： 提出了通过将从不同视点捕获的场景的图像拼接在一起而产生的马赛克图像去角度的系统和过程。当图像被镶嵌时，由不同的摄像机在不同的观点被捕获，存在对象的局部双重图像的可能性。基本上，如果场景中的物体靠近拍摄图像的相机，则会发生双重成像或重影。然而，这种局部重影可以通过估计局部误配准的量，然后局部扭曲镶嵌图像中的每个图像来减少任何重影来补偿。

4.

发明授权
Bayesian demosaicing using a two-color image 有权
标题翻译：贝叶斯组合使用双色图像

公开(公告)号：US07706609B2

公开(公告)日：2010-04-27

申请号：US11343581

申请日：2006-01-30

申请人： Eric P. Bennett , Matthew T. Uyttendaele , Charles L. Zitnick , Sing Bing Kang , Richard S. Szeliski

发明人： Eric P. Bennett , Matthew T. Uyttendaele , Charles L. Zitnick , Sing Bing Kang , Richard S. Szeliski

IPC分类号： G06K9/00 , G06K9/40 , G06K9/32

CPC分类号： H04N9/646 , H04N1/58 , H04N9/045

摘要： A Bayesian two-color image demosaicer and method for processing a digital color image to demosaic the image in such a way as to reduce image artifacts. The method and system are an improvement on and an enhancement to previous demosaicing techniques. A preliminary demosaicing pass is performed on the image to assign each pixel a fully specified RGB triple color value. The final color value of pixel in the processed image is restricted to be a linear combination of two colors. Fully-specified RGB triple color values for each pixel in an image used to find two clusters represented favored two colors. The amount of contribution from these favored two colors on the final color value then is determined. The method and system also can process multiple images to improve the demosaicing results. When using multiple images, sampling can be performed at a finer resolution, known as super resolution.

摘要翻译： 一种贝叶斯双色图像拆分器和用于处理数字彩色图像以使图像去马赛克的方法，以减少图像伪像。该方法和系统是对以前的去马赛克技术的改进和增强。对图像执行初步去马赛克通行，为每个像素分配完全指定的RGB三色值。处理图像中的像素的最终色值被限制为两种颜色的线性组合。用于查找两个聚类的图像中每个像素的完全指定的RGB三色值代表有利于两种颜色。然后确定这些有利的两种颜色对最终颜色值的贡献量。该方法和系统还可以处理多个图像以改善去马赛克结果。当使用多个图像时，可以以更精细的分辨率（称为超分辨率）进行采样。

5.

发明授权
Method and system for combining multiple exposure images having scene and camera motion 有权
标题翻译：用于组合具有场景和摄像机运动的多个曝光图像的方法和系统

公开(公告)号：US07239805B2

公开(公告)日：2007-07-03

申请号：US11049597

申请日：2005-02-01

申请人： Matthew T. Uyttendaele , Richard S. Szeliski , Ashley Michelle Eden

发明人： Matthew T. Uyttendaele , Richard S. Szeliski , Ashley Michelle Eden

IPC分类号： G03B7/00 , G03B41/00 , G06K9/32

CPC分类号： G06T3/4038 , G06T5/009 , G06T5/50 , G06T2207/20208

摘要： A panoramic high-dynamic range (HDR) image method and system of combining multiple images having different exposures and at least partial spatial overlap wherein each of the images may have scene motion, camera motion, or both. The major part of the panoramic HDR image method and system is a two-pass optimization-based approach that first defines the position of the objects in a scene and then fills in the dynamic range when possible and consistent. Data costs are created to encourage radiance values that are both consistent with object placement (defined by the first pass) and of a higher signal-to-noise ratio. Seam costs are used to ensure that transitions occur in regions of consistent radiances. The result is a high-quality panoramic HDR image having the full available spatial extent of the scene along with the full available exposure range.

摘要翻译： 全景高动态范围（HDR）图像方法和组合具有不同曝光和至少部分空间重叠的多个图像的系统，其中每个图像可以具有场景运动，相机运动或两者。全景HDR图像方法和系统的主要部分是基于双向优化的方法，首先定义场景中对象的位置，然后在可能和一致的情况下填充动态范围。创建数据成本以鼓励与对象放置（由第一遍定义）和较高信噪比的辐射值一致。接缝成本用于确保转变发生在一致的辐射区域。结果是高品质的全景HDR图像具有完整的可用空间范围以及完整的可用曝光范围。

6.

发明授权
System and method deghosting mosaics using multiperspective plane sweep 失效

公开(公告)号：US07006709B2

公开(公告)日：2006-02-28

申请号：US10172859

申请日：2002-06-15

申请人： Sing Bing Kang , Richard S. Szeliski , Matthew T. Uyttendaele

发明人： Sing Bing Kang , Richard S. Szeliski , Matthew T. Uyttendaele

IPC分类号： G06K9/32

CPC分类号： G06T5/20 , G06K9/32 , G06K2009/2045 , G06T3/00 , G06T5/003 , G06T7/30 , G06T7/55 , G06T15/503

摘要： A system and method for deghosting mosaics provides a novel multiperspective plane sweep approach for generating an image mosaic from a sequence of still images, video images, scanned photographic images, computer generated images, etc. This multiperspective plane sweep approach uses virtual camera positions to compute depth maps for columns of overlapping pixels in adjacent images. Object distortions and ghosting caused by image parallax when generating the image mosaics are then minimized by blending pixel colors, or grey values, for each computed depth to create a common composite area for each of the overlapping images. Further, the multiperspective plane sweep approach described herein is both computationally efficient, and applicable to both the case of limited overlap between the images used for creating the image mosaics, and to the case of extensive or increased image overlap.

7.

发明授权
System and method for selective decoding and decompression 有权
标题翻译：选择性解码和解压缩的系统和方法

公开(公告)号：US06859557B1

公开(公告)日：2005-02-22

申请号：US09611649

申请日：2000-07-07

申请人： Matthew T. Uyttendaele , Richard S. Szeliski

发明人： Matthew T. Uyttendaele , Richard S. Szeliski

IPC分类号： G06K9/36 , H04N7/26

CPC分类号： H04N21/4325 , H04N19/102 , H04N19/162 , H04N19/17 , H04N19/174 , H04N19/44 , H04N19/46 , H04N21/4143 , H04N21/43615 , H04N21/4728

摘要： A system and process that segments the video frames, thereby allowing selective decoding and possible decompression of just those specific regions that are to be viewed. Each frame is segmented into a plurality of regions such that the segmented regions correspond from one frame to the next. Each segmented region is then encoded separately. Once the frames have been segmented and encoded (which may include compressing them), they can be transferred to the viewer. One way to transfer the files involves an interactive approach and network connection. A viewing system identifies what portions of the scene the user wants to view and informs a server of the portions of the frame of the video that are needed to render the desired view of the scene to the user. The server then transfers only the requested portions of the next frame to the viewer. This process is repeated for each frame of the video. If an interactive approach is not possible, all the files associated with each segmented region of a panoramic frame are sent to the viewer. Once received, the viewer selectively processes and decompresses (if necessary) only those segments required to display the desired portion of the scene. The segmented and encoded frames can also be stored on a storage medium that is accessible by the viewer. In this case, the viewer reads only those segments needed to render the desired view. Thus, here too the viewer selectively processes only those segments required to display the desired portion of the scene.

摘要翻译： 分割视频帧的系统和过程，从而允许对要被观看的那些特定区域的选择性解码和可能的解压缩。每个帧被分割成多个区域，使得分割的区域对应于一个帧到下一个帧。然后分别对每个分段区进行编码。一旦帧被分段和编码（其可以包括压缩它们），它们可以被传送到观看者。传输文件的一种方法涉及交互式方法和网络连接。观看系统识别用户想要观看的场景的哪些部分，并向服务器通知视频帧的部分以将场景的期望视图呈现给用户。然后，服务器仅将下一帧的请求部分传送给观看者。为视频的每一帧重复此过程。如果不可能进行交互式方法，则将与全景帧的每个分段区域相关联的所有文件发送给观众。一旦接收到，观众选择性地处理和解压缩（如果需要）只有显示场景的所需部分所需的那些片段。分段编码的帧也可以存储在可由观看者访问的存储介质上。在这种情况下，查看器只读取渲染所需视图所需的那些段。因此，在这里，观众选择性地仅处理显示场景的期望部分所需的那些片段。

8.

发明授权
System and method for exposure compensation 有权
标题翻译：曝光补偿的系统和方法

公开(公告)号：US06813391B1

公开(公告)日：2004-11-02

申请号：US09611645

申请日：2000-07-07

申请人： Matthew T. Uyttendaele , Richard S. Szeliski

发明人： Matthew T. Uyttendaele , Richard S. Szeliski

IPC分类号： G06K936

CPC分类号： G06K9/03 , G06K2009/2045 , G06T5/50 , G06T2207/10016 , G06T2207/10144 , H04N5/217 , H04N5/23238

摘要： A system and method for adjusting exposure in a mosaiced or stitched image. A stitched composite image is typically represented by a set of images and a set of associated transformations. Each transformation corresponds to one image in the input image sequence and represents the mapping between image pixels in each image and a three-dimensional coordinate system. Every triplet of images in the mosaiced image, having a first, center and third image, is input into the system and method according to the present invention. Before exposure adjustment can be performed, the regions of overlap between the input images in the mosaiced image are calculated. Once the areas of overlap associated with the first and third images are found, the areas of overlap associated with these images are warped to the coordinate frame of the center image. This allows the corresponding pixel locations in the first and center images, and the corresponding pixel location in the third and center images to be determined for the overlapping areas. Additionally, the luminance of each pixel in these areas is identified. The exposure correction factors for each overlapping area are then calculated. Once the exposure correction factors for each overlapping area are calculated, the luminance of all pixels in the center image are adjusted using these factors to yield a mosaiced image that appears to be consistent in exposure. This process is repeated until all sets of triplets have been considered.

摘要翻译： 用于调整镶嵌或缝合图像中曝光的系统和方法。缝合的合成图像通常由一组图像和一组相关联的转换来表示。每个变换对应于输入图像序列中的一个图像，并且表示每个图像中的图像像素与三维坐标系之间的映射。按照本发明，将具有第一，中心和第三图像的马赛克图像中的每个三重图像输入到系统和方法中。在可以进行曝光调整之前，计算镶嵌图像中的输入图像之间的重叠区域。一旦找到与第一和第三图像相关联的重叠区域，则与这些图像相关联的重叠区域变形到中心图像的坐标系。这允许为重叠区域确定第一和中心图像中的相应像素位置以及第三和中心图像中的对应像素位置。此外，识别这些区域中的每个像素的亮度。然后计算每个重叠区域的曝光校正因子。一旦计算了每个重叠区域的曝光校正因子，就使用这些因素调整中心图像中的所有像素的亮度，以产生在曝光中看起来一致的镶嵌图像。重复该过程，直到考虑了所有的三元组。

9.

发明授权
System and process for viewing panoramic video 有权
标题翻译：用于观看全景视频的系统和过程

公开(公告)号：US06559846B1

公开(公告)日：2003-05-06

申请号：US09611987

申请日：2000-07-07

申请人： Matthew T. Uyttendaele , Richard S. Szeliski

发明人： Matthew T. Uyttendaele , Richard S. Szeliski

IPC分类号： G06T1700

CPC分类号： H04N21/44012 , G06T15/005 , H04N7/17318 , H04N21/21805 , H04N21/4728

摘要： The primary components of the panoramic video viewer include a decoder module. The purpose of the decoder module is to input incoming encoded panoramic video data and to output a decoded version thereof. The incoming data may be provided over a network and originate from a server, or it may simply be read from a storage media, such as a hard drive, CD or DVD. Once decoded, the data associated with each video frame is preferably stored in a storage module and made available to a 3D rendering module. The 3D rendering module is essentially a texture mapper that takes the frame data and maps the desired views onto a prescribed environment model. The output of the 3D rendering module is provided to a display module where the panoramic video is viewed by a user of the system. Typically, the user will be viewing just a portion of the scene depicted in the panoramic video at any one time, and will be able to control what portion is viewed. Preferably, the panoramic video viewer will allow the user to pan through the scene to the left, right, up or down. In addition, the user would preferably be able to zoom in or out within the portion of the scene being viewed. The user could also be allowed to select what video should be played, choose when to play or pause the video, and to specify what temporal part of the video should be played.

摘要翻译： 全景视频观看器的主要部件包括解码器模块。解码器模块的目的是输入输入的编码全景视频数据并输出其解码版本。输入数据可以通过网络提供并且来自服务器，或者可以简单地从诸如硬盘驱动器，CD或DVD的存储介质读取。一旦解码，与每个视频帧相关联的数据优选地存储在存储模块中并且可用于3D渲染模块。 3D渲染模块本质上是纹理映射器，其获取帧数据并将期望的视图映射到规定的环境模型上。 3D渲染模块的输出被提供给显示模块，其中全景视频被系统的用户观看。通常，用户将在任何一个时间仅观看全景视频中描绘的场景的一部分，并且将能够控制观看的部分。优选地，全景视频观看者将允许用户向左，右，上或下平移场景。此外，用户最好能够在正在观看的场景的部分内放大或缩小。也可以允许用户选择要播放的视频，选择何时播放或暂停视频，并指定应播放视频的时间部分。

10.

发明授权
Analysis of images located within three-dimensional environments 有权
标题翻译：分析三维环境中的图像

公开(公告)号：US09122368B2

公开(公告)日：2015-09-01

申请号：US12722280

申请日：2010-03-11

申请人： Richard S. Szeliski , Blaise H. Aguera y Arcas , Tomasz S. M. Kasperkiewicz

发明人： Richard S. Szeliski , Blaise H. Aguera y Arcas , Tomasz S. M. Kasperkiewicz

IPC分类号： G06F17/00 , G06F3/0481 , G06F17/30 , G06T19/00

CPC分类号： G06F3/04815 , G06F17/30274 , G06T19/003

摘要： Images are analyzed within a 3D environment that is generated based on spatial relationships of the images and that allows users to experience the images in the 3D environment. Image analysis may include ranking images based on user viewing information, such as the number of users who have viewed an image and how long an image was viewed. Image analysis may further include analyzing the spatial density of images within a 3D environment to determine points of user interest.

摘要翻译： 在基于图像的空间关系生成的3D环境中分析图像，并允许用户体验3D环境中的图像。图像分析可以包括基于用户观看信息来排序图像，诸如已经观看图像的用户的数量以及观看多长时间的图像。图像分析还可以包括分析3D环境中的图像的空间密度以确定用户感兴趣的点。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类