Method and apparatus for recovering a three-dimensional scene from two-dimensional images
    1.
    发明授权
    Method and apparatus for recovering a three-dimensional scene from two-dimensional images 有权
    从二维图像中恢复三维场景的方法和装置

    公开(公告)号:US07352386B1

    公开(公告)日:2008-04-01

    申请号:US09338176

    申请日:1999-06-22

    IPC分类号: H04N13/00

    摘要: A method and apparatus for recovering a three-dimensional (3D) scene from two-dimensional (2D) images. A sequence of images is divided into a number of smaller segments and a 3D reconstruction is performed on each segment individually. All the reconstructed segments are then combined together through an efficient bundle adjustment to complete the 3D reconstruction. Segmenting may be achieved by dividing the segments based on the number of feature points that are in each frame. The number of frames per segment is reduced by creating virtual key frames. The virtual key frames encode the 3D structure for each segment, but are only a small subset of the original frames in the segment. A final bundle adjustment is performed on the virtual key frames, rather than all of the original frames. Thus, the final bundle adjustment is two orders of magnitude faster than a conventional bundle adjustment.

    摘要翻译: 一种用于从二维(2D)图像中恢复三维(3D)场景的方法和装置。 图像序列被分成多个较小的段,并且对每个段单独执行3D重建。 然后通过有效的束调整将所有重建的段组合在一起以完成3D重建。 可以通过基于每个帧中的特征点的数量来划分段来实现分段。 通过创建虚拟键帧来减少每个段的帧数。 虚拟关键帧对每个段的3D结构进行编码,但只是该段中原始帧的一小部分。 在虚拟关键帧上执行最终的捆绑调整,而不是所有的原始帧。 因此,最终的捆绑调整比常规捆绑调整快两个数量级。

    Rendering with concentric mosaics
    2.
    发明授权
    Rendering with concentric mosaics 有权
    呈现同心马赛克

    公开(公告)号:US06750860B1

    公开(公告)日:2004-06-15

    申请号:US09309753

    申请日:1999-05-11

    IPC分类号: G06T1500

    CPC分类号: G06T3/4007 G06T3/4038

    摘要: An image based system and process for rendering novel views of a real or synthesized 3D scene based on a series of concentric mosaics depicting the scene. In one embodiment, each concentric mosaic represents a collection of consecutive slit images of the surrounding 3D scene taken from a different viewpoint tangent to a circle on a plane within the scene. Novel views from viewpoints within circular regions of the aforementioned circle plane defined by the concentric mosaics are rendered using these concentric mosaics. Specifically, a slit image can be identified by a ray originating at its viewpoint on the circle plane and extending toward the longitudinal midline of the slit image. Each of the rays associated with the slit images needed to construct a novel view will either coincide with one of the rays associated with a previously captured slit image, or it will pass between two of the concentric circles on the circle plane. If it coincides, then the previously captured slit image associated with the coinciding ray can be used directly to construct part of the novel view. If the ray passes between two of the concentric circles of the plane, then the needed slit image is interpolated using the two previously captured slit images associated with the rays originating from the adjacent concentric circles that are parallel to the non-coinciding ray. If the objects in the 3D scene are close to the camera, depth correction is applied to reduce image distortion for pixels located above and below the circle plane. In another embodiment, a single camera is used to capture a sequence of images. Each image includes image data that has a ray direction associated therewith. To render an image at a novel viewpoint, multiple ray directions from the novel viewpoint are chosen. Image data is combined from the sequence of images by selecting image data that has a ray direction substantially aligning with the ray direction from the novel viewpoint.

    摘要翻译: 基于图像的系统和过程,用于基于描绘场景的一系列同心的马赛克来呈现真实或合成的3D场景的新颖视图。 在一个实施例中,每个同心马赛克表示从与场景内的平面上的圆的切线相切的不同视点拍摄的围绕3D场景的连续狭缝图像的集合。 由同心马赛克定义的上述圆形平面的圆形区域内的观点的新观点使用这些同心马赛克。 具体地说,狭缝图像可以通过其在圆平面上的视点产生的射线来识别,并且朝向狭缝图像的纵向中线延伸。 与构建新视图所需的狭缝图像相关联的每个光线将与与先前捕获的狭缝图像相关联的一条光线重合,或者它将在圆平面上的两个同心圆之间通过。 如果它一致,则可以直接使用与重合光线相关联的先前捕获的狭缝图像来构造新颖视图的一部分。 如果光线穿过平面的两个同心圆之间,则使用与源于与不一致的光线平行的相邻同心圆的光线相关联的两个先前捕获的狭缝图像来插值所需的狭缝图像。 如果3D场景中的对象靠近相机,则应用深度校正来减少位于圆平面上方和下方的像素的图像失真。 在另一个实施例中,单个照相机用于捕获一系列图像。 每个图像包括具有与其相关联的射线方向的图像数据。 为了以新的观点呈现图像,从新观点选择多个射线方向。 从新颖的视点,通过选择具有与射线方向基本对准的射线方向的图像数据,从图像序列组合图像数据。

    Scalable face image retrieval
    3.
    发明授权
    Scalable face image retrieval 有权
    可扩展的面部图像检索

    公开(公告)号:US08498455B2

    公开(公告)日:2013-07-30

    申请号:US12792750

    申请日:2010-06-03

    IPC分类号: G06K9/00

    CPC分类号: G06K9/4676 G06K9/00281

    摘要: A system for identifying individuals in digital images and for providing matching digital images is provided. A set of images that include faces of known individuals is received. Faces are detected in the images and facial components are identified in each face. Visual words corresponding to the facial components are generated, stored, and associated with identifiers of the individuals. At a later time, a user may provide an image that includes the face of one of the known individuals. Visual words are determined from the face of the individual in the provided image and matched against the stored visual words. Images associated with matching visual words are ranked and presented to the user.

    摘要翻译: 提供了一种用于识别数字图像中的个体并提供匹配的数字图像的系统。 接收包括已知个人面孔的一组图像。 在图像中检测到面部,并且在每个面部中识别面部组件。 对应于面部组件的视觉词被生成,存储并与个人的标识符相关联。 在稍后的时间,用户可以提供包括已知个体之一的脸部的图像。 视觉词是从提供的图像中的个体的脸部确定的,并且与存储的视觉词匹配。 与匹配的视觉词相关联的图像被排序并呈现给用户。

    SCALABLE FACE IMAGE RETRIEVAL
    4.
    发明申请
    SCALABLE FACE IMAGE RETRIEVAL 有权
    可扩展的脸部图像检索

    公开(公告)号:US20110299743A1

    公开(公告)日:2011-12-08

    申请号:US12792750

    申请日:2010-06-03

    IPC分类号: G06T7/00

    CPC分类号: G06K9/4676 G06K9/00281

    摘要: A system for identifying individuals in digital images and for providing matching digital images is provided. A set of images that include faces of known individuals is received. Faces are detected in the images and facial components are identified in each face. Visual words corresponding to the facial components are generated, stored, and associated with identifiers of the individuals. At a later time, a user may provide an image that includes the face of one of the known individuals. Visual words are determined from the face of the individual in the provided image and matched against the stored visual words. Images associated with matching visual words are ranked and presented to the user.

    摘要翻译: 提供了一种用于识别数字图像中的个体并提供匹配的数字图像的系统。 接收包括已知个人面孔的一组图像。 在图像中检测到面部,并且在每个面部中识别面部组件。 对应于面部组件的视觉词被生成,存储并与个人的标识符相关联。 在稍后的时间,用户可以提供包括已知个体之一的脸部的图像。 视觉词是从提供的图像中的个体的脸部确定的,并且与存储的视觉词匹配。 与匹配视觉词相关联的图像被排序并呈现给用户。

    Vouching for user account using social networking relationship
    5.
    发明授权
    Vouching for user account using social networking relationship 有权
    使用社交网络关系为用户帐户提供支持

    公开(公告)号:US08745738B2

    公开(公告)日:2014-06-03

    申请号:US13350806

    申请日:2012-01-15

    摘要: Trusted user accounts of an application provider are determined. Graphs, such as trees, are created with each node corresponding to a trusted account. Each of the nodes is associated with a vouching quota, or the nodes may share a vouching quota. Untrusted user accounts are determined. For each of these untrusted accounts, a trusted user account that has a social networking relationship is determined. If the node corresponding to the trusted user account has enough vouching quota to vouch for the untrusted user account, then the quota is debited, a node is added for the untrusted user account to the graph, and the untrusted user account is vouched for. If not, available vouching quota may be borrowed from other nodes in the graph.

    摘要翻译: 确定应用程序提供程序的可信用户帐户。 使用与受信任帐户相对应的每个节点来创建诸如树之类的图形。 每个节点都与一个备份配额相关联,或者节点可以共享一个备份配额。 不信任的用户帐户被确定。 对于每个这些不受信任的帐户,确定具有社交网络关系的可信用户帐户。 如果与受信任用户帐户相对应的节点具有足够的备用配额来保证不受信任的用户帐户,则会将配额扣除,为图中不可信任的用户帐户添加一个节点,并为不受信任的用户帐户进行验证。 如果不是,可以从图中的其他节点借用可用的支票配额。

    OPTIMIZING DATA PARTITIONING FOR DATA-PARALLEL COMPUTING
    6.
    发明申请
    OPTIMIZING DATA PARTITIONING FOR DATA-PARALLEL COMPUTING 有权
    优化用于数据并行计算的数据分区

    公开(公告)号:US20130152057A1

    公开(公告)日:2013-06-13

    申请号:US13325049

    申请日:2011-12-13

    IPC分类号: G06F9/44

    CPC分类号: G06F8/453

    摘要: A data partitioning plan is automatically generated that—given a data-parallel program and a large input dataset, and without having to first run the program on the input dataset—substantially optimizes performance of the distributed execution system that explicitly measures and infers various properties of both data and computation to perform cost estimation and optimization. Estimation may comprise inferring the cost of a candidate data partitioning plan, and optimization may comprise generating an optimal partitioning plan based on the estimated costs of computation and input/output.

    摘要翻译: 自动生成数据分区计划,给定数据并行程序和大型输入数据集,无需首先在输入数据集上运行程序,从而大大优化了分布式执行系统的性能,从而明确地测量和推断出 数据和计算都要进行成本估算和优化。 估计可以包括推断候选数据分割计划的成本,并且优化可以包括基于计算和输入/输出的估计成本来生成最优分割计划。

    Partition min-hash for partial-duplicate image determination
    7.
    发明授权
    Partition min-hash for partial-duplicate image determination 有权
    部分重复图像确定的分区最小散列

    公开(公告)号:US08452106B2

    公开(公告)日:2013-05-28

    申请号:US12729250

    申请日:2010-03-23

    IPC分类号: G06K9/66

    CPC分类号: G06K9/6202 G06K9/4642

    摘要: Images in a database or collection of images are each divided into multiple partitions with each partition corresponding to an area of an image. The partitions in an image may overlap with each other. Min-hash sketches are generated for each of the partitions and stored with the images. A user may submit an image and request that an image that is a partial match for the submitted image be located in the image collection. The submitted image is similarly divided into partitions and min-hash sketches are generated from the partitions. The min-hash sketches are compared with the stored min-hash sketches for matches, and images having partitions whose sketches are matches are returned as partial matching images.

    摘要翻译: 数据库或图像集合中的图像被分成多个分区,每个分区对应于图像的区域。 图像中的分区可能会彼此重叠。 为每个分区生成最小散列草图,并与图像一起存储。 用户可以提交图像并请求作为所提交图像的部分匹配的图像位于图像集合中。 提交的图像类似地划分为分区,并且从分区生成最小哈希草图。 将最小哈希草图与存储的最小哈希草图进行比较,并将具有其草图匹配的分区的图像作为部分匹配图像返回。

    User interface for three-dimensional navigation
    8.
    发明授权
    User interface for three-dimensional navigation 有权
    三维导航用户界面

    公开(公告)号:US08276088B2

    公开(公告)日:2012-09-25

    申请号:US11827530

    申请日:2007-07-11

    IPC分类号: G06F3/048

    摘要: The present invention uses invisible junctions which are a set of local features unique to every page of the electronic document to match the captured image to a part of an electronic document. The present invention includes: an image capture device, a feature extraction and recognition system and database. When an electronic document is printed, the feature extraction and recognition system captures an image of the document page. The features in the captured image are then extracted, indexed and stored in the database. Given a query image, usually a small patch of some document page captured by a low resolution image capture device, the features in the query image are extracted and compared against those stored in the database to identify the query image. The present invention also includes methods for recognizing and tracking the viewing region and look at point corresponding to the input query image. This information is combined with a rendering of the original input document to generate a new graphical user interface to the user. This user interface can be displayed on a conventional browser or even on the display of an image capture device.

    摘要翻译: 本发明使用作为电子文档的每一页特有的一组局部特征的不可见结,以将捕获的图像与电子文档的一部分相匹配。 本发明包括:图像捕获装置,特征提取和识别系统和数据库。 当打印电子文档时,特征提取和识别系统捕获文档页面的图像。 然后将捕获的图像中的特征提取,索引并存储在数据库中。 给定查询图像,通常是由低分辨率图像捕获设备捕获的一些文档页面的小补丁,提取查询图像中的特征并将其与存储在数据库中的特征进行比较以识别查询图像。 本发明还包括用于识别和跟踪观看区域并查看与输入查询图像相对应的点的方法。 该信息与原始输入文档的呈现相结合,以向用户生成新的图形用户界面。 该用户界面可以显示在常规浏览器上,甚至可以在图像捕获设备的显示器上显示。

    Synthetic image and video generation from ground truth data
    9.
    发明授权
    Synthetic image and video generation from ground truth data 有权
    地面真相数据的合成图像和视频生成

    公开(公告)号:US08238609B2

    公开(公告)日:2012-08-07

    申请号:US13168638

    申请日:2011-06-24

    IPC分类号: G06K9/00

    摘要: A system and a method are disclosed for generating video. Object information is received. A path of motion of the object relative to a reference point is generated. A series of images and ground for a reference frame are generated from the ground truth and the generated path. A system and a method are disclosed for generating an image. Object information is received. Image data and ground truth may be generated using position, the image description, the camera characteristics, and image distortion parameters. A positional relationship between the document and a reference point is determined. An image of the document and ground truth are generated from the object information and the positional relationship and in response to user specified environment of the document.

    摘要翻译: 公开了一种用于产生视频的系统和方法。 收到对象信息。 生成对象相对于参考点的运动路径。 从地面真值和生成的路径生成一系列用于参考帧的图像和地面。 公开了一种用于生成图像的系统和方法。 收到对象信息。 可以使用位置,图像描述,相机特性和图像失真参数来生成图像数据和地面真实。 确定文件与参考点之间的位置关系。 从对象信息和位置关系以及响应于用户指定的文档环境生成文档和地面真值的图像。

    Recognition and tracking using invisible junctions
    10.
    发明授权
    Recognition and tracking using invisible junctions 有权
    识别和跟踪使用隐形路口

    公开(公告)号:US08184155B2

    公开(公告)日:2012-05-22

    申请号:US11776530

    申请日:2007-07-11

    IPC分类号: G06K9/34

    摘要: The present invention uses invisible junctions which are a set of local features unique to every page of the electronic document to match the captured image to a part of an electronic document. The present invention includes: an image capture device, a feature extraction and recognition system and database. When an electronic document is printed, the feature extraction and recognition system captures an image of the document page. The features in the captured image are then extracted, indexed and stored in the database. Given a query image, usually a small patch of some document page captured by a low resolution image capture device, the features in the query image are extracted and compared against those stored in the database to identify the query image. The present invention also includes methods for recognizing and tracking the viewing region and look at point corresponding to the input query image. This information is combined with a rendering of the original input document to generate a new graphical user interface to the user. This user interface can be displayed on a conventional browser or even on the display of an image capture device.

    摘要翻译: 本发明使用作为电子文档的每一页特有的一组局部特征的不可见结,以将捕获的图像与电子文档的一部分相匹配。 本发明包括:图像捕获装置,特征提取和识别系统和数据库。 当打印电子文档时,特征提取和识别系统捕获文档页面的图像。 然后将捕获的图像中的特征提取,索引并存储在数据库中。 给定查询图像,通常是由低分辨率图像捕获设备捕获的一些文档页面的小补丁,提取查询图像中的特征并将其与存储在数据库中的特征进行比较以识别查询图像。 本发明还包括用于识别和跟踪观看区域并查看与输入查询图像相对应的点的方法。 该信息与原始输入文档的呈现相结合,以向用户生成新的图形用户界面。 该用户界面可以显示在常规浏览器上,甚至可以在图像捕获设备的显示器上显示。