Retrieving and sharing electronic documents using paper
    11.
    发明授权
    Retrieving and sharing electronic documents using paper 有权
    使用纸张检索和分享电子文件

    公开(公告)号:US08139860B2

    公开(公告)日:2012-03-20

    申请号:US12184124

    申请日:2008-07-31

    IPC分类号: G06K9/34

    摘要: In an embodiment of the invention, an electronic document (e-document) can be searched and found by capturing an image of the printed document. Instead of typing in a file name or searching through multiple directories, the user simply takes a picture of the document with a camera and the system uses the document image to locate the e-document. In an alternative embodiment of the invention, an image of a printed document can be useful for remote document sharing. In various embodiments of the invention, sharing an image of a printed document can be used to email a high quality paper document, send a high quality fax, or open a document to a page containing an annotation. Through co-design of the feature extraction and search algorithm in the system, the image feature detection robustness and search speed are improved at the same time.

    摘要翻译: 在本发明的实施例中,可以通过捕获打印文档的图像来搜索和发现电子文档(电子文档)。 用户只需使用相机对文档进行照片,而不是输入文件名或搜索多个目录,系统会使用文档图像来定位电子文档。 在本发明的替代实施例中,打印文档的图像可用于远程文档共享。 在本发明的各种实施例中,共享打印文档的图像可以用于向高质量的纸质文档发送高质量的传真,或者向包含注释的页面打开文档。 通过系统中特征提取和搜索算法的共同设计,同时提高了图像特征检测的鲁棒性和搜索速度。

    System and method for video access from notes or summaries
    12.
    发明授权
    System and method for video access from notes or summaries 失效
    从笔记或摘要视频访问的系统和方法

    公开(公告)号:US07647555B1

    公开(公告)日:2010-01-12

    申请号:US09548685

    申请日:2000-04-13

    IPC分类号: G06F3/00

    摘要: Recorded video is accessed from printed notes or summaries derived from the video. Summaries may be created automatically by analyzing the recorded video, and annotations are made by a user on a device for note-taking with digital ink and video. The notes and/or summaries are printed along with data glyphs that provide time based indexes or offsets into the recorded video. The indexes or offsets are retrieved by scanning the glyph on the printout. The glyph information can be embedded in the printouts in many ways. One method is to associate block glyphs with annotations or images on the printed pages. Another method is to provide an address carpet in an annotated timeline. Yet another method is to provide a two-dimensional address carpet with X-Y position mapped to time which can be used to provide selected access to the video. The accessed video may be played back on the note-taking device on a pen computer, or on a summary interface on a Web browser-type device.

    摘要翻译: 录制的视频可从打印的笔记或从视频派生的摘要访问。 可以通过分析录制的视频来自动创建摘要,并且用户在设备上进行注释,以便使用数字墨水和视频进行笔记。 笔记和/或摘要与将记录的视频中提供基于时间的索引或偏移量的数据字形一起打印。 通过扫描打印输出上的字形来检索索引或偏移量。 字形信息可以以许多方式嵌入到打印输出中。 一种方法是将块字形与印刷页面上的注释或图像相关联。 另一种方法是在注释时间线中提供地址毯。 另一种方法是提供二维地址毯,其中X-Y位置映射到时间,其可以用于提供对视频的选择的访问。 所访问的视频可以在笔式计算机上的笔记本设备上或者在Web浏览器型设备的汇总接口上播放。

    Reduced representations of video sequences
    13.
    发明授权
    Reduced representations of video sequences 有权
    减少视频序列的表示

    公开(公告)号:US07149974B2

    公开(公告)日:2006-12-12

    申请号:US10116012

    申请日:2002-04-03

    IPC分类号: G06F3/048 H04N5/262

    摘要: A system that represents a video sequence comprising plurality of video clips as a number of images. The plurality of video clips is represented as a reduced representation of video images. A video clip is represented as a keyframe, wherein multiple keyframes may then be arranged according to chronological order. All or only a representative portion of the video clips can be represented as keyframes. The size of the keyframe may be configured to represent the length or importance of the video clip. The keyframe may depict an entire frame of a video clip, or a region of meaningful information within a frame of a video clip. Multiple keyframes may be arranged in a two dimensional array, in an S-shaped curve, or some other pattern. The keyframes may depict motion of an object occurring over time in the video clip by configuring groups of pixels in the key frame. Configuring groups of pixels may include colorizing pixel groups and depicting pixel groups at a semi-transparent level according to the number of frames between the keyframe and the frame containing the object in motion.

    摘要翻译: 一种系统,其将包括多个视频剪辑的视频序列表示为多个图像。 多个视频剪辑被表示为视频图像的缩小表示。 视频剪辑被表示为关键帧,其中可以根据时间顺序来布置多个关键帧。 视频剪辑的全部或仅有代表性部分可以表示为关键帧。 可以将关键帧的大小配置为表示视频剪辑的长度或重要性。 关键帧可以描绘视频剪辑的整个帧或视频剪辑的帧内的有意义的信息区域。 多个关键帧可以以二维阵列,S形曲线或其他一些图案排列。 关键帧可以通过配置关键帧中的像素组来描绘视频剪辑中随时间发生的对象的运动。 配置像素组可以包括根据在关键帧和包含运动对象的帧之间的帧数来对像素组进行着色和描绘半透明级别的像素组。

    Capturing and producing shared multi-resolution video
    14.
    发明授权
    Capturing and producing shared multi-resolution video 有权
    捕获和制作共享多分辨率视频

    公开(公告)号:US06839067B2

    公开(公告)日:2005-01-04

    申请号:US10205739

    申请日:2002-07-26

    摘要: A method and apparatus for providing multi-resolution video to multiple users under hybrid human and automatic control. Initial environment and close-up images are captured using a first camera and a PTZ camera. The initial images are then stored in memory. Current environment and close-up images are captured and the an estimated difference between the initial and current images and the true image is determined. The estimated differences are weighted and compared and the stored images are updated. A close-up image is then provided to each user of the system. The close-up camera is then directed to a portion of the environment image having high distortion, and current environment and close-up images are captured again.

    摘要翻译: 一种用于在混合人力和自动控制下向多个用户提供多分辨率视频的方法和装置。 使用第一台摄像机和一台PTZ摄像机拍摄初始环境和特写图像。 然后将初始图像存储在存储器中。 捕获当前环境和特写图像,并确定初始图像和当前图像与真实图像之间的估计差异。 估计的差异被加权和比较,并且存储的图像被更新。 然后将特写图像提供给系统的每个用户。 特写相机然后被引导到具有高失真的环境图像的一部分,并且再次捕获当前环境和特写图像。

    Method and apparatus for dynamically grouping a plurality of graphic
objects
    15.
    发明授权
    Method and apparatus for dynamically grouping a plurality of graphic objects 失效
    用于动态地分组多个图形对象的方法和装置

    公开(公告)号:US5889523A

    公开(公告)日:1999-03-30

    申请号:US977810

    申请日:1997-11-25

    摘要: When dynamically grouping a plurality of graphic objects, such as displayed on a graphic input display apparatus, a cluster tree is formed for the plurality of graphic objects. The cluster tree is based on a plurality of different types of distance measures. These include a time distance and a spatial distance. These distances are combined to form a distance metric indicting a distance between a pair of the graphic objects. Each level of the cluster tree defines a new cluster of the graphic objects. At least one of the graphic objects is selected. The different cluster levels of the cluster tree containing the selected graphic object are displayable. The displayed cluster of the graphic objects can be modified to increase or decrease the cluster level of the cluster containing the selected graphic object.

    摘要翻译: 当对诸如在图形输入显示装置上显示的多个图形对象进行动态分组时,为多个图形对象形成簇树。 集群树基于多种不同类型的距离度量。 这些包括时间距离和空间距离。 这些距离被组合以形成指示一对图形对象之间的距离的距离度量。 集群树的每个级别都定义了一个新的图形对象集群。 选择至少一个图形对象。 包含所选图形对象的集群树的不同集群级别是可显示的。 可以修改显示的图形对象集群,以增加或减少包含所选图形对象的集群的集群级别。

    Method and apparatus for organizing digital media based on face recognition
    16.
    发明授权
    Method and apparatus for organizing digital media based on face recognition 有权
    基于人脸识别的数字媒体组织方法和装置

    公开(公告)号:US07822233B2

    公开(公告)日:2010-10-26

    申请号:US10734259

    申请日:2003-12-15

    IPC分类号: G06K9/00

    CPC分类号: G06K9/00677 G06F17/30247

    摘要: In one aspect, the present invention is directed to a method and an apparatus for organizing digital media, particularly digital photos, using face recognition. According to a first aspect of the present invention, a computer-based method for organizing digital photos comprises: extracting objects of interest from a plurality of photographs; cropping said plurality of photographs to generate images of isolated objects of interest; applying a recognition algorithm to determine the similarity of isolated objects of interest with a reference; displaying a plurality of objects arranged as a function of the determined similarity; and receiving user input to associate said objects with a particular classification.

    摘要翻译: 一方面,本发明涉及一种使用人脸识别来组织数字媒体,特别是数字照片的方法和装置。 根据本发明的第一方面,一种用于组织数字照片的基于计算机的方法包括:从多张照片提取感兴趣的对象; 裁剪所述多张照片以产生感兴趣的孤立物体的图像; 应用识别算法来确定所分离的感兴趣对象与参考的相似性; 显示作为所确定的相似度的函数排列的多个对象; 以及接收用户输入以将所述对象与特定分类相关联。

    Browsing video collections using hypervideo summaries derived from hierarchical clustering
    17.
    发明申请
    Browsing video collections using hypervideo summaries derived from hierarchical clustering 审中-公开
    使用从层次聚类导出的超视频摘要浏览视频集

    公开(公告)号:US20080127270A1

    公开(公告)日:2008-05-29

    申请号:US11498686

    申请日:2006-08-02

    IPC分类号: G06F3/00

    摘要: The invention provides for quickly browsing through a large set of video clips to locate video clips of interest. In an embodiment of the present invention, hierarchical clustering of the video clips can be undertaken enabling the user to successively identify the subgroup of video clips of interest. This approach generates a video summary for the contents of each cluster by selecting representative video clips from individual videos and lower level clusters within the cluster. Links are added between the more general, higher-level clusters and the elements they contain. Thus, starting at the top of the set of videos being browsed or returned by the search engine and continuing at each subsequent cluster level, the user is presented with video summaries for the relevant parts of videos and those of next lower-level clusters. The user can then follow the navigational link to the desired video or lower-level cluster.

    摘要翻译: 本发明提供了快速浏览大量视频剪辑以定位感兴趣的视频剪辑。 在本发明的实施例中,可以进行视频剪辑的分层聚类,使得用户能够连续地识别感兴趣的视频剪辑的子组。 该方法通过从群集中的各个视频和较低级别的群集中选择代表性的视频片段,为每个群集的内容生成视频摘要。 链接被添加在更一般的,更高级别的集群和它们包含的元素之间。 因此,从搜索引擎浏览或返回的视频集合的顶部开始,并在每个后续的集群级别继续,向用户呈现视频的相关部分和下一级下级集群的视频摘要。 然后,用户可以跟踪到所需视频或较低级别集群的导航链接。

    Calendar-based interfaces for browsing and manipulation of digital images
    18.
    发明授权
    Calendar-based interfaces for browsing and manipulation of digital images 有权
    基于日历的界面,用于浏览和操纵数字图像

    公开(公告)号:US07325198B2

    公开(公告)日:2008-01-29

    申请号:US10334473

    申请日:2002-12-31

    IPC分类号: G06F3/00

    CPC分类号: G06F17/30274 G06Q10/109

    摘要: Embodiments of the present invention provide the ability to navigate, view, and manipulate a collection of digital images utilizing a GUI that has the familiar context of a calendar. Graphical objects representative of digital images are displayed within a particular day displayed in a calendar-based GUI. A user may group digital images into groups, modify the date with which a digital image is associated and perform various other manipulations using embodiments of a calendar-based GUI.

    摘要翻译: 本发明的实施例提供使用具有熟悉的日历上下文的GUI来导航,查看和操纵数字图像的集合的能力。 表示数字图像的图形对象在基于日历的GUI中显示的特定日期显示。 用户可以将数字图像分组成组,修改与数字图像相关联的日期,并使用基于日历的GUI的实施例执行各种其他操作。

    Automatic video segmentation using hidden markov model
    19.
    发明授权
    Automatic video segmentation using hidden markov model 失效
    使用隐马尔可夫模型的自动视频分割

    公开(公告)号:US6072542A

    公开(公告)日:2000-06-06

    申请号:US977808

    申请日:1997-11-25

    摘要: Detection of video shot boundaries using a Video Segmenting Hidden Markov Model to model the sequence of states of a video. The Video Segmenting Hidden Markov Model determines the state sequence based on feature values. Using Hidden Markov Model techniques allows for automatic learning and use of multiple features including motion vectors, audio differences and histogram differences, without the need for manual adjustments of these thresholds.

    摘要翻译: 使用视频分段隐马尔可夫模型检测视频拍摄边界以对视频的状态序列进行建模。 视频分割隐马尔可夫模型基于特征值确定状态序列。 使用隐马尔科夫模型技术允许自动学习和使用多个功能,包括运动矢量,音频差异和直方图差异,而无需手动调整这些阈值。