Method and apparatus for organizing digital media based on face recognition
    1.
    发明申请
    Method and apparatus for organizing digital media based on face recognition 有权
    基于人脸识别的数字媒体组织方法和装置

    公开(公告)号:US20050105806A1

    公开(公告)日:2005-05-19

    申请号:US10734259

    申请日:2003-12-15

    CPC分类号: G06K9/00677 G06F17/30247

    摘要: In one aspect, the present invention is directed to a method and an apparatus for organizing digital media, particularly digital photos, using face recognition. According to a first aspect of the present invention, a computer-based method for organizing digital photos comprises: extracting objects of interest from a plurality of photographs; cropping said plurality of photographs to generate images of isolated objects of interest; applying a recognition algorithm to determine the similarity of isolated objects of interest with a reference; displaying a plurality of objects arranged as a function of the determined similarity; and receiving user input to associate said objects with a particular classification.

    摘要翻译: 一方面,本发明涉及一种使用人脸识别来组织数字媒体,特别是数字照片的方法和装置。 根据本发明的第一方面,一种用于组织数字照片的基于计算机的方法包括:从多张照片提取感兴趣的对象; 裁剪所述多张照片以产生感兴趣的孤立物体的图像; 应用识别算法来确定所分离的感兴趣对象与参考的相似性; 显示作为所确定的相似度的函数排列的多个对象; 以及接收用户输入以将所述对象与特定分类相关联。

    Method and apparatus for organizing digital media based on face recognition
    2.
    发明授权
    Method and apparatus for organizing digital media based on face recognition 有权
    基于人脸识别的数字媒体组织方法和装置

    公开(公告)号:US08781178B2

    公开(公告)日:2014-07-15

    申请号:US12858097

    申请日:2010-08-17

    IPC分类号: G06K9/00

    CPC分类号: G06K9/00677 G06F17/30247

    摘要: In one aspect, the present invention is directed to a method and an apparatus for organizing digital media, particularly digital photos, using face recognition. According to a first aspect of the present invention, a computer-based method for organizing digital photos comprises: extracting objects of interest from a plurality of photographs; cropping said plurality of photographs to generate images of isolated objects of interest; applying a recognition algorithm to determine the similarity of isolated objects of interest with a reference; displaying a plurality of objects arranged as a function of the determined similarity; and receiving user input to associate said objects with a particular classification.

    摘要翻译: 一方面,本发明涉及一种使用人脸识别来组织数字媒体,特别是数字照片的方法和装置。 根据本发明的第一方面,一种用于组织数字照片的基于计算机的方法包括:从多张照片提取感兴趣的对象; 裁剪所述多张照片以产生感兴趣的孤立物体的图像; 应用识别算法来确定所分离的感兴趣对象与参考的相似性; 显示作为所确定的相似度的函数排列的多个对象; 以及接收用户输入以将所述对象与特定分类相关联。

    Method and apparatus for organizing digital media based on face recognition
    3.
    发明授权
    Method and apparatus for organizing digital media based on face recognition 有权
    基于人脸识别的数字媒体组织方法和装置

    公开(公告)号:US07822233B2

    公开(公告)日:2010-10-26

    申请号:US10734259

    申请日:2003-12-15

    IPC分类号: G06K9/00

    CPC分类号: G06K9/00677 G06F17/30247

    摘要: In one aspect, the present invention is directed to a method and an apparatus for organizing digital media, particularly digital photos, using face recognition. According to a first aspect of the present invention, a computer-based method for organizing digital photos comprises: extracting objects of interest from a plurality of photographs; cropping said plurality of photographs to generate images of isolated objects of interest; applying a recognition algorithm to determine the similarity of isolated objects of interest with a reference; displaying a plurality of objects arranged as a function of the determined similarity; and receiving user input to associate said objects with a particular classification.

    摘要翻译: 一方面,本发明涉及一种使用人脸识别来组织数字媒体,特别是数字照片的方法和装置。 根据本发明的第一方面,一种用于组织数字照片的基于计算机的方法包括:从多张照片提取感兴趣的对象; 裁剪所述多张照片以产生感兴趣的孤立物体的图像; 应用识别算法来确定所分离的感兴趣对象与参考的相似性; 显示作为所确定的相似度的函数排列的多个对象; 以及接收用户输入以将所述对象与特定分类相关联。

    System and method for identifying query-relevant keywords in documents with latent semantic analysis
    4.
    发明申请
    System and method for identifying query-relevant keywords in documents with latent semantic analysis 有权
    在具有潜在语义分析的文档中识别查询相关关键词的系统和方法

    公开(公告)号:US20060106767A1

    公开(公告)日:2006-05-18

    申请号:US10987377

    申请日:2004-11-12

    IPC分类号: G06F17/30

    摘要: A system and method for identifying query-related keywords in documents found in a search using latent semantic analysis. The documents are represented as a document term matrix M containing one or more document term-weight vectors d, which may be term-frequency (tf) vectors or term-frequency inverse-document-frequency (tf-idf) vectors. This matrix is subjected to a truncated singular value decomposition. The resulting transform matrix U can be used to project a query term-weight vector q into the reduced N-dimensional space, followed by its expansion back into the full vector space using the inverse of U. To perform a search, the similarity of qexpanded is measured relative to each candidate document vector in this space. Exemplary similarity functions are dot product and cosine similarity. Keywords are selected with the highest values in qexpanded that are also comprised in at least one document. Matching keywords from the query may be highlighted in the search results.

    摘要翻译: 用于使用潜在语义分析在搜索中发现的文档中识别查询相关关键字的系统和方法。 这些文件被表示为包含一个或多个文档术语权重向量d的文档术语矩阵,其可以是术语频率(tf)向量或术语频率逆文档频率(tf) -idf)载体。 该矩阵经历截断的奇异值分解。 所得到的变换矩阵 U可用于将查询项权重向量q投影到缩小的N维空间中,然后使用 U。 为了执行搜索,相对于该空间中的每个候选文档向量测量q expanded 的相似度。 示例性相似度函数是点积和余弦相似度。 关键字被选择,其中也包含在至少一个文档中的q 扩展的最高值。 查询中的匹配关键字可能会在搜索结果中突出显示。

    System and method for presenting video search results

    公开(公告)号:US20060106764A1

    公开(公告)日:2006-05-18

    申请号:US10986735

    申请日:2004-11-12

    IPC分类号: G06F17/30

    摘要: The invention displays video search results in a form that makes it easy for users to determine which results are truly relevant. Each story returned as a search result is visualized as a collage of keyframes from the story's shots. The selected keyframes and their sizes depend on the corresponding shots' respective relevance. Shot relevance depends on the search retrieval score of the shot and, in some embodiments, also depends on the search retrieval score of the shot's parent story. Once areas have been determined, the keyframes are scaled and/or cropped to fit into the area. In one embodiment, users can mark one or more shots as being relevant to the search. In one embodiment, a timeline is created and displayed with one or more neighbor stories that are each part of the video and which are closest in time of creation to the selected story.

    Method and system for analyzing fixed-camera video via the selection, visualization, and interaction with storyboard keyframes
    6.
    发明授权
    Method and system for analyzing fixed-camera video via the selection, visualization, and interaction with storyboard keyframes 有权
    通过选择,可视化和与故事板关键帧的交互来分析固定摄像机视频的方法和系统

    公开(公告)号:US08089563B2

    公开(公告)日:2012-01-03

    申请号:US11324557

    申请日:2006-01-03

    IPC分类号: H04N5/14 H04N7/18 G06K9/34

    摘要: Techniques for generating a storyboard are disclosed. In one embodiment of the invention the storyboard is comprised of videos from one or more cameras based on the identification of activity in the video. Various embodiments of the invention include an assessment of the importance of the activity, the creation of a storyboard presentation based on importance and interaction techniques for seeing more details or alternate views of the video. In one embodiment, motion detection is used to determine activity in one or more synchronized video streams. Periods of activity are recognized and assigned importance assessments based on the activity, important locations in the video streams, and events from other sensors. In different embodiments, the interface consists of a storyboard and a map.

    摘要翻译: 公开了生成故事板的技术。 在本发明的一个实施例中,故事板由来自一个或多个相机的视频组成,基于视频中活动的识别。 本发明的各种实施例包括评估活动的重要性,基于重要性的创建故事板呈现以及用于观看视频的更多细节或替代视图的交互技术。 在一个实施例中,运动检测用于确定一个或多个同步视频流中的活动。 根据活动,视频流中的重要位置以及来自其他传感器的事件,识别和分配活动的时间段并进行重要性评估。 在不同的实施例中,界面由故事板和地图组成。

    Methods and interfaces for event timeline and logs of video streams
    7.
    发明授权
    Methods and interfaces for event timeline and logs of video streams 有权
    事件时间线和视频流日志的方法和接口

    公开(公告)号:US07996771B2

    公开(公告)日:2011-08-09

    申请号:US11324971

    申请日:2006-01-03

    IPC分类号: G06F3/00

    摘要: Techniques for generating timelines and event logs from one or more fixed-position cameras based on the identification of activity in the video are presented. Various embodiments of the invention include an assessment of the importance of the activity, the creation of a timeline identifying events of interest, and interaction techniques for seeing more details of an event or alternate views of the video. In one embodiment, motion detection is used to determine activity in one or more synchronized video streams. In another embodiment, events are determined based on periods of activity and assigned importance assessments based on the activity, important locations in the video streams, and events from other sensors. In different embodiments, the interface consists of a timeline, event log, and map.

    摘要翻译: 提出了基于视频中的活动识别从一个或多个固定位置摄像机生成时间线和事件日志的技术。 本发明的各种实施例包括评估活动的重要性,创建识别感兴趣事件的时间线以及用于查看视频的事件或替代视图的更多细节的交互技术。 在一个实施例中,运动检测用于确定一个或多个同步视频流中的活动。 在另一个实施例中,基于活动的周期和基于活动的重要性评估,视频流中的重要位置以及来自其他传感器的事件来确定事件。 在不同的实施例中,接口由时间线,事件日志和地图组成。

    Systems and methods for organizing files in a graph-based layout
    8.
    发明授权
    Systems and methods for organizing files in a graph-based layout 有权
    用于在基于图形的布局中组织文件的系统和方法

    公开(公告)号:US08832119B2

    公开(公告)日:2014-09-09

    申请号:US12137718

    申请日:2008-06-12

    CPC分类号: G06F17/30058

    摘要: An adaptive, interactive visual workspace for viewing groups of files based on their relationships. Relationships of files are visualized using iterative refinement of categories through a direct-manipulation graph-based layout. The visual workspace starts with a fully connected graph linking thumbnail images of related files that is then partitioned into neighborhoods in response to a user creating file stacks corresponding to different categories. Normalized spring lengths improve the overall quality of the layout. Different modes for membership in neighborhoods avoid confusing motion of files and help a user to manually organize the workspace. Additionally, retrieved files can be added without having to significantly move the previous files. Different visualization techniques indicate which files are related to each other. Different zoom rates are used for file location, and surrogate sizes allow users to increase the separation between files while still increasing the surrogate sizes.

    摘要翻译: 一个自适应的交互式视觉工作区,用于根据他们的关系来查看文件组。 文件的关系通过直接操作图形布局的类别迭代细化进行可视化。 可视化工作区以完全连接的图形开始,将图形相关联的文件链接起来,然后将其分成邻域,以响应用户创建对应于不同类别的文件堆栈。 标准化的弹簧长度提高了布局的整体质量。 用于居民区成员身份的不同模式可避免混淆文件运动,并帮助用户手动组织工作区。 此外,可以添加检索到的文件,而不必显着移动以前的文件。 不同的可视化技术指出哪些文件是相互关联的。 文件位置使用不同的缩放比例,替代尺寸允许用户增加文件之间的间隔,同时仍然增加代理大小。

    Genetic segmentation method for data, such as image data streams
    9.
    发明授权
    Genetic segmentation method for data, such as image data streams 有权
    用于数据的遗传分割方法,如图像数据流

    公开(公告)号:US06819795B1

    公开(公告)日:2004-11-16

    申请号:US09611389

    申请日:2000-07-07

    IPC分类号: G06K934

    CPC分类号: G11B27/28

    摘要: A method, information system, and computer-readable medium is provided for segmenting a plurality of data, such as multimedia data, and in particular an image document stream. Segment boundary points may be used for retrieving and/or browsing the plurality of data. Similarly, segment boundary points may be used to summarize the plurality of data. Examples of image document streams include video, PowerPoint slides, and NoteLook pages. A genetic method having a fitness or evaluation function using information retrieval concepts, such as importance and precedence, is used to obtain segment boundary points. The genetic method is able to evaluate a large amount of data in a cost effective manner. The genetic method is also able to run incrementally on streaming video and adapt to usage patterns by considering frequently accessed images.

    摘要翻译: 提供了一种方法,信息系统和计算机可读介质,用于分割多个数据,例如多媒体数据,特别是图像文档流。 段边界点可用于检索和/或浏览多个数据。 类似地,段边界点可以用于汇总多个数据。 图像文档流的示例包括视频,PowerPoint幻灯片和NoteLook页面。 使用具有使用诸如重要性和优先级之类的信息检索概念的适应度或评价函数的遗传方法来获得段边界点。 遗传方法能够以成本有效的方式评估大量数据。 遗传方法还能够在流式视频上逐步运行,并通过考虑频繁访问的图像来适应使用模式。

    Methods and interfaces for visualizing activity across video frames in an action keyframe
    10.
    发明授权
    Methods and interfaces for visualizing activity across video frames in an action keyframe 有权
    用于在动作关键帧中的视频帧之间可视化活动的方法和接口

    公开(公告)号:US07623677B2

    公开(公告)日:2009-11-24

    申请号:US11324355

    申请日:2006-01-03

    IPC分类号: G06K9/00 H04N7/18 H04N5/14

    CPC分类号: G06F17/30811

    摘要: Techniques for generating action keyframes for a fixed-position camera based on the identification of activity in the video, an assessment of the importance of the activity, object recognition in the video, and interaction techniques for seeing more details of the video are presented. In different embodiments of the invention, the importance of activity is determined based on the amount of activity, important locations in the video streams, detected features such as faces, and events from other sensors.

    摘要翻译: 提供了基于视频中的活动识别,活动重要性的评估,视频中的对象识别以及用于观看视频的更多细节的交互技术来生成针对固定位置摄像机的动作关键帧的技术。 在本发明的不同实施例中,活动的重要性基于活动量,视频流中的重要位置,检测到的特征(例如面部)和来自其它传感器的事件来确定。