Method and apparatus for organizing digital media based on face recognition
    1.
    发明授权
    Method and apparatus for organizing digital media based on face recognition 有权
    基于人脸识别的数字媒体组织方法和装置

    公开(公告)号:US08781178B2

    公开(公告)日:2014-07-15

    申请号:US12858097

    申请日:2010-08-17

    IPC分类号: G06K9/00

    CPC分类号: G06K9/00677 G06F17/30247

    摘要: In one aspect, the present invention is directed to a method and an apparatus for organizing digital media, particularly digital photos, using face recognition. According to a first aspect of the present invention, a computer-based method for organizing digital photos comprises: extracting objects of interest from a plurality of photographs; cropping said plurality of photographs to generate images of isolated objects of interest; applying a recognition algorithm to determine the similarity of isolated objects of interest with a reference; displaying a plurality of objects arranged as a function of the determined similarity; and receiving user input to associate said objects with a particular classification.

    摘要翻译: 一方面,本发明涉及一种使用人脸识别来组织数字媒体,特别是数字照片的方法和装置。 根据本发明的第一方面,一种用于组织数字照片的基于计算机的方法包括:从多张照片提取感兴趣的对象; 裁剪所述多张照片以产生感兴趣的孤立物体的图像; 应用识别算法来确定所分离的感兴趣对象与参考的相似性; 显示作为所确定的相似度的函数排列的多个对象; 以及接收用户输入以将所述对象与特定分类相关联。

    Method and apparatus for organizing digital media based on face recognition
    2.
    发明授权
    Method and apparatus for organizing digital media based on face recognition 有权
    基于人脸识别的数字媒体组织方法和装置

    公开(公告)号:US07822233B2

    公开(公告)日:2010-10-26

    申请号:US10734259

    申请日:2003-12-15

    IPC分类号: G06K9/00

    CPC分类号: G06K9/00677 G06F17/30247

    摘要: In one aspect, the present invention is directed to a method and an apparatus for organizing digital media, particularly digital photos, using face recognition. According to a first aspect of the present invention, a computer-based method for organizing digital photos comprises: extracting objects of interest from a plurality of photographs; cropping said plurality of photographs to generate images of isolated objects of interest; applying a recognition algorithm to determine the similarity of isolated objects of interest with a reference; displaying a plurality of objects arranged as a function of the determined similarity; and receiving user input to associate said objects with a particular classification.

    摘要翻译: 一方面,本发明涉及一种使用人脸识别来组织数字媒体,特别是数字照片的方法和装置。 根据本发明的第一方面,一种用于组织数字照片的基于计算机的方法包括:从多张照片提取感兴趣的对象; 裁剪所述多张照片以产生感兴趣的孤立物体的图像; 应用识别算法来确定所分离的感兴趣对象与参考的相似性; 显示作为所确定的相似度的函数排列的多个对象; 以及接收用户输入以将所述对象与特定分类相关联。

    System and method for presenting video search results
    3.
    发明授权
    System and method for presenting video search results 有权
    用于呈现视频搜索结果的系统和方法

    公开(公告)号:US07555718B2

    公开(公告)日:2009-06-30

    申请号:US10986735

    申请日:2004-11-12

    IPC分类号: G06F3/00

    摘要: The invention displays video search results in a form that makes it easy for users to determine which results are truly relevant. Each story returned as a search result is visualized as a collage of keyframes from the story's shots. The selected keyframes and their sizes depend on the corresponding shots' respective relevance. Shot relevance depends on the search retrieval score of the shot and, in some embodiments, also depends on the search retrieval score of the shot's parent story. Once areas have been determined, the keyframes are scaled and/or cropped to fit into the area. In one embodiment, users can mark one or more shots as being relevant to the search. In one embodiment, a timeline is created and displayed with one or more neighbor stories that are each part of the video and which are closest in time of creation to the selected story.

    摘要翻译: 本发明以使得用户容易确定哪些结果真正相关的形式来显示视频搜索结果。 作为搜索结果返回的每个故事可视化为故事镜头中关键帧的拼贴。 所选择的关键帧及其大小取决于相应的拍摄对应的相关性。 拍摄相关性取决于拍摄的搜索检索分数,并且在一些实施例中还取决于拍摄的父故事的搜索检索分数。 一旦确定了区域,则关键帧被缩放和/或裁剪以适合该区域。 在一个实施例中,用户可以将一个或多个照片标记为与搜索相关。 在一个实施例中,创建和显示时间线,其中一个或多个相邻故事是每个视频的一部分,并且在创建时间最接近于所选择的故事。

    System and method for identifying query-relevant keywords in documents with latent semantic analysis
    4.
    发明授权
    System and method for identifying query-relevant keywords in documents with latent semantic analysis 有权
    在具有潜在语义分析的文档中识别查询相关关键词的系统和方法

    公开(公告)号:US07440947B2

    公开(公告)日:2008-10-21

    申请号:US10987377

    申请日:2004-11-12

    IPC分类号: G06F7/00

    摘要: A system and method for identifying query-related keywords in documents found in a search using latent semantic analysis. The documents are represented as a document term matrix M containing one or more document term-weight vectors d, which may be term-frequency (tf) vectors or term-frequency inverse-document-frequency (tf-idf) vectors. This matrix is subjected to a truncated singular value decomposition. The resulting transform matrix U can be used to project a query term-weight vector q into the reduced N-dimensional space, followed by its expansion back into the full vector space using the inverse of U.To perform a search, the similarity of qexpanded is measured relative to each candidate document vector in this space. Exemplary similarity functions are dot product and cosine similarity. Keywords are selected with the highest values in qexpanded that are also comprised in at least one document. Matching keywords from the query may be highlighted in the search results.

    摘要翻译: 用于使用潜在语义分析在搜索中发现的文档中识别查询相关关键字的系统和方法。 这些文件被表示为包含一个或多个文档术语权重向量d的文档术语矩阵,其可以是术语频率(tf)向量或术语频率逆文档频率(tf) -idf)载体。 该矩阵经受截断的奇异值分解。 所得到的变换矩阵 U可用于将查询项权重向量q投影到缩小的N维空间中,然后使用 U。 为了执行搜索,相对于该空间中的每个候选文档向量测量q expanded 的相似度。 示例性相似度函数是点积和余弦相似度。 关键字被选择,其中也包含在至少一个文档中的q 扩展的最高值。 查询中的匹配关键字可能会在搜索结果中突出显示。

    Calendar-based interfaces for browsing and manipulation of digital images
    5.
    发明授权
    Calendar-based interfaces for browsing and manipulation of digital images 有权
    基于日历的界面,用于浏览和操纵数字图像

    公开(公告)号:US07325198B2

    公开(公告)日:2008-01-29

    申请号:US10334473

    申请日:2002-12-31

    IPC分类号: G06F3/00

    CPC分类号: G06F17/30274 G06Q10/109

    摘要: Embodiments of the present invention provide the ability to navigate, view, and manipulate a collection of digital images utilizing a GUI that has the familiar context of a calendar. Graphical objects representative of digital images are displayed within a particular day displayed in a calendar-based GUI. A user may group digital images into groups, modify the date with which a digital image is associated and perform various other manipulations using embodiments of a calendar-based GUI.

    摘要翻译: 本发明的实施例提供使用具有熟悉的日历上下文的GUI来导航,查看和操纵数字图像的集合的能力。 表示数字图像的图形对象在基于日历的GUI中显示的特定日期显示。 用户可以将数字图像分组成组,修改与数字图像相关联的日期,并使用基于日历的GUI的实施例执行各种其他操作。

    Dynamically Controlling a Cursor on a Screen when Using a Video Camera as a Pointing Device
    6.
    发明申请
    Dynamically Controlling a Cursor on a Screen when Using a Video Camera as a Pointing Device 有权
    当使用摄像机作为指点设备时,在屏幕上动态控制光标

    公开(公告)号:US20090046061A1

    公开(公告)日:2009-02-19

    申请号:US11838817

    申请日:2007-08-14

    IPC分类号: G06F3/033

    CPC分类号: G06F3/0304

    摘要: A system provides for controlling a cursor on a screen automatically and dynamically when using a video camera as a pointing device. A computer displays static or dynamic content to a screen. A video camera connected to the computer points at the screen. As the video camera films the screen, frames captured by the video camera are sent to the computer. A target image is displayed by the computer onto the screen and marks the position of the screen cursor of the video camera. Frames captured by the video camera include the target image, and the computer dynamically moves the target image on the screen to ensure that the target image stays in the center of the view of the video camera.

    摘要翻译: 系统提供在使用摄像机作为指点设备时自动和动态地在屏幕上控制光标。 计算机将静态或动态内容显示在屏幕上。 连接到计算机的摄像机在屏幕上指向。 当摄像机拍摄屏幕时,由摄像机拍摄的帧将发送到计算机。 计算机将目标图像显示在屏幕上,并标记摄像机屏幕光标的位置。 由摄像机拍摄的帧包括目标图像,并且计算机在屏幕上动态地移动目标图像,以确保目标图像停留在摄像机视图的中心。

    Methods and apparatus for source location estimation from
microphone-array time-delay estimates
    7.
    发明授权
    Methods and apparatus for source location estimation from microphone-array time-delay estimates 失效
    来自麦克风阵列延时估计的源位置估计的方法和装置

    公开(公告)号:US5737431A

    公开(公告)日:1998-04-07

    申请号:US399427

    申请日:1995-03-07

    IPC分类号: G01V1/00 H04N7/15 H04R3/00

    CPC分类号: H04N7/15 G01V1/001

    摘要: Apparatus for determining the location of a signal-generating source (e.g., a conferee in a telephone conference) includes at least three sensors (e.g., microphones) arranged in a plurality of sets, each having two or more sensors. A surface-finding element responds to receipt at each sensor set of signals (e.g., speech) from the source for identifying a geometric surface (e.g., the surface of a hyperboloid or cone) representing potential locations of the source as a function of sensor locations and time difference of arrival of the signals. A location-approximating element coupled to two or more of the sets identifies a line that further defines potential source locations at the intersection of the surfaces. A location signal representing those potential locations is generated in accord with parameters of that line. Further functionality generates generating the location signal as a function of closest intersections the plural ones of the aforementioned lines.

    摘要翻译: 用于确定信号发生源(例如,电话会议中的与会者)的位置的装置包括布置在多个集合中的至少三个传感器(例如,麦克风),每个传感器具有两个或更多个传感器。 表面测量元件响应来自源的每个传感器信号组(例如,语音)的接收,用于识别作为传感器位置的函数的源的潜在位置的几何表面(例如,双曲面或锥体的表面) 和信号的到达时间差。 耦合到两个或更多个集合的位置近似元素标识进一步限定表面交叉处的潜在源位置的线。 根据该行的参数生成表示那些潜在位置的位置信号。 另外的功能产生产生位置信号作为上述线中的多个之间的最接近的交点的函数。

    Systems and methods for embedding data by dimensional compression and expansion

    公开(公告)号:US06999598B2

    公开(公告)日:2006-02-14

    申请号:US10104017

    申请日:2002-03-25

    IPC分类号: G06K9/00

    CPC分类号: G06T1/0085 G06T1/0057

    摘要: The systems and methods of this invention watermark an original data file using dimensional compression and expansion. The original data file extends along a given dimension and has portions that extend along that given dimension. The information is embedded into the data file by selectively dimensionally compressing or expanding a size of each of some or all of the portions along the given dimension, which can be space or time. The portions of the data file are selectively dimensionally expanded or compressed according to a given encoding scheme. This encoding scheme can use the kind of modification, the relationships between the type of modification between adjacent portions, or the duration or degree of compression or expansion to store a portion of the embedded information. The portions of the embedded information can be individual bits of binary or trinary information, or can be a portion of analog information.

    Methods and apparatus for source location estimation from microphone-array time-delay estimates
    9.
    发明授权
    Methods and apparatus for source location estimation from microphone-array time-delay estimates 失效
    来自麦克风阵列延时估计的源位置估计的方法和装置

    公开(公告)号:US06243471B1

    公开(公告)日:2001-06-05

    申请号:US09055556

    申请日:1998-04-06

    IPC分类号: H04R300

    CPC分类号: H04N7/15 G01V1/001

    摘要: Apparatus for determining the location of a signal-generating source (e.g., a conferee in a telephone conference) includes at least three sensors (e.g., microphones) arranged in a plurality of sets, each having two or more sensors. A surface-finding element responds to receipt at each sensor set of signals (e.g., speech) from the source for identifying a geometric surface (e.g., the surface of a hyperboloid or cone) representing potential locations of the source as a function of sensor locations and time difference of arrival of the signals. A location-approximating element coupled to two or more of the sets identifies a line that further defines potential source locations at the intersection of the surfaces. A location signal representing those potential locations is generated in accord with parameters of that line. Further functionality generates generating the location signal as a function of closest intersections the plural ones of the aforementioned lines.

    摘要翻译: 用于确定信号发生源(例如,电话会议中的与会者)的位置的装置包括布置在多个集合中的至少三个传感器(例如,麦克风),每个传感器具有两个或更多个传感器。 表面测量元件响应来自源的每个传感器信号组(例如,语音)的接收,用于识别作为传感器位置的函数的源的潜在位置的几何表面(例如,双曲面或锥体的表面) 和信号的到达时间差。 耦合到两个或更多个集合的位置近似元素标识进一步限定表面交叉处的潜在源位置的线。 根据该行的参数生成表示那些潜在位置的位置信号。 另外的功能产生产生位置信号作为上述线中的多个之间的最接近的交点的函数。