Method and apparatus for organizing digital media based on face recognition
    11.
    发明授权
    Method and apparatus for organizing digital media based on face recognition 有权
    基于人脸识别的数字媒体组织方法和装置

    公开(公告)号:US08781178B2

    公开(公告)日:2014-07-15

    申请号:US12858097

    申请日:2010-08-17

    IPC分类号: G06K9/00

    CPC分类号: G06K9/00677 G06F17/30247

    摘要: In one aspect, the present invention is directed to a method and an apparatus for organizing digital media, particularly digital photos, using face recognition. According to a first aspect of the present invention, a computer-based method for organizing digital photos comprises: extracting objects of interest from a plurality of photographs; cropping said plurality of photographs to generate images of isolated objects of interest; applying a recognition algorithm to determine the similarity of isolated objects of interest with a reference; displaying a plurality of objects arranged as a function of the determined similarity; and receiving user input to associate said objects with a particular classification.

    摘要翻译: 一方面,本发明涉及一种使用人脸识别来组织数字媒体,特别是数字照片的方法和装置。 根据本发明的第一方面,一种用于组织数字照片的基于计算机的方法包括:从多张照片提取感兴趣的对象; 裁剪所述多张照片以产生感兴趣的孤立物体的图像; 应用识别算法来确定所分离的感兴趣对象与参考的相似性; 显示作为所确定的相似度的函数排列的多个对象; 以及接收用户输入以将所述对象与特定分类相关联。

    Dynamically controlling a cursor on a screen when using a video camera as a pointing device
    12.
    发明授权
    Dynamically controlling a cursor on a screen when using a video camera as a pointing device 有权
    当使用摄像机作为指点设备时,在屏幕上动态控制光标

    公开(公告)号:US08144123B2

    公开(公告)日:2012-03-27

    申请号:US11838817

    申请日:2007-08-14

    IPC分类号: G09G5/00 G09G5/08 G06F3/033

    CPC分类号: G06F3/0304

    摘要: A system provides for controlling a cursor on a screen automatically and dynamically when using a video camera as a pointing device. A computer displays static or dynamic content to a screen. A video camera connected to the computer points at the screen. As the video camera films the screen, frames captured by the video camera are sent to the computer. A target image is displayed by the computer onto the screen and marks the position of the screen cursor of the video camera. Frames captured by the video camera include the target image, and the computer dynamically moves the target image on the screen to ensure that the target image stays in the center of the view of the video camera.

    摘要翻译: 系统提供在使用摄像机作为指点设备时自动和动态地在屏幕上控制光标。 计算机将静态或动态内容显示在屏幕上。 连接到计算机的摄像机在屏幕上指向。 当摄像机拍摄屏幕时,由摄像机拍摄的帧将发送到计算机。 计算机将目标图像显示在屏幕上,并标记摄像机屏幕光标的位置。 由摄像机拍摄的帧包括目标图像,并且计算机在屏幕上动态地移动目标图像,以确保目标图像停留在摄像机视图的中心。

    System and method for detecting user actions in a video stream
    13.
    发明授权
    System and method for detecting user actions in a video stream 有权
    用于检测视频流中的用户动作的系统和方法

    公开(公告)号:US08117544B2

    公开(公告)日:2012-02-14

    申请号:US10973198

    申请日:2004-10-26

    IPC分类号: G06F9/00

    摘要: Embodiments of the present invention include a video server that can detect and track the image of a pointing indicator in an input video stream representation of a computer display. The video server checks ordered frames of the video signal and determines movements for a pointing indicator such as a mouse arrow. Certain motions by the pointing indicator, such as lingering over a button or menu item or circling a button or menu item can provoke a control action on the server.

    摘要翻译: 本发明的实施例包括可以在计算机显示器的输入视频流表示中检测和跟踪指示指示符的图像的视频服务器。 视频服务器检查视频信号的有序帧,并确定诸如鼠标箭头的指示指示器的移动。 指示指示器的某些运动,例如挥动按钮或菜单项或旋转按钮或菜单项可能会引起服务器上的控制动作。

    System and method for identifying query-relevant keywords in documents with latent semantic analysis
    14.
    发明授权
    System and method for identifying query-relevant keywords in documents with latent semantic analysis 有权
    在具有潜在语义分析的文档中识别查询相关关键词的系统和方法

    公开(公告)号:US07440947B2

    公开(公告)日:2008-10-21

    申请号:US10987377

    申请日:2004-11-12

    IPC分类号: G06F7/00

    摘要: A system and method for identifying query-related keywords in documents found in a search using latent semantic analysis. The documents are represented as a document term matrix M containing one or more document term-weight vectors d, which may be term-frequency (tf) vectors or term-frequency inverse-document-frequency (tf-idf) vectors. This matrix is subjected to a truncated singular value decomposition. The resulting transform matrix U can be used to project a query term-weight vector q into the reduced N-dimensional space, followed by its expansion back into the full vector space using the inverse of U.To perform a search, the similarity of qexpanded is measured relative to each candidate document vector in this space. Exemplary similarity functions are dot product and cosine similarity. Keywords are selected with the highest values in qexpanded that are also comprised in at least one document. Matching keywords from the query may be highlighted in the search results.

    摘要翻译: 用于使用潜在语义分析在搜索中发现的文档中识别查询相关关键字的系统和方法。 这些文件被表示为包含一个或多个文档术语权重向量d的文档术语矩阵,其可以是术语频率(tf)向量或术语频率逆文档频率(tf) -idf)载体。 该矩阵经受截断的奇异值分解。 所得到的变换矩阵 U可用于将查询项权重向量q投影到缩小的N维空间中,然后使用 U。 为了执行搜索,相对于该空间中的每个候选文档向量测量q expanded 的相似度。 示例性相似度函数是点积和余弦相似度。 关键字被选择,其中也包含在至少一个文档中的q 扩展的最高值。 查询中的匹配关键字可能会在搜索结果中突出显示。

    Method and apparatus for organizing digital media based on face recognition
    15.
    发明授权
    Method and apparatus for organizing digital media based on face recognition 有权
    基于人脸识别的数字媒体组织方法和装置

    公开(公告)号:US07822233B2

    公开(公告)日:2010-10-26

    申请号:US10734259

    申请日:2003-12-15

    IPC分类号: G06K9/00

    CPC分类号: G06K9/00677 G06F17/30247

    摘要: In one aspect, the present invention is directed to a method and an apparatus for organizing digital media, particularly digital photos, using face recognition. According to a first aspect of the present invention, a computer-based method for organizing digital photos comprises: extracting objects of interest from a plurality of photographs; cropping said plurality of photographs to generate images of isolated objects of interest; applying a recognition algorithm to determine the similarity of isolated objects of interest with a reference; displaying a plurality of objects arranged as a function of the determined similarity; and receiving user input to associate said objects with a particular classification.

    摘要翻译: 一方面,本发明涉及一种使用人脸识别来组织数字媒体,特别是数字照片的方法和装置。 根据本发明的第一方面,一种用于组织数字照片的基于计算机的方法包括:从多张照片提取感兴趣的对象; 裁剪所述多张照片以产生感兴趣的孤立物体的图像; 应用识别算法来确定所分离的感兴趣对象与参考的相似性; 显示作为所确定的相似度的函数排列的多个对象; 以及接收用户输入以将所述对象与特定分类相关联。

    Calendar-based interfaces for browsing and manipulation of digital images
    16.
    发明授权
    Calendar-based interfaces for browsing and manipulation of digital images 有权
    基于日历的界面,用于浏览和操纵数字图像

    公开(公告)号:US07325198B2

    公开(公告)日:2008-01-29

    申请号:US10334473

    申请日:2002-12-31

    IPC分类号: G06F3/00

    CPC分类号: G06F17/30274 G06Q10/109

    摘要: Embodiments of the present invention provide the ability to navigate, view, and manipulate a collection of digital images utilizing a GUI that has the familiar context of a calendar. Graphical objects representative of digital images are displayed within a particular day displayed in a calendar-based GUI. A user may group digital images into groups, modify the date with which a digital image is associated and perform various other manipulations using embodiments of a calendar-based GUI.

    摘要翻译: 本发明的实施例提供使用具有熟悉的日历上下文的GUI来导航,查看和操纵数字图像的集合的能力。 表示数字图像的图形对象在基于日历的GUI中显示的特定日期显示。 用户可以将数字图像分组成组,修改与数字图像相关联的日期,并使用基于日历的GUI的实施例执行各种其他操作。

    Systems and methods for embedding data by dimensional compression and expansion

    公开(公告)号:US06999598B2

    公开(公告)日:2006-02-14

    申请号:US10104017

    申请日:2002-03-25

    IPC分类号: G06K9/00

    CPC分类号: G06T1/0085 G06T1/0057

    摘要: The systems and methods of this invention watermark an original data file using dimensional compression and expansion. The original data file extends along a given dimension and has portions that extend along that given dimension. The information is embedded into the data file by selectively dimensionally compressing or expanding a size of each of some or all of the portions along the given dimension, which can be space or time. The portions of the data file are selectively dimensionally expanded or compressed according to a given encoding scheme. This encoding scheme can use the kind of modification, the relationships between the type of modification between adjacent portions, or the duration or degree of compression or expansion to store a portion of the embedded information. The portions of the embedded information can be individual bits of binary or trinary information, or can be a portion of analog information.

    Methods and apparatus for source location estimation from microphone-array time-delay estimates
    18.
    发明授权
    Methods and apparatus for source location estimation from microphone-array time-delay estimates 失效
    来自麦克风阵列延时估计的源位置估计的方法和装置

    公开(公告)号:US06243471B1

    公开(公告)日:2001-06-05

    申请号:US09055556

    申请日:1998-04-06

    IPC分类号: H04R300

    CPC分类号: H04N7/15 G01V1/001

    摘要: Apparatus for determining the location of a signal-generating source (e.g., a conferee in a telephone conference) includes at least three sensors (e.g., microphones) arranged in a plurality of sets, each having two or more sensors. A surface-finding element responds to receipt at each sensor set of signals (e.g., speech) from the source for identifying a geometric surface (e.g., the surface of a hyperboloid or cone) representing potential locations of the source as a function of sensor locations and time difference of arrival of the signals. A location-approximating element coupled to two or more of the sets identifies a line that further defines potential source locations at the intersection of the surfaces. A location signal representing those potential locations is generated in accord with parameters of that line. Further functionality generates generating the location signal as a function of closest intersections the plural ones of the aforementioned lines.

    摘要翻译: 用于确定信号发生源(例如,电话会议中的与会者)的位置的装置包括布置在多个集合中的至少三个传感器(例如,麦克风),每个传感器具有两个或更多个传感器。 表面测量元件响应来自源的每个传感器信号组(例如,语音)的接收,用于识别作为传感器位置的函数的源的潜在位置的几何表面(例如,双曲面或锥体的表面) 和信号的到达时间差。 耦合到两个或更多个集合的位置近似元素标识进一步限定表面交叉处的潜在源位置的线。 根据该行的参数生成表示那些潜在位置的位置信号。 另外的功能产生产生位置信号作为上述线中的多个之间的最接近的交点的函数。

    Dynamically Controlling a Cursor on a Screen when Using a Video Camera as a Pointing Device
    19.
    发明申请
    Dynamically Controlling a Cursor on a Screen when Using a Video Camera as a Pointing Device 有权
    当使用摄像机作为指点设备时,在屏幕上动态控制光标

    公开(公告)号:US20090046061A1

    公开(公告)日:2009-02-19

    申请号:US11838817

    申请日:2007-08-14

    IPC分类号: G06F3/033

    CPC分类号: G06F3/0304

    摘要: A system provides for controlling a cursor on a screen automatically and dynamically when using a video camera as a pointing device. A computer displays static or dynamic content to a screen. A video camera connected to the computer points at the screen. As the video camera films the screen, frames captured by the video camera are sent to the computer. A target image is displayed by the computer onto the screen and marks the position of the screen cursor of the video camera. Frames captured by the video camera include the target image, and the computer dynamically moves the target image on the screen to ensure that the target image stays in the center of the view of the video camera.

    摘要翻译: 系统提供在使用摄像机作为指点设备时自动和动态地在屏幕上控制光标。 计算机将静态或动态内容显示在屏幕上。 连接到计算机的摄像机在屏幕上指向。 当摄像机拍摄屏幕时,由摄像机拍摄的帧将发送到计算机。 计算机将目标图像显示在屏幕上,并标记摄像机屏幕光标的位置。 由摄像机拍摄的帧包括目标图像,并且计算机在屏幕上动态地移动目标图像,以确保目标图像停留在摄像机视图的中心。

    Methods and apparatus for source location estimation from
microphone-array time-delay estimates
    20.
    发明授权
    Methods and apparatus for source location estimation from microphone-array time-delay estimates 失效
    来自麦克风阵列延时估计的源位置估计的方法和装置

    公开(公告)号:US5737431A

    公开(公告)日:1998-04-07

    申请号:US399427

    申请日:1995-03-07

    IPC分类号: G01V1/00 H04N7/15 H04R3/00

    CPC分类号: H04N7/15 G01V1/001

    摘要: Apparatus for determining the location of a signal-generating source (e.g., a conferee in a telephone conference) includes at least three sensors (e.g., microphones) arranged in a plurality of sets, each having two or more sensors. A surface-finding element responds to receipt at each sensor set of signals (e.g., speech) from the source for identifying a geometric surface (e.g., the surface of a hyperboloid or cone) representing potential locations of the source as a function of sensor locations and time difference of arrival of the signals. A location-approximating element coupled to two or more of the sets identifies a line that further defines potential source locations at the intersection of the surfaces. A location signal representing those potential locations is generated in accord with parameters of that line. Further functionality generates generating the location signal as a function of closest intersections the plural ones of the aforementioned lines.

    摘要翻译: 用于确定信号发生源(例如,电话会议中的与会者)的位置的装置包括布置在多个集合中的至少三个传感器(例如,麦克风),每个传感器具有两个或更多个传感器。 表面测量元件响应来自源的每个传感器信号组(例如,语音)的接收,用于识别作为传感器位置的函数的源的潜在位置的几何表面(例如,双曲面或锥体的表面) 和信号的到达时间差。 耦合到两个或更多个集合的位置近似元素标识进一步限定表面交叉处的潜在源位置的线。 根据该行的参数生成表示那些潜在位置的位置信号。 另外的功能产生产生位置信号作为上述线中的多个之间的最接近的交点的函数。