-
公开(公告)号:US08339459B2
公开(公告)日:2012-12-25
申请号:US12561154
申请日:2009-09-16
申请人: Zhengyou Zhang , Aswin Sankaranarayanan , Qing Zhang , Zicheng Liu , Qin Cai
发明人: Zhengyou Zhang , Aswin Sankaranarayanan , Qing Zhang , Zicheng Liu , Qin Cai
IPC分类号: H04N5/225
CPC分类号: H04N7/142 , G06K9/00261 , G06T7/75 , G06T7/85 , G06T2207/10012 , G06T2207/30201
摘要: Techniques and technologies for tracking a face with a plurality of cameras wherein a geometry between the cameras is initially unknown. One disclosed method includes detecting a head with two of the cameras and registering a head model with the image of the head (as detected by one of the cameras). The method also includes back projecting the other detected face image to the head model and determining a head pose from the back-projected head image. Furthermore, the determined geometry is used to track the face with at least one of the cameras.
摘要翻译: 用于跟踪具有多个相机的面部的技术和技术,其中相机之间的几何形状最初是未知的。 一种公开的方法包括使用两个摄像机检测头部,并用头部的图像(由相机之一检测到)登记头部模型。 该方法还包括将另一个检测到的脸部图像反投影到头部模型并且从后投影的头部图像确定头部姿势。 此外,所确定的几何形状用于利用至少一个相机跟踪面部。
-
公开(公告)号:US20110063403A1
公开(公告)日:2011-03-17
申请号:US12561154
申请日:2009-09-16
申请人: Zhengyou Zhang , Aswin Sankaranarayanan , Qing Zhang , Zicheng Liu , Qin Cai
发明人: Zhengyou Zhang , Aswin Sankaranarayanan , Qing Zhang , Zicheng Liu , Qin Cai
CPC分类号: H04N7/142 , G06K9/00261 , G06T7/75 , G06T7/85 , G06T2207/10012 , G06T2207/30201
摘要: Techniques and technologies for tracking a face with a plurality of cameras wherein a geometry between the cameras is initially unknown. One disclosed method includes detecting a head with two of the cameras and registering a head model with the image of the head (as detected by one of the cameras). The method also includes back projecting the other detected face image to the head model and determining a head pose from the back-projected head image. Furthermore, the determined geometry is used to track the face with at least one of the cameras.
摘要翻译: 用于跟踪具有多个相机的面部的技术和技术,其中相机之间的几何形状最初是未知的。 一种公开的方法包括使用两个摄像机检测头部,并用头部的图像(由相机之一检测到)登记头部模型。 该方法还包括将另一个检测到的脸部图像反投影到头部模型并且从后投影的头部图像确定头部姿势。 此外,所确定的几何形状用于利用至少一个相机跟踪面部。
-
公开(公告)号:US20100074433A1
公开(公告)日:2010-03-25
申请号:US12235104
申请日:2008-09-22
申请人: Zhengyou Zhang , Qin Cai
发明人: Zhengyou Zhang , Qin Cai
CPC分类号: H04M9/082
摘要: A multi-party spatial audio conferencing system is configured to receive far end signals from remote participants. The system comprises a speaker array that outputs spatialized sound signals and one or more microphones that capture and relay a sound signal comprising an echo of the spatialized sound signal to a multichannel acoustic echo cancellation (MC-AEC) unit having a plurality of echo cancellers. Respective echo cancellers perform cancellation of an echo associated with a far end signal from one of the multiple participants according to an algorithm based upon echo cancellation coefficients. The echo cancellation coefficients are determined from the input channel signals, the spatialization parameters associated with each input channel, and the audio signals captured by the microphones. This allows respective echo cancellation filters to be updated simultaneously even though the corresponding remote participant is not talking.
摘要翻译: 多方空间音频会议系统被配置为从远程参与者接收远端信号。 该系统包括输出空间声音信号的扬声器阵列和一个或多个麦克风,其捕获并将包括空间化声音信号的回波的声音信号中继到具有多个回声消除器的多通道声学回声消除(MC-AEC)单元。 相应的回波消除器根据基于回声消除系数的算法来执行与多个参与者之一的远端信号相关联的回波的消除。 回波消除系数由输入通道信号,与每个输入通道相关联的空间参数以及由麦克风捕获的音频信号确定。 这允许相应的回声消除滤波器被同时更新,即使相应的远程参与者不在说话。
-
公开(公告)号:US08675926B2
公开(公告)日:2014-03-18
申请号:US12796470
申请日:2010-06-08
申请人: Zhengyou Zhang , Qin Cai , Pieter R. Kasselman , Arthur H. Baker
发明人: Zhengyou Zhang , Qin Cai , Pieter R. Kasselman , Arthur H. Baker
IPC分类号: G06K9/00
CPC分类号: G06K9/00228 , G06K9/00906
摘要: Multiple images including a face presented by a user are accessed. One or more determinations are made based on the multiple images, such as a determination of whether the face included in the multiple images is a 3-dimensional structure or a flat surface and/or a determination of whether motion is present in one or more face components (e.g., eyes or mouth). If it is determined that the face included in the multiple images is a 3-dimensional structure or that that motion is present in the one or more face components, then an indication is provided that the user can be authenticated. However, if it is determined that the face included in the multiple images is a flat surface or that motion is not present in the one or more face components, then an indication is provided that the user cannot be authenticated.
摘要翻译: 访问包括用户呈现的脸部的多个图像。 基于多个图像进行一个或多个确定,例如确定包括在多个图像中的面是三维结构还是平面,和/或确定运动是否存在于一个或多个面中 组分(如眼睛或嘴巴)。 如果确定包括在多个图像中的面是三维结构或者该一个或多个面部组件中存在该运动,则提供用户可被认证的指示。 然而,如果确定包括在多个图像中的面是平面或者一个或多个面部组件中不存在运动,则提供用户不能被认证的指示。
-
公开(公告)号:US20110299741A1
公开(公告)日:2011-12-08
申请号:US12796470
申请日:2010-06-08
申请人: Zhengyou Zhang , Qin Cai , Pieter R. Kasselman , Arthur H. Baker
发明人: Zhengyou Zhang , Qin Cai , Pieter R. Kasselman , Arthur H. Baker
IPC分类号: G06K9/00
CPC分类号: G06K9/00228 , G06K9/00906
摘要: Multiple images including a face presented by a user are accessed. One or more determinations are made based on the multiple images, such as a determination of whether the face included in the multiple images is a 3-dimensional structure or a flat surface and/or a determination of whether motion is present in one or more face components (e.g., eyes or mouth). If it is determined that the face included in the multiple images is a 3-dimensional structure or that that motion is present in the one or more face components, then an indication is provided that the user can be authenticated. However, if it is determined that the face included in the multiple images is a flat surface or that motion is not present in the one or more face components, then an indication is provided that the user cannot be authenticated.
摘要翻译: 访问包括用户呈现的脸部的多个图像。 基于多个图像进行一个或多个确定,例如确定包括在多个图像中的面是三维结构还是平面,和/或确定运动是否存在于一个或多个面中 组分(如眼睛或嘴巴)。 如果确定包括在多个图像中的面是三维结构或者该一个或多个面部组件中存在该运动,则提供用户可被认证的指示。 然而,如果确定包括在多个图像中的面是平面或者一个或多个面部组件中不存在运动,则提供用户不能被认证的指示。
-
公开(公告)号:US08605890B2
公开(公告)日:2013-12-10
申请号:US12235104
申请日:2008-09-22
申请人: Zhengyou Zhang , Qin Cai
发明人: Zhengyou Zhang , Qin Cai
IPC分类号: H04M9/08
CPC分类号: H04M9/082
摘要: A multi-party spatial audio conferencing system is configured to receive far end signals from remote participants. The system comprises a speaker array that outputs spatialized sound signals and one or more microphones that capture and relay a sound signal comprising an echo of the spatialized sound signal to a multichannel acoustic echo cancellation (MC-AEC) unit having a plurality of echo cancellers. Respective echo cancellers perform cancellation of an echo associated with a far end signal from one of the multiple participants according to an algorithm based upon echo cancellation coefficients. The echo cancellation coefficients are determined from the input channel signals, the spatialization parameters associated with each input channel, and the audio signals captured by the microphones. This allows respective echo cancellation filters to be updated simultaneously even though the corresponding remote participant is not talking.
摘要翻译: 多方空间音频会议系统被配置为从远程参与者接收远端信号。 该系统包括输出空间声音信号的扬声器阵列和一个或多个麦克风,其捕获并将包括空间化声音信号的回波的声音信号中继到具有多个回声消除器的多通道声学回声消除(MC-AEC)单元。 相应的回波消除器根据基于回声消除系数的算法来执行与多个参与者之一的远端信号相关联的回波的消除。 回波消除系数由输入通道信号,与每个输入通道相关联的空间参数以及由麦克风捕获的音频信号确定。 这允许相应的回声消除滤波器被同时更新,即使相应的远程参与者不在说话。
-
公开(公告)号:US07844456B2
公开(公告)日:2010-11-30
申请号:US11716210
申请日:2007-03-09
申请人: Qin Cai , John Hamaker
发明人: Qin Cai , John Hamaker
摘要: Architecture for testing an application grammar for the presence of confusable terms. A grammar confusability metric (GCM) is generated for describing a likelihood that a reference term will be confused by the speech recognizer with another term phrase currently allowed by active grammar rules. The GCM is used to flag processing of two phrases in the grammar that have different semantic meaning, but that the speech recognizer could have difficulty distinguishing reliably. A built-in acoustic model is analyzed and feature vectors generated that are close to the acoustic properties of the input term. The feature vectors are then sent for recognition. A statistically random sampling method is applied to explore the acoustic properties of feature vectors of the input term phrase spatially and temporally. The feature vectors are perturbed in the neighborhood of the time domain and the Gaussian mixture model to which the feature vectors belong.
摘要翻译: 用于测试应用程序语法的架构,用于存在混淆的术语。 生成语法混淆度量(GCM),用于描述参考术语将被语音识别器与当前被活动语法规则允许的另一术语短语混淆的可能性。 GCM用于标记具有不同语义含义的语法中的两个短语的处理,但语音识别器可能难以区分可靠。 分析内置的声学模型,生成与输入项的声学特性接近的特征向量。 然后发送特征向量进行识别。 应用统计学随机抽样方法,在空间和时间上探索输入项短语的特征向量的声学特性。 特征向量在时域附近被扰乱,特征向量属于该高斯混合模型。
-
公开(公告)号:US20050108176A1
公开(公告)日:2005-05-19
申请号:US10836646
申请日:2004-04-30
申请人: Scott Jarol , Erik Selberg , Randy Meyerson , Qin Cai , Chelsea Krueger , David Hedbor
发明人: Scott Jarol , Erik Selberg , Randy Meyerson , Qin Cai , Chelsea Krueger , David Hedbor
CPC分类号: G06Q30/02
摘要: In accordance with at least one aspect of the present invention, a content provider functions as a content rights broker facilitating the acquisition of content item consumption rights from one or more licensing or rights provisioning authorities on behalf of consumers, based upon configurable business rules logic used by the content provider to derive appropriate sets of rights based upon a content item to be consumed.
摘要翻译: 根据本发明的至少一个方面,内容提供者作为内容权利代理人,基于所使用的可配置业务规则逻辑,便于代表消费者从一个或多个许可或权利提供机构获取内容项消费权 由内容提供商基于要消费的内容项导出适当的权限集合。
-
公开(公告)号:US07748030B1
公开(公告)日:2010-06-29
申请号:US10608278
申请日:2003-06-27
申请人: Erik W. Selberg , Qin Cai , David B. Hedbor , Chelsea C. Krueger
发明人: Erik W. Selberg , Qin Cai , David B. Hedbor , Chelsea C. Krueger
IPC分类号: G06F17/30
摘要: Systems and methods for packaging, delivery, and use of digital content. In one embodiment, a license matrix is created and used for evaluating the license rights applicable to a requested use of a content item. The license matrix can comprise a plurality of licenses grouped in license chains, and each license can include a set of entitlement rules and a set of grants. A license chain can be associated with an identifier corresponding to a use of content. In another embodiment, it is provided a method of packaging unsecured content for secured delivery over a computer network. In yet another embodiment, a system and method are provided for applying digital management rules to a content item at the time a user requests a use of the content item. In some embodiments, systems and methods allow evaluation of a requested use against a group of licenses to determine the applicable license rights for the requested use.
摘要翻译: 数字内容的包装,传送和使用的系统和方法。 在一个实施例中,创建许可证矩阵并用于评估适用于所请求的使用内容项的许可权。 许可证矩阵可以包括分组在许可证链中的多个许可证,并且每个许可证可以包括一组授权规则和一组授权。 许可证链可以与对应于使用内容的标识符相关联。 在另一个实施例中,提供了一种包装不安全内容以在计算机网络上进行安全传递的方法。 在另一个实施例中,提供了一种系统和方法,用于在用户请求使用内容项时将数字管理规则应用于内容项。 在一些实施例中,系统和方法允许针对一组许可证来评估所请求的使用以确定所请求使用的适用许可权。
-
-
-
-
-
-
-
-