专利检索 ap:("Yong Rui" OR "Zicheng Liu") AND inv:"Zicheng Liu" 第 12 页

111.

发明申请
Learning image enhancement 有权
标题翻译：学习图像增强

公开(公告)号：US20080279467A1

公开(公告)日：2008-11-13

申请号：US11801620

申请日：2007-05-10

申请人： Zicheng Liu , Cha Zhang , Zhengyou Zhang

发明人： Zicheng Liu , Cha Zhang , Zhengyou Zhang

IPC分类号： G06K9/40

CPC分类号： G06K9/00234 , H04N1/62 , H04N1/628

摘要： Image enhancement techniques are described to enhance an image in accordance with a set of training images. In an implementation, an image color tone map is generated for a facial region included in an image. The image color tone map may be normalized to a color tone map for a set of training images so that the image color tone map matches the map for the training images. The normalized color tone map may be applied to the image to enhance the in-question image. In further implementations, the procedure may be updated when the average color intensity in non-facial regions differs from an accumulated mean by a threshold amount.

摘要翻译： 描述图像增强技术以根据一组训练图像来增强图像。在实现中，为包括在图像中的面部区域生成图像色调映射。图像色调图可以被归一化为用于一组训练图像的色调图，使得图像色调图匹配训练图像的图。归一化色调图可以应用于图像以增强问题图像。在进一步的实施中，当非面部区域中的平均颜色强度与积累的平均值不同阈值量时，可以更新该过程。

112.

发明申请
MULTI-MODAL DEVICE POWER/MODE MANAGEMENT 有权
标题翻译：多模式设备功率/模式管理

公开(公告)号：US20080126282A1

公开(公告)日：2008-05-29

申请号：US12014419

申请日：2008-01-15

申请人： Michael J. Sinclair , David W. Williams , Zhengyou Zhang , Zicheng Liu

发明人： Michael J. Sinclair , David W. Williams , Zhengyou Zhang , Zicheng Liu

IPC分类号： G06F1/26 , G06N5/00

CPC分类号： G06F1/3203 , G06F1/263 , G06F1/3231 , Y02D10/173

摘要： A system that facilitates managing resources (e.g., functionality, services) based at least in part upon an established context. More particularly, a context determination component can be employed to establish a context by processing sensor inputs or learning/inferring a user action/preference. Once the context is established via context determination component, a power/mode management component can be employed to activate and/or mask resources in accordance with the established context. The power and mode management of the device can extend life of a power source (e.g., battery) and mask functionality in accordance with a user and/or device state.

摘要翻译： 一种有助于至少部分地基于建立的上下文来管理资源（例如，功能，服务）的系统。更具体地，可以采用上下文确定组件来通过处理传感器输入或学习/推断用户动作/偏好来建立上下文。一旦通过上下文确定组件建立了上下文，则可以使用功率/模式管理组件来根据建立的上下文激活和/或掩蔽资源。设备的功率和模式管理可以根据用户和/或设备状态延长电源（例如电池）的寿命和屏蔽功能。

113.

发明授权
Multi-sensory speech enhancement using a clean speech prior 有权
标题翻译：使用干净语音的多感官语音增强

公开(公告)号：US07346504B2

公开(公告)日：2008-03-18

申请号：US11156434

申请日：2005-06-20

申请人： Zicheng Liu , Alejandro Acero , Zhengyou Zhang

发明人： Zicheng Liu , Alejandro Acero , Zhengyou Zhang

IPC分类号： G10L21/02

CPC分类号： H04R3/005 , G10L21/0208 , H04R2460/13

摘要： A method and apparatus determine a channel response for an alternative sensor using an alternative sensor signal, an air conduction microphone signal. The channel response and a prior probability distribution for clean speech values are then used to estimate a clean speech value.

摘要翻译： 方法和装置使用替代传感器信号，空气传导麦克风信号确定替代传感器的信道响应。然后使用信道响应和干净语音值的先验概率分布来估计干净的语音值。

114.

发明申请
Calibration based beamforming, non-linear adaptive filtering, and multi-sensor headset 有权
标题翻译：基于校准的波束成形，非线性自适应滤波和多传感器耳机

公开(公告)号：US20070088544A1

公开(公告)日：2007-04-19

申请号：US11251164

申请日：2005-10-14

申请人： Alejandro Acero , Michael Seltzer , Zhengyou Zhang , Zicheng Liu

发明人： Alejandro Acero , Michael Seltzer , Zhengyou Zhang , Zicheng Liu

IPC分类号： G10L21/02

CPC分类号： G10L21/02 , G10L25/78 , G10L2021/02166

摘要： A first set of signals from an array of one or more microphones, and a second signal from a reference microphone are used to calibrate a set of filter parameters such that the filter parameters minimize a difference between the second signal and a beamformer output signal that is based on the first set of signals. Once calibrated, the filter parameters are used to form a beamformer output signal that is filtered using a non-linear adaptive filter that is adapted based on portions of a signal that do not contain speech, as determined by a speech detection sensor.

摘要翻译： 使用来自一个或多个麦克风的阵列的第一组信号和来自参考麦克风的第二信号来校准一组滤波器参数，使得滤波器参数最小化第二信号与波束形成器输出信号之间的差异，基于第一组信号。一旦被校准，滤波器参数被用于形成使用非线性自适应滤波器进行滤波的波束形成器输出信号，所述非线性自适应滤波器基于由语音检测传感器确定的不包含语音的信号的部分而被适配。

115.

发明授权
System and method for low bandwidth video streaming for face-to-face teleconferencing 失效
标题翻译：用于面对面电话会议的低带宽视频流的系统和方法

公开(公告)号：US07184602B2

公开(公告)日：2007-02-27

申请号：US10428989

申请日：2003-05-02

申请人： Michael Cohen , Zicheng Liu , Zhen Wen , Ke Zheng

发明人： Michael Cohen , Zicheng Liu , Zhen Wen , Ke Zheng

IPC分类号： G06K9/36

CPC分类号： G06T9/001 , G06K9/00248 , H04N7/147 , H04N19/503

摘要： A system and method for facilitating low bandwidth video image transmission in video conferencing systems. A target is acquired (video image of a person's head) and processed to identify one or more sub-regions (e.g., background, eyes, mouth and head). The invention incorporates a fast feature matching methodology to match a current sub-region with previously stored sub-regions. If a match is found, an instruction is sent to the receiving computer to generate the next frame of video data from the previously stored blocks utilizing a texture synthesis technique. The invention is applicable for video conferencing in low bandwidth environments.

摘要翻译： 一种用于促进视频会议系统中低带宽视频图像传输的系统和方法。获取目标（人的头部的视频）并被处理以识别一个或多个子区域（例如，背景，眼睛，嘴和头）。本发明结合了快速特征匹配方法以将当前子区域与先前存储的子区域相匹配。如果发现匹配，则使用纹理合成技术将指令发送到接收计算机以从先前存储的块生成下一帧视频数据。本发明适用于低带宽环境中的视频会议。

116.

发明申请
Multimodal note taking, annotation, and gaming 失效
标题翻译：多模式笔记，注释和游戏

公开(公告)号：US20070022372A1

公开(公告)日：2007-01-25

申请号：US11172127

申请日：2005-06-29

申请人： Zicheng Liu , Zhengyou Zhang , David Kurlander , David Williams

发明人： Zicheng Liu , Zhengyou Zhang , David Kurlander , David Williams

IPC分类号： G06F17/00

CPC分类号： G06F17/30029 , G06F17/30056 , G06K9/222

摘要： A multimodal, multilanguage mobile device which can be employed to enhance note taking and/or annotation of a document, and gaming. Input data types such as optical character recognition (OCR), speech, handwriting, and visual information (e.g., image and/or video), etc., can be fused to generate rich documents with a multidimensional level of data to provide an increased level of context over conventional documents. Such architecture can be utilized by students for homework management, as well as entertainment (e.g., gaming).

摘要翻译： 一种多模式多语言移动设备，可用于增强文档的记录和/或注释以及游戏。可以融合诸如光学字符识别（OCR），语音，手写和视觉信息（例如，图像和/或视频）等的输入数据类型以产生具有多维数据级别的丰富文档，以提供增加的级别与常规文件相关的上下文。这种架构可以由学生用于家庭作业管理以及娱乐（例如游戏）。

117.

发明申请
Multimodal authentication 有权
标题翻译：多模式认证

公开(公告)号：US20070005988A1

公开(公告)日：2007-01-04

申请号：US11171145

申请日：2005-06-29

申请人： Zhengyou Zhang , David Williams , Yuan Kong , Zicheng Liu , David Kurlander , Mike Sinclair

发明人： Zhengyou Zhang , David Williams , Yuan Kong , Zicheng Liu , David Kurlander , Mike Sinclair

IPC分类号： H04L9/32 , H04K1/00 , G06K9/00 , H04L9/00 , G06F17/30 , G06F7/04 , G06F12/14 , G06F12/00 , G06F13/00 , G06F7/58 , G06K19/00 , G11C7/00

CPC分类号： H04L63/08 , G06F21/32 , G06K9/00885 , G06K9/6293 , H04L63/0861

摘要： A multimodal system that employs a plurality of sensing modalities which can be processed concurrently to increase confidence in connection with authentication. The multimodal system and/or set of various devices can provide several points of information entry in connection with authentication. Authentication can be improved, for example, by combining face recognition, biometrics, speech recognition, handwriting recognition, gait recognition, retina scan, thumb/hand prints, or subsets thereof. Additionally, portable multimodal devices (e.g., a smartphone) can be used as credit cards, and authentication in connection with such use can mitigate unauthorized transactions.

摘要翻译： 采用多个感测模式的多模式系统，可以同时处理以增加与认证相关联的置信度。多模式系统和/或各种设备的集合可以提供与认证相关联的多个信息点。可以通过组合人脸识别，生物识别，语音识别，手写识别，步态识别，视网膜扫描，拇指/手印或其子集来改进认证。此外，便携式多模式设备（例如，智能电话）可以用作信用卡，并且与此类使用相关的认证可以减轻未经授权的交易。

118.

发明授权
Rapid computer modeling of faces for animation 有权
标题翻译：动画面部快速计算机建模

公开(公告)号：US07158658B2

公开(公告)日：2007-01-02

申请号：US11119597

申请日：2005-05-02

申请人： Zicheng Liu , Zhengyou Zhang , Michael F. Cohen , Charles E. Jacobs

发明人： Zicheng Liu , Zhengyou Zhang , Michael F. Cohen , Charles E. Jacobs

IPC分类号： G06K9/00

CPC分类号： G06T13/40 , G06K9/00201 , G06K9/00248 , G06K9/00268 , G06K9/00281 , G06T7/251 , G06T7/55 , G06T7/579 , G06T7/74 , G06T9/001 , G06T15/205 , G06T17/00 , G06T17/10 , G06T17/20 , G06T2200/08 , G06T2207/10012 , G06T2207/10016 , G06T2207/10021 , G06T2207/30201 , H04N19/162 , H04N19/503

摘要： Described herein is a technique for creating a 3D face model using images obtained from an inexpensive camera associated with a general-purpose computer. Two still images of the user are captured, and two video sequences. The user is asked to identify five facial features, which are used to calculate a mask and to perform fitting operations. Based on a comparison of the still images, deformation vectors are applied to a neutral face model to create the 3D model. The video sequences are used to create a texture map. The process of creating the texture map references the previously obtained 3D model to determine poses of the sequential video images.

摘要翻译： 这里描述了使用从与通用计算机相关联的廉价摄像机获得的图像来创建3D人脸模型的技术。捕获用户的两个静止图像，以及两个视频序列。要求用户识别五个面部特征，用于计算面罩并执行装配操作。基于静止图像的比较，将变形向量应用于中性面部模型以创建3D模型。视频序列用于创建纹理贴图。创建纹理图的过程参考先前获得的3D模型以确定顺序视频图像的姿态。

119.

发明申请
Multi-sensory speech enhancement using a clean speech prior 有权
标题翻译：使用干净语音的多感官语音增强

公开(公告)号：US20060287852A1

公开(公告)日：2006-12-21

申请号：US11156434

申请日：2005-06-20

申请人： Zicheng Liu , Alejandro Acero , Zhengyou Zhang

发明人： Zicheng Liu , Alejandro Acero , Zhengyou Zhang

IPC分类号： G10L21/02

CPC分类号： H04R3/005 , G10L21/0208 , H04R2460/13

摘要： A method and apparatus determine a channel response for an alternative sensor using an alternative sensor signal, an air conduction microphone signal. The channel response and a prior probability distribution for clean speech values are then used to estimate a clean speech value.

摘要翻译： 方法和装置使用替代传感器信号，空气传导麦克风信号确定替代传感器的信道响应。然后使用信道响应和干净语音值的先验概率分布来估计干净的语音值。

120.

发明授权
Real-time wide-angle image correction system and method for computer image viewing 有权

公开(公告)号：US07113650B2

公开(公告)日：2006-09-26

申请号：US11193274

申请日：2005-07-30

申请人： Zicheng Liu , Michael Cohen

发明人： Zicheng Liu , Michael Cohen

IPC分类号： G06K9/36 , G09G5/00 , H04N5/225

CPC分类号： G06K9/00228 , G06T3/0062 , G06T5/006

摘要： The present invention includes a real-time wide-angle image correction system and a method for alleviating distortion and perception problems in images captured by wide-angle cameras. In general, the real-time wide-angle image correction method generates warp table from pixel coordinates of a wide-angle image and applies the warp table to the wide-angle image to create a corrected wide-angle image. The corrections are performed using a parametric class of warping functions that include Spatially Varying Uniform (SVU) scaling functions. The SVU scaling functions and scaling factors are used to perform vertical scaling and horizontal scaling on the wide-angle image pixel coordinates. A horizontal distortion correction is performed using the SVU scaling functions at and at least two different scaling factors. This processing generates a warp table that can be applied to the wide-angle image to yield the corrected wide-angle image.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类