专利检索 ap:("Wei-ge Chen" OR "Zhengyou Zhang" OR "Yoomi Hur") AND inv:"Zhengyou Zhang" 第 1 页

1.

发明申请
VIRTUAL AUDIO ENVIRONMENT FOR MULTIDIMENSIONAL CONFERENCING 有权
标题翻译：多媒体会议的虚拟音频环境

公开(公告)号：US20120155680A1

公开(公告)日：2012-06-21

申请号：US12970964

申请日：2010-12-17

申请人： Wei-ge Chen , Zhengyou Zhang , Yoomi Hur

发明人： Wei-ge Chen , Zhengyou Zhang , Yoomi Hur

IPC分类号： H04R5/02

CPC分类号： H04R27/00 , H04R2227/003 , H04R2227/005

摘要： The disclosed architecture employs signal processing techniques to provide audio perception only, or audio perception that matches the visual perception. This also provides spatial audio reproduction for multiparty teleconferencing such that the teleconferencing participants perceive themselves as if they were sitting in the same room. The solution is based on the premise that people perceive sounds as a reconstructed wavefront, and hence, the wavefronts are used to provide the spatial perceptual cues. The differences between the spatial perceptual cues derived from the reconstructed wavefront of sound waves and the ideal wavefront of sound waves form an objective metric for spatial perceptual quality, and provide the means of evaluating the overall system performance. Additionally, compensation filters are employed to improve the spatial perceptual quality of stereophonic systems by optimizing the objective metrics.

摘要翻译： 所公开的架构采用信号处理技术来仅提供音频感知，或者与视觉感知匹配的音频感知。这也为多方电话会议提供了空间音频再现，使得电话会议参与者将自己视为坐在同一个房间中。解决方案是基于人们将声音视为重建波前的前提，因此波前用于提供空间感知线索。从声波重构波前衍生的空间感知线索与声波理想波阵面之间的差异形成了空间感知质量的客观指标，并提供了评估整体系统性能的手段。另外，通过优化客观指标，采用补偿滤波器来提高立体声系统的空间感知质量。

2.

发明授权
Virtual audio environment for multidimensional conferencing 有权
标题翻译：用于多维会议的虚拟音频环境

公开(公告)号：US08693713B2

公开(公告)日：2014-04-08

申请号：US12970964

申请日：2010-12-17

申请人： Wei-ge Chen , Zhengyou Zhang , Yoomi Hur

发明人： Wei-ge Chen , Zhengyou Zhang , Yoomi Hur

IPC分类号： H04R5/02

CPC分类号： H04R27/00 , H04R2227/003 , H04R2227/005

摘要： The disclosed architecture employs signal processing techniques to provide audio perception only, or audio perception that matches the visual perception. This also provides spatial audio reproduction for multiparty teleconferencing such that the teleconferencing participants perceive themselves as if they were sitting in the same room. The solution is based on the premise that people perceive sounds as a reconstructed wavefront, and hence, the wavefronts are used to provide the spatial perceptual cues. The differences between the spatial perceptual cues derived from the reconstructed wavefront of sound waves and the ideal wavefront of sound waves form an objective metric for spatial perceptual quality, and provide the means of evaluating the overall system performance. Additionally, compensation filters are employed to improve the spatial perceptual quality of stereophonic systems by optimizing the objective metrics.

摘要翻译： 所公开的架构采用信号处理技术来仅提供音频感知，或者与视觉感知匹配的音频感知。这也为多方电话会议提供了空间音频再现，使得电话会议参与者将自己视为坐在同一个房间中。解决方案是基于人们将声音视为重建波前的前提，因此波前用于提供空间感知线索。从声波重构波前衍生的空间感知线索与声波理想波阵面之间的差异形成了空间感知质量的客观指标，并提供了评估整体系统性能的手段。另外，通过优化客观指标，采用补偿滤波器来提高立体声系统的空间感知质量。

3.

发明授权
Spatialized audio over headphones 有权
标题翻译：通过耳机进行空间化音频

公开(公告)号：US08737648B2

公开(公告)日：2014-05-27

申请号：US12472080

申请日：2009-05-26

申请人： Wei-ge Chen , Zhengyou Zhang

发明人： Wei-ge Chen , Zhengyou Zhang

IPC分类号： H04R5/02

CPC分类号： H04R27/00

摘要： A spatial element is added to communications, including over telephone conference calls heard through headphones or a stereo speaker setup. Functions are created to modify signals from different callers to create the illusion that the callers are speaking from different parts of the room.

摘要翻译： 一个空间元素添加到通信中，包括通过耳机听到的电话会议通话或立体声扬声器设置。创建功能来修改来自不同呼叫者的信号，以创建呼叫者从房间的不同部分讲话的错觉。

4.

发明申请
HARMONICITY-BASED SINGLE-CHANNEL SPEECH QUALITY ESTIMATION 有权
标题翻译：基于谐波的单通道语音质量估计

公开(公告)号：US20130151244A1

公开(公告)日：2013-06-13

申请号：US13316430

申请日：2011-12-09

申请人： Wei-ge Chen , Zhengyou Zhang , Jaemo Yang

发明人： Wei-ge Chen , Zhengyou Zhang , Jaemo Yang

IPC分类号： G10L19/14

CPC分类号： G10L25/69

摘要： Speech quality estimation technique embodiments are described which generally involve estimating the human speech quality of an audio frame in a single-channel audio signal. A representation of a harmonic component of the frame is synthesized and used to compute a non-harmonic component of the frame. The synthesized harmonic component representation and the non-harmonic component are then used to compute a harmonic to non-harmonic ratio (HnHR). This HnHR is indicative of the quality of a user's speech and is designated as an estimate of the speech quality of the frame. In one implementation, the HnHR is used to establish a minimum speech quality threshold below which the quality of the user's speech is considered unacceptable. Feedback to the user is then provided based on whether the HnHR falls below the threshold.

摘要翻译： 描述了通常涉及在单声道音频信号中估计音频帧的人类语音质量的语音质量估计技术实施例。合成帧的谐波分量的表示，并用于计算帧的非谐波分量。然后使用合成谐波分量表示和非谐波分量来计算谐波到非谐波比（HnHR）。该HnHR表示用户语音的质量，并且被指定为帧的语音质量的估计。在一个实现中，HnHR用于建立最小语音质量阈值，低于该最低语音质量阈值，用户语音的质量被认为是不可接受的。然后基于HnHR是否低于阈值来提供对用户的反馈。

5.

发明申请
SPATIALIZED AUDIO OVER HEADPHONES 有权
标题翻译：耳机上的空间音频

公开(公告)号：US20100303266A1

公开(公告)日：2010-12-02

申请号：US12472080

申请日：2009-05-26

申请人： Wei-ge Chen , Zhengyou Zhang

发明人： Wei-ge Chen , Zhengyou Zhang

IPC分类号： H04R5/02

CPC分类号： H04R27/00

摘要： A spatial element is added to communications, including over telephone conference calls heard through headphones or a stereo speaker setup. Functions are created to modify signals from different callers to create the illusion that the callers are speaking from different parts of the room.

摘要翻译： 一个空间元素添加到通信中，包括通过耳机听到的电话会议通话或立体声扬声器设置。创建功能来修改来自不同呼叫者的信号，以创建呼叫者从房间的不同部分讲话的错觉。

6.

发明授权
Harmonicity-based single-channel speech quality estimation 有权
标题翻译：基于谐波的单通道语音质量估计

公开(公告)号：US08731911B2

公开(公告)日：2014-05-20

申请号：US13316430

申请日：2011-12-09

申请人： Wei-ge Chen , Zhengyou Zhang , Jaemo Yang

发明人： Wei-ge Chen , Zhengyou Zhang , Jaemo Yang

IPC分类号： G10L21/00

CPC分类号： G10L25/69

摘要： Speech quality estimation technique embodiments are described which generally involve estimating the human speech quality of an audio frame in a single-channel audio signal. A representation of a harmonic component of the frame is synthesized and used to compute a non-harmonic component of the frame. The synthesized harmonic component representation and the non-harmonic component are then used to compute a harmonic to non-harmonic ratio (HnHR). This HnHR is indicative of the quality of a user's speech and is designated as an estimate of the speech quality of the frame. In one implementation, the HnHR is used to establish a minimum speech quality threshold below which the quality of the user's speech is considered unacceptable. Feedback to the user is then provided based on whether the HnHR falls below the threshold.

摘要翻译： 描述了通常涉及在单声道音频信号中估计音频帧的人类语音质量的语音质量估计技术实施例。合成帧的谐波分量的表示，并用于计算帧的非谐波分量。然后使用合成谐波分量表示和非谐波分量来计算谐波到非谐波比（HnHR）。该HnHR表示用户语音的质量，并且被指定为帧的语音质量的估计。在一个实现中，HnHR用于建立最小语音质量阈值，低于该最低语音质量阈值，用户语音的质量被认为是不可接受的。然后基于HnHR是否低于阈值来提供对用户的反馈。

7.

发明申请
STEREOPHONIC TELECONFERENCING USING A MICROPHONE ARRAY 审中-公开
标题翻译：使用麦克风阵列的立体声电话

公开(公告)号：US20120262536A1

公开(公告)日：2012-10-18

申请号：US13086632

申请日：2011-04-14

申请人： Wei-ge Chen , Zhengyou Zhang

发明人： Wei-ge Chen , Zhengyou Zhang

IPC分类号： H04N7/14 , H04R5/02 , H04R5/00

CPC分类号： H04M3/568 , H04M2203/509 , H04N7/15 , H04R1/406 , H04R2201/401 , H04S7/30 , H04S2400/11 , H04S2420/01

摘要： Stereophonic teleconferencing system embodiments are described which advantageously employ a microphone array at a remote conference site having multiple conferencees to produce a separate output channel from the each microphone in the array. Audio data streams each representing one of the audio output channels from the microphone array are then sent to a local conference site where a local conferencee is in attendance. The voices of the aforementioned remote conferencees are spatialized within a sound-field of the local site using multiple loudspeakers. Generally, this involves receiving the monophonic audio data streams from the remote site, and processing them to generate an audio signal for each loudspeaker. Each of the generated audio signals is then played through its respective loudspeaker to produce a spatial audio sound-field which is audibly perceived by the local conferencee as having the voice of each of the remote conferencees coming from a different location.

摘要翻译： 描述了立体声电话会议系统实施例，其有利地在具有多个会议的远程会议站采用麦克风阵列，以从阵列中的每个麦克风产生单独的输出通道。然后将每个表示来自麦克风阵列的音频输出声道之一的音频数据流发送到本地会议室出席的本地会议现场。使用多个扬声器，上述远程会议的声音在本地站点的声场内被空间化。通常，这涉及从远程站点接收单声道音频数据流，并且处理它们以产生每个扬声器的音频信号。然后通过其相应的扬声器播放所生成的每个音频信号，以产生由本地会议室听得见的具有每个远程会议的声音来自不同位置的空间音频声场。

8.

发明授权
Dynamic hand gesture recognition using depth data 有权
标题翻译：使用深度数据的动态手势识别

公开(公告)号：US09536135B2

公开(公告)日：2017-01-03

申请号：US13526501

申请日：2012-06-18

申请人： Zhengyou Zhang , Alexey Vladimirovich Kurakin

发明人： Zhengyou Zhang , Alexey Vladimirovich Kurakin

IPC分类号： G06K9/00

CPC分类号： G06F3/017 , G06K9/00355 , G06K9/6277 , G06K9/6297

摘要： The subject disclosure is directed towards a technology by which dynamic hand gestures are recognized by processing depth data, including in real-time. In an offline stage, a classifier is trained from feature values extracted from frames of depth data that are associated with intended hand gestures. In an online stage, a feature extractor extracts feature values from sensed depth data that corresponds to an unknown hand gesture. These feature values are input to the classifier as a feature vector to receive a recognition result of the unknown hand gesture. The technology may be used in real time, and may be robust to variations in lighting, hand orientation, and the user's gesturing speed and style.

摘要翻译： 主题公开涉及一种通过处理深度数据（包括实时）来识别动态手势的技术。在离线阶段，从与预期的手势相关联的深度数据的帧中提取的特征值训练分类器。在在线阶段，特征提取器从对应于未知手势的感测深度数据中提取特征值。将这些特征值作为特征向量输入到分类器，以接收未知手势的识别结果。该技术可以实时使用，并且对于照明，手取向和用户的手势速度和风格的变化可能是鲁棒的。

9.

发明授权
Data buddy 有权
标题翻译：资料好友

公开(公告)号：US09055607B2

公开(公告)日：2015-06-09

申请号：US12323570

申请日：2008-11-26

申请人： Michael J. Sinclair , Yuan Kong , Zhengyou Zhang , Behrooz Chitsaz , David W. Williams , Silviu-Petru Cucerzan , Zicheng Liu

发明人： Michael J. Sinclair , Yuan Kong , Zhengyou Zhang , Behrooz Chitsaz , David W. Williams , Silviu-Petru Cucerzan , Zicheng Liu

IPC分类号： H04M1/00 , H04W88/06 , H04M1/725 , H04W8/24 , H04W92/02 , H04W92/10

CPC分类号： H04W88/06 , H04M1/72572 , H04M2250/12 , H04M2250/58 , H04W8/245 , H04W92/02 , H04W92/10

摘要： Multi-modal, multi-lingual devices can be employed to consolidate numerous items including, but not limited to, keys, remote controls, image capture devices, audio recorders, cellular telephone functionalities, location/direction detectors, health monitors, calendars, gaming devices, smart home inputs, pens, optical pointing devices or the like. For example, a corner of a cellular telephone can be used as an electronic pen. Moreover, the device can be used to snap multiple pictures stitching them together to create a panoramic image. A device can automate ignition of an automobile, initiate appliances, etc. based upon relative distance. The device can provide for near to eye capabilities for enhanced image viewing. Multiple cameras/sensors can be provided on a single device to provide for stereoscopic capabilities. The device can also provide assistance to blind, privacy, etc. by consolidating services.

摘要翻译： 可以使用多模式，多语言设备来整合许多项目，包括但不限于键，遥控器，图像捕获设备，音频记录器，蜂窝电话功能，位置/方向检测器，健康监视器，日历，游戏设备智能家庭输入，笔，光学指向装置等。例如，蜂窝电话的角落可以用作电子笔。此外，该设备可以用于将多个图片拼接在一起以创建全景图像。设备可以基于相对距离自动点火汽车，起动电器等。该设备可以提供近眼睛的功能，以增强图像观看效果。可以在单个设备上提供多个摄像机/传感器以提供立体能力。该设备还可以通过整合服务来提供盲人，隐私等方面的帮助。

10.

发明授权
Ambulatory presence features 有权
标题翻译：动态存在功能

公开(公告)号：US08941710B2

公开(公告)日：2015-01-27

申请号：US13584633

申请日：2012-08-13

申请人： Christian Huitema , William A. S. Buxton , Jonathan E. Paff , Zicheng Liu , Rajesh Kutpadi Hegde , Zhengyou Zhang , Kori Marie Quinn , Jin Li , Michel Pahud

发明人： Christian Huitema , William A. S. Buxton , Jonathan E. Paff , Zicheng Liu , Rajesh Kutpadi Hegde , Zhengyou Zhang , Kori Marie Quinn , Jin Li , Michel Pahud

IPC分类号： H04N7/15 , H04N7/14 , H04N21/422 , H04N21/4223 , H04N21/442 , H04N21/4788 , H04L12/18

CPC分类号： H04N7/147 , H04L12/1827 , H04N7/142 , H04N7/15 , H04N21/42203 , H04N21/4223 , H04N21/44213 , H04N21/4788 , H04N2007/145

摘要： A system facilitates managing one or more devices utilized for communicating data within a telepresence session. A telepresence session can be initiated within a communication framework that includes a first user and one or more second users. In response to determining a temporary absence of the first user from the telepresence session, a recordation of the telepresence session is initialized to enable a playback of a portion or a summary of the telepresence session that the first user has missed.

摘要翻译： 系统便于管理用于在远程呈现会话内传送数据的一个或多个设备。可以在包括第一用户和一个或多个第二用户的通信框架内启动远程呈现会话。响应于从远程呈现会话确定暂时不存在第一用户，初始化远程呈现会话的记录，以便能够播放第一用户已经错过的远程呈现会话的部分或摘要。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类