VIRTUAL AUDIO ENVIRONMENT FOR MULTIDIMENSIONAL CONFERENCING
    1.
    发明申请
    VIRTUAL AUDIO ENVIRONMENT FOR MULTIDIMENSIONAL CONFERENCING 有权
    多媒体会议的虚拟音频环境

    公开(公告)号:US20120155680A1

    公开(公告)日:2012-06-21

    申请号:US12970964

    申请日:2010-12-17

    IPC分类号: H04R5/02

    摘要: The disclosed architecture employs signal processing techniques to provide audio perception only, or audio perception that matches the visual perception. This also provides spatial audio reproduction for multiparty teleconferencing such that the teleconferencing participants perceive themselves as if they were sitting in the same room. The solution is based on the premise that people perceive sounds as a reconstructed wavefront, and hence, the wavefronts are used to provide the spatial perceptual cues. The differences between the spatial perceptual cues derived from the reconstructed wavefront of sound waves and the ideal wavefront of sound waves form an objective metric for spatial perceptual quality, and provide the means of evaluating the overall system performance. Additionally, compensation filters are employed to improve the spatial perceptual quality of stereophonic systems by optimizing the objective metrics.

    摘要翻译: 所公开的架构采用信号处理技术来仅提供音频感知,或者与视觉感知匹配的音频感知。 这也为多方电话会议提供了空间音频再现,使得电话会议参与者将自己视为坐在同一个房间中。 解决方案是基于人们将声音视为重建波前的前提,因此波前用于提供空间感知线索。 从声波重构波前衍生的空间感知线索与声波理想波阵面之间的差异形成了空间感知质量的客观指标,并提供了评估整体系统性能的手段。 另外,通过优化客观指标,采用补偿滤波器来提高立体声系统的空间感知质量。

    Virtual audio environment for multidimensional conferencing
    2.
    发明授权
    Virtual audio environment for multidimensional conferencing 有权
    用于多维会议的虚拟音频环境

    公开(公告)号:US08693713B2

    公开(公告)日:2014-04-08

    申请号:US12970964

    申请日:2010-12-17

    IPC分类号: H04R5/02

    摘要: The disclosed architecture employs signal processing techniques to provide audio perception only, or audio perception that matches the visual perception. This also provides spatial audio reproduction for multiparty teleconferencing such that the teleconferencing participants perceive themselves as if they were sitting in the same room. The solution is based on the premise that people perceive sounds as a reconstructed wavefront, and hence, the wavefronts are used to provide the spatial perceptual cues. The differences between the spatial perceptual cues derived from the reconstructed wavefront of sound waves and the ideal wavefront of sound waves form an objective metric for spatial perceptual quality, and provide the means of evaluating the overall system performance. Additionally, compensation filters are employed to improve the spatial perceptual quality of stereophonic systems by optimizing the objective metrics.

    摘要翻译: 所公开的架构采用信号处理技术来仅提供音频感知,或者与视觉感知匹配的音频感知。 这也为多方电话会议提供了空间音频再现,使得电话会议参与者将自己视为坐在同一个房间中。 解决方案是基于人们将声音视为重建波前的前提,因此波前用于提供空间感知线索。 从声波重构波前衍生的空间感知线索与声波理想波阵面之间的差异形成了空间感知质量的客观指标,并提供了评估整体系统性能的手段。 另外,通过优化客观指标,采用补偿滤波器来提高立体声系统的空间感知质量。

    Spatialized audio over headphones
    3.
    发明授权
    Spatialized audio over headphones 有权
    通过耳机进行空间化音频

    公开(公告)号:US08737648B2

    公开(公告)日:2014-05-27

    申请号:US12472080

    申请日:2009-05-26

    IPC分类号: H04R5/02

    CPC分类号: H04R27/00

    摘要: A spatial element is added to communications, including over telephone conference calls heard through headphones or a stereo speaker setup. Functions are created to modify signals from different callers to create the illusion that the callers are speaking from different parts of the room.

    摘要翻译: 一个空间元素添加到通信中,包括通过耳机听到的电话会议通话或立体声扬声器设置。 创建功能来修改来自不同呼叫者的信号,以创建呼叫者从房间的不同部分讲话的错觉。

    HARMONICITY-BASED SINGLE-CHANNEL SPEECH QUALITY ESTIMATION
    4.
    发明申请
    HARMONICITY-BASED SINGLE-CHANNEL SPEECH QUALITY ESTIMATION 有权
    基于谐波的单通道语音质量估计

    公开(公告)号:US20130151244A1

    公开(公告)日:2013-06-13

    申请号:US13316430

    申请日:2011-12-09

    IPC分类号: G10L19/14

    CPC分类号: G10L25/69

    摘要: Speech quality estimation technique embodiments are described which generally involve estimating the human speech quality of an audio frame in a single-channel audio signal. A representation of a harmonic component of the frame is synthesized and used to compute a non-harmonic component of the frame. The synthesized harmonic component representation and the non-harmonic component are then used to compute a harmonic to non-harmonic ratio (HnHR). This HnHR is indicative of the quality of a user's speech and is designated as an estimate of the speech quality of the frame. In one implementation, the HnHR is used to establish a minimum speech quality threshold below which the quality of the user's speech is considered unacceptable. Feedback to the user is then provided based on whether the HnHR falls below the threshold.

    摘要翻译: 描述了通常涉及在单声道音频信号中估计音频帧的人类语音质量的语音质量估计技术实施例。 合成帧的谐波分量的表示,并用于计算帧的非谐波分量。 然后使用合成谐波分量表示和非谐波分量来计算谐波到非谐波比(HnHR)。 该HnHR表示用户语音的质量,并且被指定为帧的语音质量的估计。 在一个实现中,HnHR用于建立最小语音质量阈值,低于该最低语音质量阈值,用户语音的质量被认为是不可接受的。 然后基于HnHR是否低于阈值来提供对用户的反馈。

    SPATIALIZED AUDIO OVER HEADPHONES
    5.
    发明申请
    SPATIALIZED AUDIO OVER HEADPHONES 有权
    耳机上的空间音频

    公开(公告)号:US20100303266A1

    公开(公告)日:2010-12-02

    申请号:US12472080

    申请日:2009-05-26

    IPC分类号: H04R5/02

    CPC分类号: H04R27/00

    摘要: A spatial element is added to communications, including over telephone conference calls heard through headphones or a stereo speaker setup. Functions are created to modify signals from different callers to create the illusion that the callers are speaking from different parts of the room.

    摘要翻译: 一个空间元素添加到通信中,包括通过耳机听到的电话会议通话或立体声扬声器设置。 创建功能来修改来自不同呼叫者的信号,以创建呼叫者从房间的不同部分讲话的错觉。

    Harmonicity-based single-channel speech quality estimation
    6.
    发明授权
    Harmonicity-based single-channel speech quality estimation 有权
    基于谐波的单通道语音质量估计

    公开(公告)号:US08731911B2

    公开(公告)日:2014-05-20

    申请号:US13316430

    申请日:2011-12-09

    IPC分类号: G10L21/00

    CPC分类号: G10L25/69

    摘要: Speech quality estimation technique embodiments are described which generally involve estimating the human speech quality of an audio frame in a single-channel audio signal. A representation of a harmonic component of the frame is synthesized and used to compute a non-harmonic component of the frame. The synthesized harmonic component representation and the non-harmonic component are then used to compute a harmonic to non-harmonic ratio (HnHR). This HnHR is indicative of the quality of a user's speech and is designated as an estimate of the speech quality of the frame. In one implementation, the HnHR is used to establish a minimum speech quality threshold below which the quality of the user's speech is considered unacceptable. Feedback to the user is then provided based on whether the HnHR falls below the threshold.

    摘要翻译: 描述了通常涉及在单声道音频信号中估计音频帧的人类语音质量的语音质量估计技术实施例。 合成帧的谐波分量的表示,并用于计算帧的非谐波分量。 然后使用合成谐波分量表示和非谐波分量来计算谐波到非谐波比(HnHR)。 该HnHR表示用户语音的质量,并且被指定为帧的语音质量的估计。 在一个实现中,HnHR用于建立最小语音质量阈值,低于该最低语音质量阈值,用户语音的质量被认为是不可接受的。 然后基于HnHR是否低于阈值来提供对用户的反馈。

    STEREOPHONIC TELECONFERENCING USING A MICROPHONE ARRAY
    7.
    发明申请
    STEREOPHONIC TELECONFERENCING USING A MICROPHONE ARRAY 审中-公开
    使用麦克风阵列的立体声电话

    公开(公告)号:US20120262536A1

    公开(公告)日:2012-10-18

    申请号:US13086632

    申请日:2011-04-14

    IPC分类号: H04N7/14 H04R5/02 H04R5/00

    摘要: Stereophonic teleconferencing system embodiments are described which advantageously employ a microphone array at a remote conference site having multiple conferencees to produce a separate output channel from the each microphone in the array. Audio data streams each representing one of the audio output channels from the microphone array are then sent to a local conference site where a local conferencee is in attendance. The voices of the aforementioned remote conferencees are spatialized within a sound-field of the local site using multiple loudspeakers. Generally, this involves receiving the monophonic audio data streams from the remote site, and processing them to generate an audio signal for each loudspeaker. Each of the generated audio signals is then played through its respective loudspeaker to produce a spatial audio sound-field which is audibly perceived by the local conferencee as having the voice of each of the remote conferencees coming from a different location.

    摘要翻译: 描述了立体声电话会议系统实施例,其有利地在具有多个会议的远程会议站采用麦克风阵列,以从阵列中的每个麦克风产生单独的输出通道。 然后将每个表示来自麦克风阵列的音频输出声道之一的音频数据流发送到本地会议室出席的本地会议现场。 使用多个扬声器,上述远程会议的声音在本地站点的声场内被空间化。 通常,这涉及从远程站点接收单声道音频数据流,并且处理它们以产生每个扬声器的音频信号。 然后通过其相应的扬声器播放所生成的每个音频信号,以产生由本地会议室听得见的具有每个远程会议的声音来自不同位置的空间音频声场。

    Dynamic hand gesture recognition using depth data
    8.
    发明授权
    Dynamic hand gesture recognition using depth data 有权
    使用深度数据的动态手势识别

    公开(公告)号:US09536135B2

    公开(公告)日:2017-01-03

    申请号:US13526501

    申请日:2012-06-18

    IPC分类号: G06K9/00

    摘要: The subject disclosure is directed towards a technology by which dynamic hand gestures are recognized by processing depth data, including in real-time. In an offline stage, a classifier is trained from feature values extracted from frames of depth data that are associated with intended hand gestures. In an online stage, a feature extractor extracts feature values from sensed depth data that corresponds to an unknown hand gesture. These feature values are input to the classifier as a feature vector to receive a recognition result of the unknown hand gesture. The technology may be used in real time, and may be robust to variations in lighting, hand orientation, and the user's gesturing speed and style.

    摘要翻译: 主题公开涉及一种通过处理深度数据(包括实时)来识别动态手势的技术。 在离线阶段,从与预期的手势相关联的深度数据的帧中提取的特征值训练分类器。 在在线阶段,特征提取器从对应于未知手势的感测深度数据中提取特征值。 将这些特征值作为特征向量输入到分类器,以接收未知手势的识别结果。 该技术可以实时使用,并且对于照明,手取向和用户的手势速度和风格的变化可能是鲁棒的。

    Data buddy
    9.
    发明授权
    Data buddy 有权
    资料好友

    公开(公告)号:US09055607B2

    公开(公告)日:2015-06-09

    申请号:US12323570

    申请日:2008-11-26

    摘要: Multi-modal, multi-lingual devices can be employed to consolidate numerous items including, but not limited to, keys, remote controls, image capture devices, audio recorders, cellular telephone functionalities, location/direction detectors, health monitors, calendars, gaming devices, smart home inputs, pens, optical pointing devices or the like. For example, a corner of a cellular telephone can be used as an electronic pen. Moreover, the device can be used to snap multiple pictures stitching them together to create a panoramic image. A device can automate ignition of an automobile, initiate appliances, etc. based upon relative distance. The device can provide for near to eye capabilities for enhanced image viewing. Multiple cameras/sensors can be provided on a single device to provide for stereoscopic capabilities. The device can also provide assistance to blind, privacy, etc. by consolidating services.

    摘要翻译: 可以使用多模式,多语言设备来整合许多项目,包括但不限于键,遥控器,图像捕获设备,音频记录器,蜂窝电话功能,位置/方向检测器,健康监视器,日历,游戏设备 智能家庭输入,笔,光学指向装置等。 例如,蜂窝电话的角落可以用作电子笔。 此外,该设备可以用于将多个图片拼接在一起以创建全景图像。 设备可以基于相对距离自动点火汽车,起动电器等。 该设备可以提供近眼睛的功能,以增强图像观看效果。 可以在单个设备上提供多个摄像机/传感器以提供立体能力。 该设备还可以通过整合服务来提供盲人,隐私等方面的帮助。