Adaptive ambient sound suppression and speech tracking
    1.
    发明授权
    Adaptive ambient sound suppression and speech tracking 有权
    自适应环境声音抑制和语音跟踪

    公开(公告)号:US08219394B2

    公开(公告)日:2012-07-10

    申请号:US12690827

    申请日:2010-01-20

    IPC分类号: G10L11/00 G10L21/02 G10L21/00

    摘要: A device for suppressing ambient sounds from speech received by a microphone array is provided. One embodiment of the device comprises a microphone array, a processor, an analog-to-digital converter, and memory comprising instructions stored therein that are executable by the processor. The instructions stored in the memory are configured to receive a plurality of digital sound signals, each digital sound signal based on an analog sound signal originating at the microphone array, receive a multi-channel speaker signal, generate a monophonic approximation signal of the multi-channel speaker signal, apply a linear acoustic echo canceller to suppress a first ambient sound portion of each digital sound signal, generate a combined directionally-adaptive sound signal from a combination of each digital sound signal by a combination of time-invariant and adaptive beamforming techniques, and apply one or more nonlinear noise suppression techniques to suppress a second ambient sound portion of the combined directionally-adaptive sound signal.

    摘要翻译: 提供了一种用于抑制由麦克风阵列接收的语音的环境声音的装置。 该设备的一个实施例包括麦克风阵列,处理器,模数转换器和包含可由处理器执行的存储在其中的指令的存储器。 存储在存储器中的指令被配置为接收多个数字声音信号,基于源自麦克风阵列的模拟声音信号的每个数字声音信号接收多声道扬声器信号,产生多声道扬声器信号的单声道近似信号, 应用线性声学回声消除器来抑制每个数字声音信号的第一环境声音部分,通过时不变和自适应波束成形技术的组合从每个数字声音信号的组合产生组合的定向自适应声音信号 并且应用一个或多个非线性噪声抑制技术来抑制组合的定向自适应声音信号的第二环境声音部分。

    ADAPTIVE AMBIENT SOUND SUPPRESSION AND SPEECH TRACKING
    2.
    发明申请
    ADAPTIVE AMBIENT SOUND SUPPRESSION AND SPEECH TRACKING 有权
    自适应声音抑制和语音跟踪

    公开(公告)号:US20110178798A1

    公开(公告)日:2011-07-21

    申请号:US12690827

    申请日:2010-01-20

    IPC分类号: G10L21/02

    摘要: A device for suppressing ambient sounds from speech received by a microphone array is provided. One embodiment of the device comprises a microphone array, a processor, an analog-to-digital converter, and memory comprising instructions stored therein that are executable by the processor. The instructions stored in the memory are configured to receive a plurality of digital sound signals, each digital sound signal based on an analog sound signal originating at the microphone array, receive a multi-channel speaker signal, generate a monophonic approximation signal of the multi-channel speaker signal, apply a linear acoustic echo canceller to suppress a first ambient sound portion of each digital sound signal, generate a combined directionally-adaptive sound signal from a combination of each digital sound signal by a combination of time-invariant and adaptive beamforming techniques, and apply one or more nonlinear noise suppression techniques to suppress a second ambient sound portion of the combined directionally-adaptive sound signal.

    摘要翻译: 提供了一种用于抑制由麦克风阵列接收的语音的环境声音的装置。 该设备的一个实施例包括麦克风阵列,处理器,模数转换器和包含可由处理器执行的存储在其中的指令的存储器。 存储在存储器中的指令被配置为接收多个数字声音信号,基于源自麦克风阵列的模拟声音信号的每个数字声音信号,接收多声道扬声器信号,产生多声道扬声器信号的单声道近似信号, 应用线性声学回声消除器来抑制每个数字声音信号的第一环境声音部分,通过时不变和自适应波束成形技术的组合从每个数字声音信号的组合产生组合的定向自适应声音信号 并且应用一个或多个非线性噪声抑制技术来抑制组合的定向自适应声音信号的第二环境声音部分。

    ADAPTIVE AMBIENT SOUND SUPPRESSION AND SPEECH TRACKING
    3.
    发明申请
    ADAPTIVE AMBIENT SOUND SUPPRESSION AND SPEECH TRACKING 审中-公开
    自适应声音抑制和语音跟踪

    公开(公告)号:US20120245933A1

    公开(公告)日:2012-09-27

    申请号:US13491952

    申请日:2012-06-08

    摘要: A device for suppressing ambient sounds from speech received by a microphone array is provided. One embodiment of the device comprises a microphone array, a processor, an analog-to-digital converter, and memory comprising instructions stored therein that are executable by the processor. The instructions stored in the memory are configured to receive a plurality of digital sound signals, each digital sound signal based on an analog sound signal originating at the microphone array, receive a multi-channel speaker signal, generate a monophonic approximation signal of the multi-channel speaker signal, apply a linear acoustic echo canceller to suppress a first ambient sound portion of each digital sound signal, generate a combined directionally-adaptive sound signal from a combination of each digital sound signal by a combination of time-invariant and adaptive beamforming techniques, and apply one or more nonlinear noise suppression techniques to suppress a second ambient sound portion of the combined directionally-adaptive sound signal.

    摘要翻译: 提供了一种用于抑制由麦克风阵列接收的语音的环境声音的装置。 该设备的一个实施例包括麦克风阵列,处理器,模数转换器和包含可由处理器执行的存储在其中的指令的存储器。 存储在存储器中的指令被配置为接收多个数字声音信号,基于源自麦克风阵列的模拟声音信号的每个数字声音信号,接收多声道扬声器信号,产生多声道扬声器信号的单声道近似信号, 应用线性声学回声消除器来抑制每个数字声音信号的第一环境声音部分,通过时间不变和自适应波束成形技术的组合从每个数字声音信号的组合产生组合的定向自适应声音信号 并且应用一个或多个非线性噪声抑制技术来抑制组合的定向自适应声音信号的第二环境声音部分。

    SYSTEM AND METHOD FOR HIGH-PRECISION 3-DIMENSIONAL AUDIO FOR AUGMENTED REALITY
    4.
    发明申请
    SYSTEM AND METHOD FOR HIGH-PRECISION 3-DIMENSIONAL AUDIO FOR AUGMENTED REALITY 有权
    用于高精度三维音频的系统和方法,用于实现现实

    公开(公告)号:US20120093320A1

    公开(公告)日:2012-04-19

    申请号:US12903610

    申请日:2010-10-13

    IPC分类号: H04R5/00

    摘要: Techniques are provided for providing 3D audio, which may be used in augmented reality. A 3D audio signal may be generated based on sensor data collected from the actual room in which the listener is located and the actual position of the listener in the room. The 3D audio signal may include a number of components that are determined based on the collected sensor data and the listener's location. For example, a number of (virtual) sound paths between a virtual sound source and the listener may be determined The sensor data may be used to estimate materials in the room, such that the affect that those materials would have on sound as it travels along the paths can be determined In some embodiments, sensor data may be used to collect physical characteristics of the listener such that a suitable HRTF may be determined from a library of HRTFs.

    摘要翻译: 提供了用于提供3D音频的技术,其可以用于增强现实。 可以基于从听众所在的实际房间收集的传感器数据和在房间中的听众的实际位置来生成3D音频信号。 3D音频信号可以包括基于收集的传感器数据和收听者的位置确定的多个组件。 例如,可以确定虚拟声源和收听者之间的多个(虚拟)声音路径。传感器数据可以用于估计房间中的材料,使得这些材料在行进时对声音的影响 可以确定路径。在一些实施例中,可以使用传感器数据来收集听众的物理特征,使得可以从HRTF库确定合适的HRTF。

    System and method for high-precision 3-dimensional audio for augmented reality
    5.
    发明授权
    System and method for high-precision 3-dimensional audio for augmented reality 有权
    用于增强现实的高精度三维音频的系统和方法

    公开(公告)号:US08767968B2

    公开(公告)日:2014-07-01

    申请号:US12903610

    申请日:2010-10-13

    IPC分类号: H04R5/00 H04R5/02 G09G5/00

    摘要: Techniques are provided for providing 3D audio, which may be used in augmented reality. A 3D audio signal may be generated based on sensor data collected from the actual room in which the listener is located and the actual position of the listener in the room. The 3D audio signal may include a number of components that are determined based on the collected sensor data and the listener's location. For example, a number of (virtual) sound paths between a virtual sound source and the listener may be determined. The sensor data may be used to estimate materials in the room, such that the affect that those materials would have on sound as it travels along the paths can be determined. In some embodiments, sensor data may be used to collect physical characteristics of the listener such that a suitable HRTF may be determined from a library of HRTFs.

    摘要翻译: 提供了用于提供3D音频的技术,其可以用于增强现实。 可以基于从听众所在的实际房间收集的传感器数据和在房间中的听众的实际位置来生成3D音频信号。 3D音频信号可以包括基于收集的传感器数据和收听者的位置确定的多个组件。 例如,可以确定虚拟声音源和收听者之间的多个(虚拟)声音路径。 可以使用传感器数据来估计房间中的材料,从而可以确定这些材料沿着路径行进时对声音的影响。 在一些实施例中,可以使用传感器数据来收集听众的物理特征,使得可以从HRTF库确定合适的HRTF。

    Controlling Power Levels Of Electronic Devices Through User Interaction
    7.
    发明申请
    Controlling Power Levels Of Electronic Devices Through User Interaction 有权
    通过用户互动控制电子设备的功率级别

    公开(公告)号:US20110298967A1

    公开(公告)日:2011-12-08

    申请号:US12794406

    申请日:2010-06-04

    IPC分类号: H04N5/225 H02J4/00

    摘要: A processor-implemented method, system and computer readable medium for intelligently controlling the power level of an electronic device in a multimedia system based on user intent, is provided. The method includes receiving data relating to a first user interaction with a device in a multimedia system. The method includes determining if the first user interaction corresponds to a user's intent to interact with the device. The method then includes setting a power level for the device based on the first user interaction. The method further includes receiving data relating to a second user interaction with the device. The method then includes altering the power level of the device based on the second user interaction to activate the device for the user.

    摘要翻译: 提供了一种用于基于用户意图来智能地控制多媒体系统中的电子设备的功率电平的处理器实现的方法,系统和计算机可读介质。 该方法包括接收与多媒体系统中的设备的第一用户交互相关的数据。 该方法包括确定第一用户交互是否对应于用户与设备交互的意图。 该方法然后包括基于第一用户交互来设置设备的功率级别。 所述方法还包括接收与所述设备的第二用户交互相关的数据。 该方法然后包括基于第二用户交互来改变设备的功率级别以激活用户的设备。

    DE-ALIASING DEPTH IMAGES
    8.
    发明申请
    DE-ALIASING DEPTH IMAGES 有权
    消除深度图像

    公开(公告)号:US20110234756A1

    公开(公告)日:2011-09-29

    申请号:US12732918

    申请日:2010-03-26

    IPC分类号: H04N13/02 G06K9/00

    CPC分类号: G06T5/002 G06T2207/10028

    摘要: Techniques are provided for de-aliasing depth images. The depth image may have been generated based on phase differences between a transmitted and received modulated light beam. A method may include accessing a depth image that has a depth value for a plurality of locations in the depth image. Each location has one or more neighbor locations. Potential depth values are determined for each of the plurality of locations based on the depth value in the depth image for the location and potential aliasing in the depth image. A cost function is determined based on differences between the potential depth values of each location and its neighboring locations. Determining the cost function includes assigning a higher cost for greater differences in potential depth values between neighboring locations. The cost function is substantially minimized to select one of the potential depth values for each of the locations.

    摘要翻译: 提供了去锯齿深度图像的技术。 可以基于发送和接收的调制光束之间的相位差来生成深度图像。 一种方法可以包括访问具有深度图像中的多个位置的深度值的深度图像。 每个位置都有一个或多个邻居位置。 基于深度图像中的深度图像中的深度值和深度图像中的潜在混叠,为每个多个位置确定潜在深度值。 基于每个位置的潜在深度值与其相邻位置之间的差异来确定成本函数。 确定成本函数包括为相邻位置之间的潜在深度值的更大差异分配更高的成本。 成本函数基本上被最小化以选择每个位置的潜在深度值之一。

    Video synchronization by adjusting video parameters
    10.
    发明申请
    Video synchronization by adjusting video parameters 失效
    视频同步通过调整视频参数

    公开(公告)号:US20060017847A1

    公开(公告)日:2006-01-26

    申请号:US10897278

    申请日:2004-07-22

    申请人: John Tardif

    发明人: John Tardif

    IPC分类号: H03L7/00

    摘要: When playing back audio/video streams, many playback devices try to recreate the audio and video clocks used for encoding. One means typically employed to recreate such clocks includes the use of a Phased Locked Loop (PLL) circuit. The audio and video should remain synchronized. However, many reasonable cost PLLs cannot recreate the exact video clock used for encoding. The synchronization of the video to the audio can be resolved by adjusting one or more of the dimensions (or other variables) that define the video being recreated. Changing the dimensions (or other variables) of the video allows for an adjustment of the output frequency of the PLL to a value that can be implemented.

    摘要翻译: 当播放音频/视频流时,许多播放设备尝试重新创建用于编码的音频和视频时钟。 通常用于重建这样的时钟的一种方式包括使用分相锁定环路(PLL)电路。 音频和视频应保持同步。 然而,许多合理的成本PLL无法重现用于编码的精确视频时钟。 可以通过调整定义正在重新创建的视频的维度(或其他变量)中的一个或多个来解决视频与音频的同步。 更改视频的尺寸(或其他变量)可以将PLL的输出频率调整为可实现的值。