专利检索 ap:("Amit S. Chhetri" OR "Kavitha Velusamy" OR "Wai C. Chu" OR "Ramya Gopalan") AND inv:"Kavitha Velusamy" 第 1 页

1.

发明授权
Null-forming techniques to improve acoustic echo cancellation 有权
标题翻译：用于改善声学回声消除的零成形技术

公开(公告)号：US08885815B1

公开(公告)日：2014-11-11

申请号：US13532570

申请日：2012-06-25

申请人： Kavitha Velusamy , Amit S. Chhetri , Ramya Gopalan , Wai C. Chu , Wei Li

发明人： Kavitha Velusamy , Amit S. Chhetri , Ramya Gopalan , Wai C. Chu , Wei Li

IPC分类号： H04M9/08 , H04R3/02 , G10L21/0216

CPC分类号： H04R3/02 , G01S3/80 , G10L21/0208 , G10L2021/02166 , H04M9/08 , H04M9/082 , H04R3/005

摘要： A plurality of microphones of a communication device is grouped into multiple microphone groups, such that each microphone group includes two or more microphones. For each microphone group, output of the corresponding microphones is processed to form an acoustic null in a corresponding spatial direction, such that sound from the corresponding spatial direction is attenuated in the processed output. One of the microphone groups is selected based on various factors leading to maximal echo attenuation and rejection of reverberant components of the room. The selected microphone group is then used to detect sound from a near end talker of the communication device.

摘要翻译： 通信设备的多个麦克风被分组成多个麦克风组，使得每个麦克风组包括两个或更多麦克风。对于每个麦克风组，相应麦克风的输出被处理以在相应的空间方向上形成声音零点，使得来自对应的空间方向的声音在经处理的输出中衰减。麦克风组中的一个是基于导致最大回波衰减和拒绝房间混响分量的各种因素来选择的。所选择的麦克风组然后用于检测来自通信设备的近端讲话者的声音。

2.

发明授权
Acoustic echo cancellation processing based on feedback from speech recognizer 有权
标题翻译：基于来自语音识别器的反馈的声学回声消除处理

公开(公告)号：US09373338B1

公开(公告)日：2016-06-21

申请号：US13532649

申请日：2012-06-25

申请人： Ramya Gopalan , Kavitha Velusamy , Wai C. Chu , Amit S. Chhetri

发明人： Ramya Gopalan , Kavitha Velusamy , Wai C. Chu , Amit S. Chhetri

IPC分类号： G10L21/02

CPC分类号： G10L21/02 , G10L15/22 , G10L15/26 , H04M9/082

摘要： An automatic speech recognition engine receives an acoustic-echo processed signal from an acoustic-echo processing (AEP) module, where said echo processed signal contains mainly the speech from the near-end talker. The automatic speech recognition engine analyzes the content of the acoustic-echo processed signal to determine whether words or keywords are present. Based upon the results of this analysis, the automatic speech recognition engine produces a value reflecting the likelihood that some words or keywords are detected. Said value is provided to the AEP module. Based upon the value, the AEP module determines if there is double talk and processes the incoming signals accordingly to enhance its performance.

摘要翻译： 自动语音识别引擎从声学回波处理（AEP）模块接收声学回波处理的信号，其中所述回波处理的信号主要包含来自近端讲话者的语音。自动语音识别引擎分析声回波处理信号的内容，以确定是否存在词或关键词。基于该分析的结果，自动语音识别引擎产生反映检测到某些单词或关键词的可能性的值。所述值被提供给AEP模块。基于该值，AEP模块确定是否存在双重通话，并相应地处理输入信号以增强其性能。

3.

发明授权
Acoustic echo cancellation using visual cues 有权

公开(公告)号：US09767828B1

公开(公告)日：2017-09-19

申请号：US13535232

申请日：2012-06-27

申请人： Kavitha Velusamy , Wai C. Chu , Ramya Gopalan , Amit S. Chhetri

发明人： Kavitha Velusamy , Wai C. Chu , Ramya Gopalan , Amit S. Chhetri

IPC分类号： G06K9/00 , G10L25/78 , H04N21/442 , H04B3/23

CPC分类号： G10L25/78 , G06K9/00268 , G06K9/00335 , G06K9/00362 , G10L17/00 , G10L21/0208 , G10L2021/02082 , H04B3/23 , H04B3/231 , H04N21/44218

摘要： Techniques for enhancing an acoustic echo canceller based on visual cues are described herein. The techniques include changing adaptation of a filter of the acoustic echo canceller, calibrating the filter, or reducing background noise from an audio signal processed by the acoustic echo canceller. The changing, calibrating, and reducing are responsive to visual cues that describe acoustic characteristics of a location of a device that includes the acoustic echo canceller. Such visual cues may indicate that no human being is present at the location, that some subject(s) are engaged in speaking or sound generating activities, or that motion associated with an echo path change has occurred at the location.

4.

发明授权
Acoustic echo cancellation using blind source separation 有权
标题翻译：使用盲源分离的声回波消除

公开(公告)号：US08855295B1

公开(公告)日：2014-10-07

申请号：US13532617

申请日：2012-06-25

申请人： Amit S. Chhetri , Kavitha Velusamy , Wai C. Chu , Ramya Gopalan

发明人： Amit S. Chhetri , Kavitha Velusamy , Wai C. Chu , Ramya Gopalan

IPC分类号： H04M9/08 , G10K11/00

CPC分类号： H04M9/082 , G10L21/0272

摘要： Techniques for utilizing blind source separation as a front-end to an acoustic echo canceller are described herein. The techniques include removing a first portion of an acoustic echo from an audio signal using blind source separation and a reference signal. The techniques then further remove a second portion of the acoustic echo using an acoustic echo canceller and the reference signal. Further, output of the blind source separation may be used to improve double-talk detection.

摘要翻译： 本文描述了将盲源分离用作声学回声消除器的前端的技术。这些技术包括使用盲源分离和参考信号从音频信号中去除声学回声的第一部分。然后，技术使用声学回声消除器和参考信号进一步去除声学回声的第二部分。此外，盲源分离的输出可以用于改善双方通话检测。

5.

发明授权
Signal-enhancing beamforming in an augmented reality environment 有权

公开(公告)号：US09973848B2

公开(公告)日：2018-05-15

申请号：US13165620

申请日：2011-06-21

申请人： Amit S. Chhetri , Kavitha Velusamy , Edward Dietz Crump

发明人： Amit S. Chhetri , Kavitha Velusamy , Edward Dietz Crump

IPC分类号： H04R3/00 , H04R1/40

CPC分类号： H04R3/005 , H04R1/406 , H04R2201/401 , H04R2201/403 , H04R2430/20 , H04R2430/21

摘要： An augmented reality environment allows interaction between virtual and real objects. Beamforming techniques are applied to signals acquired by an array of microphones to allow for simultaneous spatial tracking and signal acquisition from multiple users. Localization information such as from other sensors in the environment may be used to select a particular set of beamformer coefficients and resulting beampattern focused on a signal source. Alternately, a series of beampatterns may be used iteratively to localize the signal source in a computationally efficient fashion. The beamformer coefficients may be pre-computed.

6.

发明申请
Signal-enhancing Beamforming in an Augmented Reality Environment 有权
标题翻译：增强现实环境中的信号增强波束形成

公开(公告)号：US20120327115A1

公开(公告)日：2012-12-27

申请号：US13165620

申请日：2011-06-21

申请人： Amit S. Chhetri , Kavitha Velusamy , Edward Dietz Crump

发明人： Amit S. Chhetri , Kavitha Velusamy , Edward Dietz Crump

IPC分类号： G09G5/00

CPC分类号： H04R3/005 , H04R1/406 , H04R2201/401 , H04R2201/403 , H04R2430/20 , H04R2430/21

摘要： An augmented reality environment allows interaction between virtual and real objects. Beamforming techniques are applied to signals acquired by an array of microphones to allow for simultaneous spatial tracking and signal acquisition from multiple users. Localization information such as from other sensors in the environment may be used to select a particular set of beamformer coefficients and resulting beampattern focused on a signal source. Alternately, a series of beampatterns may be used iteratively to localize the signal source in a computationally efficient fashion. The beamformer coefficients may be pre-computed.

摘要翻译： 增强的现实环境允许虚拟和实际对象之间的交互。波束形成技术应用于由麦克风阵列获取的信号，以允许来自多个用户的同时空间跟踪和信号采集。可以使用诸如来自环境中的其他传感器的定位信息来选择波束形成器系数的特定集合，并且将聚焦在信号源上的所得到的波形图进行选择。或者，可以迭代地使用一系列波形图来以计算有效的方式定位信号源。波束形成系数可以被预先计算。

7.

发明授权
Echo cancellation based on shared reference signals 有权

公开(公告)号：US09779731B1

公开(公告)日：2017-10-03

申请号：US13589967

申请日：2012-08-20

申请人： Menashe Haskin , Kavitha Velusamy

发明人： Menashe Haskin , Kavitha Velusamy

IPC分类号： G10L15/00 , G10L15/06 , G10L15/20 , G10L21/00 , G10L21/02

CPC分类号： G10L15/20 , G10L15/00 , G10L19/018 , G10L21/0208 , G10L2021/02082

摘要： An audio processing system configured to generate, based at least in part on captured sound, an audio signal that includes a speech component corresponding to a user's speech utterance and an audio component corresponding to audio output of another device is described herein. The audio processing system is also configured to receive a reference signal that corresponds to the audio output of the other device. The reference signal may be received as ultrasonic audio output of the other device or from a remote server. The audio processing device then processes the generated audio signal to remove at least a part of the generated audio signal that corresponds to the reference signal.

8.

发明申请
Time Difference of Arrival Determination with Direct Sound 有权
标题翻译：具有直接声音的到达时间差异

公开(公告)号：US20120327746A1

公开(公告)日：2012-12-27

申请号：US13168759

申请日：2011-06-24

申请人： Kavitha Velusamy

发明人： Kavitha Velusamy

IPC分类号： G01S3/80

CPC分类号： G01S5/22 , G01S3/86

摘要： Acoustic signals may be localized such that their position in space is determined. Time-difference-of-arrival data from multiple microphones may be used for this localization. Signal data from the microphones may be degraded by reverberation and other environmental distortions, resulting in erroneous localization. By detecting a portion of the signal resulting from sound directly reaching a microphone rather than from a reverberation, accuracy of the localization is improved.

摘要翻译： 声信号可以被定位成使得它们在空间中的位置被确定。来自多个麦克风的到达时间差数据可以用于该定位。来自麦克风的信号数据可能由于混响和其他环境变形而降级，导致错误的定位。通过检测由直接到达麦克风的声音产生的信号的一部分，而不是从混响，提高了定位的精度。

9.

发明授权
Sound localization with artificial neural network 有权
标题翻译：用人造神经网络进行声音定位

公开(公告)号：US08676728B1

公开(公告)日：2014-03-18

申请号：US13076201

申请日：2011-03-30

申请人： Kavitha Velusamy , Edward Dietz Crump

发明人： Kavitha Velusamy , Edward Dietz Crump

IPC分类号： G06F15/18

CPC分类号： G06N3/08 , H04R1/406 , H04R3/005 , H04R2430/20

摘要： The location of a sound within a given spatial volume may be used in applications such as augmented reality environments. An artificial neural network processes time-difference-of-arrival data (TDOA) from a known microphone array to determine a spatial location of the sound. The neural network may be located locally or available as a cloud service. The artificial neural network is trained with perturbed and non-perturbed TDOA data.

摘要翻译： 给定空间体积内的声音的位置可以用于诸如增强现实环境的应用中。人造神经网络处理来自已知麦克风阵列的时差差数据（TDOA）以确定声音的空间位置。神经网络可以位于本地或可用作云服务。人工神经网络用扰动和非扰动TDOA数据训练。

10.

发明申请
TECHNIQUES FOR PERFORMING KEY FRAME REQUESTS IN MEDIA SERVERS AND ENDPOINT DEVICES 有权
标题翻译：在媒体服务器和端点设备中执行关键帧请求的技术

公开(公告)号：US20130106984A1

公开(公告)日：2013-05-02

申请号：US13287155

申请日：2011-11-02

申请人： Duanpei Wu , Kavitha Velusamy

发明人： Duanpei Wu , Kavitha Velusamy

IPC分类号： H04N7/15

CPC分类号： H04N7/15

摘要： Techniques are provided for sending and receiving key frames and key frame request messages. At a video conference bridge, a key frame request message is received from a first endpoint device. The key frame request message comprises a request for a key frame from a second endpoint device. When a prior key frame request message is received before the key frame request message, a key frame request time value is determined that corresponds to an amount of time between receiving the key frame request message and receiving the prior key frame request message. This value is compared to a threshold time value. When the key frame request time is greater than the threshold time, a key frame request forwarding message is generated, and the key frame request forwarding message is sent to the second endpoint device to request the key frame from the second endpoint device.

摘要翻译： 提供了发送和接收关键帧和关键帧请求消息的技术。在视频会议桥上，从第一端点设备接收关键帧请求消息。关键帧请求消息包括对来自第二端点设备的关键帧的请求。当在关键帧请求消息之前接收到先前的关键帧请求消息时，确定对应于接收到关键帧请求消息和接收先前关键帧请求消息之间的时间量的关键帧请求时间值。该值与阈值时间值进行比较。当关键帧请求时间大于阈值时间时，生成关键帧请求转发消息，并将关键帧请求转发消息发送到第二终端设备，从第二终端设备请求关键帧。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类