-
公开(公告)号:US20100315905A1
公开(公告)日:2010-12-16
申请号:US12482773
申请日:2009-06-11
申请人: Bowon Lee , Kar-Han Tan
发明人: Bowon Lee , Kar-Han Tan
IPC分类号: G01S3/80
CPC分类号: G01S5/28
摘要: Various embodiments of the present invention are directed to systems and methods for multimodal object localization using one or more depth sensors and two or more microphones. In one aspect, a method comprises capturing three-dimensional images of a region of space wherein the object is located. The images comprise three-dimensional depth sensor observations. The method collects ambient audio generated by the object, providing acoustic observation regarding the ambient audio time difference of arrival at the audio sensors. The method determines a coordinate location of the object corresponding to the maximum of a joint probability distribution characterizing the probability of the acoustic observations emanating from each coordinate location in the region of space and the probability of each coordinate location in the region of space given depth sensor observations.
摘要翻译: 本发明的各种实施例涉及使用一个或多个深度传感器和两个或更多个麦克风的多模态对象定位的系统和方法。 一方面,一种方法包括捕获物体所位于的空间区域的三维图像。 图像包括三维深度传感器观察。 该方法收集由对象产生的环境音频,提供关于到达音频传感器的环境音频时差的声学观察。 该方法确定对应于对应于联合概率分布的最大值的对象的坐标位置,其表征从空间区域中的每个坐标位置发出的声学观察的概率和给定深度传感器的空间区域中的每个坐标位置的概率 观察。
-
公开(公告)号:US08174932B2
公开(公告)日:2012-05-08
申请号:US12482773
申请日:2009-06-11
申请人: Bowon Lee , Kar-Han Tan
发明人: Bowon Lee , Kar-Han Tan
IPC分类号: G01S3/80
CPC分类号: G01S5/28
摘要: Various embodiments of the present invention are directed to systems and methods for multimodal object localization using one or more depth sensors and two or more microphones. In one aspect, a method comprises capturing three-dimensional images of a region of space wherein the object is located. The images comprise three-dimensional depth sensor observations. The method collects ambient audio generated by the object, providing acoustic observation regarding the ambient audio time difference of arrival at the audio sensors. The method determines a coordinate location of the object corresponding to the maximum of a joint probability distribution characterizing the probability of the acoustic observations emanating from each coordinate location in the region of space and the probability of each coordinate location in the region of space given depth sensor observations.
摘要翻译: 本发明的各种实施例涉及使用一个或多个深度传感器和两个或更多个麦克风的多模态对象定位的系统和方法。 一方面,一种方法包括捕获物体所位于的空间区域的三维图像。 图像包括三维深度传感器观察。 该方法收集由对象产生的环境音频,提供关于到达音频传感器的环境音频时差的声学观察。 该方法确定对应于对应于联合概率分布的最大值的对象的坐标位置,其表征从空间区域中的每个坐标位置发出的声学观察的概率和给定深度传感器的空间区域中的每个坐标位置的概率 观察。
-
公开(公告)号:US08558894B2
公开(公告)日:2013-10-15
申请号:US12947191
申请日:2010-11-16
申请人: Kar-Han Tan , Bowon Lee
发明人: Kar-Han Tan , Bowon Lee
IPC分类号: H04N5/225
摘要: A method for presentation interaction. The method includes, receiving by a computer system an indication of a manual selection of a region proximate to an audience member of an audience wherein the indication is received via an interaction with a displayed image of the audience. The method also includes utilizing a microphone array communicatively coupled with a beam-forming component of the computer system to focus audio pickup from the region proximate to the audience member in response to receiving the indication. The method also includes displaying an enhanced image of the region proximate to the audience member using the computer system in response to receiving the indication.
摘要翻译: 呈现交互的方法。 该方法包括:由计算机系统接收关于接近受众的观众成员的区域的手动选择的指示,其中通过与观众的显示图像的交互来接收该指示。 该方法还包括利用与计算机系统的波束形成部件通信耦合的麦克风阵列,以便响应于接收到指示,将来自接近受众成员的区域的音频拾取聚焦。 该方法还包括响应于接收到指示而使用计算机系统显示接近受众成员的区域的增强图像。
-
公开(公告)号:US08411126B2
公开(公告)日:2013-04-02
申请号:US12822532
申请日:2010-06-24
申请人: Bowon Lee , Ton Kalker , Kar Han Tan
发明人: Bowon Lee , Ton Kalker , Kar Han Tan
摘要: Disclosed herein are multimedia-conferencing systems and methods enabling local participants to hear remote participants from the direction the remote participants are rendered on a display. In one aspect, a method includes a computing device receives a remote participant's image and sound information collected at a remote site. The remote participant's image is rendered on a display at a local site. When the local participant is in close proximity to the display, sounds generated by the remote participant are played over stereo loudspeakers so that the local participant perceives the sounds as emanating from the remote participant's location rendered on the display.
摘要翻译: 这里公开的是多媒体会议系统和方法,使得本地参与者能够从远程参与者在显示器上呈现的方向听到远程参与者。 一方面,一种方法包括计算设备接收在远程站点处收集的远程参与者的图像和声音信息。 远程参与者的图像呈现在本地站点的显示器上。 当本地参与者靠近显示器时,由远程参与者产生的声音通过立体声扬声器播放,使得本地参与者将声音从显示器上呈现的远程参与者的位置感知为发出的声音。
-
公开(公告)号:US20120124602A1
公开(公告)日:2012-05-17
申请号:US12947191
申请日:2010-11-16
申请人: Kar-Han TAN , Bowon Lee
发明人: Kar-Han TAN , Bowon Lee
IPC分类号: H04N7/16
摘要: A method for presentation interaction. The method includes, receiving by a computer system an indication of a manual selection of a region proximate to an audience member of an audience wherein the indication is received via an interaction with a displayed image of the audience. The method also includes utilizing a microphone array communicatively coupled with a beam-forming component of the computer system to focus audio pickup from the region proximate to the audience member in response to receiving the indication. The method also includes displaying an enhanced image of the region proximate to the audience member using the computer system in response to receiving the indication.
摘要翻译: 呈现交互的方法。 该方法包括:由计算机系统接收关于接近受众的观众成员的区域的手动选择的指示,其中通过与观众的显示图像的交互来接收该指示。 该方法还包括利用与计算机系统的波束形成部件通信耦合的麦克风阵列,以便响应于接收到指示,将来自接近受众成员的区域的音频拾取聚焦。 该方法还包括响应于接收到指示而使用计算机系统显示接近受众成员的区域的增强图像。
-
公开(公告)号:US20110316966A1
公开(公告)日:2011-12-29
申请号:US12822532
申请日:2010-06-24
申请人: Bowon Lee , Ton Kalker , Kar Han Tan
发明人: Bowon Lee , Ton Kalker , Kar Han Tan
IPC分类号: H04N7/15
摘要: Disclosed herein are multimedia-conferencing systems and methods enabling local participants to hear remote participants from the direction the remote participants are rendered on a display. In one aspect, a method includes a computing device receives a remote participant's image and sound information collected at a remote site. The remote participant's image is rendered on a display at a local site. When the local participant is in close proximity to the display, sounds generated by the remote participant are played over stereo loudspeakers so that the local participant perceives the sounds as emanating from the remote participant's location rendered on the display.
摘要翻译: 这里公开的是多媒体会议系统和方法,使得本地参与者能够从远程参与者在显示器上呈现的方向听到远程参与者。 一方面,一种方法包括计算设备接收在远程站点处收集的远程参与者的图像和声音信息。 远程参与者的图像呈现在本地站点的显示器上。 当本地参与者靠近显示器时,由远程参与者产生的声音通过立体声扬声器播放,使得本地参与者将声音从显示器上呈现的远程参与者的位置感知为发出的声音。
-
公开(公告)号:US20120093336A1
公开(公告)日:2012-04-19
申请号:US12904921
申请日:2010-10-14
申请人: Amir Said , Bowon Lee , Ton Kalker
发明人: Amir Said , Bowon Lee , Ton Kalker
IPC分类号: H04R3/00
CPC分类号: G10L25/78 , G01S5/18 , H04R3/005 , H04R2201/405
摘要: Systems and methods for performing sound source localization are provided. In one aspect, a method for locating a sound source using a computing device subdivides a space into subregions. The method then computes a sound source power for each of subregions and determines which of the sound source energies is the largest. When the volume of the subregion is less than a threshold volume, the method outputs the subregion having the largest sound source power. Otherwise, the stages of partitioning, computing, and determining the subregion having the largest sound source power is repeated.
摘要翻译: 提供了用于执行声源定位的系统和方法。 一方面,使用计算设备定位声源的方法将空间细分为子区域。 该方法然后计算每个子区域的声源功率,并确定哪个声源能量最大。 当该子区域的音量小于阈值音量时,该方法输出具有最大声源功率的子区域。 否则,重复分区,计算和确定具有最大声源功率的子区域的阶段。
-
公开(公告)号:US08553904B2
公开(公告)日:2013-10-08
申请号:US12904921
申请日:2010-10-14
申请人: Amir Said , Bowon Lee , Ton Kalker
发明人: Amir Said , Bowon Lee , Ton Kalker
IPC分类号: H04R3/00
CPC分类号: G10L25/78 , G01S5/18 , H04R3/005 , H04R2201/405
摘要: Systems and methods for performing sound source localization are provided. In one aspect, a method for locating a sound source using a computing device subdivides a space into subregions. The method then computes a sound source power for each of subregions and determines which of the sound source energies is the largest. When the volume of the subregion is less than a threshold volume, the method outputs the subregion having the largest sound source power. Otherwise, the stages of partitioning, computing, and determining the subregion having the largest sound source power is repeated.
摘要翻译: 提供了用于执行声源定位的系统和方法。 一方面,使用计算设备定位声源的方法将空间细分为子区域。 该方法然后计算每个子区域的声源功率,并确定哪个声源能量最大。 当该子区域的音量小于阈值音量时,该方法输出具有最大声源功率的子区域。 否则,重复分区,计算和确定具有最大声源功率的子区域的阶段。
-
公开(公告)号:US08451315B2
公开(公告)日:2013-05-28
申请号:US12956033
申请日:2010-11-30
申请人: Bowon Lee
发明人: Bowon Lee
IPC分类号: H04N7/15
摘要: Embodiments of the present invention disclose a system and method for distributed meeting capture. According to one embodiment, the system includes a plurality of personal devices configured to capture video data and audio data associated with at least one operating user. A media hub includes a plurality of I/O ports and is configured to receive video and audio data from the plurality of personal devices. In addition, the media hub is configured to collect the video data and/or audio data from the plurality of personal devices and output at least one audio-visual data stream for facilitating video conferencing over a network.
摘要翻译: 本发明的实施例公开了一种用于分布式会议捕获的系统和方法。 根据一个实施例,系统包括被配置为捕获与至少一个操作用户相关联的视频数据和音频数据的多个个人设备。 媒体集线器包括多个I / O端口,并且被配置为从多个个人设备接收视频和音频数据。 此外,媒体集线器被配置为从多个个人设备收集视频数据和/或音频数据,并输出至少一个视听数据流,以促进通过网络的视频会议。
-
公开(公告)号:US08218780B2
公开(公告)日:2012-07-10
申请号:US12484686
申请日:2009-06-15
CPC分类号: H04M9/082
摘要: Various embodiments of the present invention are directed to methods for dereverberation of audio generated in a room. In one aspect, a method for dereverberating reverberant digital signals comprises transforming a reverberant digital signal from the time domain into Fourier domain signals using a computing device, each Fourier domain signal corresponding to a subband. For each subband of the Fourier domain signal, the method computes autoregressive model coefficients of the reverberation with the current and previous magnitudes of the Fourier digital signal, and inverse filters the magnitude of the Fourier domain signal using the computing device, based on the autoregressive model coefficients and previous magnitudes of the Fourier digital signal. The method includes inverse transforming the Fourier domain signals with filtered magnitudes into an approximate dereverberated digital signal.
摘要翻译: 本发明的各种实施例涉及用于在室内产生的音频的混响的方法。 一方面,一种用于去混响混响数字信号的方法包括使用计算装置将混响数字信号从时域变换成傅立叶域信号,每个傅立叶域信号对应于子带。 对于傅立叶域信号的每个子带,该方法利用傅里叶数字信号的当前和先前幅度来计算混响的自回归模型系数,并且使用计算装置基于自回归模型对傅立叶域信号的幅度进行滤波 傅里叶数字信号的系数和先前幅度。 该方法包括将具有滤波幅度的傅立叶域信号逆变换为近似的非反相数字信号。
-
-
-
-
-
-
-
-
-