专利检索 ap:("Randall B. Baird" OR "Scott S. Firestone" OR "Luke K. Surazski" OR "Duanpei Wu") AND inv:"Duanpei Wu" 第 1 页

1.

发明授权
System and method for performing distributed multipoint video conferencing 有权
标题翻译：执行分布式多点视频会议的系统和方法

公开(公告)号：US08614732B2

公开(公告)日：2013-12-24

申请号：US11210325

申请日：2005-08-24

申请人： Randall B. Baird , Scott S. Firestone , Luke K. Surazski , Duanpei Wu

发明人： Randall B. Baird , Scott S. Firestone , Luke K. Surazski , Duanpei Wu

IPC分类号： H04N7/15

CPC分类号： H04N7/15 , H04N7/152

摘要： According to an embodiment of the present invention, an apparatus for performing video conferencing is provided that includes an I-frame injector element operable to intercept I-frame requests from one or more end points and to attempt to service the I-frame requests such that at least a portion of the requests are prevented from propagating back to an originating sender. In more specific embodiments, when a receiver endpoint sends a fast video update (FVU) request upstream, it is intercepted by the I-frame injector element and rather than passing the FVU request to the sender the I-frame injector element replaces a next P-frame from the sender with an I-frame, whereby the I-frame is constructed so that when decoded, it matches the P-frame that it replaced. In still more detailed embodiments, the I-frame injector element operates in one of three modes that are associated with bandwidth parameters.

摘要翻译： 根据本发明的实施例，提供了一种用于执行视频会议的装置，其包括I帧注入器元件，其可操作以截取来自一个或多个端点的I帧请求并尝试对I帧请求进行服务，使得至少一部分请求被阻止传播回始发端。在更具体的实施例中，当接收机端点在上游发送快速视频更新（FVU）请求时，它被I帧注入器元件拦截，而不是将FVU请求传递给发送器，I帧注入器元件替换下一个P - 帧从具有I帧的发送器发送，由此构造I帧，使得当被解码时，它与其所取代的P帧匹配。在更详细的实施例中，I帧注入器元件以与带宽参数相关联的三种模式之一操作。

2.

发明授权
System and method for performing distributed video conferencing 有权
标题翻译：用于执行分布式视频会议的系统和方法

公开(公告)号：US07477282B2

公开(公告)日：2009-01-13

申请号：US11180826

申请日：2005-07-12

申请人： Scott S. Firestone , Walter R. Friedrich , Nermin M. Ismail , Keith A. Lantz , Shantanu Sarkar , Luke K. Surazski , Duanpei Wu

发明人： Scott S. Firestone , Walter R. Friedrich , Nermin M. Ismail , Keith A. Lantz , Shantanu Sarkar , Luke K. Surazski , Duanpei Wu

IPC分类号： H04N7/14 , H04M3/42

CPC分类号： H04L65/605 , H04L12/1822 , H04L65/4046 , H04M3/567 , H04N7/15 , H04N7/152

摘要： A method for executing a video conference is provided that includes receiving one or more audio streams associated with a video conference from one or more end points and determining an active speaker associated with one of the end points. Audio information associated with the active speaker may be received at one or more media switches. One or more video streams may be suppressed except for a selected video stream associated with the active speaker, the selected video stream propagating to one or more of the media switches during the video conference. The selected video stream may be replicated such that it may be communicated to one or more of the end points associated with a selected one of the media switches.

摘要翻译： 提供一种用于执行视频会议的方法，其包括从一个或多个端点接收与视频会议相关联的一个或多个音频流，并且确定与所述终点中的一个相关联的活动扬声器。可以在一个或多个媒体交换机处接收与主动扬声器相关联的音频信息。除了与活动扬声器相关联的所选视频流之外，可以抑制一个或多个视频流，所选择的视频流在视频会议期间传播到一个或多个媒体交换机。可以复制所选择的视频流，使得其可以被传送到与所选择的一个媒体交换机相关联的一个或多个终点。

3.

发明授权
System and method for performing distributed video conferencing 有权
标题翻译：用于执行分布式视频会议的系统和方法

公开(公告)号：US08659636B2

公开(公告)日：2014-02-25

申请号：US10680918

申请日：2003-10-08

申请人： Scott S. Firestone , Walter R. Friedrich , Nermin M. Ismail , Keith A. Lantz , Shantanu Sarkar , Luke K. Surazski , Duanpei Wu

发明人： Scott S. Firestone , Walter R. Friedrich , Nermin M. Ismail , Keith A. Lantz , Shantanu Sarkar , Luke K. Surazski , Duanpei Wu

IPC分类号： H04N7/14

CPC分类号： H04N7/14 , H04L12/1822 , H04L65/4046 , H04L65/605 , H04M3/567 , H04M2207/20 , H04N7/15

摘要： A method for executing a video conference is provided that includes receiving one or more audio streams associated with a video conference from one or more end points and determining an active speaker associated with one of the end points. Audio information associated with the active speaker may be received at one or more media switches. One or more video streams may be suppressed except for a selected video stream associated with the active speaker, the selected video stream propagating to one or more of the media switches during the video conference. The selected video stream may be replicated such that it may be communicated to one or more of the end points associated with a selected one of the media switches.

摘要翻译： 提供一种用于执行视频会议的方法，其包括从一个或多个端点接收与视频会议相关联的一个或多个音频流，并且确定与所述终点中的一个相关联的活动扬声器。可以在一个或多个媒体交换机处接收与主动扬声器相关联的音频信息。除了与活动扬声器相关联的所选视频流之外，可以抑制一个或多个视频流，所选择的视频流在视频会议期间传播到一个或多个媒体交换机。可以复制所选择的视频流，使得其可以被传送到与所选择的一个媒体交换机相关联的一个或多个终点。

4.

发明授权
System and method for providing video conferencing synchronization 有权
标题翻译：用于提供视频会议同步的系统和方法

公开(公告)号：US07084898B1

公开(公告)日：2006-08-01

申请号：US10715687

申请日：2003-11-18

申请人： Scott S. Firestone , Walter R. Friedrich , Nermin M. Ismail , Keith A. Lantz , Shantanu Sarkar , Luke K. Surazski , Duanpei Wu

发明人： Scott S. Firestone , Walter R. Friedrich , Nermin M. Ismail , Keith A. Lantz , Shantanu Sarkar , Luke K. Surazski , Duanpei Wu

IPC分类号： H04N7/14

CPC分类号： H04N7/152

摘要： An audio mixer on a first device receives one or more incoming audio streams. Each of the one or more incoming audio streams has an associated timestamp. The audio mixer generates a mixed audio stream from the one or more incoming audio streams. The audio mixer determines differences in the time base of each of the one or more incoming audio streams and the time base for the mixed audio stream. The audio mixer generates mapping parameters associated with the determined differences and transforms the timestamp of each of the one or more incoming audio streams to a corresponding output timestamp associated with the mixed audio stream according to the mapping parameters. the mapping parameters are provided to a video mixer for similar processing and transformation such that the mixed audio stream is in synchronization with a mixed video stream.

摘要翻译： 第一设备上的音频混合器接收一个或多个输入音频流。一个或多个输入音频流中的每一个具有关联的时间戳。音频混合器从一个或多个输入音频流产生混合音频流。音频混合器确定一个或多个输入音频流中的每一个的时基和混合音频流的时基的差异。音频混合器产生与确定的差异相关联的映射参数，并且根据映射参数将一个或多个输入音频流中的每一个的时间戳转换为与混合音频流相关联的对应输出时间戳。将映射参数提供给用于类似处理和变换的视频混合器，使得混合音频流与混合视频流同步。

5.

发明授权
System and method for performing distributed video conferencing 有权

公开(公告)号：US06989856B2

公开(公告)日：2006-01-24

申请号：US10703859

申请日：2003-11-06

申请人： Scott S. Firestone , Walter R. Friedrich , Nermin M. Ismail , Keith A. Lantz , Shantanu Sarkar , Luke K. Surazski , Duanpei Wu

发明人： Scott S. Firestone , Walter R. Friedrich , Nermin M. Ismail , Keith A. Lantz , Shantanu Sarkar , Luke K. Surazski , Duanpei Wu

IPC分类号： H04N7/14

CPC分类号： H04N7/14 , H04L12/1822 , H04L65/4046 , H04L65/605 , H04M3/567 , H04M2207/20 , H04N7/15

摘要： A method for executing a video conference is provided that includes receiving one or more audio streams associated with a video conference from one or more end points and determining an active speaker associated with one of the end points. Audio information associated with the active speaker may be received at one or more media switches. One or more video streams may be suppressed except for a selected video stream associated with the active speaker, the selected video stream propagating to one or more of the media switches during the video conference. The selected video stream may be replicated such that it may be communicated to one or more of the end points associated with a selected one of the media switches.

6.

发明授权
System for concealing missing audio waveforms 有权
标题翻译：隐藏丢失音频波形的系统

公开(公告)号：US08340078B1

公开(公告)日：2012-12-25

申请号：US11644062

申请日：2006-12-21

申请人： Duanpei Wu , Luke K. Surazski

发明人： Duanpei Wu , Luke K. Surazski

IPC分类号： H04L12/66 , G10L11/04 , G10L11/06

CPC分类号： G10L25/90 , G10L19/005 , G10L21/047

摘要： In one embodiment, a method can include: (i) establishing an internet protocol (IP) connection; (ii) forming a buffered version of a plurality of voice frame slices from received audio packets; and (iii) when an erasure is detected, performing a packet loss concealment (PLC) to provide a synthesized speech signal for the erasure, where the PLC can include: (a) identifying first and second pitches from the buffered version of the plurality of voice frame slices; and (b) forming the synthesized speech signal by using the first and second pitches, and more if needed, followed by an overlay-add (OLA).

摘要翻译： 在一个实施例中，一种方法可以包括：（i）建立因特网协议（IP）连接; （ii）从接收到的音频分组形成多个语音帧片段的缓冲版本; 和（iii）当检测到擦除时，执行分组丢失隐藏（PLC）以提供用于擦除的合成语音信号，其中PLC可以包括：（a）从缓冲版本中识别第一和第二间距语音片段; 和（b）通过使用第一和第二音调形成合成的语音信号，如果需要，则更多，然后是叠加（OLA）。

7.

发明申请
Method and architecture for distributed video switching using media notifications 有权
标题翻译：使用媒体通知进行分布式视频切换的方法和架构

公开(公告)号：US20070153712A1

公开(公告)日：2007-07-05

申请号：US11327541

申请日：2006-01-05

申请人： Steven Fry , Thiyagesan Ramalingam , Nermin Ismail , Walter Friedrich , Duanpei Wu

发明人： Steven Fry , Thiyagesan Ramalingam , Nermin Ismail , Walter Friedrich , Duanpei Wu

IPC分类号： H04L12/16

CPC分类号： H04L29/06027 , H04L49/355 , H04L65/4046 , H04L65/607 , H04N7/152

摘要： Disclosed are video conferencing systems, devices, architectures, and methods for using media notifications to coordinate switching between video in a distributed arrangement. An exemplary media switch in accordance with embodiments can include: a first interface configured for a first type communication with an endpoint; a second interface configured for the first type communication with another media switch, the second interface being configured to receive a first video stream having a first characteristic and a second video stream having a second characteristic; a third interface configured for a second type communication with a stream controller, the stream controller being configured to provide a notification; and a fourth interface configured for the second type communication with a controlling server, whereby the media switch is configured to re-target an active stream in response to the notification or a difference between the first and second characteristics.

摘要翻译： 公开了用于使用媒体通知来协调分布式布置中的视频之间的切换的视频会议系统，设备，架构和方法。根据实施例的示例性媒体交换机可以包括：被配置用于与端点进行第一类型通信的第一接口; 第二接口，被配置用于与另一媒体交换机的第一类型通信，所述第二接口被配置为接收具有第一特征的第一视频流和具有第二特征的第二视频流; 配置用于与流控制器的第二类型通信的第三接口，所述流控制器被配置为提供通知; 以及配置用于与控制服务器的第二类型通信的第四接口，由此媒体交换机被配置为响应于该通知或第一和第二特性之间的差异而重新定位活动流。

8.

发明授权
Weighted frequency-channel background noise suppressor 失效
标题翻译：加权频道背景噪声抑制器

公开(公告)号：US06826528B1

公开(公告)日：2004-11-30

申请号：US09691878

申请日：2000-10-18

申请人： Duanpei Wu , Miyuki Tanaka , Xavier Menendez-Pidal

发明人： Duanpei Wu , Miyuki Tanaka , Xavier Menendez-Pidal

IPC分类号： G10L2102

CPC分类号： G10L21/0208 , G10L21/0232 , G10L25/18 , G10L25/78

摘要： A method for implementing a noise suppressor in a speech recognition system comprises a filter bank for separating source speech data into discrete frequency sub-bands to generate filtered channel energy, and a noise suppressor for weighting the frequency sub-bands to improve the signal-to-noise ratio of the resultant noise-suppressed channel energy. The noise suppressor preferably includes a noise calculator for calculating background noise values, a speech energy calculator for calculating speech energy values for each channel of the filter bank, and a weighting module for applying calculated weighting values to the projected channel energy to generate the noise-suppressed channel energy.

摘要翻译： 一种用于在语音识别系统中实现噪声抑制器的方法包括：滤波器组，用于将源语音数据分离成离散频率子带以产生经滤波的信道能量;以及噪声抑制器，用于对频率子带进行加权以改善信号到噪声抑制通道能量的噪声比。噪声抑制器优选地包括用于计算背景噪声值的噪声计算器，用于计算滤波器组的每个通道的语音能量值的语音能量计算器，以及用于将计算的加权值应用于投影的通道能量以产生噪声抑制器的加权模块，抑制通道能量。

9.

发明授权
Speech detection with noise suppression based on principal components analysis 失效
标题翻译：基于主成分分析的噪声抑制语音检测

公开(公告)号：US06230122B1

公开(公告)日：2001-05-08

申请号：US09176178

申请日：1998-10-21

申请人： Duanpei Wu , Miyuki Tanaka , Mariscela Amador-Hernandez

发明人： Duanpei Wu , Miyuki Tanaka , Mariscela Amador-Hernandez

IPC分类号： G10L2102

CPC分类号： G10L21/0208 , G10L21/0232

摘要： A method for effectively suppressing background noise in a speech detection system comprises a filter bank for separating source speech data into discrete frequency sub-bands to generate filtered channel energy, and a noise suppressor for weighting the frequency sub-bands to improve the signal-to-noise ratio of the resultant noise-suppressed channel energy. The noise suppressor preferably includes a subspace module for using a Karhunen-Loeve transformation to create a subspace based on the background noise, a projection module for generating projected channel energy by projecting the filtered channel energy onto the created subspace, and a weighting module for applying calculated weighting values to the projected channel energy to generate the noise-suppressed channel energy.

摘要翻译： 一种用于有效地抑制语音检测系统中的背景噪声的方法包括用于将源语音数据分离成离散频率子带以产生经滤波的信道能量的滤波器组，以及用于对频率子带进行加权以改善信号到噪声抑制通道能量的噪声比。噪声抑制器优选地包括用于使用Karhunen-Loeve变换来创建基于背景噪声的子空间的子空间模块，用于通过将滤波的信道能量投影到所创建的子空间上来产生投影通道能量的投影模块，以及用于应用的加权模块计算加权值到投影通道能量以产生噪声抑制的通道能量。

10.

发明授权
Method for implementing a speech recognition system to determine speech endpoints during conditions with background noise 失效
标题翻译：用于在具有背景噪声的条件下实现语音识别系统以确定语音端点的方法

公开(公告)号：US06216103B1

公开(公告)日：2001-04-10

申请号：US08957875

申请日：1997-10-20

申请人： Duanpei Wu , Miyuki Tanaka , Ruxin Chen , Lex Olorenshaw

发明人： Duanpei Wu , Miyuki Tanaka , Ruxin Chen , Lex Olorenshaw

IPC分类号： G01L300

CPC分类号： G10L25/87 , G10L15/20

摘要： A method for implementing a speech recognition system for use during conditions with background noise includes the steps of calculating, in real-time, sequential short-term delta energy parameters for speech energy from a spoken utterance, determining threshold values in the speech energy, and identifying a beginning point and an ending point for the spoken utterance based on the relationship between the threshold values and the short-term delta energy parameters.

摘要翻译： 用于实现在具有背景噪声的条件期间使用的语音识别系统的方法包括以下步骤：实时地从语音话语中计算语音能量的连续短期增量能量参数，确定语音能量中的阈值，以及基于阈值和短期δ能量参数之间的关系来识别口语发音的起始点和终点。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类