Patent search ap:("Bin Yu" OR "Yong Rui") AND inv:"Yong Rui" Page 10

91.

发明申请
Multi-Label Active Learning 有权
Title translation: 多标签主动学习

公开(公告)号：US20090125461A1

公开(公告)日：2009-05-14

申请号：US11958050

申请日：2007-12-17

Applicant: Guo-Jun Qi , Xian-Sheng Hua , Yong Rui , Hong-Jiang Zhang , Shipeng Li

Inventor： Guo-Jun Qi , Xian-Sheng Hua , Yong Rui , Hong-Jiang Zhang , Shipeng Li

IPC: G06F15/18

CPC classification number: G06N99/005

Abstract: Multi-label active learning may entail training a classifier with a set of training samples having multiple labels per sample. In an example embodiment, a method includes accepting a set of training samples, with the set of training samples having multiple respective samples that are each respectively associated with multiple labels. The set of training samples is analyzed to select a sample-label pair responsive to at least one error parameter. The selected sample-label pair is then submitted to an oracle for labeling.

Abstract translation: 多标签主动学习可能需要对分类器训练一组具有每个样本的多个标签的训练样本。在示例实施例中，一种方法包括接受一组训练样本，其中该组训练样本具有多个相应样本，每个样本分别与多个标签相关联。分析该组训练样本以响应于至少一个误差参数来选择样本标签对。然后将选定的样品标签对提交给oracle进行标记。

92.

发明授权
Decentralized architecture and protocol for voice conferencing 有权
Title translation: 语音会议的分散架构和协议

公开(公告)号：US07417983B2

公开(公告)日：2008-08-26

申请号：US11277905

申请日：2006-03-29

Applicant: Li-wei He , Dinei A. Florencio , Yong Rui

Inventor： Li-wei He , Dinei A. Florencio , Yong Rui

IPC: H04L12/66

CPC classification number: H04L12/1822 , H04L67/104 , H04L67/1061

Abstract: A decentralized computer network architecture and method that gathers metadata from local and remote clients and, based on that metadata, locally makes a decision whether to send a packet over the network. Each client listens to what other clients are doing, and only sends when the total number of concurrent speakers is below some threshold. In a multi-party voice conferencing embodiment, the threshold is a number of concurrent speakers that is restricted to less than a certain number. Under the decentralized computer network architecture, the type of network topology used to connect the clients is flexible, as long as each client is running a peer-aware system to decide locally whether to send their packets. The decentralized computer network architecture and method is distributed to run on each client, making it suitable for a wide variety of network topologies (such as full-mesh, bridge-based, or a hybrid of the two).

Abstract translation: 从本地和远程客户端收集元数据的分散计算机网络架构和方法，并且基于该元数据在本地做出是否通过网络发送分组的决定。每个客户端监听其他客户端正在执行的操作，只有当并发扬声器的总数低于某个阈值时才发送。在多方语音会议实施例中，阈值是限制在小于一定数量的并发扬声器的数量。在分散式计算机网络架构下，用于连接客户端的网络拓扑的类型是灵活的，只要每个客户端都运行一个对等体感知系统来本地确定是否发送它们的数据包。分布式计算机网络架构和方法分布在每个客户端上运行，使其适用于各种网络拓扑（如全网状，基于桥接或两者混合）。

93.

发明授权
Automated video production system and method using expert video production rules for online publishing of lectures 失效
Title translation: 自动视频制作系统和方法，使用专家视频制作规则进行在线发布演讲

公开(公告)号：US07349005B2

公开(公告)日：2008-03-25

申请号：US09681835

申请日：2001-06-14

Applicant: Yong Rui , Anoop Gupta , Qiong Liu

Inventor： Yong Rui , Anoop Gupta , Qiong Liu

IPC: H04N7/14

CPC classification number: H04N7/188 , H04N7/181

Abstract: An automated system and method for producing videos using expert video production rules. The automated video production system and method of the present invention is particularly well-suited for the online publishing of lectures. The system of the present invention includes a camera system for capturing the lecture, such as an audience-tracking camera and a non-intrusive lecturer-tracking camera. Moreover, each camera may be controlled by a cinematographer. Tracking of a subject is performed using a history-based, reduced-motion tracker that sets up a camera shot based on the subject's movement history and leaves the shot fixed until the camera is switched. The system also includes a virtual director module having a probabilistic finite state machine (FSM) module that uses probabilistic rules and the expert video production rules to determine a current camera view.

Abstract translation: 一种使用专业视频制作规则制作视频的自动化系统和方法。本发明的自动化视频制作系统和方法特别适用于在线出版讲座。本发明的系统包括用于捕获演讲的相机系统，诸如观众跟踪相机和非侵入式讲师跟踪相机。此外，每个摄像机可以由摄影机控制。使用基于历史的减少运动的跟踪器来执行对象的跟踪，该跟踪器基于被摄体的移动历史设置拍摄的照相机，并将照片固定，直到相机切换为止。该系统还包括具有概率有限状态机（FSM）模块的虚拟导向器模块，该模块使用概率规则和专家视频制作规则来确定当前的摄像机视图。

94.

发明申请
Identification Of People Using Multiple Types Of Input 有权
Title translation: 使用多种输入的人的识别

公开(公告)号：US20070297682A1

公开(公告)日：2007-12-27

申请号：US11425967

申请日：2006-06-22

Applicant: Cha Zhang , Paul A. Viola , Pei Yin , Ross G. Cutler , Xinding Sun , Yong Rui

Inventor： Cha Zhang , Paul A. Viola , Pei Yin , Ross G. Cutler , Xinding Sun , Yong Rui

IPC: G06K9/46 , G06K9/62

CPC classification number: G06K9/6256 , G06K9/4614 , G10L25/78 , G10L2021/02166 , H04N7/147 , H04N7/15 , H04N21/42203 , H04N21/4223 , H04N21/4394 , H04N21/44008 , H04N21/44213 , H04N21/4788

Abstract: Systems and methods for detecting people or speakers in an automated fashion are disclosed. A pool of features including more than one type of input (like audio input and video input) may be identified and used with a learning algorithm to generate a classifier that identifies people or speakers. The resulting classifier may be evaluated to detect people or speakers.

Abstract translation: 公开了以自动方式检测人或扬声器的系统和方法。可以识别包括多于一种类型的输入（例如音频输入和视频输入）的功能池，并与学习算法一起使用以生成识别人或扬声器的分类器。可以评估所得分类器以检测人或扬声器。

95.

发明授权
Skimming continuous multimedia content 失效
Title translation: 剔除连续的多媒体内容

公开(公告)号：US07293280B1

公开(公告)日：2007-11-06

申请号：US09565819

申请日：2000-05-05

Applicant: Anoop Gupta , Li-Wei He , Francis C. Li , Yong Rui

Inventor： Anoop Gupta , Li-Wei He , Francis C. Li , Yong Rui

IPC: H04N7/173 , H04N5/93

CPC classification number: H04N21/26603 , G06F17/241 , G11B27/105 , G11B27/11 , G11B27/28 , G11B2220/20 , H04N5/76 , H04N5/781 , H04N5/85 , H04N5/907 , H04N7/17318 , H04N9/8042 , H04N21/23418 , H04N21/4312 , H04N21/4314 , H04N21/4334 , H04N21/440281 , H04N21/4622 , H04N21/4782 , H04N21/6125 , H04N21/84 , H04N21/8456

Abstract: A program distribution system includes a plurality of set-top boxes that receive broadcast programming and segmentation data from content and information providers. The segmentation information indicates portions of programs that are to be included in skimmed or condensed versions of the received programming, and is produced using manual or automated methods. Automated methods include the use of ancillary production data to detect the most important parts of a program. A user interface allows a user to control time scale modification and skimming during playback, and also allows the user to easily browse to different points within the current program.

Abstract translation: 程序分配系统包括从内容和信息提供者接收广播节目和分段数据的多个机顶盒。分割信息指示将被包括在接收到的节目的撇号或缩小版本中的节目的部分，并且使用手动或自动方法来产生。自动化方法包括使用辅助生产数据来检测程序的最重要部分。用户界面允许用户在播放期间控制时间刻度修改和缩放，并且还允许用户容易地浏览到当前节目内的不同点。

96.

发明申请
Image segmentation using spatial-color gaussian mixture models 失效
Title translation: 使用空间色高斯混合模型的图像分割

公开(公告)号：US20070237393A1

公开(公告)日：2007-10-11

申请号：US11393576

申请日：2006-03-30

Applicant: Cha Zhang , Michael Cohen , Yong Rui , Ting Yu

Inventor： Cha Zhang , Michael Cohen , Yong Rui , Ting Yu

IPC: G06K9/34

CPC classification number: G06K9/38 , G06T7/11 , G06T7/143 , G06T7/162 , G06T7/194 , G06T2207/10016 , G06T2207/10024 , G06T2207/20072

Abstract: A spatial-color Gaussian mixture model (SCGMM) image segmentation technique for segmenting images. The SCGMM image segmentation technique specifies foreground objects in the first frame of an image sequence, either manually or automatically. From the initial segmentation, the SCGMM segmentation system learns two spatial-color Gaussian mixture models (SCGMM) for the foreground and background objects. These models are built into a first-order Markov random field (MRF) energy function. The minimization of the energy function leads to a binary segmentation of the images in the image sequence, which can be solved efficiently using a conventional graph cut procedure

Abstract translation: 用于分割图像的空间色彩高斯混合模型（SCGMM）图像分割技术。 SCGMM图像分割技术手动或自动地指定图像序列的第一帧中的前景对象。从初始分割，SCGMM分割系统为前景和背景对象学习两个空间色高斯混合模型（SCGMM）。这些模型内置于一阶马尔科夫随机场（MRF）能量函数中。能量函数的最小化导致图像序列中的图像的二进制分割，这可以使用常规的图形切割程序有效地解决

97.

发明授权
Automatic detection and tracking of multiple individuals using multiple cues 有权

公开(公告)号：US07130446B2

公开(公告)日：2006-10-31

申请号：US10006927

申请日：2001-12-03

Applicant: Yong Rui , Yunqiang Chen

Inventor： Yong Rui , Yunqiang Chen

IPC: G06K9/00

CPC classification number: G06K9/00234 , G06T7/251 , G06T2207/10016 , G06T2207/30196 , G06T2207/30201

Abstract: Automatic detection and tracking of multiple individuals includes receiving a frame of video and/or audio content and identifying a candidate area for a new face region in the frame. One or more hierarchical verification levels are used to verify whether a human face is in the candidate area, and an indication made that the candidate area includes a face if the one or more hierarchical verification levels verify that a human face is in the candidate area. A plurality of audio and/or video cues are used to track each verified face in the video content from frame to frame.

98.

发明授权
System and process for time delay estimation in the presence of correlated noise and reverberation 有权
Title translation: 在存在相关噪声和混响的情况下进行延时估计的系统和过程

公开(公告)号：US07113605B2

公开(公告)日：2006-09-26

申请号：US11182633

申请日：2005-07-14

Applicant: Yong Rui , Dinei Florencio

Inventor： Yong Rui , Dinei Florencio

IPC: H04R3/00 , H04B15/00 , H03B29/00

CPC classification number: H04R3/005 , H04R2430/23

Abstract: A system and process for estimating the time delay of arrival (TDOA) between a pair of audio sensors of a microphone array is presented. Generally, a generalized cross-correlation (GCC) technique is employed. However, this technique is improved to include provisions for both reducing the influence (including interference) from correlated ambient noise and reverberation noise in the sensor signals prior to computing the TDOA estimate. Two unique correlated ambient noise reduction procedures are also proposed. One involves the application of Wiener filtering, and the other a combination of Wiener filtering with a Gnn subtraction technique. In addition, two unique reverberation noise reduction procedures are proposed. Both involve applying a weighting factor to the signals prior to computing the TDOA which combines the effects of a traditional maximum likelihood (TML) weighting function and a phase transformation (PHAT) weighting function.

Abstract translation: 提出了一种用于估计麦克风阵列的一对音频传感器之间的到达时间延迟（TDOA）的系统和过程。通常，采用广义互相关（GCC）技术。然而，该技术被改进为包括在计算TDOA估计之前减少传感器信号中相关环境噪声和混响噪声的影响（包括干扰）的规定。还提出了两个独特的相关环境降噪程序。一个涉及Wiener滤波的应用，另一个涉及Wiener滤波与G>减法技术的组合。另外还提出了两个独特的混响降噪程序。两者都涉及在计算结合了传统最大似然（TML）加权函数和相变（PHAT）加权函数的效果的TDOA之前对信号应用加权因子。

99.

发明授权
System and process for time delay estimation in the presence of correlated noise and reverberation 失效

公开(公告)号：US07039200B2

公开(公告)日：2006-05-02

申请号：US10404219

申请日：2003-03-31

Applicant: Yong Rui , Dinei A. Florencio

Inventor： Yong Rui , Dinei A. Florencio

IPC: H04R3/00 , H04N5/232

CPC classification number: H04R3/005 , H04R2430/23

Abstract: A system and process for estimating the time delay of arrival (TDOA) between a pair of audio sensors of a microphone array is presented. Generally, a generalized cross-correlation (GCC) technique is employed. However, this technique is improved to include provisions for both reducing the influence (including interference) from correlated ambient noise and reverberation noise in the sensor signals prior to computing the TDOA estimate. Two unique correlated ambient noise reduction procedures are also proposed. One involves the application of Wiener filtering, and the other a combination of Wiener filtering with a Gnn subtraction technique. In addition, two unique reverberation noise reduction procedures are proposed. Both involve applying a weighting factor to the signals prior to computing the TDOA which combines the effects of a traditional maximum likelihood (TML) weighting function and a phase transformation (PHAT) weighting function.

100.

发明申请
Automated online broadcasting system and method using an omni-directional camera system for viewing meetings over a computer network 失效

公开(公告)号：US20050285933A1

公开(公告)日：2005-12-29

申请号：US11193702

申请日：2005-07-29

Applicant: Yong Rui , Anoop Gupta , Johnathan Cadiz , Ross Cutler

Inventor： Yong Rui , Anoop Gupta , Johnathan Cadiz , Ross Cutler

IPC: H04N7/14 , H04N7/15

CPC classification number: H04N7/142 , H04N7/147 , H04N7/15

Abstract: An automated system and method for broadcasting meetings over a computer network. The meeting is filmed using an omni-directional camera system and capable of being presented to a viewer both live and on-demand. The system of the present invention includes an automated camera management system for controlling the camera system and an analysis module determining the location of meeting participants in the meeting environments. The method of the present invention includes using the system of the present invention to broadcast an event to a viewer over a computer network. In particular, the method includes filming the event using an omni-directional camera system. Next, the method determines the location of each event participant in the event environment. Finally, a viewer is provided with a user interface for viewing the broadcast event. This user interface allows a viewer to choose which event participant that the viewer would like to view.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification