Patent search ap:("Bin Yu" OR "Yong Rui") AND inv:"Yong Rui" Page 3

21.

发明授权
System and method for mode-based multi-hypothesis tracking using parametric contours 有权

公开(公告)号：US06999599B2

公开(公告)日：2006-02-14

申请号：US10164947

申请日：2002-06-07

Applicant: Yong Rui , Yunqiang Chen

Inventor： Yong Rui , Yunqiang Chen

IPC: G06K9/00

CPC classification number: G06K9/00234 , G06K9/3216 , G06K9/6207 , G06T7/251 , G06T7/277 , G06T2207/10016 , G06T2207/30201

Abstract: A system and method for object tracking using probabilistic mode-based multi-hypothesis tracking (MHT) provides for robust and computationally efficient tracking of moving objects such as heads and faces in complex environments. A mode-based multi-hypothesis tracker uses modes that are local maximums which are refined from initial samples in a parametric state space. Because the modes are highly representative, the mode-based multi-hypothesis tracker effectively models non-linear probabilistic distributions using a small number of hypotheses. Real-time tracking performance is achieved by using a parametric causal contour model to refine initial contours to nearby modes. In addition, one common drawback of conventional MHT schemes, i.e., producing only maximum likelihood estimates instead of a desired posterior probability distribution, is addressed by introducing an importance sampling framework into MHT, and estimating the posterior probability distribution from the importance function.

22.

发明申请
System and process for locating a speaker using 360 degree sound source localization 有权
Title translation: 使用360度声源定位来定位扬声器的系统和过程

公开(公告)号：US20050265562A1

公开(公告)日：2005-12-01

申请号：US11182142

申请日：2005-07-15

Applicant: Yong Rui

Inventor： Yong Rui

IPC: G10L25/93 , H04R3/00 , G10L11/06

CPC classification number: H04R3/005 , H04R2201/401

Abstract: A system and process is described for estimating the location of a speaker using signals output by a microphone array characterized by multiple pairs of audio sensors. The location of a speaker is estimated by first determining whether the signal data contains human speech components and filtering out noise attributable to stationary sources. The location of the person speaking is then estimated using a time-delay-of-arrival based SSL technique on those parts of the data determined to contain human speech components. A consensus location for the speaker is computed from the individual location estimates associated with each pair of microphone array audio sensors taking into consideration the uncertainty of each estimate. A final consensus location is also computed from the individual consensus locations computed over a prescribed number of sampling periods using a temporal filtering technique.

Abstract translation: 描述了一种系统和过程，用于使用由多对音频传感器表征的麦克风阵列输出的信号来估计扬声器的位置。通过首先确定信号数据是否包含人类语音分量并滤除归因于固定源的噪声来估计扬声器的位置。然后，使用基于时间延迟的SSL技术来估计说话人的位置，以确定包含人类语音组件的数据的那些部分。考虑到每个估计的不确定性，从与每对麦克风阵列音频传感器相关联的各个位置估计计算扬声器的共识位置。还可以使用时间滤波技术从在规定数量的采样周期上计算的单个共识位置计算最终共识位置。

23.

发明申请
System and process for tracking an object state using a particle filter sensor fusion technique 失效
Title translation: 使用粒子滤波器传感器融合技术跟踪物体状态的系统和过程

公开(公告)号：US20050114079A1

公开(公告)日：2005-05-26

申请号：US10985243

申请日：2004-11-10

Applicant: Yong Rui , Yunqiang Chen

Inventor： Yong Rui , Yunqiang Chen

IPC: G06T7/20 , G10L21/02 , G01S13/00

CPC classification number: G06T7/277 , G10L2021/02166

Abstract: A system and process for tracking an object state over time using particle filter sensor fusion and a plurality of logical sensor modules is presented. This new fusion framework combines both the bottom-up and top-down approaches to sensor fusion to probabilistically fuse multiple sensing modalities. At the lower level, individual vision and audio trackers can be designed to generate effective proposals for the fuser. At the higher level, the fuser performs reliable tracking by verifying hypotheses over multiple likelihood models from multiple cues. Different from the traditional fusion algorithms, the present framework is a closed-loop system where the fuser and trackers coordinate their tracking information. Furthermore, to handle non-stationary situations, the present framework evaluates the performance of the individual trackers and dynamically updates their object states. A real-time speaker tracking system based on the proposed framework is feasible by fusing object contour, color and sound source location.

Abstract translation: 提出了一种使用粒子滤波器传感器融合和多个逻辑传感器模块跟踪物体状态随时间变化的系统和过程。这种新的融合框架将自下而上和自顶向下的方法与传感器融合相结合，以概率地融合多种感测模式。在较低级别，个人视觉和音频跟踪器可以设计用于为定影器生成有效的建议。在较高级别，定影器通过从多个线索的多个似然模型上验证假设来执行可靠的跟踪。与传统融合算法不同，本框架是闭环系统，其中定影器和跟踪器协调其跟踪信息。此外，为了处理非平稳情况，本框架评估各个跟踪器的性能并动态更新其对象状态。基于提出的框架的实时扬声器跟踪系统可以通过融合对象轮廓，颜色和声源位置来实现。

24.

发明授权
Methods and systems for estimating network available bandwidth using packet pairs and spatial filtering 有权
Title translation: 使用分组对和空间过滤估计网络可用带宽的方法和系统

公开(公告)号：US08817645B2

公开(公告)日：2014-08-26

申请号：US13300027

申请日：2011-11-18

Applicant: Yong Rui , Andres Vega-Garcia

Inventor： Yong Rui , Andres Vega-Garcia

IPC: H04L12/26

CPC classification number: H04L43/0882 , H04L43/022 , H04L43/045 , H04L43/0852

Abstract: Estimation of available bandwidth on a network uses packet pairs and spatially filtering. Packet pairs are transmitted over the network. The dispersion of the packet pairs is used to generate samples of the available bandwidth, which are then classified into bins to generate a histogram. The bins can have uniform bin widths, and the histogram data can be aged so that older samples are given less weight in the estimation. The histogram data is then spatially filtered. Kernel density algorithms can be used to spatially filter the histogram data. The network available bandwidth is estimated using the spatially filtered histogram data. Alternatively, the spatially filtered histogram data can be temporally filtered before the available bandwidth is estimated.

Abstract translation: 网络上可用带宽的估计使用数据包对和空间过滤。分组对通过网络传输。分组对的分散被用于生成可用带宽的样本，然后将其分类为分组以生成直方图。箱体可以具有统一的箱体宽度，并且直方图数据可以老化，以便在估计中给予较小的重量。然后将直方图数据进行空间滤波。内核密度算法可用于对直方图数据进行空间过滤。使用空间滤波的直方图数据估计网络可用带宽。或者，空间滤波的直方图数据可以在估计可用带宽之前进行时间滤波。

25.

发明授权
Kernelized spatial-contextual image classification 有权
Title translation: 内核空间上下文图像分类

公开(公告)号：US08131086B2

公开(公告)日：2012-03-06

申请号：US12237298

申请日：2008-09-24

Applicant: Xian-Sheng Hua , Guo-Jun Qi , Yong Rui , Hong-Jiang Zhang

Inventor： Xian-Sheng Hua , Guo-Jun Qi , Yong Rui , Hong-Jiang Zhang

IPC: G06K9/68

CPC classification number: G06K9/469 , G06K9/6297

Abstract: Kernelized spatial-contextual image classification is disclosed. One embodiment comprises generating a first spatial-contextual model to represent a first image, the first spatial-contextual model having a plurality of interconnected nodes arranged in a first pattern of connections with each node connected to at least one other node, generating a second spatial-contextual model to represent a second image using the first pattern of connections, and estimating the distance between corresponding nodes in the first spatial-contextual model and the second spatial-contextual model based on a relationship with adjacent connected nodes to determine a distance between the first image and the second image.

Abstract translation: 公开了内核空间上下文图像分类。一个实施例包括生成第一空间上下文模型以表示第一图像，第一空间上下文模型具有以与连接到至少一个其他节点的每个节点连接的第一连接方式布置的多个互连节点，产生第二空间 - 使用所述第一连接模式来表示第二图像，以及基于与相邻连接节点的关系来估计所述第一空间 - 上下文模型中的对应节点与所述第二空间 - 上下文模型之间的距离，以确定所述第二图像之间的距离第一个图像和第二个图像。

26.

发明申请
IDENTIFICATION OF PEOPLE USING MULTIPLE TYPES OF INPUT 有权
Title translation: 使用多种输入类型识别人

公开(公告)号：US20110313766A1

公开(公告)日：2011-12-22

申请号：US13221640

申请日：2011-08-30

Applicant: Cha Zhang , Paul A. Viola , Pei Yin , Ross G. Cutler , Xinding Sun , Yong Rui

Inventor： Cha Zhang , Paul A. Viola , Pei Yin , Ross G. Cutler , Xinding Sun , Yong Rui

IPC: G10L17/00

CPC classification number: G06K9/6256 , G06K9/4614 , G10L25/78 , G10L2021/02166 , H04N7/147 , H04N7/15 , H04N21/42203 , H04N21/4223 , H04N21/4394 , H04N21/44008 , H04N21/44213 , H04N21/4788

Abstract: Systems and methods for detecting people or speakers in an automated fashion are disclosed. A pool of features including more than one type of input (like audio input and video input) may be identified and used with a learning algorithm to generate a classifier that identifies people or speakers. The resulting classifier may be evaluated to detect people or speakers.

Abstract translation: 公开了以自动方式检测人或扬声器的系统和方法。可以识别包括多于一种类型的输入（例如音频输入和视频输入）的功能池，并与学习算法一起使用以生成识别人或扬声器的分类器。可以评估所得分类器以检测人或扬声器。

27.

发明授权
Correlative multi-label image annotation 有权
Title translation: 相关多标签图像注释

公开(公告)号：US07996762B2

公开(公告)日：2011-08-09

申请号：US12030616

申请日：2008-02-13

Applicant: Guo-Jun Qi , Xian-Sheng Hua , Yong Rui , Hong-Jiang Zhang , Shipeng Li

Inventor： Guo-Jun Qi , Xian-Sheng Hua , Yong Rui , Hong-Jiang Zhang , Shipeng Li

IPC: G06F17/00

CPC classification number: G06F17/30799 , G06K9/00711

Abstract: Correlative multi-label image annotation may entail annotating an image by indicating respective labels for respective concepts. In an example embodiment, a classifier is to annotate an image by implementing a labeling function that maps an input feature space and a label space to a combination feature vector. The combination feature vector models both features of individual ones of the concepts and correlations among the concepts.

Abstract translation: 相关多标签图像注释可能需要通过针对相应概念指示相应的标签来注释图像。在示例实施例中，分类器是通过实现将输入特征空间和标签空间映射到组合特征向量的标记功能来注释图像。组合特征向量模拟各个概念的特征和概念之间的相关性。

28.

发明授权
Image segmentation using spatial-color Gaussian mixture models 失效
Title translation: 使用空间色高斯混合模型的图像分割

公开(公告)号：US07885463B2

公开(公告)日：2011-02-08

申请号：US11393576

申请日：2006-03-30

Applicant: Cha Zhang , Michael Cohen , Yong Rui , Ting Yu

Inventor： Cha Zhang , Michael Cohen , Yong Rui , Ting Yu

IPC: G06K9/34

CPC classification number: G06K9/38 , G06T7/11 , G06T7/143 , G06T7/162 , G06T7/194 , G06T2207/10016 , G06T2207/10024 , G06T2207/20072

Abstract: A spatial-color Gaussian mixture model (SCGMM) image segmentation technique for segmenting images. The SCGMM image segmentation technique specifies foreground objects in the first frame of an image sequence, either manually or automatically. From the initial segmentation, the SCGMM segmentation system learns two spatial-color Gaussian mixture models (SCGMM) for the foreground and background objects. These models are built into a first-order Markov random field (MRF) energy function. The minimization of the energy function leads to a binary segmentation of the images in the image sequence, which can be solved efficiently using a conventional graph cut procedure.

Abstract translation: 用于分割图像的空间色彩高斯混合模型（SCGMM）图像分割技术。 SCGMM图像分割技术手动或自动地指定图像序列的第一帧中的前景对象。从初始分割，SCGMM分割系统为前景和背景对象学习两个空间色高斯混合模型（SCGMM）。这些模型内置于一阶马尔科夫随机场（MRF）能量函数中。能量函数的最小化导致图像序列中的图像的二进制分割，这可以使用常规的图形切割程序有效地解决。

29.

发明授权
Systems and methods for real-time audio-visual communication and data collaboration in a network conference environment 有权
Title translation: 网络会议环境中实时视听通信和数据协作的系统和方法

公开(公告)号：US07634533B2

公开(公告)日：2009-12-15

申请号：US10836778

申请日：2004-04-30

Applicant: Eric Rudolph , Yong Rui , Henrique S Malvar , Li-Wei He , Michael F Cohen , Ivan Tashev

Inventor： Eric Rudolph , Yong Rui , Henrique S Malvar , Li-Wei He , Michael F Cohen , Ivan Tashev

IPC: G06F15/16

CPC classification number: H04L12/1827 , H04L12/1831 , H04L29/06027 , H04L65/403 , H04L65/4038

Abstract: Systems and methods are disclosed that facilitate real-time information exchange in a multimedia conferencing environment. Data Client(s) facilitate data collaboration between users and are maintained separately from audio/video (AV) Clients that provide real-time communication functionality. Data Clients can be remotely located with respect to one another and with respect to a server. A remote user Stand-in Device can be provided that comprises a display to present a remote user to local users, a digital automatic pan/tilt/zoom camera to capture imagery in, for example, a conference room and provide real-time information to an AV Client in a remote office, and a microphone array that can similarly provide real-time audio information from the conference room to an AV Client in the remote office. The invention further facilitates file transfer and presentation broadcast between Data Clients in a single location or in a plurality of disparate locations.

Abstract translation: 公开了促进多媒体会议环境中的实时信息交换的系统和方法。数据客户端促进用户之间的数据协作，并与提供实时通信功能的音频/视频（AV）客户端分开维护。数据客户端可以相对于彼此和相对于服务器远程定位。可以提供远程用户待机设备，其包括向本地用户呈现远程用户的显示器，用于在例如会议室中捕获图像的数字自动摇摄/俯仰/变焦相机，并且提供实时信息远程办公室中的AV客户端以及可以类似地从会议室向远程办公室中的AV客户端提供实时音频信息的麦克风阵列。本发明进一步便于在单个位置或多个不同位置的数据客户端之间的文件传送和呈现广播。

30.

发明授权
Automated online broadcasting system and method using an omni-directional camera system for viewing meetings over a computer network 失效

公开(公告)号：US07580054B2

公开(公告)日：2009-08-25

申请号：US11193976

申请日：2005-07-29

Applicant: Yong Rui , Anoop Gupta , Johnathan J. Cadiz , Ross Garrett Cutler

Inventor： Yong Rui , Anoop Gupta , Johnathan J. Cadiz , Ross Garrett Cutler

IPC: H04N7/14 , G06F3/00

CPC classification number: H04N7/142 , H04N7/147 , H04N7/15

Abstract: An automated system and method for broadcasting meetings over a computer network. The meeting is filmed using an omni-directional camera system and capable of being presented to a viewer both live and on-demand. The system of the present invention includes an automated camera management system for controlling the camera system and an analysis module determining the location of meeting participants in the meeting environments. The method of the present invention includes using the system of the present invention to broadcast an event to a viewer over a computer network. In particular, the method includes filming the event using an omni-directional camera system. Next, the method determines the location of each event participant in the event environment. Finally, a viewer is provided with a user interface for viewing the broadcast event. This user interface allows a viewer to choose which event participant that the viewer would like to view.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification