Patent search ap:("Yong Rui") AND inv:"Yong Rui" Page 8

71.

发明申请
System and process for robust sound source localization 有权
Title translation: 强大的声源定位系统和过程

公开(公告)号：US20060215850A1

公开(公告)日：2006-09-28

申请号：US11267678

申请日：2005-11-04

Applicant: Yong Rui , Dinei Florencio

Inventor： Yong Rui , Dinei Florencio

IPC: H04R3/00

CPC classification number: H04R3/005 , G10L21/0272 , G10L2021/02165

Abstract: A system and process for finding the location of a sound source using direct approaches having weighting factors that mitigate the effect of both correlated and reverberation noise is presented. When more than two microphones are used, the traditional time-delay-of-arrival (TDOA) based sound source localization (SSL) approach involves two steps. The first step computes TDOA for each microphone pair, and the second step combines these estimates. This two-step process discards relevant information in the first step, thus degrading the SSL accuracy and robustness. In the present invention, direct, one-step, approaches are employed. Namely, a one-step TDOA SSL approach and a steered beam (SB) SSL approach are employed. Each of these approaches provides an accuracy and robustness not available with the traditional two-step approaches.

Abstract translation: 提出了使用具有减轻相关和混响噪声的影响的加权因子的直接方法来发现声源的位置的系统和过程。当使用两个以上的麦克风时，传统的延迟延时（TDOA）声源定位（SSL）方法涉及两个步骤。第一步计算每个麦克风对的TDOA，第二步合并这些估计。这两步过程在第一步中丢弃相关信息，从而降低了SSL的准确性和鲁棒性。在本发明中，采用直接的一步法。也就是说，采用一步式TDOA SSL方法和转向束（SB）SSL方法。这些方法中的每一种提供了传统的两步方法不可用的精度和鲁棒性。

72.

发明授权
Event-based system and process for recording and playback of collaborative electronic presentations 有权
Title translation: 基于事件的系统和记录和回放协作电子演示的过程

公开(公告)号：US07099798B2

公开(公告)日：2006-08-29

申请号：US10973186

申请日：2004-10-25

Applicant: Bin Yu , Yong Rui

Inventor： Bin Yu , Yong Rui

IPC: G06F11/30 , G21C17/00

CPC classification number: G06Q10/10

Abstract: An event-based system and process for recording and playback of collaborative electronic presentations is presented. The present system and process includes a technique for recording collaborative electronic presentations by capturing and storing the interactions between each participant and presentation data where each interaction event is timestamped and linked to a data file comprising the presentation data. The present system and process also includes a technique for playing back the recorded collaborative electronic presentation, which involves displaying the presentation data in an order it was originally presented and reproducing the recorded interactions between each participant and the displayed presentation data at the same point in the presentation that they were originally performed, based on the aforementioned timestamps.

Abstract translation: 介绍了一个基于事件的系统和进程，用于录制和回放协同电子演示文稿。本系统和过程包括通过捕获和存储每个参与者与呈现数据之间的交互来记录协同电子呈现的技术，其中每个交互事件被加时间戳并链接到包括呈现数据的数据文件。本系统和过程还包括一种用于回放记录的协同电子表现的技术，其涉及以原始呈现的顺序显示呈现数据，并且在相同的点处再现每个参与者和所显示的呈现数据之间的记录的交互基于上述时间戳，他们最初执行的演示。

73.

发明申请
Mode- based multi-hypothesis tracking using parametric contours 有权
Title translation: 基于模式的多假设跟踪使用参数轮廓

公开(公告)号：US20060078163A1

公开(公告)日：2006-04-13

申请号：US11282365

申请日：2005-11-17

Applicant: Yong Rui , Yunqiang Chen

Inventor： Yong Rui , Yunqiang Chen

IPC: G06K9/00

CPC classification number: G06K9/00234 , G06K9/3216 , G06K9/6207 , G06T7/251 , G06T7/277 , G06T2207/10016 , G06T2207/30201

Abstract: A system and method for object tracking using probabilistic mode-based multi-hypothesis tracking (MHT) provides for robust and computationally efficient tracking of moving objects such as heads and faces in complex environments. A mode-based multi-hypothesis tracker uses modes that are local maximums which are refined from initial samples in a parametric state space. Because the modes are highly representative, the mode-based multi-hypothesis tracker effectively models non-linear probabilistic distributions using a small number of hypotheses. Real-time tracking performance is achieved by using a parametric causal contour model to refine initial contours to nearby modes. In addition, one common drawback of conventional MHT schemes, i.e., producing only maximum likelihood estimates instead of a desired posterior probability distribution, is addressed by introducing an importance sampling framework into MHT, and estimating the posterior probability distribution from the importance function.

Abstract translation: 使用基于概率模式的多假设跟踪（MHT）的对象跟踪的系统和方法提供了在复杂环境中运动对象（例如头部和面部）的鲁棒和计算上有效的跟踪。基于模式的多假设跟踪器使用在参数状态空间中从初始样本精化的局部最大值的模式。由于模式具有很高的代表性，所以基于模式的多假设跟踪器使用少量假设来有效地建模非线性概率分布。通过使用参数因果轮廓模型来将初始轮廓细化到附近模式，可以实现实时跟踪性能。另外，常规MHT方案的一个共同缺点，即仅产生最大似然估计而不是期望的后验概率分布，通过将重要性采样框架引入到MHT中，并从重要性函数估计后验概率分布来解决。

74.

发明授权
Annotating programs for automatic summary generation 失效

公开(公告)号：US07028325B1

公开(公告)日：2006-04-11

申请号：US09660529

申请日：2000-09-13

Applicant: Yong Rui , Anoop Gupta , Alejandro Acero

Inventor： Yong Rui , Anoop Gupta , Alejandro Acero

IPC: G06F3/00 , G06F13/00 , H04N5/445

CPC classification number: G06F17/30787 , G06F17/30743 , G06F17/30749 , G06F17/30758 , G06F17/30843 , G06K9/00711

Abstract: Audio/video programming content is made available to a receiver from a content provider, and meta data is made available to the receiver from a meta data provider. The meta data corresponds to the programming content, and identifies, for each of multiple portions of the programming content, an indicator of a likelihood that the portion is an exciting portion of the content. In one implementation, the meta data includes probabilities that segments of a baseball program are exciting, and is generated by analyzing the audio data of the baseball program for both excited speech and baseball hits. The meta data can then be used to generate a summary for the baseball program.

75.

发明申请
System and method for communicating audio data signals via an audio communications medium 审中-公开
Title translation: 用于经由音频通信介质传送音频数据信号的系统和方法

公开(公告)号：US20060009867A1

公开(公告)日：2006-01-12

申请号：US11117844

申请日：2005-04-29

Applicant: Roy Leban , Ross Cutler , Henrique Malvar , Yong Rui

Inventor： Roy Leban , Ross Cutler , Henrique Malvar , Yong Rui

IPC: G06F17/00 , H04M11/00

CPC classification number: H04L29/12113 , H04L61/1541 , H04L67/16 , H04M3/567 , H04M11/06 , H04M11/08

Abstract: A system for communicating audio data signals comprises a source computer that performs an action, generates an event message corresponding to the action, converts the event message into an audio data signal, and communicates the audio data signal through its speaker. A source telephone receives a voice signal from a participant and the audio data signal through its microphone and communicates the audio data signal and voice as coherent sound via an audio communications medium. A recipient telephone receives the audio data signal from the coherent sound communicated via the audio communications medium and communicates the audio data signal via its speaker. A recipient computer receives the audio data signal through its microphone, extracts the event message from the audio data signal, and performs an action based on the event message from the audio data signal. The audio communications medium can comprise a telephone communications system or air.

Abstract translation: 用于传送音频数据信号的系统包括执行动作的源计算机，产生与动作相对应的事件消息，将事件消息转换成音频数据信号，并通过其扬声器传送音频数据信号。源电话通过其麦克风接收来自参与者的语音信号和音频数据信号，并通过音频通信介质将音频数据信号和声音作为相干声传送。接收者电话从经由音频通信介质传送的相干声音接收音频数据信号，并通过其扬声器传送音频数据信号。接收者计算机通过其麦克风接收音频数据信号，从音频数据信号中提取事件消息，并根据来自音频数据信号的事件消息执行动作。音频通信介质可以包括电话通信系统或空气。

76.

发明申请
Systems and methods for novel real-time audio-visual communication and data collaboration 有权
Title translation: 新型实时视听通信和数据协作的系统和方法

公开(公告)号：US20050262201A1

公开(公告)日：2005-11-24

申请号：US10836778

申请日：2004-04-30

Applicant: Eric Rudolph , Yong Rui , Henrique Malvar , Li-Wei He , Michael Cohen , Ivan Tashev

Inventor： Eric Rudolph , Yong Rui , Henrique Malvar , Li-Wei He , Michael Cohen , Ivan Tashev

IPC: H04N7/15 , H04L12/18 , H04L29/06 , H04M3/56 , G06F15/16

CPC classification number: H04L12/1827 , H04L12/1831 , H04L29/06027 , H04L65/403 , H04L65/4038

Abstract: Systems and methods are disclosed that facilitate real-time information exchange in a multimedia conferencing environment. Data Client(s) facilitate data collaboration between users and are maintained separately from audio/video (AV) Clients that provide real-time communication functionality. Data Clients can be remotely located with respect to one another and with respect to a server. A remote user Stand-in Device can be provided that comprises a display to present a remote user to local users, a digital automatic pan/tilt/zoom camera to capture imagery in, for example, a conference room and provide real-time information to an AV Client in a remote office, and a microphone array that can similarly provide real-time audio information from the conference room to an AV Client in the remote office. The invention further facilitates file transfer and presentation broadcast between Data Clients in a single location or in a plurality of disparate locations.

Abstract translation: 公开了促进多媒体会议环境中的实时信息交换的系统和方法。数据客户端促进用户之间的数据协作，并与提供实时通信功能的音频/视频（AV）客户端分开维护。数据客户端可以相对于彼此和相对于服务器远程定位。可以提供远程用户待机设备，其包括向本地用户呈现远程用户的显示器，用于在例如会议室中捕获图像的数字自动摇摄/俯仰/变焦相机，并且提供实时信息远程办公室中的AV客户端以及可以类似地从会议室向远程办公室中的AV客户端提供实时音频信息的麦克风阵列。本发明进一步便于在单个位置或多个不同位置的数据客户端之间的文件传送和呈现广播。

77.

发明申请
Automatic detection and tracking of multiple individuals using multiple cues 有权

公开(公告)号：US20050210103A1

公开(公告)日：2005-09-22

申请号：US11042766

申请日：2005-01-25

Applicant: Yong Rui , Yunqiang Chen

Inventor： Yong Rui , Yunqiang Chen

IPC: G06T1/00 , G06K9/00 , G06T1/20 , G06T7/00 , G06T7/20 , H04N7/15 , H04N7/26 , G06F15/16

CPC classification number: G06K9/00234 , G06T7/251 , G06T2207/10016 , G06T2207/30196 , G06T2207/30201

Abstract: Automatic detection and tracking of multiple individuals includes receiving a frame of video and/or audio content and identifying a candidate area for a new face region in the frame. One or more hierarchical verification levels are used to verify whether a human face is in the candidate area, and an indication made that the candidate area includes a face if the one or more hierarchical verification levels verify that a human face is in the candidate area. A plurality of audio and/or video cues are used to track each verified face in the video content from frame to frame.

78.

发明申请
Annotating programs for automatic summary generation 有权
Title translation: 注释自动汇总生成程序

公开(公告)号：US20050159956A1

公开(公告)日：2005-07-21

申请号：US11073144

申请日：2005-03-04

Applicant: Yong Rui , Anoop Gupta , Alejandro Acero

Inventor： Yong Rui , Anoop Gupta , Alejandro Acero

IPC: G06F17/30 , H04N7/16

CPC classification number: G06F17/30787 , G06F17/30743 , G06F17/30749 , G06F17/30758 , G06F17/30843 , G06K9/00711

Abstract: Audio/video programming content is made available to a receiver from a content provider, and meta data is made available to the receiver from a meta data provider. The meta data corresponds to the programming content, and identifies, for each of multiple portions of the programming content, an indicator of a likelihood that the portion is an exciting portion of the content. In one implementation, the meta data includes probabilities that segments of a baseball program are exciting, and is generated by analyzing the audio data of the baseball program for both excited speech and baseball hits. The meta data can then be used to generate a summary for the baseball program.

Abstract translation: 音频/视频节目内容从内容提供商可用于接收者，并且元数据从元数据提供者向接收者提供。元数据对应于节目内容，并且针对节目内容的多个部分中的每一个识别该部分是内容的激动部分的可能性的指示符。在一个实现中，元数据包括棒球节目的节段令人兴奋的概率，并且通过分析用于激发的语音和棒球命中的棒球节目的音频数据而产生。然后可以使用元数据来生成棒球程序的摘要。

79.

发明申请
USER INTERFACE FOR THREE-DIMENSIONAL MODELING 有权
Title translation: 用户界面进行三维建模

公开(公告)号：US20140368620A1

公开(公告)日：2014-12-18

申请号：US13919933

申请日：2013-06-17

Applicant: Zhiwei Li , Rui Cai , Jiawei Gu , Lei Zhang , Yong Rui

Inventor： Zhiwei Li , Rui Cai , Jiawei Gu , Lei Zhang , Yong Rui

IPC: H04N13/02 , H04N5/232

CPC classification number: H04N13/282 , G06F3/00 , G06T19/006 , H04N5/23216 , H04N5/23293 , H04N13/221 , H04N13/296

Abstract: A method of acquiring a set of images useable to 3D model a physical object includes imaging the physical object with a camera, and displaying with the camera a current view of the physical object as imaged by the camera from a current perspective. The method further includes displaying with the camera a visual cue overlaying the current view and indicating perspectives from which the physical object is to be imaged to acquire the set of images.

Abstract translation: 获取可用于对物理对象进行3D建模的一组图像的方法包括用相机对物理对象进行成像，并且从当前的角度通过相机显示物理对象的当前视图。该方法还包括用相机显示覆盖当前视图的视觉提示，并且指示要从其中成像物理对象的视角以获取该组图像。

80.

发明申请
IDENTIFICATION OF PEOPLE USING MULTIPLE TYPES OF INPUT 有权
Title translation: 使用多种输入类型识别人

公开(公告)号：US20120278077A1

公开(公告)日：2012-11-01

申请号：US13546153

申请日：2012-07-11

Applicant: Cha Zhang , Paul A. Viola , Pei Yin , Ross G. Cutler , Xinding Sun , Yong Rui

Inventor： Cha Zhang , Paul A. Viola , Pei Yin , Ross G. Cutler , Xinding Sun , Yong Rui

IPC: G10L17/00

CPC classification number: G06K9/6256 , G06K9/4614 , G10L25/78 , G10L2021/02166 , H04N7/147 , H04N7/15 , H04N21/42203 , H04N21/4223 , H04N21/4394 , H04N21/44008 , H04N21/44213 , H04N21/4788

Abstract: Systems and methods for detecting people or speakers in an automated fashion are disclosed. A pool of features including more than one type of input (like audio input and video input) may be identified and used with a learning algorithm to generate a classifier that identifies people or speakers. The resulting classifier may be evaluated to detect people or speakers.

Abstract translation: 公开了以自动方式检测人或扬声器的系统和方法。可以识别包括多于一种类型的输入（例如音频输入和视频输入）的功能池，并与学习算法一起使用以生成识别人或扬声器的分类器。可以评估所得分类器以检测人或扬声器。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification