Patent search ap:("Bin Yu" OR "Yong Rui") AND inv:"Yong Rui" Page 8

71.

发明授权
Mode-based multi-hypothesis tracking using parametric contours 有权
Title translation: 基于模式的多假设跟踪使用参数轮廓

公开(公告)号：US07231064B2

公开(公告)日：2007-06-12

申请号：US11282365

申请日：2005-11-17

Applicant: Yong Rui , Yunqiang Chen

Inventor： Yong Rui , Yunqiang Chen

IPC: G06K9/00

CPC classification number: G06K9/00234 , G06K9/3216 , G06K9/6207 , G06T7/251 , G06T7/277 , G06T2207/10016 , G06T2207/30201

Abstract: A system and method for object tracking using probabilistic mode-based multi-hypothesis tracking (MHT) provides for robust and computationally efficient tracking of moving objects such as heads and faces in complex environments. A mode-based multi-hypothesis tracker uses modes that are local maximums which are refined from initial samples in a parametric state space. Because the modes are highly representative, the mode-based multi-hypothesis tracker effectively models non-linear probabilistic distributions using a small number of hypotheses. Real-time tracking performance is achieved by using a parametric causal contour model to refine initial contours to nearby modes. In addition, one common drawback of conventional MHT schemes, i.e., producing only maximum likelihood estimates instead of a desired posterior probability distribution, is addressed by introducing an importance sampling framework into MHT, and estimating the posterior probability distribution from the importance function.

Abstract translation: 使用基于概率模式的多假设跟踪（MHT）的对象跟踪的系统和方法提供了在复杂环境中运动对象（例如头部和面部）的鲁棒和计算上有效的跟踪。基于模式的多假设跟踪器使用在参数状态空间中从初始样本精化的局部最大值的模式。由于模式具有很高的代表性，所以基于模式的多假设跟踪器使用少量假设来有效地建模非线性概率分布。通过使用参数因果轮廓模型来将初始轮廓细化到附近模式，可以实现实时跟踪性能。另外，常规MHT方案的一个共同缺点，即仅产生最大似然估计而不是期望的后验概率分布，通过将重要性采样框架引入到MHT中，并从重要性函数估计后验概率分布来解决。

72.

发明申请
Combined digital and mechanical tracking of a person or object using a single video camera 有权
Title translation: 使用单个摄像机对人物或物体的组合数字和机械跟踪

公开(公告)号：US20070120979A1

公开(公告)日：2007-05-31

申请号：US11284496

申请日：2005-11-21

Applicant: Cha Zhang , Li-wei He , Yong Rui

Inventor： Cha Zhang , Li-wei He , Yong Rui

IPC: H04N7/18

CPC classification number: H04N7/185 , G08B13/19667 , H04N7/188

Abstract: A combined digital and mechanical tracking system and process for generating a video using a single digital video camera that tracks a person or object of interest moving in a scene is presented. This generally involves operating the camera at a higher resolution than is needed for the application, and cropping a sub-region out of the image captured that is output as the output video. The person or object being tracked is at least partially contained within the cropped sub-region. As the person or object moves within the field of view of the camera, the location of the cropped sub-region is also moved so as to keep the subject of interest within its boundaries. When the subject of interest moves to the boundary of the FOV of the camera, the camera is mechanically panned to keep the person or object inside its FOV.

Abstract translation: 呈现了组合的数字和机械跟踪系统和用于使用跟踪在场景中移动的感兴趣的对象的单个数字摄像机生成视频的过程。这通常涉及以比应用所需要的更高的分辨率来操作相机，以及从作为输出视频输出的捕获的图像中剪切一个子区域。被跟踪的人或物体至少部分地包含在裁剪的子区域内。随着人或物体在照相机的视场内移动，裁剪的子区域的位置也被移动，以将感兴趣的对象保持在其边界内。当感兴趣的主题移动到相机的FOV的边界时，相机被机械地平移以将人或物体保持在其FOV内。

73.

发明申请
System and method for applying digital make-up in video conferencing 失效

公开(公告)号：US20060268101A1

公开(公告)日：2006-11-30

申请号：US11137252

申请日：2005-05-25

Applicant: Li-wei He , Michael Cohen , Yong Rui , Shinichi Manaka

Inventor： Li-wei He , Michael Cohen , Yong Rui , Shinichi Manaka

IPC: H04N7/14

CPC classification number: H04N7/147

Abstract: A method of digitally adding the appearance of makeup to a videoconferencing participant. The system and method for applying digital make-up operates in a loop processing sequential video frames. For each input frame, there are typically three general steps: 1) Locating the face and eye and mouth regions; 2) Applying digital make-up to the face, preferably with the exception of the eye and open mouth areas; and 3) Blending the make-up region with the rest of the face. In one embodiment of the invention, the background in the frame containing a video conferencing participant can also be modified so that other video conferencing participants cannot clearly see the background behind the participant in the image frame. In one such embodiment of the invention, the video conferencing participant tries to make his or her own image look comical or altered. In another embodiment of the invention, a particular remote participant tries to make another participant look funny to the other participants.

74.

发明授权
System and process for robust sound source localization 有权

公开(公告)号：US07127071B2

公开(公告)日：2006-10-24

申请号：US11267678

申请日：2005-11-04

Applicant: Yong Rui , Dinei Florencio

Inventor： Yong Rui , Dinei Florencio

IPC: H04R3/00 , H04N7/14

CPC classification number: H04R3/005 , G10L21/0272 , G10L2021/02165

Abstract: A system and process for finding the location of a sound source using direct approaches having weighting factors that mitigate the effect of both correlated and reverberation noise is presented. When more than two microphones are used, the traditional time-delay-of-arrival (TDOA) based sound source localization (SSL) approach involves two steps. The first step computes TDOA for each microphone pair, and the second step combines these estimates. This two-step process discards relevant information in the first step, thus degrading the SSL accuracy and robustness. In the present invention, direct, one-step, approaches are employed. Namely, a one-step TDOA SSL approach and a steered beam (SB) SSL approach are employed. Each of these approaches provides an accuracy and robustness not available with the traditional two-step approaches.

75.

发明申请
System and process for robust sound source localization 有权
Title translation: 强大的声源定位系统和过程

公开(公告)号：US20060215850A1

公开(公告)日：2006-09-28

申请号：US11267678

申请日：2005-11-04

Applicant: Yong Rui , Dinei Florencio

Inventor： Yong Rui , Dinei Florencio

IPC: H04R3/00

CPC classification number: H04R3/005 , G10L21/0272 , G10L2021/02165

Abstract: A system and process for finding the location of a sound source using direct approaches having weighting factors that mitigate the effect of both correlated and reverberation noise is presented. When more than two microphones are used, the traditional time-delay-of-arrival (TDOA) based sound source localization (SSL) approach involves two steps. The first step computes TDOA for each microphone pair, and the second step combines these estimates. This two-step process discards relevant information in the first step, thus degrading the SSL accuracy and robustness. In the present invention, direct, one-step, approaches are employed. Namely, a one-step TDOA SSL approach and a steered beam (SB) SSL approach are employed. Each of these approaches provides an accuracy and robustness not available with the traditional two-step approaches.

Abstract translation: 提出了使用具有减轻相关和混响噪声的影响的加权因子的直接方法来发现声源的位置的系统和过程。当使用两个以上的麦克风时，传统的延迟延时（TDOA）声源定位（SSL）方法涉及两个步骤。第一步计算每个麦克风对的TDOA，第二步合并这些估计。这两步过程在第一步中丢弃相关信息，从而降低了SSL的准确性和鲁棒性。在本发明中，采用直接的一步法。也就是说，采用一步式TDOA SSL方法和转向束（SB）SSL方法。这些方法中的每一种提供了传统的两步方法不可用的精度和鲁棒性。

76.

发明申请
Mode- based multi-hypothesis tracking using parametric contours 有权
Title translation: 基于模式的多假设跟踪使用参数轮廓

公开(公告)号：US20060078163A1

公开(公告)日：2006-04-13

申请号：US11282365

申请日：2005-11-17

Applicant: Yong Rui , Yunqiang Chen

Inventor： Yong Rui , Yunqiang Chen

IPC: G06K9/00

CPC classification number: G06K9/00234 , G06K9/3216 , G06K9/6207 , G06T7/251 , G06T7/277 , G06T2207/10016 , G06T2207/30201

Abstract: A system and method for object tracking using probabilistic mode-based multi-hypothesis tracking (MHT) provides for robust and computationally efficient tracking of moving objects such as heads and faces in complex environments. A mode-based multi-hypothesis tracker uses modes that are local maximums which are refined from initial samples in a parametric state space. Because the modes are highly representative, the mode-based multi-hypothesis tracker effectively models non-linear probabilistic distributions using a small number of hypotheses. Real-time tracking performance is achieved by using a parametric causal contour model to refine initial contours to nearby modes. In addition, one common drawback of conventional MHT schemes, i.e., producing only maximum likelihood estimates instead of a desired posterior probability distribution, is addressed by introducing an importance sampling framework into MHT, and estimating the posterior probability distribution from the importance function.

Abstract translation: 使用基于概率模式的多假设跟踪（MHT）的对象跟踪的系统和方法提供了在复杂环境中运动对象（例如头部和面部）的鲁棒和计算上有效的跟踪。基于模式的多假设跟踪器使用在参数状态空间中从初始样本精化的局部最大值的模式。由于模式具有很高的代表性，所以基于模式的多假设跟踪器使用少量假设来有效地建模非线性概率分布。通过使用参数因果轮廓模型来将初始轮廓细化到附近模式，可以实现实时跟踪性能。另外，常规MHT方案的一个共同缺点，即仅产生最大似然估计而不是期望的后验概率分布，通过将重要性采样框架引入到MHT中，并从重要性函数估计后验概率分布来解决。

77.

发明授权
Annotating programs for automatic summary generation 失效

公开(公告)号：US07028325B1

公开(公告)日：2006-04-11

申请号：US09660529

申请日：2000-09-13

Applicant: Yong Rui , Anoop Gupta , Alejandro Acero

Inventor： Yong Rui , Anoop Gupta , Alejandro Acero

IPC: G06F3/00 , G06F13/00 , H04N5/445

CPC classification number: G06F17/30787 , G06F17/30743 , G06F17/30749 , G06F17/30758 , G06F17/30843 , G06K9/00711

Abstract: Audio/video programming content is made available to a receiver from a content provider, and meta data is made available to the receiver from a meta data provider. The meta data corresponds to the programming content, and identifies, for each of multiple portions of the programming content, an indicator of a likelihood that the portion is an exciting portion of the content. In one implementation, the meta data includes probabilities that segments of a baseball program are exciting, and is generated by analyzing the audio data of the baseball program for both excited speech and baseball hits. The meta data can then be used to generate a summary for the baseball program.

78.

发明申请
System and method for communicating audio data signals via an audio communications medium 审中-公开
Title translation: 用于经由音频通信介质传送音频数据信号的系统和方法

公开(公告)号：US20060009867A1

公开(公告)日：2006-01-12

申请号：US11117844

申请日：2005-04-29

Applicant: Roy Leban , Ross Cutler , Henrique Malvar , Yong Rui

Inventor： Roy Leban , Ross Cutler , Henrique Malvar , Yong Rui

IPC: G06F17/00 , H04M11/00

CPC classification number: H04L29/12113 , H04L61/1541 , H04L67/16 , H04M3/567 , H04M11/06 , H04M11/08

Abstract: A system for communicating audio data signals comprises a source computer that performs an action, generates an event message corresponding to the action, converts the event message into an audio data signal, and communicates the audio data signal through its speaker. A source telephone receives a voice signal from a participant and the audio data signal through its microphone and communicates the audio data signal and voice as coherent sound via an audio communications medium. A recipient telephone receives the audio data signal from the coherent sound communicated via the audio communications medium and communicates the audio data signal via its speaker. A recipient computer receives the audio data signal through its microphone, extracts the event message from the audio data signal, and performs an action based on the event message from the audio data signal. The audio communications medium can comprise a telephone communications system or air.

Abstract translation: 用于传送音频数据信号的系统包括执行动作的源计算机，产生与动作相对应的事件消息，将事件消息转换成音频数据信号，并通过其扬声器传送音频数据信号。源电话通过其麦克风接收来自参与者的语音信号和音频数据信号，并通过音频通信介质将音频数据信号和声音作为相干声传送。接收者电话从经由音频通信介质传送的相干声音接收音频数据信号，并通过其扬声器传送音频数据信号。接收者计算机通过其麦克风接收音频数据信号，从音频数据信号中提取事件消息，并根据来自音频数据信号的事件消息执行动作。音频通信介质可以包括电话通信系统或空气。

79.

发明申请
Systems and methods for novel real-time audio-visual communication and data collaboration 有权
Title translation: 新型实时视听通信和数据协作的系统和方法

公开(公告)号：US20050262201A1

公开(公告)日：2005-11-24

申请号：US10836778

申请日：2004-04-30

Applicant: Eric Rudolph , Yong Rui , Henrique Malvar , Li-Wei He , Michael Cohen , Ivan Tashev

Inventor： Eric Rudolph , Yong Rui , Henrique Malvar , Li-Wei He , Michael Cohen , Ivan Tashev

IPC: H04N7/15 , H04L12/18 , H04L29/06 , H04M3/56 , G06F15/16

CPC classification number: H04L12/1827 , H04L12/1831 , H04L29/06027 , H04L65/403 , H04L65/4038

Abstract: Systems and methods are disclosed that facilitate real-time information exchange in a multimedia conferencing environment. Data Client(s) facilitate data collaboration between users and are maintained separately from audio/video (AV) Clients that provide real-time communication functionality. Data Clients can be remotely located with respect to one another and with respect to a server. A remote user Stand-in Device can be provided that comprises a display to present a remote user to local users, a digital automatic pan/tilt/zoom camera to capture imagery in, for example, a conference room and provide real-time information to an AV Client in a remote office, and a microphone array that can similarly provide real-time audio information from the conference room to an AV Client in the remote office. The invention further facilitates file transfer and presentation broadcast between Data Clients in a single location or in a plurality of disparate locations.

Abstract translation: 公开了促进多媒体会议环境中的实时信息交换的系统和方法。数据客户端促进用户之间的数据协作，并与提供实时通信功能的音频/视频（AV）客户端分开维护。数据客户端可以相对于彼此和相对于服务器远程定位。可以提供远程用户待机设备，其包括向本地用户呈现远程用户的显示器，用于在例如会议室中捕获图像的数字自动摇摄/俯仰/变焦相机，并且提供实时信息远程办公室中的AV客户端以及可以类似地从会议室向远程办公室中的AV客户端提供实时音频信息的麦克风阵列。本发明进一步便于在单个位置或多个不同位置的数据客户端之间的文件传送和呈现广播。

80.

发明申请
Automatic detection and tracking of multiple individuals using multiple cues 有权

公开(公告)号：US20050210103A1

公开(公告)日：2005-09-22

申请号：US11042766

申请日：2005-01-25

Applicant: Yong Rui , Yunqiang Chen

Inventor： Yong Rui , Yunqiang Chen

IPC: G06T1/00 , G06K9/00 , G06T1/20 , G06T7/00 , G06T7/20 , H04N7/15 , H04N7/26 , G06F15/16

CPC classification number: G06K9/00234 , G06T7/251 , G06T2207/10016 , G06T2207/30196 , G06T2207/30201

Abstract: Automatic detection and tracking of multiple individuals includes receiving a frame of video and/or audio content and identifying a candidate area for a new face region in the frame. One or more hierarchical verification levels are used to verify whether a human face is in the candidate area, and an indication made that the candidate area includes a face if the one or more hierarchical verification levels verify that a human face is in the candidate area. A plurality of audio and/or video cues are used to track each verified face in the video content from frame to frame.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification