System and method for applying digital make-up in video conferencing
    41.
    发明授权
    System and method for applying digital make-up in video conferencing 失效
    在视频会议中应用数字化妆的系统和方法

    公开(公告)号:US07612794B2

    公开(公告)日:2009-11-03

    申请号:US11137252

    申请日:2005-05-25

    CPC classification number: H04N7/147

    Abstract: A method of digitally adding the appearance of makeup to a videoconferencing participant. The system and method for applying digital make-up operates in a loop processing sequential video frames. For each input frame, there are typically three general steps: 1) Locating the face and eye and mouth regions; 2) Applying digital make-up to the face, preferably with the exception of the eye and open mouth areas; and 3) Blending the make-up region with the rest of the face. In one embodiment of the invention, the background in the frame containing a video conferencing participant can also be modified so that other video conferencing participants cannot clearly see the background behind the participant in the image frame. In one such embodiment of the invention, the video conferencing participant tries to make his or her own image look comical or altered. In another embodiment of the invention, a particular remote participant tries to make another participant look funny to the other participants.

    Abstract translation: 将化妆品外观数字化添加到视频会议参与者的方法。 用于应用数字化妆的系统和方法在循环处理顺序视频帧中操作。 对于每个输入框架,通常有三个一般步骤:1)定位脸部和眼睛和嘴部区域; 2)应用数字化妆面部,最好除了眼睛和开口区域外; 和3)将化妆区域与面部的其余部分混合。 在本发明的一个实施例中,还可以修改包含视频会议参与者的帧的背景,使得其他视频会议参与者不能清楚地看到图像帧中的参与者后面的背景。 在本发明的一个这样的实施例中,视频会议参与者尝试使他或她自己的图像看上去滑稽或改变。 在本发明的另一个实施例中,特定的远程参与者尝试使另一参与者对其他参与者看起来很滑稽。

    Automatic detection and tracking of multiple individuals using multiple cues
    42.
    发明授权
    Automatic detection and tracking of multiple individuals using multiple cues 有权
    使用多个线索自动检测和跟踪多个人

    公开(公告)号:US07428315B2

    公开(公告)日:2008-09-23

    申请号:US11042766

    申请日:2005-01-25

    Abstract: Automatic detection and tracking of multiple individuals includes receiving a frame of video and/or audio content and identifying a candidate area for a new face region in the frame. One or more hierarchical verification levels are used to verify whether a human face is in the candidate area, and an indication made that the candidate area includes a face if the one or more hierarchical verification levels verify that a human face is in the candidate area. A plurality of audio and/or video cues are used to track each verified face in the video content from frame to frame.

    Abstract translation: 多个人的自动检测和跟踪包括接收视频和/或音频内容的帧并且识别帧中的新的面部区域的候选区域。 使用一个或多个分级验证级别来验证人脸是否在候选区域中,并且如果所述一个或多个分层验证级别验证人脸在候选区域中,则指示使候选区域包括面部。 使用多个音频和/或视频提示从帧到帧跟踪视频内容中的每个验证的面部。

    Automatic detection and tracking of multiple individuals using multiple cues
    44.
    发明授权
    Automatic detection and tracking of multiple individuals using multiple cues 有权
    使用多个线索自动检测和跟踪多个人

    公开(公告)号:US07171025B2

    公开(公告)日:2007-01-30

    申请号:US11042457

    申请日:2005-01-25

    Abstract: Automatic detection and tracking of multiple individuals includes receiving a frame of video and/or audio content and identifying a candidate area for a new face region in the frame. One or more hierarchical verification levels are used to verify whether a human face is in the candidate area, and an indication made that the candidate area includes a face if the one or more hierarchical verification levels verify that a human face is in the candidate area. A plurality of audio and/or video cues are used to track each verified face in the video content from frame to frame.

    Abstract translation: 多个人的自动检测和跟踪包括接收视频和/或音频内容的帧并且识别帧中的新的面部区域的候选区域。 使用一个或多个分级验证级别来验证人脸是否在候选区域中,并且如果所述一个或多个分层验证级别验证人脸在候选区域中,则指示使候选区域包括面部。 使用多个音频和/或视频提示从帧到帧跟踪视频内容中的每个验证的面部。

    System and process for locating a speaker using 360 degree sound source localization
    45.
    发明授权
    System and process for locating a speaker using 360 degree sound source localization 失效
    使用360度声源定位来定位扬声器的系统和过程

    公开(公告)号:US07039199B2

    公开(公告)日:2006-05-02

    申请号:US10228210

    申请日:2002-08-26

    Applicant: Yong Rui

    Inventor: Yong Rui

    CPC classification number: H04R3/005 H04R2201/401

    Abstract: A system and process is described for estimating the location of a speaker using signals output by a microphone array characterized by multiple pairs of audio sensors. The location of a speaker is estimated by first determining whether the signal data contains human speech components and filtering out noise attributable to stationary sources. The location of the person speaking is then estimated using a time-delay-of-arrival based SSL technique on those parts of the data determined to contain human speech components. A consensus location for the speaker is computed from the individual location estimates associated with each pair of microphone array audio sensors taking into consideration the uncertainty of each estimate. A final consensus location is also computed from the individual consensus locations computed over a prescribed number of sampling periods using a temporal filtering technique.

    Abstract translation: 描述了一种系统和过程,用于使用由多对音频传感器表征的麦克风阵列输出的信号来估计扬声器的位置。 通过首先确定信号数据是否包含人类语音分量并滤除归因于固定源的噪声来估计扬声器的位置。 然后使用基于时间延迟的SSL技术来估计说话人的位置,以确定包含人类语音组件的数据的那些部分。 考虑到每个估计的不确定性,从与每对麦克风阵列音频传感器相关联的各个位置估计计算扬声器的共识位置。 还可以使用时间滤波技术从在规定数量的采样周期上计算的单个共识位置计算最终共识位置。

    Event-based system and process for recording and playback of collaborative electronic presentations

    公开(公告)号:US20060089820A1

    公开(公告)日:2006-04-27

    申请号:US10973186

    申请日:2004-10-25

    Applicant: Bin Yu Yong Rui

    Inventor: Bin Yu Yong Rui

    CPC classification number: G06Q10/10

    Abstract: An event-based system and process for recording and playback of collaborative electronic presentations is presented. The present system and process includes a technique for recording collaborative electronic presentations by capturing and storing the interactions between each participant and presentation data where each interaction event is timestamped and linked to a data file comprising the presentation data. The present system and process also includes a technique for playing back the recorded collaborative electronic presentation, which involves displaying the presentation data in an order it was originally presented and reproducing the recorded interactions between each participant and the displayed presentation data at the same point in the presentation that they were originally performed, based on the aforementioned timestamps.

    System and process for tracking an object state using a particle filter sensor fusion technique
    47.
    发明授权
    System and process for tracking an object state using a particle filter sensor fusion technique 失效
    使用粒子滤波器传感器融合技术跟踪物体状态的系统和过程

    公开(公告)号:US07035764B2

    公开(公告)日:2006-04-25

    申请号:US10985243

    申请日:2004-11-10

    CPC classification number: G06T7/277 G10L2021/02166

    Abstract: A system and process for tracking an object state over time using particle filter sensor fusion and a plurality of logical sensor modules is presented. This new fusion framework combines both the bottom-up and top-down approaches to sensor fusion to probabilistically fuse multiple sensing modalities. At the lower level, individual vision and audio trackers can be designed to generate effective proposals for the fuser. At the higher level, the fuser performs reliable tracking by verifying hypotheses over multiple likelihood models from multiple cues. Different from the traditional fusion algorithms, the present framework is a closed-loop system where the fuser and trackers coordinate their tracking information. Furthermore, to handle non-stationary situations, the present framework evaluates the performance of the individual trackers and dynamically updates their object states. A real-time speaker tracking system based on the proposed framework is feasible by fusing object contour, color and sound source location.

    Abstract translation: 提出了一种使用粒子滤波器传感器融合和多个逻辑传感器模块跟踪物体状态随时间变化的系统和过程。 这种新的融合框架将自下而上和自顶向下的方法与传感器融合相结合,以概率地融合多种感测模式。 在较低级别,个人视觉和音频跟踪器可以设计用于为定影器生成有效的建议。 在较高级别,定影器通过从多个线索的多个似然模型中验证假设来执行可靠的跟踪。 与传统融合算法不同,本框架是闭环系统,其中定影器和跟踪器协调其跟踪信息。 此外,为了处理非平稳情况,本框架评估各个跟踪器的性能并动态更新其对象状态。 基于提出的框架的实时扬声器跟踪系统可以通过融合对象轮廓,颜色和声源位置来实现。

    System and process for robust sound source localization

    公开(公告)号:US06999593B2

    公开(公告)日:2006-02-14

    申请号:US10446924

    申请日:2003-05-28

    CPC classification number: H04R3/005 G10L21/0272 G10L2021/02165

    Abstract: A system and process for finding the location of a sound source using direct approaches having weighting factors that mitigate the effect of both correlated and reverberation noise is presented. When more than two microphones are used, the traditional time-delay-of-arrival (TDOA) based sound source localization (SSL) approach involves two steps. The first step computes TDOA for each microphone pair, and the second step combines these estimates. This two-step process discards relevant information in the first step, thus degrading the SSL accuracy and robustness. In the present invention, direct, one-step, approaches are employed. Namely, a one-step TDOA SSL approach and a steered beam (SB) SSL approach are employed. Each of these approaches provides an accuracy and robustness not available with the traditional two-step approaches.

    Portable solution for automatic camera management

    公开(公告)号:US20060005136A1

    公开(公告)日:2006-01-05

    申请号:US10883123

    申请日:2004-06-30

    CPC classification number: H04M9/082

    Abstract: A “virtual video studio”, as described herein, provides a highly portable real-time capability to automatically capture, record, and edit a plurality of video streams of a presentation, such as, for example, a speech, lecture, seminar, classroom instruction, talk-show, teleconference, etc., along with any accompanying exhibits, such as a corresponding slide presentation, using a suite of one or more unmanned cameras controlled by a set of videography rules. The resulting video output may then either be stored for later use, or broadcast in real-time to a remote audience. This real-time capability is achieved by using an abstraction of “virtual cameramen” and physical cameras in combination with a scriptable interface to the aforementioned videography rules for capturing and editing the recorded video to create a composite video of the presentation in real-time under the control of a “virtual director.”

    System and process for time delay estimation in the presence of correlated noise and reverberation
    50.
    发明申请
    System and process for time delay estimation in the presence of correlated noise and reverberation 有权
    在存在相关噪声和混响的情况下进行延时估计的系统和过程

    公开(公告)号:US20050249038A1

    公开(公告)日:2005-11-10

    申请号:US11182633

    申请日:2005-07-14

    CPC classification number: H04R3/005 H04R2430/23

    Abstract: A system and process for estimating the time delay of arrival (TDOA) between a pair of audio sensors of a microphone array is presented. Generally, a generalized cross-correlation (GCC) technique is employed. However, this technique is improved to include provisions for both reducing the influence (including interference) from correlated ambient noise and reverberation noise in the sensor signals prior to computing the TDOA estimate. Two unique correlated ambient noise reduction procedures are also proposed. One involves the application of Wiener filtering, and the other a combination of Wiener filtering with a Gnn subtraction technique. In addition, two unique reverberation noise reduction procedures are proposed. Both involve applying a weighting factor to the signals prior to computing the TDOA which combines the effects of a traditional maximum likelihood (TML) weighting function and a phase transformation (PHAT) weighting function.

    Abstract translation: 提出了一种用于估计麦克风阵列的一对音频传感器之间的到达时间延迟(TDOA)的系统和过程。 通常,采用广义互相关(GCC)技术。 然而,该技术被改进为包括在计算TDOA估计之前减少传感器信号中相关环境噪声和混响噪声的影响(包括干扰)的规定。 还提出了两个独特的相关环境降噪程序。 一个涉及Wiener滤波的应用,另一个涉及Wiener滤波与G>减法技术的组合。 另外还提​​出了两个独特的混响降噪程序。 两者都涉及在计算结合了传统最大似然(TML)加权函数和相变(PHAT)加权函数的效果的TDOA之前对信号应用加权因子。

Patent Agency Ranking