Automatic detection and tracking of multiple individuals using multiple cues
    101.
    发明申请
    Automatic detection and tracking of multiple individuals using multiple cues 有权
    使用多个线索自动检测和跟踪多个人

    公开(公告)号:US20050188013A1

    公开(公告)日:2005-08-25

    申请号:US11042453

    申请日:2005-01-25

    Abstract: Automatic detection and tracking of multiple individuals includes receiving a frame of video and/or audio content and identifying a candidate area for a new face region in the frame. One or more hierarchical verification levels are used to verify whether a human face is in the candidate area, and an indication made that the candidate area includes a face if the one or more hierarchical verification levels verify that a human face is in the candidate area. A plurality of audio and/or video cues are used to track each verified face in the video content from frame to frame.

    Abstract translation: 多个人的自动检测和跟踪包括接收视频和/或音频内容的帧并且识别帧中的新的面部区域的候选区域。 使用一个或多个分级验证级别来验证人脸是否在候选区域中,并且如果所述一个或多个分层验证级别验证人脸在候选区域中,则指示使候选区域包括面部。 使用多个音频和/或视频提示从帧到帧跟踪视频内容中的每个验证的面部。

    System and method for communicating audio data signals via an audio communications medium
    102.
    发明授权
    System and method for communicating audio data signals via an audio communications medium 失效
    用于经由音频通信介质传送音频数据信号的系统和方法

    公开(公告)号:US06934370B1

    公开(公告)日:2005-08-23

    申请号:US10462243

    申请日:2003-06-16

    Abstract: A system for communicating audio data signals comprises a source computer that performs an action, generates an event message corresponding to the action, converts the event message into an audio data signal, and communicates the audio data signal through its speaker. A source telephone receives a voice signal from a participant and the audio data signal through its microphone and communicates the audio data signal and voice as coherent sound via an audio communications medium. A recipient telephone receives the audio data signal from the coherent sound communicated via the audio communications medium and communicates the audio data signal via its speaker. A recipient computer receives the audio data signal through its microphone, extracts the event message from the audio data signal, and performs an action based on the event message from the audio data signal. The audio communications medium can comprise a telephone communications system or air.

    Abstract translation: 用于传送音频数据信号的系统包括执行动作的源计算机,产生与动作相对应的事件消息,将事件消息转换成音频数据信号,并通过其扬声器传送音频数据信号。 源电话通过其麦克风接收来自参与者的语音信号和音频数据信号,并通过音频通信介质将音频数据信号和声音作为相干声传送。 接收者电话从经由音频通信介质传送的相干声音接收音频数据信号,并通过其扬声器传送音频数据信号。 接收者计算机通过其麦克风接收音频数据信号,从音频数据信号中提取事件消息,并根据来自音频数据信号的事件消息执行动作。 音频通信介质可以包括电话通信系统或空气。

    Annotating programs for automatic summary generations
    103.
    发明申请
    Annotating programs for automatic summary generations 有权
    注释自动汇总的程序

    公开(公告)号:US20050160457A1

    公开(公告)日:2005-07-21

    申请号:US11081118

    申请日:2005-03-15

    Abstract: Audio/video programming content is made available to a receiver from a content provider, and meta data is made available to the receiver from a meta data provider. The meta data corresponds to the programming content, and identifies, for each of multiple portions of the programming content, an indicator of a likelihood that the portion is an exciting portion of the content. In one implementation, the meta data includes probabilities that segments of a baseball program are exciting, and is generated by analyzing the audio data of the baseball program for both excited speech and baseball hits. The meta data can then be used to generate a summary for the baseball program.

    Abstract translation: 音频/视频节目内容从内容提供商可用于接收者,并且元数据从元数据提供者向接收者提供。 元数据对应于节目内容,并且针对节目内容的多个部分中的每一个识别该部分是内容的激动部分的可能性的指示符。 在一个实现中,元数据包括棒球节目的节段令人兴奋的概率,并且通过分析用于激发的语音和棒球命中的棒球节目的音频数据而产生。 然后可以使用元数据来生成棒球程序的摘要。

    System and process for tracking an object state using a particle filter sensor fusion technique

    公开(公告)号:US06882959B2

    公开(公告)日:2005-04-19

    申请号:US10428470

    申请日:2003-05-02

    CPC classification number: G06T7/277 G10L2021/02166

    Abstract: A system and process for tracking an object state over time using particle filter sensor fusion and a plurality of logical sensor modules is presented. This new fusion framework combines both the bottom-up and top-down approaches to sensor fusion to probabilistically fuse multiple sensing modalities. At the lower level, individual vision and audio trackers can be designed to generate effective proposals for the fuser. At the higher level, the fuser performs reliable tracking by verifying hypotheses over multiple likelihood models from multiple cues. Different from the traditional fusion algorithms, the present framework is a closed-loop system where the fuser and trackers coordinate their tracking information. Furthermore, to handle non-stationary situations, the present framework evaluates the performance of the individual trackers and dynamically updates their object states. A real-time speaker tracking system based on the proposed framework is feasible by fusing object contour, color and sound source location.

    Methods and systems for participant sourcing indication in multi-party conferencing and for audio source discrimination
    106.
    发明申请
    Methods and systems for participant sourcing indication in multi-party conferencing and for audio source discrimination 有权
    用于多方会议和音频源歧视的参与者采购指示的方法和系统

    公开(公告)号:US20050076081A1

    公开(公告)日:2005-04-07

    申请号:US10677213

    申请日:2003-10-01

    Abstract: Indications of which participant is providing information during a multi-party conference. Each participant has equipment to display information being transferred during the conference. A sourcing signaler residing in the participant equipment provides a signal that indicates the identity of its participant when this participant is providing information to the conference. The source indicators of the other participant equipment receive the signal and cause a UI to indicate that the participant identified by the received signal is providing information (e.g. the UI can causes the identifier to change appearance). An audio discriminator is used to distinguish between an acoustic signal that was generated by a person speaking from that generated in a band-limited manner. The audio discriminator analyzes the spectrum of detected audio signals and generates several parameters from the spectrum and from past determinations to determine the source of an audio signal on a frame-by-frame basis.

    Abstract translation: 参与者在多方会议期间提供信息的指示。 每个参与者都有设备在会议期间显示要传送的信息。 驻留在参与者设备中的采购信号器提供当该参与者向会议提供信息时指示其参与者身份的信号。 其他参与者设备的源指示符接收信号并使UI指示由接收到的信号识别的参与者提供信息(例如,UI可以使得标识符改变外观)。 使用音频鉴别器来区分由以频带限制的方式产生的声音所产生的声信号。 音频鉴别器分析检测到的音频信号的频谱,并从频谱和过去的确定中产生几个参数,以逐帧确定音频信号的来源。

    System and method for devising a human interactive proof that determines whether a remote client is a human or a computer program
    107.
    发明申请
    System and method for devising a human interactive proof that determines whether a remote client is a human or a computer program 有权
    用于设计确定远程客户端是人类还是计算机程序的人类交互式证明的系统和方法

    公开(公告)号:US20050065802A1

    公开(公告)日:2005-03-24

    申请号:US10664657

    申请日:2003-09-19

    CPC classification number: G06Q30/02

    Abstract: A system and method for automatically determining if a remote client is a human or a computer. A set of HIP design guidelines which are important to ensure the security and usability of a HIP system are described. Furthermore, one embodiment of this new HIP system and method is based on human face and facial feature detection. Because human face is the most familiar object to all human users the embodiment of the invention employing a face is possibly the most universal HIP system so far.

    Abstract translation: 用于自动确定远程客户端是人机还是计算机的系统和方法。 描述了一套重要的HIP设计指南,以确保HIP系统的安全性和可用性。 此外,这种新的HIP系统和方法的一个实施例是基于人脸和面部特征检测。 因为人脸是所有人类用户最熟悉的对象,所以使用脸部的发明的实施方式可能是迄今为止最普遍的HIP系统。

Patent Agency Ranking