System and process for robust sound source localization
    71.
    发明申请
    System and process for robust sound source localization 有权
    强大的声源定位系统和过程

    公开(公告)号:US20060215850A1

    公开(公告)日:2006-09-28

    申请号:US11267678

    申请日:2005-11-04

    CPC classification number: H04R3/005 G10L21/0272 G10L2021/02165

    Abstract: A system and process for finding the location of a sound source using direct approaches having weighting factors that mitigate the effect of both correlated and reverberation noise is presented. When more than two microphones are used, the traditional time-delay-of-arrival (TDOA) based sound source localization (SSL) approach involves two steps. The first step computes TDOA for each microphone pair, and the second step combines these estimates. This two-step process discards relevant information in the first step, thus degrading the SSL accuracy and robustness. In the present invention, direct, one-step, approaches are employed. Namely, a one-step TDOA SSL approach and a steered beam (SB) SSL approach are employed. Each of these approaches provides an accuracy and robustness not available with the traditional two-step approaches.

    Abstract translation: 提出了使用具有减轻相关和混响噪声的影响的加权因子的直接方法来发现声源的位置的系统和过程。 当使用两个以上的麦克风时,传统的延迟延时(TDOA)声源定位(SSL)方法涉及两个步骤。 第一步计算每个麦克风对的TDOA,第二步合并这些估计。 这两步过程在第一步中丢弃相关信息,从而降低了SSL的准确性和鲁棒性。 在本发明中,采用直接的一步法。 也就是说,采用一步式TDOA SSL方法和转向束(SB)SSL方法。 这些方法中的每一种提供了传统的两步方法不可用的精度和鲁棒性。

    Event-based system and process for recording and playback of collaborative electronic presentations
    72.
    发明授权
    Event-based system and process for recording and playback of collaborative electronic presentations 有权
    基于事件的系统和记录和回放协作电子演示的过程

    公开(公告)号:US07099798B2

    公开(公告)日:2006-08-29

    申请号:US10973186

    申请日:2004-10-25

    Applicant: Bin Yu Yong Rui

    Inventor: Bin Yu Yong Rui

    CPC classification number: G06Q10/10

    Abstract: An event-based system and process for recording and playback of collaborative electronic presentations is presented. The present system and process includes a technique for recording collaborative electronic presentations by capturing and storing the interactions between each participant and presentation data where each interaction event is timestamped and linked to a data file comprising the presentation data. The present system and process also includes a technique for playing back the recorded collaborative electronic presentation, which involves displaying the presentation data in an order it was originally presented and reproducing the recorded interactions between each participant and the displayed presentation data at the same point in the presentation that they were originally performed, based on the aforementioned timestamps.

    Abstract translation: 介绍了一个基于事件的系统和进程,用于录制和回放协同电子演示文稿。 本系统和过程包括通过捕获和存储每个参与者与呈现数据之间的交互来记录协同电子呈现的技术,其中每个交互事件被加时间戳并链接到包括呈现数据的数据文件。 本系统和过程还包括一种用于回放记录的协同电子表现的技术,其涉及以原始呈现的顺序显示呈现数据,并且在相同的点处再现每个参与者和所显示的呈现数据之间的记录的交互 基于上述时间戳,他们最初执行的演示。

    Mode- based multi-hypothesis tracking using parametric contours
    73.
    发明申请
    Mode- based multi-hypothesis tracking using parametric contours 有权
    基于模式的多假设跟踪使用参数轮廓

    公开(公告)号:US20060078163A1

    公开(公告)日:2006-04-13

    申请号:US11282365

    申请日:2005-11-17

    Abstract: A system and method for object tracking using probabilistic mode-based multi-hypothesis tracking (MHT) provides for robust and computationally efficient tracking of moving objects such as heads and faces in complex environments. A mode-based multi-hypothesis tracker uses modes that are local maximums which are refined from initial samples in a parametric state space. Because the modes are highly representative, the mode-based multi-hypothesis tracker effectively models non-linear probabilistic distributions using a small number of hypotheses. Real-time tracking performance is achieved by using a parametric causal contour model to refine initial contours to nearby modes. In addition, one common drawback of conventional MHT schemes, i.e., producing only maximum likelihood estimates instead of a desired posterior probability distribution, is addressed by introducing an importance sampling framework into MHT, and estimating the posterior probability distribution from the importance function.

    Abstract translation: 使用基于概率模式的多假设跟踪(MHT)的对象跟踪的系统和方法提供了在复杂环境中运动对象(例如头部和面部)的鲁棒和计算上有效的跟踪。 基于模式的多假设跟踪器使用在参数状态空间中从初始样本精化的局部最大值的模式。 由于模式具有很高的代表性,所以基于模式的多假设跟踪器使用少量假设来有效地建模非线性概率分布。 通过使用参数因果轮廓模型来将初始轮廓细化到附近模式,可以实现实时跟踪性能。 另外,常规MHT方案的一个共同缺点,即仅产生最大似然估计而不是期望的后验概率分布,通过将重要性采样框架引入到MHT中,并从重要性函数估计后验概率分布来解决。

    System and method for communicating audio data signals via an audio communications medium
    75.
    发明申请
    System and method for communicating audio data signals via an audio communications medium 审中-公开
    用于经由音频通信介质传送音频数据信号的系统和方法

    公开(公告)号:US20060009867A1

    公开(公告)日:2006-01-12

    申请号:US11117844

    申请日:2005-04-29

    Abstract: A system for communicating audio data signals comprises a source computer that performs an action, generates an event message corresponding to the action, converts the event message into an audio data signal, and communicates the audio data signal through its speaker. A source telephone receives a voice signal from a participant and the audio data signal through its microphone and communicates the audio data signal and voice as coherent sound via an audio communications medium. A recipient telephone receives the audio data signal from the coherent sound communicated via the audio communications medium and communicates the audio data signal via its speaker. A recipient computer receives the audio data signal through its microphone, extracts the event message from the audio data signal, and performs an action based on the event message from the audio data signal. The audio communications medium can comprise a telephone communications system or air.

    Abstract translation: 用于传送音频数据信号的系统包括执行动作的源计算机,产生与动作相对应的事件消息,将事件消息转换成音频数据信号,并通过其扬声器传送音频数据信号。 源电话通过其麦克风接收来自参与者的语音信号和音频数据信号,并通过音频通信介质将音频数据信号和声音作为相干声传送。 接收者电话从经由音频通信介质传送的相干声音接收音频数据信号,并通过其扬声器传送音频数据信号。 接收者计算机通过其麦克风接收音频数据信号,从音频数据信号中提取事件消息,并根据来自音频数据信号的事件消息执行动作。 音频通信介质可以包括电话通信系统或空气。

    Systems and methods for novel real-time audio-visual communication and data collaboration
    76.
    发明申请
    Systems and methods for novel real-time audio-visual communication and data collaboration 有权
    新型实时视听通信和数据协作的系统和方法

    公开(公告)号:US20050262201A1

    公开(公告)日:2005-11-24

    申请号:US10836778

    申请日:2004-04-30

    Abstract: Systems and methods are disclosed that facilitate real-time information exchange in a multimedia conferencing environment. Data Client(s) facilitate data collaboration between users and are maintained separately from audio/video (AV) Clients that provide real-time communication functionality. Data Clients can be remotely located with respect to one another and with respect to a server. A remote user Stand-in Device can be provided that comprises a display to present a remote user to local users, a digital automatic pan/tilt/zoom camera to capture imagery in, for example, a conference room and provide real-time information to an AV Client in a remote office, and a microphone array that can similarly provide real-time audio information from the conference room to an AV Client in the remote office. The invention further facilitates file transfer and presentation broadcast between Data Clients in a single location or in a plurality of disparate locations.

    Abstract translation: 公开了促进多媒体会议环境中的实时信息交换的系统和方法。 数据客户端促进用户之间的数据协作,并与提供实时通信功能的音频/视频(AV)客户端分开维护。 数据客户端可以相对于彼此和相对于服务器远程定位。 可以提供远程用户待机设备,其包括向本地用户呈现远程用户的显示器,用于在例如会议室中捕获图像的数字自动摇摄/俯仰/变焦相机,并且提供实时信息 远程办公室中的AV客户端以及可以类似地从会议室向远程办公室中的AV客户端提供实时音频信息的麦克风阵列。 本发明进一步便于在单个位置或多个不同位置的数据客户端之间的文件传送和呈现广播。

    Annotating programs for automatic summary generation
    78.
    发明申请
    Annotating programs for automatic summary generation 有权
    注释自动汇总生成程序

    公开(公告)号:US20050159956A1

    公开(公告)日:2005-07-21

    申请号:US11073144

    申请日:2005-03-04

    Abstract: Audio/video programming content is made available to a receiver from a content provider, and meta data is made available to the receiver from a meta data provider. The meta data corresponds to the programming content, and identifies, for each of multiple portions of the programming content, an indicator of a likelihood that the portion is an exciting portion of the content. In one implementation, the meta data includes probabilities that segments of a baseball program are exciting, and is generated by analyzing the audio data of the baseball program for both excited speech and baseball hits. The meta data can then be used to generate a summary for the baseball program.

    Abstract translation: 音频/视频节目内容从内容提供商可用于接收者,并且元数据从元数据提供者向接收者提供。 元数据对应于节目内容,并且针对节目内容的多个部分中的每一个识别该部分是内容的激动部分的可能性的指示符。 在一个实现中,元数据包括棒球节目的节段令人兴奋的概率,并且通过分析用于激发的语音和棒球命中的棒球节目的音频数据而产生。 然后可以使用元数据来生成棒球程序的摘要。

    USER INTERFACE FOR THREE-DIMENSIONAL MODELING
    79.
    发明申请
    USER INTERFACE FOR THREE-DIMENSIONAL MODELING 有权
    用户界面进行三维建模

    公开(公告)号:US20140368620A1

    公开(公告)日:2014-12-18

    申请号:US13919933

    申请日:2013-06-17

    Abstract: A method of acquiring a set of images useable to 3D model a physical object includes imaging the physical object with a camera, and displaying with the camera a current view of the physical object as imaged by the camera from a current perspective. The method further includes displaying with the camera a visual cue overlaying the current view and indicating perspectives from which the physical object is to be imaged to acquire the set of images.

    Abstract translation: 获取可用于对物理对象进行3D建模的一组图像的方法包括用相机对物理对象进行成像,并且从当前的角度通过相机显示物理对象的当前视图。 该方法还包括用相机显示覆盖当前视图的视觉提示,并且指示要从其中成像物理对象的视角以获取该组图像。

Patent Agency Ranking