Image segmentation using spatial-color Gaussian mixture models
    21.
    发明授权
    Image segmentation using spatial-color Gaussian mixture models 失效
    使用空间色高斯混合模型的图像分割

    公开(公告)号:US07885463B2

    公开(公告)日:2011-02-08

    申请号:US11393576

    申请日:2006-03-30

    Abstract: A spatial-color Gaussian mixture model (SCGMM) image segmentation technique for segmenting images. The SCGMM image segmentation technique specifies foreground objects in the first frame of an image sequence, either manually or automatically. From the initial segmentation, the SCGMM segmentation system learns two spatial-color Gaussian mixture models (SCGMM) for the foreground and background objects. These models are built into a first-order Markov random field (MRF) energy function. The minimization of the energy function leads to a binary segmentation of the images in the image sequence, which can be solved efficiently using a conventional graph cut procedure.

    Abstract translation: 用于分割图像的空间色彩高斯混合模型(SCGMM)图像分割技术。 SCGMM图像分割技术手动或自动地指定图像序列的第一帧中的前景对象。 从初始分割,SCGMM分割系统为前景和背景对象学习两个空间色高斯混合模型(SCGMM)。 这些模型内置于一阶马尔科夫随机场(MRF)能量函数中。 能量函数的最小化导致图像序列中的图像的二进制分割,这可以使用常规的图形切割程序有效地解决。

    Systems and methods for real-time audio-visual communication and data collaboration in a network conference environment
    22.
    发明授权
    Systems and methods for real-time audio-visual communication and data collaboration in a network conference environment 有权
    网络会议环境中实时视听通信和数据协作的系统和方法

    公开(公告)号:US07634533B2

    公开(公告)日:2009-12-15

    申请号:US10836778

    申请日:2004-04-30

    Abstract: Systems and methods are disclosed that facilitate real-time information exchange in a multimedia conferencing environment. Data Client(s) facilitate data collaboration between users and are maintained separately from audio/video (AV) Clients that provide real-time communication functionality. Data Clients can be remotely located with respect to one another and with respect to a server. A remote user Stand-in Device can be provided that comprises a display to present a remote user to local users, a digital automatic pan/tilt/zoom camera to capture imagery in, for example, a conference room and provide real-time information to an AV Client in a remote office, and a microphone array that can similarly provide real-time audio information from the conference room to an AV Client in the remote office. The invention further facilitates file transfer and presentation broadcast between Data Clients in a single location or in a plurality of disparate locations.

    Abstract translation: 公开了促进多媒体会议环境中的实时信息交换的系统和方法。 数据客户端促进用户之间的数据协作,并与提供实时通信功能的音频/视频(AV)客户端分开维护。 数据客户端可以相对于彼此和相对于服务器远程定位。 可以提供远程用户待机设备,其包括向本地用户呈现远程用户的显示器,用于在例如会议室中捕获图像的数字自动摇摄/俯仰/变焦相机,并且提供实时信息 远程办公室中的AV客户端以及可以类似地从会议室向远程办公室中的AV客户端提供实时音频信息的麦克风阵列。 本发明进一步便于在单个位置或多个不同位置的数据客户端之间的文件传送和呈现广播。

    Portable solution for automatic camera management
    24.
    发明授权
    Portable solution for automatic camera management 失效
    用于自动相机管理的便携式解决方案

    公开(公告)号:US07512883B2

    公开(公告)日:2009-03-31

    申请号:US10883123

    申请日:2004-06-30

    CPC classification number: H04M9/082

    Abstract: A “virtual video studio”, as described herein, provides a highly portable real-time capability to automatically capture, record, and edit a plurality of video streams of a presentation, such as, for example, a speech, lecture, seminar, classroom instruction, talk-show, teleconference, etc., along with any accompanying exhibits, such as a corresponding slide presentation, using a suite of one or more unmanned cameras controlled by a set of videography rules. The resulting video output may then either be stored for later use, or broadcast in real-time to a remote audience. This real-time capability is achieved by using an abstraction of “virtual cameramen” and physical cameras in combination with a scriptable interface to the aforementioned videography rules for capturing and editing the recorded video to create a composite video of the presentation in real-time under the control of a “virtual director.”

    Abstract translation: 如本文所述,“虚拟视频工作室”提供了高度便携的实时能力,以自动地捕获,记录和编辑演示文稿的多个视频流,例如语音,讲座,研讨会,教室 指导,谈话节目,电话会议等,以及任何伴随的展览,例如相应的幻灯片呈现,使用由一组视频规则控制的一个或多个无人摄像机的套件。 然后,所得到的视频输出可以被存储以供以后使用,或者实时地向远程受众广播。 这种实时功能是通过使用“虚拟摄像头”和物理摄像机的抽象结合上述视频规则的脚本接口来实现的,用于捕获和编辑记录的视频,以便实时地创建演示文稿的复合视频 控制“虚拟导演”。

    SYSTEM AND METHOD FOR DISTRIBUTED MEETINGS
    25.
    发明申请
    SYSTEM AND METHOD FOR DISTRIBUTED MEETINGS 有权
    分布式会议的系统和方法

    公开(公告)号:US20090046139A1

    公开(公告)日:2009-02-19

    申请号:US12191270

    申请日:2008-08-13

    CPC classification number: H04N7/15 H04N7/152 H04N7/155

    Abstract: A system and method for teleconferencing and recording of meetings. The system uses a variety of capture devices (a novel 360° camera, a whiteboard camera, a presenter view camera, a remote view camera, and a microphone array) to provide a rich experience for people who want to participate in a meeting from a distance. The system is also combined with speaker clustering, spatial indexing, and time compression to provide a rich experience for people who miss a meeting and want to watch it afterward.

    Abstract translation: 电话会议和会议记录的系统和方法。 该系统使用各种捕获设备(新颖的360度相机,白板摄像头,演示者相机,遥控摄像头和麦克风阵列),为希望参加会议的人们提供丰富的体验 距离。 该系统还结合扬声器群集,空间索引和时间压缩,为错过会议并希望观看的人们提供丰富的体验。

    Automated camera management system and method for capturing presentations using videography rules
    26.
    发明授权
    Automated camera management system and method for capturing presentations using videography rules 失效
    使用摄影规则捕获演示的自动相机管理系统和方法

    公开(公告)号:US07349008B2

    公开(公告)日:2008-03-25

    申请号:US10307088

    申请日:2002-11-30

    CPC classification number: H04N7/181 H04N7/185 H04N7/188

    Abstract: An automated camera management system and method for capturing presentations using videography rules. The system and method use technology components and aesthetic components represented by the videography rules to capture a presentation. In general, the automated camera management method captures a presentation using videography rules to determine camera positioning, camera movement, and switching or transition between cameras. The videography rules depend on the type of presentation room and the number of audio-visual camera units used to capture the presentation. The automated camera management system of the invention uses the above method to capture a presentation in a presentation room. The system includes a least one audio-visual (A-V) camera unit for capturing and tracking a subject based on vision or sound. The (A-V) camera unit includes any combination of the following components: (1) a pan-tilt-zoom (PTZ) camera; (2) a fixed camera; and (3) a microphone array.

    Abstract translation: 一种使用摄影规则捕获演示的自动相机管理系统和方法。 该系统和方法使用由视频规则表示的技术组件和美学组件来捕获演示文稿。 通常,自动相机管理方法使用摄影规则捕获演示,以确定相机定位,相机移动以及相机之间的切换或转换。 视频规则取决于演示室的类型和用于捕获演示的视听相机单元的数量。 本发明的自动相机管理系统使用上述方法在演示室中捕获演示。 该系统包括至少一个视听(A-V)摄像机单元,用于基于视觉或声音捕获和跟踪被摄体。 (A-V)相机单元包括以下组件的任意组合:(1)俯仰放大(PTZ)相机; (2)固定摄像头; 和(3)麦克风阵列。

    System and process for locating a speaker using 360 degree sound source localization
    27.
    发明授权
    System and process for locating a speaker using 360 degree sound source localization 有权
    使用360度声源定位来定位扬声器的系统和过程

    公开(公告)号:US07305095B2

    公开(公告)日:2007-12-04

    申请号:US11182142

    申请日:2005-07-15

    Applicant: Yong Rui

    Inventor: Yong Rui

    CPC classification number: H04R3/005 H04R2201/401

    Abstract: A system and process is described for estimating the location of a speaker using signals output by a microphone array characterized by multiple pairs of audio sensors. The location of a speaker is estimated by first determining whether the signal data contains human speech components and filtering out noise attributable to stationary sources. The location of the person speaking is then estimated using a time-delay-of-arrival based SSL technique on those parts of the data determined to contain human speech components. A consensus location for the speaker is computed from the individual location estimates associated with each pair of microphone array audio sensors taking into consideration the uncertainty of each estimate. A final consensus location is also computed from the individual consensus locations computed over a prescribed number of sampling periods using a temporal filtering technique.

    Abstract translation: 描述了一种系统和过程,用于使用由多对音频传感器表征的麦克风阵列输出的信号来估计扬声器的位置。 通过首先确定信号数据是否包含人类语音分量并滤除归因于固定源的噪声来估计扬声器的位置。 然后,使用基于时间延迟的SSL技术来估计说话人的位置,以确定包含人类语音组件的数据的那些部分。 考虑到每个估计的不确定性,从与每对麦克风阵列音频传感器相关联的各个位置估计计算扬声器的共识位置。 还可以使用时间滤波技术从在规定数量的采样周期上计算的单个共识位置计算最终共识位置。

    Interactive table based platform to facilitate collaborative activities
    28.
    发明申请
    Interactive table based platform to facilitate collaborative activities 审中-公开
    基于交互式表的平台,促进协作活动

    公开(公告)号:US20070124370A1

    公开(公告)日:2007-05-31

    申请号:US11289671

    申请日:2005-11-29

    CPC classification number: G06Q10/10

    Abstract: A unique system and method that facilitates multi-user collaborative interactions is provided. Multiple users can provide input to an interactive surface at or about the same time without yielding control of the surface to any one user. The multiple users can share control of the surface and perform operations on various objects displayed on the surface. The objects can undergo a variety of manipulations and modifications depending on the particular application in use. Objects can be moved or copied between the interactive surface (a public workspace) and a more private workspace where a single user controls the workspace. The objects can also be grouped as desired

    Abstract translation: 提供了一种促进多用户协作交互的独特系统和方法。 多个用户可以在或大约相同的时间向交互式表面提供输入,而不会使表面对任何一个用户的控制。 多个用户可以共享表面的控制,并对表面上显示的各种对象执行操作。 根据使用中的具体应用,对象可以进行各种操作和修改。 可以在交互式表面(公共工作区)和单个用户控制工作空间的更私有工作区之间移动或复制对象。 对象也可以根据需要进行分组

    Distributed presentations employing inputs from multiple video cameras located at multiple sites and customizable display screen configurations
    29.
    发明申请
    Distributed presentations employing inputs from multiple video cameras located at multiple sites and customizable display screen configurations 失效
    分布式演示文稿,采用位于多个位置的多台摄像机的输入和可定制的显示屏幕配置

    公开(公告)号:US20070118868A1

    公开(公告)日:2007-05-24

    申请号:US11286651

    申请日:2005-11-23

    CPC classification number: H04N7/181 H04N21/4223 H04N21/4316

    Abstract: A computer network-based distributed presentation system and process is presented that controls the display of one or more video streams output by multiple video cameras located across multiple presentation sites on display screens located at each presentation site. The distributed presentation system and process provides the ability for a user at a site to customize the screen configuration (i.e., what video streams are display at any one time and in what format) for that site via a two-layer display director module. In the design layer of the module, a user interface is provided for a user to specify display priorities dictating what video streams are to be displayed on the screen over time. These display priorities are then provided to the execution layer of the module which translates them into probabilistic timed automata and uses the automata to control what is displayed on the display screen.

    Abstract translation: 提出了一种基于计算机网络的分布式呈现系统和过程,其控制由位于每个呈现站点的显示屏幕上的多个呈现站点上的多个摄像机输出的一个或多个视频流的显示。 分布式呈现系统和过程提供了一个站点用户通过两层显示导演模块定制屏幕配置(即,任何一个时间和以什么格式显示什么视频流)的能力。 在模块的设计层中,为用户提供用户界面,以指定显示优先级,指定在屏幕上随时间显示哪些视频流。 然后将这些显示优先级提供给模块的执行层,将其转换为概率定时自动机,并使用自动机来控制显示屏上显示的内容。

    System and process for muting audio transmission during a computer network-based, multi-party teleconferencing session
    30.
    发明申请
    System and process for muting audio transmission during a computer network-based, multi-party teleconferencing session 有权
    在基于计算机网络的多方电话会议期间,静音音频传输的系统和过程

    公开(公告)号:US20060167995A1

    公开(公告)日:2006-07-27

    申请号:US11035115

    申请日:2005-01-12

    Applicant: Yong Rui

    Inventor: Yong Rui

    Abstract: A system and process for muting the audio transmission from a location of a participant engaged in a multi-party, computer network-based teleconference when that participant is working on a keyboard, is presented. The audio is muted as it is assumed the participant is doing something other than actively participation in the meeting when typing on the keyboard. If left un-muted the sound of typing would distract the other participant in the teleconference.

    Abstract translation: 提出了一种系统和过程,用于在参与者正在使用键盘时从参与多方计算机网络的电话会议中的参与者的位置静音音频传输。 音频被静音,因为假设在键盘上键入时,参与者正在做积极参与会议的其他事情。 如果没有静音,打字的声音会分散电话会议中的其他参与者的注意力。

Patent Agency Ranking