Systems and methods for the automatic segmentation and clustering of ordered information
    11.
    发明授权
    Systems and methods for the automatic segmentation and clustering of ordered information 失效
    有序信息的自动分段和聚类的系统和方法

    公开(公告)号:US06915009B2

    公开(公告)日:2005-07-05

    申请号:US09947385

    申请日:2001-09-07

    CPC classification number: G06K9/00711 G06K9/00718 G06K9/00765 G06K9/6218

    Abstract: Techniques segmenting ordered information such as audio, video and text are provided by windowing and parameterizing an ordered information stream and storing of the parameterized and windowed information into a two-dimensional representation such as a matrix. The similarity between the parameter vectors is determined and an orthogonal matrix decomposition such as singular value decomposition is applied to the similarity matrix. The singular values or eigenvalues of the resulting decomposition indicate major components or segments of the ordered information. The boundaries of the major components may be determined using the determined singular vectors to provide, for example, smart cut-and-paste of ordered information in which boundaries are automatically identified by the singular vectors; automatic categorization and retrieval of ordered information and automatic summarization of ordered information.

    Abstract translation: 通过对有序信息流进行窗口化和参数化以及将参数化和加窗信息存储为诸如矩阵的二维表示来提供分类诸如音频,视频和文本的有序信息的技术。 确定参数向量之间的相似度,并将正交矩阵分解(如奇异值分解)应用于相似矩阵。 所得分解的奇异值或特征值表示有序信息的主要组成部分。 可以使用所确定的奇异向量来确定主要分量的边界,以提供例如智能切割和粘贴有序信息,其中边界由单个向量自动识别; 有序信息的自动分类和检索以及有序信息的自动汇总。

    Methods and apparatuses for interactive similarity searching, retrieval, and browsing of video
    12.
    发明授权
    Methods and apparatuses for interactive similarity searching, retrieval, and browsing of video 有权
    视频互动相似检索,检索和浏览的方法和装置

    公开(公告)号:US06774917B1

    公开(公告)日:2004-08-10

    申请号:US09266558

    申请日:1999-03-11

    CPC classification number: G06K9/00758 G06F17/30814 G06F17/30825 G06F17/3084

    Abstract: Method for interactive selecting video consisting of training images from a video for a video similarity search and for displaying the results of the similarity search are disclosed. The user selects a time interval in the video as a query definition of training images for training an image class statistical model. Time intervals can be as short as one frame or consist of disjoint segments or shots. A statistical model of the image class defined by the training images is calculated on-the-fly from feature vectors extracted from transforms of the training images. For each frame in the video, a feature vector is extracted from the transform of the frame, and a similarity measure is calculated using the feature vector and the image class statistical model. The similarity measure is derived from the likelihood of a Gaussian model producing the frame. The similarity is then presented graphically, which allows the time structure of the video to be visualized and browsed. Similarity can be rapidly calculated for other video files as well, which enables content-based retrieval by example. A content-aware video browser featuring interactive similarity measurement is presented. A method for selecting training segments involves mouse click-and-drag operations over a time bar representing the duration of the video; similarity results are displayed as shades in the time bar. Another method involves selecting periodic frames of the video as endpoints for the training segment.

    Abstract translation: 公开了一种用于交互式选择视频组合的视频相似性搜索的视频的训练图像和用于显示相似性搜索的结果的方法。 用户选择视频中的时间间隔作为用于训练图像类统计模型的训练图像的查询定义。 时间间隔可以短到一帧,或者由不相交的片段或镜头组成。 从训练图像变换中提取的特征向量,计算由训练图像定义的图像类别的统计模型。 对于视频中的每个帧,从帧的变换中提取特征向量,并且使用特征向量和图像类统计模型来计算相似度度量。 相似性度量是从产生帧的高斯模型的可能性得出的。 然后以图形方式呈现相似性,这允许视频的时间结构可视化和浏览。 也可以为其他视频文件快速计算相似度,从而实现基于内容的检索。 介绍了具有交互式相似度测量功能的内容感知视频浏览器。 用于选择训练段的方法涉及通过表示视频持续时间的时间条来进行鼠标点击和拖动操作; 相似度结果在时间栏中显示为阴影。 另一种方法是选择视频的周期帧作为训练段的端点。

    Video enabled tele-presence control host
    13.
    发明授权
    Video enabled tele-presence control host 有权
    视频启用远程存在控制主机

    公开(公告)号:US07995090B2

    公开(公告)日:2011-08-09

    申请号:US10629403

    申请日:2003-07-28

    CPC classification number: H04N7/147 G06F19/00 H04N7/15 H04N21/4622

    Abstract: A method for exchanging information in a shared interactive environment, comprising selecting a first physical device in a first live video image wherein the first physical device has information associated with it, causing the information to be transferred to a second physical device in a second live video image wherein the transfer is brought about by manipulating a visual representation of the information, wherein the manipulation includes interacting with the first live video image and the second live video image, wherein the first physical device and the second physical device are part of the shared interactive environment, and wherein the first physical device and the second physical device are not the same.

    Abstract translation: 一种用于在共享交互环境中交换信息的方法,包括:选择第一实时视频图像中的第一物理设备,其中所述第一物理设备具有与其相关联的信息,使得所述信息被传送到第二实时视频中的第二物理设备 图像,其中通过操纵所述信息的视觉表示来实现所述传送,其中所述操纵包括与所述第一实况视频图像和所述第二实况视频图像进行交互,其中所述第一物理设备和所述第二物理设备是所述共享交互的一部分 环境,并且其中所述第一物理设备和所述第二物理设备不相同。

    Methods and systems for discriminative keyframe selection
    14.
    发明授权
    Methods and systems for discriminative keyframe selection 有权
    用于辨别关键帧选择的方法和系统

    公开(公告)号:US07778469B2

    公开(公告)日:2010-08-17

    申请号:US10678935

    申请日:2003-10-03

    CPC classification number: G11B27/28 G06K9/00711

    Abstract: Embodiments of the present invention provide a system and method for discriminatively selecting keyframes that are representative of segments of a source digital media and at the same time distinguishable from other keyframes representing other segments of the digital media. The method and system, in one embodiment, includes pre-processing the source digital media to obtain feature vectors for frames of the media. Discriminatively selecting a keyframe as a representative for each segment of a source digital media wherein said discriminative selection includes determining a similarity measure for each candidate keyframe and determining a dis-similarity measure for each candidate keyframe and selecting the keyframe with the highest goodness value computing from the similarity and dis-similarity measures.

    Abstract translation: 本发明的实施例提供了一种系统和方法,用于区分性地选择代表源数字媒体的片段的关键帧,并且同时与代表数字媒体的其他片段的其他关键帧可分辨。 在一个实施例中,该方法和系统包括预处理源数字媒体以获得媒体帧的特征向量。 识别性地选择关键帧作为源数字媒体的每个片段的代表,其中所述鉴别选择包括确定每个候选关键帧的相似性度量,并且确定每个候选关键帧的不相似性度量,并且选择具有最高善计值计算的关键帧 相似和不相似的措施。

    Automatic generation of multimedia presentation
    16.
    发明授权
    Automatic generation of multimedia presentation 有权
    自动生成多媒体演示

    公开(公告)号:US07383509B2

    公开(公告)日:2008-06-03

    申请号:US10243220

    申请日:2002-09-13

    CPC classification number: G09B7/00 G09B5/00 G10L15/26 G10L2021/105

    Abstract: The present invention provides a system and method for automatically combining image and audio data to create a multimedia presentation. In one embodiment, audio and image data are received by the system. The audio data includes a list of events that correspond to points of interest in an audio file. The audio data may also include an audio file or audio stream. The received images are then matched to the audio file or stream using the time. In one embodiment, the events represent times within the audio file or stream at which there is a certain feature or characteristic in the audio file. The audio events list may be processed to remove, sort or predict or otherwise generate audio events. Images processing may also occur, and may include image analysis to determine image matching to the event list, deleting images, and processing images to incorporate effects. Image effects may include cropping, panning, zooming and other visual effects.

    Abstract translation: 本发明提供一种用于自动组合图像和音频数据以创建多媒体呈现的系统和方法。 在一个实施例中,系统接收音频和图像数据。 音频数据包括与音频文件中的兴趣点对应的事件的列表。 音频数据还可以包括音频文件或音频流。 然后使用该时间将接收到的图像与音频文件或流进行匹配。 在一个实施例中,事件表示在音频文件或音频文件中具有特定特征的音频文件或流中的时间。 可以处理音频事件列表以移除,排序或预测或以其他方式生成音频事件。 也可能发生图像处理,并且可以包括图像分析以确定与事件列表的图像匹配,删除图像以及处理图像以合并效果。 图像效果可能包括裁剪,平移,缩放和其他视觉效果。

    Video production and compaction with collage picture frame user interface
    18.
    发明授权
    Video production and compaction with collage picture frame user interface 有权
    视频制作和压缩与拼贴画框用户界面

    公开(公告)号:US07203380B2

    公开(公告)日:2007-04-10

    申请号:US09992617

    申请日:2001-11-16

    CPC classification number: H04N5/262

    Abstract: A method, system, and apparatus for easily creating a video collage from a video is provided. By segmenting the video into a set number of video segments and providing an interface for a user to select images which represent the video segments and insert the selected images into a video collage template, a video collage may be easily created in a short amount of time. The system is designed to assign values to the video inserted in a video collage and compact the video based on these values thereby creating a small file which may be easily stored or transmitted.

    Abstract translation: 提供了一种用于从视频轻松创建视频拼贴的方法,系统和装置。 通过将视频分割成一定数量的视频片段并提供用于用户选择表示视频段的图像并将所选择的图像插入到视频拼贴模板中的界面,可以在短时间内容易地创建视频拼贴 。 该系统旨在为插入到视频拼贴中的视频分配值,并基于这些值压缩视频,从而创建可容易地存储或传输的小文件。

    Systems and methods for embedding data by dimensional compression and expansion

    公开(公告)号:US06999598B2

    公开(公告)日:2006-02-14

    申请号:US10104017

    申请日:2002-03-25

    CPC classification number: G06T1/0085 G06T1/0057

    Abstract: The systems and methods of this invention watermark an original data file using dimensional compression and expansion. The original data file extends along a given dimension and has portions that extend along that given dimension. The information is embedded into the data file by selectively dimensionally compressing or expanding a size of each of some or all of the portions along the given dimension, which can be space or time. The portions of the data file are selectively dimensionally expanded or compressed according to a given encoding scheme. This encoding scheme can use the kind of modification, the relationships between the type of modification between adjacent portions, or the duration or degree of compression or expansion to store a portion of the embedded information. The portions of the embedded information can be individual bits of binary or trinary information, or can be a portion of analog information.

Patent Agency Ranking