Extraction of high-level features from low-level features of multimedia content
    1.
    发明授权
    Extraction of high-level features from low-level features of multimedia content 有权
    从多媒体内容的低级功能中提取高级功能

    公开(公告)号:US06763069B1

    公开(公告)日:2004-07-13

    申请号:US09610763

    申请日:2000-07-06

    IPC分类号: H04N712

    CPC分类号: G06K9/00711

    摘要: A method extracts high-level features from a video including a sequence of frames. Low-level features are extracted from each frame of the video. Each frame of the video is labeled according to the extracted low-level features to generate sequences of labels. Each sequence of labels is associated with one of the extracted low-level feature. The sequences of labels are analyzed using learning machine learning techniques to extract high-level features of the video.

    摘要翻译: 一种方法从包括帧序列的视频中提取高级特征。 从视频的每个帧中提取低级功能。 视频的每个帧根据提取的低级特征进行标记,以生成标签序列。 标签的每个序列与提取的低级特征之一相关联。 使用学习机器学习技术来分析标签序列以提取视频的高级特征。

    Method for representing and comparing multimedia content according to rank
    3.
    发明授权
    Method for representing and comparing multimedia content according to rank 失效
    根据等级表示和比较多媒体内容的方法

    公开(公告)号:US07383504B1

    公开(公告)日:2008-06-03

    申请号:US09518937

    申请日:2000-03-06

    IPC分类号: G06T11/20 G06K9/45 G06F3/00

    CPC分类号: G06K9/00711 G06K9/6878

    摘要: A method for generating a representation of multimedia content by first segmenting the multimedia content spatially and temporally to extract objects. Feature extraction is applied to the objects to produce semantic and syntactic attributes, relations, and a containment set of content entities. The content entities are coded to produce directed acyclic graphs of the content entities, where each directed acyclic graph represents a particular interpretation of the multimedia content. Attributes of each content entity are measured and the measured attributes are assigned to each corresponding content entity in the directed acyclic graphs to rank order the multimedia content.

    摘要翻译: 一种用于通过首先在空间和时间上分割多媒体内容以提取对象来生成多媒体内容的表示的方法。 特征提取被应用于对象以产生语义和句法属性,关系以及内容实体的包含集合。 内容实体被编码以产生内容实体的有向非循环图,其中每个有向无环图表示多媒体内容的特定解释。 测量每个内容实体的属性,并将测量的属性分配给有向非循环图中的每个对应的内容实体以对多媒体内容排序。

    Networked surveillance and control system
    4.
    发明授权
    Networked surveillance and control system 有权
    网络监控系统

    公开(公告)号:US06646676B1

    公开(公告)日:2003-11-11

    申请号:US09612876

    申请日:2000-07-10

    IPC分类号: H04N718

    摘要: A surveillance and control system includes a feature extraction unit to dynamically extract low-level features from a compressed digital video signal, a description encoder, coupled to the feature extraction unit, to encode the low-level features as content descriptors. An event detector is coupled to the description encoder to detect security events from the content descriptors, and a control signal processor, coupled to the event detector, to generate control signals in response to detecting the security events.

    摘要翻译: 监视和控制系统包括特征提取单元,其从压缩数字视频信号动态地提取低级特征,耦合到特征提取单元的描述编码器,以将低级特征编码为内容描述符。 事件检测器耦合到描述编码器以检测来自内容描述符的安全事件,以及耦合到事件检测器的控制信号处理器,以响应于检测到安全事件而产生控制信号。

    Adaptable compressed bitstream transcoder
    5.
    发明授权
    Adaptable compressed bitstream transcoder 失效
    适应压缩比特流代码转换器

    公开(公告)号:US06542546B1

    公开(公告)日:2003-04-01

    申请号:US09496706

    申请日:2000-02-02

    IPC分类号: H04N726

    摘要: A multi-media delivery system for delivering a compressed bitstream through a network to a user device includes a transcoder and a manager. The transcoder is configured to operate on the bit stream using in any one of a plurality of conversion modes. The manager is configured to selecting a particular one of the plurality of conversion modes dependent on semantic content of the bitstream and network characteristics. The system also includes a content classifier to determine the content characteristics, and a model predicator to determine the network characteristics, and user device characteristics. An integrator of the manager generates an optimal rate-quality function to be used for selecting the particular conversion model for a given available bit rate of the network.

    摘要翻译: 用于通过网络将压缩比特流传送到用户设备的多媒体传送系统包括代码转换器和管理器。 代码转换器被配置为使用多种转换模式中的任何一种对比特流进行操作。 管理器被配置为根据比特流的语义内容和网络特性来选择多个转换模式中的特定一个。 该系统还包括内容分类器,用于确定内容特征,以及一个模型预测器来确定网络特性,以及用户设备特性。 管理者的积分器产生最佳速率质量函数,用于为网络的给定可用比特率选择特定的转换模型。

    Method for representing and comparing multimedia content
    7.
    发明授权
    Method for representing and comparing multimedia content 失效
    用于表示和比较多媒体内容的方法

    公开(公告)号:US06546135B1

    公开(公告)日:2003-04-08

    申请号:US09385169

    申请日:1999-08-30

    IPC分类号: G06K946

    摘要: A method for generating a representation of multimedia content by first segmenting the multimedia content spatially and temporally to extract objects. Feature extraction is applied to the objects to produce semantic and syntactic attributes, relations, and a containment set of content entities. The content entities are coded to produce directed acyclic graphs of the content entities, where each directed acyclic graph represents a particular interpretation of the multimedia content.

    摘要翻译: 一种用于通过首先在空间和时间上分割多媒体内容以提取对象来生成多媒体内容的表示的方法。 特征提取被应用于对象以产生语义和句法属性,关系以及内容实体的包含集合。 内容实体被编码以产生内容实体的有向非循环图,其中每个有向无环图表示多媒体内容的特定解释。

    Method and System for Decoding Multiview Videos with Prediction Dependencies
    8.
    发明申请
    Method and System for Decoding Multiview Videos with Prediction Dependencies 有权
    用预测依赖关系解码多视点视频的方法和系统

    公开(公告)号:US20100322311A1

    公开(公告)日:2010-12-23

    申请号:US12871249

    申请日:2010-08-30

    IPC分类号: H04N7/32

    摘要: Multiview videos are acquired of a scene with corresponding cameras arranged at poses, such that there is view overlap between any pair of cameras. V-frames are generated from the multiview videos. The V-frames are encoded using only spatial prediction. Then, the V-frames are inserted periodically in an encoded bit stream to provide random temporal access to the multiview videos. Additional view dependency information enables the decoding of a reduced number of frames prior to accessing randomly a target frame for a specified view and time, and decoding the target frame. The method also decodes multiview videos by maintaining a reference picture list for a current frame of a plurality of multiview videos, and predicting each current frame of the plurality of multiview videos according to reference pictures indexed by the associated reference picture list.

    摘要翻译: 多视点视频被采集到具有以姿势布置的相应摄像机的场景,使得在任何一对摄像机之间存在视图重叠。 V帧是从多视角视频生成的。 V帧仅使用空间预测编码。 然后,将V帧周期性地插入到编码比特流中,以便为多视点视频提供随机的时间访问。 附加的视图依赖性信息使得能够在对指定的视图和时间的随机访问目标帧之前对减少数量的帧进行解码,以及对目标帧进行解码。 该方法还通过维护多个多视角视频的当前帧的参考图片列表,并根据由相关参考图片列表索引的参考图片来预测多个多视角视频中的每个当前帧来对多视点视频进行解码。