Image retrieval system and image retrieval method
    32.
    发明申请
    Image retrieval system and image retrieval method 有权
    图像检索系统和图​​像检索方法

    公开(公告)号:US20010004739A1

    公开(公告)日:2001-06-21

    申请号:US09773570

    申请日:2001-02-02

    IPC分类号: G06F017/30

    摘要: When a retrieval condition of an attribute list is input from a user interface unit to a retrieval processing unit, the attribute list stored in an attribute list storing unit is retrieved in the retrieval processing unit. Thereafter, attribute information conforming to the retrieval condition is output to and displayed on a displaying unit. Thereafter, when a retrieval condition of the similarity retrieval is input from the user interface unit to the retrieval processing unit, image data stored in the image information storing unit is retrieved in the retrieval processing unit, and specific image data relating to a characteristic descriptor set conforming to the retrieval condition is selected in the retrieval processing unit. Thereafter, the specific image data is output to and displayed on the displaying unit.

    摘要翻译: 当从用户接口单元向检索处理单元输入属性列表的检索条件时,在检索处理单元中检索存储在属性列表存储单元中的属性列表。 此后,将符合检索条件的属性信息输出并显示在显示单元上。 此后,当从用户接口单元向检索处理单元输入相似性检索的检索条件时,在检索处理单元中检索存储在图像信息存储单元中的图像数据,并且与特征描述符集合有关的特定图像数据 在检索处理单元中选择符合检索条件的信息。 此后,将特定图像数据输出并显示在显示单元上。

    System and method for extracting text captions from video and generating video summaries
    33.
    发明授权
    System and method for extracting text captions from video and generating video summaries 有权
    从视频中提取文字字幕并生成视频摘要的系统和方法

    公开(公告)号:US08488682B2

    公开(公告)日:2013-07-16

    申请号:US11960424

    申请日:2007-12-19

    摘要: Caption boxes which are embedded in video content can be located and the text within the caption boxes decoded. Real time processing is enhanced by locating caption box regions in the compressed video domain and performing pixel based processing operations within the region of the video frame in which a caption box is located. The captions boxes are further refined by identifying word regions within the caption boxes and then applying character and word recognition processing to the identified word regions. Domain based models are used to improve text recognition results. The extracted caption box text can be used to detect events of interest in the video content and a semantic model applied to extract a segment of video of the event of interest.

    摘要翻译: 嵌入在视频内容中的字幕框可以被定位,并且标题框内的文本被解码。 通过将标题框区域定位在压缩视频域中并且在字幕框所在的视频帧的区域内执行基于像素的处理操作来增强实时处理。 通过识别字幕框内的字区域,然后对识别的字区域应用字符和字识别处理,进一步细化字幕框。 基于域的模型用于改进文本识别结果。 提取的字幕框文本可用于检测视频内容中感兴趣的事件和应用于提取感兴趣事件的视频片段的语义模型。

    SEMANTIC EVENT DETECTION USING CROSS-DOMAIN KNOWLEDGE
    34.
    发明申请
    SEMANTIC EVENT DETECTION USING CROSS-DOMAIN KNOWLEDGE 有权
    使用跨域知识的语义事件检测

    公开(公告)号:US20090299999A1

    公开(公告)日:2009-12-03

    申请号:US12408140

    申请日:2009-03-20

    IPC分类号: G06F17/30

    摘要: A method for facilitating semantic event classification of a group of image records related to an event. The method using an event detector system for providing: extracting a plurality of visual features from each of the image records; wherein the visual features include segmenting an image record into a number of regions, in which the visual features are extracted; generating a plurality of concept scores for each of the image records using the visual features, wherein each concept score corresponds to a visual concept and each concept score is indicative of a probability that the image record includes the visual concept; generating a feature vector corresponding to the event based on the concept scores of the image records; and supplying the feature vector to an event classifier that identifies at least one semantic event classifier that corresponds to the event.

    摘要翻译: 一种用于促进与事件相关的一组图像记录的语义事件分类的方法。 该方法使用事件检测器系统来提供:从每个图像记录提取多个视觉特征; 其中所述视觉特征包括将图像记录分割成其中提取所述视觉特征的多个区域; 使用所述视觉特征为每个所述图像记录生成多个概念分数,其中每个概念分数对应于视觉概念,并且每个概念分数指示所述图像记录包括所述视觉概念的概率; 基于所述图像记录的概念分数生成与所述事件相对应的特征向量; 以及将特征向量提供给识别与该事件相对应的至少一个语义事件分类器的事件分类器。

    SEMANTIC EVENT DETECTION FOR DIGITAL CONTENT RECORDS
    35.
    发明申请
    SEMANTIC EVENT DETECTION FOR DIGITAL CONTENT RECORDS 有权
    用于数字内容记录的语义事件检测

    公开(公告)号:US20090297032A1

    公开(公告)日:2009-12-03

    申请号:US12331927

    申请日:2008-12-10

    IPC分类号: G06K9/46

    摘要: A system and method for semantic event detection in digital image content records is provided in which an event-level “Bag-of-Features” (BOF) representation is used to model events, and generic semantic events are detected in a concept space instead of an original low-level visual feature space based on the BOF representation.

    摘要翻译: 提供了一种用于数字图像内容记录中的语义事件检测的系统和方法,其中使用事件级“Bag-of-Features”(BOF)表示来建模事件,并且在概念空间中检测通用语义事件,而不是 基于BOF表示的原始低级视觉特征空间。

    Video mining using unsupervised clustering of video content
    36.
    发明授权
    Video mining using unsupervised clustering of video content 失效
    使用无监督的视频内容聚类进行视频挖掘

    公开(公告)号:US07375731B2

    公开(公告)日:2008-05-20

    申请号:US10285831

    申请日:2002-11-01

    摘要: A method mines unknown content of a video by first selecting one or more low-level features of the video. For each selected feature, or combination of features, time series data is generated. The time series data is then self-correlated to identify similar segments of the video according to the low-level features. The similar segments are grouped into clusters to discover high-level patterns in the unknown content of video.

    摘要翻译: 一种通过首先选择视频的一个或多个低级特征来挖掘视频的未知内容的方法。 对于每个选定的特征或特征的组合,生成时间序列数据。 然后,时间序列数据被自相关,以根据低级特征来识别视频的类似片段。 类似的段被分组成群集以发现视频的未知内容中的高级模式。

    Method of retrieving video picture and apparatus therefor
    37.
    发明授权
    Method of retrieving video picture and apparatus therefor 失效
    检索影像的方法及其设备

    公开(公告)号:US07020192B1

    公开(公告)日:2006-03-28

    申请号:US09363881

    申请日:1999-07-30

    IPC分类号: H04B1/66

    CPC分类号: G06F17/30805 G06F17/30808

    摘要: An apparatus for retrieving a video picture includes a decoder section for decoding a coded bit stream of video picture data representing an arbitrary shape object and including shape information and texture information, a retrieval condition input section for inputting a retrieval condition for retrieval of a desired picture, a retrieval section for retrieving a picture meeting the retrieval condition by using shape information of the object decoded by the decoder section, and a display section for outputting the retrieved result obtained by the retrieval section.

    摘要翻译: 一种用于检索视频图像的装置包括:解码器部分,用于对表示任意形状对象的视频图像数据的编码比特流进行解码,并包括形状信息和纹理信息;检索条件输入部分,用于输入用于检索所需图像的检索条件 检索部分,用于通过使用由解码器部分解码的对象的形状信息来检索符合检索条件的图像;以及显示部分,用于输出由检索部分获得的检索结果。

    Scalably presenting a collection of media objects
    39.
    发明申请
    Scalably presenting a collection of media objects 有权
    可扩展地呈现媒体对象的集合

    公开(公告)号:US20040128308A1

    公开(公告)日:2004-07-01

    申请号:US10334769

    申请日:2002-12-31

    发明人: Pere Obrador

    IPC分类号: G06F007/00

    摘要: Systems and methods of presenting media objects are described. In one aspect, a group of media objects is selected from the collection based upon media object relevance to one or more data structures of a selected media file of indexed, temporally-ordered data structures. One or more of the selected media file and the media objects of the selected group are transmitted to a client for contemporaneous presentation at a selected summarization level. In another aspect, media objects in the collection are grouped into multiple clusters based upon one or more media object relevance criteria. The media object clusters are arranged into a hierarchy of two or more levels. A selected cluster is transmitted to a client for contemporaneous presentation at a selected summarization level.

    摘要翻译: 描述介绍媒体对象的系统和方法。 在一个方面,基于媒体对象与索引的,时间有序的数据结构的所选媒体文件的一个或多个数据结构的相关性,从集合中选择一组媒体对象。 所选择的组的所选择的媒体文件和媒体对象中的一个或多个被发送到客户端以在选择的摘要级别进行同时呈现。 在另一方面,基于一个或多个媒体对象相关性标准将集合中的媒体对象分组成多个集群。 媒体对象集群被布置成两个或多个层次的层次结构。 所选择的集群被发送到客户端以在选定的摘要级别进行同时呈现。

    Algorithms and system for object-oriented content-based video search
    40.
    发明授权
    Algorithms and system for object-oriented content-based video search 有权
    面向对象内容视频搜索的算法和系统

    公开(公告)号:US06741655B1

    公开(公告)日:2004-05-25

    申请号:US09423409

    申请日:2000-02-22

    IPC分类号: H04N718

    摘要: Object-oriented methods and systems for permitting a user to locate one or more video objects from one or more video clips over an interactive network are disclosed. The system includes one or more server computers (110) comprising storage (111) for video clips and databases of video object attributes, a communications network (120), and a client computer (130). The client computer contains a query interface to specify video object attribute information, including motion trajectory information (134), a browser interface to browse through stored video object attributes within the server computers, and an interactive video player.

    摘要翻译: 公开了一种用于允许用户通过交互网络从一个或多个视频剪辑定位一个或多个视频对象的面向对象的方法和系统。 该系统包括一个或多个服务器计算机(110),其包括用于视频剪辑的存储(111)和视频对象属性的数据库,通信网络(120)和客户端计算机(130)。 客户端计算机包含用于指定视频对象属性信息的查询界面,包括运动轨迹信息(134),浏览服务器计算机内存储的视频对象属性的浏览器界面和交互式视频播放器。