专利检索 cpc:"G06F17/30808" 第 4 页

31.

发明申请
Feature identification of events in multimedia 失效
标题翻译：多媒体事件的特征识别

公开(公告)号：US20050251532A1

公开(公告)日：2005-11-10

申请号：US10922781

申请日：2004-08-20

申请人： Regunathan Radhakrishnan , Isao Otsuka , Ajay Divakaran

发明人： Regunathan Radhakrishnan , Isao Otsuka , Ajay Divakaran

IPC分类号： H04N5/91 , G06F7/00 , G06F17/30 , G06K9/00 , G06K9/34 , G06K9/54 , G06K9/62 , G06T7/20 , G10L11/00 , G10L15/10 , G10L15/14

CPC分类号： G06K9/00711 , G06F17/30787 , G06F17/30802 , G06F17/30808 , G06F17/30811 , G06F17/30843 , Y10S707/99942 , Y10S707/99943 , Y10S707/99945 , Y10S707/99948

摘要： A method detects events in multimedia. Features are extracted from the multimedia. The features are sampled using a sliding window to obtain samples. A context model is constructed for each sample. The context models form a time series. An affinity matrix is determined from the time series models and a commutative distance metric between each pair of context models. A second generalized eigenvector is determined for the affinity matrix, and the samples are then clustered into events according to the second generalized eigenvector.

摘要翻译： 一种方法来检测多媒体中的事件。功能从多媒体提取。使用滑动窗口对特征进行采样以获得样品。为每个样本构建上下文模型。上下文模型形成时间序列。从时间序列模型和每对上下文模型之间的交换距离度量确定亲和度矩阵。对于亲和度矩阵确定第二广义特征向量，然后根据第二广义特征向量将样本聚类成事件。

32.

发明申请
Image retrieval system and image retrieval method 有权
标题翻译：图像检索系统和图像检索方法

公开(公告)号：US20010004739A1

公开(公告)日：2001-06-21

申请号：US09773570

申请日：2001-02-02

发明人： Shunichi Sekiguchi , Yoshimi Isu , Hirofumi Nishikawa , Yoshihisa Yamada , Kohtaro Asai

IPC分类号： G06F017/30

CPC分类号： G06F17/30247 , G06F17/30256 , G06F17/30799 , G06F17/30802 , G06F17/30808 , G06F17/30811 , G06F17/30817 , G06F17/30858 , G06K9/00744 , H04N9/8042 , H04N9/8047 , H04N9/8233 , Y10S707/99933 , Y10S707/99945

摘要： When a retrieval condition of an attribute list is input from a user interface unit to a retrieval processing unit, the attribute list stored in an attribute list storing unit is retrieved in the retrieval processing unit. Thereafter, attribute information conforming to the retrieval condition is output to and displayed on a displaying unit. Thereafter, when a retrieval condition of the similarity retrieval is input from the user interface unit to the retrieval processing unit, image data stored in the image information storing unit is retrieved in the retrieval processing unit, and specific image data relating to a characteristic descriptor set conforming to the retrieval condition is selected in the retrieval processing unit. Thereafter, the specific image data is output to and displayed on the displaying unit.

摘要翻译： 当从用户接口单元向检索处理单元输入属性列表的检索条件时，在检索处理单元中检索存储在属性列表存储单元中的属性列表。此后，将符合检索条件的属性信息输出并显示在显示单元上。此后，当从用户接口单元向检索处理单元输入相似性检索的检索条件时，在检索处理单元中检索存储在图像信息存储单元中的图像数据，并且与特征描述符集合有关的特定图像数据在检索处理单元中选择符合检索条件的信息。此后，将特定图像数据输出并显示在显示单元上。

33.

发明授权
System and method for extracting text captions from video and generating video summaries 有权
标题翻译：从视频中提取文字字幕并生成视频摘要的系统和方法

公开(公告)号：US08488682B2

公开(公告)日：2013-07-16

申请号：US11960424

申请日：2007-12-19

申请人： Shih-Fu Chang , Dongqing Zhang

发明人： Shih-Fu Chang , Dongqing Zhang

IPC分类号： H04N7/12 , H04N11/02 , H04N11/04 , H04B1/66

CPC分类号： H04N7/0882 , G06F17/30796 , G06F17/30808 , G06F17/30811 , G06F17/30843 , G06K9/3266 , G06K2209/01 , G06T7/75 , G11B27/031 , G11B27/034 , G11B27/28 , H04N5/93 , H04N7/025 , H04N21/434 , H04N21/4884

摘要： Caption boxes which are embedded in video content can be located and the text within the caption boxes decoded. Real time processing is enhanced by locating caption box regions in the compressed video domain and performing pixel based processing operations within the region of the video frame in which a caption box is located. The captions boxes are further refined by identifying word regions within the caption boxes and then applying character and word recognition processing to the identified word regions. Domain based models are used to improve text recognition results. The extracted caption box text can be used to detect events of interest in the video content and a semantic model applied to extract a segment of video of the event of interest.

摘要翻译： 嵌入在视频内容中的字幕框可以被定位，并且标题框内的文本被解码。通过将标题框区域定位在压缩视频域中并且在字幕框所在的视频帧的区域内执行基于像素的处理操作来增强实时处理。通过识别字幕框内的字区域，然后对识别的字区域应用字符和字识别处理，进一步细化字幕框。基于域的模型用于改进文本识别结果。提取的字幕框文本可用于检测视频内容中感兴趣的事件和应用于提取感兴趣事件的视频片段的语义模型。

34.

发明申请
SEMANTIC EVENT DETECTION USING CROSS-DOMAIN KNOWLEDGE 有权
标题翻译：使用跨域知识的语义事件检测

公开(公告)号：US20090299999A1

公开(公告)日：2009-12-03

申请号：US12408140

申请日：2009-03-20

申请人： Alexander C. Loui , Wei Jiang

发明人： Alexander C. Loui , Wei Jiang

IPC分类号： G06F17/30

CPC分类号： G06F17/30256 , G06F17/30802 , G06F17/30805 , G06F17/30808 , G06K9/00664 , G06K9/00711

摘要： A method for facilitating semantic event classification of a group of image records related to an event. The method using an event detector system for providing: extracting a plurality of visual features from each of the image records; wherein the visual features include segmenting an image record into a number of regions, in which the visual features are extracted; generating a plurality of concept scores for each of the image records using the visual features, wherein each concept score corresponds to a visual concept and each concept score is indicative of a probability that the image record includes the visual concept; generating a feature vector corresponding to the event based on the concept scores of the image records; and supplying the feature vector to an event classifier that identifies at least one semantic event classifier that corresponds to the event.

摘要翻译： 一种用于促进与事件相关的一组图像记录的语义事件分类的方法。该方法使用事件检测器系统来提供：从每个图像记录提取多个视觉特征; 其中所述视觉特征包括将图像记录分割成其中提取所述视觉特征的多个区域; 使用所述视觉特征为每个所述图像记录生成多个概念分数，其中每个概念分数对应于视觉概念，并且每个概念分数指示所述图像记录包括所述视觉概念的概率; 基于所述图像记录的概念分数生成与所述事件相对应的特征向量; 以及将特征向量提供给识别与该事件相对应的至少一个语义事件分类器的事件分类器。

35.

发明申请
SEMANTIC EVENT DETECTION FOR DIGITAL CONTENT RECORDS 有权
标题翻译：用于数字内容记录的语义事件检测

公开(公告)号：US20090297032A1

公开(公告)日：2009-12-03

申请号：US12331927

申请日：2008-12-10

申请人： Alexander C. LOUI , Jiang WEI

发明人： Alexander C. LOUI , Jiang WEI

IPC分类号： G06K9/46

CPC分类号： G06K9/00664 , G06F17/30256 , G06F17/30802 , G06F17/30805 , G06F17/30808 , G06K9/00711

摘要： A system and method for semantic event detection in digital image content records is provided in which an event-level “Bag-of-Features” (BOF) representation is used to model events, and generic semantic events are detected in a concept space instead of an original low-level visual feature space based on the BOF representation.

摘要翻译： 提供了一种用于数字图像内容记录中的语义事件检测的系统和方法，其中使用事件级“Bag-of-Features”（BOF）表示来建模事件，并且在概念空间中检测通用语义事件，而不是基于BOF表示的原始低级视觉特征空间。

36.

发明授权
Video mining using unsupervised clustering of video content 失效
标题翻译：使用无监督的视频内容聚类进行视频挖掘

公开(公告)号：US07375731B2

公开(公告)日：2008-05-20

申请号：US10285831

申请日：2002-11-01

申请人： Ajay Divakaran , Kadir A. Peker

发明人： Ajay Divakaran , Kadir A. Peker

IPC分类号： G09G5/00 , H04N9/475 , H04N7/12 , H04N7/00 , H04N11/02

CPC分类号： G06K9/00751 , G06F17/30787 , G06F17/30802 , G06F17/30808 , G06F17/30811 , G06F17/30843

摘要： A method mines unknown content of a video by first selecting one or more low-level features of the video. For each selected feature, or combination of features, time series data is generated. The time series data is then self-correlated to identify similar segments of the video according to the low-level features. The similar segments are grouped into clusters to discover high-level patterns in the unknown content of video.

摘要翻译： 一种通过首先选择视频的一个或多个低级特征来挖掘视频的未知内容的方法。对于每个选定的特征或特征的组合，生成时间序列数据。然后，时间序列数据被自相关，以根据低级特征来识别视频的类似片段。类似的段被分组成群集以发现视频的未知内容中的高级模式。

37.

发明授权
Method of retrieving video picture and apparatus therefor 失效
标题翻译：检索影像的方法及其设备

公开(公告)号：US07020192B1

公开(公告)日：2006-03-28

申请号：US09363881

申请日：1999-07-30

申请人： Noboru Yamaguchi , Toshiaki Watanabe , Takashi Ida , Yoko Sambonsugi , Osamu Hori , Toshimitsu Kaneko

发明人： Noboru Yamaguchi , Toshiaki Watanabe , Takashi Ida , Yoko Sambonsugi , Osamu Hori , Toshimitsu Kaneko

IPC分类号： H04B1/66

CPC分类号： G06F17/30805 , G06F17/30808

摘要： An apparatus for retrieving a video picture includes a decoder section for decoding a coded bit stream of video picture data representing an arbitrary shape object and including shape information and texture information, a retrieval condition input section for inputting a retrieval condition for retrieval of a desired picture, a retrieval section for retrieving a picture meeting the retrieval condition by using shape information of the object decoded by the decoder section, and a display section for outputting the retrieved result obtained by the retrieval section.

摘要翻译： 一种用于检索视频图像的装置包括：解码器部分，用于对表示任意形状对象的视频图像数据的编码比特流进行解码，并包括形状信息和纹理信息;检索条件输入部分，用于输入用于检索所需图像的检索条件检索部分，用于通过使用由解码器部分解码的对象的形状信息来检索符合检索条件的图像;以及显示部分，用于输出由检索部分获得的检索结果。

38.

发明授权
Method and apparatus for efficiently representing storing and accessing video information 失效
标题翻译：用于有效地表示存储和访问视频信息的方法和装置

公开(公告)号：US06956573B1

公开(公告)日：2005-10-18

申请号：US08970889

申请日：1997-11-14

申请人： James Russell Bergen , Curtis R. Carlson , Rakesh Kumar , Harpreet S. Sawhney

发明人： James Russell Bergen , Curtis R. Carlson , Rakesh Kumar , Harpreet S. Sawhney

IPC分类号： G06F17/30 , G06T15/00 , G11B27/10 , G11B27/28 , G11B27/34 , H04N7/173

CPC分类号： G06F17/30802 , G06F17/30808 , G06F17/30817 , G06F17/30843 , G11B27/105 , G11B27/28 , G11B27/34 , H04N5/91 , H04N7/17318 , H04N9/8205 , H04N9/8227 , H04N21/44008 , H04N21/440227 , H04N21/47202 , H04N21/4828 , H04N21/8153 , H04N21/8456

摘要： A method and concomitant apparatus for comprehensively representing video information in a manner facilitating indexing of the video information. Specifically, a method according to the inveniton comprises the steps of dividing a continuous video stream into a plurality of video scenes; and at least one of the steps of dividing, using intra-scene motion analysis, at least one of the plurality of scenes into one or more layers; representing, as a mosaic, at least one of the pluraliy of scenes; computing, for at least one layer or scene, one or more content-related appearance attributes; and storing, in a database, the content-related appearance attributes or said mosaic representations.

摘要翻译： 一种用于以便于索引视频信息的方式全面地表示视频信息的方法和伴随装置。具体地，根据本发明的方法包括将连续视频流分割成多个视频场景的步骤; 以及将所述多个场景中的至少一个划分为一个或多个层的步骤中的至少一个步骤; 作为马赛克代表多个场景中的至少一个; 对于至少一个层或场景，计算一个或多个内容相关的外观属性; 以及在数据库中存储与内容相关的外观属性或所述马赛克表示。

39.

发明申请
Scalably presenting a collection of media objects 有权
标题翻译：可扩展地呈现媒体对象的集合

公开(公告)号：US20040128308A1

公开(公告)日：2004-07-01

申请号：US10334769

申请日：2002-12-31

发明人： Pere Obrador

IPC分类号： G06F007/00

CPC分类号： G06F17/30787 , G06F17/30056 , G06F17/30802 , G06F17/30808 , G06F17/30817 , G06F17/30849 , G11B27/031 , G11B27/11

摘要： Systems and methods of presenting media objects are described. In one aspect, a group of media objects is selected from the collection based upon media object relevance to one or more data structures of a selected media file of indexed, temporally-ordered data structures. One or more of the selected media file and the media objects of the selected group are transmitted to a client for contemporaneous presentation at a selected summarization level. In another aspect, media objects in the collection are grouped into multiple clusters based upon one or more media object relevance criteria. The media object clusters are arranged into a hierarchy of two or more levels. A selected cluster is transmitted to a client for contemporaneous presentation at a selected summarization level.

摘要翻译： 描述介绍媒体对象的系统和方法。在一个方面，基于媒体对象与索引的，时间有序的数据结构的所选媒体文件的一个或多个数据结构的相关性，从集合中选择一组媒体对象。所选择的组的所选择的媒体文件和媒体对象中的一个或多个被发送到客户端以在选择的摘要级别进行同时呈现。在另一方面，基于一个或多个媒体对象相关性标准将集合中的媒体对象分组成多个集群。媒体对象集群被布置成两个或多个层次的层次结构。所选择的集群被发送到客户端以在选定的摘要级别进行同时呈现。

40.

发明授权
Algorithms and system for object-oriented content-based video search 有权
标题翻译：面向对象内容视频搜索的算法和系统

公开(公告)号：US06741655B1

公开(公告)日：2004-05-25

申请号：US09423409

申请日：2000-02-22

申请人： Shih-Fu Chang , William Chen , Horace J. Meng , Hari Sundaram , Di Zhong

发明人： Shih-Fu Chang , William Chen , Horace J. Meng , Hari Sundaram , Di Zhong

IPC分类号： H04N718

CPC分类号： G06F17/30802 , G06F17/30247 , G06F17/30805 , G06F17/30808 , G06F17/30811 , G06F17/30831

摘要： Object-oriented methods and systems for permitting a user to locate one or more video objects from one or more video clips over an interactive network are disclosed. The system includes one or more server computers (110) comprising storage (111) for video clips and databases of video object attributes, a communications network (120), and a client computer (130). The client computer contains a query interface to specify video object attribute information, including motion trajectory information (134), a browser interface to browse through stored video object attributes within the server computers, and an interactive video player.

摘要翻译： 公开了一种用于允许用户通过交互网络从一个或多个视频剪辑定位一个或多个视频对象的面向对象的方法和系统。该系统包括一个或多个服务器计算机（110），其包括用于视频剪辑的存储（111）和视频对象属性的数据库，通信网络（120）和客户端计算机（130）。客户端计算机包含用于指定视频对象属性信息的查询界面，包括运动轨迹信息（134），浏览服务器计算机内存储的视频对象属性的浏览器界面和交互式视频播放器。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类