System and method for extracting text captions from video and generating video summaries
    21.
    发明授权
    System and method for extracting text captions from video and generating video summaries 有权
    从视频中提取文字字幕并生成视频摘要的系统和方法

    公开(公告)号:US07339992B2

    公开(公告)日:2008-03-04

    申请号:US10494739

    申请日:2002-12-06

    摘要: Caption boxes which are embedded in video content can be located and the text within the caption boxes decoded. Real time processing is enhanced by locating caption box regions in the compressed video domain (210) and performing pixel based processing operations within the region of the video frame in which a caption box is located. The captions boxes are further refined by identifying word regions (240) within the caption boxes and then applying character and word recognition processing (250) to the identified word regions. Domain based models are used to improve text recognition results. The extracted caption box text can be used to detect events of interest in the video content and a semantic model applied to extract a segment of video of the event of interest.

    摘要翻译: 嵌入在视频内容中的字幕框可以被定位,并且标题框内的文本被解码。 通过将标题框区域定位在压缩视频域(210)中并且在字幕框所在的视频帧的区域内执行基于像素的处理操作来增强实时处理。 通过识别字幕框内的字区域(240),然后将字符和字识别处理(250)应用到所识别的字区域,来进一步改进字幕框。 基于域的模型用于改进文本识别结果。 提取的字幕框文本可用于检测视频内容中感兴趣的事件和应用于提取感兴趣事件的视频片段的语义模型。

    Video description system and method
    22.
    发明申请
    Video description system and method 失效
    视频描述系统和方法

    公开(公告)号:US20070245400A1

    公开(公告)日:2007-10-18

    申请号:US11448114

    申请日:2006-06-06

    IPC分类号: H04N7/16

    摘要: Systems and methods for describing video content establish video description records which include an object set (24), an object hierarchy (26) and entity relation graphs (28). Video objects can include global objects, segment objects and local objects. The video objects are further defined by a number of features organized in classes, which in turn are further defined by a number of feature descriptors (36, 38, and 40). The relationships (44) between and among the objects in the object set (24) are defined by the object hierarchy (26) and entity relation graphs (28). The video description records provide a standard vehicle for describing the content and context of video information for subsequent access and processing by computer applications such as search engines, filters and archive systems.

    摘要翻译: 用于描述视频内容的系统和方法建立包括对象集(24),对象层次(26)和实体关系图(28)的视频描述记录。 视频对象可以包括全局对象,段对象和本地对象。 视频对象由类别中组织的多个特征进一步限定,这些特征又由多个特征描述符(36,38和40)进一步限定。 对象集合(24)中的对象之间和之间的关系(44)由对象层次结构(26)和实体关系图(28)定义。 视频描述记录提供用于描述视频信息的内容和上下文的标准车辆,以便随后通过计算机应用(例如搜索引擎,过滤器和归档系统)进行访问和处理。

    Video description system and method
    24.
    发明授权
    Video description system and method 有权
    视频描述系统和方法

    公开(公告)号:US07143434B1

    公开(公告)日:2006-11-28

    申请号:US09831218

    申请日:1999-11-05

    IPC分类号: G09G5/00 H04B1/66 G06K9/00

    摘要: The present invention relates to a system for generating a description record from multimedia information including, e.g., video data. A multimedia information input interface is used to receive multimedia information. A computer processor receives the multimedia information, processes the video information by performing video object extraction processing to generate video object descriptions from the video information, processes the generated video object descriptions by object hierarchy construction and extraction processing to generate video object hierarchy descriptions, and processes the generated video object descriptions by entity relation graph descriptions.

    摘要翻译: 本发明涉及一种用于从包括例如视频数据的多媒体信息产生描述记录的系统。 多媒体信息输入接口用于接收多媒体信息。 计算机处理器接收多媒体信息,通过执行视频对象提取处理来处理视频信息以从视频信息生成视频对象描述,通过对象层次结构和提取处理处理所生成的视频对象描述以产生视频对象层级描述,以及处理 通过实体关系图描述生成的视频对象描述。

    Method and apparatus for processing echocardiogram video images

    公开(公告)号:US06514207B2

    公开(公告)日:2003-02-04

    申请号:US09835166

    申请日:2001-04-16

    IPC分类号: A61B806

    摘要: Methods and a system are disclosed for processing an echocardiogram video of a patient's heart. The echocardiogram comprises at least a first sequence of consecutive video frames corresponding to a first view of the patient's heart concatenated with a second sequence of consecutive video frames corresponding to a second view of the patient's heart. The end-diastole phase of the patient's heart is monitored in each frame by detecting the electrocardiograph wave, and a key frame is selected upon the occurrence of the R-wave peak in the electrocardiograph wave in each of the first sequence of consecutive video frames and in the second sequence of consecutive video frames. The shape and color content of the echocardiogram image window is monitored in certain video frames, and a transition is detected when there is a change in the first feature between adjacent frames. A summary is generated which comprises by the video frames corresponding to the end-diastole phase.

    Video description system and method
    26.
    发明授权
    Video description system and method 失效
    视频描述系统和方法

    公开(公告)号:US08370869B2

    公开(公告)日:2013-02-05

    申请号:US11448114

    申请日:2006-06-06

    IPC分类号: H04H60/32 G06K9/00

    摘要: Systems and methods for describing video content establish video description records which include an object set (24), an object hierarchy (26) and entity relation graphs (28). Video objects can include global objects, segment objects and local objects. The video objects are further defined by a number of features organized in classes, which in turn are further defined by a number of feature descriptors (36, 38, and 40). The relationships (44) between and among the objects in the object set (24) are defined by the object hierarchy (26) and entity relation graphs (28). The video description records provide a standard vehicle for describing the content and context of video information for subsequent access and processing by computer applications such as search engines, filters and archive systems.

    摘要翻译: 用于描述视频内容的系统和方法建立包括对象集(24),对象层次(26)和实体关系图(28)的视频描述记录。 视频对象可以包括全局对象,段对象和本地对象。 视频对象由类别中组织的多个特征进一步限定,这些特征又由多个特征描述符(36,38和40)进一步限定。 对象集合(24)中的对象之间和之间的关系(44)由对象层次结构(26)和实体关系图(28)定义。 视频描述记录提供用于描述视频信息的内容和上下文的标准车辆,以便随后通过计算机应用(例如搜索引擎,过滤器和归档系统)进行访问和处理。

    Multimedia integration description scheme, method and system for MPEG-7
    27.
    发明授权
    Multimedia integration description scheme, method and system for MPEG-7 有权
    MPEG-7多媒体集成描述方案,方法和系统

    公开(公告)号:US07970822B2

    公开(公告)日:2011-06-28

    申请号:US12372052

    申请日:2009-02-17

    IPC分类号: G06F15/16 G06F12/00

    摘要: The invention provides a system and method for integrating multimedia descriptions in a way that allows humans, software components or devices to easily identify, represent, manage, retrieve, and categorize the multimedia content. In this manner, a user who may be interested in locating a specific piece of multimedia content from a database, Internet, or broadcast media, for example, may search for and find the multimedia content. In this regard, the invention provides a system and method that receives multimedia content and separates the multimedia content into separate components which are assigned to multimedia categories, such as image, video, audio, synthetic and text. Within each of the multimedia categories, the multimedia content is classified and descriptions of the multimedia content are generated. The descriptions are then formatted, integrated, using a multimedia integration description scheme, and the multimedia integration description is generated for the multimedia content. The multimedia description is then stored into a database. As a result, a user may query a search engine which then retrieves the multimedia content from the database whose integration description matches the query criteria specified by the user. The search engine can then provide the user a useful search result based on the multimedia integration description.

    摘要翻译: 本发明提供一种用于以允许人,软件组件或设备容易地识别,表示,管理,检索和分类多媒体内容的方式来集成多媒体描述的系统和方法。 以这种方式,例如,可能有兴趣从数据库,因特网或广播媒体定位特定的多媒体内容的用户可以搜索和查找多媒体内容。 在这方面,本发明提供了一种接收多媒体内容并将多媒体内容分离成分配给多媒体类别(诸如图像,视频,音频,合成和文本)的组件的系统和方法。 在多媒体类别的每一个内,对多媒体内容进行分类,并且生成多媒体内容的描述。 然后使用多媒体集成描述方案对所述描述进行格式化,集成化,并且为多媒体内容生成多媒体集成描述。 然后将多媒体描述存储到数据库中。 结果,用户可以查询搜索引擎,该搜索引擎然后从整合描述与用户指定的查询标准匹配的数据库中检索多媒体内容。 然后,搜索引擎可以基于多媒体整合描述向用户提供有用的搜索结果。

    VIDEO CONCEPT CLASSIFICATION USING AUDIO-VISUAL ATOMS
    28.
    发明申请
    VIDEO CONCEPT CLASSIFICATION USING AUDIO-VISUAL ATOMS 有权
    使用音频视频的视频概念分类

    公开(公告)号:US20110081082A1

    公开(公告)日:2011-04-07

    申请号:US12574716

    申请日:2009-10-07

    IPC分类号: G06K9/00 G06K9/62 G10L11/00

    CPC分类号: G06K9/00765 G10L25/00

    摘要: A method for determining a classification for a video segment, comprising the steps of: breaking the video segment into a plurality of short-term video slices, each including a plurality of video frames and an audio signal; analyzing the video frames for each short-term video slice to form a plurality of region tracks; analyzing each region track to form a visual feature vector and a motion feature vector; analyzing the audio signal for each short-term video slice to determine an audio feature vector; forming a plurality of short-term audio-visual atoms for each short-term video slice by combining the visual feature vector and the motion feature vector for a particular region track with the corresponding audio feature vector; and using a classifier to determine a classification for the video segment responsive to the short-term audio-visual atoms.

    摘要翻译: 一种用于确定视频段的分类的方法,包括以下步骤:将视频段分解成多个短视频片段,每个短片段包括多个视频帧和音频信号; 分析每个短期视频片段的视频帧以形成多个区域轨道; 分析每个区域轨迹以形成视觉特征向量和运动特征向量; 分析每个短期视频片段的音频信号以确定音频特征向量; 通过将特定区域轨道的视觉特征向量和运动特征向量与相应的音频特征向量组合,形成每个短期视频片段的多个短期视听原子; 并且使用分类器来确定响应于短期视听原子的视频片段的分类。

    SYSTEMS AND METHODS FOR IMAGE ARCHEOLOGY
    29.
    发明申请
    SYSTEMS AND METHODS FOR IMAGE ARCHEOLOGY 有权
    图像科学的系统与方法

    公开(公告)号:US20110025710A1

    公开(公告)日:2011-02-03

    申请号:US12861377

    申请日:2010-08-23

    IPC分类号: G09G5/00 G06F17/30 G06K9/36

    摘要: Systems and methods are described for determining manipulation history among a plurality of images. The described techniques include selecting a pair of images from the plurality of images, detecting one or more manipulations operable to transform one of the images to the other, and based on the manipulations detected, determining a parent-child relationship between the pair or pairs of images. The described techniques can further include repeating the selecting two images, detecting manipulations, and determining the parent-child relationship for each pairs of images in the plurality of images, constructing a visual migration map for the images, and presenting the visual migration map in a user readable format.

    摘要翻译: 描述了用于确定多个图像之间的操作历史的系统和方法。 所描述的技术包括从多个图像中选择一对图像,检测可操作以将图像中的一个图像转换为另一图像的一个或多个操作,并且基于检测到的操作,确定一对或多对图像之间的父子关系 图片。 所描述的技术可以进一步包括重复选择两个图像,检测操作,以及确定多个图像中每对图像的父子关系,构建图像的视觉迁移图,以及将视觉迁移图呈现在 用户可读格式。

    Systems and methods for interoperable multimedia content descriptions
    30.
    发明授权
    Systems and methods for interoperable multimedia content descriptions 失效
    用于互操作的多媒体内容描述的系统和方法

    公开(公告)号:US07653635B1

    公开(公告)日:2010-01-26

    申请号:US09830899

    申请日:1999-11-05

    IPC分类号: G06F17/30

    摘要: Systems and methods for generating standard description records from multimedia information are provided. The system includes at least one multimedia information input interface (180) receiving multimedia information, a computer processor, and a data storage system (150), operatively coupled to said processor, for storing said at least one description record. The processor performs object extraction processing to generate multimedia object descriptions (200, 201, 205) from the multimedia information, and object hierarchy processing (410, 420) to generate multimedia object hierarchy descriptions, to generate at least one description record including the multimedia object descriptions (200, 201, 205) and multimedia object hierarchy descriptions for content embedded within the multimedia information.

    摘要翻译: 提供了从多媒体信息生成标准描述记录的系统和方法。 系统包括至少一个接收多媒体信息的多媒体信息输入接口(180),计算机处理器和可操作地耦合到所述处理器的数据存储系统(150),用于存储所述至少一个描述记录。 处理器执行对象提取处理以从多媒体信息和对象分层处理(410,420)生成多媒体对象描述(200,201,205)以生成多媒体对象层级描述,以生成包括多媒体对象的至少一个描述记录 描述(200,201,205)以及多媒体信息中嵌入的内容的多媒体对象层级描述。