System and method for semantic video segmentation based on joint audiovisual and text analysis
    1.
    发明申请
    System and method for semantic video segmentation based on joint audiovisual and text analysis 失效
    基于联合视听和文本分析的语义视频分割系统和方法

    公开(公告)号:US20070055695A1

    公开(公告)日:2007-03-08

    申请号:US11210305

    申请日:2005-08-24

    IPC分类号: G06F17/00

    CPC分类号: G06F17/30787 G06F17/30796

    摘要: System and method for partitioning a video into a series of semantic units where each semantic unit relates to a generally complete thematic topic. A computer implemented method for partitioning a video into a series of semantic units wherein each semantic unit relates to a theme or a topic, comprises dividing a video into a plurality of homogeneous segments, analyzing audio and visual content of the video, extracting a plurality of keywords from the speech content of each of the plurality of homogeneous segments of the video, and detecting and merging a plurality of groups of semantically related and temporally adjacent homogeneous segments into a series of semantic units in accordance with the results of both the audio and visual analysis and the keyword extraction. The present invention can be applied to generate important table-of-contents as well as index tables for videos to facilitate efficient video topic searching and browsing.

    摘要翻译: 将视频分割成一系列语义单元的系统和方法,其中每个语义单元涉及一般完整的主题。 一种用于将视频分割成一系列语义单元的计算机实现的方法,其中每个语义单元涉及主题或主题,包括将视频划分为多个同构段,分析视频的音频和视觉内容,提取多个 根据视频的多个同构段的每个的语音内容的关键字,以及根据音频和视频的结果检测和合并多个语义相关和时间上相邻的同构段的组成一系列语义单元 分析和关键词提取。 本发明可以应用于产生重要的内容表以及用于视频的索引表,以便于有效的视频主题搜索和浏览。

    System and method for semantic video segmentation based on joint audiovisual and text analysis
    2.
    发明授权
    System and method for semantic video segmentation based on joint audiovisual and text analysis 失效
    基于联合视听和文本分析的语义视频分割系统和方法

    公开(公告)号:US08121432B2

    公开(公告)日:2012-02-21

    申请号:US12055023

    申请日:2008-03-25

    IPC分类号: G06K9/36

    CPC分类号: G06F17/30787 G06F17/30796

    摘要: System and method for partitioning a video into a series of semantic units where each semantic unit relates to a generally complete thematic topic. A computer implemented method for partitioning a video into a series of semantic units wherein each semantic unit relates to a theme or a topic, comprises dividing a video into a plurality of homogeneous segments, analyzing audio and visual content of the video, extracting a plurality of keywords from the speech content of each of the plurality of homogeneous segments of the video, and detecting and merging a plurality of groups of semantically related and temporally adjacent homogeneous segments into a series of semantic units in accordance with the results of both the audio and visual analysis and the keyword extraction. The present invention can be applied to generate important table-of-contents as well as index tables for videos to facilitate efficient video topic searching and browsing.

    摘要翻译: 将视频分割成一系列语义单元的系统和方法,其中每个语义单元涉及一般完整的主题。 一种用于将视频分割成一系列语义单元的计算机实现的方法,其中每个语义单元涉及主题或主题,包括将视频划分为多个同构段,分析视频的音频和视觉内容,提取多个 根据视频的多个同构段的每个的语音内容的关键字,以及根据音频和视频的结果检测和合并多个语义相关和时间上相邻的同构段的组成一系列语义单元 分析和关键词提取。 本发明可以应用于产生重要的内容表以及用于视频的索引表,以便于有效的视频主题搜索和浏览。

    SYSTEM AND METHOD FOR SEMANTIC VIDEO SEGMENTATION BASED ON JOINT AUDIOVISUAL AND TEXT ANALYSIS
    3.
    发明申请
    SYSTEM AND METHOD FOR SEMANTIC VIDEO SEGMENTATION BASED ON JOINT AUDIOVISUAL AND TEXT ANALYSIS 失效
    基于联合音视频分析的语义视频分割系统与方法

    公开(公告)号:US20080175556A1

    公开(公告)日:2008-07-24

    申请号:US12055023

    申请日:2008-03-25

    IPC分类号: H04N5/93

    CPC分类号: G06F17/30787 G06F17/30796

    摘要: System and method for partitioning a video into a series of semantic units where each semantic unit relates to a generally complete thematic topic. A computer implemented method for partitioning a video into a series of semantic units wherein each semantic unit relates to a theme or a topic, comprises dividing a video into a plurality of homogeneous segments, analyzing audio and visual content of the video, extracting a plurality of keywords from the speech content of each of the plurality of homogeneous segments of the video, and detecting and merging a plurality of groups of semantically related and temporally adjacent homogeneous segments into a series of semantic units in accordance with the results of both the audio and visual analysis and the keyword extraction. The present invention can be applied to generate important table-of-contents as well as index tables for videos to facilitate efficient video topic searching and browsing.

    摘要翻译: 将视频分割成一系列语义单元的系统和方法,其中每个语义单元涉及一般完整的主题。 一种用于将视频分割成一系列语义单元的计算机实现的方法,其中每个语义单元涉及主题或主题,包括将视频划分为多个同构段,分析视频的音频和视觉内容,提取多个 根据视频的多个同构段的每个的语音内容的关键字,以及根据音频和视频的结果检测和合并多个语义相关和时间上相邻的同构段的组成一系列语义单元 分析和关键词提取。 本发明可以应用于产生重要的内容表以及用于视频的索引表,以便于有效的视频主题搜索和浏览。

    System and method for semantic video segmentation based on joint audiovisual and text analysis
    4.
    发明授权
    System and method for semantic video segmentation based on joint audiovisual and text analysis 失效
    基于联合视听和文本分析的语义视频分割系统和方法

    公开(公告)号:US07382933B2

    公开(公告)日:2008-06-03

    申请号:US11210305

    申请日:2005-08-24

    IPC分类号: G06K9/36

    CPC分类号: G06F17/30787 G06F17/30796

    摘要: System and method for partitioning a video into a series of semantic units where each semantic unit relates to a generally complete thematic topic. A computer implemented method for partitioning a video into a series of semantic units wherein each semantic unit relates to a theme or a topic, comprises dividing a video into a plurality of homogeneous segments, analyzing audio and visual content of the video, extracting a plurality of keywords from the speech content of each of the plurality of homogeneous segments of the video, and detecting and merging a plurality of groups of semantically related and temporally adjacent homogeneous segments into a series of semantic units in accordance with the results of both the audio and visual analysis and the keyword extraction. The present invention can be applied to generate important table-of-contents as well as index tables for videos to facilitate efficient video topic searching and browsing.

    摘要翻译: 将视频分割成一系列语义单元的系统和方法,其中每个语义单元涉及一般完整的主题。 一种用于将视频分割成一系列语义单元的计算机实现的方法,其中每个语义单元涉及主题或主题,包括将视频划分为多个同构段,分析视频的音频和视觉内容,提取多个 根据视频的多个同构段的每个的语音内容的关键字,以及根据音频和视频的结果检测和合并多个语义相关和时间上相邻的同构段的组成一系列语义单元 分析和关键词提取。 本发明可以应用于产生重要的内容表以及用于视频的索引表,以便于有效的视频主题搜索和浏览。

    System and method for detecting topic shift boundaries in multimedia streams using joint audio, visual and text cues
    5.
    发明申请
    System and method for detecting topic shift boundaries in multimedia streams using joint audio, visual and text cues 审中-公开
    用于使用联合音频,视觉和文本提示来检测多媒体流中的主题移位边界的系统和方法

    公开(公告)号:US20080066136A1

    公开(公告)日:2008-03-13

    申请号:US11509250

    申请日:2006-08-24

    摘要: Computer implemented method, system and computer usable program code for detecting topic shift boundaries in a multimedia stream. A computer implemented method for detecting topic shift boundaries in a multimedia stream includes receiving a multimedia stream, and performing multimodal analysis on the multimedia stream to locate a plurality of temporal positions within the multimedia stream at which topic changes have an increased likelihood of occurring to provide a sequence of multimedia portions. Characteristics for a sliding window for each multimedia portion in the sequence of multimedia portions are automatically determined, and topic shift boundaries are detected in each multimedia portion by applying a text-based topic shift detector over the media stream's text transcript using a sliding window, wherein the sliding window used with each multimedia portion has the characteristics determined from its respective multimedia portion.

    摘要翻译: 用于检测多媒体流中的主题移位边界的计算机实现的方法,系统和计算机可用程序代码。 一种用于检测多媒体流中的主题移位边界的计算机实现方法,包括:接收多媒体流,以及对所述多媒体流执行多模态分析,以定位所述多媒体流内的多个时间位置,在该多媒体流中,主题变化具有增加的发生可能性以提供 多媒体部分的序列。 自动确定多媒体部分序列中的每个多媒体部分的滑动窗口的特征,并且通过使用滑动窗口在媒体流的文本转录本上应用基于文本的主题移位检测器,在每个多媒体部分中检测主题移位边界,其中 与每个多媒体部分一起使用的滑动窗口具有从其各自的多媒体部分确定的特征。

    System and method for automatically creating personal profiles for video characters
    6.
    发明申请
    System and method for automatically creating personal profiles for video characters 审中-公开
    自动创建视频角色个人资料的系统和方法

    公开(公告)号:US20080059522A1

    公开(公告)日:2008-03-06

    申请号:US11511816

    申请日:2006-08-29

    申请人: Ying Li Youngja Park

    发明人: Ying Li Youngja Park

    IPC分类号: G06F17/00

    摘要: A system, method and computer program product for generating personal profiles for subjects appearing in a video media source. The method includes extracting audiovisual-related personal information related to a subject appearing in the video media source; extracting text-related personal information that are related to the subject in the video source; correlating the extracted audiovisual-related personal information and the extracted text-related personal information related to the subject; and assembling a personal profile data structure for the subject, the personal profile data structure comprising the text-related personal information and audiovisual-related personal information related to the subject. The text-related personal information forms the name identity of the subject, while the audiovisual-related personal information includes audiovisual-related features including information forming one or more of: a visual identity, a kinematic identity and, a voice identity of the subject. In an alternate embodiment, in an iterative manner, the correlated extracted audiovisual-related personal information and extracted text-related personal information may be fed back and utilized for performing an additional search from external information sources, via a search engine, to obtain additional texts relating to the subject or obtain additional video media sources having the subject. There is further enabled the updating of an assembled personal profile of a subject as a new video media source having said subject becomes available.

    摘要翻译: 一种用于生成出现在视频媒体源中的对象的个人简档的系统,方法和计算机程序产品。 该方法包括提取与视频媒体源中出现的对象相关的与视听有关的个人信息; 提取与视频源中的主题相关的文本相关个人信息; 将所提取的与视听有关的个人信息与所提取的与被摄体有关的文本相关个人信息相关联; 以及组合个体简档数据结构,该个人简档数据结构包括与该对象相关的文本相关个人信息和与视听相关的个人信息。 与文本相关的个人信息形成主题的姓名身份,而与视听相关的个人信息包括视听相关特征,包括形成以下中的一个或多个的信息:视觉身份,运动身份和主体的语音身份。 在替代实施例中,以迭代方式,相关联的提取的与视听有关的个人信息和提取的与文本相关的个人信息可以被反馈并用于经由搜索引擎从外部信息源执行附加搜索以获得附加文本 涉及该主题或获得具有该主题的附加视频媒体源。 此外,当具有所述对象的新的视频媒体源变得可用时,还能够更新被摄体的组合个人简档。

    System and method for extracting salient keywords for videos
    7.
    发明申请
    System and method for extracting salient keywords for videos 审中-公开
    提取视频关键词的系统和方法

    公开(公告)号:US20070185857A1

    公开(公告)日:2007-08-09

    申请号:US11337371

    申请日:2006-01-23

    IPC分类号: G06F17/30

    摘要: Computer implemented method, system and computer program product for extracting salient keywords for videos. A computer implemented method for extracting salient keywords for videos includes extracting a set of candidate keywords from a text source of a video, assigning a salience value to each candidate keyword based on statistical information to provide a set of statistically significant keywords, exploiting additional cues that are available to the video and that can be used to further measure the significance of existing keywords or to extract new keywords, and selecting a set of salient keywords for the video based on the set of statistically significant keywords and the additional cues.

    摘要翻译: 计算机实现方法,系统和计算机程序产品,用于提取视频的关键词。 用于提取视频的关键词的计算机实现方法包括从视频的文本源提取一组候选关键字,基于统计信息为每个候选关键字分配突出值,以提供一组统计上显着的关键字,利用附加线索 可用于视频,可用于进一步测量现有关键字的重要性或提取新关键字,以及基于一组统计学上重要的关键字和其他线索为视频选择一组显着的关键字。

    Techniques for Generating Balanced and Class-Independent Training Data From Unlabeled Data Set
    8.
    发明申请
    Techniques for Generating Balanced and Class-Independent Training Data From Unlabeled Data Set 审中-公开
    从非标准数据集中生成平衡和类别独立训练数据的技术

    公开(公告)号:US20130097103A1

    公开(公告)日:2013-04-18

    申请号:US13274002

    申请日:2011-10-14

    IPC分类号: G06F15/18 G06F17/30

    CPC分类号: G06N20/00

    摘要: Techniques for creating training sets for predictive modeling are provided. In one aspect, a method for generating training data from an unlabeled data set is provided which includes the following steps. A small initial set of data is selected from the unlabeled data set. Labels are acquired for the initial set of data selected from the unlabeled data set resulting in labeled data. The data in the unlabeled data set is clustered using a semi-supervised clustering process along with the labeled data to produce data clusters. Data samples are chosen from each of the clusters to use as the training data. The selecting, presenting, clustering and choosing steps are repeated with one or more additional sets of data selected from the unlabeled data set until a desired amount of training data has been obtained, wherein at each iteration an amount of the labeled data is increased.

    摘要翻译: 提供了用于创建预测建模训练集的技术。 一方面,提供了一种用于从未标记的数据集生成训练数据的方法,包括以下步骤。 从未标记的数据集中选择一小段初始数据。 从未标记的数据集中选择的初始数据集中获取标签,从而产生标记数据。 未标记数据集中的数据使用半监督聚类过程与标记数据一起聚类以产生数据集群。 从每个群集中选择数据样本以用作训练数据。 使用从未标记的数据集中选择的一个或多个附加数据集重复选择,呈现,聚类和选择步骤,直到获得了所需量的训练数据,其中在每次迭代时,标记数据的量增加。

    SYSTEM AND METHOD FOR AUTOMATIC CALL SEGMENTATION AT CALL CENTER
    9.
    发明申请
    SYSTEM AND METHOD FOR AUTOMATIC CALL SEGMENTATION AT CALL CENTER 有权
    呼叫中心自动呼叫分段系统及方法

    公开(公告)号:US20100104086A1

    公开(公告)日:2010-04-29

    申请号:US12257037

    申请日:2008-10-23

    申请人: Youngja Park

    发明人: Youngja Park

    IPC分类号: H04M3/00

    摘要: A system and method for automatic call segmentation including steps and means for automatically detecting boundaries between utterances in the call transcripts; automatically classifying utterances into target call sections; automatically partitioning the call transcript into call segments; and outputting a segmented call transcript. A training method and apparatus for training the system to perform automatic call segmentation includes steps and means for providing at least one training transcript with annotated call sections; normalizing the at least one training transcript; and performing statistical analysis on the at least one training transcript.

    摘要翻译: 一种用于自动呼叫分段的系统和方法,包括用于自动检测呼叫转录中的话语之间的边界的步骤和装置; 自动将话语分类为目标通话部分; 自动将通话记录分成通话段; 并输出分段呼叫转录。 用于训练系统执行自动呼叫分段的训练方法和装置包括用于提供至少一个具有注释呼叫部分的训练成绩单的步骤和装置; 规范化至少一个训练成绩单; 以及对所述至少一个训练成绩单执行统计分析。

    System, method, program product, and networking use for recognizing words and their parts of speech in one or more natural languages
    10.
    发明授权
    System, method, program product, and networking use for recognizing words and their parts of speech in one or more natural languages 有权
    系统,方法,程序产品和网络使用,用于以一种或多种自然语言识别单词及其部分语音

    公开(公告)号:US07680649B2

    公开(公告)日:2010-03-16

    申请号:US10173931

    申请日:2002-06-17

    申请人: Youngja Park

    发明人: Youngja Park

    IPC分类号: G06F17/21 G06F17/20

    CPC分类号: G06F17/2755 G06F17/278

    摘要: A system, method, and computer program are disclosed for recognizing one or more words not listed in a dictionary database. One or more sequences of characters in the word are checked to determine a probability that the word is valid. A prefix removal process removes any prefixes from a word, and obtains information about the removed prefix. A suffix removal process removes any suffixes from the word, and obtains information about the removed suffix. A root process obtains information about a root word from the dictionary database. A combination process then determines if the prefix, the root, and the suffix can be combined into a valid word as defined by one or more combination rules, obtains one or more of the possible parts of speech of the valid word, and stores the parts of speech with the valid word in the dictionary database.

    摘要翻译: 公开了一种用于识别字典数据库中未列出的一个或多个字的系统,方法和计算机程序。 检查单词中的一个或多个字符序列以确定单词有效的概率。 前缀删除过程从单词中删除任何前缀,并获取有关已删除的前缀的信息。 后缀删除过程从单词中删除任何后缀,并获取有关已删除后缀的信息。 根进程从字典数据库获取有关根词的信息。 然后,组合处理确定前缀,根和后缀是否可以组合成由一个或多个组合规则定义的有效字,获得有效字的一个或多个可能的语音部分,并存储部分 的词典数据库中的有效单词。