Knowledge management system, program product and method
    1.
    发明授权
    Knowledge management system, program product and method 有权
    知识管理系统,程序产品和方法

    公开(公告)号:US07657546B2

    公开(公告)日:2010-02-02

    申请号:US11340246

    申请日:2006-01-26

    IPC分类号: G06F7/00 G06F17/00

    CPC分类号: G06F17/30864 G06F17/30705

    摘要: An ontology directory service tool, computer program product and method of automatically discovering ontology file categories. A web search unit searches a network (e.g., the Internet) for semantic data files, e.g., semantic web pages. A preprocessing unit generates an ontology file from the content of each identified semantic data file. A category discovery unit identifies a domain for each ontology file and provides training sets for training ontology file classification. A classification unit trained using the training sets, classifies ontology file instances into inherent ontology categories.

    摘要翻译: 本体目录服务工具,计算机程序产品和自动发现本体文件类别的方法。 网络搜索单元在网络(例如,因特网)中搜索语义数据文件,例如语义网页。 预处理单元从每个识别的语义数据文件的内容生成本体文件。 类别发现单元识别每个本体文件的域,并提供用于训练本体文件分类的训练集。 使用训练集训练的分类单元,将本体文件实例分类为固有的本体类别。

    Knowledge management system, program product and method
    2.
    发明申请
    Knowledge management system, program product and method 有权
    知识管理系统,程序产品和方法

    公开(公告)号:US20070174270A1

    公开(公告)日:2007-07-26

    申请号:US11340246

    申请日:2006-01-26

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864 G06F17/30705

    摘要: An ontology directory service tool, computer program product and method of automatically discovering ontology file categories. A web search unit searches a network (e.g., the Internet) for semantic data files, e.g., semantic web pages. A preprocessing unit generates an ontology file from the content of each identified semantic data file. A category discovery unit identifies a domain for each ontology file and provides training sets for training ontology file classification. A classification unit trained using the training sets, classifies ontology file instances into inherent ontology categories.

    摘要翻译: 本体目录服务工具,计算机程序产品和自动发现本体文件类别的方法。 网络搜索单元在网络(例如,因特网)中搜索语义数据文件,例如语义网页。 预处理单元从每个识别的语义数据文件的内容生成本体文件。 类别发现单元识别每个本体文件的域,并提供用于训练本体文件分类的训练集。 使用训练集训练的分类单元,将本体文件实例分类为固有的本体类别。

    Techniques for Generating Balanced and Class-Independent Training Data From Unlabeled Data Set
    3.
    发明申请
    Techniques for Generating Balanced and Class-Independent Training Data From Unlabeled Data Set 审中-公开
    从非标准数据集中生成平衡和类别独立训练数据的技术

    公开(公告)号:US20130097103A1

    公开(公告)日:2013-04-18

    申请号:US13274002

    申请日:2011-10-14

    IPC分类号: G06F15/18 G06F17/30

    CPC分类号: G06N20/00

    摘要: Techniques for creating training sets for predictive modeling are provided. In one aspect, a method for generating training data from an unlabeled data set is provided which includes the following steps. A small initial set of data is selected from the unlabeled data set. Labels are acquired for the initial set of data selected from the unlabeled data set resulting in labeled data. The data in the unlabeled data set is clustered using a semi-supervised clustering process along with the labeled data to produce data clusters. Data samples are chosen from each of the clusters to use as the training data. The selecting, presenting, clustering and choosing steps are repeated with one or more additional sets of data selected from the unlabeled data set until a desired amount of training data has been obtained, wherein at each iteration an amount of the labeled data is increased.

    摘要翻译: 提供了用于创建预测建模训练集的技术。 一方面,提供了一种用于从未标记的数据集生成训练数据的方法,包括以下步骤。 从未标记的数据集中选择一小段初始数据。 从未标记的数据集中选择的初始数据集中获取标签,从而产生标记数据。 未标记数据集中的数据使用半监督聚类过程与标记数据一起聚类以产生数据集群。 从每个群集中选择数据样本以用作训练数据。 使用从未标记的数据集中选择的一个或多个附加数据集重复选择,呈现,聚类和选择步骤,直到获得了所需量的训练数据,其中在每次迭代时,标记数据的量增加。

    System and method for semantic video segmentation based on joint audiovisual and text analysis
    4.
    发明授权
    System and method for semantic video segmentation based on joint audiovisual and text analysis 失效
    基于联合视听和文本分析的语义视频分割系统和方法

    公开(公告)号:US08121432B2

    公开(公告)日:2012-02-21

    申请号:US12055023

    申请日:2008-03-25

    IPC分类号: G06K9/36

    CPC分类号: G06F17/30787 G06F17/30796

    摘要: System and method for partitioning a video into a series of semantic units where each semantic unit relates to a generally complete thematic topic. A computer implemented method for partitioning a video into a series of semantic units wherein each semantic unit relates to a theme or a topic, comprises dividing a video into a plurality of homogeneous segments, analyzing audio and visual content of the video, extracting a plurality of keywords from the speech content of each of the plurality of homogeneous segments of the video, and detecting and merging a plurality of groups of semantically related and temporally adjacent homogeneous segments into a series of semantic units in accordance with the results of both the audio and visual analysis and the keyword extraction. The present invention can be applied to generate important table-of-contents as well as index tables for videos to facilitate efficient video topic searching and browsing.

    摘要翻译: 将视频分割成一系列语义单元的系统和方法,其中每个语义单元涉及一般完整的主题。 一种用于将视频分割成一系列语义单元的计算机实现的方法,其中每个语义单元涉及主题或主题,包括将视频划分为多个同构段,分析视频的音频和视觉内容,提取多个 根据视频的多个同构段的每个的语音内容的关键字,以及根据音频和视频的结果检测和合并多个语义相关和时间上相邻的同构段的组成一系列语义单元 分析和关键词提取。 本发明可以应用于产生重要的内容表以及用于视频的索引表,以便于有效的视频主题搜索和浏览。

    SYSTEM AND METHOD FOR AUTOMATIC CALL SEGMENTATION AT CALL CENTER
    5.
    发明申请
    SYSTEM AND METHOD FOR AUTOMATIC CALL SEGMENTATION AT CALL CENTER 有权
    呼叫中心自动呼叫分段系统及方法

    公开(公告)号:US20100104086A1

    公开(公告)日:2010-04-29

    申请号:US12257037

    申请日:2008-10-23

    申请人: Youngja Park

    发明人: Youngja Park

    IPC分类号: H04M3/00

    摘要: A system and method for automatic call segmentation including steps and means for automatically detecting boundaries between utterances in the call transcripts; automatically classifying utterances into target call sections; automatically partitioning the call transcript into call segments; and outputting a segmented call transcript. A training method and apparatus for training the system to perform automatic call segmentation includes steps and means for providing at least one training transcript with annotated call sections; normalizing the at least one training transcript; and performing statistical analysis on the at least one training transcript.

    摘要翻译: 一种用于自动呼叫分段的系统和方法,包括用于自动检测呼叫转录中的话语之间的边界的步骤和装置; 自动将话语分类为目标通话部分; 自动将通话记录分成通话段; 并输出分段呼叫转录。 用于训练系统执行自动呼叫分段的训练方法和装置包括用于提供至少一个具有注释呼叫部分的训练成绩单的步骤和装置; 规范化至少一个训练成绩单; 以及对所述至少一个训练成绩单执行统计分析。

    System, method, program product, and networking use for recognizing words and their parts of speech in one or more natural languages
    6.
    发明授权
    System, method, program product, and networking use for recognizing words and their parts of speech in one or more natural languages 有权
    系统,方法,程序产品和网络使用,用于以一种或多种自然语言识别单词及其部分语音

    公开(公告)号:US07680649B2

    公开(公告)日:2010-03-16

    申请号:US10173931

    申请日:2002-06-17

    申请人: Youngja Park

    发明人: Youngja Park

    IPC分类号: G06F17/21 G06F17/20

    CPC分类号: G06F17/2755 G06F17/278

    摘要: A system, method, and computer program are disclosed for recognizing one or more words not listed in a dictionary database. One or more sequences of characters in the word are checked to determine a probability that the word is valid. A prefix removal process removes any prefixes from a word, and obtains information about the removed prefix. A suffix removal process removes any suffixes from the word, and obtains information about the removed suffix. A root process obtains information about a root word from the dictionary database. A combination process then determines if the prefix, the root, and the suffix can be combined into a valid word as defined by one or more combination rules, obtains one or more of the possible parts of speech of the valid word, and stores the parts of speech with the valid word in the dictionary database.

    摘要翻译: 公开了一种用于识别字典数据库中未列出的一个或多个字的系统,方法和计算机程序。 检查单词中的一个或多个字符序列以确定单词有效的概率。 前缀删除过程从单词中删除任何前缀,并获取有关已删除的前缀的信息。 后缀删除过程从单词中删除任何后缀,并获取有关已删除后缀的信息。 根进程从字典数据库获取有关根词的信息。 然后,组合处理确定前缀,根和后缀是否可以组合成由一个或多个组合规则定义的有效字,获得有效字的一个或多个可能的语音部分,并存储部分 的词典数据库中的有效单词。

    SYSTEM AND METHOD FOR SEMANTIC VIDEO SEGMENTATION BASED ON JOINT AUDIOVISUAL AND TEXT ANALYSIS
    7.
    发明申请
    SYSTEM AND METHOD FOR SEMANTIC VIDEO SEGMENTATION BASED ON JOINT AUDIOVISUAL AND TEXT ANALYSIS 失效
    基于联合音视频分析的语义视频分割系统与方法

    公开(公告)号:US20080175556A1

    公开(公告)日:2008-07-24

    申请号:US12055023

    申请日:2008-03-25

    IPC分类号: H04N5/93

    CPC分类号: G06F17/30787 G06F17/30796

    摘要: System and method for partitioning a video into a series of semantic units where each semantic unit relates to a generally complete thematic topic. A computer implemented method for partitioning a video into a series of semantic units wherein each semantic unit relates to a theme or a topic, comprises dividing a video into a plurality of homogeneous segments, analyzing audio and visual content of the video, extracting a plurality of keywords from the speech content of each of the plurality of homogeneous segments of the video, and detecting and merging a plurality of groups of semantically related and temporally adjacent homogeneous segments into a series of semantic units in accordance with the results of both the audio and visual analysis and the keyword extraction. The present invention can be applied to generate important table-of-contents as well as index tables for videos to facilitate efficient video topic searching and browsing.

    摘要翻译: 将视频分割成一系列语义单元的系统和方法,其中每个语义单元涉及一般完整的主题。 一种用于将视频分割成一系列语义单元的计算机实现的方法,其中每个语义单元涉及主题或主题,包括将视频划分为多个同构段,分析视频的音频和视觉内容,提取多个 根据视频的多个同构段的每个的语音内容的关键字,以及根据音频和视频的结果检测和合并多个语义相关和时间上相邻的同构段的组成一系列语义单元 分析和关键词提取。 本发明可以应用于产生重要的内容表以及用于视频的索引表,以便于有效的视频主题搜索和浏览。

    System and method to govern sensitive data exchange with mobile devices based on threshold sensitivity values
    8.
    发明授权
    System and method to govern sensitive data exchange with mobile devices based on threshold sensitivity values 有权
    基于阈值灵敏度值来管理与移动设备的敏感数据交换的系统和方法

    公开(公告)号:US08560722B2

    公开(公告)日:2013-10-15

    申请号:US13051679

    申请日:2011-03-18

    IPC分类号: G06F15/16

    摘要: Techniques for limiting the risk of loss of sensitive data from a mobile device are provided. In one aspect, a method for managing sensitive data on a mobile device is provided. The method includes the following steps. A sensitivity of a data item to be transferred to the mobile device is determined. It is determined whether an aggregate sensitivity of data items already present on the mobile device plus the data item to be transferred exceeds a current threshold sensitivity value for the mobile device. If the aggregate sensitivity exceeds the current threshold sensitivity value, measures are employed to ensure the aggregate sensitivity remains below the current threshold sensitivity value for the mobile device. Otherwise the data item is transferred to the mobile device.

    摘要翻译: 提供了用于限制从移动设备丢失敏感数据的风险的技术。 一方面,提供了一种用于管理移动设备上的敏感数据的方法。 该方法包括以下步骤。 确定要传送到移动设备的数据项目的灵敏度。 确定移动设备上已经存在的数据项的总体灵敏度加上要传送的数据项是否超过了移动设备的当前阈值灵敏度值。 如果总灵敏度超过当前阈值灵敏度值,则采用措施来确保总灵敏度保持在移动设备的当前阈值灵敏度值以下。 否则将数据项传送到移动设备。

    System and Method to Govern Data Exchange with Mobile Devices
    10.
    发明申请
    System and Method to Govern Data Exchange with Mobile Devices 有权
    用移动设备管理数据交换的系统和方法

    公开(公告)号:US20120240238A1

    公开(公告)日:2012-09-20

    申请号:US13051679

    申请日:2011-03-18

    IPC分类号: H04N7/16

    摘要: Techniques for limiting the risk of loss of sensitive data from a mobile device are provided. In one aspect, a method for managing sensitive data on a mobile device is provided. The method includes the following steps. A sensitivity of a data item to be transferred to the mobile device is determined. It is determined whether an aggregate sensitivity of data items already present on the mobile device plus the data item to be transferred exceeds a current threshold sensitivity value for the mobile device. If the aggregate sensitivity exceeds the current threshold sensitivity value, measures are employed to ensure the aggregate sensitivity remains below the current threshold sensitivity value for the mobile device. Otherwise the data item is transferred to the mobile device.

    摘要翻译: 提供了用于限制从移动设备丢失敏感数据的风险的技术。 一方面,提供了一种用于管理移动设备上的敏感数据的方法。 该方法包括以下步骤。 确定要传送到移动设备的数据项目的灵敏度。 确定移动设备上已经存在的数据项的总体灵敏度加上要传送的数据项是否超过了移动设备的当前阈值灵敏度值。 如果总灵敏度超过当前阈值灵敏度值,则采用措施来确保总灵敏度保持在移动设备的当前阈值灵敏度值以下。 否则将数据项传送到移动设备。