Data Security System
    1.
    发明申请
    Data Security System 有权
    数据安全系统

    公开(公告)号:US20120136646A1

    公开(公告)日:2012-05-31

    申请号:US12956739

    申请日:2010-11-30

    IPC分类号: G06F17/28

    摘要: A method, computer system, and computer program product for translating information. The computer system receives the information for a translation. The computer system identifies portions of the information based on a set of rules for security for the information in response to receiving the information. The computer system sends the portions of the information to a plurality of translation systems. In response to receiving translation results from the plurality of translation systems for respective portions of the information, the computer system combines the translation results for the respective portions to form a consolidated translation of the information.

    摘要翻译: 一种用于翻译信息的方法,计算机系统和计算机程序产品。 计算机系统接收翻译信息。 计算机系统基于用于响应于接收信息的信息的安全性的一组规则来识别信息的部分。 计算机系统将该部分信息发送到多个翻译系统。 响应于从信息的各个部分接收来自多个翻译系统的翻译结果,计算机系统组合各个部分的翻译结果以形成信息的综合翻译。

    Data security system for natural language translation
    2.
    发明授权
    Data security system for natural language translation 有权
    自然语言翻译数据安全系统

    公开(公告)号:US09002696B2

    公开(公告)日:2015-04-07

    申请号:US12956739

    申请日:2010-11-30

    摘要: A method, computer system, and computer program product for translating information. The computer system receives the information for a translation. The computer system identifies portions of the information based on a set of rules for security for the information in response to receiving the information. The computer system sends the portions of the information to a plurality of translation systems. In response to receiving translation results from the plurality of translation systems for respective portions of the information, the computer system combines the translation results for the respective portions to form a consolidated translation of the information.

    摘要翻译: 一种用于翻译信息的方法,计算机系统和计算机程序产品。 计算机系统接收翻译信息。 计算机系统基于用于响应于接收信息的信息的安全性的一组规则来识别信息的部分。 计算机系统将该部分信息发送到多个翻译系统。 响应于从信息的各个部分接收来自多个翻译系统的翻译结果,计算机系统组合各个部分的翻译结果以形成信息的综合翻译。

    METHOD AND APPARATUS FOR FAST TRANSLATION MEMORY SEARCH
    3.
    发明申请
    METHOD AND APPARATUS FOR FAST TRANSLATION MEMORY SEARCH 有权
    用于快速翻译记忆搜索的方法和装置

    公开(公告)号:US20130231916A1

    公开(公告)日:2013-09-05

    申请号:US13412104

    申请日:2012-03-05

    IPC分类号: G06F17/28

    CPC分类号: G06F17/2827 G06F17/2836

    摘要: Methods and systems for fast translation memory search include, in response to an input query string, identifying a plurality of hypothesis strings stored in a translation memory as candidates to match the query string. One or more candidates are eliminated, using a processor, where string lengths between the candidates and the query string are at least a cutoff value representing a string edit distance. One or more candidates are eliminated where differences in word frequency distributions between the candidates and the query string are at least the cutoff value. One or more candidates are eliminated by employing a dynamic programming matrix where string edit distances between the candidates and the query string are at least the cutoff value. A number of remaining candidates are outputted as matches to the query string.

    摘要翻译: 用于快速翻译存储器搜索的方法和系统包括响应于输入查询字符串,将存储在翻译存储器中的多个假设字符串识别为与查询字符串匹配的候选。 消除一个或多个候选者,使用处理器,候选者和查询字符串之间的串长度至少是表示字符串编辑距离的截止值。 删除一个或多个候选者,其中候选和查询字符串之间的字频率分布的差异至少为截止值。 通过采用动态编程矩阵来消除一个或多个候选者,其中候选和查询串之间的字符串编辑距离至少为截止值。 剩余的多个候选者作为匹配输出到查询字符串。

    Method and apparatus for fast translation memory search
    4.
    发明授权
    Method and apparatus for fast translation memory search 有权
    快速翻译记忆搜索的方法和装置

    公开(公告)号:US08874428B2

    公开(公告)日:2014-10-28

    申请号:US13412104

    申请日:2012-03-05

    IPC分类号: G06F17/28

    CPC分类号: G06F17/2827 G06F17/2836

    摘要: Methods and systems for fast translation memory search include, in response to an input query string, identifying a plurality of hypothesis strings stored in a translation memory as candidates to match the query string. One or more candidates are eliminated, using a processor, where string lengths between the candidates and the query string are at least a cutoff value representing a string edit distance. One or more candidates are eliminated where differences in word frequency distributions between the candidates and the query string are at least the cutoff value. One or more candidates are eliminated by employing a dynamic programming matrix where string edit distances between the candidates and the query string are at least the cutoff value. A number of remaining candidates are outputted as matches to the query string.

    摘要翻译: 用于快速翻译存储器搜索的方法和系统包括响应于输入查询字符串,将存储在翻译存储器中的多个假设字符串识别为与查询字符串匹配的候选。 消除一个或多个候选者,使用处理器,候选者和查询字符串之间的串长度至少是表示字符串编辑距离的截止值。 删除一个或多个候选者,其中候选和查询字符串之间的字频率分布的差异至少为截止值。 通过采用动态编程矩阵来消除一个或多个候选者,其中候选和查询串之间的字符串编辑距离至少为截止值。 剩余的多个候选者作为匹配输出到查询字符串。

    TRANSLATED SENTENCE QUALITY ESTIMATION
    5.
    发明申请
    TRANSLATED SENTENCE QUALITY ESTIMATION 审中-公开
    翻译质量评估

    公开(公告)号:US20120209590A1

    公开(公告)日:2012-08-16

    申请号:US13028555

    申请日:2011-02-16

    IPC分类号: G06F17/28

    CPC分类号: G06F17/2854 G06F17/2818

    摘要: A method, system, and computer readable storage medium including a computer readable program are provided. The method includes storing a set of sentences in a memory device. The method further includes receiving an input translated phrase and searching the set of sentences for a subset of sentences closest to the input translated phrase based on a set of respective distances to the sentences in the set with respect to the input translated phrase. The method also includes calculating and outputting a language model score for the subset of sentences based on a function of a subset of respective distances pertaining to the subset of sentences.

    摘要翻译: 提供了包括计算机可读程序的方法,系统和计算机可读存储介质。 该方法包括将一组句子存储在存储器件中。 该方法还包括接收输入翻译短语,并且基于相对于输入翻译短语的集合中的句子的相应距离的集合,搜索最接近输入翻译短语的语句的子集的句子集合。 该方法还包括基于与语句子集相关的相应距离的子集的函数来计算和输出语句子集的语言模型分数。

    Natural language system and method based on unisolated performance metric
    6.
    发明授权
    Natural language system and method based on unisolated performance metric 有权
    自然语言系统和基于非隔离性能度量的方法

    公开(公告)号:US07574358B2

    公开(公告)日:2009-08-11

    申请号:US11067819

    申请日:2005-02-28

    IPC分类号: G10L15/00

    CPC分类号: G10L15/18 G10L15/065

    摘要: A natural language business system and method is developed to understand the underlying meaning of a person's speech, such as during a transaction with the business system. The system includes a speech recognition engine, and action classification engine, and a control module. The control module causes the system to execute an inventive method wherein the speech recognition and action classification models may be recursively optimized on an unisolated performance metric that is pertinent to the overall performance of the natural language business system, as opposed to the isolated model-specific criteria previously employed.

    摘要翻译: 开发一种自然语言业务系统和方法,以了解一个人的言论的基本含义,例如在与业务系统的交易期间。 该系统包括语音识别引擎,动作分类引擎和控制模块。 控制模块使得系统执行本发明的方法,其中语音识别和动作分类模型可以针对与自然语言业务系统的整体性能相关的非隔离性能度量递归地优化,而不是孤立的模型特定 以前使用的标准。

    Method and apparatus for detecting data anomalies in statistical natural language applications
    9.
    发明申请
    Method and apparatus for detecting data anomalies in statistical natural language applications 审中-公开
    用于检测统计自然语言应用中的数据异常的方法和装置

    公开(公告)号:US20070016399A1

    公开(公告)日:2007-01-18

    申请号:US11179789

    申请日:2005-07-12

    IPC分类号: G06F17/28

    CPC分类号: G06F17/2818 G06K9/6284

    摘要: Techniques for detecting data anomalies in a natural language understanding (NLU) system are provided. A number of categorized sentences, categorized into a number of categories, are obtained. Sentences within a given one of the categories are clustered into a number of sub clusters, and the sub clusters are analyzed to identify data anomalies. The clustering can be based on surface forms of the sentences. The anomalies can be, for example, ambiguities or inconsistencies. The clustering can be performed, for example, with a K-means clustering algorithm.

    摘要翻译: 提供了一种用于检测自然语言理解(NLU)系统中数据异常的技术。 获得分类为多个类别的多个分类句子。 给定一个类别中的句子被聚集成多个子集群,并且分析子集群以识别数据异常。 聚类可以基于句子的表面形式。 异常可能是例如歧义或矛盾。 可以使用K-means聚类算法来执行聚类。