Linguistic error detection
    1.
    发明授权
    Linguistic error detection 有权
    语言错误检测

    公开(公告)号:US08855997B2

    公开(公告)日:2014-10-07

    申请号:US13193248

    申请日:2011-07-28

    IPC分类号: G06F17/27

    摘要: Potential linguistic errors within a sequence of words of a sentence are identified based on analysis of a configurable sliding window. The analysis is performed based on an assumption that if a sequence of words occurs frequently enough within a large, well-formed corpus, its joint probability for occurring in a sentence is very likely to be greater than the same words randomly ordered.

    摘要翻译: 根据可配置滑动窗口的分析,确定句子序列中潜在的语言错误。 基于如下假设进行分析:如果一个单词序列在一个大的,形成良好的语料库中经常出现,则其在句子中发生的联合概率很可能大于随机排序的相同单词。

    TECHNIQUES FOR DATA AGGREGATION, ANALYSIS, AND DISTRIBUTION
    3.
    发明申请
    TECHNIQUES FOR DATA AGGREGATION, ANALYSIS, AND DISTRIBUTION 审中-公开
    数据聚合,分析和分配的技术

    公开(公告)号:US20100185631A1

    公开(公告)日:2010-07-22

    申请号:US12355806

    申请日:2009-01-19

    IPC分类号: G06F17/30

    摘要: Various technologies and techniques are disclosed for aggregating and using data collected from multiple computers to modify a later behavior of those computers. In one implementation, a data aggregation system is described. A data collector is operable to collect behavior data over a network from one or more applications used by the computers, and to save the behavior data to a data store. A data installer is operable to access the behavior data in the data store and convert the behavior data into a format that will modify a future operation of at least one of the applications that is used on at least one of the computers. A method for creating and distributing a custom dictionary from data collected from multiple computers is described. A method for identifying related documents from data collected from multiple computers is also described.

    摘要翻译: 公开了各种技术和技术,用于聚合和使用从多台计算机收集的数据来修改这些计算机的后续行为。 在一个实现中,描述了数据聚合系统。 数据收集器可操作以通过网络从计算机使用的一个或多个应用收集行为数据,并将行为数据保存到数据存储。 数据安装器可操作以访问数据存储中的行为数据,并将行为数据转换成将修改在至少一个计算机上使用的至少一个应用的将来操作的格式。 描述了从多台计算机收集的数据创建和分发自定义词典的方法。 还描述了从从多台计算机收集的数据中识别相关文档的方法。