Method and apparatus for automatic detection of spelling errors in one or more documents
    1.
    发明授权
    Method and apparatus for automatic detection of spelling errors in one or more documents 有权
    用于自动检测一个或多个文档中的拼写错误的方法和装置

    公开(公告)号:US09465791B2

    公开(公告)日:2016-10-11

    申请号:US11673173

    申请日:2007-02-09

    IPC分类号: G06F17/27

    CPC分类号: G06F17/2735 G06F17/273

    摘要: Methods and apparatus are provided for automatically detecting spelling errors in one or more documents, such as documents being processed for the creation of a lexicon According to one aspect of the invention, a spelling error is detected in one or more documents by determining if at least one given word in the one or more documents satisfies a predefined misspelling criteria, wherein the predefined misspelling criteria comprises the at least one given word having a frequency below a predefined low threshold and the at least one given word being within a predefined edit distance of one or more other words in the one or more documents having a frequency above a predefined high threshold; and identifying a given word as a potentially misspelled word if the given word satisfies the predefined misspelling criteria.

    摘要翻译: 提供了用于自动检测一个或多个文档中的拼写错误的方法和装置,例如为了创建词典而被处理的文档。根据本发明的一个方面,通过确定是否至少在一个或多个文档中检测到拼写错误 一个或多个文档中的一个给定的单词满足预定义的拼写错误标准,其中预定义拼错标准包括具有低于预定义低阈值的频率的至少一个给定单词,并且至少一个给定单词在预定义的编辑距离之内 或更多的其他单词在一个或多个文档中具有高于预定义的高阈值的频率; 并且如果给定的单词满足预定义的拼写标准,则将给定的单词识别为潜在的拼写错误的单词。

    Method and Apparatus for Automatic Detection of Spelling Errors in One or More Documents
    2.
    发明申请
    Method and Apparatus for Automatic Detection of Spelling Errors in One or More Documents 有权
    一种或多种文件中自动检测拼写错误的方法和装置

    公开(公告)号:US20080195940A1

    公开(公告)日:2008-08-14

    申请号:US11673173

    申请日:2007-02-09

    IPC分类号: G06F17/00 G06F17/27 B41J5/30

    CPC分类号: G06F17/2735 G06F17/273

    摘要: Methods and apparatus are provided for automatically detecting spelling errors in one or more documents, such as documents being processed for the creation of a lexicon According to one aspect of the invention, a spelling error is detected in one or more documents by determining if at least one given word in the one or more documents satisfies a predefined misspelling criteria, wherein the predefined misspelling criteria comprises the at least one given word having a frequency below a predefined low threshold and the at least one given word being within a predefined edit distance of one or mote other words in the one or more documents having a frequency above a predefined high threshold; and identifying a given word as a potentially misspelled word if the given word satisfies the predefined misspelling criteria

    摘要翻译: 提供了用于自动检测一个或多个文档中的拼写错误的方法和装置,例如为了创建词典而被处理的文档。根据本发明的一个方面,通过确定是否至少在一个或多个文档中检测到拼写错误 一个或多个文档中的一个给定的单词满足预定义的拼写错误标准,其中预定义拼错标准包括具有低于预定义低阈值的频率的至少一个给定单词,并且至少一个给定单词在一个预定义的编辑距离内 或将具有高于预定义高阈值的频率的一个或多个文档中的其他单词粉碎; 并且如果给定的单词满足预定义的拼写标准,则将给定的单词识别为潜在的拼写错误的单词

    Methods and apparatus for performing spelling corrections using one or more variant hash tables
    3.
    发明申请
    Methods and apparatus for performing spelling corrections using one or more variant hash tables 有权
    使用一个或多个变体哈希表进行拼写校正的方法和装置

    公开(公告)号:US20080059876A1

    公开(公告)日:2008-03-06

    申请号:US11513782

    申请日:2006-08-31

    IPC分类号: G06F17/00

    摘要: Methods and apparatus are provided for performing spelling corrections using one or more variant hash tables. The spelling of at least one candidate word is corrected by obtaining at least one variant dictionary hash table based on variants of a set of known correctly spelled words, wherein the variants are obtained by applying one or more of a deletion, insertion, replacement, and transposition operation on the correctly spelled words; obtaining from the candidate word one or more lookup variants using one or more of the deletion, insertion, replacement, and transposition operations; evaluating one or more of the candidate word and the lookup variants against the at least one variant dictionary hash table; and indicating a candidate correction if there is at least one match in the at least one variant dictionary hash table.

    摘要翻译: 提供了使用一个或多个变体散列表来执行拼写校正的方法和装置。 通过基于一组已知的正确拼写的单词的变体获得至少一个变体词典散列表来校正至少一个候选词的拼写,其中通过应用删除,插入,替换和 对正确拼写词的转置操作; 使用删除,插入,替换和替换操作中的一个或多个从候选词获得一个或多个查找变体; 根据所述至少一个变体字典哈希表评估候选词和查找变体中的一个或多个; 并且如果在所述至少一个变体字典散列表中存在至少一个匹配,则指示候选者校正。

    Methods and apparatus for performing spelling corrections using one or more variant hash tables
    4.
    发明授权
    Methods and apparatus for performing spelling corrections using one or more variant hash tables 有权
    使用一个或多个变体哈希表进行拼写校正的方法和装置

    公开(公告)号:US09552349B2

    公开(公告)日:2017-01-24

    申请号:US11513782

    申请日:2006-08-31

    IPC分类号: G06F17/00 G06F17/27

    摘要: Methods and apparatus are provided for performing spelling corrections using one or more variant hash tables. The spelling of at least one candidate word is corrected by obtaining at least one variant dictionary hash table based on variants of a set of known correctly spelled words, wherein the variants are obtained by applying one or more of a deletion, insertion, replacement, and transposition operation on the correctly spelled words; obtaining from the candidate word one or more lookup variants using one or more of the deletion, insertion, replacement, and transposition operations; evaluating one or more of the candidate word and the lookup variants against the at least one variant dictionary hash table; and indicating a candidate correction if there is at least one match in the at least one variant dictionary hash table.

    摘要翻译: 提供了使用一个或多个变体散列表来执行拼写校正的方法和装置。 通过基于一组已知的正确拼写的单词的变体获得至少一个变体字典散列表来校正至少一个候选词的拼写,其中通过应用删除,插入,替换和 对正确拼写词的转置操作; 使用删除,插入,替换和替换操作中的一个或多个从候选词获得一个或多个查找变体; 根据所述至少一个变体字典哈希表评估候选词和查找变体中的一个或多个; 并且如果在所述至少一个变体字典散列表中存在至少一个匹配,则指示候选者校正。

    Two step method for correcting spelling of a word or phrase in a document
    7.
    发明授权
    Two step method for correcting spelling of a word or phrase in a document 有权
    纠正文字中单词或短语拼写的两步法

    公开(公告)号:US06616704B1

    公开(公告)日:2003-09-09

    申请号:US09665897

    申请日:2000-09-20

    IPC分类号: G06F1721

    CPC分类号: G06F17/273

    摘要: A very fast method for correcting the spelling of a word or phrase in a document proceeds in two steps: first applying a very fast approximate method for eliminating most candidate words from consideration (without computing the exact edit distance between the given word whose spelling is to be corrected and any candidate word), followed by a “slow method” which computes the exact edit distance between the word whose spelling is to be corrected and each of the few remaining candidate words. The combination results in a method that is almost as fast as the fast approximate method and as exact as the slow method.

    摘要翻译: 用于纠正文档中的单词或短语的拼写的非常快速的方法分为两个步骤:首先应用非常快速的近似方法来消除考虑中的大多数候选词(不计算给定单词的精确编辑距离,其拼写是 被修正和任何候选词),然后是“慢法”,其计算要修正其拼写的单词和几个剩余的候选词之间的精确编辑距离。 该组合产生的方法几乎与快速近似方法一样快,并且与慢速方法一样精确。

    Page-ranking method and system
    8.
    发明申请
    Page-ranking method and system 有权
    页面排名方法和系统

    公开(公告)号:US20070219993A1

    公开(公告)日:2007-09-20

    申请号:US11377413

    申请日:2006-03-17

    IPC分类号: G06F7/00

    摘要: A page-ranking method includes mining a portion of content of a user workstation which is connectable to a network to detect references to pages of the network. The pages may be ranked based on the detected references.

    摘要翻译: 页面排序方法包括挖掘可连接到网络的用户工作站的一部分内容以检测对网页的引用。 可以基于检测到的参考来对页面进行排名。