Method and Apparatus for Automatic Detection of Spelling Errors in One or More Documents
    2.
    发明申请
    Method and Apparatus for Automatic Detection of Spelling Errors in One or More Documents 有权
    一种或多种文件中自动检测拼写错误的方法和装置

    公开(公告)号:US20080195940A1

    公开(公告)日:2008-08-14

    申请号:US11673173

    申请日:2007-02-09

    IPC分类号: G06F17/00 G06F17/27 B41J5/30

    CPC分类号: G06F17/2735 G06F17/273

    摘要: Methods and apparatus are provided for automatically detecting spelling errors in one or more documents, such as documents being processed for the creation of a lexicon According to one aspect of the invention, a spelling error is detected in one or more documents by determining if at least one given word in the one or more documents satisfies a predefined misspelling criteria, wherein the predefined misspelling criteria comprises the at least one given word having a frequency below a predefined low threshold and the at least one given word being within a predefined edit distance of one or mote other words in the one or more documents having a frequency above a predefined high threshold; and identifying a given word as a potentially misspelled word if the given word satisfies the predefined misspelling criteria

    摘要翻译: 提供了用于自动检测一个或多个文档中的拼写错误的方法和装置,例如为了创建词典而被处理的文档。根据本发明的一个方面,通过确定是否至少在一个或多个文档中检测到拼写错误 一个或多个文档中的一个给定的单词满足预定义的拼写错误标准,其中预定义拼错标准包括具有低于预定义低阈值的频率的至少一个给定单词,并且至少一个给定单词在一个预定义的编辑距离内 或将具有高于预定义高阈值的频率的一个或多个文档中的其他单词粉碎; 并且如果给定的单词满足预定义的拼写标准,则将给定的单词识别为潜在的拼写错误的单词

    Method and apparatus for automatic detection of spelling errors in one or more documents
    3.
    发明授权
    Method and apparatus for automatic detection of spelling errors in one or more documents 有权
    用于自动检测一个或多个文档中的拼写错误的方法和装置

    公开(公告)号:US09465791B2

    公开(公告)日:2016-10-11

    申请号:US11673173

    申请日:2007-02-09

    IPC分类号: G06F17/27

    CPC分类号: G06F17/2735 G06F17/273

    摘要: Methods and apparatus are provided for automatically detecting spelling errors in one or more documents, such as documents being processed for the creation of a lexicon According to one aspect of the invention, a spelling error is detected in one or more documents by determining if at least one given word in the one or more documents satisfies a predefined misspelling criteria, wherein the predefined misspelling criteria comprises the at least one given word having a frequency below a predefined low threshold and the at least one given word being within a predefined edit distance of one or more other words in the one or more documents having a frequency above a predefined high threshold; and identifying a given word as a potentially misspelled word if the given word satisfies the predefined misspelling criteria.

    摘要翻译: 提供了用于自动检测一个或多个文档中的拼写错误的方法和装置,例如为了创建词典而被处理的文档。根据本发明的一个方面,通过确定是否至少在一个或多个文档中检测到拼写错误 一个或多个文档中的一个给定的单词满足预定义的拼写错误标准,其中预定义拼错标准包括具有低于预定义低阈值的频率的至少一个给定单词,并且至少一个给定单词在预定义的编辑距离之内 或更多的其他单词在一个或多个文档中具有高于预定义的高阈值的频率; 并且如果给定的单词满足预定义的拼写标准,则将给定的单词识别为潜在的拼写错误的单词。