-
公开(公告)号:US20060048055A1
公开(公告)日:2006-03-02
申请号:US10928131
申请日:2004-08-25
申请人: Jun Wu , Liren Chen
发明人: Jun Wu , Liren Chen
IPC分类号: G06F17/24
CPC分类号: G06F17/273
摘要: Fault-tolerant systems and methods to process and correct input spelling errors for non-Roman based languages such as Chinese, Japanese, and Korean (CJK) are disclosed. The method may be applied to a Chinese input method using pinyin. For example, the method may generally include receiving a pinyin input representing characters in Chinese, the input having at least one original pinyin, identifying potentially incorrect pinyins in the input, expanding each potentially incorrect pinyin to at least one additional alternative pinyin, each pair of potentially incorrect and corresponding alternative pinyin having a proximity measurement, converting each pinyin in the input and each alternative pinyin to Chinese characters, computing likelihoods of possible conversions of the pinyin input to Chinese characters, each possible Chinese conversion being a combination of the converted original and/or alternative pinyins of the input, the probabilities being based on the proximity measurement and optionally on a context of the possible Chinese conversion, and determining a most likely Chinese conversion from the possible conversions.
摘要翻译: 公开了容错系统和处理和纠正非罗马语言(如中文,日文和韩文)输入拼写错误的方法。 该方法可以应用于使用拼音的中文输入法。 例如,该方法通常可以包括接收表示中文字符的拼音输入,输入具有至少一个原始拼音,识别输入中潜在错误的拼音,将每个可能不正确的拼音扩展到至少一个附加的替代拼音,每对 潜在的错误和相应的替代拼音具有接近度测量,将输入中的每个拼音和每个替代拼音转换为汉字,计算拼音输入可能转换为汉字的可能性,每个可能的中文转换是转换的原始和 /或输入的替代拼音,概率基于接近度测量,并且可选地在可能的中文转换的上下文中,并且从可能的转换确定最可能的中文转换。