-
公开(公告)号:US08738356B2
公开(公告)日:2014-05-27
申请号:US13110484
申请日:2011-05-18
申请人: Hisami Suzuki , Vikram Dendi , Christopher Brian Quirk , Pallavi Choudhury , Jianfeng Gao , Achraf Chalabi
发明人: Hisami Suzuki , Vikram Dendi , Christopher Brian Quirk , Pallavi Choudhury , Jianfeng Gao , Achraf Chalabi
IPC分类号: G06F17/28
CPC分类号: G06F17/27
摘要: The universal text input technique described herein addresses the difficulties of typing text in various languages and scripts, and offers a unified solution, which combines character conversion, next word prediction, spelling correction and automatic script switching to make it extremely simple to type any language from any device. The technique provides a rich and seamless input experience in any language through a universal IME (input method editor). It allows a user to type in any script for any language using a regular qwerty keyboard via phonetic input and at the same time allows for auto-completion and spelling correction of words and phrases while typing. The technique also provides a modeless input that automatically turns on and off an input mode that changes between different types of script.
摘要翻译: 本文描述的通用文本输入技术解决了以各种语言和脚本输入文本的困难,并提供了一种统一的解决方案,它将字符转换,下一个字预测,拼写校正和自动脚本切换相结合,使其非常简单, 任何设备。 该技术通过通用IME(输入法编辑器)为任何语言提供了丰富且无缝的输入体验。 它允许用户使用普通qwerty键盘通过语音输入为任何语言输入任何脚本,同时允许在打字时自动完成和拼写校正单词和短语。 该技术还提供了无模式输入,可自动打开和关闭在不同类型脚本之间进行更改的输入模式。
-
公开(公告)号:US20080301121A1
公开(公告)日:2008-12-04
申请号:US11807410
申请日:2007-05-29
申请人: Hisami Suzuki , Satoshi Sekine
发明人: Hisami Suzuki , Satoshi Sekine
IPC分类号: G06F17/30
CPC分类号: G06F17/30734
摘要: Methods are disclosed for acquiring ontological knowledge using query logs. In one embodiment, query logs are first utilized as a basis for identifying important contexts associated with terms belonging to a semantic category. Then, those contexts are as a basis for identifying new terms belonging to the same category or, in another embodiment, as a basis for removing extraneous or obsolete terms identified as being in the same category.
摘要翻译: 公开了使用查询日志获取本体知识的方法。 在一个实施例中,首先将查询日志用作识别与属于语义类别的术语相关联的重要上下文的基础。 然后,这些背景作为确定属于相同类别的新术语的基础,或者在另一实施例中,作为用于去除被识别为相同类别的外来或过时术语的基础。
-
公开(公告)号:US20070282596A1
公开(公告)日:2007-12-06
申请号:US11483943
申请日:2006-07-10
申请人: Hisami Suzuki , Kristina Toutanova
发明人: Hisami Suzuki , Kristina Toutanova
IPC分类号: G06F17/27
CPC分类号: G06F17/2775 , G06F17/2863
摘要: Grammatical element prediction is used to predict grammatical elements in text fragments (such as phrases or sentences). In one embodiment, a statistical model, using syntax features, is used to predict grammatical elements.
摘要翻译: 语法元素预测用于预测文本片段(如短语或句子)中的语法元素。 在一个实施例中,使用语法特征的统计模型用于预测语法元素。
-
公开(公告)号:US20070282590A1
公开(公告)日:2007-12-06
申请号:US11504285
申请日:2006-08-15
申请人: Hisami Suzuki , Kristina Toutanova
发明人: Hisami Suzuki , Kristina Toutanova
IPC分类号: G06F17/28
CPC分类号: G06F17/2818 , G06F17/274
摘要: Grammatical element prediction is used in the context of machine translation. Features from both the source language and the target language sentences (or other text fragments) are used in predicting the grammatical elements.
摘要翻译: 语法元素预测用于机器翻译的上下文。 源语言和目标语言句子(或其他文本片段)的特征用于预测语法元素。
-
公开(公告)号:US07290005B2
公开(公告)日:2007-10-30
申请号:US10979626
申请日:2004-11-02
IPC分类号: G06F17/30
CPC分类号: G06F17/30684 , B42D15/00 , B42D15/0053 , Y10S707/917 , Y10S707/954 , Y10S707/956 , Y10S707/99932 , Y10S707/99933 , Y10S707/99934 , Y10S707/99935 , Y10S707/99936 , Y10S707/99942 , Y10S707/99943 , Y10S707/99944 , Y10S707/99945
摘要: A system is utilized for determining a relationship between first and second textual inputs. The system identifies constituents in the first textual input, having predetermined characteristics indicative of usefulness in determining the relationship. The relationship is then determined based on the constituents identified. The constituents can be eliminated from the first textual input, weighted in the first textual input, or simply annotated in one of a variety of ways.
-
公开(公告)号:US07290004B2
公开(公告)日:2007-10-30
申请号:US10979299
申请日:2004-11-02
IPC分类号: G06F17/30
CPC分类号: G06F17/30684 , B42D15/00 , B42D15/0053 , Y10S707/917 , Y10S707/954 , Y10S707/956 , Y10S707/99932 , Y10S707/99933 , Y10S707/99934 , Y10S707/99935 , Y10S707/99936 , Y10S707/99942 , Y10S707/99943 , Y10S707/99944 , Y10S707/99945
摘要: A system is utilized for determining a relationship between first and second textual inputs. The system identifies constituents in the first textual input, having predetermined characteristics indicative of usefulness in determining the relationship. The relationship is then determined based on the constituents identified. The constituents can be eliminated from the first textual input, weighted in the first textual input, or simply annotated in one of a variety of ways.
-
公开(公告)号:US20060247912A1
公开(公告)日:2006-11-02
申请号:US11115498
申请日:2005-04-27
申请人: Hisami Suzuki , Jianfeng Gao
发明人: Hisami Suzuki , Jianfeng Gao
IPC分类号: G06F17/20
CPC分类号: G06F17/2211 , G06F11/3616
摘要: A method and apparatus for generating a score for a system that generates text is provided. The method and apparatus identify errors in the text generated by the system and identify errors in a second text generated by a second system. The number of errors that are generated by the system but not generated by the second system is divided by the number of errors that are generated by the second system but not by the system to generate the score.
摘要翻译: 提供了一种用于生成用于生成文本的系统的得分的方法和装置。 所述方法和装置识别由系统生成的文本中的错误并识别由第二系统生成的第二文本中的错误。 由系统生成但不由第二系统生成的错误的数量除以第二系统生成的错误的数量,而不是系统生成的分数的错误数。
-
公开(公告)号:US20050043936A1
公开(公告)日:2005-02-24
申请号:US10957973
申请日:2004-10-04
CPC分类号: G06F17/30684 , B42D15/00 , B42D15/0053 , Y10S707/917 , Y10S707/954 , Y10S707/956 , Y10S707/99932 , Y10S707/99933 , Y10S707/99934 , Y10S707/99935 , Y10S707/99936 , Y10S707/99942 , Y10S707/99943 , Y10S707/99944 , Y10S707/99945
摘要: A system is utilized for determining a relationship between first and second textual inputs. The system identifies constituents in the first textual input, having predetermined characteristics indicative of usefulness in determining the relationship. The relationship is then determined based on the constituents identified. The constituents can be eliminated from the first textual input, weighted in the first textual input, or simply annotated in one of a variety of ways.
-
公开(公告)号:US5963893A
公开(公告)日:1999-10-05
申请号:US672638
申请日:1996-06-28
CPC分类号: G06F17/2755 , G06F17/274 , G06F17/277 , G06F17/2863
摘要: A word breaking facility operates to identify words within a Japanese text string. The word breaking facility performs morphological processing to identify postfix bound morphemes and prefix bound morphemes. The word breaking facility also performs opheme matching to identify likely stem characters. A scoring heuristic is applied to determine an optimal analysis that includes a postfix analysis, a stem analysis, and a prefix analysis. The morphological analyses are stored in an efficient compressed format to minimize the amount of memory they occupy and maximize the analysis speed. The morphological analyses of postfixes, stems, and prefixes is performed in a right-to-left fashion. The word breaking facility may be used in applications that demand identity of selection granularity, autosummarization applications, content indexing applications, and natural language processing applications.
-
公开(公告)号:US20120296627A1
公开(公告)日:2012-11-22
申请号:US13110484
申请日:2011-05-18
申请人: Hisami Suzuki , Vikram Dendi , Christopher Brian Quirk , Pallavi Choudhury , Jianfeng Gao , Achraf Chalabi
发明人: Hisami Suzuki , Vikram Dendi , Christopher Brian Quirk , Pallavi Choudhury , Jianfeng Gao , Achraf Chalabi
IPC分类号: G06F17/27
CPC分类号: G06F17/27
摘要: The universal text input technique described herein addresses the difficulties of typing text in various languages and scripts, and offers a unified solution, which combines character conversion, next word prediction, spelling correction and automatic script switching to make it extremely simple to type any language from any device. The technique provides a rich and seamless input experience in any language through a universal IME (input method editor). It allows a user to type in any script for any language using a regular qwerty keyboard via phonetic input and at the same time allows for auto-completion and spelling correction of words and phrases while typing. The technique also provides a modeless input that automatically turns on and off an input mode that changes between different types of script.
摘要翻译: 本文描述的通用文本输入技术解决了以各种语言和脚本输入文本的困难,并提供了一种统一的解决方案,它将字符转换,下一个字预测,拼写校正和自动脚本切换相结合,使其非常简单, 任何设备。 该技术通过通用IME(输入法编辑器)为任何语言提供了丰富且无缝的输入体验。 它允许用户使用普通qwerty键盘通过语音输入为任何语言输入任何脚本,同时允许在打字时自动完成和拼写校正单词和短语。 该技术还提供了无模式输入,可自动打开和关闭在不同类型脚本之间进行更改的输入模式。
-
-
-
-
-
-
-
-
-