Identification of words in Japanese text by a computer system

发明授权

US5963893A Identification of words in Japanese text by a computer system 失效

请登陆查看更多内容

专利标题： Identification of words in Japanese text by a computer system
申请号： US672638

申请日： 1996-06-28
公开(公告)号： US5963893A

公开(公告)日： 1999-10-05
发明人: Patrick H. Halstead, Jr. , Hisami Suzuki
申请人： Patrick H. Halstead, Jr. , Hisami Suzuki
申请人地址： WA Redmond
专利权人： Microsoft Corporation
当前专利权人： Microsoft Corporation
当前专利权人地址： WA Redmond
主分类号： G06F17/27
IPC分类号： G06F17/27 ; G06F17/28

Identification of words in Japanese text by a computer system

摘要：

A word breaking facility operates to identify words within a Japanese text string. The word breaking facility performs morphological processing to identify postfix bound morphemes and prefix bound morphemes. The word breaking facility also performs opheme matching to identify likely stem characters. A scoring heuristic is applied to determine an optimal analysis that includes a postfix analysis, a stem analysis, and a prefix analysis. The morphological analyses are stored in an efficient compressed format to minimize the amount of memory they occupy and maximize the analysis speed. The morphological analyses of postfixes, stems, and prefixes is performed in a right-to-left fashion. The word breaking facility may be used in applications that demand identity of selection granularity, autosummarization applications, content indexing applications, and natural language processing applications.

公开/授权文献

US5123561A Closure with tamper-evident tear-off panel 公开/授权日：1992-06-23

信息查询

Global Dossier Espacenet