Acquiring ontological knowledge from query logs
    7.
    发明授权
    Acquiring ontological knowledge from query logs 有权
    从查询日志获取本体知识

    公开(公告)号:US08051056B2

    公开(公告)日:2011-11-01

    申请号:US11807410

    申请日:2007-05-29

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30734

    摘要: Methods are disclosed for acquiring ontological knowledge using query logs. In one embodiment, query logs are first utilized as a basis for identifying important contexts associated with terms belonging to a semantic category. Then, those contexts are as a basis for identifying new terms belonging to the same category or, in another embodiment, as a basis for removing extraneous or obsolete terms identified as being in the same category.

    摘要翻译: 公开了使用查询日志获取本体知识的方法。 在一个实施例中,首先将查询日志用作识别与属于语义类别的术语相关联的重要上下文的基础。 然后,这些背景作为确定属于相同类别的新术语的基础,或者在另一实施例中,作为用于去除被识别为相同类别的外来或过时术语的基础。

    Identification of words in Japanese text by a computer system
    10.
    发明授权
    Identification of words in Japanese text by a computer system 失效
    通过计算机系统识别日语文本中的单词

    公开(公告)号:US5946648A

    公开(公告)日:1999-08-31

    申请号:US121655

    申请日:1998-07-24

    IPC分类号: G06F17/27 G06F17/28

    摘要: A word breaking facility operates to identify words within a Japanese text string. The word breaking facility performs morphological processing to identify postfix bound morphemes and prefix bound morphemes. The word breaking facility also performs opheme matching to identify likely stem characters. A scoring heuristic is applied to determine an optimal analysis that includes a postfix analysis, a stem analysis, and a prefix analysis. The morphological analyses are stored in an efficient compressed format to minimize the amount of memory they occupy and maximize the analysis speed. The morphological analyses of postfixes, stems, and prefixes is performed in a right-to-left fashion. The word breaking facility may be used in applications that demand identity of selection granularity, autosummarization applications, content indexing applications, and natural language processing applications.

    摘要翻译: 单词断开设施用于识别日语文本字符串中的单词。 词突破设施执行形态处理以识别后缀绑定语素和前缀绑定语素。 单词破解工具还执行opheme匹配以识别可能的字符字符。 应用得分启发式来确定包括后缀分析,茎分析和前缀分析的最佳分析。 形态分析以有效的压缩格式存储,以最小化其占用的内存量并最大化分析速度。 后缀,茎和前缀的形态学分析以从右到左的方式进行。 单词破解工具可用于需要选择粒度,自动归类应用程序,内容索引应用程序和自然语言处理应用程序的身份的应用程序。