Information search using knowledge agents
    1.
    发明授权
    Information search using knowledge agents 有权
    信息搜索使用知识代理

    公开(公告)号:US06636848B1

    公开(公告)日:2003-10-21

    申请号:US09610705

    申请日:2000-07-06

    IPC分类号: G06F1730

    摘要: A method for searching a corpus of documents, such as the World Wide Web, includes defining a knowledge domain and identifying a set of reference documents in the corpus pertinent to the domain. Upon inputting a query, the corpus is searched using the set of reference documents to find one or more of the documents in the corpus that contain information in the domain relevant to the query. The set of reference documents is updated with the found documents that are most relevant to the domain. The updated set is used in searching the corpus for information in the domain relevant to subsequent queries.

    摘要翻译: 用于搜索诸如万维网的文档语料库的方法包括定义知识域并且在与域相关的语料库中标识一组参考文档。 在输入查询后,使用一组参考文档搜索语料库,以找到语料库中包含与查询相关的域中的信息的一个或多个文档。 参考文档的集合将更新为与域最相关的找到的文档。 更新的集合用于在语料库中搜索与后续查询相关的域中的信息。

    Information search using knowledge agents
    3.
    发明授权
    Information search using knowledge agents 有权
    信息搜索使用知识代理

    公开(公告)号:US07809708B2

    公开(公告)日:2010-10-05

    申请号:US11923676

    申请日:2007-12-10

    IPC分类号: G06F17/30

    摘要: A method for searching a corpus of documents, such as the World Wide Web, includes defining a knowledge domain and identifying a set of reference documents in the corpus pertinent to the domain. Upon inputting a query, the corpus is searched using the set of reference documents to find one or more of the documents in the corpus that contain information in the domain relevant to the query. The set of reference documents is updated with the found documents that are most relevant to the domain. The updated set is used in searching the corpus for information in the domain relevant to subsequent queries.

    摘要翻译: 用于搜索诸如万维网的文档语料库的方法包括定义知识域并且在与该域相关的语料库中标识一组参考文档。 在输入查询后,使用一组参考文档搜索语料库,以找到语料库中包含与查询相关的域中的信息的一个或多个文档。 参考文档的集合将更新为与域最相关的找到的文档。 更新的集合用于在语料库中搜索与后续查询相关的域中的信息。

    Information search using knowledge agents
    4.
    发明授权
    Information search using knowledge agents 有权
    信息搜索使用知识代理

    公开(公告)号:US07318057B2

    公开(公告)日:2008-01-08

    申请号:US10634319

    申请日:2003-08-01

    IPC分类号: G06F17/30

    摘要: A method for searching a corpus of documents, such as the World Wide Web, includes defining a knowledge domain and identifying a set of reference documents in the corpus pertinent to the domain. Upon inputting a query, the corpus is searched using the set of reference documents to find one or more of the documents in the corpus that contain information in the domain relevant to the query. The set of reference documents is updated with the found documents that are most relevant to the domain. The updated set is used in searching the corpus for information in the domain relevant to subsequent queries.

    摘要翻译: 用于搜索诸如万维网的文档语料库的方法包括定义知识域并且在与域相关的语料库中标识一组参考文档。 在输入查询后,使用一组参考文档搜索语料库,以找到语料库中包含与查询相关的域中的信息的一个或多个文档。 参考文档的集合将更新为与域最相关的找到的文档。 更新的集合用于在语料库中搜索与后续查询相关的域中的信息。

    Morphological disambiguation
    6.
    发明授权
    Morphological disambiguation 失效
    形态消歧

    公开(公告)号:US07072827B1

    公开(公告)日:2006-07-04

    申请号:US09606326

    申请日:2000-06-29

    IPC分类号: G06F17/27

    摘要: A method for morphological disambiguation includes receiving an input string and morphologically analyzing the string to generate a list of candidate analyses of the string, each candidate analysis including a respective word and a linguistic pattern of the word. The pattern of each of the analyses is evaluated against a predefined criterion in order to select one or more of the analyses from the list. The method is suitable particularly for computerized analysis and searching in Hebrew and other Semitic languages.

    摘要翻译: 用于形态消歧的方法包括接收输入字符串和形态地分析字符串以生成字符串的候选分析列表,每个候选分析包括单词的单词和语言模式。 根据预定义的标准评估每个分析的模式,以从列表中选择一个或多个分析。 该方法特别适用于希伯来语和其他闪族语言的计算机化分析和搜索。

    System, method and computer program product for performing unstructured information management and automatic text analysis, including a search operator functioning as a weighted and (WAND)
    7.
    发明授权
    System, method and computer program product for performing unstructured information management and automatic text analysis, including a search operator functioning as a weighted and (WAND) 有权
    用于执行非结构化信息管理和自动文本分析的系统,方法和计算机程序产品,包括用作加权的搜索运算符和(WAND)

    公开(公告)号:US07512602B2

    公开(公告)日:2009-03-31

    申请号:US11607080

    申请日:2006-11-30

    IPC分类号: G06F17/30

    摘要: Disclosed is a system architecture, components and a searching technique for an Unstructured Information Management System (UIMS). The UIMS may be provided as middleware for the effective management and interchange of unstructured information over a wide array of information sources. The architecture generally includes a search engine, data storage, analysis engines containing pipelined document annotators and various adapters. The searching technique makes use of a two-level searching technique. A search query includes a search operator containing of a plurality of search sub-expressions each having an associated weight value. The search engine returns a document or documents having a weight value sum that exceeds a threshold weight value sum. The search operator is implemented as a Boolean predicate that functions as a Weighted AND (WAND).

    摘要翻译: 公开了一种用于非结构化信息管理系统(UIMS)的系统架构,组件和搜索技术。 UIMS可以作为中间件提供,用于通过广泛的信息源有效地管理和交换非结构化信息。 该架构通常包括搜索引擎,数据存储,包含流水线文档注释器和各种适配器的分析引擎。 搜索技术利用了两级搜索技术。 搜索查询包括包含多个搜索子表达式的搜索运算符,每个搜索子表达式具有相关联的权重值。 搜索引擎返回具有超过阈值权重值和的权重值和的文档或文档。 搜索运算符实现为一个布尔谓词,用作加权AND(WAND)。

    System, method and computer program product for performing unstructured information management and automatic text analysis, including a search operator functioning as a Weighted AND (WAND)
    9.
    发明授权
    System, method and computer program product for performing unstructured information management and automatic text analysis, including a search operator functioning as a Weighted AND (WAND) 有权
    用于执行非结构化信息管理和自动文本分析的系统,方法和计算机程序产品,包括用作加权AND(WAND)的搜索运算符,

    公开(公告)号:US07146361B2

    公开(公告)日:2006-12-05

    申请号:US10449265

    申请日:2003-05-30

    IPC分类号: G06F17/30

    摘要: Disclosed is a system architecture, components and a searching technique for an Unstructured Information Management System (UIMS). The UIMS may be provided as middleware for the effective management and interchange of unstructured information over a wide array of information sources. The architecture generally includes a search engine, data storage, analysis engines containing pipelined document annotators and various adapters. The searching technique makes use of a two-level searching technique. A search query includes a search operator containing of a plurality of search sub-expressions each having an associated weight value. The search engine returns a document or documents having a weight value sum that exceeds a threshold weight value sum. The search operator is implemented as a Boolean predicate that functions as a Weighted AND (WAND).

    摘要翻译: 公开了一种用于非结构化信息管理系统(UIMS)的系统架构,组件和搜索技术。 UIMS可以作为中间件提供,用于通过广泛的信息源有效地管理和交换非结构化信息。 该架构通常包括搜索引擎,数据存储,包含流水线文档注释器和各种适配器的分析引擎。 搜索技术利用了两级搜索技术。 搜索查询包括包含多个搜索子表达式的搜索运算符,每个搜索子表达式具有相关联的权重值。 搜索引擎返回具有超过阈值权重值和的权重值和的文档或文档。 搜索运算符实现为一个布尔谓词,用作加权AND(WAND)。

    System, method and computer program product for performing unstructured information management and automatic text analysis, including a search operator functioning as a Weighted AND (WAND)
    10.
    发明授权
    System, method and computer program product for performing unstructured information management and automatic text analysis, including a search operator functioning as a Weighted AND (WAND) 有权
    用于执行非结构化信息管理和自动文本分析的系统,方法和计算机程序产品,包括用作加权AND(WAND)的搜索运算符,

    公开(公告)号:US08280903B2

    公开(公告)日:2012-10-02

    申请号:US12138857

    申请日:2008-06-13

    IPC分类号: G06F17/30

    摘要: Disclosed is a system architecture, components and a searching technique for an Unstructured Information Management System (UIMS). The UIMS may be provided as middleware for the effective management and interchange of unstructured information over a wide array of information sources. The architecture generally includes a search engine, data storage, analysis engines containing pipelined document annotators and various adapters. The searching technique makes use of a two-level searching technique. A search query includes a search operator containing of a plurality of search sub-expressions each having an associated weight value. The search engine returns a document or documents having a weight value sum that exceeds a threshold weight value sum. The search operator is implemented as a Boolean predicate that functions as a Weighted AND (WAND).

    摘要翻译: 公开了一种用于非结构化信息管理系统(UIMS)的系统架构,组件和搜索技术。 UIMS可以作为中间件提供,用于通过广泛的信息源有效地管理和交换非结构化信息。 该架构通常包括搜索引擎,数据存储,包含流水线文档注释器和各种适配器的分析引擎。 搜索技术利用了两级搜索技术。 搜索查询包括包含多个搜索子表达式的搜索运算符,每个搜索子表达式具有相关联的权重值。 搜索引擎返回具有超过阈值权重值和的权重值和的文档或文档。 搜索运算符实现为一个布尔谓词,用作加权AND(WAND)。