-
公开(公告)号:US07296009B1
公开(公告)日:2007-11-13
申请号:US10030331
申请日:2000-06-30
申请人: Jason Jiang , Bhavani Laxman Raskutti , Christopher David Rowles , Simon David Ryan , Wilson Wen
发明人: Jason Jiang , Bhavani Laxman Raskutti , Christopher David Rowles , Simon David Ryan , Wilson Wen
CPC分类号: G06F17/3069 , Y10S707/99931 , Y10S707/99932 , Y10S707/99933 , Y10S707/99934 , Y10S707/99935
摘要: A search engine and system for data, such as Internet web pages, including a query analyser for processing a query to assign respective weights to terms of the query and to generate a query vector including the weights, and an index network responsive to the query vector to output at least one index to data in response to the query. The index network is a self-generating neural network built using training examples derived from a feature extractor. The feature extractor is used during both the search and training phase. A clusterer is used to group search results.
摘要翻译: 一种用于数据的搜索引擎和系统,例如因特网网页,包括用于处理查询以将相应权重分配给查询的条款并生成包括权重的查询向量的查询分析器,以及响应于查询向量的索引网络 以响应于查询将至少一个索引输出到数据。 索引网络是使用从特征提取器导出的训练样本构建的自生成神经网络。 特征提取器在搜索和训练阶段都被使用。 群集器用于对搜索结果进行分组。
-
公开(公告)号:US07870118B2
公开(公告)日:2011-01-11
申请号:US11938758
申请日:2007-11-12
申请人: Jason Jiang , Bhavani Laxman Raskutti , Christopher David Rowles , Simon David Ryan , Wilson Wen
发明人: Jason Jiang , Bhavani Laxman Raskutti , Christopher David Rowles , Simon David Ryan , Wilson Wen
IPC分类号: G06F17/30
CPC分类号: G06F17/3069 , Y10S707/99931 , Y10S707/99932 , Y10S707/99933 , Y10S707/99934 , Y10S707/99935
摘要: A search engine and system for data, such as Internet web pages, including a query analyser for processing a query to assign respective weights to terms of the query and to generate a query vector including the weights, and an index network responsive to the query vector to output at least one index to data in response to the query. The index network is a self-generating neural network built using training examples derived from a feature extractor. The feature extractor is used during both the search and training phase. A clusterer is used to group search results.
摘要翻译: 一种用于数据的搜索引擎和系统,例如因特网网页,包括用于处理查询以将相应权重分配给查询的条款并生成包括权重的查询向量的查询分析器,以及响应于查询向量的索引网络 以响应于查询将至少一个索引输出到数据。 索引网络是使用从特征提取器导出的训练样本构建的自生成神经网络。 特征提取器在搜索和训练阶段都被使用。 群集器用于对搜索结果进行分组。
-
公开(公告)号:US20080133508A1
公开(公告)日:2008-06-05
申请号:US11938758
申请日:2007-11-12
申请人: Jason JIANG , Bhavani Laxman Raskutti , Christopher David Rowles , Simon David Ryan , Wilson Wen
发明人: Jason JIANG , Bhavani Laxman Raskutti , Christopher David Rowles , Simon David Ryan , Wilson Wen
IPC分类号: G06F17/30
CPC分类号: G06F17/3069 , Y10S707/99931 , Y10S707/99932 , Y10S707/99933 , Y10S707/99934 , Y10S707/99935
摘要: A search engine and system for data, such as Internet web pages, including a query analyser for processing a query to assign respective weights to terms of the query and to generate a query vector including the weights, and an index network responsive to the query vector to output at least one index to data in response to the query. The index network is a self-generating neural network built using training examples derived from a feature extractor. The feature extractor is used during both the search and training phase. A clusterer is used to group search results.
摘要翻译: 一种用于数据的搜索引擎和系统,例如因特网网页,包括用于处理查询以将相应权重分配给查询的条款并生成包括权重的查询向量的查询分析器,以及响应于查询向量的索引网络 以响应于查询将至少一个索引输出到数据。 索引网络是使用从特征提取器导出的训练样本构建的自生成神经网络。 特征提取器在搜索和训练阶段都被使用。 群集器用于对搜索结果进行分组。
-
公开(公告)号:US08793261B2
公开(公告)日:2014-07-29
申请号:US10399587
申请日:2001-10-17
IPC分类号: G06F17/30
CPC分类号: G06F17/30011 , G06F17/2705 , G06F17/28 , G06F17/30616 , G06F17/30663 , G06F17/30684 , G10L15/00
摘要: An information retrieval system including a natural language parser (3) for parsing documents of a document space (1) to identify key terms of each document based on linguistic structure, and for parsing a search query to determine the search term, a feature extractor (4) for determining an importance score for terms of the document space based on distribution of the terms in the document space, an index term generator (5) for generating index terms using the key terms identified by the parser and the extractor and having an importance score above a threshold level, and a query clarifier (16) for selecting from the index terms, on the basis of the search term, index terms for selecting a document from the document space. A speech recognition engine (12) generates the query, and a bi-gram language module (6) generates grammar rules for the speech recognition engine using the index terms.
摘要翻译: 一种信息检索系统,包括用于解析文档空间(1)的文档的自然语言解析器(3),以基于语言结构识别每个文档的关键术语,以及用于解析搜索查询以确定搜索项,特征提取器( 4)用于基于文档空间中的项的分布来确定文档空间的术语的重要性得分,索引项生成器(5),用于使用由解析器和提取器识别的关键术语来生成索引项,并且具有重要性 得分高于阈值水平;以及查询澄清器(16),用于根据搜索项从索引项中选择用于从文档空间中选择文档的索引项。 语音识别引擎(12)生成查询,并且双语言语言模块(6)使用索引项生成语音识别引擎的语法规则。
-
-
-