Constructing web query hierarchies from click-through data
    83.
    发明授权
    Constructing web query hierarchies from click-through data 有权
    从点击型数据构建Web查询层次结构

    公开(公告)号:US07870132B2

    公开(公告)日:2011-01-11

    申请号:US12020574

    申请日:2008-01-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: The claimed subject matter is directed to constructing query hierarchies in response to a query request. To construct a query hierarchy, a list of related candidate queries is generated in response to the received query request. The list of related candidate queries is generated by determining the relative coverage of information shared by the candidate queries and the query request. Relationships between the submitted query request and the candidate queries in the list are determined based upon the extent of relative coverage of information shared by the candidate queries and the query request. A query hierarchy is then constructed to reflect the determined relationships between the query request and the candidate queries.

    摘要翻译: 所要求保护的主题涉及响应于查询请求构建查询层次结构。 为了构建查询层次结构,响应于接收的查询请求生成相关候选查询的列表。 通过确定候选查询和查询请求共享的信息的相对覆盖率来生成相关候选查询的列表。 基于候选查询和查询请求共享的信息的相对覆盖范围确定列表中提交的查询请求与候选查询之间的关系。 然后构建查询层次结构以反映所确定的查询请求和候选查询之间的关系。

    Key phrase navigation map for document navigation
    84.
    发明授权
    Key phrase navigation map for document navigation 失效
    关键短语导航地图文件导航

    公开(公告)号:US07861149B2

    公开(公告)日:2010-12-28

    申请号:US11372365

    申请日:2006-03-09

    IPC分类号: G06F3/048 G06F17/30

    摘要: Computer-readable media having computer-executable instructions and apparatuses provide a keyphrase navigation map (KNM) for a document page. Keyphrases are extracted from the document page. Keyphrase clusters are subsequently formed by a measure of relevancy, and a salient keyphrase is determined for each cluster. A thumbnail is formed with tags corresponding to the salient keyphrases. A selected tag is expanded with associated keyphrases. An associated keyphrase may be further selected in order to facilitate the navigation of the document page. The displayed tags on the thumbnail are positioned in accordance with locations of associated keyphrases in the document page.

    摘要翻译: 具有计算机可执行指令和装置的计算机可读介质为文档页面提供关键词导航映射(KNM)。 从文档页面提取关键短语。 随后通过相关性的量度形成关键词组,并且为每个簇确定显着的关键短语。 使用与突出关键短语相对应的标签形成缩略图。 所选标签用相关的关键短语展开。 可以进一步选择相关联的关键短语,以便于文档页面的导航。 缩略图上显示的标签根据文档页面中相关联的关键短语的位置进行定位。

    Forecasting time-dependent search queries
    85.
    发明授权
    Forecasting time-dependent search queries 失效
    预测与时间相关的搜索查询

    公开(公告)号:US07693823B2

    公开(公告)日:2010-04-06

    申请号:US11770385

    申请日:2007-06-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864 G06Q30/02

    摘要: Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.

    摘要翻译: 用于分析和建模查询频率的技术由查询分析系统提供。 查询分析系统分析查询的频率,以确定查询是时间依赖还是时间无关。 查询分析系统根据其周期性预测与时间相关的查询的频率。 查询分析系统根据与其他查询的因果关系预测与时间无关的查询的频率。 为了预测时间无关查询的频率,查询分析系统随时间分析查询的频率,以识别频率的显着增加,这被称为“查询事件”或“事件”。查询分析系统预测频率 基于具有事件倾向于在要预测的查询的事件之前的查询的与时间无关的查询。

    Identification of events of search queries
    86.
    发明授权
    Identification of events of search queries 有权
    识别搜索查询的事件

    公开(公告)号:US07689622B2

    公开(公告)日:2010-03-30

    申请号:US11770423

    申请日:2007-06-28

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864 G06Q30/02

    摘要: Techniques for analyzing and modeling the frequency of queries are provided by a query analysis system. A query analysis system analyzes frequencies of a query over time to determine whether the query is time-dependent or time-independent. The query analysis system forecasts the frequency of time-dependent queries based on their periodicities. The query analysis system forecasts the frequency of time-independent queries based on causal relationships with other queries. To forecast the frequency of time-independent queries, the query analysis system analyzes the frequency of a query over time to identify significant increases in the frequency, which are referred to as “query events” or “events.” The query analysis system forecasts frequencies of time-independent queries based on queries with events that tend to causally precede events of the query to be forecasted.

    摘要翻译: 用于分析和建模查询频率的技术由查询分析系统提供。 查询分析系统分析查询的频率,以确定查询是时间依赖还是时间无关。 查询分析系统根据其周期性预测与时间相关的查询的频率。 查询分析系统根据与其他查询的因果关系预测与时间无关的查询的频率。 为了预测时间无关查询的频率,查询分析系统随时间分析查询的频率,以识别频率的显着增加,这被称为“查询事件”或“事件”。查询分析系统预测频率 基于具有事件倾向于在要预测的查询的事件之前的查询的与时间无关的查询。

    System and method for exploring a semantic file network
    87.
    发明授权
    System and method for exploring a semantic file network 失效
    用于探索语义文件网络的系统和方法

    公开(公告)号:US07624130B2

    公开(公告)日:2009-11-24

    申请号:US11392640

    申请日:2006-03-30

    IPC分类号: G06F17/30

    摘要: Extraction of semantic information and the generation of semantic attributes allows for improved organization and management of data. Semantic attributes are automatically generated and eliminate the need for manual entry of attribute information. A semantic file network may further be constructed based on similarities between files that are based on the semantic attribute information. Semantic links representing a semantic relationship may be built between similar or relevant files. In addition, user operations and user operation patterns may also be considered in building the file network. Semantic attributes and information may further facilitate browsing the file systems as well as improve the accuracy and speed of queries.

    摘要翻译: 语义信息的提取和语义属性的产生可以改善数据的组织和管理。 自动生成语义属性,无需手动输入属性信息。 还可以基于基于语义属性信息的文件之间的相似性来构建语义文件网络。 表示语义关系的语义链接可以在相似或相关文件之间建立。 此外,在构建文件网络时也可以考虑用户操作和用户操作模式。 语义属性和信息可以进一步促进文件系统的浏览以及提高查询的准确性和速度。

    Comparative web search
    88.
    发明授权
    Comparative web search 有权
    比较网络搜索

    公开(公告)号:US07571162B2

    公开(公告)日:2009-08-04

    申请号:US11365961

    申请日:2006-03-01

    IPC分类号: G06F17/30 G06F15/16

    摘要: Methods and systems are provided for performing a comparative search. In one example, the comparative search is performed over a network, such as the web, or a database. In one exemplary implementation, a user transmits a plurality of queries which represent the topics that a user wants to compare, and a computing system can automatically retrieve and rank web pages or documents based on both their relevance to queries and the comparative contents they contain. In one such example, the comparative pages are displayed in a pair or other form of a grouping. In another example, comparative results having similar contents may be clustered into meaningful themes.

    摘要翻译: 提供了用于执行比较搜索的方法和系统。 在一个示例中,比较搜索通过诸如网络或数据库的网络执行。 在一个示例性实现中,用户发送表示用户想要比较的主题的多个查询,并且计算系统可以基于它们与查询的相关性及其包含的比较内容来自动检索和排序网页或文档。 在一个这样的示例中,比较页面以一对或其他形式的分组显示。 在另一个例子中,具有相似内容的比较结果可以聚集成有意义的主题。

    WEB CONTENT MINING OF PAIR-BASED DATA
    90.
    发明申请
    WEB CONTENT MINING OF PAIR-BASED DATA 有权
    基于对象数据的WEB内容采集

    公开(公告)号:US20090132530A1

    公开(公告)日:2009-05-21

    申请号:US11941968

    申请日:2007-11-19

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: Described herein is technology for, among other things, mining pair-based data on the web. The technology involves an online pair-based data mining system as well as an offline SVM training system. By subjecting a pair-based input data to the systems, one may grow a pool of pair-based data which share characteristics of the pair-based input data in more efficient manner.

    摘要翻译: 这里描述的是用于在网络上挖掘基于对的数据的技术。 该技术涉及一个在线的基于对的数据挖掘系统以及离线SVM培训系统。 通过对基于对的输入数据进行系统的处理,可以以更有效的方式增加基于成对的输入数据的特征的基于对的数据池。