MINING INTENT OF QUERIES FROM SEARCH LOG DATA
    1.
    发明申请
    MINING INTENT OF QUERIES FROM SEARCH LOG DATA 审中-公开
    从搜索日志数据中挖掘查询的内容

    公开(公告)号:US20120290575A1

    公开(公告)日:2012-11-15

    申请号:US13103989

    申请日:2011-05-09

    IPC分类号: G06F17/30

    CPC分类号: G06F16/3325 G06F16/9535

    摘要: Architecture that mines intent of a query from search log data. For example, for a given query, the intent, the major URLs for the intent, and intent attributes, are found. The input is search log data and the output is a database that contains the intent of queries mined from the log data. Data mining techniques are employed to discover major intents of queries in the click-through log data of a search engine. For each query, its expanded queries are created and utilized, as well as co-clicks of the original query and expanded queries in the log data. For each query, clustering is performed on the co-click data of the query and expanded queries to find the major intents of the query.

    摘要翻译: 从搜索日志数据中挖掘意图的架构。 例如,对于给定的查询,找到意图,意图的主要URL和意图属性。 输入是搜索日志数据,输出是包含从日志数据挖掘的查询的意图的数据库。 采用数据挖掘技术来发现搜索引擎的点击日志数据中的查询的主要意图。 对于每个查询,其扩展的查询将被创建和使用,以及日志数据中原始查询和扩展查询的共同点击。 对于每个查询,对查询和扩展查询的共同点击数据执行聚类,以查找查询的主要意图。

    Extracting Search-Focused Key N-Grams and/or Phrases for Relevance Rankings in Searches
    3.
    发明申请
    Extracting Search-Focused Key N-Grams and/or Phrases for Relevance Rankings in Searches 审中-公开
    提取搜索关键的N-gram和/或短语相关性排名搜索

    公开(公告)号:US20130173610A1

    公开(公告)日:2013-07-04

    申请号:US13339532

    申请日:2011-12-29

    申请人: Yunhua Hu Hang Li

    发明人: Yunhua Hu Hang Li

    IPC分类号: G06F17/30

    CPC分类号: G06F16/951

    摘要: An n-gram and/or phrase extraction model may be trained based at least in part on search-focused information mined from a search-query log. The n-gram and/or phrase extraction model may extract key n-grams and/or phrases from retrieved electronic documents based at least in part on features and/or characteristics of the key n-grams and/or phrases and based at least in part on features and/or characteristics of the search-focused information. The extracted key n-grams and/or phrases may be weighted. A relevancy ranking model may be trained based at least in part on the information extracted by the n-gram and/or phrase extraction model. The relevancy ranking model may provide a relevancy ranking score for electronic documents listed in a search result based at least in part on weights of extracted key n-grams and/or phrases.

    摘要翻译: 可以至少部分地基于从搜索查询日志开始的以搜索为重点的信息来训练n-gram和/或短语提取模型。 至少部分地基于关键n克和/或短语的特征和/或特征,n字和/或短语提取模型可以从检索到的电子文档中提取密钥n-gram和/或短语,并且至少基于 部分以搜索为重点的信息的特征和/或特征。 提取的键n-gram和/或短语可以被加权。 可以至少部分地基于由n-gram和/或短语提取模型提取的信息来训练相关性排名模型。 相关性排名模型可以至少部分地基于所提取的关键n-gram和/或短语的权重来提供在搜索结果中列出的电子文档的相关性排名分数。

    Mining and Conveying Social Relationships
    5.
    发明申请
    Mining and Conveying Social Relationships 审中-公开
    挖掘和输送社会关系

    公开(公告)号:US20110078188A1

    公开(公告)日:2011-03-31

    申请号:US12568622

    申请日:2009-09-28

    IPC分类号: G06F17/30 G06F3/00

    CPC分类号: G06Q30/02 G06Q50/01

    摘要: Techniques and tools described herein mine social information from a source and store the social information in a database. Responsive to a search object, the techniques search the stored social information and determine social relationships. The techniques further provide, via a graphical user interface, the social relationships determined from the social information stored in the database. In several embodiments, the techniques enable social relationship feedback.

    摘要翻译: 本文描述的技术和工具将资源中的社会信息存储在数据库中。 响应搜索对象,该技术搜索存储的社会信息并确定社会关系。 这些技术还通过图形用户界面提供从存储在数据库中的社会信息确定的社会关系。 在几个实施例中,这些技术实现了社会关系反馈。

    RANKING SEARCH RESULTS USING AUTHOR EXTRACTION
    6.
    发明申请
    RANKING SEARCH RESULTS USING AUTHOR EXTRACTION 审中-公开
    使用作者提取排名搜索结果

    公开(公告)号:US20090182723A1

    公开(公告)日:2009-07-16

    申请号:US11972613

    申请日:2008-01-10

    IPC分类号: G06F17/30

    CPC分类号: G06F16/38

    摘要: Architecture that extracts author information from general documents and uses the author information for search results ranking. The architecture performs automatic author value extraction and makes the extracted value available at index time for subsequent use at query processing and results ranking. Machine learning (e.g., a perceptron algorithm) is employed and a set of input features for the perceptron algorithm utilized for author value extraction. The extracted author value is converted into a feature for input a ranking function for generating a ranking score for each document. The input features can also be weighted according to weighting criteria.

    摘要翻译: 从一般文件中提取作者信息并使用作者信息进行搜索结果排名的架构。 该架构执行自动作者价值提取,并使提取的值在索引时间可用于随后在查询处理和结果排名中使用。 采用机器学习(例如,感知器算法)和用于感知器算法的用于作者价值提取的一组输入特征。 提取的作者价值被转换成用于输入用于生成每个文档的排名得分的排名功能的特征。 输入特征也可以根据加权标准加权。

    Enterprise Search Method and System
    7.
    发明申请
    Enterprise Search Method and System 有权
    企业搜索方法与系统

    公开(公告)号:US20100228711A1

    公开(公告)日:2010-09-09

    申请号:US12391484

    申请日:2009-02-24

    IPC分类号: G06F7/06 G06F17/30 G06F3/048

    摘要: A system and method for enterprise search includes one or more computer-readable media storing computer-executable instructions that, when executed on one or more processors that perform acts including extracting one or more of term data, personal data and metadata from one or more predetermined resources; retrieving a set of information derived from the extracted term data, personal data and metadata responsive to a query; and receiving feedback responsive to the set of information, the feedback augmenting at least one of the one or more predetermined resources.

    摘要翻译: 用于企业搜索的系统和方法包括存储计算机可执行指令的一个或多个计算机可读介质,所述计算机可执行指令当在执行动作的一个或多个处理器上执行时,包括从一个或多个预定的 资源; 从所提取的术语数据,响应于查询的个人数据和元数据检索一组信息; 以及响应于所述一组信息接收反馈,所述反馈增加所述一个或多个预定资源中的至少一个。

    CONTEXT-AWARE SEARCHING
    8.
    发明申请
    CONTEXT-AWARE SEARCHING 审中-公开
    背景知识搜索

    公开(公告)号:US20110208730A1

    公开(公告)日:2011-08-25

    申请号:US12710608

    申请日:2010-02-23

    申请人: Daxin Jiang Hang Li

    发明人: Daxin Jiang Hang Li

    IPC分类号: G06F17/30

    CPC分类号: G06F16/951

    摘要: A model generated from search log data predicts a hidden state based on a query to determine a context of the query, such as for providing re-ranked search results, query suggestions and/or URL recommendations.

    摘要翻译: 从搜索日志数据生成的模型基于查询预测隐藏状态以确定查询的上下文,例如用于提供重新排序的搜索结果,查询建议和/或URL建议。

    Search Log Online Analytic Processing
    9.
    发明申请
    Search Log Online Analytic Processing 审中-公开
    搜索日志在线分析处理

    公开(公告)号:US20110179013A1

    公开(公告)日:2011-07-21

    申请号:US12691109

    申请日:2010-01-21

    申请人: Daxin Jiang Hang Li

    发明人: Daxin Jiang Hang Li

    IPC分类号: G06F17/30

    CPC分类号: G06F16/951

    摘要: A suffix-tree index may be constructed from search engine search logs. This suffix-tree is scalable and suitable for use in a distributed computing environment. Data mining against the data may proceed with functions including a forward search, backward search, and/or query session retrieval.

    摘要翻译: 可以从搜索引擎搜索日志构建后缀树索引。 这个后缀树是可扩展的,适合在分布式计算环境中使用。 针对数据的数据挖掘可以进行包括前向搜索,反向搜索和/或查询会话检索的功能。

    Context-aware query suggestion by mining log data
    10.
    发明授权
    Context-aware query suggestion by mining log data 有权
    通过采矿日志数据提供上下文感知查询建议

    公开(公告)号:US09330165B2

    公开(公告)日:2016-05-03

    申请号:US12371296

    申请日:2009-02-13

    申请人: Daxin Jiang Hang Li

    发明人: Daxin Jiang Hang Li

    IPC分类号: G06F17/30

    摘要: Techniques described herein describe a context-aware query suggestion process. Context of a current query may be calculated by analyzing a sequence of previous queries. Historical search data may be mined to generate groups of query suggestion candidates. Using the context of the current query, the current query may be matched with the groups of query suggestion candidates to find a matching query suggestion candidate, which may be provided to the user.

    摘要翻译: 本文描述的技术描述了上下文感知查询建议过程。 可以通过分析先前查询的序列来计算当前查询的上下文。 可以开采历史搜索数据以生成查询建议候选者组。 使用当前查询的上下文,可以将当前查询与查询建议候选组匹配,以找到可以提供给用户的匹配查询建议候选。