Method and system for adapting search results to personal information needs
    1.
    发明授权
    Method and system for adapting search results to personal information needs 有权
    将搜索结果适应个人信息需求的方法和系统

    公开(公告)号:US07849089B2

    公开(公告)日:2010-12-07

    申请号:US12616739

    申请日:2009-11-11

    IPC分类号: G06F7/00 G10L15/00

    摘要: A method and system for adapting search results of a query to the information needs of the user submitting the query is provided. A search system analyzes click-through triplets indicating that a user submitted a query and that the user selected a document from the results of the query. To overcome the large size and sparseness of the click-through data, the search system when presented with an input triplet comprising a user, a query, and a document determines a probability that the user will find the input document important by smoothing the click-through triplets. The search system then orders documents of the result based on the probability of their importance to the input user.

    摘要翻译: 提供了一种用于将查询的搜索结果适应于提交查询的用户的信息需求的方法和系统。 搜索系统分析点击三胞胎,指示用户提交了查询,并且用户从查询的结果中选择了文档。 为了克服点击数据的大尺寸和稀疏性,当呈现包括用户,查询和文档的输入三元组时,搜索系统确定用户将通过平滑点击数据来重新找到输入文档的概率, 通过三胞胎。 然后,搜索系统基于其对输入用户的重要性的概率来订购结果的文档。

    METHOD AND SYSTEM FOR ADAPTING SEARCH RESULTS TO PERSONAL INFORMATION NEEDS
    2.
    发明申请
    METHOD AND SYSTEM FOR ADAPTING SEARCH RESULTS TO PERSONAL INFORMATION NEEDS 有权
    搜索结果适用于个人信息需求的方法和系统

    公开(公告)号:US20100057798A1

    公开(公告)日:2010-03-04

    申请号:US12616739

    申请日:2009-11-11

    IPC分类号: G06F17/30

    摘要: A method and system for adapting search results of a query to the information needs of the user submitting the query is provided. A search system analyzes click-through triplets indicating that a user submitted a query and that the user selected a document from the results of the query. To overcome the large size and sparseness of the click-through data, the search system when presented with an input triplet comprising a user, a query, and a document determines a probability that the user will find the input document important by smoothing the click-through triplets. The search system then orders documents of the result based on the probability of their importance to the input user.

    摘要翻译: 提供了一种用于将查询的搜索结果适应于提交查询的用户的信息需求的方法和系统。 搜索系统分析点击三胞胎,指示用户提交了查询,并且用户从查询的结果中选择了文档。 为了克服点击数据的大尺寸和稀疏性,搜索系统当呈现包括用户,查询和文档的输入三元组时,确定用户将通过平滑点击数据来重新找到输入文档的概率, 通过三胞胎。 然后,搜索系统基于其对输入用户的重要性的概率来订购结果的文档。

    Method and system for adapting search results to personal information needs
    3.
    发明授权
    Method and system for adapting search results to personal information needs 失效
    将搜索结果适应个人信息需求的方法和系统

    公开(公告)号:US07630976B2

    公开(公告)日:2009-12-08

    申请号:US11125839

    申请日:2005-05-10

    IPC分类号: G06F17/30 G06Q30/00

    摘要: A method and system for adapting search results of a query to the information needs of the user submitting the query is provided. A search system analyzes click-through triplets indicating that a user submitted a query and that the user selected a document from the results of the query. To overcome the large size and sparseness of the click-through data, the search system when presented with an input triplet comprising a user, a query, and a document determines a probability that the user will find the input document important by smoothing the click-through triplets. The search system then orders documents of the result based on the probability of their importance to the input user.

    摘要翻译: 提供了一种用于将查询的搜索结果适应于提交查询的用户的信息需求的方法和系统。 搜索系统分析点击三胞胎,指示用户提交了查询,并且用户从查询的结果中选择了文档。 为了克服点击数据的大尺寸和稀疏性,搜索系统当呈现包括用户,查询和文档的输入三元组时,确定用户将通过平滑点击数据来重新找到输入文档的概率, 通过三胞胎。 然后,搜索系统基于其对输入用户的重要性的概率来订购结果的文档。

    Determining relevance using queries as surrogate content
    5.
    发明申请
    Determining relevance using queries as surrogate content 审中-公开
    使用查询确定相关性作为替代内容

    公开(公告)号:US20070005588A1

    公开(公告)日:2007-01-04

    申请号:US11174438

    申请日:2005-07-01

    IPC分类号: G06F17/30

    CPC分类号: G06F16/951

    摘要: A method and system for determining the relevance of a document to a query based on surrogate content is provided. The relevance system associates queries with documents. The relevance system calculates the relevance of a document to a query based at least in part on the similarity of the associated queries to the query. When multiple queries are associated with a document, the relevance system may provide a weight for each query for calculating a combined relevance score for the associated queries.

    摘要翻译: 提供了一种用于基于代理内容确定文档与查询的相关性的方法和系统。 相关系统将查询与文档相关联。 相关系统至少部分地基于相关查询与查询的相似性来计算文档与查询的相关性。 当多个查询与文档相关联时,相关系统可以为每个查询提供用于计算相关查询的组合相关性得分的权重。

    Collaborative filtering using cluster-based smoothing
    6.
    发明申请
    Collaborative filtering using cluster-based smoothing 审中-公开
    使用基于群集的平滑的协同过滤

    公开(公告)号:US20070239553A1

    公开(公告)日:2007-10-11

    申请号:US11377130

    申请日:2006-03-16

    IPC分类号: G06Q30/00

    摘要: In an embodiment, a method of predicting an active user's rating for an item is disclosed. A database of users may be sorted into clusters. The data associated with the users in each cluster may be smoothed to filling in ratings for items that the users have not personally rated. An active user may then be compared to a set of users, where the set may be all or some portion of the database, to determine the K users that are most similar to the active user. The ratings of the K users regarding the item may be used to predict the active user's rating for the item. In an embodiment, the rating of each of the K users is assigned a confidence value associated with whether the user personally rated the item or if the rating was generated by the data smoothing process.

    摘要翻译: 在一个实施例中,公开了一种用于预测项目的活跃用户评级的方法。 可以将用户的数据库分类为群集。 可以平滑与每个群集中的用户相关联的数据,以填充用户未被评估的项目的评级。 然后可以将活动用户与一组用户进行比较,其中该集合可以是数据库的全部或部分,以确定与活动用户最相似的K个用户。 关于该项目的K个用户的评级可以用于预测该项目的活动用户的评级。 在一个实施例中,每个K个用户的评级被分配与用户个人评价该项目相关联的置信度值,或者如果该评级是由数据平滑处理产生的。

    Method and system for detecting when an outgoing communication contains certain content
    9.
    发明授权
    Method and system for detecting when an outgoing communication contains certain content 失效
    用于检测输出通信何时包含某些内容的方法和系统

    公开(公告)号:US07594277B2

    公开(公告)日:2009-09-22

    申请号:US10881867

    申请日:2004-06-30

    摘要: A method and system for detecting whether an outgoing communication contains confidential information or other target information is provided. The detection system is provided with a collection of documents that contain confidential information, referred to as “confidential documents.” When the detection system is provided with an outgoing communication, it compares the content of the outgoing communication to the content of the confidential documents. If the outgoing communication contains confidential information, then the detection system may prevent the outgoing communication from being sent outside the organization. The detection system detects confidential information based on the similarity between the content of an outgoing communication and the content of confidential documents that are known to contain confidential information.

    摘要翻译: 提供一种用于检测输出通信是否包含机密信息或其他目标信息的方法和系统。 检测系统提供了一系列包含机密信息的文件,称为“机密文件”。 当向检测系统提供传出通信时,将传出通信的内容与机密文档的内容进行比较。 如果传出通信包含机密信息,则检测系统可以防止传出通信被发送到组织外部。 检测系统基于传出通信的内容与已知包含机密信息的机密文档的内容之间的相似性来检测机密信息。

    Method and system for classifying and identifying messages as question or not a question within a discussion thread
    10.
    发明授权
    Method and system for classifying and identifying messages as question or not a question within a discussion thread 失效
    用于将消息分类和识别为问题的方法和系统,或不是讨论线程中的问题

    公开(公告)号:US07590603B2

    公开(公告)日:2009-09-15

    申请号:US10957329

    申请日:2004-10-01

    IPC分类号: G06F15/18

    CPC分类号: G06F17/30707

    摘要: A method and system for classifying messages of a discussion thread as questions is provided. A classification system generates a classifier to classify messages of discussion threads as question messages or non-question messages. The system trains the classifier using the feature vectors and input classifications derived from a training set of discussion threads. After the classifier is trained, the classification system uses the classifier to classify messages within a corpus of discussion threads as question or non-question messages. To classify a message, the classification system generates a feature vector for the messages and submits that feature vector to the classifier. The classifier generates a score for the message indicating a likelihood that the message is a question message.

    摘要翻译: 提供了一种用于将讨论线程的消息分类为问题的方法和系统。 分类系统生成分类器以将讨论线程的消息分类为问题消息或非问题消息。 系统使用从训练集讨论线程派生的特征向量和输入分类来训练分类器。 在分类器训练之后,分类系统使用分类器将讨论线程的语料库中的消息分类为问题或非问题消息。 为了对消息进行分类,分类系统生成消息的特征向量,并将该特征向量提交给分类器。 分类器生成消息的分数,指示消息是问题消息的可能性。