System and method for inferring user interest based on analysis of user-generated metadata
    1.
    发明授权
    System and method for inferring user interest based on analysis of user-generated metadata 有权
    基于用户生成的元数据分析推断用户兴趣的系统和方法

    公开(公告)号:US08707160B2

    公开(公告)日:2014-04-22

    申请号:US11502869

    申请日:2006-08-10

    IPC分类号: G06F17/00

    CPC分类号: G06Q30/02

    摘要: User-generated tags from viewing web-based content are collected over a predetermined period of time. A subset of distinct or unique tags is identified from among the collected tags. A z-score is calculated for each identified distinct tag, where the z-score is a measure of the statistical significance of the tag. The subset of distinct tags is then sorted based on their corresponding z-score. All distinct tags having a corresponding z-score lower than a predetermined threshold are rejected and the remaining distinct tags, having a corresponding z-score higher than the threshold are used to infer a user's interest. The ability to infer a user's interests from the remaining distinct tags may thus benefit web-based applications by achieving a high degree of accuracy in predicting the interests of users by leveraging on the use of the user generated content tags and keywords.

    摘要翻译: 在预定的时间段内收集来自观看基于web的内容的用户生成的标签。 从收集的标签中识别不同或唯一标签的子集。 对于每个识别的不同标签计算z分数,其中z分数是标签的统计显着性的量度。 然后将不同标签的子集根据其相应的z分数进行排序。 拒绝具有低于预定阈值的相应z分数的所有不同标签,并且使用具有高于阈值的对应z分数的剩余不同标签来推断用户的兴趣。 因此,通过利用用户生成的内容标签和关键字的使用,通过利用用户生成的内容标签和关键字来预测用户兴趣的高度准确性,从而可以从剩余的不同标签中推断出用户兴趣的能力。

    System and method for inferring user interest based on analysis of user-generated metadata
    2.
    发明申请
    System and method for inferring user interest based on analysis of user-generated metadata 有权
    基于用户生成的元数据分析推断用户兴趣的系统和方法

    公开(公告)号:US20080040301A1

    公开(公告)日:2008-02-14

    申请号:US11502869

    申请日:2006-08-10

    IPC分类号: G06F15/18 G06F7/00

    CPC分类号: G06Q30/02

    摘要: There are provided methods and systems for inferring a user's interests from user-generated tags of web-based content. In accordance with the invention, user-generated tags from viewing web-based content are collected over a predetermined period of time. A subset of distinct or unique tags is identified from among the collected tags. A z-score is calculated for each identified distinct tag, where the z-score is a measure of the statistical significance of the tag. The subset of distinct tags is then sorted based on their corresponding z-score. All distinct tags having a corresponding z-score lower than a predetermined threshold are rejected and the remaining distinct tags, having a corresponding z-score higher than the threshold are used to infer a user's interest. The ability to infer a user's interests from the remaining distinct tags may thus benefit web-based applications by achieving a high degree of accuracy in predicting the interests of users by leveraging on the use of the user generated content tags and keywords.

    摘要翻译: 提供了用于从基于网络的内容的用户生成的标签推断用户兴趣的方法和系统。 根据本发明,在预定的时间段内收集来自观看基于web的内容的用户生成的标签。 从收集的标签中识别不同或唯一标签的子集。 对于每个识别的不同标签计算z分数,其中z分数是标签的统计显着性的量度。 然后将不同标签的子集根据其相应的z分数进行排序。 拒绝具有低于预定阈值的相应z分数的所有不同标签,并且使用具有高于阈值的相应z分数的剩余不同标签来推断用户的兴趣。 因此,通过利用用户生成的内容标签和关键字的使用,通过利用用户生成的内容标签和关键字来预测用户兴趣的高度准确性,从而可以从剩余的不同标签中推断出用户兴趣的能力。

    Session based click features for recency ranking
    3.
    发明授权
    Session based click features for recency ranking 有权
    基于会话的点击功能进行新近度排名

    公开(公告)号:US08326815B2

    公开(公告)日:2012-12-04

    申请号:US12725357

    申请日:2010-03-16

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30864

    摘要: In one embodiment, access one or more query chains, wherein each one of the query chains comprises two or more search queries, {q1, . . . , qn}, which are recency-sensitive, are related to the same subject matter, and are issued to a search engine sequentially, and actual click-through information associated with each one of the query chains; and smooth each one of the query chains using the actual click-through information associated with the query chain. To smooth one of the query chains comprises, for each one of search queries, qj, in the query chain, where 2≦j≦n, if one of the network resources identified for qj has actually been clicked in connection with qj by the corresponding one network user, then presume that the one network resource has been clicked in connection with one or more search queries, qk, in the query chain, where 1≦k

    摘要翻译: 在一个实施例中,访问一个或多个查询链,其中每个查询链包括两个或多个搜索查询{q1,..., 。 。 ,qn},它们是新近度敏感的,与相同的主题相关,并且被顺序地发布到搜索引擎,并且与每个查询链相关联的实际点击信息; 并使用与查询链相关联的实际点击信息来平滑每个查询链。 为了平滑一个查询链,对于查询链中的每个搜索查询,包括qj,其中2≦̸ j≦̸ n,如果为qj标识的一个网络资源实际上已经被qj与点对点相关联 一个网络用户,然后假设一个网络资源已被连接到查询链中的一个或多个搜索查询qk,其中1≦̸ k

    Session based click features for recency ranking
    4.
    发明授权
    Session based click features for recency ranking 有权
    基于会话的点击功能进行新近度排名

    公开(公告)号:US08255390B2

    公开(公告)日:2012-08-28

    申请号:US12725310

    申请日:2010-03-16

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30864

    摘要: In one embodiment, access one or more query-resource pairs, wherein for each one of the query-resource pairs comprising one of one or more search queries and one of one or more network resources, the one search query is recency-sensitive with respect to a particular time period, and the one network resource is identified for the one search query, and a resource-view count and a resource-click count associated with each one of the query-resource pairs; and construct one or more first click features using the resource-view counts and the resource-click counts associated with the query-resource pairs. To construct one of the first click features in connection with one of the query-resource pairs comprises determine a only-resource-click count associated with the one query-resource pair; and calculate a ratio between the only-resource-click count and the resource-view count associated with the one query-resource pair as the one first click feature.

    摘要翻译: 在一个实施例中,访问一个或多个查询 - 资源对,其中对于包括一个或多个搜索查询中的一个和一个或多个网络资源中的一个的查询 - 资源对中的每个查询 - 资源对,所述一个搜索查询对于近似度敏感 到特定时间段,并且为一个搜索查询标识一个网络资源,以及与每个查询 - 资源对相关联的资源视图计数和资源点击计数; 并使用资源视图计数和与查询资源对相关联的资源点击计数构建一个或多个第一个点击功能。 为了构建与其中一个查询 - 资源对相关联的第一个点击功能之一,包括确定与一个查询 - 资源对相关联的唯一资源点击计数; 并且计算唯一的资源点击计数和与一个查询资源对相关联的资源视图计数之间的比率作为一个第一点击特征。

    Detection of abnormal user click activity in a search results page
    5.
    发明授权
    Detection of abnormal user click activity in a search results page 有权
    在搜索结果页面中检测异常的用户点击活动

    公开(公告)号:US07860870B2

    公开(公告)日:2010-12-28

    申请号:US11755972

    申请日:2007-05-31

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30867

    摘要: The present invention provides for the detection of abnormal user behavior for a query session of an electronic search engine. A query session is initiated upon receipt of a user search request that includes one or more search terms. The search engine, in accordance with known search technology, generates a search results page that includes various hyperlinks, including for example web content hyperlinks, page navigation hyperlinks and advertising hyperlinks. Tracking user activities generates the clickstream associated with the search results page. The present invention determines a probability score for the clickstream and then this score is normalized. A comparison of the normalized probability score with other normalized probability scores for similar query sessions determines of the normalcy of the query session.

    摘要翻译: 本发明提供用于检测电子搜索引擎的查询会话的异常用户行为。 在接收到包括一个或多个搜索项的用户搜索请求时启动查询会话。 搜索引擎根据已知的搜索技术生成包括各种超链接的搜索结果页面,包括例如网页内容超链接,页面导航超链接和广告超链接。 跟踪用户活动会生成与搜索结果页面关联的点击流。 本发明确定点击流的概率得分,然后对该分数进行归一化。 对于类似的查询会话,归一化概率分数与其他归一化概率分数的比较确定了查询会话的正常状态。

    Value Maximizing Recommendation Systems
    6.
    发明申请
    Value Maximizing Recommendation Systems 有权
    价值最大化推荐系统

    公开(公告)号:US20120016772A1

    公开(公告)日:2012-01-19

    申请号:US12838169

    申请日:2010-07-16

    IPC分类号: G06Q30/00 G06F17/30

    摘要: A server determines a plurality of immediate candidate items for a first web page to recommend to a user. For each particular immediate candidate item of the plurality of immediate candidate items, the server determines a separate sequence of two or more subsequent possible candidate items for subsequent web pages to recommend to the user in the event that the user selects the particular immediate candidate item. Further, the server selects a particular immediate candidate item from the plurality of immediate candidate items for the first web page to recommend to the user. The first web page that recommends the plurality of immediate candidate items is generated and sent over the Internet to the user.

    摘要翻译: 服务器确定用于第一网页的多个即时候选项目以推荐给用户。 对于多个即时候选项目中的每个特定即时候选项目,服务器确定用于后续网页的两个或更多个后续可能候选项目的单独序列,以在用户选择特定直接候选项目的情况下推荐给用户。 此外,服务器从第一网页的多个直接候选项目中选择特定的即时候选项目以向用户推荐。 建立多个即时候选项目的第一网页通过因特网生成并发送给用户。

    SESSION BASED CLICK FEATURES FOR RECENCY RANKING
    7.
    发明申请
    SESSION BASED CLICK FEATURES FOR RECENCY RANKING 有权
    基于会话的点击功能

    公开(公告)号:US20110231390A1

    公开(公告)日:2011-09-22

    申请号:US12725310

    申请日:2010-03-16

    IPC分类号: G06F17/30 G06F15/18

    CPC分类号: G06F17/30864

    摘要: In one embodiment, access one or more query-resource pairs, wherein for each one of the query-resource pairs comprising one of one or more search queries and one of one or more network resources, the one search query is recency-sensitive with respect to a particular time period, and the one network resource is identified for the one search query, and a resource-view count and a resource-click count associated with each one of the query-resource pairs; and construct one or more first click features using the resource-view counts and the resource-click counts associated with the query-resource pairs. To construct one of the first click features in connection with one of the query-resource pairs comprises determine a only-resource-click count associated with the one query-resource pair; and calculate a ratio between the only-resource-click count and the resource-view count associated with the one query-resource pair as the one first click feature.

    摘要翻译: 在一个实施例中,访问一个或多个查询 - 资源对,其中对于包括一个或多个搜索查询中的一个和一个或多个网络资源中的一个的查询 - 资源对中的每个查询 - 资源对,所述一个搜索查询对于近似度敏感 到特定时间段,并且为一个搜索查询标识一个网络资源,以及与每个查询 - 资源对相关联的资源视图计数和资源点击计数; 并使用资源视图计数和与查询 - 资源对相关联的资源点击计数构建一个或多个第一个点击功能。 为了构建与其中一个查询 - 资源对相关联的第一个点击功能之一,包括确定与一个查询 - 资源对相关联的唯一资源点击计数; 并且计算唯一的资源点击计数和与一个查询资源对相关联的资源视图计数之间的比率作为一个第一点击特征。

    DETECTION OF ABNORMAL USER CLICK ACTIVITY IN A SEARCH RESULTS PAGE
    8.
    发明申请
    DETECTION OF ABNORMAL USER CLICK ACTIVITY IN A SEARCH RESULTS PAGE 有权
    在搜索结果页面中检测异常用户点击活动

    公开(公告)号:US20080301090A1

    公开(公告)日:2008-12-04

    申请号:US11755972

    申请日:2007-05-31

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30867

    摘要: The present invention provides for the detection of abnormal user behavior for a query session of an electronic search engine. A query session is initiated upon receipt of a user search request that includes one or more search terms. The search engine, in accordance with known search technology, generates a search results page that includes various hyperlinks, including for example web content hyperlinks, page navigation hyperlinks and advertising hyperlinks. Tracking user activities generates the clickstream associated with the search results page. The present invention determines a probability score for the clickstream and then this score is normalized. A comparison of the normalized probability score with other normalized probability scores for similar query sessions determines of the normalcy of the query session.

    摘要翻译: 本发明提供用于检测电子搜索引擎的查询会话的异常用户行为。 在接收到包括一个或多个搜索项的用户搜索请求时启动查询会话。 搜索引擎根据已知的搜索技术生成包括各种超链接的搜索结果页面,包括例如网页内容超链接,页面导航超链接和广告超链接。 跟踪用户活动会生成与搜索结果页面关联的点击流。 本发明确定点击流的概率得分,然后对该分数进行归一化。 对于类似的查询会话,归一化概率分数与其他归一化概率分数的比较确定了查询会话的正常状态。

    Value maximizing recommendation systems
    9.
    发明授权
    Value maximizing recommendation systems 有权
    价值最大化推荐系统

    公开(公告)号:US08583502B2

    公开(公告)日:2013-11-12

    申请号:US12838169

    申请日:2010-07-16

    IPC分类号: G06Q30/00

    摘要: A server determines a plurality of immediate candidate items for a first web page to recommend to a user. For each particular immediate candidate item of the plurality of immediate candidate items, the server determines a separate sequence of two or more subsequent possible candidate items for subsequent web pages to recommend to the user in the event that the user selects the particular immediate candidate item. Further, the server selects a particular immediate candidate item from the plurality of immediate candidate items for the first web page to recommend to the user. The first web page that recommends the plurality of immediate candidate items is generated and sent over the Internet to the user.

    摘要翻译: 服务器确定用于第一网页的多个即时候选项目以推荐给用户。 对于多个即时候选项目中的每个特定即时候选项目,服务器确定用于后续网页的两个或更多个后续可能候选项目的单独序列,以在用户选择特定直接候选项目的情况下推荐给用户。 此外,服务器从第一网页的多个直接候选项目中选择特定的即时候选项目以向用户推荐。 建立多个即时候选项目的第一网页通过因特网生成并发送给用户。

    SESSION BASED CLICK FEATURES FOR RECENCY RANKING
    10.
    发明申请
    SESSION BASED CLICK FEATURES FOR RECENCY RANKING 有权
    基于会话的点击功能

    公开(公告)号:US20110231380A1

    公开(公告)日:2011-09-22

    申请号:US12725357

    申请日:2010-03-16

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864

    摘要: In one embodiment, access one or more query chains, wherein each one of the query chains comprises two or more search queries, {q1, . . . , qn}, which are recency-sensitive, are related to the same subject matter, and are issued to a search engine sequentially, and actual click-through information associated with each one of the query chains; and smooth each one of the query chains using the actual click-through information associated with the query chain. To smooth one of the query chains comprises, for each one of search queries, qj, in the query chain, where 2≦j≦n, if one of the network resources identified for qj has actually been clicked in connection with qj by the corresponding one network user, then presume that the one network resource has been clicked in connection with one or more search queries, qk, in the query chain, where 1≦k

    摘要翻译: 在一个实施例中,访问一个或多个查询链,其中每个查询链包括两个或多个搜索查询{q1,..., 。 。 ,qn},它们是新近度敏感的,与相同的主题相关,并且被顺序地发布到搜索引擎,并且与每个查询链相关联的实际点击信息; 并使用与查询链相关联的实际点击信息来平滑每个查询链。 为了平滑一个查询链,对于查询链中的每个搜索查询,包括qj,其中2≦̸ j≦̸ n,如果为qj标识的一个网络资源实际上已经被qj与点对点相关联 一个网络用户,然后假设一个网络资源已被连接到查询链中的一个或多个搜索查询qk,其中1≦̸ k