Real time implicit user modeling for personalized search
    1.
    发明授权
    Real time implicit user modeling for personalized search 有权
    用于个性化搜索的实时隐式用户建模

    公开(公告)号:US08442973B2

    公开(公告)日:2013-05-14

    申请号:US11743076

    申请日:2007-05-01

    IPC分类号: G06F7/00 G06F17/00

    CPC分类号: G06F17/30554 G06F17/30867

    摘要: A method and apparatus for utilizing user behavior to immediately modify sets of search results so that the most relevant documents are moved to the top. In one embodiment of the invention, behavior data, which can come from virtually any activity, is used to infer the user's intent. The updated inferred implicit user model is then exploited immediately by re-ranking the set of matched documents to best reflect the information need of the user. The system updates the user model and immediately re-ranks documents at every opportunity in order to constantly provide the most optimal results. In another embodiment, the system determines, based on the similarity of results sets, if the current query belongs in the same information session as one or more previous queries. If so, the current query is expanded with additional keywords in order to improve the targeting of the results.

    摘要翻译: 一种用于利用用户行为立即修改搜索结果集的方法和装置,使得最相关的文档被移动到顶部。 在本发明的一个实施例中,可以使用几乎任何活动的行为数据来推断用户的意图。 然后通过重新排列匹配文档的集合来立即利用更新的推断的隐式用户模型,以最好地反映用户的信息需求。 系统更新用户模型,并在每个机会立即重新排列文档,以不断提供最佳结果。 在另一个实施例中,系统基于结果集的相似性来确定当前查询是否属于与一个或多个先前查询相同的信息会话。 如果是,则使用其他关键字扩展当前查询,以改进结果的定位。

    Method and apparatus for profile score threshold setting and updating

    公开(公告)号:US06587850B2

    公开(公告)日:2003-07-01

    申请号:US10162804

    申请日:2002-06-05

    申请人: Chengxiang Zhai

    发明人: Chengxiang Zhai

    IPC分类号: G06F1730

    摘要: A novel approach for filtering documents involves the use of delivery ratio threshold setting technique to set an initial profile score threshold and the use of beta-gamma regulation for dynamic threshold updating. A group of documents is scored pursuant to a user profile. The score for each document is indicative of the relevance of the corresponding document to the user profile. The score can be compared with a profile score threshold to decide if the document should be accepted or rejected. According to one aspect of the invention, the initial threshold is set to a score threshold that approximates an expected ratio of acceptable documents calibrated with respect to a set of reference documents. According to another aspect of the invention, the score threshold can be updated based on the accumulated example documents, user's relevance judgment, and the user's utility function. The accumulated example documents are first scored against a profile and a ranked list of scored documents is obtained. Each position at the ranked list corresponds to a candidate score threshold as well as a utility value computed based on the relevance status of the example documents. From these candidate threshold points, an optimal utility threshold and a zero utility threshold are determined. Using the optimal utility threshold and the zero utility threshold, a new utility threshold is calculated by interpolating between estimates of the optimal utility threshold and the zero utility threshold. This new utility threshold is used for subsequent information retrieval and filtering.

    Method and apparatus for profile score threshold setting and updating
    3.
    发明授权
    Method and apparatus for profile score threshold setting and updating 失效
    配置文件分数阈值设置和更新的方法和装置

    公开(公告)号:US06463434B2

    公开(公告)日:2002-10-08

    申请号:US10023918

    申请日:2001-12-17

    申请人: Chengxiang Zhai

    发明人: Chengxiang Zhai

    IPC分类号: G06F1730

    摘要: A novel approach for filtering documents involves the use of delivery ratio threshold setting technique to set an initial profile score threshold and the use of beta-gamma regulation for dynamic threshold updating. A group of documents is scored pursuant to a user profile. The score for each document is indicative of the relevance of the corresponding document to the user profile. The score can be compared with a profile score threshold to decide if the document should be accepted or rejected. According to one aspect of the invention, the initial threshold is set to a score threshold that approximates an expected ratio of acceptable documents calibrated with respect to a set of reference documents. According to another aspect of the invention, the score threshold can be updated based on the accumulated example documents, user's relevance judgment, and the user's utility function. The accumulated example documents are first scored against a profile and a ranked list of scored documents is obtained. Each position at the ranked list corresponds to a candidate score threshold as well as a utility value computed based on the relevance status of the example documents. From these candidate threshold points, an optimal utility threshold and a zero utility threshold are determined. Using the optimal utility threshold and the zero utility threshold, a new utility threshold is calculated by interpolating between estimates of the optimal utility threshold and the zero utility threshold. This new utility threshold is used for subsequent information retrieval and filtering.

    摘要翻译: 用于过滤文档的新颖方法涉及使用传送比阈值设置技术来设置初始简档分数阈值以及使用β-gamma调节来进行动态阈值更新。 根据用户配置文件对一组文档进行评分。 每个文档的分数表示相应文档与用户简档的相关性。 可以将得分与简档分数阈值进行比较,以确定文档是否应被接受或拒绝。 根据本发明的一个方面,将初始阈值设置为近似相对于一组参考文档校准的可接受文档的预期比率的分数阈值。 根据本发明的另一方面,可以基于累积的示例文档,用户的相关性判断和用户的效用函数来更新得分阈值。 累积的示例文档首先对配置文件进行评分,并获得打分文档的排名列表。 排名列表中的每个位置对应于候选分数阈值以及基于示例文档的相关性状态计算的效用值。 从这些候选阈值点,确定最佳效用阈值和零效用阈值。 使用最优效用阈值和零效用阈值,通过在最优效用阈值和零效用阈值的估计之间进行内插来计算新的效用阈值。 这个新的实用阈值用于后续的信息检索和过滤。

    REAL TIME IMPLICIT USER MODELING FOR PERSONALIZED SEARCH
    4.
    发明申请
    REAL TIME IMPLICIT USER MODELING FOR PERSONALIZED SEARCH 有权
    用于个性化搜索的实时隐含用户建模

    公开(公告)号:US20080114751A1

    公开(公告)日:2008-05-15

    申请号:US11743076

    申请日:2007-05-01

    IPC分类号: G06F7/06

    CPC分类号: G06F17/30554 G06F17/30867

    摘要: A method and apparatus for utilizing user behavior to immediately modify sets of search results so that the most relevant documents are moved to the top. In one embodiment of the invention, behavior data, which can come from virtually any activity, is used to infer the user's intent. The updated inferred implicit user model is then exploited immediately by re-ranking the set of matched documents to best reflect the information need of the user. The system updates the user model and immediately re-ranks documents at every opportunity in order to constantly provide the most optimal results. In another embodiment, the system determines, based on the similarity of results sets, if the current query belongs in the same information session as one or more previous queries. If so, the current query is expanded with additional keywords in order to improve the targeting of the results.

    摘要翻译: 一种用于利用用户行为立即修改搜索结果集的方法和装置,使得最相关的文档被移动到顶部。 在本发明的一个实施例中,可以使用几乎任何活动的行为数据来推断用户的意图。 然后通过重新排列匹配文档的集合来立即利用更新的推断的隐式用户模型,以最好地反映用户的信息需求。 系统更新用户模型,并在每个机会立即重新排列文档,以不断提供最佳结果。 在另一个实施例中,系统基于结果集的相似性来确定当前查询是否属于与一个或多个先前查询相同的信息会话。 如果是,则使用其他关键字扩展当前查询,以改进结果的定位。

    Method and apparatus for profile score threshold setting and updating

    公开(公告)号:US06430559B1

    公开(公告)日:2002-08-06

    申请号:US09432005

    申请日:1999-11-02

    申请人: Chengxiang Zhai

    发明人: Chengxiang Zhai

    IPC分类号: G06F1730

    摘要: A novel approach for filtering documents involves the use of delivery ratio threshold setting technique to set an initial profile score threshold and the use of beta-gamma regulation for dynamic threshold updating. A group of documents is scored pursuant to a user profile. The score for each document is indicative of the relevance of the corresponding document to the user profile. The score can be compared with a profile score threshold to decide if the document should be accepted or rejected. According to one aspect of the invention, the initial threshold is set to a score threshold that approximates an expected ratio of acceptable documents calibrated with respect to a set of reference documents. According to another aspect of the invention, the score threshold can be updated based on the accumulated example documents, user's relevance judgment, and the user's utility function. The accumulated example documents are first scored against a profile and a ranked list of scored documents is obtained. Each position at the ranked list corresponds to a candidate score threshold as well as a utility value computed based on the relevance status of the example documents. From these candidate threshold points, an optimal utility threshold and a zero utility threshold are determined. Using the optimal utility threshold and the zero utility threshold, a new utility threshold is calculated by interpolating between estimates of the optimal utility threshold and the zero utility threshold. This new utility threshold is used for subsequent information retrieval and filtering.

    Method and apparatus for profile score threshold setting and updating

    公开(公告)号:US06535876B2

    公开(公告)日:2003-03-18

    申请号:US10162829

    申请日:2002-06-05

    申请人: Chengxiang Zhai

    发明人: Chengxiang Zhai

    IPC分类号: G06F1730

    摘要: A novel approach for filtering documents involves the use of delivery ratio threshold setting technique to set an initial profile score threshold and the use of beta-gamma regulation for dynamic threshold updating. A group of documents is scored pursuant to a user profile. The score for each document is indicative of the relevance of the corresponding document to the user profile. The score can be compared with a profile score threshold to decide if the document should be accepted or rejected. According to one aspect of the invention, the initial threshold is set to a score threshold that approximates an expected ratio of acceptable documents calibrated with respect to a set of reference documents. According to another aspect of the invention, the score threshold can be updated based on the accumulated example documents, user's relevance judgment, and the user's utility function. The accumulated example documents are first scored against a profile and a ranked list of scored documents is obtained. Each position at the ranked list corresponds to a candidate score threshold as well as a utility value computed based on the relevance status of the example documents. From these candidate threshold points, an optimal utility threshold and a zero utility threshold are determined. Using the optimal utility threshold and the zero utility threshold, a new utility threshold is calculated by interpolating between estimates of the optimal utility threshold and the zero utility threshold. This new utility threshold is used for subsequent information retrieval and filtering.