发明授权
- 专利标题: Bootstrap and adapt a document search engine
- 专利标题(中): 引导和调整文档搜索引擎
-
申请号: US12726358申请日: 2010-03-18
-
公开(公告)号: US08527534B2公开(公告)日: 2013-09-03
- 发明人: Kuansan Wang , Bo-June Hsu , Xiaolong Li
- 申请人: Kuansan Wang , Bo-June Hsu , Xiaolong Li
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Corporation
- 当前专利权人: Microsoft Corporation
- 当前专利权人地址: US WA Redmond
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
Architecture that employs a modeling technique based on language modeling to estimate a probability of a document matching the user need as expressed in the query. The modeling technique is based on the data mining results that various portions of a document (e.g., body, title, URL, anchor text, user queries) use different styles of human languages. Thus, the results based on a language can be adapted individually to match the language of query. Since the approach is based on adaptation, the framework also provides a natural means to progressively revise the model as user data are collected. Different styles of languages in a document can be recognized and adapted individually. Background language models are also employed that offer a fallback approach in case the document has incomplete fields of data, and can utilize topical or semantic hierarchy of the knowledge domain.
公开/授权文献
- US20110231394A1 BOOTSTRAP AND ADAPT A DOCUMENT SEARCH ENGINE 公开/授权日:2011-09-22
信息查询