发明授权
- 专利标题: Pseudo-anchor text extraction
- 专利标题(中): 伪锚文本提取
-
申请号: US12697056申请日: 2010-01-29
-
公开(公告)号: US08073838B2公开(公告)日: 2011-12-06
- 发明人: Shuming Shi , Ji-Rong Wen , Mingjie Zhu , Fei Xing , Zaiqing Nie
- 申请人: Shuming Shi , Ji-Rong Wen , Mingjie Zhu , Fei Xing , Zaiqing Nie
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Corporation
- 当前专利权人: Microsoft Corporation
- 当前专利权人地址: US WA Redmond
- 代理机构: Lee & Hayes, PLLC
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
A search method uses pseudo-anchor text associated with search objects to improve search performance. The pseudo-anchor text may be extracted in combination with an identifier of the search objects (such as a pseudo-URL) from a digital corpus such as a collection of documents. Pseudo-anchor texts for each object are preferably extracted from candidate anchor blocks using a machine learning based approach. The pseudo-anchor texts are made available for searching and used to help rank the objects in a search result to improve search performance. The method may be used in vertical search of objects such as published articles, products and images that lack explicit URLs and anchor text information.
公开/授权文献
- US20100145956A1 PSEUDO-ANCHOR TEXT EXTRACTION 公开/授权日:2010-06-10
信息查询