发明申请
- 专利标题: Content-Based Information Retrieval
- 专利标题(中): 基于内容的信息检索
-
申请号: US12417511申请日: 2009-04-02
-
公开(公告)号: US20100257202A1公开(公告)日: 2010-10-07
- 发明人: Martin Szummer , Andrew Fitzgibbon , Lorenzo Torresani
- 申请人: Martin Szummer , Andrew Fitzgibbon , Lorenzo Torresani
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Corporation
- 当前专利权人: Microsoft Corporation
- 当前专利权人地址: US WA Redmond
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
摘要:
Content-based information retrieval is described. In an example, a query item such as an image, document, email or other item is presented and items with similar content are retrieved from a database of items. In an example, each time a query is presented, a classifier is formed based on that query and using a training set of items. For example, the classifier is formed in real-time and is formed in such a way that a limit on the proportion of the items in the database that will be retrieved is set. In an embodiment, the query item is analyzed to identify tokens in that item and subsets of those tokens are selected to form the classifier. For example, the subsets of tokens are combined using Boolean operators in a manner which is efficient for searching on particular types of database.
公开/授权文献
- US08346800B2 Content-based information retrieval 公开/授权日:2013-01-01
信息查询