-
公开(公告)号:US20110004573A1
公开(公告)日:2011-01-06
申请号:US12497467
申请日:2009-07-02
IPC分类号: G06F15/18
CPC分类号: G06N99/005 , G06F17/30707
摘要: Systems, methods and articles of manufacture are disclosed for identifying a training document for a content classifier. One or more thresholds may be defined for designating a document as a training document for a content classifier. A plurality of documents may be evaluated to compute a score for each respective document. The score may represent suitability of a document for training the content classifier with respect to a category. The score may be computed based on content of the plurality of documents, metadata of the plurality of documents, link structure of the plurality of documents, user feedback (e.g., user supplied document tags) received for the plurality of documents, and document metrics received for the plurality of documents. Based on the computed scores, a training document may be selected. The content classifier may be trained using the selected training document.
摘要翻译: 公开了用于识别内容分类器的训练文档的系统,方法和制品。 可以定义一个或多个阈值来指定文档作为内容分类器的训练文档。 可以评估多个文档以计算每个相应文档的得分。 该分数可以表示用于针对类别来训练内容分类器的文档的适合性。 可以基于多个文档的内容,多个文档的元数据,多个文档的链接结构,为多个文档接收的用户反馈(例如,用户提供的文档标签)以及接收到的文档度量来计算分数 用于多个文档。 基于计算出的分数,可以选择训练文档。 内容分类器可以使用所选择的训练文档进行训练。
-
公开(公告)号:US20110004609A1
公开(公告)日:2011-01-06
申请号:US12497463
申请日:2009-07-02
CPC分类号: G06F17/30707 , G06F17/30648 , G06F17/30867 , G06Q10/10
摘要: Systems, methods and articles of manufacture are disclosed for generating search results based on user feedback. A request may be received to generate search results retrieved using a search string. The request may include user feedback for one or more selected documents of the search results. Improved search results may be generated based on the search results and the feedback for one or more selected documents of the search results. The improved search results may be output to a graphical display device.
摘要翻译: 公开了基于用户反馈来生成搜索结果的系统,方法和制品。 可以接收请求以生成使用搜索字符串检索的搜索结果。 该请求可以包括对于搜索结果的一个或多个所选文档的用户反馈。 可以基于搜索结果和搜索结果的一个或多个所选文档的反馈来生成改进的搜索结果。 可以将改进的搜索结果输出到图形显示装置。
-