- 专利标题: Techniques for generating a topic model
-
申请号: US16445256申请日: 2019-06-19
-
公开(公告)号: US11914966B2公开(公告)日: 2024-02-27
- 发明人: Esther Goldbraich
- 申请人: International Business Machines Corporation
- 申请人地址: US NY Armonk
- 专利权人: International Business Machines Corporation
- 当前专利权人: International Business Machines Corporation
- 当前专利权人地址: US NY Armonk
- 代理商 Caleb D. Wilkes
- 主分类号: G06F40/40
- IPC分类号: G06F40/40 ; G06N20/00 ; G06F16/28 ; H04L51/52
摘要:
In some examples, a system for generating a topic model includes a processor that can process a set of documents to generate training data, wherein each document in the set of documents is associated with one or more users. The processor can also generate a plurality of topic models using the training data, such that each topic model includes a different number of topics. The processor can also generate an evaluation score for each of the topic models based on information about the users associated with the documents included in the training data. The evaluation score describes a percentage of topics that exhibit a specified level of interest from a specified number of users. The processor can also identify a final topic model based on the evaluation scores and store the final topic model to be used in natural language processing.
信息查询