-
公开(公告)号:US20200233872A1
公开(公告)日:2020-07-23
申请号:US16251464
申请日:2019-01-18
发明人: John G. Vergo , Anuradha Bhamidipaty , Justin Platz , Alan M. Webb , Jeffrey Owen Kephart , Danny Soroker , Daniel M. Gruen , Julie Macnaught , Michael Abraham Tanenblatt , Siva Sankalp Patel
IPC分类号: G06F16/2457 , G06Q10/06
摘要: A similarity determination method, system, and computer program product, including using a description of companies for making a list of query entities, calculating a set of similar companies for each company on the list of query entities, employing a voting scheme to rank the results of the calculating, ordering a final set of the results based on the voting scheme and presenting them back to the user as a first ranked list, iteratively repeating the calculating by adding a second set of new companies and recalculating a second ranked list of recommended companies based on the updated query list, combining the first ranked list and the second ranked into a single set of companies of a combined list while remembering which of the first ranked list and the second ranked list from which each company originated, and visualizing the combined list based on which original list the companies came from. The technique can be extended to an arbitrary number of lists.
-
公开(公告)号:US20160217200A1
公开(公告)日:2016-07-28
申请号:US15045331
申请日:2016-02-17
发明人: Sara H. Basson , Kember A.-R. Forcke , Richard T. Goodwin , Kaan K. Katircioglu , Meir M. Laker , Jonathan Lenchner , Pietro Mazzoleni , Nitinchandra R. Nayak , John G. Vergo , Wlodek W. Zadrozny
CPC分类号: G06F16/285 , G06F16/245 , G06F16/24573 , G06F16/355 , G06F16/93 , G06F17/241
摘要: A model of a domain is received, wherein the model has a plurality of elements. A corpus of select documents covering the plurality of elements of the model is also received. A plurality of select topics is generated from the corpus of select documents. Topics of an additional document are compared to the plurality of select topics to calculate a distance between the topics of the additional document and the plurality of select topics. Upon the distance meeting a threshold value, a new corpus is generated to include the additional document. The new document is annotated with the plurality of elements of the model.
-
公开(公告)号:US20150347467A1
公开(公告)日:2015-12-03
申请号:US14287474
申请日:2014-05-27
发明人: Sara H. Basson , Kember A.-R. Forcke , Richard T. Goodwin , Kaan K. Katircioglu , Meir M. Laker , Jonathan Lenchner , Pietro Mazzoleni , Nitinchandra R. Nayak , John G. Vergo , Wlodek W. Zadrozny
IPC分类号: G06F17/30
CPC分类号: G06F16/285 , G06F16/245 , G06F16/24573 , G06F16/355 , G06F16/93 , G06F17/241
摘要: A model of a domain is received, wherein the model has a plurality of elements. A corpus of select documents covering the plurality of elements of the model is also received. A plurality of select topics is generated from the corpus of select documents. Topics of an additional document are compared to the plurality of select topics to calculate a distance between the topics of the additional document and the plurality of select topics. Upon the distance meeting a threshold value, a new corpus is generated to include the additional document. The new document is annotated with the plurality of elements of the model.
摘要翻译: 接收到域的模型,其中模型具有多个元素。 也接收到覆盖模型的多个元素的选择文档的语料库。 从选择文档的语料库生成多个选择主题。 将附加文档的主题与多个选择主题进行比较,以计算附加文档的主题与多个选择主题之间的距离。 当距离达到阈值时,生成新的语料库以包括附加文档。 新文档用模型的多个元素注释。
-
公开(公告)号:US10671601B2
公开(公告)日:2020-06-02
申请号:US14562987
申请日:2014-12-08
发明人: Sara H. Basson , Kember A.-R. Forcke , Richard T. Goodwin , Kaan K. Katircioglu , Meir M. Laker , Pietro Mazzoleni , Nitinchandra R. Nayak , John G. Vergo
IPC分类号: G06F7/00 , G06F17/00 , G06F16/242 , G06Q30/00
摘要: Receiving a first model associated with a user, a generic model of a generic domain, and a specific domain having an associated domain-specific corpus. A first set of query terms based on elements of the first model, and a second set of query terms based on elements of the generic model, are determined. A third set of query terms is generated based on the first and second sets of query terms. The domain specific corpus is queried using the third set of query terms, and a domain specific model is generated based on results of the querying.
-
公开(公告)号:US20160162538A1
公开(公告)日:2016-06-09
申请号:US14562987
申请日:2014-12-08
发明人: Sara H. Basson , Kember A.-R. Forcke , Richard T. Goodwin , Kaan K. Katircioglu , Meir M. Laker , Pietro Mazzoleni , Nitinchandra R. Nayak , John G. Vergo
IPC分类号: G06F17/30
CPC分类号: G06F16/2425 , G06Q30/00
摘要: Receiving a first model associated with a user, a generic model of a generic domain, and a specific domain having an associated domain-specific corpus. A first set of query terms based on elements of the first model, and a second set of query terms based on elements of the generic model, are determined. A third set of query terms is generated based on the first and second sets of query terms. The domain specific corpus is queried using the third set of query terms, and a domain specific model is generated based on results of the querying.
摘要翻译: 接收与用户相关联的第一模型,通用域的通用模型以及具有关联的域特定语料库的特定域。 确定基于第一模型的元素的第一组查询项和基于通用模型的元素的第二组查询项。 基于第一组和第二组查询项生成第三组查询项。 使用第三组查询项查询域特定语料库,并根据查询结果生成域特定模型。
-
-
-
-