Adjustment of document relationship graphs

    公开(公告)号:US10255272B2

    公开(公告)日:2019-04-09

    申请号:US15472977

    申请日:2017-03-29

    申请人: Quid, Inc.

    IPC分类号: G06F17/27 G06F17/30

    摘要: Provided is a process of modifying semantic similarity graphs representative of pair-wise similarity between documents in a corpus, the method comprising obtaining a semantic similarity graph that comprises more than 500 nodes and more than 1000 weighted edges, each node representing a document of a corpus, and each edge weight indicating an amount of similarity between a pair of documents corresponding to the respective nodes connected by the respective edge; obtaining an n-gram indicating that edge weights affected by the n-gram are to be increased or decreased; expanding the n-gram to produce a set of expansion n-grams; adjusting edge weights of edges between pairs of documents in which members of the expanded n-gram set co-occur.

    Adjustment of document relationship graphs

    公开(公告)号:US09645999B1

    公开(公告)日:2017-05-09

    申请号:US15226093

    申请日:2016-08-02

    申请人: Quid, Inc.

    IPC分类号: G06F17/27 G06F17/30

    摘要: Provided is a process of modifying semantic similarity graphs representative of pair-wise similarity between documents in a corpus, the method comprising obtaining a semantic similarity graph that comprises more than 500 nodes and more than 1000 weighted edges, each node representing a document of a corpus, and each edge weight indicating an amount of similarity between a pair of documents corresponding to the respective nodes connected by the respective edge; obtaining an n-gram indicating that edge weights affected by the n-gram are to be increased or decreased; expanding the n-gram to produce a set of expansion n-grams; adjusting edge weights of edges between pairs of documents in which members of the expanded n-gram set co-occur.

    Summarizing collections of documents

    公开(公告)号:US09767165B1

    公开(公告)日:2017-09-19

    申请号:US15206703

    申请日:2016-07-11

    申请人: Quid, Inc.

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30867 G06F17/30719

    摘要: Provided is a process including: obtaining a corpus having a plurality of document collections, each of which is associated with features; for a given document collection, computing a pertinence score for each feature; ranking the features based on the features' pertinence scores; selecting a first set of features based on a first coverage score thereof and a threshold; re-ranking the first set of features based on the features' relevance to the document collection; and selecting a second set of features from the first set of features based on a second coverage score thereof and the threshold, the second set of features being used for summarizing the document collection.

    Topic-influenced document relationship graphs

    公开(公告)号:US09715495B1

    公开(公告)日:2017-07-25

    申请号:US15379724

    申请日:2016-12-15

    申请人: Quid, Inc.

    IPC分类号: G06F17/30 G06F7/00 G06F17/27

    摘要: Provided is a process of enhancing or suppressing measures of relationships between documents based on the relationships arising from text pertaining to selected topics, the process including: obtaining a corpus of documents; obtaining a set of topics by self-extracting topics according to the tokens present in text of the documents or manual provision; receiving a selected topic among the set of topics, the selection indicating that relationships between the documents are to be enhanced or suppressed in virtue of the relationships arising from text relating to the selected topic; forming a relationship graph of the documents, wherein: the relationships between pairs of the documents are determined based on co-occurrence of n-grams in the pairs of the documents, and wherein the relationships are enhanced or suppressed in response to co-occurring n-grams being in the respective set of n-grams of the selected topic.

    Topic-influenced document relationship graphs

    公开(公告)号:US10769381B2

    公开(公告)日:2020-09-08

    申请号:US15629001

    申请日:2017-06-21

    申请人: Quid, Inc.

    摘要: Provided is a process of enhancing or suppressing measures of relationships between documents based on the relationships arising from text pertaining to selected topics, the process including: obtaining a corpus of documents; obtaining a set of topics by self-extracting topics according to the tokens present in text of the documents or manual provision; receiving a selected topic among the set of topics, the selection indicating that relationships between the documents are to be enhanced or suppressed in virtue of the relationships arising from text relating to the selected topic; forming a relationship graph of the documents, wherein: the relationships between pairs of the documents are determined based on co-occurrence of n-grams in the pairs of the documents, and wherein the relationships are enhanced or suppressed in response to co-occurring n-grams being in the respective set of n-grams of the selected topic.

    TOPIC-INFLUENCED DOCUMENT RELATIONSHIP GRAPHS

    公开(公告)号:US20180173699A1

    公开(公告)日:2018-06-21

    申请号:US15629001

    申请日:2017-06-21

    申请人: Quid, Inc.

    IPC分类号: G06F17/27 G06F17/30

    摘要: Provided is a process of enhancing or suppressing measures of relationships between documents based on the relationships arising from text pertaining to selected topics, the process including: obtaining a corpus of documents; obtaining a set of topics by self-extracting topics according to the tokens present in text of the documents or manual provision; receiving a selected topic among the set of topics, the selection indicating that relationships between the documents are to be enhanced or suppressed in virtue of the relationships arising from text relating to the selected topic; forming a relationship graph of the documents, wherein: the relationships between pairs of the documents are determined based on co-occurrence of n-grams in the pairs of the documents, and wherein the relationships are enhanced or suppressed in response to co-occurring n-grams being in the respective set of n-grams of the selected topic.

    ADJUSTMENT OF DOCUMENT RELATIONSHIP GRAPHS
    7.
    发明申请

    公开(公告)号:US20180039620A1

    公开(公告)日:2018-02-08

    申请号:US15472977

    申请日:2017-03-29

    申请人: Quid, Inc.

    IPC分类号: G06F17/27 G06F17/30

    摘要: Provided is a process of modifying semantic similarity graphs representative of pair-wise similarity between documents in a corpus, the method comprising obtaining a semantic similarity graph that comprises more than 500 nodes and more than 1000 weighted edges, each node representing a document of a corpus, and each edge weight indicating an amount of similarity between a pair of documents corresponding to the respective nodes connected by the respective edge; obtaining an n-gram indicating that edge weights affected by the n-gram are to be increased or decreased; expanding the n-gram to produce a set of expansion n-grams; adjusting edge weights of edges between pairs of documents in which members of the expanded n-gram set co-occur.