Document search system and document search method

    公开(公告)号:US12169515B2

    公开(公告)日:2024-12-17

    申请号:US17612248

    申请日:2020-05-11

    Abstract: A document search system that enables efficient document search regardless of the ability of a user is achieved. Document search is performed using a document search system in which database document data is stored. After first document data and second document data are input to the document search system, the document search system extracts a plurality of terms from the first document data. The extraction of the terms is performed using morphological analysis, for example. Next, the extracted terms are weighted on the basis of the second document data. For example, texts included in a document represented by the second document data are classified into first and second texts. Among the terms extracted from the first document data, the weight of the term included in the first text is set larger than the weights of the other terms. The classification of the texts can be performed in accordance with a rule basis or using machine learning. After that, the similarity of the database document data to the first document data is calculated on the basis of the weighted term.

    Document retrieval system and method for retrieving document

    公开(公告)号:US12086181B2

    公开(公告)日:2024-09-10

    申请号:US17791316

    申请日:2020-12-28

    Abstract: A document retrieval system retrieving a document with the concept of the document taken into account is provided. The system includes a processing portion and the processing portion creates a retrieval graph from a retrieval composition. The retrieval graph includes first to m-th retrieval local graphs (m is an integer of greater than or equal to 1), and the retrieval local graphs are each constituted by two nodes and one edge. The processing portion performs retrieval of first to m-th sentences on a reference document. The i-th sentence (i is an integer of greater than or equal to 1 and less than or equal to m) includes one of the two nodes in the i-th retrieval local graph or a related term or a hyponym of the one of the two nodes; the other of the two nodes in the i-th retrieval local graph or a related term or a hyponym of the other of the two nodes; and the edge in the i-th retrieval local graph or a related term or a hyponym of the edge. A mark is assigned to the score of the reference document in accordance with the number of sentences included in the reference document among the first to m-th sentences.

Patent Agency Ranking