DOCUMENT RETRIEVAL DEVICE
    1.
    发明公开

    公开(公告)号:US20230359653A1

    公开(公告)日:2023-11-09

    申请号:US18245061

    申请日:2021-09-07

    CPC classification number: G06F16/332 G06F16/338

    Abstract: To improve accuracy in a document search for a document that includes a typographical error. A document retrieval device according to one embodiment of the present invention includes a misidentification table storing a correctly identified character string and a misidentified character string. The document retrieval device includes a document searcher. The document searcher obtains a search character string, and retrieves the search character string from both a document and a character string that is obtained by changing the misidentified character string included in the document to the correctly identified character string.

    DOCUMENT RETRIEVAL APPARATUS, DOCUMENT RETRIEVAL SYSTEM, DOCUMENT RETRIEVAL PROGRAM, AND DOCUMENT RETRIEVAL METHOD

    公开(公告)号:US20220019581A1

    公开(公告)日:2022-01-20

    申请号:US17310439

    申请日:2020-02-10

    Abstract: A document retrieval apparatus includes an input reception unit configured to receive an input of a keyword, a document acquisition unit configured to acquire an author's name and a document file from a digital document database which stores document files of text data obtained by performing a character recognition process with respect to document image data of handwritten documents, and names of authors who wrote the handwritten documents, a keyword acquisition unit configured to reference an associating keyword database which stores information associating the authors' names, keywords, and associating keywords, and acquire an associating keyword of the input keyword, from the input keyword received by the input reception unit and the author's name acquired by the document acquisition unit, a document search unit configured to search the document file acquired by the document acquisition unit, using the input keyword and the acquired associating keyword, and a search result output unit configured to output a search result of the document search unit.

Patent Agency Ranking