Apparatus and method for searching information based on Wikipedia's contents

    公开(公告)号:US10037381B2

    公开(公告)日:2018-07-31

    申请号:US14260828

    申请日:2014-04-24

    CPC classification number: G06F16/951 G06F16/3331

    Abstract: The present invention is to provide an apparatus for searching information based on Wikipedia's contents comprising: a document converting part extracting fulltext documents, section title documents, info-box documents, category documents and definition statement documents from Wikipedia original documents and generating at least one of Wikipedia documents for questions and answers; a document indexing part analyzing the Wikipedia document for questions and answers, extracting POS-based index terms from the Wikipedia document for questions and answers, and generating a Wikipedia document index for questions and answers; a question analyzing part receiving a natural language question, analyzing a question pattern, an answer pattern and a question focus from the natural language question, and extracting document search keywords; a document searching part performing document search by using the document search keywords from the Wikipedia document index for questions and answers and generating document search result from each Wikipedia document index for questions and answers; an answer extracting part extracting first answers by using information about the question pattern, the answer pattern and the question focus from the document search result; and an answer integrating part integrating and prioritizing the first answer and generating a second answer.

Patent Agency Ranking