Abstract:
The present invention is to provide an apparatus for searching information based on Wikipedia's contents comprising: a document converting part extracting fulltext documents, section title documents, info-box documents, category documents and definition statement documents from Wikipedia original documents and generating at least one of Wikipedia documents for questions and answers; a document indexing part analyzing the Wikipedia document for questions and answers, extracting POS-based index terms from the Wikipedia document for questions and answers, and generating a Wikipedia document index for questions and answers; a question analyzing part receiving a natural language question, analyzing a question pattern, an answer pattern and a question focus from the natural language question, and extracting document search keywords; a document searching part performing document search by using the document search keywords from the Wikipedia document index for questions and answers and generating document search result from each Wikipedia document index for questions and answers; an answer extracting part extracting first answers by using information about the question pattern, the answer pattern and the question focus from the document search result; and an answer integrating part integrating and prioritizing the first answer and generating a second answer.