摘要:
A document processing apparatus 200 has a processor that executes programs, and a memory that stores the programs to be executed by the processor. The document processing apparatus 200 links a certain character array in a document with a character array located to a right side thereof from the certain character array or a region including the certain character array towards the right side thereof and below, and generating a network for multiple hypothetical document structures by linking the certain character array to a character array located therebelow.
摘要:
Sentence polarity determination is used to assess whether a sentence is an affirmative expression or a negative expression, and is applied for reputation analysis, etc. Polarity determination determines whether an input sentence is affirmative or negative. When some subject is being talked about, it is sometimes desired to determine whether what is being referred to in the sentence is affirmative or negative, rather than the polarity of the sentence per se. The present invention provides a method for determining the polarity of the sentence by applying a recursive polarity rule based on a dependency structure of the sentence, taking into consideration the portion of the sentence that is being referred to. Use of a recursive rule makes it possible to prevent the number of rules from becoming huge, and thereby to perform efficient polarity determination in terms of memory amount and calculation amount. The length of the dependency needed for polarity determination can also be efficiently controlled.
摘要:
The problem solved by this invention is to convert text information in a geology report to numerical values which reflects geological characteristics of a well's subsurface. Prior art referred above cannot be applicable to this problem. Since text information in the geology report is in the natural language form. This information is not widely used in this industry, due to the fact that the text information can be hardly extracted and summarized into numerical values and integrated into current physical geology models or statistical models. This invention makes the text information in geology report, which is often in a natural language form, easier to be integrated into current geology physical models or statistical models. Also, the numerical values extracted from the geology report can be integrated with other kinds of data, such as seismic data and well-logging data, to obtain more accurate and comprehensive analysis results.