摘要:
A language analysis apparatus of the invention includes division rules, each of which is classified into one of levels according to the degree of risk of causing analysis accuracy problems when applied; a division point candidate generation unit 21 which, when a character string whose length is greater than the predetermined maximum input length is input, generates division point candidates for the input character string by applying the division rules sequentially one by one in the ascending order of the level of risk of causing problems; a division point adjustment unit 22 which, when the length of a division unit candidate obtained by the division point candidate generated by the division point candidate generation unit 21 is less than the maximum input length, selects a combination of division points from among the division point candidates obtained by applying division rules of the same level while ensuring that each division unit is not greater in length than the maximum input length; and a division unit 23 which divides an input character string at the division point determined by the division point adjustment unit.
摘要:
A natural-language processing system (10) includes a registration-candidate storage section (32) that stores therein registration-candidate dictionary data, a judgment means (22) that compares input data against the registration-candidate dictionary data to thereby judge whether or not the input data includes a word corresponding to the registration-candidate dictionary data, an inquiry means (23) that inquires to a user whether or not corresponding dictionary data is to be registered in a dictionary storage section (31) to accept a user's instruction if it is judged that a corresponding word exists, a dictionary registration means (24) that registers the corresponding dictionary data in the dictionary storage section based on the input instruction, and a natural language processing means (25) that executes a natural-language processing onto the input data by using the dictionary data registered in the dictionary storage section.
摘要:
A language processing device includes first analysis unit 21 that subjects a natural language sentence containing a polysemic word and other words to a predetermined analysis and outputs a plurality of analysis results for the natural language sentence according to a plurality of meanings of the polysemic word, second analysis unit 23 that performs a particular analysis on the analysis results outputted from first analysis unit 21, and employs one of the analysis results, and generation unit 244 that generates a deletion rule for deleting one or more unnecessary analysis results of the first analysis unit 21 which has been deleted from the analysis results outputted from first analysis unit 21 but employed by second analysis unit 23, according to the analysis results outputted from the first analysis unit 21 and the employment result of second analysis unit 23.
摘要:
Disclosed is an information providing system comprising a receiving unit that receives an information request from a requester, a data storage unit that stores data, a detection processing unit that analyzes the content of the information request and extracts provision candidate data corresponding to the information request from the data storage unit, a responder output device to which the content of the information request and the provision candidate data are output, a responder input device that receives instruction information on whether or not the provision candidate data is to be provided, a response control unit that determines whether or not there is providable data based on the received instruction information and the provision candidate data, and an answer generating unit that generates answer data using the decision result by the response control unit.
摘要:
Disclosed is an information providing system comprising a receiving unit that receives an information request from a requester, a data storage unit that stores data, a detection processing unit that analyzes the content of the information request and extracts provision candidate data corresponding to the information request from the data storage unit, a responder output device to which the content of the information request and the provision candidate data are output, a responder input device that receives instruction information on whether or not the provision candidate data is to be provided, a response control unit that determines whether or not there is providable data based on the received instruction information and the provision candidate data, and an answer generating unit that generates answer data using the decision result by the response control unit.
摘要:
A language processing device includes first analysis unit 21 that subjects a natural language sentence containing a polysemic word and other words to a predetermined analysis and outputs a plurality of analysis results for the natural language sentence according to a plurality of meanings of the polysemic word, second analysis unit 23 that performs a particular analysis on the analysis results outputted from first analysis unit 21, and employs one of the analysis results, and generation unit 244 that generates a deletion rule for deleting one or more unnecessary analysis results of the first analysis unit 21 which has been deleted from the analysis results outputted from first analysis unit 21 but employed by second analysis unit 23, according to the analysis results outputted from the first analysis unit 21 and the employment result of second analysis unit 23.
摘要:
A natural-language processing system includes a registration-candidate storage section that stores therein registration-candidate dictionary data, a judgment means that compares input data against the registration-candidate dictionary data to thereby judge whether or not the input data includes a word corresponding to the registration-candidate dictionary data, an inquiry means that inquires to a user whether or not corresponding dictionary data is to be registered in a dictionary storage section to accept a user's instruction if it is judged that a corresponding word exists, a dictionary registration means that registers the corresponding dictionary data in the dictionary storage section based on the input instruction, and a natural-language processing means that executes a natural-language processing onto the input data by using the dictionary data registered in the dictionary storage section.
摘要:
A language analysis apparatus of the invention includes division rules, each of which is classified into one of levels according to the degree of risk of causing analysis accuracy problems when applied; a division point candidate generation unit 21 which, when a character string whose length is greater than the predetermined maximum input length is input, generates division point candidates for the input character string by applying the division rules sequentially one by one in the ascending order of the level of risk of causing problems; a division point adjustment unit 22 which, when the length of a division unit candidate obtained by the division point candidate generated by the division point candidate generation unit 21 is less than the maximum input length, selects a combination of division points from among the division point candidates obtained by applying division rules of the same level while ensuring that each division unit is not greater in length than the maximum input length; and a division unit 23 which divides an input character string at the division point determined by the division point adjustment unit.
摘要:
A preprocessing unit (3) identifies an original text input from an input unit (1) as one of a plurality of types of basic element functions defined in advance as basic element functions that construct a description format of a claim, and outputs the original text in basic element functions. A translation unit (4) changes a translation manner in accordance with the type of basic element function of the original text output the preprocessing unit (3). This enables to appropriately translate a claim.
摘要:
There is provided a dictionary registration system which makes it possible to register a word into a user dictionary while minimizing an adverse effect that the word may have on natural language processing, if any. The dictionary registration system performs natural language processing by using a user dictionary, and includes a data processing apparatus that performs the natural language processing by managing and using the user dictionary and a storage apparatus that retains system dictionary information and user dictionary information for use in the natural language processing. The storage apparatus includes the system dictionary information for use in the natural language processing, and the user dictionary. The data processing apparatus includes: a word information registering init that registers information on an input word into the user dictionary; a difference creating unit that creates differences in a result of processing between a first result of processing when the natural language processing is performed, by using the system dictionary, information and a second result of processing when the natural language processing is performed by using the system dictionary information and the user dictionary information; a correct-incorrect accepting unit that accepts correct-incorrect judgments as to whether changes from the first result of processing to the second result of processing are correct or incorrect, the changes corresponding to the differences created by the difference creating unit; and dictionary registration unit that registers registration information on the accepted word into the user dictionary along with part or all of pairs of the correct-incorrect judgments accepted and input sentences from which the differences given the respective correct-incorrect judgments are created.