摘要:
A method for automatically detecting errors in machine translation using a parallel corpus includes analyzing morphemes of a target language sentence in the parallel corpus and a machine-translated target language sentence, corresponding to a source language sentence, to classify the morphemes into words; aligning by words and decoding, respectively, a group of the source language sentence and the machine-translated target language sentence, and a group of the source language sentence and the target language sentence in the parallel corpus; classifying by types errors in the machine-translated target language sentence by making a comparison, word by word, between the decoded target language sentence in the parallel corpus and the decoded machine-translated target language sentence; and computing error information in the machine-translated target language sentence by examining a frequency of occurrence of the classified error types.
摘要:
A machine-translation apparatus using multi-level verbal-phrase patterns includes: a simple sentence generation unit for generating an input simple sentence; a basic verbal-phrase pattern-matching unit for trying a match of a semantic code of each case component of the input simple sentence with basic verbal-phrase patterns; a default verbal-phrase pattern matching unit for trying a match of a size and case prepositions of the input simple sentence with default verbal-phrase patterns having a verb identical to that of the input simple sentence; a default word-order matching unit for trying a match of a word-order of the input simple sentence with default word-order verbal-phrase patterns having a case component structure identical to that of the input simple sentence; and a default preposition matching unit for generating a target sentence of an input sentence with default preposition patterns having a context identical to that of the input simple sentence.
摘要:
The present invention relates to a method and apparatus for constructing translation knowledge to be used in a translator. According to the invention, a source-language sentence and a target-language sentence are converted by receiving the source-language sentence and the target-language sentence corresponding to the source-language sentence and attaching a prototype, a part-of-speech, relative position information, and syntactic information in a base phrase to each morpheme of the source-language sentence and the target-language sentence. Then, word alignment and syntactic alignment are performed in the converted source-language sentence and target-language sentence, thereby extracting translation knowledge on words and syntaxes, translation knowledge on a subcategory of a bilingual inflected-word, and translation knowledge on a bilingual sentence pattern based on the results of the word and syntactic alignment.
摘要:
The method for retrieving a similar sentence to a source sentence inputted by a user through a translation memory in a translation aid system is provided. An inverted file of an index word and a translation memory from a parallel corpus are constituted. Candidate sentences having a high similarity are filtered by comparing the source sentence provided by the user with sentences of the constituted translation memory. A source sentence and a corresponding target sentence are outputted in the order of similarity by calculating similarity between the filtered candidate sentences and the source sentence.