摘要:
Disclosed all an apparatus and a method for constructing a verb phrase translation pattern using a bilingual parallel corpus which recognizes a predicate and an argument using a syntax analysis result and a word alignment result of a source sentence from a plurality of bilingual parallel corpus, extracts a translation pattern candidate and an occurrence frequency using the recognized predicate and argument, and then generates a basic verb phrase translation pattern by verifying the translation pattern candidate, and generalizes the generated basic verb phrase translation pattern to generate a general verb phrase pattern so as to be applied to various language pairs and minimize an error in the verb phrase translation pattern and determine an appropriate generalization level using a co-occurrence frequency of a predicate and an argument of the verb phrase translation pattern and a translation probability of the predicate.
摘要:
An apparatus and a method for constructing a verb phrase translation pattern using a bilingual parallel corpus. Each of the apparatus or method recognizes a predicate and an argument using a syntax analysis result and a word alignment result of a source sentence from a plurality of bilingual parallel corpus, extracts a translation pattern candidate and an occurrence frequency using the recognized predicate and argument, and then generates a basic verb phrase translation pattern by verifying the translation pattern candidate, and generalizes the generated basic verb phrase translation pattern to generate a general verb phrase pattern so as to be applied to various language pairs and minimize an error in the verb phrase translation pattern and determine an appropriate generalization level using a co-occurrence frequency of a predicate and an argument of the verb phrase translation pattern and a translation probability of the predicate.
摘要:
The present disclosure relates to an apparatus and a method for recognizing an idiomatic expression using phrase alignment of a bilingual parallel corpus, and more particularly, to an apparatus and a method for recognizing an idiomatic expression using phrase alignment of a parallel corpus which extract a candidate idiomatic expression using phrase alignment information of a bilingual parallel corpus and measure an idiomatic expression index for every extracted candidate idiomatic expression to recognize the candidate idiomatic expression as an idiomatic expression to resolve errors in measuring a translational entropy of a word and extracting a representative translated word of the word and improve the accuracy of the idiomatic expression recognition.
摘要:
The present invention relates to statistical machine translation, and provides a machine translation device and a machine translation method that acquire a creation probability for a target language from a single corpus while extracting respective conversion probabilities by extracting syntax conversion knowledge and word translation knowledge from a parallel corpus, model a weighted translation model by allowing each of the conversion knowledge and each of the probabilities to learn using a translation model learning device, and generate a target sentence through decoding processes of a syntax converter and a word translator by applying the translation model to a source sentence input in real time, thereby resolving disadvantages of the existing phrase-based SMT and syntax-based SMT and combining advantages thereof.
摘要:
The present invention relates to statistical machine translation, and provides a machine translation device and a machine translation method that acquire a creation probability for a target language from a single corpus while extracting respective conversion probabilities by extracting syntax conversion knowledge and word translation knowledge from a parallel corpus, model a weighted translation model by allowing each of the conversion knowledge and each of the probabilities to learn using a translation model learning device, and generate a target sentence through decoding processes of a syntax converter and a word translator by applying the translation model to a source sentence input in real time, thereby resolving disadvantages of the existing phrase-based SMT and syntax-based SMT and combining advantages thereof.