摘要:
Disclosure relates to a statistical machine translation field, and more particularly to a method and a device for filtering a translation rule and generating a target word in a hierarchical phrase-based statistical machine translation. The method and device filters a translation rule using a relaxed-well-formed dependency structure and generates a target word by referring to a head word of a source word in a hierarchical phrase-based statistical machine translation. The disclosure improves a translation performance while reducing a number of translation rules, in comparison with a hierarchical phrase-based original translation rule table.
摘要:
Disclosed are a joint decoding apparatus and a joint decoding method that joins a tokenization process and a translation process. More particularly, the present disclosure can generate all available candidate tokens, reduce translation errors, and obtain an optimal translation result by jointly conducting a decoding by simultaneously conducting the tokenization process and the translation process for an input character sequence while decoding.
摘要:
Disclosed are a joint decoding apparatus and a joint decoding method that joins a tokenization process and a translation process. More particularly, the present disclosure can generate all available candidate tokens, reduce translation errors, and obtain an optimal translation result by jointly conducting a decoding by simultaneously conducting the tokenization process and the translation process for an input character sequence while decoding.
摘要:
The present invention relates to the use of a plurality of dependency trees in tree-based statistical machine translation and proposes a dependency forest to effectively process the plurality of dependency trees. The present invention can improve a translation capability by generating a translation rule and a dependency language model by using the dependency forest and applying the generated translation rule and dependency language model when a source language text is converted to a target language text.
摘要:
Disclosed all an apparatus and a method for constructing a verb phrase translation pattern using a bilingual parallel corpus which recognizes a predicate and an argument using a syntax analysis result and a word alignment result of a source sentence from a plurality of bilingual parallel corpus, extracts a translation pattern candidate and an occurrence frequency using the recognized predicate and argument, and then generates a basic verb phrase translation pattern by verifying the translation pattern candidate, and generalizes the generated basic verb phrase translation pattern to generate a general verb phrase pattern so as to be applied to various language pairs and minimize an error in the verb phrase translation pattern and determine an appropriate generalization level using a co-occurrence frequency of a predicate and an argument of the verb phrase translation pattern and a translation probability of the predicate.
摘要:
The present disclosure relates to an apparatus and a method for recognizing an idiomatic expression using phrase alignment of a bilingual parallel corpus, and more particularly, to an apparatus and a method for recognizing an idiomatic expression using phrase alignment of a parallel corpus which extract a candidate idiomatic expression using phrase alignment information of a bilingual parallel corpus and measure an idiomatic expression index for every extracted candidate idiomatic expression to recognize the candidate idiomatic expression as an idiomatic expression to resolve errors in measuring a translational entropy of a word and extracting a representative translated word of the word and improve the accuracy of the idiomatic expression recognition.
摘要:
The present invention relates to statistical machine translation, and provides a machine translation device and a machine translation method that acquire a creation probability for a target language from a single corpus while extracting respective conversion probabilities by extracting syntax conversion knowledge and word translation knowledge from a parallel corpus, model a weighted translation model by allowing each of the conversion knowledge and each of the probabilities to learn using a translation model learning device, and generate a target sentence through decoding processes of a syntax converter and a word translator by applying the translation model to a source sentence input in real time, thereby resolving disadvantages of the existing phrase-based SMT and syntax-based SMT and combining advantages thereof.
摘要:
The present invention relates to statistical machine translation, and provides a machine translation device and a machine translation method that acquire a creation probability for a target language from a single corpus while extracting respective conversion probabilities by extracting syntax conversion knowledge and word translation knowledge from a parallel corpus, model a weighted translation model by allowing each of the conversion knowledge and each of the probabilities to learn using a translation model learning device, and generate a target sentence through decoding processes of a syntax converter and a word translator by applying the translation model to a source sentence input in real time, thereby resolving disadvantages of the existing phrase-based SMT and syntax-based SMT and combining advantages thereof.
摘要:
An apparatus and a method for constructing a verb phrase translation pattern using a bilingual parallel corpus. Each of the apparatus or method recognizes a predicate and an argument using a syntax analysis result and a word alignment result of a source sentence from a plurality of bilingual parallel corpus, extracts a translation pattern candidate and an occurrence frequency using the recognized predicate and argument, and then generates a basic verb phrase translation pattern by verifying the translation pattern candidate, and generalizes the generated basic verb phrase translation pattern to generate a general verb phrase pattern so as to be applied to various language pairs and minimize an error in the verb phrase translation pattern and determine an appropriate generalization level using a co-occurrence frequency of a predicate and an argument of the verb phrase translation pattern and a translation probability of the predicate.
摘要:
Provided are a device and method for interactive machine translation. The device includes a machine translation engine having a morphological/syntactic analyzer for analyzing morphemes and sentences of an original text and generating original text analysis information, and a translation generator for generating a translation and translation generation information on the basis of the original text analysis information, and a user interface module for displaying sentence structures of the original text and the translation, and a relationship between the original text and the translation to a user on the basis of the original text analysis information and the translation generation information, and for receiving corrections to the original text or the translation from the user. The device and method provide a user interface whereby the user can effectively recognize and correct a mistranslated part and a cause of the mistranslation, and rapidly provides a re-translated result according to the correction. Thus, it is possible to perform high-quality translation which satisfies the user.