摘要:
The present invention relates to a method and apparatus for constructing translation knowledge to be used in a translator. According to the invention, a source-language sentence and a target-language sentence are converted by receiving the source-language sentence and the target-language sentence corresponding to the source-language sentence and attaching a prototype, a part-of-speech, relative position information, and syntactic information in a base phrase to each morpheme of the source-language sentence and the target-language sentence. Then, word alignment and syntactic alignment are performed in the converted source-language sentence and target-language sentence, thereby extracting translation knowledge on words and syntaxes, translation knowledge on a subcategory of a bilingual inflected-word, and translation knowledge on a bilingual sentence pattern based on the results of the word and syntactic alignment.
摘要:
The present invention relates to a method and apparatus for constructing translation knowledge to be used in a translator. According to the invention, a source-language sentence and a target-language sentence are converted by receiving the source-language sentence and the target-language sentence corresponding to the source-language sentence and attaching a prototype, a part-of-speech, relative position information, and syntactic information in a base phrase to each morpheme of the source-language sentence and the target-language sentence. Then, word alignment and syntactic alignment are performed in the converted source-language sentence and target-language sentence, thereby extracting translation knowledge on words and syntaxes, translation knowledge on a subcategory of a bilingual inflected-word, and translation knowledge on a bilingual sentence pattern based on the results of the word and syntactic alignment.
摘要:
Disclosed is a machine translation method for a PDF file. A machine translation device extracts source language text and non-text from the input source language PDF file through image transformation, corrects the extracted source language text by using the source language text extracted from text information, restores a part that is contextually separated by the non-text from among the extracted source language text, generates a source language XML/HTML file by rearranging the extracted text and non-text so as to satisfy the contextual flow of the source language PDF file, separates source language text from a tag of the source language XML/HTML file, generates target language text by using translation knowledge and a transformation engine specified for the technical field corresponding to the source language PDF file, inserts the translated target language text other than source language text into XML/HTML file, and transforms the generated target language XML/HTML file into a target language PDF file to be output.
摘要:
Disclosed is a machine translation method for a PDF file. A machine translation device extracts source language text and non-text from the input source language PDF file through image transformation, corrects the extracted source language text by using the source language text extracted from text information, restores a part that is contextually separated by the non-text from among the extracted source language text, generates a source language XML/HTML file by rearranging the extracted text and non-text so as to satisfy the contextual flow of the source language PDF file, separates source language text from a tag of the source language XML/HTML file, generates target language text by using translation knowledge and a transformation engine specified for the technical field corresponding to the source language PDF file, inserts the translated target language text other than source language text into XML/HTML file, and transforms the generated target language XML/HTML file into a target language PDF file to be output.
摘要:
A user-interactive automatic translation device for a mobile device, includes: a camera image controller for converting an image captured by a camera into a digital image; an image character recognition controller for user-interactively selecting a character string region to be translated from the digital image, performing a character recognition function on the selected character string region based on an optical character reader (OCR) function and character recognition information to generate a text string; and user-interactively correcting errors included in the text string. Further, the user-interactive automatic translation device includes a text transmission controller for transmitting the error-corrected text string; and an automatic translation controller for receiving the error-corrected text string from the text transmission controller, performing a morpheme analysis, tagging, a structure analysis, a structure conversion, and a vocabulary conversion on the text string to generate a translation of the text string based on a grammar of a target language.
摘要:
A user-interactive automatic translation device for a mobile device, includes: a camera image controller for converting an image captured by a camera into a digital image; an image character recognition controller for user-interactively selecting a character string region to be translated from the digital image, performing a character recognition function on the selected character string region based on an optical character reader (OCR) function and character recognition information to generate a text string; and user-interactively correcting errors included in the text string. Further, the user-interactive automatic translation device includes a text transmission controller for transmitting the error-corrected text string; and an automatic translation controller for receiving the error-corrected text string from the text transmission controller, performing a morpheme analysis, tagging, a structure analysis, a structure conversion, and a vocabulary conversion on the text string to generate a translation of the text string based on a grammar of a target language.
摘要:
The method for retrieving a similar sentence to a source sentence inputted by a user through a translation memory in a translation aid system is provided. An inverted file of an index word and a translation memory from a parallel corpus are constituted. Candidate sentences having a high similarity are filtered by comparing the source sentence provided by the user with sentences of the constituted translation memory. A source sentence and a corresponding target sentence are outputted in the order of similarity by calculating similarity between the filtered candidate sentences and the source sentence.