发明授权
- 专利标题: Machine translation method for PDF file
- 专利标题(中): PDF文件的机器翻译方法
-
申请号: US12155131申请日: 2008-05-29
-
公开(公告)号: US08108202B2公开(公告)日: 2012-01-31
- 发明人: Oh Woog Kwon , Sung Kwon Choi , Ki Young Lee , Yoon-Hyung Roh , Young Kil Kim , Chang Hyun Kim , Young-Ae Seo , Seong Il Yang , Young-Sook Hwang , Chang-Hao Yin , Eun jin Park
- 申请人: Oh Woog Kwon , Sung Kwon Choi , Ki Young Lee , Yoon-Hyung Roh , Young Kil Kim , Chang Hyun Kim , Young-Ae Seo , Seong Il Yang , Young-Sook Hwang , Chang-Hao Yin , Eun jin Park
- 申请人地址: KR Daejeon
- 专利权人: Electronics and Telecommunications Research Institute
- 当前专利权人: Electronics and Telecommunications Research Institute
- 当前专利权人地址: KR Daejeon
- 代理机构: Staas & Halsey LLP
- 优先权: KR10-2007-0075581 20070727
- 主分类号: G06F17/28
- IPC分类号: G06F17/28
摘要:
Disclosed is a machine translation method for a PDF file. A machine translation device extracts source language text and non-text from the input source language PDF file through image transformation, corrects the extracted source language text by using the source language text extracted from text information, restores a part that is contextually separated by the non-text from among the extracted source language text, generates a source language XML/HTML file by rearranging the extracted text and non-text so as to satisfy the contextual flow of the source language PDF file, separates source language text from a tag of the source language XML/HTML file, generates target language text by using translation knowledge and a transformation engine specified for the technical field corresponding to the source language PDF file, inserts the translated target language text other than source language text into XML/HTML file, and transforms the generated target language XML/HTML file into a target language PDF file to be output.
公开/授权文献
- US20090030671A1 Machine translation method for PDF file 公开/授权日:2009-01-29
信息查询