发明申请
US20060288281A1 Method of determining unicode values corresponding to the text in digital documents
有权
确定与数字文档中的文本对应的unicode值的方法
- 专利标题: Method of determining unicode values corresponding to the text in digital documents
- 专利标题(中): 确定与数字文档中的文本对应的unicode值的方法
-
申请号: US11447826申请日: 2006-06-06
-
公开(公告)号: US20060288281A1公开(公告)日: 2006-12-21
- 发明人: Thomas Merz , Kurt Stutzer
- 申请人: Thomas Merz , Kurt Stutzer
- 优先权: EP05013373.5 20050621
- 主分类号: G06F17/00
- IPC分类号: G06F17/00
摘要:
A method of determining Unicode values corresponding to the text in digital documents includes: providing a digital document containing information related to the text in the document, the information including at least one set of data selected from the group consisting of: the numerical character code comprised by a single byte value or a sequence of multiple bytes, the glyph name corresponding to the character code for simple fonts, the code-to-Unicode mapping provided by a ToUnicode CMap, and font outline data embedded in the document; obtaining the information related to the text from the document; and determining the Unicode values corresponding to a specific code of a specific font on a per-glyph basis by executing a cascade of determination steps for each code separately, the cascade being executed in a predetermined sequence using different sources of information.
公开/授权文献
信息查询