发明公开
EP0621541A2 Method and apparatus for automatic language determination 失效
用于自动语音识别方法和装置。

  • 专利标题: Method and apparatus for automatic language determination
  • 专利标题(中): 用于自动语音识别方法和装置。
  • 申请号: EP94302734.2
    申请日: 1994-04-18
  • 公开(公告)号: EP0621541A2
    公开(公告)日: 1994-10-26
  • 发明人: Spitz, A. Lawrence
  • 申请人: XEROX CORPORATION
  • 申请人地址: Xerox Square Rochester New York 14644 US
  • 专利权人: XEROX CORPORATION
  • 当前专利权人: XEROX CORPORATION
  • 当前专利权人地址: Xerox Square Rochester New York 14644 US
  • 代理机构: Johnson, Reginald George
  • 优先权: US47673 19930419
  • 主分类号: G06F15/20
  • IPC分类号: G06F15/20 G06K9/72
Method and apparatus for automatic language determination
摘要:
An automatic language determining apparatus automatically determines the particular Asian language of the text image of a document when the gross script-type is known to be, or is determined to be, an Asian script-type. A connected component generating means (28) generates connected components from the pixels comprising the text image. A character cell generating means generates a character cell surrounding at least one connected component. An optical density determining means determines the optical density, in absolute numbers or percentage of pixels, of the pixels within each character cell. A script feature determining means first generates a histogram, then converts, by linear discriminate analysis, the histogram to a point in a new coordinate space. A language determining means (36) compares the determined point of the text portion in the new coordinate space to predetermined regimes in the new coordinate space corresponding to at least one Asian language to determine the particular Asian language of the text image.
公开/授权文献
信息查询
0/0