发明授权
US07552051B2 Method and apparatus for mapping multiword expressions to identifiers using finite-state networks
有权
使用有限状态网络将多字表达式映射到标识符的方法和装置
- 专利标题: Method and apparatus for mapping multiword expressions to identifiers using finite-state networks
- 专利标题(中): 使用有限状态网络将多字表达式映射到标识符的方法和装置
-
申请号: US10248058申请日: 2002-12-13
-
公开(公告)号: US07552051B2公开(公告)日: 2009-06-23
- 发明人: Caroline Privault , Herve Poirier
- 申请人: Caroline Privault , Herve Poirier
- 申请人地址: US CT Norwalk
- 专利权人: Xerox Corporation
- 当前专利权人: Xerox Corporation
- 当前专利权人地址: US CT Norwalk
- 代理机构: Fay Sharpe LLP
- 主分类号: G10L15/04
- IPC分类号: G10L15/04 ; G06F17/27
摘要:
Multiword expressions are mapped to identifiers using finite-state networks. Each of a plurality of multiword expressions is encoded into a regular expression. Each regular expression encodes a base form common to a plurality of derivative forms defined by ones of the multiword expressions. Each of the plurality of regular expressions is compiled with factorization into a set of finite-state networks. A union of the finite-state networks in the set of finite-state networks is performed to define a multiword finite-state network and a set of subnets. The multiword finite-state network and the set of subnets are traversed to identify a path corresponding to one of the plurality of multiword expressions, wherein only transitions originating from the multiword finite-state network are accounted for to ascertain a path number identifying a base form of the one of the plurality of multiword expressions.
公开/授权文献
信息查询