发明授权
US07966174B1 Automatic clustering of tokens from a corpus for grammar acquisition 有权
用于语法获取的语料库的令牌的自动聚类

Automatic clustering of tokens from a corpus for grammar acquisition
摘要:
A system for recognizing patterns is disclosed. Grammar learning from a corpus includes, for the other non-context words, generating frequency vectors for each non-context token in a corpus based upon counted occurrences of a predetermined relationship of the non-context tokens to identified context tokens. Clusters are grown from the frequency vectors according to a lexical correlation or a cluster tree among the non-context tokens. The cluster tree is used for pattern recognition.
信息查询
0/0