Invention Application
US20050060150A1 Unsupervised training for overlapping ambiguity resolution in word segmentation 审中-公开
用于重叠模糊度分辨率的无监督训练

Unsupervised training for overlapping ambiguity resolution in word segmentation
Abstract:
A method for resolving overlapping ambiguity strings in unsegmented languages such as Chinese. The methodology includes segmenting sentences into two possible segmentations and recognizing overlapping ambiguity strings in the sentences. One of the two possible segmentations is selected as a function of probability information. The probability information is derived from unsupervised training data. A method of constructing a knowledge base containing probability information needed to select one of the segmentation is also provided.
Information query
Patent Agency Ranking
0/0