Invention Grant
- Patent Title: Sequence classification for machine translation
- Patent Title (中): 机器翻译序列分类
-
Application No.: US11647080Application Date: 2006-12-28
-
Publication No.: US07783473B2Publication Date: 2010-08-24
- Inventor: Srinivas Bangalore , Patrick Haffner , Stephan Kanthak
- Applicant: Srinivas Bangalore , Patrick Haffner , Stephan Kanthak
- Applicant Address: US NV Reno
- Assignee: AT&T Intellectual Property II, L.P.
- Current Assignee: AT&T Intellectual Property II, L.P.
- Current Assignee Address: US NV Reno
- Agent Ronald D. Slusky
- Main IPC: G06F17/28
- IPC: G06F17/28 ; G10L21/00

Abstract:
Classification of sequences, such as the translation of natural language sentences, is carried out using an independence assumption. The independence assumption is an assumption that the probability of a correct translation of a source sentence word into a particular target sentence word is independent of the translation of other words in the sentence. Although this assumption is not a correct one, a high level of word translation accuracy is nonetheless achieved. In particular, discriminative training is used to develop models for each target vocabulary word based on a set of features of the corresponding source word in training sentences, with at least one of those features relating to the context of the source word. Each model comprises a weight vector for the corresponding target vocabulary word. The weights comprising the vectors are associated with respective ones of the features; each weight is a measure of the extent to which the presence of that feature for the source word makes it more probable that the target word in question is the correct one.
Public/Granted literature
- US20080162111A1 Sequence classification for machine translation Public/Granted day:2008-07-03
Information query