Method and apparatus for distribution-based language model adaptation

发明申请

US20060009965A1 Method and apparatus for distribution-based language model adaptation 有权

请登陆查看更多内容

专利标题： Method and apparatus for distribution-based language model adaptation
申请号： US11225543

申请日： 2005-09-13
公开(公告)号： US20060009965A1

公开(公告)日： 2006-01-12
发明人: Jianfeng Gao , Mingjing Li
申请人： Jianfeng Gao , Mingjing Li
申请人地址： US WA Redmond
专利权人： Microsoft Corporation
当前专利权人： Microsoft Corporation
当前专利权人地址： US WA Redmond
主分类号： G06F17/27
IPC分类号： G06F17/27

Method and apparatus for distribution-based language model adaptation

摘要：

A method and apparatus are provided for adapting a language model to a task-specific domain. Under the method and apparatus, the relative frequency of n-grams in a small training set (i.e. task-specific training data set) and the relative frequency of n-grams in a large training set (i.e. out-of-domain training data set) are used to weight a distribution count of n-grams in the large training set. The weighted distributions are then used to form a modified language model by identifying probabilities for n-grams from the weighted distributions.

公开/授权文献

US07254529B2 Method and apparatus for distribution-based language model adaptation 公开/授权日：2007-08-07

信息查询

Global Dossier Espacenet