发明授权
- 专利标题: Adaptive construction of a statistical language model
- 专利标题(中): 统计语言模型的自适应构建
-
申请号: US12684749申请日: 2010-01-08
-
公开(公告)号: US08577670B2公开(公告)日: 2013-11-05
- 发明人: Kuansan Wang , Xiaolong Li , Jiangbo Miao , Frederic H. Behr, Jr.
- 申请人: Kuansan Wang , Xiaolong Li , Jiangbo Miao , Frederic H. Behr, Jr.
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Corporation
- 当前专利权人: Microsoft Corporation
- 当前专利权人地址: US WA Redmond
- 主分类号: G06F17/27
- IPC分类号: G06F17/27
摘要:
A statistical language model (SLM) may be iteratively refined by considering N-gram counts in new data, and blending the information contained in the new data with the existing SLM. A first group of documents is evaluated to determine the probabilities associated with the different N-grams observed in the documents. An SLM is constructed based on these probabilities. A second group of documents is then evaluated to determine the probabilities associated with each N-gram in that second group. The existing SLM is then evaluated to determine how well it explains the probabilities in the second group of documents, and a weighting parameter is calculated from that evaluation. Using the weighting parameter, a new SLM is then constructed as a weighted average of the existing SLM and the new probabilities.
公开/授权文献
- US20110172988A1 ADAPTIVE CONSTRUCTION OF A STATISTICAL LANGUAGE MODEL 公开/授权日:2011-07-14
信息查询