发明授权
US08175878B1 Representing n-gram language models for compact storage and fast retrieval 有权
代表用于紧凑存储和快速检索的n-gram语言模型

Representing n-gram language models for compact storage and fast retrieval
摘要:
Systems, methods, and apparatuses, including computer program products, are provided for representing language models. In some implementations, a computer-implemented method is provided. The method includes generating a compact language model including receiving a collection of n-grams from the corpus, each n-gram of the collection having a corresponding first probability of occurring in the corpus and generating a trie representing the collection of n-grams. The method also includes using the language model to identify a second probability of a particular string of words occurring.
信息查询
0/0