发明授权
US08001128B2 Selection of a set of optimal n-grams for indexing string data in a DBMS system under space constraints introduced by the system 失效
在系统引入的空间约束下,选择一组用于在DBMS系统中索引字符串数据的最佳n-gram

Selection of a set of optimal n-grams for indexing string data in a DBMS system under space constraints introduced by the system
摘要:
The present invention provides a computer-readable medium and system for selecting a set of n-grams for indexing string data in a DBMS system. Aspects of the invention include providing a set of candidate n-grams, each n-gram comprising a sequence of characters; identifying sample queries having character strings containing the candidate n-grams; and based on the set of candidate n-grams, the sample queries, database records, and an n-gram space constraint, automatically selecting, given the space constraint, a minimal set of an n-grams from the set of candidate n-grams that minimizes the number of false hits for the set of sample queries had the sample queries been executed against the database records.
信息查询
0/0