-
公开(公告)号:US20130060780A1
公开(公告)日:2013-03-07
申请号:US13224327
申请日:2011-09-02
申请人: Tirthankar Lahiri , Chi-Kim Hoang , Dina Thomas , Kirk Meredith Edson , Subhradyuti Sarkar , Mark McAuliffe , Marie-Anne Neimat , Chih-Ping Wang
发明人: Tirthankar Lahiri , Chi-Kim Hoang , Dina Thomas , Kirk Meredith Edson , Subhradyuti Sarkar , Mark McAuliffe , Marie-Anne Neimat , Chih-Ping Wang
IPC分类号: G06F17/30
摘要: In column domain dictionary compression, column values in one or more columns are tokenized by a single dictionary. The domain of the dictionary is the entire set of columns. A dictionary may not only map a token to a tokenized value, but also to a count (“token count”) of the number of occurrences of the token and corresponding tokenized value in the dictionary's domain. Such information may be used to compute queries on the base table.
摘要翻译: 在列域字典压缩中,一个或多个列中的列值由单个字典进行标记化。 字典的域是整个列的集合。 字典不仅可以将令牌映射到标记值,还可以映射令牌的出现次数和字典域中对应的标记值的计数(令牌计数)。 此类信息可用于计算基表上的查询。
-
公开(公告)号:US10756759B2
公开(公告)日:2020-08-25
申请号:US13224327
申请日:2011-09-02
申请人: Tirthankar Lahiri , Chi-Kim Hoang , Dina Thomas , Kirk Meredith Edson , Subhradyuti Sarkar , Mark McAuliffe , Marie-Anne Neimat , Chih-Ping Wang
发明人: Tirthankar Lahiri , Chi-Kim Hoang , Dina Thomas , Kirk Meredith Edson , Subhradyuti Sarkar , Mark McAuliffe , Marie-Anne Neimat , Chih-Ping Wang
摘要: In column domain dictionary compression, column values in one or more columns are tokenized by a single dictionary. The domain of the dictionary is the entire set of columns. A dictionary may not only map a token to a tokenized value, but also to a count (“token count”) of the number of occurrences of the token and corresponding tokenized value in the dictionary's domain. Such information may be used to compute queries on the base table.
-