-
1.
公开(公告)号:US08452737B2
公开(公告)日:2013-05-28
申请号:US13347367
申请日:2012-01-10
IPC分类号: G06F17/30
CPC分类号: G06F17/30501 , G06F17/30315 , H03M7/30 , H03M7/48
摘要: The subject disclosure relates to column based data encoding where raw data to be compressed is organized by columns, and then, as first and second layers of reduction of the data size, dictionary encoding and/or value encoding are applied to the data as organized by columns, to create integer sequences that correspond to the columns. Next, a hybrid greedy run length encoding and bit packing compression algorithm further compacts the data according to an analysis of bit savings. Synergy of the hybrid data reduction techniques in concert with the column-based organization, coupled with gains in scanning and querying efficiency owing to the representation of the compact data, results in substantially improved data compression at a fraction of the cost of conventional systems.
摘要翻译: 本公开涉及基于列的数据编码,其中待压缩的原始数据由列组织,然后作为数据大小的第一和第二层缩减,字典编码和/或值编码被应用于由 列,以创建与列相对应的整数序列。 接下来,混合贪婪跑步长度编码和位打包压缩算法根据比特节省的分析进一步压缩数据。 混合数据简化技术与基于列的组织协调一致,加上由于表示紧凑数据而在扫描和查询效率方面的增益,导致数据压缩大大提高了传统系统成本的一小部分。
-
公开(公告)号:US20090012919A1
公开(公告)日:2009-01-08
申请号:US11772480
申请日:2007-07-02
IPC分类号: G06F15/18
CPC分类号: G06F17/30592
摘要: Systems and methodologies for identification of factors that cause significant shifts in transactions in a relational store and/or OLAP environment. Transactions are grouped into significant categories defined across the whole data space, to detect interesting sub spaces transactions. Subsequently, sub spaces that show strong variance between two slices can be selected, followed by grouping the subspaces in sub reports to measure the coverage for each sub report. A final report can then be generated that contains list of sub-reports detected in the previous acts.
摘要翻译: 用于识别在关系存储和/或OLAP环境中导致事务重大变化的因素的系统和方法。 事务被分组在整个数据空间中定义的重要类别中,以检测有趣的子空间事务。 随后,可以选择显示两个切片之间强差异的子空间,然后在子报告中对子空间进行分组,以测量每个子报告的覆盖范围。 然后可以生成包含先前行为中检测到的子报告列表的最终报告。
-
公开(公告)号:US20100030796A1
公开(公告)日:2010-02-04
申请号:US12270873
申请日:2008-11-14
IPC分类号: G06F17/00
CPC分类号: G06F17/30501 , G06F17/30315 , H03M7/30 , H03M7/48
摘要: The subject disclosure relates to column based data encoding where raw data to be compressed is organized by columns, and then, as first and second layers of reduction of the data size, dictionary encoding and/or value encoding are applied to the data as organized by columns, to create integer sequences that correspond to the columns. Next, a hybrid greedy run length encoding and bit packing compression algorithm further compacts the data according to an analysis of bit savings. Synergy of the hybrid data reduction techniques in concert with the column-based organization, coupled with gains in scanning and querying efficiency owing to the representation of the compact data, results in substantially improved data compression at a fraction of the cost of conventional systems.
摘要翻译: 本公开涉及基于列的数据编码,其中待压缩的原始数据由列组织,然后作为数据大小的第一和第二层缩减,字典编码和/或值编码被应用于由 列,以创建与列相对应的整数序列。 接下来,混合贪婪跑步长度编码和位打包压缩算法根据比特节省的分析进一步压缩数据。 混合数据简化技术与基于列的组织协调一致,加上由于表示紧凑数据而在扫描和查询效率方面的增益,导致数据压缩大大提高了传统系统成本的一小部分。
-
公开(公告)号:US20120109910A1
公开(公告)日:2012-05-03
申请号:US13347367
申请日:2012-01-10
IPC分类号: G06F17/30
CPC分类号: G06F17/30501 , G06F17/30315 , H03M7/30 , H03M7/48
摘要: The subject disclosure relates to column based data encoding where raw data to be compressed is organized by columns, and then, as first and second layers of reduction of the data size, dictionary encoding and/or value encoding are applied to the data as organized by columns, to create integer sequences that correspond to the columns. Next, a hybrid greedy run length encoding and bit packing compression algorithm further compacts the data according to an analysis of bit savings. Synergy of the hybrid data reduction techniques in concert with the column-based organization, coupled with gains in scanning and querying efficiency owing to the representation of the compact data, results in substantially improved data compression at a fraction of the cost of conventional systems.
摘要翻译: 本公开涉及基于列的数据编码,其中待压缩的原始数据由列组织,然后作为数据大小的第一和第二层缩减,字典编码和/或值编码被应用于由 列,以创建与列相对应的整数序列。 接下来,混合贪婪跑步长度编码和位打包压缩算法根据比特节省的分析进一步压缩数据。 混合数据简化技术与基于列的组织协调一致,加上由于表示紧凑数据而在扫描和查询效率方面的增益,导致数据压缩大大提高了传统系统成本的一小部分。
-
5.
公开(公告)号:US08108361B2
公开(公告)日:2012-01-31
申请号:US12270873
申请日:2008-11-14
IPC分类号: G06F17/30
CPC分类号: G06F17/30501 , G06F17/30315 , H03M7/30 , H03M7/48
摘要: The subject disclosure relates to column based data encoding where raw data to be compressed is organized by columns, and then, as first and second layers of reduction of the data size, dictionary encoding and/or value encoding are applied to the data as organized by columns, to create integer sequences that correspond to the columns. Next, a hybrid greedy run length encoding and bit packing compression algorithm further compacts the data according to an analysis of bit savings. Synergy of the hybrid data reduction techniques in concert with the column-based organization, coupled with gains in scanning and querying efficiency owing to the representation of the compact data, results in substantially improved data compression at a fraction of the cost of conventional systems.
摘要翻译: 本公开涉及基于列的数据编码,其中待压缩的原始数据由列组织,然后作为数据大小的第一和第二层缩减,字典编码和/或值编码被应用于由 列,以创建与列相对应的整数序列。 接下来,混合贪婪跑步长度编码和位打包压缩算法根据比特节省的分析进一步压缩数据。 混合数据简化技术与基于列的组织协调一致,加上由于表示紧凑数据而在扫描和查询效率方面的增益,导致数据压缩大大提高了传统系统成本的一小部分。
-
公开(公告)号:US07899776B2
公开(公告)日:2011-03-01
申请号:US11772480
申请日:2007-07-02
CPC分类号: G06F17/30592
摘要: Systems and methodologies for identification of factors that cause significant shifts in transactions in a relational store and/or OLAP environment. Transactions are grouped into significant categories defined across the whole data space, to detect interesting sub spaces transactions. Subsequently, sub spaces that show strong variance between two slices can be selected, followed by grouping the subspaces in sub reports to measure the coverage for each sub report. A final report can then be generated that contains list of sub-reports detected in the previous acts.
摘要翻译: 用于识别在关系存储和/或OLAP环境中导致事务重大变化的因素的系统和方法。 事务被分组在整个数据空间中定义的重要类别中,以检测有趣的子空间事务。 随后,可以选择显示两个切片之间强差异的子空间,然后在子报告中对子空间进行分组,以测量每个子报告的覆盖范围。 然后可以生成包含先前行为中检测到的子报告列表的最终报告。
-
公开(公告)号:US07689703B2
公开(公告)日:2010-03-30
申请号:US11069342
申请日:2005-03-01
申请人: Mosha Pasumansky , Marius Dumitru , Adrian Dumitrascu , Cristian Petculescu , Akshai M. Mirchandani , Paul J. Sanders , Thulusalamatom Krishnamurthi Anand , Richard R. Tkachuk , Raman S. Iyer , Thomas P. Conlon , Alexander Berger , Sergei Gringauze , Ioan Bogdan Crivat , C. James MacLennan , Rong J. Guan
发明人: Mosha Pasumansky , Marius Dumitru , Adrian Dumitrascu , Cristian Petculescu , Akshai M. Mirchandani , Paul J. Sanders , Thulusalamatom Krishnamurthi Anand , Richard R. Tkachuk , Raman S. Iyer , Thomas P. Conlon , Alexander Berger , Sergei Gringauze , Ioan Bogdan Crivat , C. James MacLennan , Rong J. Guan
CPC分类号: G06F17/30893
摘要: The subject invention relates to systems and methods that extend the network data access capabilities of mark-up language protocols. In one aspect, a network data transfer system is provided. The system includes a protocol component that employs a computerized mark-up language to facilitate data interactions between network components, whereby the data interactions were previously limited or based on a statement command associated with the markup language. An extension component operates with the protocol component to support the data transactions, where the extension component supplies at least one other command from the statement command to facilitate the data interactions.
摘要翻译: 本发明涉及扩展标记语言协议的网络数据访问能力的系统和方法。 一方面,提供一种网络数据传送系统。 该系统包括协议组件,其采用计算机化的标记语言来促进网络组件之间的数据交互,由此先前限制数据交互或基于与标记语言相关联的语句命令。 扩展组件与协议组件一起运行以支持数据事务,其中扩展组件从语句命令提供至少一个其他命令,以促进数据交互。
-
8.
公开(公告)号:US20080235180A1
公开(公告)日:2008-09-25
申请号:US11688409
申请日:2007-03-20
申请人: T.K. Anand , Paul J. Sanders , Richard R. Tkachuk , Cristian Petculescu , Chu Xu , Akshai M. Mirchandani , Valeri Kim , Andriy Garbuzov , C. James MacLennan , Marius Dumitru , Ioan Bogdan Crivat
发明人: T.K. Anand , Paul J. Sanders , Richard R. Tkachuk , Cristian Petculescu , Chu Xu , Akshai M. Mirchandani , Valeri Kim , Andriy Garbuzov , C. James MacLennan , Marius Dumitru , Ioan Bogdan Crivat
IPC分类号: G06F7/00
CPC分类号: G06F17/30592 , G06F17/30306
摘要: Systems and methods that supply extensibility mechanisms for analysis services, via a plug-in component that enables additional functionalities. The plug-in component provide additional custom logic for the analysis services unified dimensional model (UDM). Accordingly, server functionalities can be extended in an agile manner, and without a requirement for a new release, for example.
摘要翻译: 通过可实现附加功能的插件组件,为分析服务提供可扩展性机制的系统和方法。 插件组件为分析服务统一维度模型(UDM)提供了额外的自定义逻辑。 因此,服务器功能可以以敏捷的方式进行扩展,例如不需要新的版本。
-
9.
公开(公告)号:US07886289B2
公开(公告)日:2011-02-08
申请号:US11688409
申请日:2007-03-20
申请人: Thulusalamatom K. Anand , Paul J. Sanders , Richard R. Tkachuk , Cristian Petculescu , Chu Xu , Akshai M. Mirchandani , Valeri Kim , Andriy Garbuzov , C. James MacLennan , Marius Dumitru , Ioan Bogdan Crivat
发明人: Thulusalamatom K. Anand , Paul J. Sanders , Richard R. Tkachuk , Cristian Petculescu , Chu Xu , Akshai M. Mirchandani , Valeri Kim , Andriy Garbuzov , C. James MacLennan , Marius Dumitru , Ioan Bogdan Crivat
CPC分类号: G06F17/30592 , G06F17/30306
摘要: Systems and methods that supply extensibility mechanisms for analysis services, via a plug-in component that enables additional functionalities. The plug-in component provide additional custom logic for the analysis services unified dimensional model (UDM). Accordingly, server functionalities can be extended in an agile manner, and without a requirement for a new release, for example.
摘要翻译: 通过可实现附加功能的插件组件,为分析服务提供可扩展性机制的系统和方法。 插件组件为分析服务统一维度模型(UDM)提供了额外的自定义逻辑。 因此,服务器功能可以以敏捷的方式进行扩展,例如不需要新的版本。
-
公开(公告)号:US20090319330A1
公开(公告)日:2009-12-24
申请号:US12141105
申请日:2008-06-18
IPC分类号: G06Q10/00
CPC分类号: G06Q30/02 , G06Q10/0639 , G06Q30/0202
摘要: Various technologies and techniques are disclosed for calculating and evaluating the behavior of recommendation systems. Accuracy measures are computed for a plurality of items in a real recommendation system, an ideal recommendation system, and a popularity-based baseline recommendation system. The accuracy measures for the plurality of items are presented to a user so the user can evaluate a performance of the real recommendation system in comparison to the ideal recommendation system and the popularity-based baseline recommendation system. The accuracy measures can be presented in an interactive graph.
摘要翻译: 公开了各种技术和技术来计算和评估推荐系统的行为。 针对真实推荐系统,理想推荐系统和基于流行度的基准推荐系统中的多个项目计算准确度度量。 将多个项目的准确性度量提供给用户,使得与理想推荐系统和基于流行度的基准推荐系统相比,用户可以评估真实推荐系统的性能。 准确性度量可以在交互式图表中呈现。
-
-
-
-
-
-
-
-
-