Faster access for compressed time series data: the block index

    公开(公告)号:US11263196B2

    公开(公告)日:2022-03-01

    申请号:US16358598

    申请日:2019-03-19

    申请人: SAP SE

    摘要: A system and method for faster access for compressed time series data. A set of blocks are generated based on a table stored in a database of the data platform. The table stores data associated with multiple sources of data provided as consecutive values, each block containing index vectors having a range of the consecutive values. A block index is generated for each block having a field start vector representing a starting position of the block relative to the range of consecutive values, and a starting value vector representing a value of the block at the starting position. The field start vector of the block index is accessed to obtain the starting position of a field corresponding to a first block and to the range of the consecutive values of the first block. The starting value vector is then determined from the block index to determine an end and a length of the field of the first block.

    Compressing time stamp columns
    2.
    发明授权

    公开(公告)号:US11386104B2

    公开(公告)日:2022-07-12

    申请号:US16661993

    申请日:2019-10-23

    申请人: SAP SE

    IPC分类号: G06F16/2458 G06F16/22

    摘要: Disclosed is a system and method for improving database memory consumption and performance using compression of time stamp columns. A number of time stamps of a time series is received. The time stamps have a start time, and are separated by an equal increment of time that defines an interval. The start time and interval are stored in a dictionary of a column store of a database. An index is generated in the column store of the database, the index having a number of index vectors. Using the index vectors, each time stamp of the number of time stamps can be calculated from the start time stored in the dictionary and the position in the time series based on the interval stored in the dictionary.

    Multi-representation storage of time series data

    公开(公告)号:US10558678B2

    公开(公告)日:2020-02-11

    申请号:US16124126

    申请日:2018-09-06

    申请人: SAP SE

    摘要: A flexible approach for storing time series data, utilizes multiple representations in order to achieve optimization amongst various dimensions such as covered time period, accuracy, compression model, and storage medium. A first aspect selectively provides different representations for various portions of a time series along the time axis (horizontal). In this horizontal aspect, a first compression model may be applied to store data occurring during one period in a time series, while a different compression model is applied to store data during another period. In a separate (vertical) aspect, time series data over a same time period may be saved in multiple representations using different compression models. While increasing demands on memory, such vertical storage can afford efficient access for specific purposes (i.e., analytical). Storing time series data according to horizontal and/or vertical distributions, may be useful for applications such as data aging and the optimization of operator execution patterns.

    Multi-representation Storage of Time Series Data

    公开(公告)号:US20190005102A1

    公开(公告)日:2019-01-03

    申请号:US16124126

    申请日:2018-09-06

    申请人: SAP SE

    IPC分类号: G06F17/30 G06F3/06

    摘要: A flexible approach for storing time series data, utilizes multiple representations in order to achieve optimization amongst various dimensions such as covered time period, accuracy, compression model, and storage medium. A first aspect selectively provides different representations for various portions of a time series along the time axis (horizontal). In this horizontal aspect, a first compression model may be applied to store data occurring during one period in a time series, while a different compression model is applied to store data during another period. In a separate (vertical) aspect, time series data over a same time period may be saved in multiple representations using different compression models. While increasing demands on memory, such vertical storage can afford efficient access for specific purposes (i.e., analytical). Storing time series data according to horizontal and/or vertical distributions, may be useful for applications such as data aging and the optimization of operator execution patterns.

    Multi-representation storage of time series data

    公开(公告)号:US10095757B2

    公开(公告)日:2018-10-09

    申请号:US14961502

    申请日:2015-12-07

    申请人: SAP SE

    IPC分类号: G06F17/00 G06F17/30 G06F3/06

    摘要: A flexible approach for storing time series data, utilizes multiple representations in order to achieve optimization amongst various dimensions such as covered time period, accuracy, compression model, and storage medium. A first aspect selectively provides different representations for various portions of a time series along the time axis (horizontal). In this horizontal aspect, a first compression model may be applied to store data occurring during one period in a time series, while a different compression model is applied to store data during another period. In a separate (vertical) aspect, time series data over a same time period may be saved in multiple representations using different compression models. While increasing demands on memory, such vertical storage can afford efficient access for specific purposes (i.e., analytical). Storing time series data according to horizontal and/or vertical distributions, may be useful for applications such as data aging and the optimization of operator execution patterns.

    Faster access for compressed time series data: the block index

    公开(公告)号:US11892999B2

    公开(公告)日:2024-02-06

    申请号:US17579336

    申请日:2022-01-19

    申请人: SAP SE

    摘要: A system and method for faster access for compressed time series data. A set of blocks are generated based on a table stored in a database of the data platform. The table stores data associated with multiple sources of data provided as consecutive values, each block containing index vectors having a range of the consecutive values. A block index is generated for each block having a field start vector representing a starting position of the block relative to the range of consecutive values, and a starting value vector representing a value of the block at the starting position. The field start vector of the block index is accessed to obtain the starting position of a field corresponding to a first block and to the range of the consecutive values of the first block. The starting value vector is then determined from the block index to determine an end and a length of the field of the first block.

    Advisor generating multi-representations of time series data

    公开(公告)号:US10685306B2

    公开(公告)日:2020-06-16

    申请号:US14961456

    申请日:2015-12-07

    申请人: SAP SE

    摘要: An advisor creates configurations for generating multi-representations of time series data based upon detected characteristics such as length, interval, minimums, data types, etc., as well as configurations provided by a user. In an offline mode the advisor may further consider a previous time series workload. In an on-line mode the advisor may adapt multi-representation configurations with respect to ongoing changes in a current time series workload. The advisor may reference a cost model including values quantifying various dimensions (e.g., compression technique, accuracy, covered time period, storage medium, memory consumption, speed) of the multi-representations for optimization purposes. Configurations created by the advisor may be input to a storage engine to generate and store the multi-representations according to goals for data aging, operation execution pattern optimization, and ease of access to time series data located in hot zones. The advisor may be implemented with an engine of an in-memory database.

    Multi-representation Storage of Time Series Data

    公开(公告)号:US20170161340A1

    公开(公告)日:2017-06-08

    申请号:US14961502

    申请日:2015-12-07

    申请人: SAP SE

    IPC分类号: G06F17/30

    摘要: A flexible approach for storing time series data, utilizes multiple representations in order to achieve optimization amongst various dimensions such as covered time period, accuracy, compression model, and storage medium. A first aspect selectively provides different representations for various portions of a time series along the time axis (horizontal). In this horizontal aspect, a first compression model may be applied to store data occurring during one period in a time series, while a different compression model is applied to store data during another period. In a separate (vertical) aspect, time series data over a same time period may be saved in multiple representations using different compression models. While increasing demands on memory, such vertical storage can afford efficient access for specific purposes (i.e., analytical). Storing time series data according to horizontal and/or vertical distributions, may be useful for applications such as data aging and the optimization of operator execution patterns.

    AGGREGATING DATABASE ENTRIES BY HASHING
    9.
    发明申请
    AGGREGATING DATABASE ENTRIES BY HASHING 审中-公开
    通过洗涤聚合数据库入口

    公开(公告)号:US20160350394A1

    公开(公告)日:2016-12-01

    申请号:US14726251

    申请日:2015-05-29

    申请人: SAP SE

    IPC分类号: G06F17/30

    摘要: Aggregating input into hashtables using just-in-time compilation of compilable code in response to a database query. Compilable code can be generated that is configured to cause a programmable processor to produce one or more hashmaps based on the input database. The one or more hashmaps can correspond to each individual thread from the input. The compilable code can be configured to cause the one or more processors to insert the hashmaps into a scheduler. Compilable code can be generated that is configured to: aggregate elements from the one or more hashmaps into buckets of elements having the same partition identity; rehash the buckets of elements having the same partition identity to reduce the number of groups within the bucket; facilitate the merger of all non-empty elements from each target-partition into a merged-partition.

    摘要翻译: 使用即时汇编可编译代码来响应数据库查询将输入汇总到散列表中。 可以生成可编译代码,该代码被配置为使可编程处理器基于输入数据库产生一个或多个哈希图。 一个或多个hashmaps可以对应于来自输入的每个单独的线程。 可编译代码可以配置为使一个或多个处理器将hashmaps插入调度程序。 可以生成可编译代码,其被配置为:将元素从一个或多个hashmaps聚合到具有相同分区标识的元素的桶中; 重新分配具有相同分区身份的元素桶以减少桶内的组数; 便于将所有非空元素从每个目标分区合并成合并分区。

    Forecast-model-aware data storage for time series data

    公开(公告)号:US10445399B2

    公开(公告)日:2019-10-15

    申请号:US14284936

    申请日:2014-05-22

    申请人: SAP SE

    摘要: A system includes multiple memory modules arranged and configured to store data and at least one processor that is operably coupled to the memory modules. The at least one processor is arranged and configured to select an access pattern of a forecast model, determine a storage layout model based on the identified access pattern of the forecast model, and store values in an order defined by the storage layout model using at least one of the memory modules. The order of the stored values enables sequential access to the stored values for use in the forecast model. Implementations of one or more features of the system may be performed by a computer-implemented method and/or a computer program product.