-
公开(公告)号:US11263175B2
公开(公告)日:2022-03-01
申请号:US17039584
申请日:2020-09-30
申请人: SAS Institute Inc.
IPC分类号: G06F16/13 , G06F9/50 , G06F3/06 , G06F12/02 , G06F21/60 , G06F16/27 , G06F16/182 , G06F16/22
摘要: An apparatus includes a processor to: within each reading thread, retrieve a data set part and corresponding part metadata from storage device(s), analyze row group metadata for each row group within the data set part to identify candidate row group(s) meeting specified criteria, and store the candidate row group(s) and corresponding row group metadata within a data buffer of a queue; operate the queue as a FIFO buffer; within each provision thread, retrieve one of multiple row groups and corresponding metadata from within the data buffer, use information in the metadata to identify rows meeting the criteria, and provide those rows to the requesting device or an application; and in response to each instance of storage of a data set part within a data buffer of the queue, analyze the availability of storage space and/or of processing resources to determine whether to dynamically adjust the quantity of reading threads.
-
公开(公告)号:US20210026805A1
公开(公告)日:2021-01-28
申请号:US17039314
申请日:2020-09-30
申请人: SAS Institute Inc.
IPC分类号: G06F16/13 , G06F3/06 , G06F9/50 , G06F21/60 , G06F12/02 , G06F16/27 , G06F16/182 , G06F16/22
摘要: An apparatus includes a processor to: instantiate data buffers of a queue, reading threads, and provision threads; within each reading thread, use an identifier provided in a data buffer of the queue to retrieve the corresponding data set part and part metadata from storage device(s), and store both within the data buffer; operate the queue as a (FIFO) buffer; within each provision thread, retrieve a row group from among multiple row groups and corresponding metadata from within the data buffer, use information in the metadata to decompress at least one column, and provide the data values of the row group to the requesting device or an application routine; and in response to each instance of storage of a data set part within a data buffer of the queue, analyze the availability of storage space and/or of processing resources to determine whether to dynamically adjust the quantity of reading threads.
-
公开(公告)号:US09977807B1
公开(公告)日:2018-05-22
申请号:US15838211
申请日:2017-12-11
申请人: SAS Institute Inc.
CPC分类号: G06F17/30327 , G06F7/02 , G06F7/08 , G06F7/20 , G06F9/5027 , G06F9/5072 , G06F17/30289 , G06F17/30321 , G06F17/3033 , G06F17/30345 , G06F17/30371 , G06F17/30424 , G06F17/30575 , G06F17/30725 , G06F17/30949 , G06F17/30961
摘要: An apparatus including a processor to: receive search criteria including a data value; in response to receiving the search criteria, generate a hash value from the data value of the search criteria, and for each data cell of a super cell, compare the hash value to hash values within a hash values vector in the corresponding cell index to determine whether the data cell includes at least one data record meeting the search criteria, and in response to determining that the data cell includes at least one of such data record, search the data records to identify one or more data records meeting the search criteria; and in response to identifying at least one data record within at least one data cell of the super cell meeting the search criteria, provide results data indicative of the super cell including at least one of such data record.
-
公开(公告)号:US10303670B2
公开(公告)日:2019-05-28
申请号:US15984706
申请日:2018-05-21
申请人: SAS Institute Inc.
IPC分类号: G06F17/30 , G06F16/22 , G06F9/50 , G06F16/21 , G06F16/23 , G06F16/27 , G06F16/245 , G06F16/38 , G06F16/901 , G06F7/08 , G06F7/20 , G06F7/02
摘要: An apparatus including a processor to index data records within a data cell, wherein for each data record, the processor retrieves data values from first and second data fields; determines whether the first and second data fields store unique data values; in response to the first data field storing a unique data value, adds an identifier of the data record to a first unique values index, in response to the second data field storing a unique data value, adds the identifier to a second unique values index, wherein identifiers of data records within the unique values indexes are ordered based on corresponding unique data values; and generates an indication of ranges of data values of the first and second data fields to enable a determination of whether a data value specified in search criteria is present within at least the data cell.
-
公开(公告)号:US11347686B2
公开(公告)日:2022-05-31
申请号:US17039314
申请日:2020-09-30
申请人: SAS Institute Inc.
IPC分类号: G06F16/13 , G06F9/50 , G06F3/06 , G06F21/60 , G06F12/02 , G06F16/27 , G06F16/182 , G06F16/22
摘要: An apparatus includes a processor to: instantiate data buffers of a queue, reading threads, and provision threads; within each reading thread, use an identifier provided in a data buffer of the queue to retrieve the corresponding data set part and part metadata from storage device(s), and store both within the data buffer; operate the queue as a (FIFO) buffer; within each provision thread, retrieve a row group from among multiple row groups and corresponding metadata from within the data buffer, use information in the metadata to decompress at least one column, and provide the data values of the row group to the requesting device or an application routine; and in response to each instance of storage of a data set part within a data buffer of the queue, analyze the availability of storage space and/or of processing resources to determine whether to dynamically adjust the quantity of reading threads.
-
公开(公告)号:US20180276259A1
公开(公告)日:2018-09-27
申请号:US15984706
申请日:2018-05-21
申请人: SAS Institute Inc.
CPC分类号: G06F17/30321 , G06F7/02 , G06F7/08 , G06F7/20 , G06F9/5027 , G06F9/5072 , G06F17/30289 , G06F17/30327 , G06F17/3033 , G06F17/30345 , G06F17/30371 , G06F17/30424 , G06F17/30575 , G06F17/30725 , G06F17/30949 , G06F17/30961
摘要: An apparatus including a processor to index data records within a data cell, wherein for each data record, the processor retrieves data values from first and second data fields; determines whether the first and second data fields store unique data values; in response to the first data field storing a unique data value, adds an identifier of the data record to a first unique values index, in response to the second data field storing a unique data value, adds the identifier to a second unique values index, wherein identifiers of data records within the unique values indexes are ordered based on corresponding unique data values; and generates an indication of ranges of data values of the first and second data fields to enable a determination of whether a data value specified in search criteria is present within at least the data cell.
-
公开(公告)号:US10013441B1
公开(公告)日:2018-07-03
申请号:US15838195
申请日:2017-12-11
申请人: SAS Institute Inc.
CPC分类号: G06F16/2228 , G06F7/02 , G06F7/08 , G06F7/20 , G06F9/5027 , G06F9/5072 , G06F16/21 , G06F16/2246 , G06F16/2255 , G06F16/23 , G06F16/2365 , G06F16/245 , G06F16/27 , G06F16/381 , G06F16/9014 , G06F16/9027
摘要: An apparatus including a processor to: index multiple data records within a data cell by first and second data fields in a single read pass through the data cell; wherein for each data record within the first data cell, the processor is to retrieve data values from the first and second data fields, search a first binary tree to determine whether the data value from the first data field comprises a unique value, and add the data value to the first binary tree if it is unique, and search a second binary tree to determine whether the data value from the second data field comprises a unique value, and add the data value to the second binary tree if it is unique; and generate a first and second unique values indexes of identifiers of the data records associated with the unique data values within the first and second binary trees.
-
公开(公告)号:US20210026806A1
公开(公告)日:2021-01-28
申请号:US17039584
申请日:2020-09-30
申请人: SAS Institute Inc.
IPC分类号: G06F16/13 , G06F3/06 , G06F9/50 , G06F21/60 , G06F12/02 , G06F16/27 , G06F16/182 , G06F16/22
摘要: An apparatus includes a processor to: within each reading thread, retrieve a data set part and corresponding part metadata from storage device(s), analyze row group metadata for each row group within the data set part to identify candidate row group(s) meeting specified criteria, and store the candidate row group(s) and corresponding row group metadata within a data buffer of a queue; operate the queue as a FIFO buffer; within each provision thread, retrieve one of multiple row groups and corresponding metadata from within the data buffer, use information in the metadata to identify rows meeting the criteria, and provide those rows to the requesting device or an application; and in response to each instance of storage of a data set part within a data buffer of the queue, analyze the availability of storage space and/or of processing resources to determine whether to dynamically adjust the quantity of reading threads.
-
公开(公告)号:US10002146B1
公开(公告)日:2018-06-19
申请号:US15838175
申请日:2017-12-11
申请人: SAS Institute Inc.
CPC分类号: G06F16/2228 , G06F7/02 , G06F7/08 , G06F7/20 , G06F9/5027 , G06F9/5072 , G06F16/21 , G06F16/2246 , G06F16/2255 , G06F16/23 , G06F16/2365 , G06F16/245 , G06F16/27 , G06F16/381 , G06F16/9014 , G06F16/9027
摘要: An apparatus including a processor to receive search criteria including a data value for a search within a data field; in response to the receipt of the query instructions, and for each data cell within a super cell, perform the specified search by comparing the data value to ranges of values indicated in a corresponding cell index to determine whether the data cell includes a data record meeting the search criteria, and in response to a determination that the data cell includes such a data record, use a unique values index in the cell index to search the data records of the data cell to identify one or more data records meeting the search criteria; and in response to identifying at least one data record meeting the search criteria, provide an indication that at least the data cell includes at least one data record meeting the search criteria.
-
公开(公告)号:US09977805B1
公开(公告)日:2018-05-22
申请号:US15838110
申请日:2017-12-11
申请人: SAS Institute Inc.
CPC分类号: G06F17/30327 , G06F7/02 , G06F7/08 , G06F7/20 , G06F9/5027 , G06F9/5072 , G06F17/30289 , G06F17/30321 , G06F17/3033 , G06F17/30345 , G06F17/30371 , G06F17/30424 , G06F17/30575 , G06F17/30725 , G06F17/30949 , G06F17/30961
摘要: An apparatus including a processor to index data records within a data cell, wherein for each data record, the processor retrieves data values from first and second data fields; determines whether the first and second data fields store unique data values; in response to the first data field storing a unique data value, adds an identifier of the data record to a first unique values index, in response to the second data field storing a unique data value, adds the identifier to a second unique values index, wherein identifiers of data records within the unique values indexes are ordered based on corresponding unique data values; and generates an indication of ranges of data values of the first and second data fields to enable a determination of whether a data value specified in search criteria is present within at least the data cell.
-
-
-
-
-
-
-
-
-