Invention Grant
- Patent Title: Lazy reassembling of semi-structured data
-
Application No.: US17814110Application Date: 2022-07-21
-
Publication No.: US11567939B2Publication Date: 2023-01-31
- Inventor: Mahmud Allahverdiyev , Selcuk Aya , Bowei Chen , Ismail Oukid
- Applicant: Snowflake Inc.
- Applicant Address: US MT Bozeman
- Assignee: Snowflake Inc.
- Current Assignee: Snowflake Inc.
- Current Assignee Address: US MT Bozeman
- Agency: Schwegman Lundberg & Woessner, P.A.
- Main IPC: G06F16/24
- IPC: G06F16/24 ; G06F16/2455 ; G06F16/9035 ; G06F16/28 ; G06F17/18 ; G06F16/22

Abstract:
A pruning index is generated for a source table organized into a set of batch units. The source table comprises a column of semi-structured data. The pruning index comprises a set of filters that index distinct values in each column of the source table. Rather than reassembling an entire tree structure of the semi-structured data prior to indexing, the generating of the pruning index comprises traversing a reassembly hook object that represents a first portion of the semi-structured data that is subcolumnarized and traversing a residual object that represents a second portion of the semi-structured data that is not subcolumnarized. The reassembly hook object is traversed to identify values corresponding to the first portion of the semi-structured data and the residual object is traversed to identify values corresponding to the second portion. The pruning index is stored with an association with the source table.
Public/Granted literature
- US20220358128A1 LAZY REASSEMBLING OF SEMI-STRUCTURED DATA Public/Granted day:2022-11-10
Information query