Invention Application
- Patent Title: DETECTING DATA SKEW IN A JOIN OPERATION
-
Application No.: US17502685Application Date: 2021-10-15
-
Publication No.: US20220035814A1Publication Date: 2022-02-03
- Inventor: Florian Andreas Funke , Thierry Cruanes , Benoit Dageville , Marcin Zukowski
- Applicant: Snowflake Inc.
- Applicant Address: US MT Bozeman
- Assignee: Snowflake Inc.
- Current Assignee: Snowflake Inc.
- Current Assignee Address: US MT Bozeman
- Main IPC: G06F16/2453
- IPC: G06F16/2453 ; G06F16/22 ; G06F16/2455

Abstract:
Systems, methods, and devices, for managing data skew during a join operation are disclosed. A method includes computing a hash value for a join operation and detecting data skew on a probe side of the join operation at a runtime of the join operation using a lightweight sketch data structure. The method includes identifying a frequent probe-side join key on the probe side of the join operation during a probe phase of the join operation. The method includes identifying a frequent build-side row having a build-side join key corresponding with the frequent probe-side join key. The method includes asynchronously distributing the frequent build-side row to one or more remote servers.
Public/Granted literature
- US11347738B2 Detecting data skew in a join operation Public/Granted day:2022-05-31
Information query