专利检索 ap:("International Business Machines Corporation") AND inv:"Lukasz Gaza" 第 2 页

11.

发明授权
Optimization of a plurality of table processing operations in a massive parallel processing environment 有权

公开(公告)号：US09922083B2

公开(公告)日：2018-03-20

申请号：US14731456

申请日：2015-06-05

申请人： International Business Machines Corporation

发明人： Lukasz Gaza , Artur M. Gruszecki , Tomasz Kazalski , Konrad K. Skibski , Tomasz Stradomski

IPC分类号： G06F17/30

CPC分类号： G06F17/30445 , G06F17/30486

摘要： A computer-implemented method for partitioning data for a query operation of one table of the database system is provided. The computer-implemented method comprises estimating a value distribution of the attribute in the result table based on a first value distribution of the attribute in the first column of the first table. The computer-implemented method further comprises determining boundaries for partitioning ranges of the attribute, based on the estimated value distribution, wherein the partitioning ranges correspond to a same number of rows of the result table. The computer-implemented method further comprises partitioning the first table with processing nodes of the query operation, based on the determined boundaries of partitioning ranges.

12.

发明申请
PARALLEL PREPARATION OF A QUERY EXECUTION PLAN IN A MASSIVELY PARALLEL PROCESSING ENVIRONMENT BASED ON GLOBAL AND LOW-LEVEL STATISTICS 审中-公开

公开(公告)号：US20170147640A1

公开(公告)日：2017-05-25

申请号：US14948483

申请日：2015-11-23

申请人： International Business Machines Corporation

发明人： Lukasz Gaza , Artur M. Gruszecki , Tomasz Kazalski , Konrad K. Skibski , Tomasz K. Stradomski

IPC分类号： G06F17/30

CPC分类号： G06F17/30463

摘要： In an approach to preparing a query execution plan, a host node receives a query implicating one or more data tables. The host node broadcasts one or more implicated data tables to one or more processing nodes. The host node receives a set of node-specific query execution plans and execution cost estimates associated with each of the node-specific query execution plans, which have been prepared in parallel based on global statistics and node-specific low level statistics. The host node selects an optimal query execution plan based on minimized execution cost.

13.

发明申请
OPTIMIZATION OF A PLURALITY OF TABLE PROCESSING OPERATIONS IN A MASSIVE PARALLEL PROCESSING ENVIRONMENT 审中-公开

公开(公告)号：US20160098447A1

公开(公告)日：2016-04-07

申请号：US14505715

申请日：2014-10-03

申请人： International Business Machines Corporation

发明人： Lukasz Gaza , Artur M. Gruszecki , Tomasz Kazalski , Konrad K. Skibski , Tomasz Stradomski

IPC分类号： G06F17/30

CPC分类号： G06F17/30445 , G06F17/30486

摘要： A computer-implemented method for partitioning data for a query operation of one table of the database system is provided. The computer-implemented method comprises estimating a value distribution of the attribute in the result table based on a first value distribution of the attribute in the first column of the first table. The computer-implemented method further comprises determining boundaries for partitioning ranges of the attribute, based on the estimated value distribution, wherein the partitioning ranges correspond to a same number of rows of the result table. The computer-implemented method further comprises partitioning the first table with processing nodes of the query operation, based on the determined boundaries of partitioning ranges.

14.

发明授权
Joining two data tables on a join attribute 有权

公开(公告)号：US11163769B2

公开(公告)日：2021-11-02

申请号：US16443958

申请日：2019-06-18

申请人： International Business Machines Corporation

发明人： Michal Bodziony , Konrad K. Skibski , Tomasz Kazalski , Artur M. Gruszecki , Lukasz Gaza

IPC分类号： G06F16/00 , G06F16/2453

摘要： A computer-implemented method for joining two data tables on a join attribute, where the data tables have at least a first and a second attribute and the second attribute is the join attribute. The method provides a function for associating a computing node to a given record. The function may be used to determine the associated computing node. The records of the two data tables may be distributed to the respective determined computing nodes. The relationship between the values of the first and second attributes may be modelled using a predefined dataset. For each record of the two data tables the values of the first attribute may be re-determined using the corresponding values of the second attribute. The function may be used to re-determine the associated computing node.

15.

发明申请
EARLY DIAGNOSIS OF HARDWARE, SOFTWARE OR CONFIGURATION PROBLEMS IN DATA WAREHOUSE SYSTEM UTILIZING GROUPING OF QUERIES BASED ON QUERY PARAMETERS 审中-公开

公开(公告)号：US20190340050A1

公开(公告)日：2019-11-07

申请号：US16507771

申请日：2019-07-10

申请人： International Business Machines Corporation

发明人： Lukasz Gaza , Artur M. Gruszecki , Tomasz Kazalski , Bartlomiej T. Malecki , Konrad K. Skibski , Tomasz Stradomski

IPC分类号： G06F11/07 , G06F16/21 , G06F16/2457 , G06F16/2455 , G06F16/25 , G06F11/34

摘要： A method, system and computer program product for providing early diagnosis of hardware, software or configuration problems in a data warehouse system. A received query is parsed to determine the properties of the query. The query may then be joined to existing groups of queries if those groups have shared properties of the query. After executing the query according to an execution plan, results from the execution of the query is received, which may include problem(s) that occurred during execution of the query. For those problems that reach a pre-defined threshold of becoming a “group problem” in those groups joined by the query, the problem is reported to the end user concerning those groups where the problem exceeds the pre-defined threshold. In this manner, an early diagnosis of the problems in the data warehouse system that can cause delay and failure of the processing of queries is able to occur.

16.

发明授权
Method for processing a database query 有权

公开(公告)号：US09953065B2

公开(公告)日：2018-04-24

申请号：US14621466

申请日：2015-02-13

申请人： International Business Machines Corporation

发明人： Lukasz Gaza , Artur M. Gruszecki , Tomasz Kazalski , Konrad K. Skibski , Tomasz Stradomski

IPC分类号： G06F17/30 , G06F15/16

CPC分类号： G06F17/30536 , G06F17/30424 , G06F17/30864

摘要： The invention relates to a computer-implemented method for processing a query in a database, the query comprising a search value. The database comprises a plurality of datasets the datasets comprising entries, wherein distance statistics are assigned to the datasets. The distance statistics describe the minimum and maximum distance between the values of the entries of a dataset of the plurality of datasets and a reference value. The method comprises determining the distance between the search value and the reference value, said determination resulting in a search distance, determining a subset of datasets from the plurality of datasets for which the search distance is within the limits given by the minimum and maximum distances described by the respective distance statistics, and searching for the search value in the subset of datasets.

17.

发明申请
EFFICIENT PROCESSING OF DATA EXTENTS 审中-公开

公开(公告)号：US20180060386A1

公开(公告)日：2018-03-01

申请号：US15249509

申请日：2016-08-29

申请人： INTERNATIONAL BUSINESS MACHINES CORPORATION

发明人： Michal Bodziony , Andreas Brodt , Lukasz Gaza , Artur M. Gruszecki , Tomasz Kazalski , Konrad K. Skibski

IPC分类号： G06F17/30

CPC分类号： G06F17/30448 , G06F17/30395

摘要： The present disclosure relates to a computer-implemented method, computer program product, and computer system, for optimization of query processing a set of data extents on which a table is stored. Attribute value information may be maintained for each data extent. The attribute value information indicate as ranges the minimum and maximum values of an attribute of the entries stored in the respective extent. A first metric of a first data extent of the set may determine splitting the first data extent into sub-extents increases query processing efficiency. A second metric of a second data extent and a third data extent may determine merging the second data extent and the third data extent increases query processing efficiency.

18.

发明授权
Directed backup for massively parallel processing databases 有权

公开(公告)号：US09785515B2

公开(公告)日：2017-10-10

申请号：US14614847

申请日：2015-02-05

申请人： International Business Machines Corporation

发明人： Lukasz Gaza , Artur M. Gruszecki , Tomasz Kazalski , Konrad K. Skibski , Tomasz Stradomski

IPC分类号： G06F17/30 , G06F11/14 , H04L29/08

CPC分类号： G06F11/1458 , G06F11/1448 , G06F11/1464 , G06F11/1469 , G06F17/30073 , G06F2201/80 , H04L67/10 , H04L67/1095

摘要： Creating a data backup of data on a first computer system to restore to a second computer system, each of the first and second computer system including one or more nodes, each node configured to manage a subset of the data. Receiving, by the first computer system, identification of data to back up and node configuration information for the second computer system. Creating, by the first computer system, a backup of the data from the one or more nodes of the first computer system, configured in accordance with the node configuration information of the second computer system, such that the backed up data is directly manageable by the one or more nodes of the second computer system.

19.

发明申请
EARLY DIAGNOSIS OF HARDWARE, SOFTWARE OR CONFIGURATION PROBLEMS IN DATA WAREHOUSE SYSTEM UTILIZING GROUPING OF QUERIES BASED ON QUERY PARAMETERS 有权

公开(公告)号：US20170269982A1

公开(公告)日：2017-09-21

申请号：US15617201

申请日：2017-06-08

申请人： International Business Machines Corporation

发明人： Lukasz Gaza , Artur M. Gruszecki , Tomasz Kazalski , Bartlomiej T. Malecki , Konrad K. Skibski , Tomasz Stradomski

IPC分类号： G06F11/07 , G06F17/30

摘要： A method, system and computer program product for providing early diagnosis of hardware, software or configuration problems in a data warehouse system. A received query is parsed to determine the properties of the query. The query may then be joined to existing groups of queries if those groups have shared properties of the query. After executing the query according to an execution plan, results from the execution of the query is received, which may include problem(s) that occurred during execution of the query. For those problems that reach a pre-defined threshold of becoming a “group problem” in those groups joined by the query, the problem is reported to the end user concerning those groups where the problem exceeds the pre-defined threshold. In this manner, an early diagnosis of the problems in the data warehouse system that can cause delay and failure of the processing of queries is able to occur.

20.

发明授权
Avoidance of intermediate data skew in a massive parallel processing environment 有权
标题翻译：避免在大规模并行处理环境中的中间数据偏移

公开(公告)号：US09569493B2

公开(公告)日：2017-02-14

申请号：US14144893

申请日：2013-12-31

申请人： International Business Machines Corporation

发明人： Lukasz Gaza , Artur M. Gruszecki , Tomasz Kazalski , Grzegorz S. Milka , Konrad K. Skibski , Tomasz Stradomski

IPC分类号： G06F17/30

CPC分类号： G06F17/30466 , G06F17/3033 , G06F17/30469 , G06F17/30498 , G06F17/30501

摘要： A computer-implemented method for minimizing join operation processing time within a database system based on estimated joined table spread of the database system has been provided. The computer-implemented method includes, estimating value distribution of data in a joined table, wherein the joined table is a result of join operation between two instances of tables of a database system. The computer-implemented method further includes determining boundaries for partitioning at least one range of attributes of the estimated value distribution, wherein the boundaries for partitioning at least one range of attributes of the estimated value distribution corresponds to a same number of rows of the joined table. The computer-implemented method further includes determining at least one assignment of the determined partition of the at least one range of attributes to processing units of the database system.

摘要翻译： 已经提供了一种基于数据库系统的估计连接表扩展来最小化数据库系统内的连接操作处理时间的计算机实现的方法。计算机实现的方法包括：估计联接表中的数据的值分布，其中所连接的表是数据库系统的两个表的实例之间的连接操作的结果。计算机实现的方法还包括确定用于划分估计值分布的属性的至少一个范围的边界，其中用于划分估计值分布的至少一个属性范围的边界对应于所连接的表的相同数量的行。计算机实现的方法还包括确定至少一个属性范围的所确定的分区的至少一个分配到数据库系统的处理单元。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类