-
公开(公告)号:US11487776B2
公开(公告)日:2022-11-01
申请号:US16801201
申请日:2020-02-26
发明人: Michal Bodziony , Lukasz Stanislaw Studzienny , Andrzej Laskawiec , Monika Piatek , Marcin Luczynski , Marcin Filip , Tomasz Zatorski
摘要: Managing ETL (extract-transform-load) operation execution by receiving an updated ETL (extract-transform-load) job flow, determining affected stages of a current ETL job flow, sending a message to stop data flow for all stages of the current ETL job flow, sending a switch message along the current ETL job flow through the affected stages, receiving notice of receipt of the switch message downstream from the affected stages of the current ETL job flow, moving an affected stage of the current ETL job flow from a first ETL section to a second ETL section; and resuming data flow according to the updated ETL job flow.
-
公开(公告)号:US11157645B2
公开(公告)日:2021-10-26
申请号:US16177810
申请日:2018-11-01
发明人: Monika Piatek , Michal Bodziony , Marcin Filip , Andrzej Laskawiec , Marcin Luczynski , Lukasz S. Studzienny , Tomasz Zatorski
IPC分类号: G06F21/62
摘要: A method of masking data includes disabling, by one or more processors of a computer system, operation of a data structure that includes at least one data index. The method includes masking, by the one or more processors of the computer system, the data structure in a table with an isomorphic function, and enabling, by the one or more processors of the computer system, operation of the data structure.
-
公开(公告)号:US11995047B2
公开(公告)日:2024-05-28
申请号:US16846078
申请日:2020-04-10
发明人: Marcin Filip , Michal Bodziony , Marcin Luczynski , Andrzej Laskawiec , Lukasz Stanislaw Studzienny , Monika Piatek , Tomasz Zatorski
IPC分类号: G06F16/21 , G06F16/245 , G06F16/27 , G06F16/28 , G06F21/62
CPC分类号: G06F16/213 , G06F16/245 , G06F16/27 , G06F16/289 , G06F21/6227
摘要: Methods, computer program products, and systems are presented. The method computer program products, and systems can include, for instance: running an application on one or more computing node, the application providing user access to a database; receiving by the application registration data from a user for registering the user into a service; in response to the receiving the registration data from the user, generating a user schema that defines access rights of the user to the database, wherein the generated user schema includes a set of shared data structure objects instantiated in a storage system associated to the application, and a set of private data structure objects.
-
4.
公开(公告)号:US11036698B2
公开(公告)日:2021-06-15
申请号:US16211902
申请日:2018-12-06
发明人: Andrzej Laskawiec , Michal Bodziony , Lukasz S. Studzienny , Marcin Luczynski , Marcin Filip , Monika Piatek , Tomasz Zatorski
摘要: Methods and systems for moving a dataset from a relational database to a non-relational database are disclosed. A method includes: copying, by a computing device, source datafiles from a relational database to each of a plurality of region servers associated with a non-relational database; generating, by the computing device, at least one table in the non-relational database from the copied source datafiles; and accessing, by the computing device, the at least one table in the non-relational database by reading raw data stored in a relational database format from the copied source datafiles on one of the plurality of region servers associated with the non-relational database.
-
公开(公告)号:US20210263944A1
公开(公告)日:2021-08-26
申请号:US16801201
申请日:2020-02-26
发明人: Michal Bodziony , Lukasz Stanislaw Studzienny , Andrzej Laskawiec , Monika Piatek , Marcin Luczynski , Marcin Filip , Tomasz Zatorski
摘要: Managing ETL (extract-transform-load) operation execution by receiving an updated ETL (extract-transform-load) job flow, determining affected stages of a current ETL job flow, sending a message to stop data flow for all stages of the current ETL job flow, sending a switch message along the current ETL job flow through the affected stages, receiving notice of receipt of the switch message downstream from the affected stages of the current ETL job flow, moving an affected stage of the current ETL job flow from a first ETL section to a second ETL section; and resuming data flow according to the updated ETL job flow.
-
公开(公告)号:US11030212B2
公开(公告)日:2021-06-08
申请号:US16123568
申请日:2018-09-06
发明人: Michal Bodziony , Marcin Filip , Monika Piatek , Lukasz S. Studzienny , Andrzej Laskawiec , Tomasz Zatorski , Marcin Luczynski
IPC分类号: G06F16/25 , G06F16/248 , G06F16/22 , G06F21/62
摘要: A computer-implemented method includes: receiving, by a computer device, a database query of a database from a database client; determining, by the computer device and based on the database query, a to-be-masked portion of data; sending, by the computer device, the database query to a federated database system for the to-be-masked portion of the data; receiving, by the computer device and in response to the sending, masked resultant data from the federated database system; and forwarding, by the computer device, the masked resultant data to the database client. The to-be-masked portion of the data is masked in at least one view of the database residing on the federated database system.
-
公开(公告)号:US11093642B2
公开(公告)日:2021-08-17
申请号:US16238913
申请日:2019-01-03
发明人: Michal Bodziony , Lukasz S. Studzienny , Monika Piatek , Marcin Filip , Andrzej Laskawiec , Marcin Luczynski , Tomasz Zatorski
IPC分类号: H04L29/06 , G06F21/62 , G06F16/27 , G06F16/9535
摘要: A method, computer program product, and a system where a processor(s) intercepts a query from a client targeting a database. The processor(s) analyzes the query to determine contextual data relevant to the query and determines, based on a data masking policy, if a portion of results responsive to the intercepted query should be provided to the client as masked data. When the processor(s) determines that the portion of the results should be provided to the client as masked data, the processor(s) determines computing resources managing data comprising the results responsive to the intercepted query. The processor(s) executes the query on the computing resources by obtaining the masked data via coprocessors deployed on the computing resources, where the coprocessors mask the portion of the results. The processor(s) returns query results comprising the portion of the results masked by the coprocessors, to the client.
-
公开(公告)号:US20210089508A1
公开(公告)日:2021-03-25
申请号:US16580533
申请日:2019-09-24
发明人: Andrzej Laskawiec , Michal Bodziony , Marcin Filip , Lukasz Stanislaw Studzienny , Tomasz Zatorski , Marcin Luczynski , Monika Piatek
IPC分类号: G06F16/22 , G06F16/215 , G06F16/23 , G06F11/34
摘要: Aspects of the present invention disclose a method for ordering rows of a table based on access statistics. The method includes one or more processors adding a respective tracking mechanism to the attribute of each respective instance of the set of rows of the table of the database. The method further includes adding a tracking mechanism to the respective instances of the set of rows. The method further includes gathering access data of the respective tracking mechanism of the attribute of each respective instance of the set of rows. The method further includes determining each respective instance of an attribute value that corresponds to the access data of the respective tracking mechanism of each respective instance of the set of rows. The method further includes ordering each respective instance of the set of rows of the table based at least in part on the respective instances of the attribute values.
-
公开(公告)号:US10922312B2
公开(公告)日:2021-02-16
申请号:US16139472
申请日:2018-09-24
发明人: Andrzej Laskawiec , Michal Bodziony , Marcin Filip , Lukasz S. Studzienny , Marcin Luczynski , Monika Piatek , Tomasz Zatorski
IPC分类号: G06F16/2453 , G06F16/25 , G06F16/22
摘要: Optimization of job execution includes generating a hash tree for an initial dataset, executing a data processing job against the initial dataset, the job including a set of calculations against data of the records of the initial dataset, generating, based on a modification to the initial dataset to produce a modified dataset, a hash tree for the modified dataset, comparing the hash tree for the initial dataset to the hash tree for the modified dataset, and identifying one or more blocks of the modified dataset that are updated compared to the initial dataset, and re-executing the job against the modified dataset. The re-executing performs only those calculations that use data of the records of the one or more blocks that are updated compared to the initial dataset, and does not perform other calculations that do not use data of the records of the one or more blocks.
-
公开(公告)号:US11604776B2
公开(公告)日:2023-03-14
申请号:US16839200
申请日:2020-04-03
发明人: Andrzej Laskawiec , Monika Piatek , Lukasz Stanislaw Studzienny , Marcin Filip , Marcin Luczynski , Michal Bodziony , Tomasz Zatorski
IPC分类号: G06F16/00 , G06F16/215 , G06F16/28 , G06F16/242
摘要: A computer-implemented method for unambiguously identifying entities in a database system may be provided. The method comprises storing data items as records with different attributes in a table of a database, storing naming rules for selected combinations of the attributes of the data items, and prioritizing the naming rules. The method also comprises determining a hash value for each of the selected combinations of the attributes of the data items, and identifying duplicate data items using the determined hash values and the prioritized naming rules.
-
-
-
-
-
-
-
-
-