-
公开(公告)号:US20240346051A1
公开(公告)日:2024-10-17
申请号:US18496543
申请日:2023-10-27
发明人: Marshall A. Isman , Adam Weiss , Jonah Egenolf , Robert Parks , John MacLean , Richard Mellon , Dusan Radivojevic , Paul Veiser , Mazin Woodrow Khader
CPC分类号: G06F16/288 , G06F3/048 , G06F9/4494 , G06F9/451
摘要: A method implemented by a data processing system for enabling a system to pipeline or otherwise process data in conformance with specified criteria by providing a graphical user interface for selecting data to be processed, determining metadata of selected data, and, based on the metadata, automatically processing the selected data in conformance with the specified criteria.
-
公开(公告)号:US20240346019A1
公开(公告)日:2024-10-17
申请号:US18415067
申请日:2024-01-17
IPC分类号: G06F16/242 , G06F9/54 , G06F16/248
CPC分类号: G06F16/2425 , G06F9/54 , G06F16/248
摘要: Techniques for obtaining information about data entity instances managed by a data processing system using at least one data store. The techniques include obtaining a query comprising a first portion comprising information for identifying instances of a first data entity stored in at least one data store; and a second portion indicating at least one attribute of the first data entity; generating, from the query, a plurality of executable queries including a first set of one or more executable queries and a second set of one or more executable queries, the generating comprising: generating, using the first portion, the first set of executable queries for identifying instances of the first data entity, and generating, using the second portion, the second set of executable queries for obtaining attribute values for instances of the first data entity; and executing the plurality of executable queries to obtain results for the query.
-
公开(公告)号:US20240265028A1
公开(公告)日:2024-08-08
申请号:US18486300
申请日:2023-10-13
CPC分类号: G06F16/258 , H04L67/55
摘要: A method for performing real-time segmentation by updating a wide record based on receipt of real-time data, wherein an item of real-time data represents a transaction, detecting that the updated wide record satisfies criteria for performing real-time segmentation, and performing real-time segmentation on the updated, wide record, wherein real-time is relative to when a transaction represented in the updated wide record occurs.
-
公开(公告)号:US12038921B2
公开(公告)日:2024-07-16
申请号:US17012633
申请日:2020-09-04
发明人: Garth Dickie , Ian Schechter
IPC分类号: G06F16/2452 , G06F16/242 , G06F16/2455
CPC分类号: G06F16/24526 , G06F16/2448 , G06F16/24553 , G06F16/24564
摘要: A method includes executing a program that specifies operations and accessing a translation file that includes instructions for translating the language of the program into a language of a database. The translation file specifies operations in the language of the program that are supported by the database and the semantic meaning of the supported operations in the language of the database. Operations of the program that are unsupported by the database are processed by the program. Operations of the program that are supported by the database are determined from the translation file, and a portion of the program representing the supported operations is translated, using the translation file, into the language of the database and transmitted to the database. Data resulting from execution, within the database, of the translated portion of the program representing the operations that are supported by the database is received by the program.
-
公开(公告)号:US12032631B2
公开(公告)日:2024-07-09
申请号:US15993284
申请日:2018-05-30
发明人: Garth Allen Dickie
IPC分类号: G06F16/901 , G06F16/23 , G06F16/242
CPC分类号: G06F16/9024 , G06F16/2379 , G06F16/2433
摘要: At least one non-transitory computer-readable storage medium storing processor-executable instructions that, when executed by at least one computer hardware processor, cause the at least one computer hardware processor to perform: obtaining an automatically generated initial dataflow graph, the initial dataflow graph comprising a first plurality of nodes representing a first plurality of data processing operations and a first plurality of links representing flows of data among nodes in the first plurality of nodes; and generating an updated dataflow graph by iteratively applying dataflow graph optimization rules to update the initial dataflow graph, the updated dataflow graph comprising a second plurality of nodes representing a second plurality of data processing operations and a second plurality of links representing flows of data among nodes in the second plurality of nodes.
-
公开(公告)号:US11921710B2
公开(公告)日:2024-03-05
申请号:US17587181
申请日:2022-01-28
IPC分类号: G06F16/242 , G06F9/54 , G06F16/248
CPC分类号: G06F16/2425 , G06F9/54 , G06F16/248
摘要: Techniques for obtaining information about data entity instances managed by a data processing system using at least one data store. The techniques include obtaining a query comprising a first portion comprising information for identifying instances of a first data entity stored in at least one data store; and a second portion indicating at least one attribute of the first data entity; generating, from the query, a plurality of executable queries including a first set of one or more executable queries and a second set of one or more executable queries, the generating comprising: generating, using the first portion, the first set of executable queries for identifying instances of the first data entity, and generating, using the second portion, the second set of executable queries for obtaining attribute values for instances of the first data entity; and executing the plurality of executable queries to obtain results for the query.
-
公开(公告)号:US11893036B2
公开(公告)日:2024-02-06
申请号:US16517320
申请日:2019-07-19
发明人: Andrew Blom , Darren Miller , Marshall A. Isman
IPC分类号: G06F7/00 , G06F17/00 , G06F16/25 , G06F16/901 , G06F8/34 , H04L67/565
CPC分类号: G06F16/254 , G06F8/34 , G06F16/258 , G06F16/9024 , H04L67/565
摘要: A method for generating an executable application to transform and load data into a structured dataset includes receiving a metadata file that specifies values for parameters for structuring data feeds, received from a networked data source, into a structured database. The metadata file specifies logical rules for transforming the data feeds. The values of the parameters and the logical rules for transforming the plurality of the data feeds are validated to ensure logical consistency for each data feed. Data rules are generated that specify standards for transforming each data feed in accordance with the validated values of the parameters and logical rules. The executable application is generated that is configured to receive source data comprising a data feed from one or more data sources and transform the source data into structured data that satisfies the one or more standards for the structured data record in compliance with the data rules.
-
8.
公开(公告)号:US20230281022A1
公开(公告)日:2023-09-07
申请号:US18316333
申请日:2023-05-12
发明人: Oded Ravid , Trevor Murphy
IPC分类号: G06F9/448 , G06F16/901 , G06F16/2455 , G06F16/178 , G06F9/445
CPC分类号: G06F9/4494 , G06F16/9024 , G06F16/24568 , G06F16/1794 , G06F9/44505 , G06F8/433
摘要: A method is described for processing keyed data items that are each associated with a value of a key, the keyed data items being from a plurality of distinct data streams, the processing including collecting the keyed data items, determining, based on contents of at least one of the keyed data items, satisfaction of one or more specified conditions for execution of one or more actions and causing execution of at least one of the one or more actions responsive to the determining.
-
公开(公告)号:US20230112179A1
公开(公告)日:2023-04-13
申请号:US17704469
申请日:2022-03-25
发明人: John Joyce , Marshall A. Isman , Edward Alan Bach
摘要: A method for updating a computer program includes receiving a computer program hosted on and configured to be executed by a first computing system. The method includes analyzing the computer program to obtain characterization of a lineage, an architecture, and an operation of the computer program. The lineage includes relationships among elements of the computer program, the architecture includes a characteristic of the data source, the data target, and one or more processors configured to process the data contained in data records, and the operation includes processes that are executed to process the data from the data records. The method includes receiving a characterization of an update to be made to the computer program, in which when the computer program is modified according to the update, at least some of the modified computer program is configured to be hosted on and executed by a second computing system; and modifying the computer program to implement the update to generate the modified computer program.
-
公开(公告)号:US20220391257A1
公开(公告)日:2022-12-08
申请号:US17831978
申请日:2022-06-03
发明人: Stephen J. Schmidt , David Ricardo , Craig W. Stanfill , Jacky Cheuk Kei Chu , Ephraim Meriwether Vishniac
IPC分类号: G06F9/50
摘要: A method for performing a distributed computation on a computing system using computational resources dynamically allocated using a computational resource manager includes storing information specifying quantities of computational resources associated with respective ones of a number of program portions of the program, where the program portions perform successive transformations of data and each program portion uses computational resources granted by the computational resource manager enabling computation associated with that program portion to be performed in the computing system, requesting a first quantity of computational resources associated with a first program portion of the number of program portions from the computational resource manager, receiving a second quantity of computational resources from the computational resource manager, less than the requested first quantity of computational resources, performing computation associated with the first portion of the program using the second quantity of computational resources, while performing the computation associated with the first portion of the program using the second quantity of computational resources, receiving an additional quantity of computational resources from the computational resource manager, and performing an additional computation associated with the first portion of the program using the additional quantity of computational resources while performing the computation associated with the first portion using the second quantity of computational resources.
-
-
-
-
-
-
-
-
-