SYSTEMS AND METHODS FOR ACCESSING DATA ENTITIES MANAGED BY A DATA PROCESSING SYSTEM

    公开(公告)号:US20240346019A1

    公开(公告)日:2024-10-17

    申请号:US18415067

    申请日:2024-01-17

    摘要: Techniques for obtaining information about data entity instances managed by a data processing system using at least one data store. The techniques include obtaining a query comprising a first portion comprising information for identifying instances of a first data entity stored in at least one data store; and a second portion indicating at least one attribute of the first data entity; generating, from the query, a plurality of executable queries including a first set of one or more executable queries and a second set of one or more executable queries, the generating comprising: generating, using the first portion, the first set of executable queries for identifying instances of the first data entity, and generating, using the second portion, the second set of executable queries for obtaining attribute values for instances of the first data entity; and executing the plurality of executable queries to obtain results for the query.

    Transforming operations of a computer program for execution at a database

    公开(公告)号:US12038921B2

    公开(公告)日:2024-07-16

    申请号:US17012633

    申请日:2020-09-04

    摘要: A method includes executing a program that specifies operations and accessing a translation file that includes instructions for translating the language of the program into a language of a database. The translation file specifies operations in the language of the program that are supported by the database and the semantic meaning of the supported operations in the language of the database. Operations of the program that are unsupported by the database are processed by the program. Operations of the program that are supported by the database are determined from the translation file, and a portion of the program representing the supported operations is translated, using the translation file, into the language of the database and transmitted to the database. Data resulting from execution, within the database, of the translated portion of the program representing the operations that are supported by the database is received by the program.

    Systems and methods for dataflow graph optimization

    公开(公告)号:US12032631B2

    公开(公告)日:2024-07-09

    申请号:US15993284

    申请日:2018-05-30

    摘要: At least one non-transitory computer-readable storage medium storing processor-executable instructions that, when executed by at least one computer hardware processor, cause the at least one computer hardware processor to perform: obtaining an automatically generated initial dataflow graph, the initial dataflow graph comprising a first plurality of nodes representing a first plurality of data processing operations and a first plurality of links representing flows of data among nodes in the first plurality of nodes; and generating an updated dataflow graph by iteratively applying dataflow graph optimization rules to update the initial dataflow graph, the updated dataflow graph comprising a second plurality of nodes representing a second plurality of data processing operations and a second plurality of links representing flows of data among nodes in the second plurality of nodes.

    Systems and methods for accessing data entities managed by a data processing system

    公开(公告)号:US11921710B2

    公开(公告)日:2024-03-05

    申请号:US17587181

    申请日:2022-01-28

    摘要: Techniques for obtaining information about data entity instances managed by a data processing system using at least one data store. The techniques include obtaining a query comprising a first portion comprising information for identifying instances of a first data entity stored in at least one data store; and a second portion indicating at least one attribute of the first data entity; generating, from the query, a plurality of executable queries including a first set of one or more executable queries and a second set of one or more executable queries, the generating comprising: generating, using the first portion, the first set of executable queries for identifying instances of the first data entity, and generating, using the second portion, the second set of executable queries for obtaining attribute values for instances of the first data entity; and executing the plurality of executable queries to obtain results for the query.

    Publishing to a data warehouse
    7.
    发明授权

    公开(公告)号:US11893036B2

    公开(公告)日:2024-02-06

    申请号:US16517320

    申请日:2019-07-19

    摘要: A method for generating an executable application to transform and load data into a structured dataset includes receiving a metadata file that specifies values for parameters for structuring data feeds, received from a networked data source, into a structured database. The metadata file specifies logical rules for transforming the data feeds. The values of the parameters and the logical rules for transforming the plurality of the data feeds are validated to ensure logical consistency for each data feed. Data rules are generated that specify standards for transforming each data feed in accordance with the validated values of the parameters and logical rules. The executable application is generated that is configured to receive source data comprising a data feed from one or more data sources and transform the source data into structured data that satisfies the one or more standards for the structured data record in compliance with the data rules.

    AUTOMATED MODIFICATION OF COMPUTER PROGRAMS

    公开(公告)号:US20230112179A1

    公开(公告)日:2023-04-13

    申请号:US17704469

    申请日:2022-03-25

    摘要: A method for updating a computer program includes receiving a computer program hosted on and configured to be executed by a first computing system. The method includes analyzing the computer program to obtain characterization of a lineage, an architecture, and an operation of the computer program. The lineage includes relationships among elements of the computer program, the architecture includes a characteristic of the data source, the data target, and one or more processors configured to process the data contained in data records, and the operation includes processes that are executed to process the data from the data records. The method includes receiving a characterization of an update to be made to the computer program, in which when the computer program is modified according to the update, at least some of the modified computer program is configured to be hosted on and executed by a second computing system; and modifying the computer program to implement the update to generate the modified computer program.

    Dynamic Computational Resource Allocation

    公开(公告)号:US20220391257A1

    公开(公告)日:2022-12-08

    申请号:US17831978

    申请日:2022-06-03

    IPC分类号: G06F9/50

    摘要: A method for performing a distributed computation on a computing system using computational resources dynamically allocated using a computational resource manager includes storing information specifying quantities of computational resources associated with respective ones of a number of program portions of the program, where the program portions perform successive transformations of data and each program portion uses computational resources granted by the computational resource manager enabling computation associated with that program portion to be performed in the computing system, requesting a first quantity of computational resources associated with a first program portion of the number of program portions from the computational resource manager, receiving a second quantity of computational resources from the computational resource manager, less than the requested first quantity of computational resources, performing computation associated with the first portion of the program using the second quantity of computational resources, while performing the computation associated with the first portion of the program using the second quantity of computational resources, receiving an additional quantity of computational resources from the computational resource manager, and performing an additional computation associated with the first portion of the program using the additional quantity of computational resources while performing the computation associated with the first portion using the second quantity of computational resources.