OPERATIONALIZING METADATA
    4.
    发明公开

    公开(公告)号:US20240070163A1

    公开(公告)日:2024-02-29

    申请号:US18104066

    申请日:2023-01-31

    CPC classification number: G06F16/254 G06F16/26 G06F16/9024

    Abstract: A method for using a metadata model to perform operations on data items, with the metadata model including parent nodes and child nodes connected by edges, with the parent nodes specifying logical metadata and the child nodes specifying physical metadata representing the data items, and with the edges specifying relationships between the nodes. The method includes: identifying a given data item and physical metadata of that given data item, accessing the metadata model, identifying, in the metadata model, a child node representing the physical metadata of the given data item, traversing one or more edges in the metadata model to identify parent nodes of the child node, determining, from logical metadata associated with the identified parent nodes, one or more operations to be performed on the given data item, applying the one or more operations to the given data item to transform the data item, and storing the transformed data item.

    DATAFLOW GRAPH DATASETS
    5.
    发明公开

    公开(公告)号:US20230359668A1

    公开(公告)日:2023-11-09

    申请号:US18114212

    申请日:2023-02-24

    CPC classification number: G06F16/9024

    Abstract: Described herein are techniques, performed by a data processing system, for enabling efficient development of software application programs in a dynamic environment with multiple datasets by generating entries in a dataset catalog to provide a software application program with access to output data dynamically generated by dataflow graphs, the entries associated with respective software application programs developed as dataflow graphs. The techniques include identifying a subgraph, wherein, when the subgraph is executed, the subgraph generates output data by applying one or more data processing operations to data obtained from one or more data sources; creating, in the dataset catalog, a new entry associated with the identified subgraph, the new entry associated with information indicating nodes, links, and configuration parameters of the identified subgraph; and configuring the dataset catalog to enable access to the new entry, in the dataset catalog, associated with the identified subgraph.

    Transforming a specification into a persistent computer program

    公开(公告)号:US11423083B2

    公开(公告)日:2022-08-23

    申请号:US15795917

    申请日:2017-10-27

    Abstract: A method performed by a computer system including: accessing a specification that specifies a plurality of modules to be implemented by the computer program for processing the one or more values of the one or more fields in the structured data item; transforming the specification into the computer program that implements the plurality of modules, wherein the transforming includes: for each of one or more first modules of the plurality of modules: identifying one or more second modules of the plurality of modules that each receive input that is at least partly based on an output of the first module; and formatting an output data format of the first module such that the first module outputs only one or more values of one or more fields of the structured data item.

    Logical Access for Previewing Expanded View Datasets

    公开(公告)号:US20240320224A1

    公开(公告)日:2024-09-26

    申请号:US18492904

    申请日:2023-10-24

    CPC classification number: G06F16/24568 G06F16/24542 G06F16/2457

    Abstract: A method implemented by a data processing system for: enabling a user to preview attributes of fields of an expanded view of a base dataset and to specify one or more of the fields to use in downstream data processing and generating a dataset that includes the one or more of the fields from the preview specified to be used in the downstream data processing, with the generated dataset having increased efficiency with respect to speed and data memory, relative to an efficiency of generating a dataset including all the fields of the expanded view when only the specified one or more of the fields are used in the downstream data processing.

    TRANSFORMING A SPECIFICATION INTO A PERSISTENT COMPUTER PROGRAM

    公开(公告)号:US20220342935A1

    公开(公告)日:2022-10-27

    申请号:US17858605

    申请日:2022-07-06

    Abstract: A method performed by a computer system including: accessing a specification that specifies a plurality of modules to be implemented by the computer program for processing the one or more values of the one or more fields in the structured data item; transforming the specification into the computer program that implements the plurality of modules, wherein the transforming includes: for each of one or more first modules of the plurality of modules: identifying one or more second modules of the plurality of modules that each receive input that is at least partly based on an output of the first module; and formatting an output data format of the first module such that the first module outputs only one or more values of one or more fields of the structured data item.

    GENERATION OF OPTIMIZED LOGIC FROM A SCHEMA

    公开(公告)号:US20220147529A1

    公开(公告)日:2022-05-12

    申请号:US17558097

    申请日:2021-12-21

    Abstract: A method includes accessing a schema that specifies relationships among datasets, computations on the datasets, or transformations of the datasets, selecting a dataset from among the datasets, and identifying, from the schema, other datasets that are related to the selected dataset. Attributes of the datasets are identified, and logical data representing the identified attributes and relationships among the attributes is generated. The logical data is provided to a development environment, which provides access to portions of the logical data representing the identified attributes. A specification that specifies at least one of the identified attributes in performing an operation is received from the development environment. Based on the specification and the relationships among the identified attributes represented by the logical data, a computer program is generated to perform the operation by accessing, from storage, at least one dataset having the at least one of the attributes specified in the specification.

Patent Agency Ranking