DETERMINING LINEAGE INFORMATION FOR DATA RECORDS

    公开(公告)号:US20230091775A1

    公开(公告)日:2023-03-23

    申请号:US17479849

    申请日:2021-09-20

    Abstract: A computer-based system may be configured to collect metadata for each source and target defined for a data pipeline and formatting information (e.g., schemas, transformations, etc.) associated with each entity and field. During the definition of the pipeline, how the data will end up in the target may be defined, for example, by a user of the computer-based system via a GUI/interface and/or the like. Information (e.g., modification information, etc.) describing how the data will end up in the target may be defined, stored, and accessed to determine and/or track over which fields and entities are affected by the user-defined mutations and over which schemas. Lineage information (e.g., a genealogical tree, data lineage tracing, etc.) describing a data, version, and transformation may be generated and used to determine a source for a data record, how changes to the data record are related, how lineage evolved, and/or the like.

Patent Agency Ranking