FILTERING DATA LINEAGE DIAGRAMS
    2.
    发明申请
    FILTERING DATA LINEAGE DIAGRAMS 审中-公开
    过滤数据线图

    公开(公告)号:WO2016130615A1

    公开(公告)日:2016-08-18

    申请号:PCT/US2016/017246

    申请日:2016-02-10

    CPC classification number: G06F17/30601 G06F17/30958

    Abstract: Managing lineage information includes processing a request for a representation of data lineage for a first node of a number of nodes (102, 104, 106). The processing includes determining an association between the first node and at least a first tag identifier of a number of tag identifiers, and determining a first subset of at least one and fewer than all of a number of possible tag values for the first tag identifier, and traversing nodes along a first lineage path of directed links from the first node to determine a data lineage for the first node. Determining the data lineage includes, for each traversed node (350) determining whether to add (356) the traversed node to the data lineage or to exclude (360) the traversed node from the data lineage based at least in part on any tag identifiers or tag values associated with the traversed node.

    Abstract translation: 管理谱系信息包括处理对多个节点(102,104,106)的第一节点的数据谱系的表示的请求。 所述处理包括确定所述第一节点与多个标签标识符的至少第一标签标识符之间的关联,以及确定所述第一标签标识符的至少一个且少于所有可能的标签值的第一子集, 以及沿着来自第一节点的定向链路的第一沿袭路径遍历节点以确定第一节点的数据沿袭。 确定数据谱系对于每个经过的节点(350)包括确定是否将所遍历的节点(356)添加到数据谱系中,或者至少部分地基于任何标签标识符来排除(360)所遍历的节点与数据谱系 与所遍历的节点相关联的标签值。

    TECHNIQUES FOR MANAGING DATA IN A DATA PROCESSING SYSTEM USING DATA ENTITIES AND INHERITANCE

    公开(公告)号:WO2022165123A1

    公开(公告)日:2022-08-04

    申请号:PCT/US2022/014232

    申请日:2022-01-28

    Abstract: Techniques for storing data entities by a data processing system are described herein. The data processing system may store a plurality of data entity instances generated using a plurality of data entities. The plurality of data entity instances may include a first data entity instance generated using a first data entity and a second data entity instance generated using a second data entity. The first data entity instance may include a first attribute that is configured to inherit its value from a second attribute of the second data entity instance. The data processing system may provide the inherited value of the second attribute of the second data entity instance as the value of the first attribute of the first data entity instance.

    SYSTEMS AND METHODS FOR DETERMINING RELATIONSHIPS AMONG DATA ELEMENTS
    4.
    发明申请
    SYSTEMS AND METHODS FOR DETERMINING RELATIONSHIPS AMONG DATA ELEMENTS 审中-公开
    确定数据元素之间关系的系统和方法

    公开(公告)号:WO2018089633A1

    公开(公告)日:2018-05-17

    申请号:PCT/US2017/060860

    申请日:2017-11-09

    Abstract: A data processing system configured to perform: obtaining a first data lineage representing relationships among physical data elements, the first data lineage being generated at least in part by performing at least one of: (a) analyzing source code of at least one computer program configured to access the physical data elements; and (b) analyzing information obtained during runtime of the at least one computer program; obtaining, based on user input, a second data lineage representing relationships among business data elements; obtaining an association between at least some of the physical data elements of the first data lineage and at least some of the business data elements of the second data lineage; and generating, based on the association between the physical data elements and the business data elements, an indication of agreement or discrepancy between the first data lineage and the second data lineage.

    Abstract translation: 数据处理系统,被配置为执行:获得表示物理数据元素之间的关系的第一数据沿袭,所述第一数据沿袭至少部分通过执行以下中的至少一个来生成:(a)分析源 至少一个计算机程序的代码,其被配置为访问所述物理数据元素; 和(b)分析在所述至少一个计算机程序的运行时期间获得的信息; 基于用户输入获得表示业务数据元素之间的关系的第二数据沿袭; 获得第一数据沿袭的至少一些物理数据元素与第二数据沿袭的至少一些商业数据元素之间的关联; 以及基于物理数据元素与商业数据元素之间的关联,生成第一数据系列与第二数据系列之间的一致或差异的指示。

    FINITE STATE MACHINES FOR IMPLEMENTING WORKFLOWS FOR DATA OBJECTS MANAGED BY A DATA PROCESSING SYSTEM

    公开(公告)号:WO2020154400A1

    公开(公告)日:2020-07-30

    申请号:PCT/US2020/014607

    申请日:2020-01-22

    Abstract: Techniques for using finite state machines (FSMs) to implement workflows in a data processing system comprising at least one data store storing data objects and a workflow management system (WMS). The WMS is configured to perform: determining a current value of an attribute of a first data object by accessing the current value in the at least one data store; identifying, using the current value and metadata specifying relationships among at least some of the data objects, an actor authorized to perform a workflow task for the first data object; generating a GUI through which the actor can provide the input that the workflow task is to be performed; and in response to receiving, from the actor and through the GUI, input specifying that the workflow task is to be performed: performing the workflow task; and updating the current workflow state of the first FSM to a second workflow state.

    SYSTEM FOR METADATA MANAGEMENT
    6.
    发明申请
    SYSTEM FOR METADATA MANAGEMENT 审中-公开
    元数据管理系统

    公开(公告)号:WO2014151631A1

    公开(公告)日:2014-09-25

    申请号:PCT/US2014/026133

    申请日:2014-03-13

    CPC classification number: G06F17/30994 G06F17/30309

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for metadata management. One of the methods includes receiving user input selecting a first node. The method includes receiving a first data lineage of a first object, the first object having a type, the first data lineage describing relationships between the first object and one or more datasets or transforms. The method includes receiving user input selecting a second node. The method includes receiving a second data lineage of a second object, the second object having the same type as the first object. The method includes performing a comparison of the first node and the first data lineage to the second node and the second data lineage. The method includes generating a report based on the comparison.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于元数据管理。 方法之一包括接收选择第一节点的用户输入。 该方法包括接收第一对象的第一数据谱系,第一对象具有类型,描述第一对象与一个或多个数据集或变换之间的关系的第一数据谱系。 该方法包括接收选择第二节点的用户输入。 所述方法包括接收第二对象的第二数据谱系,所述第二对象具有与所述第一对象相同的类型。 该方法包括执行第一节点和第一数据谱系与第二节点和第二数据谱系的比较。 该方法包括基于比较生成报告。

    METADATA-DRIVEN DATA INGESTION
    7.
    发明申请

    公开(公告)号:WO2023044445A1

    公开(公告)日:2023-03-23

    申请号:PCT/US2022/076595

    申请日:2022-09-16

    Abstract: An electronic system for increasing the speed of preparing data with a specified data quality for storage by automatically identifying for a user, with minimal user input, common contexts among (i) fields in disparate datasets, and (ii) names the user has specified as potentially describing the fields, and by using those common contexts to govern the disparate datasets prior to storage to ensure the specified data quality.

    GENERATING, ACCESSING, AND DISPLAYING LINEAGE METADATA

    公开(公告)号:WO2018102691A1

    公开(公告)日:2018-06-07

    申请号:PCT/US2017/064227

    申请日:2017-12-01

    Abstract: Among other things, we describe a method of receiving a portion of metadata from a data source, the portion of metadata describing nodes and edges; generating instances of a data structure representing the portion of metadata, at least one instance of the data structure including an identification value that identifies a corresponding node, one or more property values representing respective properties of the corresponding node, and one or more pointers to respective identification values, each pointer representing an edge associated with a node identified by the corresponding respective identification value; storing the instances of the data structure in random access memory; receiving a query that includes an identification of at least one particular element of data; and using at least one instance of the data structure to cause a display of a computer system to display a representation of lineage of the particular element of data.

    BUILDING REPORTS
    9.
    发明申请
    BUILDING REPORTS 审中-公开
    建筑报告

    公开(公告)号:WO2016100641A1

    公开(公告)日:2016-06-23

    申请号:PCT/US2015/066335

    申请日:2015-12-17

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for building reports. One of the methods includes creating a model based on relational structured data, the structured data including data structures, each data structure having data elements, each data element having fields, each field having a name. The method includes generating a hierarchy of objects in model, the hierarchy organizing objects the with respect to a starting object according to relationship fields on the objects. The method includes generating a user interface including elements for one or more of the objects in the hierarchy, wherein the user interface enables a user to create a report and filter the report using the new name. The method includes receiving a user selection of an element from the elements. The method also includes generating a report.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于构建报告。 方法之一包括基于关系结构化数据创建模型,结构化数据包括数据结构,每个数据结构都具有数据元素,每个数据元素都有字段,每个字段都有一个名称。 该方法包括在模型中生成对象的层次结构,根据对象上的关系字段,层次结构相对于起始对象组织对象。 该方法包括生成包括用于层次结构中的一个或多个对象的元素的用户界面,其中用户界面使得用户能够创建报告并使用新名称过滤报告。 该方法包括从元素接收用户对元素的选择。 该方法还包括生成报告。

    FILTERING DATA LINEAGE DIAGRAMS
    10.
    发明申请
    FILTERING DATA LINEAGE DIAGRAMS 审中-公开
    过滤数据线图

    公开(公告)号:WO2016130626A1

    公开(公告)日:2016-08-18

    申请号:PCT/US2016/017263

    申请日:2016-02-10

    Abstract: Managing lineage information includes processing a specification of a directed graph to associate nodes (102, 104, 106) with information for processing requests for a representation of data lineage. The processing includes: identifying a first set of one or more nodes (1362, 1364, 1366) of the directed graph corresponding to normalizing data elements being stored in a data store and de-normalizing data elements being retrieved from the data store; and associating a first plurality of nodes (1370, 1372, 1374) connected to the first set of one or more nodes and a second plurality of nodes (1376, 1378, 1380) connected to the first set of one or more nodes with at least one tag identifier having a plurality of possible tag values, where the number of possible tag values is at least as large as the number of data elements being normalized, and where nodes representing different data elements in a de-normalized record are associated with different values of the tag identifier.

    Abstract translation: 管理谱系信息包括处理有向图的规范以将节点(102,104,106)与用于处理对数据谱系的表示的请求的信息相关联。 该处理包括:识别对应于正在存储在数据存储器中的数据元素归一化的有向图的一个或多个节点(1362,1364,1366)的第一集合,并且对从数据存储器检索的数据元素进行去规范化; 以及将连接到所述第一组一个或多个节点的第一多个节点(1370,1372,1374)和连接到所述第一组一个或多个节点的第二多个节点(1376,1378,1380)相关联,至少 一个标签标识符具有多个可能的标签值,其中可能的标签值的数量至少与被归一化的数据元素的数量一样大,并且其中表示去归一化记录中的不同数据元素的节点与不同的值相关联 的标签标识符。

Patent Agency Ranking