LOCALIZED LINK FORMATION TO PERFORM IMPLICITLY FEDERATED QUERIES USING EXTENDED COMPUTERIZED QUERY LANGUAGE SYNTAX

    公开(公告)号:US20190050459A1

    公开(公告)日:2019-02-14

    申请号:US16036836

    申请日:2018-07-16

    IPC分类号: G06F17/30

    摘要: Various embodiments relate generally to data science and data analysis, computer software and systems, and more specifically, to a computing and data storage platform configured to provide one or more computerized tools that facilitate development and management of data projects, including implementation of localized link identifiers to perform implicitly federated queries using, in some examples, extended computerized query language syntax to analyze multiple tabular data arrangements in data-driven collaborative projects. For example, a method may include importing a dataset into a data project, identifying a remote link identifier associated with a remote data source at which the dataset is stored, transforming the remote dataset identifier to form data representing a link identifier, and presenting in a data project user interface the link identifier as associated with a local namespace associated with the data project to perform implicit query federation using, for example, an extended multi-table syntax.

    LAYERED DATA GENERATION AND DATA REMEDIATION TO FACILITATE FORMATION OF INTERRELATED DATA IN A SYSTEM OF NETWORKED COLLABORATIVE DATASETS

    公开(公告)号:US20190050445A1

    公开(公告)日:2019-02-14

    申请号:US15927004

    申请日:2018-03-20

    IPC分类号: G06F17/30 G06K9/62

    摘要: Various embodiments relate generally to data science and data analysis, computer software and systems, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby logic is configured to remediate anomalies in a data set originating in a first format prior to enrichment and conversion into a second format that facilitates forming collaborative dataset and, for example, interrelations among a system of networked collaborative datasets, whereby, at least in some implementations, data interrelations between different formats may be disposed in one or more data layers (e.g., layered data files and/or data arrangements). In some examples, a method may include analyzing data to detect a non-compliant data attribute, detecting a condition based on the non-compliant data attribute, invoking an action to modify a subset of data, and generating a graph data arrangement linkable to other graph data arrangements to form a collaborative dataset.

    TRANSMUTING DATA ASSOCIATIONS AMONG DATA ARRANGEMENTS TO FACILITATE DATA OPERATIONS IN A SYSTEM OF NETWORKED COLLABORATIVE DATASETS

    公开(公告)号:US20190042606A1

    公开(公告)日:2019-02-07

    申请号:US15943629

    申请日:2018-04-02

    IPC分类号: G06F17/30

    摘要: Various embodiments relate generally to data science and data analysis and computer software and systems to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform configured to transmute associations between data arrangements of different formats or different data models to facilitate data operations, such as queries, configured to enhance, for example, an ingested dataset via transmuted associations as, for example, interrelations among a system of networked collaborative datasets. For example, a method may include identifying a referential indicator, determining an association with a value representative of the referential indicator to an equivalent value representative of another referential indicator associated with a different dataset, transmuting the association to form a transmuted association as a link between the value and the equivalent value, and integrating the link into an ingested data arrangement.

    DATA INGESTION TO GENERATE LAYERED DATASET INTERRELATIONS TO FORM A SYSTEM OF NETWORKED COLLABORATIVE DATASETS

    公开(公告)号:US20180314705A1

    公开(公告)日:2018-11-01

    申请号:US15926999

    申请日:2018-03-20

    IPC分类号: G06F17/30

    摘要: Various embodiments relate generally to data science and data analysis, and computer software and systems to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby data ingestion is performed to form data representing layered data files and data arrangements to facilitate, for example, interrelations among a system of networked collaborative datasets. In some examples, a method may include forming a first layer data file and a second layer data file, assigning addressable identifiers to uniquely identify units of data and data units to facilitate the linking of data, and implementing selectively one or more of a unit of data and a data unit as a function of a context of a data access request for a collaborative dataset.