SYSTEM AND METHOD FOR DECOMPOSITION OF CODE GENERATION INTO SEPARATE PHYSICAL UNITS THOUGH EXECUTION UNITS
    11.
    发明申请
    SYSTEM AND METHOD FOR DECOMPOSITION OF CODE GENERATION INTO SEPARATE PHYSICAL UNITS THOUGH EXECUTION UNITS 有权
    通过执行单位将代码分解成单独的物理单位的系统和方法

    公开(公告)号:US20140344310A1

    公开(公告)日:2014-11-20

    申请号:US14077140

    申请日:2013-11-11

    Abstract: In various embodiments, a data integration system is disclosed which enables users to create a logical design which is platform and technology independent. The user can create a logical design that defines, at a high level, how a user wants data to flow between sources and targets. The tool can analyze the logical design, in view of the user's infrastructure, and create a physical design. The logical design can include a plurality of components corresponding to each source and target in the design, as well as operations such as joins or filters, and access points. Each component when transferred to the physical design generates code to perform operations on the data. Depending on the underlying technology (e.g., SQL Server, Oracle, Hadoop, etc.) and the language used (SQL, pig, etc.) the code generated by each component may be different.

    Abstract translation: 在各种实施例中,公开了一种数据集成系统,其使得用户能够创建平台和技术无关的逻辑设计。 用户可以创建逻辑设计,从而在高级别定义用户如何在数据源和目标之间传输数据。 该工具可以根据用户的基础设施分析逻辑设计,并创建物理设计。 逻辑设计可以包括与设计中的每个源和目标对应的多个组件,以及诸如连接或过滤器以及接入点的操作。 转移到物理设计中的每个组件生成代码以对数据执行操作。 根据基础技术(例如,SQL Server,Oracle,Hadoop等)和所使用的语言(SQL,pig等),每个组件生成的代码可能不同。

    TECHNIQUES FOR EFFICIENT COMPUTE RESOURCE HARVESTING

    公开(公告)号:US20250147942A1

    公开(公告)日:2025-05-08

    申请号:US19013493

    申请日:2025-01-08

    Abstract: The present disclosure relates to a system and techniques for resolving dangling references resulting from a dependency relationship between computing resource objects uncovered during a harvesting process. The techniques include, adding a computing resource object from a catalog of computing resource objects to a computing resource collection for a client and identifying one or more dependencies for the computing resource object. The techniques further include determining at least one unresolved dependency from the one or more dependencies, the at least one unresolved dependency including a second dependency on a second computing resource object outside of the computing resource collection. The techniques further include resolving the at least one unresolved dependency after the second computing resource object associated with the unresolved dependency has been added to the computing resource collection.

    GENERATING EXTERNAL IDENTIFIERS FOR DATA ENTITIES USING A DATA CATALOG SYSTEM

    公开(公告)号:US20220114163A1

    公开(公告)日:2022-04-14

    申请号:US17069778

    申请日:2020-10-13

    Abstract: A data catalog system is disclosed that provides capabilities for uniquely identifying and retrieving data entities stored in diverse data sources managed by an organization. The data catalog system includes capabilities for generating a unique external identifier for a data entity (e.g., a data asset or a data object) by identifying a set of immutable configuration parameters associated with the data asset and identifying a set of data object attributes that uniquely identify data objects within the data asset. The generated unique external identifiers are stored as part of the metadata harvested by the data catalog system. The external identifiers are used to enforce a single representation of the data assets and the data objects in the data catalog system. The external object identifiers are used to perform data lookups and reconcile states of data entities during the metadata harvesting process.

    TECHNIQUES FOR EFFICIENT COMPUTE RESOURCE HARVESTING

    公开(公告)号:US20220100568A1

    公开(公告)日:2022-03-31

    申请号:US17036345

    申请日:2020-09-29

    Abstract: The present disclosure relates to a system and techniques for resolving dangling references resulting from a dependency relationship between computing resource objects uncovered during a harvesting process. In embodiments, a harvester application adds computing resource objects associated with a client to a resource collection as those computing resource objects are identified. Dependencies are identified as each computing resource object is added to the resource collection, which are resolved only if the computing resource objects associated with those dependencies have already been added to the resource collection. If the computing resource objects associated with the dependencies have not already been added to the resource collection, then the dependency is added to an observer pool. Observer modules are configured to check each computing resource object as it is processed during the harvest process in order to match those computing resource objects to unresolved dependencies.

Patent Agency Ranking