-
31.
公开(公告)号:US12282757B2
公开(公告)日:2025-04-22
申请号:US17490186
申请日:2021-09-30
Applicant: Oracle International Corporation
IPC: G06F16/00 , G06F3/042 , G06F3/0482 , G06F3/06 , G06F8/10 , G06F8/34 , G06F8/41 , G06F16/14 , G06F16/21 , G06F16/23 , G06F16/25 , G06F16/435 , G06F17/18 , G06F40/30 , G06N5/022 , G06N5/04 , G06N5/046 , G06N20/00 , G06Q10/0637 , G06F9/50
Abstract: In accordance with various embodiments, described herein is a system (Data Artificial Intelligence system, Data AI system), for use with a data integration or other computing environment, that leverages machine learning (ML, DataFlow Machine Learning, DFML), for use in managing a flow of data (dataflow, DF), and building complex dataflow software applications (dataflow applications, pipelines). In accordance with an embodiment, the system can provide support for auto-mapping of complex data structures, datasets or entities, between one or more sources or targets of data, referred to herein in some embodiments as HUBs. The auto-mapping can be driven by a metadata, schema, and statistical profiling of a dataset; and used to map a source dataset or entity associated with an input HUB, to a target dataset or entity or vice versa, to produce an output data prepared in a format or organization (projection) for use with one or more output HUBs.
-
公开(公告)号:US12235818B2
公开(公告)日:2025-02-25
申请号:US18545765
申请日:2023-12-19
Applicant: Oracle International Corporation
Inventor: Ganesh Seetharaman , Robert Costin Velisar , Yuen Sheung Chan
Abstract: The present disclosure relates to a system and techniques for resolving dangling references resulting from a dependency relationship between computing resource objects uncovered during a harvesting process. In embodiments, a harvester application adds computing resource objects associated with a client to a resource collection as those computing resource objects are identified. Dependencies are identified as each computing resource object is added to the resource collection, which are resolved only if the computing resource objects associated with those dependencies have already been added to the resource collection. If the computing resource objects associated with the dependencies have not already been added to the resource collection, then the dependency is added to an observer pool. Observer modules are configured to check each computing resource object as it is processed during the harvest process in order to match those computing resource objects to unresolved dependencies.
-
公开(公告)号:US11966384B2
公开(公告)日:2024-04-23
申请号:US17069778
申请日:2020-10-13
Applicant: Oracle International Corporation
Inventor: Ganesh Seetharaman , Robert Costin Velisar , Yuda Dai , Yuen Sheung Chan
IPC: G06F16/22 , G06F16/23 , G06F16/958
CPC classification number: G06F16/2379 , G06F16/2228 , G06F16/2365 , G06F16/972
Abstract: A data catalog system is disclosed that provides capabilities for uniquely identifying and retrieving data entities stored in diverse data sources managed by an organization. The data catalog system includes capabilities for generating a unique external identifier for a data entity (e.g., a data asset or a data object) by identifying a set of immutable configuration parameters associated with the data asset and identifying a set of data object attributes that uniquely identify data objects within the data asset. The generated unique external identifiers are stored as part of the metadata harvested by the data catalog system. The external identifiers are used to enforce a single representation of the data assets and the data objects in the data catalog system. The external object identifiers are used to perform data lookups and reconcile states of data entities during the metadata harvesting process.
-
公开(公告)号:US20240126736A1
公开(公告)日:2024-04-18
申请号:US18545765
申请日:2023-12-19
Applicant: Oracle International Corporation
Inventor: Ganesh Seetharaman , Robert Costin Velisar , Yuen Sheung Chan
IPC: G06F16/22
CPC classification number: G06F16/2228
Abstract: The present disclosure relates to a system and techniques for resolving dangling references resulting from a dependency relationship between computing resource objects uncovered during a harvesting process. In embodiments, a harvester application adds computing resource objects associated with a client to a resource collection as those computing resource objects are identified. Dependencies are identified as each computing resource object is added to the resource collection, which are resolved only if the computing resource objects associated with those dependencies have already been added to the resource collection. If the computing resource objects associated with the dependencies have not already been added to the resource collection, then the dependency is added to an observer pool. Observer modules are configured to check each computing resource object as it is processed during the harvest process in order to match those computing resource objects to unresolved dependencies.
-
公开(公告)号:US11531675B1
公开(公告)日:2022-12-20
申请号:US17379860
申请日:2021-07-19
Applicant: Oracle International Corporation
Inventor: Gopal Srinivasa Raghavan , Abhiram Madhukar Gujjewar , Ganesh Seetharaman , Jai Motwani , Sayon Dutta , Rajat Mahajan , Manasjyoti Sharma
IPC: G06F17/00 , G06F7/00 , G06F16/2457 , G06F16/22 , G06N20/00 , G06F16/248
Abstract: A machine-learning model may be previously trained with a supervised learning algorithm to identify whether a pair of labels provided as input are similar. A locality sensitive hashing forest (LSH) may be generated for the set of candidate labels. When a user later identifies an input label (e.g., by search query, by interface selection, etc.) the input label may be used to query the LSH forest to identify a subset of the candidate labels. This subset may be used to generate respective pairs comprising the input label, one of the subset candidate labels, and a corresponding feature set generated for the pair. This data may be provided to the model to identify a degree to which the pair of labels are similar. The user may be provided one or more recommendations including similar terms identified from the model's output.
-
公开(公告)号:US11526338B2
公开(公告)日:2022-12-13
申请号:US16921533
申请日:2020-07-06
Applicant: ORACLE INTERNATIONAL CORPORATION
IPC: G06F9/44 , G06F3/0484 , G06F8/41 , G06N20/00 , G06F16/14 , G06F16/21 , G06F16/25 , G06F16/435 , G06F16/23 , G06N5/04 , G06Q10/06 , G06F40/30 , G06F3/0482 , G06F17/18 , G06N5/02 , G06F8/10 , G06F8/34 , G06F3/042 , G06F3/06 , G06F9/50
Abstract: In accordance with various embodiments, described herein is a system (Data Artificial Intelligence system, Data AI system), for use with a data integration or other computing environment, that leverages machine learning (ML, DataFlow Machine Learning, DFML), for use in managing a flow of data (dataflow, DF), and building complex dataflow software applications (dataflow applications, pipelines). In accordance with an embodiment, the system can provide a service to recommend actions and transformations, on an input data, based on patterns identified from the functional decomposition of a data flow for a software application, including determining possible transformations of the data flow in subsequent applications. Data flows can be decomposed into a model describing transformations of data, predicates, and business rules applied to the data, and attributes used in the data flows.
-
公开(公告)号:US20220114168A1
公开(公告)日:2022-04-14
申请号:US17217881
申请日:2021-03-30
Applicant: Oracle International Corporation
Inventor: Ganesh Seetharaman , Robert Velisar , Geoffrey William Watters , Yuda Dai
IPC: G06F16/245 , G06F16/248 , G06F16/22 , G06F16/23
Abstract: Systems, devices, and methods discussed herein are directed to utilizing patterns and logical entities to identify and maintain relationships between data assets. In some embodiments, a query comprising a logical entity qualifier, one or more pattern identifiers that indicate a pattern, and a data set identifier may be received. The pattern is executed against a data set corresponding to the data set identifier and one or more logical entities are generated based on this execution. A logical entity may be a label that represents a set of one or more data assets in a data set. Assets that share a label can share attributes that are described by the label. The label corresponding to each logical entity may be presented, where each label represents a different set of data assets which share a common trait. In some embodiments, the user may define a pattern by which commonality may be assessed.
-
38.
公开(公告)号:US10776086B2
公开(公告)日:2020-09-15
申请号:US15683563
申请日:2017-08-22
Applicant: ORACLE INTERNATIONAL CORPORATION
IPC: G06F7/00 , G06F8/41 , G06N20/00 , G06F16/14 , G06F16/21 , G06F16/25 , G06F16/435 , G06F16/23 , G06N5/04 , G06Q10/06 , G06F40/30 , G06F3/0482 , G06F17/18 , G06N5/02 , G06F8/10 , G06F8/34 , G06F3/042 , G06F3/06 , G06F9/50
Abstract: In accordance with various embodiments, described herein is a system (Data Artificial Intelligence system, Data AI system), for use with a data integration or other computing environment, that leverages machine learning (ML, DataFlow Machine Learning, DFML), for use in managing a flow of data (dataflow, DF), and building complex dataflow software applications (dataflow applications, pipelines). In accordance with an embodiment, the system provides a programmatic interface, referred to herein in some embodiments as a foreign function interface, by which a user or third-party can define a service, functional and business types, semantic actions, and patterns or predefined complex data flows based on functional and business types, in a declarative manner, to extend the functionality of the system.
-
39.
公开(公告)号:US20200241854A1
公开(公告)日:2020-07-30
申请号:US16845738
申请日:2020-04-10
Applicant: ORACLE INTERNATIONAL CORPORATION
IPC: G06F8/41 , G06N20/00 , G06F16/14 , G06F16/21 , G06F16/25 , G06F16/435 , G06F16/23 , G06N5/04 , G06Q10/06 , G06F40/30 , G06F3/0482 , G06F17/18 , G06N5/02 , G06F8/10 , G06F8/34 , G06F3/042 , G06F3/06
Abstract: In accordance with various embodiments, described herein is a system (Data Artificial Intelligence system, Data AI system), for use with a data integration or other computing environment, that leverages machine learning (ML, DataFlow Machine Learning, DFML), for use in managing a flow of data (dataflow, DF), and building complex dataflow software applications (dataflow applications, pipelines). In accordance with an embodiment, the system can perform an ontology analysis of a schema definition, to determine the types of data, and datasets or entities, associated with that schema; and generate, or update, a model from a reference schema that includes an ontology defined based on relationships between datasets or entities, and their attributes. A reference HUB including one or more schemas can be used to analyze data flows, and further classify or make recommendations such as, for example, transformations enrichments, filtering, or cross-entity data fusion of an input data.
-
40.
公开(公告)号:US20180052870A1
公开(公告)日:2018-02-22
申请号:US15683559
申请日:2017-08-22
Applicant: ORACLE INTERNATIONAL CORPORATION
CPC classification number: G06F8/433 , G06F3/0428 , G06F3/0482 , G06F3/0649 , G06F8/10 , G06F8/34 , G06F8/4452 , G06F9/5061 , G06F16/144 , G06F16/211 , G06F16/2322 , G06F16/2358 , G06F16/254 , G06F16/435 , G06F17/18 , G06F17/2785 , G06N5/022 , G06N5/04 , G06N5/046 , G06N20/00 , G06Q10/0637
Abstract: In accordance with various embodiments, described herein is a system (Data Artificial Intelligence system, Data AI system), for use with a data integration or other computing environment, that leverages machine learning (ML, DataFlow Machine Learning, DFML), for use in managing a flow of data (dataflow, DF), and building complex dataflow software applications (dataflow applications, pipelines). In accordance with an embodiment, the system can perform an ontology analysis of a schema definition, to determine the types of data, and datasets or entities, associated with that schema; and generate, or update, a model from a reference schema that includes an ontology defined based on relationships between datasets or entities, and their attributes. A reference HUB including one or more schemas can be used to analyze data flows, and further classify or make recommendations such as, for example, transformations enrichments, filtering, or cross-entity data fusion of an input data.
-
-
-
-
-
-
-
-
-