-
公开(公告)号:US11455283B2
公开(公告)日:2022-09-27
申请号:US16848463
申请日:2020-04-14
Applicant: SAP SE
Inventor: Jan Portisch , Ronald Boehle , Volker Saggau , Sandra Bracholdt
Abstract: Techniques and solutions are described for determining a set of elements of a second set that may correspond to a given element of a first set of elements. The elements can be, in specific examples, components of a database system, such as tables (or entities), attributes, or records. Significance metric values are calculated for elements in the first and second sets. The significance metric values can be a number of records in an entity or a number of read or write access operations for an entity or for a record of an entity. A significance metric value for the given element can be used at least in part to select elements of the second set as potential match candidates, based at least in part on significance metric values for elements of the second set. Selecting elements can include selecting elements based on a window of elements of the second set or a range of significance metric values.
-
公开(公告)号:US11449529B2
公开(公告)日:2022-09-20
申请号:US16926097
申请日:2020-07-10
Applicant: SAP SE
Inventor: Sandra Bracholdt , Volker Saggau , Jan Portisch
IPC: G06F16/00 , G06F16/28 , G06F16/26 , G06F16/21 , G06F3/04842
Abstract: Techniques and solutions are described for determining, evaluating, and selecting data retrieval pathways. A plurality of database objects, such as tables, can be retrieved, and relationships between them determined. The relationships are between various attributes of the database objects. A plurality of pathways are determined between at least two target database objects, where the path determination does not consider attribute-level relationship information. After a path is determined, the relationship-level information can be added. A summary of the path, and relationship information, can be displayed to a user. A user can select paths to remove from the display, and can indicate one or more database objects that are required for a valid path, one or more database objects that are prohibited from valid paths, or combinations thereof. A user can select a path, and a query language statement to implement the path can be generated and displayed to the user.
-
公开(公告)号:US11436244B2
公开(公告)日:2022-09-06
申请号:US17160153
申请日:2021-01-27
Applicant: SAP SE
Inventor: Jan Portisch , Sandra Bracholdt , Oliver Huth , Volker Saggau
IPC: G06F16/24 , G06F16/2458
Abstract: A computer-implemented method can receive a source table containing data tuples and a source schema defining attributes of the data tuples, and match the source schema to an ontology of a knowledge graph. The knowledge graph can include a plurality of instances and the ontology defines properties of the plurality of instances. The computer-implemented method can link the data tuples to respective instances in the knowledge graph, and identifying non-matching properties of the respective instances, wherein the non-matching properties are defined in the ontology and not matched to the source schema. The computer-implemented method can obtain property values associated with the non-matching properties from the knowledge graph, and add one or more of the non-matching properties and the associated property values to respective data tuples of the source table.
-
公开(公告)号:US20220004532A1
公开(公告)日:2022-01-06
申请号:US16920260
申请日:2020-07-02
Applicant: SAP SE
Inventor: Jan Portisch , Volker Saggau , Sandra Bracholdt
IPC: G06F16/22
Abstract: A variety of scenarios for generating mock data based on original data are described. Characteristics of the original data can be reflected in the mock data without having access to the actual original data. Instead, statistical artifacts representing the original data can be used to instantiate reconstructed database tables. In relational database scenarios, foreign key relationships can be preserved even if the actual values are not known. The reconstructed database tables can be used as mock data in a variety of circumstances, including testing, and is especially useful to recreate characteristics of the original data when the original data cannot be made available for size, confidentiality, or privacy reasons.
-
公开(公告)号:US20210240675A1
公开(公告)日:2021-08-05
申请号:US16780481
申请日:2020-02-03
Applicant: SAP SE
Inventor: Jan Portisch , Bjoern Pantel , Benjamin Merkle , Stephan Schub , Sandra Bracholdt , Roland Kolata , Michael Hladik , Christoph Meinel
Abstract: Techniques and solutions are described for converting data models between formats, such as between a conceptual data model and a physical data model for a database system, or between a conceptual data model and artefacts to be implemented in the database system. The conceptual data model is annotated with annotations that allow the physical data model or database artefacts to be automatically generated from the conceptual data model. The annotations can reflect relationships between entity types in the physical data model, such as inheritance relationships, header/item relationships, or one-to-one cardinality relationships. Annotations can also indicate attributes that should be added to entity types in the conceptual data model, such as attributes for versioning or data governance, that may not be used in the conceptual data model. Annotations can be used to determine how entity types in the conceptual data model will be denormalized in the physical data model.
-
公开(公告)号:US20210073655A1
公开(公告)日:2021-03-11
申请号:US16567470
申请日:2019-09-11
Applicant: SAP SE
Inventor: Jan Portisch , Ronald Boehle , Volker Saggau , Sandra Bracholdt
IPC: G06N5/02 , G06F16/901
Abstract: Smart rule development and rule mining functionality is provided herein. Rule mining for use in rule development can include generating logic statement proposals, rule deduplication, and rule template generation. Rule mining can include accessing a rule set to analyze the rule set against an input logic statement to identify existing rules which match at least in part the input logic statement. Rule deduplication can include returning exact rule matches to replace the input logic statement. Proposing logic statements can include returning logically related rules from rules found that include the input logic statement. Generating rule templates can include returning a template based on the entire rule(s) which includes the input logic statement. Ranking scores can be calculated for returned rules, whether for deduplication, proposals, or template generation. The scores can be based on statistical information for the rules, such as usage of the rule or coverage of the rule.
-
公开(公告)号:US20210012219A1
公开(公告)日:2021-01-14
申请号:US16507465
申请日:2019-07-10
Applicant: SAP SE
Inventor: Jan Portisch , Sandra Bracholdt , Michael Monych , Volker Saggau
Abstract: Automatic rule generation is provided herein for generating data mapping, data transformation, or process flow rules or logic statements. The rules may be generated based on a field or attribute, and may be further based on a partial rule or one or more existing rules, or a combination thereof. Proposed rules may be generated based on analysis of a data set, including identifying possible values for the attribute and to calculate scores for the possible values. A score may be the probability of the value based on the data set. The data set may be cleaned or scrubbed based on the partial rule or existing rules. The proposed rules may be provided to a user, or may be automatically selected. Rule generation may include constraint checking. Constraint checking may include detecting empty data sets or detecting when two rules are not mutually exclusive.
-
公开(公告)号:US10726052B2
公开(公告)日:2020-07-28
申请号:US16027010
申请日:2018-07-03
Applicant: SAP SE
Inventor: Sandra Bracholdt , Volker Saggau , Jan Portisch
IPC: G06F16/00 , G06F16/28 , G06F16/26 , G06F16/21 , G06F3/0484
Abstract: Techniques and solutions are described for determining, evaluating, and selecting data retrieval pathways. A plurality of database objects, such as tables, can be retrieved, and relationships between them determined. The relationships are between various attributes of the database objects. A plurality of pathways are determined between at least two target database objects, where the path determination does not consider attribute-level relationship information. After a path is determined, the relationship-level information can be added. A summary of the path, and relationship information, can be displayed to a user. A user can select paths to remove from the display, and can indicate one or more database objects that are required for a valid path, one or more database objects that are prohibited from valid paths, or combinations thereof. A user can select a path, and a query language statement to implement the path can be generated and displayed to the user.
-
公开(公告)号:US20160357787A1
公开(公告)日:2016-12-08
申请号:US14732155
申请日:2015-06-05
Applicant: SAP SE
Inventor: Roland Kolata , Sandra Bracholdt , Volker Saggau
IPC: G06F17/30 , G06F3/0484
CPC classification number: G06F16/211
Abstract: According to an embodiment of the present disclosures, systems, methods, and non-transitory computer-readable mediums having program instructions thereon, provide for a framework for automatically transferring metadata from different sources and different components of a computing platform. Further, the framework also provides for the seamless inclusion of additional metadata sources from the computing platform. The framework also provides for visualizing (e.g., modeling) the extracted metadata with a graphical user interface software application.
Abstract translation: 根据本公开的实施例,其上具有程序指令的系统,方法和非暂时性计算机可读介质提供了用于自动从不同来源和计算平台的不同组件传送元数据的框架。 此外,该框架还提供了从计算平台无缝包含附加元数据源。 该框架还提供用图形用户界面软件应用程序可视化(例如,建模)所提取的元数据。
-
公开(公告)号:US20250131031A1
公开(公告)日:2025-04-24
申请号:US18382992
申请日:2023-10-23
Applicant: SAP SE
Inventor: Jan Portisch , Sandra Bracholdt
IPC: G06F16/36 , G06F16/901
Abstract: Class definitions for an ontology of a domain are determined using a materialized instance graph, where the ontology is used for semantic query execution, automated analytical reasoning, or for machine learning. A plurality of instances graphs for a respective plurality of domain instances are received. A materialized instance graph is generated from the plurality of instance graphs. One or more communities represented in the materialized instance graph are determined. Properties associated with respective communities of the one or more communities are determined. Class definitions are generated, where a class corresponds to a community of the one or more communities and at least a portion of properties associated with the community. Class definitions are assigned to the ontology for the domain.
-
-
-
-
-
-
-
-
-