METHOD AND SYSTEM FOR TRAINING A MACHINE LEARNING MODEL

    公开(公告)号:US20230046653A1

    公开(公告)日:2023-02-16

    申请号:US17879178

    申请日:2022-08-02

    IPC分类号: G06N20/00 G06N5/02 G06N5/04

    摘要: An initially trained machine learning model is used by an active learning module to generate candidate triples, which are fed into an expert system for verification. As a result, the expert system outputs novel facts that are used for retraining the machine learning model. This approach consolidates expert systems with machine learning through iterations of an active learning loop, by bringing the two paradigms together, which is in general difficult because training of a neural network (machine learning) requires differentiable functions and rules (used by expert systems) tend not to be differentiable. The method and system provide a data augmentation strategy where the expert system acts as an oracle and outputs the novel facts, which provide labels for the candidate triples. The novel facts provide critical information from the oracle that is injected into the machine learning model at the retraining stage, thus allowing to increase its generalization performance.

    METHOD AND SYSTEM FOR RESTORING CONSISTENCY OF A DIGITAL TWIN DATABASE

    公开(公告)号:US20240143623A1

    公开(公告)日:2024-05-02

    申请号:US18494627

    申请日:2023-10-25

    IPC分类号: G06F16/27

    CPC分类号: G06F16/273

    摘要: To restore consistency of a digital twin database, identifiers with metadata imported from various data sources are processed by an encoder, which computes latent representations of the identifiers that are compared by an efficient similarity metric. If the respective similarity score exceeds a threshold, a match is detected between the identifiers. In that case, the digital twin database is updated by aligning the first identifier and the second identifier. This matching algorithm for equipment identifiers updates the digital twin data automatically and continuously by aligning identifiers which refer to the same piece of equipment. The updates flow directly into the digital twin database, thereby removing the manual effort. Using approximate nearest neighbor methods is highly efficient, especially for large plants. The encoder is implemented as an autoencoder which relies only on unlabeled training data. This unsupervised approach is more suitable for industrial scenarios where labeled data is expensive to create.

    Control apparatus of an automation system

    公开(公告)号:US10545967B2

    公开(公告)日:2020-01-28

    申请号:US15513272

    申请日:2014-09-25

    摘要: A control apparatus of an automation system, the control apparatus includes a database adapted to store time series data in a historian data source and adapted to store events derived from the time series data based on event detection rules in an event data source, wherein a semantic data or event query received by the control apparatus is mapped to a corresponding data source of the database to retrieve the queried data or event which are contextualized using an ontological context model of the automation system stored in the database and output by control apparatus in a semantic format is provided.

    METHOD AND SYSTEM FOR AUTOMATED COLUMN TYPE ANNOTATION

    公开(公告)号:US20230418802A1

    公开(公告)日:2023-12-28

    申请号:US18338302

    申请日:2023-06-20

    IPC分类号: G06F16/22 G06F16/21

    CPC分类号: G06F16/2282 G06F16/211

    摘要: A solution for automated column type annotation maps each column contained in a table to a column annotation class. A pre-processor transforms the table into a numerical tensor representation by outputting a sequence of cell tokens for each cell in the table. A table encoder encodes the sequences of cell tokens and a column annotation label for each column into body cell embeddings. A body pooling component processes the body cell embeddings to provide column representations. A classifier classifies the column representations to provide for each column, confidence scores for each column annotation class. The method concludes with comparing the highest confidence score for each column with a threshold, and, if the highest confidence score for each column is above the threshold, annotating each column with the respective column annotation class.

    METHOD AND SYSTEM FOR SEMI-AUTOMATIC COMPLETION OF AN ENGINEERING PROJECT

    公开(公告)号:US20230273573A1

    公开(公告)日:2023-08-31

    申请号:US18113224

    申请日:2023-02-23

    IPC分类号: G05B13/02

    CPC分类号: G05B13/027

    摘要: A database stores a set of items, with each item having technical attributes, and with each item representing a module that can be used in an engineering project of a first user, u1. A feature encoder embeds each item based on its technical attributes into a low-dimensional vector space. Then, in a second step, a graph neural network pools over these item embeddings to compute an updated user embedding for the first user A decoder mapping then addresses the recommendation task by outputting recommendation scores for each item. That means, heuristically speaking, that the method and system lift the recommendation task to the level of technical attributes to overcome the sparsity problem caused by item sets that are not overlapping between user groups. Thus, when matching similar users, the method does not rely on users configuring exactly the same modules but rather on configured modules that are similar from a technical point of view.