Deep group disentangled embedding and network weight generation for visual inspection

    公开(公告)号:US11087174B2

    公开(公告)日:2021-08-10

    申请号:US16580497

    申请日:2019-09-24

    Abstract: A method is provided for visual inspection. The method includes learning, by a processor, group disentangled visual feature embedding vectors of input images. The input images include defective objects and defect-free objects. The method further includes generating, by the processor using a weight generation network, classification weights from visual features and semantic descriptions. Both the visual features and the semantic descriptions are for predicting defective and defect-free labels. The method also includes calculating, by the processor, a cosine similarity score between the classification weights and the group disentangled visual feature embedding vectors. The method additionally includes episodically training, by the processor, the weight generation network on the input images to update parameters of the weight generation network. The method further includes generating, by the processor using the trained weight generation network, a prediction of a test image as including any of defective objects and defect-free objects.

    CONTROLLED TEXT GENERATION WITH SUPERVISED REPRESENTATION DISENTANGLEMENT AND MUTUAL INFORMATION MINIMIZATION

    公开(公告)号:US20210174213A1

    公开(公告)日:2021-06-10

    申请号:US17115464

    申请日:2020-12-08

    Abstract: A computer-implemented method is provided for disentangled data generation. The method includes accessing, by a bidirectional Long Short-Term Memory (LSTM) with a multi-head attention mechanism, a dataset including a plurality of pairs each formed from a given one of a plurality of input text structures and given one of a plurality of style labels for the plurality of input text structures. The method further includes training the bidirectional LSTM as an encoder to disentangle a sequential text input into disentangled representations comprising a content embedding and a style embedding based on a subset of the dataset. The method also includes training a unidirectional LSTM as a decoder to generate a next text structure prediction for the sequential text input based on previously generated text structure information and a current word, from a disentangled representation with the content embedding and the style embedding.

    DEEP GROUP DISENTANGLED EMBEDDING AND NETWORK WEIGHT GENERATION FOR VISUAL INSPECTION

    公开(公告)号:US20200097771A1

    公开(公告)日:2020-03-26

    申请号:US16580497

    申请日:2019-09-24

    Abstract: A method is provided for visual inspection. The method includes learning, by a processor, group disentangled visual feature embedding vectors of input images. The input images include defective objects and defect-free objects. The method further includes generating, by the processor using a weight generation network, classification weights from visual features and semantic descriptions. Both the visual features and the semantic descriptions are for predicting defective and defect-free labels. The method also includes calculating, by the processor, a cosine similarity score between the classification weights and the group disentangled visual feature embedding vectors. The method additionally includes episodically training, by the processor, the weight generation network on the input images to update parameters of the weight generation network. The method further includes generating, by the processor using the trained weight generation network, a prediction of a test image as including any of defective objects and defect-free objects.

    Large margin high-order deep learning with auxiliary tasks for video-based anomaly detection

    公开(公告)号:US10402653B2

    公开(公告)日:2019-09-03

    申请号:US15380014

    申请日:2016-12-15

    Abstract: A computer-implemented method and system are provided for video-based anomaly detection. The method includes forming, by a processor, a Deep High-Order Convolutional Neural Network (DHOCNN)-based model having a one-class Support Vector Machine (SVM) as a loss layer of the DHOCNN-based model. An objective of the SVM is configured to perform the video-based anomaly detection. The method further includes generating, by the processor, one or more predictions of an impending anomaly based on the high-order deep learning based model applied to an input image. The method also includes initiating, by the processor, an action to a hardware device to mitigate expected harm to at least one item selected from the group consisting of the hardware device, another hardware device related to the hardware device, and a person related to the hardware device.

    Fast distributed nonnegative matrix factorization and completion for big data analytics

    公开(公告)号:US10304008B2

    公开(公告)日:2019-05-28

    申请号:US15063236

    申请日:2016-03-07

    Abstract: Systems and methods are disclosed for operating a machine, by receiving training data from one or more sensors; training a machine learning module with the training data by: partitioning a data matrix into smaller submatrices to process in parallel and optimized for each processing node; for each submatrix, performing a greedy search for rank-one solutions; using alternating direction method of multipliers (ADMM) to ensure consistency over different data blocks; and controlling one or more actuators using live data and the learned module during operation.

    Adaptive Convolutional Neural Knowledge Graph Learning System Leveraging Entity Descriptions

    公开(公告)号:US20190122111A1

    公开(公告)日:2019-04-25

    申请号:US16168244

    申请日:2018-10-23

    Abstract: Systems and methods for predicting new relationships in the knowledge graph, including embedding a partial triplet including a head entity description and a relationship or a tail entity description to produce a separate vector for each of the head, relationship, and tail. The vectors for the head entity, relationship, and tail entity can be combined into a first matrix, and adaptive kernels generated from the entity descriptions can be applied to the matrix through convolutions to produce a second matrix having a different dimension from the first matrix. An activation function can be applied to the second matrix to obtain non-negative feature maps, and max-pooling can be used over the feature maps to get subsamples. A fixed length vector, Z, flattens the subsampling feature maps into a feature vector, and a linear mapping method is used to map the feature vectors into a prediction score.

Patent Agency Ranking