SYSTEM AND METHOD FOR BI-DIRECTIONAL TRANSLATION USING SUM-PRODUCT NETWORKS

    公开(公告)号:US20210390269A1

    公开(公告)日:2021-12-16

    申请号:US16900481

    申请日:2020-06-12

    IPC分类号: G06F40/58 G06N3/04 G06N3/08

    摘要: A method and machine translation system for bi-directional translation of textual sequences between a first language and a second language are described. The machine translation system includes a first autoencoder configured to receive a vector representation of a first textual sequence in the first language and encode the vector representation of the first textual sequence into a first sentence embedding. The machine translation system also includes a sum-product network (SPN) configured to receive the first sentence embedding and generate a second sentence embedding by maximizing a first conditional probability of the second sentence embedding given the first sentence embedding and a second autoencoder receiving the second sentence embedding, the second autoencoder being trained to decode the second sentence embedding into a vector representation of a second textual sequence in the second language.

    Semi-supervised regression with generative adversarial networks

    公开(公告)号:US11003995B2

    公开(公告)日:2021-05-11

    申请号:US15789518

    申请日:2017-10-20

    IPC分类号: G06N3/04 G06N3/08

    摘要: Method and system for performing semi-supervised regression with a generative adversarial network (GAN) that includes a generator comprising a first neural network and a discriminator comprising a second neural network, comprising: outputting, from the first neural network, generated samples derived from a random noise vector; inputting, to the second neural network, the generated samples, a plurality of labelled training samples, and a plurality of unlabelled training samples; and outputting, from the second neural network, a predicted continuous label for each of a plurality of the generated samples and unlabelled samples.

    System and method for bi-directional translation using sum-product networks

    公开(公告)号:US11586833B2

    公开(公告)日:2023-02-21

    申请号:US16900481

    申请日:2020-06-12

    IPC分类号: G06F40/58 G06N3/08 G06N3/04

    摘要: A method and machine translation system for bi-directional translation of textual sequences between a first language and a second language are described. The machine translation system includes a first autoencoder configured to receive a vector representation of a first textual sequence in the first language and encode the vector representation of the first textual sequence into a first sentence embedding. The machine translation system also includes a sum-product network (SPN) configured to receive the first sentence embedding and generate a second sentence embedding by maximizing a first conditional probability of the second sentence embedding given the first sentence embedding and a second autoencoder receiving the second sentence embedding, the second autoencoder being trained to decode the second sentence embedding into a vector representation of a second textual sequence in the second language.

    TRANSFORMER-BASED AUTOMATIC SPEECH RECOGNITION SYSTEM INCORPORATING TIME-REDUCTION LAYER

    公开(公告)号:US20220122590A1

    公开(公告)日:2022-04-21

    申请号:US17076794

    申请日:2020-10-21

    IPC分类号: G10L15/16 G10L15/06

    摘要: Computer implemented method and system for automatic speech recognition. A first speech sequence is processed, using a time reduction operation of an encoder NN, into a second speech sequence that comprises a second set of speech frame feature vectors that each concatenate information from a respective plurality of speech frame feature vectors included in the first set, wherein the second speech sequence includes fewer speech frame feature vectors than the first speech sequence. The second speech sequence is transformed, using a self-attention operation of the encoder NN, into a third speech sequence that comprises a third set of speech frame feature vectors. The third speech sequence is processed, using a probability operation of the encoder NN, to predict a sequence of first labels corresponding to the third set of speech frame feature vectors. The third speech sequence is also processed using a decoder NN to predict a sequence of second labels corresponding to the third set of speech frame feature vectors.

    SELF-TRAINING METHOD AND SYSTEM FOR SEMI-SUPERVISED LEARNING WITH GENERATIVE ADVERSARIAL NETWORKS

    公开(公告)号:US20190122120A1

    公开(公告)日:2019-04-25

    申请号:US15789628

    申请日:2017-10-20

    IPC分类号: G06N3/08 G06N3/04

    摘要: A method and system for augmenting a training dataset for a generative adversarial network (GAN). The training dataset includes labelled data samples and unlabelled data samples. The method includes: receiving generated samples generated using a first neural network of the GAN and the unlabelled samples of training dataset; determining a decision value for a sample from a decision function, wherein the sample is a generated sample of the generated samples or an unlabelled sample of the unlabelled samples of the training dataset; comparing the decision value to a threshold; in response to determining that the decision value exceeds the threshold: predicting a label for a sample; assigning the label to the sample; and augmenting the training dataset to include the sample with the assigned label as a labelled sample.

    METHODS, DEVICES AND MEDIA FOR IMPROVING KNOWLEDGE DISTILLATION USING INTERMEDIATE REPRESENTATIONS

    公开(公告)号:US20220335303A1

    公开(公告)日:2022-10-20

    申请号:US17233323

    申请日:2021-04-16

    IPC分类号: G06N3/08 G06N3/04

    摘要: Methods, devices and processor-readable media for knowledge distillation using intermediate representations are described. A student model is trained using a Dropout-KD approach in which intermediate layer selection is performed efficiently such that the skip, search, and overfitting problems in intermediate layer KD may be solved. Teacher intermediate layers are selected randomly at each training epoch, with the layer order preserved to avoid breaking information flow. Over the course of multiple training epochs, all of the teacher intermediate layers are used for knowledge distillation. A min-max data augmentation method is also described based on the intermediate layer selection of the Dropout-KD training method.

    Self-training method and system for semi-supervised learning with generative adversarial networks

    公开(公告)号:US11120337B2

    公开(公告)日:2021-09-14

    申请号:US15789628

    申请日:2017-10-20

    IPC分类号: G06N3/04 G06N3/08

    摘要: A method and system for augmenting a training dataset for a generative adversarial network (GAN). The training dataset includes labelled data samples and unlabelled data samples. The method includes: receiving generated samples generated using a first neural network of the GAN and the unlabelled samples of training dataset; determining a decision value for a sample from a decision function, wherein the sample is a generated sample of the generated samples or an unlabelled sample of the unlabelled samples of the training dataset; comparing the decision value to a threshold; in response to determining that the decision value exceeds the threshold: predicting a label for a sample; assigning the label to the sample; and augmenting the training dataset to include the sample with the assigned label as a labelled sample.

    Transformer-based automatic speech recognition system incorporating time-reduction layer

    公开(公告)号:US11715461B2

    公开(公告)日:2023-08-01

    申请号:US17076794

    申请日:2020-10-21

    IPC分类号: G10L15/16 G10L15/06

    CPC分类号: G10L15/16 G10L15/063

    摘要: Computer implemented method and system for automatic speech recognition. A first speech sequence is processed, using a time reduction operation of an encoder NN, into a second speech sequence comprising a second set of speech frame feature vectors that each concatenate information from a respective plurality of speech frame feature vectors included in the first set and includes fewer speech frame feature vectors than the first speech sequence. The second speech sequence is transformed, using a self-attention operation of the encoder NN, into a third speech sequence comprising a third set of speech frame feature vectors. The third speech sequence is processed using a probability operation of the encoder NN, to predict a sequence of first labels corresponding to the third set of speech frame feature vectors, and using a decoder NN to predict a sequence of second labels corresponding to the third set of speech frame feature vectors.