Fusion of neural networks
    3.
    发明授权

    公开(公告)号:US11574181B2

    公开(公告)日:2023-02-07

    申请号:US16406426

    申请日:2019-05-08

    IPC分类号: G06N3/08 G06N3/04

    摘要: Fusion of neural networks is performed by obtaining a first neural network and a second neural network. The first and the second neural networks are the result of a parent neural network subjected to different training. A similarity score is calculated of a first component of the first neural network and a corresponding second component of the second neural network. An interpolation weight is determined for the first and the second components by using the similarity score. A neural network parameter of the first component is updated based on the interpolation weight and a corresponding neural network parameter of the second component to obtain a fused neural network.

    ALTERNATIVE SOFT LABEL GENERATION

    公开(公告)号:US20220188622A1

    公开(公告)日:2022-06-16

    申请号:US17118139

    申请日:2020-12-10

    摘要: An approach to identifying alternate soft labels for training a student model may be provided. A teaching model may generate a soft label for a labeled training data. The training data can be an acoustic file for speech or a spoken natural language. A pool of soft labels previously generated by teacher models can be searched at the label level to identify soft labels that are similar to the generated soft label. The similar soft labels can have similar length or sequence at the word phoneme, and/or state level. The identified similar soft labels can be used in conjunction with the generated soft label to train a student model.

    Training data modification for training model

    公开(公告)号:US11011156B2

    公开(公告)日:2021-05-18

    申请号:US16381426

    申请日:2019-04-11

    发明人: Gakuto Kurata

    摘要: A computer-implemented method for training a model is disclosed. The model is capable of retaining a history of one or more preceding elements and has a direction of prediction. The method includes obtaining a training sequence of elements. The method also includes splitting the training sequence into a plurality of parts. The method further includes selecting one part of the plurality of the parts depending on the direction of the model to generate a modified training data. The method includes further training the model using the modified training data.

    Implementing a classification model for recognition processing

    公开(公告)号:US10990902B2

    公开(公告)日:2021-04-27

    申请号:US16582343

    申请日:2019-09-25

    发明人: Gakuto Kurata

    摘要: A method, system, and computer program product for learning a recognition model for recognition processing. The method includes preparing one or more examples for learning, each of which includes an input segment, an additional segment adjacent to the input segment and an assigned label. The input segment and the additional segment are extracted from an original training data. A classification model is trained, using the input segment and the additional segment in the examples, to initialize parameters of the classification model so that extended segments including the input segment and the additional segment are reconstructed from the input segment. Then, the classification model is tuned to predict a target label, using the input segment and the assigned label in the examples, based on the initialized parameters. At least a portion of the obtained classification model is included in the recognition model.

    NEURAL NETWORK FOR CHEMICAL COMPOUNDS

    公开(公告)号:US20210110240A1

    公开(公告)日:2021-04-15

    申请号:US17132631

    申请日:2020-12-23

    IPC分类号: G06N3/04 G16C99/00 G16C20/70

    摘要: A computer implemented method for training a neural network to capture a structural feature specific to a set of chemical compounds is disclosed. In the method, the computer system reads an expression describing a structure of the chemical compound for each chemical compound in the set and enumerates one or more combinations of a position and a type of a structural element appearing in the expression for each chemical compound in the set. The computer system also generates training data based on the one or more enumerated combinations for each chemical compound in the set. The training data includes one or more values with a length, each of which indicates whether or not a corresponding type of the structural element appears at a corresponding position for each combination. Furthermore, the computer system trains the neural network based on the training data for the set of the chemical compounds.

    ALIGNING SPIKE TIMING OF MODELS
    10.
    发明申请

    公开(公告)号:US20210082399A1

    公开(公告)日:2021-03-18

    申请号:US16570022

    申请日:2019-09-13

    摘要: A technique for aligning spike timing of models is disclosed. A first model having a first architecture trained with a set of training samples is generated. Each training sample includes an input sequence of observations and an output sequence of symbols having different length from the input sequence. Then, one or more second models are trained with the trained first model by minimizing a guide loss jointly with a normal loss for each second model and a sequence recognition task is performed using the one or more second models. The guide loss evaluates dissimilarity in spike timing between the trained first model and each second model being trained.