METHOD, ELECTRONIC DEVICE, AND STORAGE MEDIUM FOR DISTILLING MODEL

    公开(公告)号:US20210383233A1

    公开(公告)日:2021-12-09

    申请号:US17101748

    申请日:2020-11-23

    Abstract: The disclosure discloses a method for distilling a model, an electronic device, and a storage medium, and relates to the field of deep learning technologies. A teacher model and a student model are obtained. The second intermediate fully connected layer is transformed into an enlarged fully connected layer and a reduced fully connected layer based on a first data processing capacity of a first intermediate fully connected layer of the teacher model and a second data processing capacity of a second intermediate fully connected layer of the student model. The second intermediate fully connected layer is replaced with the enlarged fully connected layer and the reduced fully connected layer to generate a training student model. The training student model is distilled based on the teacher model.

    ARTIFICIAL INTELLIGENCE BASED METHOD AND APPARATUS FOR CHECKING TEXT

    公开(公告)号:US20180349350A1

    公开(公告)日:2018-12-06

    申请号:US15921386

    申请日:2018-03-14

    Abstract: This disclosure discloses an artificial intelligence based method and apparatus for checking a text. An embodiment of the method comprises: lexing a first to-be-checked text and a second to-be-checked text respectively, determining word vectors of the lexed words to generate a first word vector sequence and a second word vector sequence; inputting the first word vector sequence and the second word vector sequence respectively into a pre-trained convolutional neural network containing at least one multi-scale convolutional layer, identifying vector sequences in a plurality of vector sequences outputted by a last multi-scale convolutional layer as eigenvector sequences, to obtain eigenvector sequence groups respectively corresponding to the texts; combining eigenvector sequences in each eigenvector sequence group to generate a combined eigenvector sequence; and analyzing the generated combined eigenvector sequences to determine whether the first text and the second text pass a similarity check. The embodiment improves the flexibility in checking a text.

    METHOD AND APPARATUS FOR CORRECTING CHARACTER ERRORS, ELECTRONIC DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20210390248A1

    公开(公告)日:2021-12-16

    申请号:US16950975

    申请日:2020-11-18

    Abstract: A method and apparatus for correcting character errors, an electronic device and a storage medium are disclosed, which relates to the natural language processing field and the deep learning field. The method may include: for a character to be processed, acquiring the score of each character in a pre-constructed vocabulary, the score being a score of the reasonability of the character in the vocabulary at the position of the character to be processed; selecting top K characters as candidates of the character to be processed, K being a positive integer greater than one; selecting an optimal candidate from the K candidates; and replacing the character to be processed with the optimal candidate if the optimal candidate is different from the character to be processed. With the solution of the present application, the accuracy of an error correction result, or the like, may be improved.

    METHOD AND APPARATUS FOR GENERATING DIALOGUE MODEL

    公开(公告)号:US20210200957A1

    公开(公告)日:2021-07-01

    申请号:US16895297

    申请日:2020-06-08

    Abstract: Embodiments of the present disclosure relate to a method and apparatus for generating a dialogue model. The method may include: acquiring a corpus sample set, a corpus sample including input information and target response information; classifying corpus samples in the corpus sample set, setting discrete hidden variables for the corpus samples based on a classification result to generate a training sample set, a training sample including the input information, the target response information, and a discrete hidden variable; and training a preset neural network using the training sample set to obtain the dialogue model, the dialogue model being used to represent a corresponding relationship between inputted input information and outputted target response information.

Patent Agency Ranking