-
1.
公开(公告)号:US20230386450A1
公开(公告)日:2023-11-30
申请号:US18303394
申请日:2023-04-19
Applicant: Samsung Electronics Co., Ltd.
Inventor: Brendon Christopher Beachy Eby , Suhel Jaber , Sai Ajay Modukuri , Omar Abdelwahab , Ankit Goyal
IPC: G10L15/06 , G10L15/183
CPC classification number: G10L15/063 , G10L2015/0636 , G10L15/183
Abstract: A method includes determining, using at least one processing device of an electronic device, a target embedding vector for each class of a plurality of classes. The method also includes generating, using the at least one processing device, an utterance embedding vector using a pre-trained language model, where the utterance embedding vector represents an input utterance associated with an expected class. The method further includes obtaining, using the at least one processing device, a predicted class associated with the input utterance based on distances of the utterance embedding vector to spatial parameters representing the plurality of classes, where the spatial parameter of each class is based on the target embedding vector associated with that class. In addition, the method includes updating, using the at least one processing device, parameters of the language model based on a difference between the predicted class and the expected class.
-
公开(公告)号:US20230385546A1
公开(公告)日:2023-11-30
申请号:US18315931
申请日:2023-05-11
Applicant: Samsung Electronics Co., Ltd.
Inventor: Brendon Christopher Beachy Eby , Suhel Jaber , Sai Ajay Modukuri , Omar Abdelwahab , Ankit Goyal
IPC: G06F40/284
CPC classification number: G06F40/284
Abstract: A method includes receiving an input utterance that is a continuation of a previous utterance. The method also includes, using a trained Siamese network, determining input utterance embeddings representing tokens from the input utterance, pooling the input utterance embeddings with a context token embedding representing a class associated with the previous utterance to generate a representative input utterance embedding, and determining a representative embedding associated with each of multiple possible classes. Each possible class is associated with first and second threshold boundaries. The method further includes, using the trained Siamese network, determining a similarity score for each possible class based on a distance between the representative input utterance embedding and a selected threshold boundary of the representative embedding for that possible class and identifying a class for the input utterance based on the determined similarity scores. In addition, the method includes performing an action corresponding to the identified class.
-
3.
公开(公告)号:US20240054342A1
公开(公告)日:2024-02-15
申请号:US18336687
申请日:2023-06-16
Applicant: Samsung Electronics Co., Ltd.
Inventor: Suhel Jaber , Brendon Christopher Beachy Eby , Sai Ajay Modukuri
IPC: G06N3/08 , G06N3/048 , G06N3/0455
CPC classification number: G06N3/08 , G06N3/048 , G06N3/0455
Abstract: A method includes obtaining an input containing multiple tokens. The method also includes processing the input using a machine learning model. Processing the input includes performing attention over both (i) multiple dimensions of the tokens contained in the input and (ii) multiple dimensions of embedding vectors used to represent the tokens contained in the input so that different dimensions of each of at least some of the tokens are weighted differently. In addition, the method includes generating an output embedding vector for a query token of the multiple tokens based on the attention.
-
公开(公告)号:US20240020477A1
公开(公告)日:2024-01-18
申请号:US18306881
申请日:2023-04-25
Applicant: Samsung Electronics Co., Ltd.
Inventor: Sai Ajay Modukuri , Brendon Christopher Beachy Eby , Suhel Jaber
IPC: G06F40/284 , G06F40/40
CPC classification number: G06F40/284 , G06F40/40
Abstract: A method includes providing embedding vectors representing tokens in an input to a transformer comprising multiple transformer layers arranged in a sequence, each transformer layer having a residual connection to each previous transformer layer. The method also includes, for each transformer layer, determining, for a first token, an input embedding vector based on a combination of output embedding vectors from previous transformer layers. The method further includes, for each transformer layer, processing, for the first token, the input embedding vector to generate an output embedding vector to be provided to each subsequent transformer layer.
-
-
-