-
公开(公告)号:US20210343277A1
公开(公告)日:2021-11-04
申请号:US17160278
申请日:2021-01-27
Applicant: Samsung Electronics Co., Ltd.
Inventor: Suhel Jaber , Anil Yadav , Melvin Lobo , Sukrat Gupta
Abstract: An electronic device includes an audio sensor, a memory, and at least one processor coupled to the audio sensor and the memory. The at least one processor is configured to receive, via the audio sensor an audio input. The at least one processor is further configured to perform, using an automatic speech recognition (ASR) model and an entity prediction model, out-of-vocabulary prediction of an entity. The at least one processor is further configured to receive an ASR hypothesis including the predicted entity. The at least one processor is further configured to output text including the predicted entity.
-
公开(公告)号:US20230385546A1
公开(公告)日:2023-11-30
申请号:US18315931
申请日:2023-05-11
Applicant: Samsung Electronics Co., Ltd.
Inventor: Brendon Christopher Beachy Eby , Suhel Jaber , Sai Ajay Modukuri , Omar Abdelwahab , Ankit Goyal
IPC: G06F40/284
CPC classification number: G06F40/284
Abstract: A method includes receiving an input utterance that is a continuation of a previous utterance. The method also includes, using a trained Siamese network, determining input utterance embeddings representing tokens from the input utterance, pooling the input utterance embeddings with a context token embedding representing a class associated with the previous utterance to generate a representative input utterance embedding, and determining a representative embedding associated with each of multiple possible classes. Each possible class is associated with first and second threshold boundaries. The method further includes, using the trained Siamese network, determining a similarity score for each possible class based on a distance between the representative input utterance embedding and a selected threshold boundary of the representative embedding for that possible class and identifying a class for the input utterance based on the determined similarity scores. In addition, the method includes performing an action corresponding to the identified class.
-
公开(公告)号:US20240370701A1
公开(公告)日:2024-11-07
申请号:US18582349
申请日:2024-02-20
Applicant: Samsung Electronics Co., Ltd.
Inventor: Suhel Jaber , Julia Isabel White
IPC: G06N3/0455
Abstract: A method includes receiving an input by a self-attention machine learning model and generating a set of queries using the input. This method also includes generating at least one of two sets of keys using the input and two sets of values using the input. This method also includes determining an output of the self-attention machine learning model using the two sets of keys, the two sets of values, or both. Another method includes identifying a query position for the set of queries, identifying a key position for the two sets of keys, and when the query position is determined to be equal to the key position, calculating an attention score using a first set of the two sets of keys, or, when the query position is determined to be unequal to the key position, calculating the attention score using a second set of the two sets of keys.
-
公开(公告)号:US20240020477A1
公开(公告)日:2024-01-18
申请号:US18306881
申请日:2023-04-25
Applicant: Samsung Electronics Co., Ltd.
Inventor: Sai Ajay Modukuri , Brendon Christopher Beachy Eby , Suhel Jaber
IPC: G06F40/284 , G06F40/40
CPC classification number: G06F40/284 , G06F40/40
Abstract: A method includes providing embedding vectors representing tokens in an input to a transformer comprising multiple transformer layers arranged in a sequence, each transformer layer having a residual connection to each previous transformer layer. The method also includes, for each transformer layer, determining, for a first token, an input embedding vector based on a combination of output embedding vectors from previous transformer layers. The method further includes, for each transformer layer, processing, for the first token, the input embedding vector to generate an output embedding vector to be provided to each subsequent transformer layer.
-
5.
公开(公告)号:US20230386450A1
公开(公告)日:2023-11-30
申请号:US18303394
申请日:2023-04-19
Applicant: Samsung Electronics Co., Ltd.
Inventor: Brendon Christopher Beachy Eby , Suhel Jaber , Sai Ajay Modukuri , Omar Abdelwahab , Ankit Goyal
IPC: G10L15/06 , G10L15/183
CPC classification number: G10L15/063 , G10L2015/0636 , G10L15/183
Abstract: A method includes determining, using at least one processing device of an electronic device, a target embedding vector for each class of a plurality of classes. The method also includes generating, using the at least one processing device, an utterance embedding vector using a pre-trained language model, where the utterance embedding vector represents an input utterance associated with an expected class. The method further includes obtaining, using the at least one processing device, a predicted class associated with the input utterance based on distances of the utterance embedding vector to spatial parameters representing the plurality of classes, where the spatial parameter of each class is based on the target embedding vector associated with that class. In addition, the method includes updating, using the at least one processing device, parameters of the language model based on a difference between the predicted class and the expected class.
-
公开(公告)号:US12020693B2
公开(公告)日:2024-06-25
申请号:US17160278
申请日:2021-01-27
Applicant: Samsung Electronics Co., Ltd.
Inventor: Suhel Jaber , Anil Yadav , Melvin Lobo , Sukrat Gupta
Abstract: An electronic device includes an audio sensor, a memory, and at least one processor coupled to the audio sensor and the memory. The at least one processor is configured to receive, via the audio sensor an audio input. The at least one processor is further configured to perform, using an automatic speech recognition (ASR) model and an entity prediction model, out-of-vocabulary prediction of an entity. The at least one processor is further configured to receive an ASR hypothesis including the predicted entity. The at least one processor is further configured to output text including the predicted entity.
-
7.
公开(公告)号:US20240054342A1
公开(公告)日:2024-02-15
申请号:US18336687
申请日:2023-06-16
Applicant: Samsung Electronics Co., Ltd.
Inventor: Suhel Jaber , Brendon Christopher Beachy Eby , Sai Ajay Modukuri
IPC: G06N3/08 , G06N3/048 , G06N3/0455
CPC classification number: G06N3/08 , G06N3/048 , G06N3/0455
Abstract: A method includes obtaining an input containing multiple tokens. The method also includes processing the input using a machine learning model. Processing the input includes performing attention over both (i) multiple dimensions of the tokens contained in the input and (ii) multiple dimensions of embedding vectors used to represent the tokens contained in the input so that different dimensions of each of at least some of the tokens are weighted differently. In addition, the method includes generating an output embedding vector for a query token of the multiple tokens based on the attention.
-
-
-
-
-
-