-
公开(公告)号:US20220293088A1
公开(公告)日:2022-09-15
申请号:US17499072
申请日:2021-10-12
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Sivakumar BALASUBRAMANIAN , Gowtham SRINIVASAN , Srinivasa Rao PONAKALA , Anil Sunder YADAV , Aditya Jajodia
Abstract: A method of generating a trained trigger word detection model includes training an auxiliary model, based on an auxiliary task, to concentrate on one or more utterances and/or learn context of the one or more utterances using generic single word and/or phrase training data: and obtaining a trigger word detection model by retraining one or more final layers of the auxiliary model, which is weighted based on the auxiliary task, based on a trigger word detection task that detects one or more trigger words. The retraining uses training data specific to the one or more trigger words.
-
公开(公告)号:US20250149031A1
公开(公告)日:2025-05-08
申请号:US18816659
申请日:2024-08-27
Applicant: Samsung Electronics Co., Ltd.
Inventor: Aditya Jajodia , Akash Sahoo , Patrick Hegarty , Divya Neelagiri , Vijendra Raj Apsingekar
IPC: G10L15/197 , G10L13/02 , G10L15/06
Abstract: A method includes identifying, using an automated speech recognition (ASR) system, at least one named entity hypothesis from at least one audio input. The method also can include providing, using the ASR system, the identified at least one named entity to a large language model (LLM). The method also can include generating a prompt using an automated prompt generator. The method also can include processing, using the LLM, the identified at least one named entity hypothesis and the prompt to generate updated named entity recognition data. The method also can include providing the updated named entity recognition data back to the ASR system.
-
3.
公开(公告)号:US20240304179A1
公开(公告)日:2024-09-12
申请号:US18596406
申请日:2024-03-05
Applicant: Samsung Electronics Co., Ltd.
Inventor: Euisung Kim , Aditya Jajodia , Cindy Sushen Tseng , Divya Neelagiri , Taeyeon Ki , Vijendra Raj Apsingekar
CPC classification number: G10L15/063 , G10L25/30
Abstract: A method includes receiving, by an automatic speech recognition (ASR)-based spoken language understanding (SLU) model, an input utterance using an audio input device. The method also includes, for each token of the input utterance, generating, using a shared ASR encoder of the ASR-based SLU model, an acoustic representation of acoustic features of the token (the shared ASR encoder including a first adapter layer); determining, using an ASR decoder of the ASR-based SLU model, a text representation of the token using the acoustic representation and any previous tokens (the ASR decoder including a second adapter layer); combining, using a fusion model of the ASR-based SLU model, the text representation and the acoustic representation to generate a joint representation, and determining, using an SLU decoder of the ASR-based SLU model, a semantic label associated with the token based on the joint representation and any previous semantic labels.
-
公开(公告)号:US12236939B2
公开(公告)日:2025-02-25
申请号:US17499072
申请日:2021-10-12
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Sivakumar Balasubramanian , Gowtham Srinivasan , Srinivasa Rao Ponakala , Anil Sunder Yadav , Aditya Jajodia
Abstract: A method of generating a trained trigger word detection model includes training an auxiliary model, based on an auxiliary task, to concentrate on one or more utterances and/or learn context of the one or more utterances using generic single word and/or phrase training data; and obtaining a trigger word detection model by retraining one or more final layers of the auxiliary model, which is weighted based on the auxiliary task, based on a trigger word detection task that detects one or more trigger words. The retraining uses training data specific to the one or more trigger words.
-
-
-