-
公开(公告)号:US11322136B2
公开(公告)日:2022-05-03
申请号:US16731488
申请日:2019-12-31
Applicant: Samsung Electronics Co., Ltd.
Inventor: Vijendra R. Apsingekar , Pu Song , Mohammad M. Moazzami , Asif Ali
Abstract: A method includes performing, using at least one processor, feature extraction of input audio data to identify extracted features associated with the input audio data. The method also includes detecting, using the at least one processor, a language associated with each of multiple portions of the input audio data by processing the extracted features using a plurality of language models, where each language model is associated with a different language. In addition, the method includes directing, using the at least one processor, each portion of the input audio data to one of a plurality of automatic speech recognition (ASR) models based on the language associated with the portion of the input audio data.
-
公开(公告)号:US20200219492A1
公开(公告)日:2020-07-09
申请号:US16731488
申请日:2019-12-31
Applicant: Samsung Electronics Co., Ltd.
Inventor: Vijendra R. Apsingekar , Pu Song , Mohammad M. Moazzami , Asif Ali
Abstract: A method includes performing, using at least one processor, feature extraction of input audio data to identify extracted features associated with the input audio data. The method also includes detecting, using the at least one processor, a language associated with each of multiple portions of the input audio data by processing the extracted features using a plurality of language models, where each language model is associated with a different language. In addition, the method includes directing, using the at least one processor, each portion of the input audio data to one of a plurality of automatic speech recognition (ASR) models based on the language associated with the portion of the input audio data.
-
公开(公告)号:US20190279618A1
公开(公告)日:2019-09-12
申请号:US16227209
申请日:2018-12-20
Applicant: Samsung Electronics Co., Ltd
Inventor: Anil Yadav , Abdul Rafay Khalid , Alireza Dirafzoon , Mohammad Mahdi Moazzami , Pu Song , Zheng Zhou
IPC: G10L15/197 , G10L15/22 , G10L15/30
Abstract: A method, an electronic device, and computer readable medium is provided. The method includes identifying a set of observable features associated with one or more users. The method also includes generating latent features from the set of observable features. The method additionally includes sorting the latent features into one or more clusters. Each of the one or more clusters represents verbal utterances of a group of users that share a portion of the latent features. The method further includes generating a language model that corresponds to a specific cluster of the one or more clusters. The language model represents a probability ranking of the verbal utterances that are associated with the group of users of the specific cluster.
-
公开(公告)号:US11967315B2
公开(公告)日:2024-04-23
申请号:US17660335
申请日:2022-04-22
Applicant: Samsung Electronics Co., Ltd.
Inventor: Vijendra R. Apsingekar , Pu Song , Mohammad M. Moazzami , Asif Ali
CPC classification number: G10L15/197 , G06N3/08 , G06N7/01 , G10L15/005 , G10L15/02 , G10L15/16 , G10L15/22 , G10L2015/223
Abstract: A method includes performing, using at least one processor, feature extraction of input audio data to identify extracted features associated with the input audio data. The method also includes detecting, using the at least one processor, a language associated with the input audio data by processing the extracted features using a plurality of language models, where each language model is associated with a different language. The method further includes directing, using the at least one processor, the input audio data to one of a plurality of automatic speech recognition (ASR) models based on the language associated with the input audio data.
-
公开(公告)号:US20220246143A1
公开(公告)日:2022-08-04
申请号:US17660335
申请日:2022-04-22
Applicant: Samsung Electronics Co., Ltd.
Inventor: Vijendra R. Apsingekar , Pu Song , Mohammad M. Moazzami , Asif Ali
Abstract: A method includes performing, using at least one processor, feature extraction of input audio data to identify extracted features associated with the input audio data. The method also includes detecting, using the at least one processor, a language associated with the input audio data by processing the extracted features using a plurality of language models, where each language model is associated with a different language. The method further includes directing, using the at least one processor, the input audio data to one of a plurality of automatic speech recognition (ASR) models based on the language associated with the input audio data.
-
公开(公告)号:US11106868B2
公开(公告)日:2021-08-31
申请号:US16227209
申请日:2018-12-20
Applicant: Samsung Electronics Co., Ltd
Inventor: Anil Yadav , Abdul Rafay Khalid , Alireza Dirafzoon , Mohammad Mahdi Moazzami , Pu Song , Zheng Zhou
IPC: G06F40/279 , G10L15/22 , G10L15/183 , G06F40/20 , G10L15/197 , G10L15/30
Abstract: A method, an electronic device, and computer readable medium is provided. The method includes identifying a set of observable features associated with one or more users. The method also includes generating latent features from the set of observable features. The method additionally includes sorting the latent features into one or more clusters. Each of the one or more clusters represents verbal utterances of a group of users that share a portion of the latent features. The method further includes generating a language model that corresponds to a specific cluster of the one or more clusters. The language model represents a probability ranking of the verbal utterances that are associated with the group of users of the specific cluster.
-
-
-
-
-