Patent search ap:("Samsung Electronics Co. Page Ltd.") AND inv:"Pu Song"

1.

发明授权
System and method for multi-spoken language detection 有权

公开(公告)号：US11322136B2

公开(公告)日：2022-05-03

申请号：US16731488

申请日：2019-12-31

Applicant: Samsung Electronics Co., Ltd.

Inventor： Vijendra R. Apsingekar , Pu Song , Mohammad M. Moazzami , Asif Ali

IPC: G10L15/00 , G10L15/197 , G10L15/16 , G10L15/02 , G06N3/08 , G06N7/00 , G10L15/22

Abstract: A method includes performing, using at least one processor, feature extraction of input audio data to identify extracted features associated with the input audio data. The method also includes detecting, using the at least one processor, a language associated with each of multiple portions of the input audio data by processing the extracted features using a plurality of language models, where each language model is associated with a different language. In addition, the method includes directing, using the at least one processor, each portion of the input audio data to one of a plurality of automatic speech recognition (ASR) models based on the language associated with the portion of the input audio data.

2.

发明申请
SYSTEM AND METHOD FOR MULTI-SPOKEN LANGUAGE DETECTION 审中-公开

公开(公告)号：US20200219492A1

公开(公告)日：2020-07-09

申请号：US16731488

申请日：2019-12-31

Applicant: Samsung Electronics Co., Ltd.

Inventor： Vijendra R. Apsingekar , Pu Song , Mohammad M. Moazzami , Asif Ali

IPC: G10L15/197 , G10L15/16 , G10L15/00 , G10L15/22 , G10L15/02 , G06N3/08 , G06N7/00

Abstract: A method includes performing, using at least one processor, feature extraction of input audio data to identify extracted features associated with the input audio data. The method also includes detecting, using the at least one processor, a language associated with each of multiple portions of the input audio data by processing the extracted features using a plurality of language models, where each language model is associated with a different language. In addition, the method includes directing, using the at least one processor, each portion of the input audio data to one of a plurality of automatic speech recognition (ASR) models based on the language associated with the portion of the input audio data.

3.

发明申请
SYSTEM AND METHOD FOR LANGUAGE MODEL PERSONALIZATION 审中-公开

公开(公告)号：US20190279618A1

公开(公告)日：2019-09-12

申请号：US16227209

申请日：2018-12-20

Applicant: Samsung Electronics Co., Ltd

Inventor： Anil Yadav , Abdul Rafay Khalid , Alireza Dirafzoon , Mohammad Mahdi Moazzami , Pu Song , Zheng Zhou

IPC: G10L15/197 , G10L15/22 , G10L15/30

Abstract: A method, an electronic device, and computer readable medium is provided. The method includes identifying a set of observable features associated with one or more users. The method also includes generating latent features from the set of observable features. The method additionally includes sorting the latent features into one or more clusters. Each of the one or more clusters represents verbal utterances of a group of users that share a portion of the latent features. The method further includes generating a language model that corresponds to a specific cluster of the one or more clusters. The language model represents a probability ranking of the verbal utterances that are associated with the group of users of the specific cluster.

4.

发明授权
System and method for multi-spoken language detection 有权

公开(公告)号：US11967315B2

公开(公告)日：2024-04-23

申请号：US17660335

申请日：2022-04-22

Applicant: Samsung Electronics Co., Ltd.

Inventor： Vijendra R. Apsingekar , Pu Song , Mohammad M. Moazzami , Asif Ali

IPC: G10L15/00 , G06N3/08 , G06N7/01 , G10L15/02 , G10L15/16 , G10L15/197 , G10L15/22

CPC classification number: G10L15/197 , G06N3/08 , G06N7/01 , G10L15/005 , G10L15/02 , G10L15/16 , G10L15/22 , G10L2015/223

Abstract: A method includes performing, using at least one processor, feature extraction of input audio data to identify extracted features associated with the input audio data. The method also includes detecting, using the at least one processor, a language associated with the input audio data by processing the extracted features using a plurality of language models, where each language model is associated with a different language. The method further includes directing, using the at least one processor, the input audio data to one of a plurality of automatic speech recognition (ASR) models based on the language associated with the input audio data.

5.

发明申请
SYSTEM AND METHOD FOR MULTI-SPOKEN LANGUAGE DETECTION 有权

公开(公告)号：US20220246143A1

公开(公告)日：2022-08-04

申请号：US17660335

申请日：2022-04-22

Applicant: Samsung Electronics Co., Ltd.

Inventor： Vijendra R. Apsingekar , Pu Song , Mohammad M. Moazzami , Asif Ali

IPC: G10L15/197 , G10L15/16 , G10L15/00 , G10L15/02 , G06N3/08 , G06N7/00 , G10L15/22

Abstract: A method includes performing, using at least one processor, feature extraction of input audio data to identify extracted features associated with the input audio data. The method also includes detecting, using the at least one processor, a language associated with the input audio data by processing the extracted features using a plurality of language models, where each language model is associated with a different language. The method further includes directing, using the at least one processor, the input audio data to one of a plurality of automatic speech recognition (ASR) models based on the language associated with the input audio data.

6.

发明授权
System and method for language model personalization 有权

公开(公告)号：US11106868B2

公开(公告)日：2021-08-31

申请号：US16227209

申请日：2018-12-20

Applicant: Samsung Electronics Co., Ltd

Inventor： Anil Yadav , Abdul Rafay Khalid , Alireza Dirafzoon , Mohammad Mahdi Moazzami , Pu Song , Zheng Zhou

IPC: G06F40/279 , G10L15/22 , G10L15/183 , G06F40/20 , G10L15/197 , G10L15/30

Abstract: A method, an electronic device, and computer readable medium is provided. The method includes identifying a set of observable features associated with one or more users. The method also includes generating latent features from the set of observable features. The method additionally includes sorting the latent features into one or more clusters. Each of the one or more clusters represents verbal utterances of a group of users that share a portion of the latent features. The method further includes generating a language model that corresponds to a specific cluster of the one or more clusters. The language model represents a probability ranking of the verbal utterances that are associated with the group of users of the specific cluster.

Patent Agency Ranking