-
1.
公开(公告)号:US20240054999A1
公开(公告)日:2024-02-15
申请号:US18297509
申请日:2023-04-07
Applicant: Samsung Electronics Co., Ltd.
Inventor: Cindy Sushen Tseng , Srinivasa Rao Ponakala , Myungjong Kim , Taeyeon Ki , Vijendra Raj Apsingekar
CPC classification number: G10L15/22 , G10L15/08 , G10L25/51 , G10L2015/223 , G10L2015/088
Abstract: A method includes obtaining an audio input and a location associated with an electronic device. The method also includes generating an audio embedding associated with the audio input. The method further includes determining a first difference between the audio embedding associated with the audio input and an audio embedding associated with a known user. The method also includes determining a second difference between the location associated with the electronic device and a known location associated with the known user. The method further includes generating, using a false trigger mitigation (FTM) system, a probability of the audio input including a false trigger for automatic speech recognition based on the audio input, the first difference, and the second difference. In addition, the method includes determining whether to perform automatic speech recognition based on the probability.
-
公开(公告)号:US20230419979A1
公开(公告)日:2023-12-28
申请号:US18046041
申请日:2022-10-12
Applicant: Samsung Electronics Co., Ltd.
Inventor: Myungjong Kim , Taeyeon Ki , Vijendra Raj Apsingekar , Sungjae Park , SeungBeom Ryu , Hyuk Oh
IPC: G10L21/028 , G10L17/06 , G10L17/02
CPC classification number: G10L21/028 , G10L17/06 , G10L17/02
Abstract: A method includes obtaining at least a portion of an audio stream containing speech activity. At least the portion of the audio stream includes multiple segments. The method also includes, for each of the multiple segments, generating an embedding vector that represents the segment. The method further includes, within each of multiple local windows, clustering the embedding vectors into one or more clusters to perform speaker identification. Different clusters correspond to different speakers. The method also includes presenting at least one first sequence of speaker identities based on the speaker identification performed for the local windows. The method further includes, within each of multiple global windows, clustering the embedding vectors into one or more clusters to perform speaker identification. Each global window includes two or more local windows. In addition, the method includes presenting at least one second sequence of speaker identities based on the speaker identification performed for the global windows.
-
3.
公开(公告)号:US12087307B2
公开(公告)日:2024-09-10
申请号:US17538604
申请日:2021-11-30
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Myungjong Kim , Vijendra Raj Apsingekar , Aviral Anshu , Taeyeon Ki
IPC: G10L17/06 , G10L17/02 , G10L17/18 , G10L21/0272 , G10L21/0308
CPC classification number: G10L17/06 , G10L17/02 , G10L17/18 , G10L21/0272 , G10L21/0308
Abstract: An apparatus for processing speech data may include a processor configured to: separate an input speech into speech signals; identify a bandwidth of each of the speech signals; extract speaker embeddings from the speech signals based on the bandwidth of each of the speech signals, using at least one neural network configured to receive the speech signals and output the speaker embeddings; and cluster the speaker embeddings into one or more speaker clusters, each speaker cluster corresponding to a speaker identity.
-
公开(公告)号:US20230419962A1
公开(公告)日:2023-12-28
申请号:US18047609
申请日:2022-10-18
Applicant: Samsung Electronics Co., Ltd.
Inventor: Myungjong Kim , Taeyeon Ki , Cindy Sushen Tseng , Srinivasa Rao Ponakala , Vijendra Raj Apsingekar
CPC classification number: G10L15/22 , G10L2015/088 , G10L15/08
Abstract: A method includes obtaining audio data and identifying an utterance of a wake word or phrase in the audio data. The method also includes generating an embedding vector based on the utterance from the audio data and accessing a set of previously-generated vectors representing previous utterances of the wake word or phrase. The method further includes performing clustering on the embedding vector and the set of previously-generated vectors to identify a cluster including the embedding vector, where the identified cluster is associated with a speaker. The method also includes updating a speaker vector associated with the speaker based on the embedding vector and determining, using a speaker verification model, a similarity score between the updated speaker vector and the embedding vector. In addition, the method includes determining, based on the similarity score, whether a speaker providing the utterance matches the speaker associated with the identified cluster.
-
公开(公告)号:US20230117535A1
公开(公告)日:2023-04-20
申请号:US17502838
申请日:2021-10-15
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Vijendra Raj Apsingekar , Myungjong Kim , Anil Yadav
IPC: G10L15/02 , G10L25/30 , G06F40/279 , G10L15/26
Abstract: A method and system are provided. The method includes receiving an audio input, in response to the audio input being unrecognized by an audio recognition model, identifying contextual information, determining whether the contextual information corresponds to the audio input, and in response to determining that the contextual information corresponds to the audio input, causing training of a neural network associated with the audio recognition model based on the contextual information and the audio input.
-
公开(公告)号:US09694235B2
公开(公告)日:2017-07-04
申请号:US14324898
申请日:2014-07-07
Applicant: Samsung Electronics Co., Ltd.
Inventor: Heungno Oh , Myungjong Kim , Jongchan Kim , Jongpil Kim , Juyeon Kim , Eunnam Song , Jounggeun Jo
IPC: A63B24/00 , A63B22/06 , A63B71/06 , G06F3/01 , A63F13/90 , A63F13/245 , A63F13/816 , A63B22/00 , A63B71/00
CPC classification number: A63B22/0605 , A63B22/0023 , A63B24/0087 , A63B71/0622 , A63B2024/009 , A63B2024/0096 , A63B2071/0081 , A63B2071/0638 , A63B2071/0666 , A63B2220/78 , A63B2220/805 , A63B2225/20 , A63B2225/50 , A63B2230/06 , A63F13/245 , A63F13/816 , A63F13/90 , G06F3/011 , G06F2203/012
Abstract: A virtual biking system and a virtual hiking method by which a user can bike indoors while simulating actually being outdoors are provided. The virtual hiking system includes a Personal Computer (PC) configured to display a simulation screen, a motion platform configured to move corresponding to a state of a road of the simulation screen displayed on the PC, and a bicycle fixed onto the motion platform such that the simulation screen is changed corresponding to a movement and a speed of a handle.
-
-
-
-
-