-
公开(公告)号:US11626104B2
公开(公告)日:2023-04-11
申请号:US17115158
申请日:2020-12-08
发明人: Soo Jin Park , Sunkuk Moon , Lae-Hoon Kim , Erik Visser
IPC分类号: G10L17/00 , G10L15/07 , G06F1/3231 , G10L15/04 , G10L15/16
摘要: A device includes processors configured to determine, in a first power mode, whether an audio stream corresponds to speech of at least two talkers. The processors are configured to, based on determining that the audio stream corresponds to speech of at least two talkers, analyze, in a second power mode, audio feature data of the audio stream to generate a segmentation result. The processors are configured to perform a comparison of a plurality of user speech profiles to an audio feature data set of a plurality of audio feature data sets of a talker-homogenous audio segment to determine whether the audio feature data set matches any of the user speech profiles. The processors are configured to, based on determining that the audio feature data set does not match any of the plurality of user speech profiles, generate a user speech profile based on the plurality of audio feature data sets.