-
公开(公告)号:US20240355335A1
公开(公告)日:2024-10-24
申请号:US18685019
申请日:2022-11-08
发明人: Xianliang Wang , Hongbin Suo
CPC分类号: G10L17/06 , G10L15/04 , G10L15/063 , G10L17/04 , G10L2015/0631
摘要: The present disclosure relates to an audio signal processing method and apparatus, a device and a storage medium. The present disclosure performs a segmenting processing on an audio signal to obtain multiple audio segments, performs a clustering processing on the multiple audio segments according to feature information of each audio segment in the multiple audio segments to obtain one or more first sets, determines a first cluster center of each first set according to the feature information of the audio segment included in each first set, and performs a clustering processing on the multiple audio segments according to the first cluster center of each first set to obtain one or more second sets, where audio segments in a same second set corresponding to a same role label. In this way, an accuracy of an unsupervised role separation based on a single channel speech is improved.