- 专利标题: Multi-person speech separation method and apparatus using a generative adversarial network model
-
申请号: US17023829申请日: 2020-09-17
-
公开(公告)号: US11450337B2公开(公告)日: 2022-09-20
- 发明人: Lianwu Chen , Meng Yu , Yanmin Qian , Dan Su , Dong Yu
- 申请人: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
- 申请人地址: CN Shenzhen
- 专利权人: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
- 当前专利权人: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
- 当前专利权人地址: CN Shenzhen
- 代理机构: Anova Law Group, PLLC
- 优先权: CN201810904488.9 20180809
- 主分类号: G10L21/0272
- IPC分类号: G10L21/0272 ; G06N3/04 ; G06N3/08 ; G10L25/30 ; G10L25/51
摘要:
A multi-person speech separation method is provided for a terminal. The method includes extracting a hybrid speech feature from a hybrid speech signal requiring separation, N human voices being mixed in the hybrid speech signal, N being a positive integer greater than or equal to 2; extracting a masking coefficient of the hybrid speech feature by using a generative adversarial network (GAN) model, to obtain a masking matrix corresponding to the N human voices, wherein the GAN model comprises a generative network model and an adversarial network model; and performing a speech separation on the masking matrix corresponding to the N human voices and the hybrid speech signal by using the GAN model, and outputting N separated speech signals corresponding to the N human voices.
公开/授权文献
- US20210005216A1 MULTI-PERSON SPEECH SEPARATION METHOD AND APPARATUS 公开/授权日:2021-01-07
信息查询