Patent search ap:("INSTITUTE OF AUTOMATION Page CHINESE ACADEMY OF SCIENCES") AND inv:"Jian Cui"

1.

发明授权
Target speaker separation system, device and storage medium 有权

公开(公告)号：US11978470B2

公开(公告)日：2024-05-07

申请号：US17980473

申请日：2022-11-03

Applicant: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES

Inventor： Jiaming Xu , Jian Cui , Bo Xu

IPC: G10L21/0272 , G10L17/02 , G10L17/04 , G10L17/06 , G10L21/028 , H04S1/00

CPC classification number: G10L21/028 , G10L17/02 , G10L17/04 , G10L17/06 , H04S1/007

Abstract: Disclosed are a target speaker separation system, an electronic device and a storage medium. The system includes: first, performing, jointly unified modeling on a plurality of cues based a masked pre-training strategy, to boost the inference capability of a model for missing cues and enhance the representation accuracy of disturbed cues; and second, constructing a hierarchical cue modulation module. A spatial cue is introduced into a primary cue modulation module for directional enhancement of a speech of a speaker; in an intermediate cue modulation module, the speech of the speaker is enhanced on the basis of temporal coherence of a dynamic cue and an auditory signal component; a steady-state cue is introduced into an advanced cue modulation module for selective filtering; and finally, the supervised learning capability of simulation data and the unsupervised learning effect of real mixed data are sufficiently utilized.

Patent Agency Ranking