-
1.
公开(公告)号:US20250054500A1
公开(公告)日:2025-02-13
申请号:US18233323
申请日:2023-08-13
Applicant: Google LLC
Inventor: Hakan Erdogan , Scott Thomas Wisdom , John Hershey , Zalán Borsos , Marco Tagliasacchi , Neil Zeghidour , Xuankai Chang
Abstract: A system and method are disclosed. Audio input comprising the mixed audio signals is received by one or more client devices. The audio input is converted into a plurality of discrete tokens. A plurality of sound sources, each corresponding to a subset of discrete tokens of a plurality of subsets of discrete tokens, is determined using a trained machine learning model.