Patent search ap:("Google LLC") AND inv:"Xuankai Chang" Page 1

1.

发明申请
USING MACHINE LEARNING AND DISCRETE TOKENS TO ESTIMATE DIFFERENT SOUND SOURCES FROM AUDIO MIXTURES 有权

公开(公告)号：US20250054500A1

公开(公告)日：2025-02-13

申请号：US18233323

申请日：2023-08-13

Applicant: Google LLC

Inventor： Hakan Erdogan , Scott Thomas Wisdom , John Hershey , Zalán Borsos , Marco Tagliasacchi , Neil Zeghidour , Xuankai Chang

IPC: G10L17/20 , G10L17/02 , G10L17/04 , G10L17/06 , G10L17/18

Abstract: A system and method are disclosed. Audio input comprising the mixed audio signals is received by one or more client devices. The audio input is converted into a plurality of discrete tokens. A plurality of sound sources, each corresponding to a subset of discrete tokens of a plurality of subsets of discrete tokens, is determined using a trained machine learning model.

Patent Agency Ranking