Systems and methods for jointly estimating sound sources and frequencies from audio
摘要:
An electronic device receives a first audio content item that includes a plurality of sound sources. The electronic device generates a representation of the first audio content item. The electronic device determines, from the representation of the first audio content item, a representation of an isolated sound source and frequency data associated with the isolated sound source. The determining includes using a neural network to jointly determine the representation of the isolated sound source and the frequency data associated with the isolated sound source.
信息查询
0/0