AUDIO CONTENT GENERATION AND CLASSIFICATION

    公开(公告)号:US20250006208A1

    公开(公告)日:2025-01-02

    申请号:US18708561

    申请日:2022-11-03

    Abstract: Some disclosed methods involve receiving audio data of at least a first audio data type and a second audio data type, including audio signals and associated spatial data indicating intended perceived spatial positions for the audio signals, determining at least a first feature type from the audio data and applying a positional encoding process to the audio data, to produce encoded audio data. The encoded audio data may include representations of at least the spatial data and the first feature type in first embedding vectors of an embedding dimension. Some methods may involve training a neural network, based on the encoded audio data, to transform audio data from an input audio data type having an input spatial data type to a transformed audio data type having a transformed spatial data type. Some methods may involve training a neural network to identify an input audio data type.

Patent Agency Ranking