Invention Grant
- Patent Title: System and method for training a transformer-in-transformer-based neural network model for audio data
-
Application No.: US17502863Application Date: 2021-10-15
-
Publication No.: US11854558B2Publication Date: 2023-12-26
- Inventor: Wei Tsung Lu , Ju-Chiang Wang , Minz Won , Keunwoo Choi , Xuchen Song
- Applicant: Lemon Inc.
- Applicant Address: KY Grand Cayman
- Assignee: Lemon Inc.
- Current Assignee: Lemon Inc.
- Current Assignee Address: KY Grand Cayman
- Agency: Faegre Drinker Biddle & Reath LLP
- Main IPC: G10L19/02
- IPC: G10L19/02 ; G10L25/30

Abstract:
Devices, systems and methods related to causing an apparatus to generate music information of audio data using a transformer-based neural network model with a multilevel transformer for audio analysis, using a spectral and a temporal transformer, are disclosed herein. The processor generates a time-frequency representation of obtained audio data to be applied as input for a transformer-based neural network model; determines spectral embeddings and first temporal embeddings of the audio data based on the time-frequency representation of the audio data; determines each vector of a second frequency class token (FCT) by passing each vector of the first FCT in the spectral embeddings through the spectral transformer; determines second temporal embeddings by adding a linear projection of the second FCT to the first temporal embeddings; determines third temporal embeddings by passing the second temporal embeddings through the temporal transformer; and generates music information based on the third temporal embeddings.
Public/Granted literature
- US20230124006A1 SYSTEM AND METHOD FOR TRAINING A TRANSFORMER-IN-TRANSFORMER-BASED NEURAL NETWORK MODEL FOR AUDIO DATA Public/Granted day:2023-04-20
Information query