Method and device for processing music file, terminal and storage medium

    公开(公告)号:US11514923B2

    公开(公告)日:2022-11-29

    申请号:US17494655

    申请日:2021-10-05

    发明人: Hequn Bai

    摘要: Provided are a method and device for processing a music file, a terminal and a storage medium. The method comprises: in response to a received sound effect adjustment instruction, acquiring a music file, the adjustment of which is indicated by the sound effect adjustment instruction; carrying out vocals and accompaniment separation on the music file to obtain vocal data and accompaniment data in the music file; carrying out first sound effect processing on the vocal data to obtain target vocal data, and carrying out second sound effect processing on the accompaniment data to obtain target accompaniment data; and synthesizing the target vocal data and the target accompaniment data to obtain a target music file.

    A DIALOG DETECTOR
    8.
    发明申请

    公开(公告)号:US20220199074A1

    公开(公告)日:2022-06-23

    申请号:US17604379

    申请日:2020-04-13

    发明人: Lie LU Xin LIU

    摘要: The present application relates to a method of extracting audio features in a dialog detector in response to an input audio signal, the method comprising dividing the input audio signal into a plurality of frames, extracting frame audio features from each frame, determining a set of context windows, each context window including a number of frames surrounding a current frame, deriving, for each context window, a relevant context audio feature for the current frame based on the frame audio features of the frames in each respective context, and concatenating each context audio feature to form a combined feature vector to represent the current frame. The context windows with the different length can improve the response speed and improve robustness.

    Method for outputting an audio signal reproducing a piece of music into an interior via an output device

    公开(公告)号:US11328741B2

    公开(公告)日:2022-05-10

    申请号:US16962987

    申请日:2018-01-18

    发明人: Daniel Kotulla

    IPC分类号: G10L25/81 G10L25/48 H04R5/04

    摘要: Method for outputting an audio signal reproducing at least part of a piece of music containing part of at least one main voice, in particular a singing voice, into an interior forming part of a passenger compartment of a motor vehicle via an audio output device having a left and a right audio output channel. The method includes providing an audio signal reproducing at least part of a piece of music containing at least one main voice, extracting an audio signal component, containing the at least one main voice, of the audio signal from the audio signal, attenuating the audio signal component containing the at least one main voice, and outputting the audio signal via the left and right audio output channels, of the audio output device, wherein the audio signal component containing the at least one main voice is output in attenuated fashion.

    SINGING VOICE SEPARATION WITH DEEP U-NET CONVOLUTIONAL NETWORKS

    公开(公告)号:US20210256994A1

    公开(公告)日:2021-08-19

    申请号:US17135119

    申请日:2020-12-28

    申请人: Spotify AB

    摘要: A system, method and computer product for training a neural network system. The method comprises applying an audio signal to the neural network system, the audio signal including a vocal component and a non-vocal component. The method also comprises comparing an output of the neural network system to a target signal, and adjusting at least one parameter of the neural network system to reduce a result of the comparing, for training the neural network system to estimate one of the vocal component and the non-vocal component. In one example embodiment, the system comprises a U-Net architecture. After training, the system can estimate vocal or instrumental components of an audio signal, depending on which type of component the system is trained to estimate.