HYBRID AUDIO SYNTHESIS USING NEURAL NETWORKS

    公开(公告)号:WO2020010338A1

    公开(公告)日:2020-01-09

    申请号:PCT/US2019/040739

    申请日:2019-07-05

    Applicant: DTS, INC.

    Abstract: A hybrid audio synthesis system and method for synthesizing hybrid audio from at least two distinct classes of sounds. Embodiments of the system and method include generating a synthesized hybrid audio clip using a generative adversarial network (GAN) that has been trained on two distinct classes of sounds. Embodiments of the system and method also include techniques for generating an extended-duration synthesized hybrid audio clip. This includes techniques for combining synthesized hybrid audio clips. In some embodiments a long short-term memory (LSTM) used to learn the best next audio input vector that is input to the trained GAN generator. In some embodiments the extended-duration synthesized hybrid audio clip is obtained using a Fourier transform of the synthesized hybrid audio clip (such as the Griffin-Lim algorithm or any variation thereof). In some embodiments cross fading is used to seamlessly stitch together the audio segments or clips.

Patent Agency Ranking