GENERATING VIDEOS USING DIFFUSION MODELS
    2.
    发明公开

    公开(公告)号:US20240338936A1

    公开(公告)日:2024-10-10

    申请号:US18296938

    申请日:2023-04-06

    Applicant: Google LLC

    CPC classification number: G06V10/82 G06V10/771 H04N7/0117 H04N7/013

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating an output video conditioned on an input. In one aspect, a method comprises receiving the input; initializing a current intermediate representation; generating an output video by updating the current intermediate representation at each of a plurality of iterations, wherein the updating comprises, at each iteration: processing an intermediate input for the iteration comprising the current intermediate representation using a diffusion model that is configured to process the intermediate input to generate a noise output; and updating the current intermediate representation using the noise output for the iteration.

Patent Agency Ranking