HIGH-RESOLUTION VIDEO GENERATION USING IMAGE DIFFUSION MODELS
Abstract:
In various examples, systems and methods are disclosed relating to aligning images into frames of a first video using at least one first temporal attention layer of a neural network model. The first video has a first spatial resolution. A second video having a second spatial resolution is generated by up-sampling the first video using at least one second temporal attention layer of an up-sampler neural network model, wherein the second spatial resolution is higher than the first spatial resolution.
Public/Granted literature
Information query
Patent Agency Ranking
0/0