High resolution real-time artistic style transfer pipeline

    公开(公告)号:US12159366B2

    公开(公告)日:2024-12-03

    申请号:US17436298

    申请日:2020-03-12

    Applicant: GOOGLE LLC

    Abstract: Systems and methods are provided for receiving at least one image and a reference image, and performing a plurality of downscaling operations having separable convolutions on the received at least one image. A plurality of residual blocks may be formed, with each residual block containing two separable convolutions of the kernel and two instance normalizations. A plurality of upscaling operations may be performed on the plurality of residual blocks, and a stylized image may be displayed based on at least the performed plurality of upscaling operations and the reference image.

    VISUAL ASSET DEVELOPMENT USING A GENERATIVE ADVERSARIAL NETWORK

    公开(公告)号:US20230215083A1

    公开(公告)日:2023-07-06

    申请号:US17928874

    申请日:2020-06-04

    Applicant: GOOGLE LLC

    CPC classification number: G06T15/205 G06T15/50 G06T15/04

    Abstract: A virtual camera captures first images of a three-dimensional (3D) digital representation of a visual asset from different perspectives and under different lighting conditions. The first images are training images that are stored in a memory. One or more processors implement a generative adversarial network (GAN) that includes a generator and a discriminator, which are implemented as different neural networks. The generator generates second images that represent variations of the visual asset concurrently with the discriminator attempting to distinguish between the first and second images. The one or more processors update a first model in the discriminator and/or a second model in the generator based on whether the discriminator successfully distinguished between the first and second images. Once trained, the generator generates images of the visual asset based on the first model, e.g., based on a label or an outline of the visual asset.

Patent Agency Ranking