CROSS-MODAL SHAPE AND COLOR MANIPULATION
    3.
    发明公开

    公开(公告)号:US20230386158A1

    公开(公告)日:2023-11-30

    申请号:US17814391

    申请日:2022-07-22

    Applicant: Snap Inc.

    CPC classification number: G06T19/20 G06T17/00 G06T2219/2012 G06T2219/2021

    Abstract: Systems, computer readable media, and methods herein describe an editing system where a three-dimensional (3D) object can be edited by editing a 2D sketch or 2D RGB views of the 3D object. The editing system uses multi-modal (MM) variational auto-decoders (VADs)(MM-VADs) that are trained with a shared latent space that enables editing 3D objects by editing 2D sketches of the 3D objects. The system determines a latent code that corresponds to an edited or sketched 2D sketch. The latent code is then used to generate a 3D object using the MM-VADs with the latent code as input. The latent space is divided into a latent space for shapes and a latent space for colors. The MM-VADs are trained with variational auto-encoders (VAE) and a ground truth.

    GENERATING 3D MODELS WITH TEXTURE

    公开(公告)号:US20250111628A1

    公开(公告)日:2025-04-03

    申请号:US18375332

    申请日:2023-09-29

    Applicant: Snap Inc.

    Abstract: An artificial intelligence (AI) network or neural network is trained to generate three-dimensional (3D) models or shapes with color from two-dimensional (2D) input images and input text describing the 3D model with color. Example methods include converting a first three-dimensional (3D) model from a first representation to a second representation, the second representation including color information for the 3D model and inputting the second representation into an encoder to generate a third representation having a lower dimension than the second representation. The method further includes inputting the third representation into a decoder to generate a fourth representation having a same dimension as the second representation and generating a second 3D model from the fourth representation. The method further includes determining losses between the first 3D model and the second 3D model and updating weights of the encoder and the decoder based on the losses.

Patent Agency Ranking