NEURAL NETWORK SYSTEMS FOR DECOMPOSING VIDEO DATA INTO LAYERED REPRESENTATIONS

    公开(公告)号:US20220012898A1

    公开(公告)日:2022-01-13

    申请号:US17295321

    申请日:2019-11-20

    Abstract: A computer-implemented neural network system for decomposing input video data. A video data input receives a sequence of video image frames. The sequence is encoded, using a 3D spatio-temporal encoder neural network, into a set of latent variables representing a compressed version of the sequence. A 3D spatio-temporal decoder neural network processes the set of latent variables to generate two or more sets of decomposed video data; these may be stored, communicated, and/or made available to a user interface. Input video including undesired features such as reflections, shadows, and occlusions may thus be decomposed into two or more video sequences, one in which the undesired features are suppressed, and another containing the undesired features.

Patent Agency Ranking