METHOD AND APPARATUS WITH VIDEO SEGMENTATION

    公开(公告)号:US20210150727A1

    公开(公告)日:2021-05-20

    申请号:US16900649

    申请日:2020-06-12

    Abstract: A method with video segmentation may include: acquiring, over time, a video sequence including a plurality of image frames, the plurality of image frames including a second image frame corresponding to a time t of the video sequence and a first image frame corresponding to a time t−1 before the time t; extracting a second feature vector from the second image frame; generating second hidden state information corresponding to the second image frame, based on first hidden state information corresponding to the first image frame and second fusion information in which the second feature vector is fused with information related to the second image frame stored in a memory; generating a second segmentation mask corresponding to the second image frame, based on an output vector corresponding to the second hidden state information; and outputting the second segmentation mask.

    METHOD AND DEVICE FOR REPRESENTING RENDERED SCENES

    公开(公告)号:US20240054716A1

    公开(公告)日:2024-02-15

    申请号:US18096972

    申请日:2023-01-13

    CPC classification number: G06T15/08 G06N3/08 G06T15/06

    Abstract: Disclosed are a method and device for representing rendered scenes. A data processing method of training a neural network model includes obtaining spatial information of sampling data, obtaining one or more volume-rendering parameters by inputting the spatial information of the sampling data to the neural network model, obtaining a regularization term based on a distribution of the volume-rendering parameters, performing volume rendering based on the volume-rendering parameters, and training the neural network model to minimize a loss function determined based on the regularization term and based on a difference between a ground truth image and an image that is estimated according to the volume rendering.

    Method and apparatus with depth map generation

    公开(公告)号:US12288347B2

    公开(公告)日:2025-04-29

    申请号:US17714347

    申请日:2022-04-06

    Abstract: A method and apparatus with depth map generation. The method may include generating points for a point cloud by unprojecting multi-view depth maps, of plural views, into a corresponding three-dimensional (3D) space using respective camera parameters corresponding to each view of the multi-view depth maps, extracting feature embedding vectors corresponding to the generated points, generating a two-dimensional (2D) feature map of a set view based on the extracted feature embedding vectors, generating a residual depth map using a refinement network with respect to the 2D feature map, generating a new depth map based on the residual depth map and an initial depth map, of the set view, among the multi-view depth maps.

    Method and apparatus with video segmentation

    公开(公告)号:US11321848B2

    公开(公告)日:2022-05-03

    申请号:US16900649

    申请日:2020-06-12

    Abstract: A method with video segmentation may include: acquiring, over time, a video sequence including a plurality of image frames, the plurality of image frames including a second image frame corresponding to a time t of the video sequence and a first image frame corresponding to a time t−1 before the time t; extracting a second feature vector from the second image frame; generating second hidden state information corresponding to the second image frame, based on first hidden state information corresponding to the first image frame and second fusion information in which the second feature vector is fused with information related to the second image frame stored in a memory; generating a second segmentation mask corresponding to the second image frame, based on an output vector corresponding to the second hidden state information; and outputting the second segmentation mask.

Patent Agency Ranking