LEARNING TO FUSE GEOMETRICAL AND CNN RELATIVE CAMERA POSE VIA UNCERTAINTY

    公开(公告)号:US20220148220A1

    公开(公告)日:2022-05-12

    申请号:US17519894

    申请日:2021-11-05

    Abstract: A computer-implemented method for fusing geometrical and Convolutional Neural Network (CNN) relative camera pose is provided. The method includes receiving two images having different camera poses. The method further includes inputting the two images into a geometric solver branch to return, as a first solution, an estimated camera pose and an associated pose uncertainty value determined from a Jacobian of a reproduction error function. The method also includes inputting the two images into a CNN branch to return, as a second solution, a predicted camera pose and an associated pose uncertainty value. The method additionally includes fusing, by a processor device, the first solution and the second solution in a probabilistic manner using Bayes' rule to obtain a fused pose.

    END-TO-END PARAMETRIC ROAD LAYOUT PREDICTION WITH CHEAP SUPERVISION

    公开(公告)号:US20220147746A1

    公开(公告)日:2022-05-12

    申请号:US17521193

    申请日:2021-11-08

    Abstract: A computer-implemented method for road layout prediction is provided. The method includes segmenting, by a first processor-based element, an RGB image to output pixel-level semantic segmentation results for the RGB image in a perspective view for both visible and occluded pixels in the perspective view based on contextual clues. The method further includes learning, by a second processor-based element, a mapping from the pixel-level semantic segmentation results for the RGB image in the perspective view to a top view of the RGB image using a road plane assumption. The method also includes generating, by a third processor-based element, an occlusion-aware parametric road layout prediction for road layout related attributes in the top view.

    FACE-AWARE PERSON RE-IDENTIFICATION SYSTEM

    公开(公告)号:US20220147735A1

    公开(公告)日:2022-05-12

    申请号:US17519986

    申请日:2021-11-05

    Abstract: A method for employing facial information in unsupervised person re-identification is presented. The method includes extracting, by a body feature extractor, body features from a first data stream, extracting, by a head feature extractor, head features from a second data stream, outputting a body descriptor vector from the body feature extractor, outputting a head descriptor vector from the head feature extractor, and concatenating the body descriptor vector and the head descriptor vector to enable a model to generate a descriptor vector.

    Self-supervised visual odometry framework using long-term modeling and incremental learning

    公开(公告)号:US11321853B2

    公开(公告)日:2022-05-03

    申请号:US16939604

    申请日:2020-07-27

    Abstract: A computer-implemented method for implementing a self-supervised visual odometry framework using long-term modeling includes, within a pose network of the self-supervised visual odometry framework including a plurality of pose encoders, a convolution long short-term memory (ConvLSTM) module having a first-layer ConvLSTM and a second-layer ConvLSTM, and a pose prediction layer, performing a first stage of training over a first image sequence using photometric loss, depth smoothness loss and pose cycle consistency loss, and performing a second stage of training to finetune the second-layer ConvLSTM over a second image sequence longer than the first image sequence.

    UNIVERSAL FEATURE REPRESENTATION LEARNING FOR FACE RECOGNITION

    公开(公告)号:US20210142043A1

    公开(公告)日:2021-05-13

    申请号:US17091011

    申请日:2020-11-06

    Abstract: A computer-implemented method for implementing face recognition includes receiving training data including a plurality of augmented images each corresponding to a respective one of a plurality of input images augmented by one of a plurality of variations, splitting a feature embedding generated from the training data into a plurality of sub-embeddings each associated with one of the plurality of variations, associating each of the plurality of sub-embeddings with respective ones of a plurality of confidence values, and applying a plurality of losses including a confidence-aware identification loss and a variation-decorrelation loss to the plurality of sub-embeddings and the plurality of confidence values to improve face recognition performance by learning the plurality of sub-embeddings.

Patent Agency Ranking