DIVIDE-AND-CONQUER FOR LANE-AWARE DIVERSE TRAJECTORY PREDICTION

    公开(公告)号:US20220144256A1

    公开(公告)日:2022-05-12

    申请号:US17521139

    申请日:2021-11-08

    Abstract: A method for driving path prediction is provided. The method concatenates past trajectory features and lane centerline features in a channel dimension at an agent's respective location in a top view map to obtain concatenated features thereat. The method obtains convolutional features derived from the top view map, the concatenated features, and a single representation of the training scene the vehicle and agent interactions. The method extracts hypercolumn descriptor vectors which include the convolutional features from the agent's respective location in the top view map. The method obtains primary and auxiliary trajectory predictions from the hypercolumn descriptor vectors. The method generates a respective score for each of the primary and auxiliary trajectory predictions. The method trains a vehicle trajectory prediction neural network using a reconstruction loss, a regularization loss objective, and an IOC loss objective responsive to the respective score for each of the primary and auxiliary trajectory predictions.

    MULTI-TASK LEARNING VIA GRADIENT SPLIT FOR RICH HUMAN ANALYSIS

    公开(公告)号:US20220121953A1

    公开(公告)日:2022-04-21

    申请号:US17496214

    申请日:2021-10-07

    Abstract: A method for multi-task learning via gradient split for rich human analysis is presented. The method includes extracting images from training data having a plurality of datasets, each dataset associated with one task, feeding the training data into a neural network model including a feature extractor and task-specific heads, wherein the feature extractor has a feature extractor shared component and a feature extractor task-specific component, dividing filters of deeper layers of convolutional layers of the feature extractor into N groups, N being a number of tasks, assigning one task to each group of the N groups, and manipulating gradients so that each task loss updates only one subset of filters.

    Unsupervised domain adaptation for video classification

    公开(公告)号:US11301716B2

    公开(公告)日:2022-04-12

    申请号:US16515593

    申请日:2019-07-18

    Abstract: A method is provided for unsupervised domain adaptation for video classification. The method learns a transformation for each target video clips taken from a set of target videos, responsive to original features extracted from the target video clips. The transformation corrects differences between a target domain corresponding to target video clips and a source domain corresponding to source video clips taken from a set of source videos. The method adapts the target to the source domain by applying the transformation to the original features extracted to obtain transformed features for the plurality of target video clips. The method converts the original and transformed features of same ones of the target video clips into a single classification feature for each of the target videos. The method classifies a new target video relative to the set of source videos using the single classification feature for each of the target videos.

    SELF-SUPERVISED VISUAL ODOMETRY FRAMEWORK USING LONG-TERM MODELING AND INCREMENTAL LEARNING

    公开(公告)号:US20210042937A1

    公开(公告)日:2021-02-11

    申请号:US16939604

    申请日:2020-07-27

    Abstract: A computer-implemented method for implementing a self-supervised visual odometry framework using long-term modeling includes, within a pose network of the self-supervised visual odometry framework including a plurality of pose encoders, a convolution long short-term memory (ConvLSTM) module having a first-layer ConvLSTM and a second-layer ConvLSTM, and a pose prediction layer, performing a first stage of training over a first image sequence using photometric loss, depth smoothness loss and pose cycle consistency loss, and performing a second stage of training to finetune the second-layer ConvLSTM over a second image sequence longer than the first image sequence.

    Learning good features for visual odometry

    公开(公告)号:US10852749B2

    公开(公告)日:2020-12-01

    申请号:US16100445

    申请日:2018-08-10

    Abstract: A computer-implemented method, system, and computer program product are provided for pose estimation. The method includes receiving, by a processor, a plurality of images from one or more cameras. The method also includes generating, by the processor with a feature extraction convolutional neural network (CNN), a feature map for each of the plurality of images. The method additionally includes estimating, by the processor with a feature weighting network, a score map from a pair of the feature maps. The method further includes predicting, by the processor with a pose estimation CNN, a pose from the score map and a combined feature map. The method also includes controlling an operation of a processor-based machine to change a state of the processor-based machine, responsive to the pose.

    CAMERA SELF-CALIBRATION NETWORK
    19.
    发明申请

    公开(公告)号:US20200234467A1

    公开(公告)日:2020-07-23

    申请号:US16736451

    申请日:2020-01-07

    Abstract: Systems and methods for camera self-calibration are provided. The method includes receiving real uncalibrated images, and estimating, using a camera self-calibration network, multiple predicted camera parameters corresponding to the real uncalibrated images. Deep supervision is implemented based on a dependence order between the plurality of predicted camera parameters to place supervision signals across multiple layers according to the dependence order. The method also includes determining calibrated images using the real uncalibrated images and the predicted camera parameters.

    Siamese reconstruction convolutional neural network for pose-invariant face recognition

    公开(公告)号:US10474883B2

    公开(公告)日:2019-11-12

    申请号:US15803292

    申请日:2017-11-03

    Abstract: A computer-implemented method, system, and computer program product is provided for pose-invariant facial recognition. The method includes generating, by a processor using a recognition neural network, a rich feature embedding for identity information and non-identity information for each of one or more images. The method also includes generating, by the processor using a Siamese reconstruction network, one or more pose-invariant features by employing the rich feature embedding for identity information and non-identity information. The method additionally includes identifying, by the processor, a user by employing the one or more pose-invariant features. The method further includes controlling an operation of a processor-based machine to change a state of the processor-based machine, responsive to the identified user in the one or more images.

Patent Agency Ranking