Techniques for interactive image segmentation networks

    公开(公告)号:US12112482B2

    公开(公告)日:2024-10-08

    申请号:US17161139

    申请日:2021-01-28

    CPC classification number: G06T7/11 G06N3/045 G06T2207/20081 G06T2207/20084

    Abstract: Various embodiments are generally directed to techniques for image segmentation utilizing context, such as with a machine learning (ML) model that injects context into various training stages. Many embodiments utilize one or more of an encoder-decoder model topology and select criteria and parameters in hyper-parameter optimization (HPO) to conduct the best model neural architecture search (NAS). Some embodiments are particularly directed to resizing context frames to a resolution that corresponds with a particular stage of decoding. In several embodiments, the context frames are concatenated with one or more of data from a previous decoding stage and data from a corresponding encoding stage prior to being provided as input to a next decoding stage.

    LEVERAGING EPISTEMIC CONFIDENCE FOR MULTI-MODAL FEATURE PROCESSING

    公开(公告)号:US20220382787A1

    公开(公告)日:2022-12-01

    申请号:US17816468

    申请日:2022-08-01

    Abstract: Systems, apparatuses, and methods include technology that extracts a plurality of features from the input data. The technology generates a confidence metric for the plurality of features. The confidence metric corresponds to a degree that at least one feature of the plurality of features is relevant for classification of the input data. The technology categorizes the input data into a category based on the plurality of features and the confidence metric

    Video tracking with deep Siamese networks and Bayesian optimization

    公开(公告)号:US11227179B2

    公开(公告)日:2022-01-18

    申请号:US16586671

    申请日:2019-09-27

    Abstract: An apparatus, method, system and computer readable medium for video tracking. An exemplar crop is selected to be tracked in an initial frame of a video. Bayesian optimization is applied with each subsequent frame of the video by building a surrogate model of an objective function using Gaussian Process Regression (GPR) based on similarity scores of candidate crops collected from a search space in a current frame of the video. A next candidate crop in the search space is determined using an acquisition function. The next candidate crop is compared to the exemplar crop using a Siamese neural network. Comparisons of new candidate crops to the exemplar crop are made using the Siamese neural network until the exemplar crop has been found in the current frame. The new candidate crops are selected based on an updated surrogate model.

    HIGH RESOLUTION INTERACTIVE VIDEO SEGMENTATION USING LATENT DIVERSITY DENSE FEATURE DECOMPOSITION WITH BOUNDARY LOSS

    公开(公告)号:US20210150329A1

    公开(公告)日:2021-05-20

    申请号:US16683326

    申请日:2019-11-14

    Abstract: Methods, systems and apparatuses may provide for technology that trains a neural network by inputting video data to the neural network, determining a boundary loss function for the neural network, and selecting weights for the neural network based at least in part on the boundary loss function, wherein the neural network outputs a pixel-level segmentation of one or more objects depicted in the video data. The technology may also operate the neural network by accepting video data and an initial feature set, conducting a tensor decomposition on the initial feature set to obtain a reduced feature set, and outputting a pixel-level segmentation of object(s) depicted in the video data based at least in part on the reduced feature set.

    METHODS AND APPARATUS FOR GROUND TRUTH SHIFT FEATURE RANKING

    公开(公告)号:US20240028876A1

    公开(公告)日:2024-01-25

    申请号:US18477407

    申请日:2023-09-28

    CPC classification number: G06N3/047 G06N3/084

    Abstract: Example apparatus disclosed include interface circuitry, machine readable instruction, and programmable circuitry to at least one of instantiate or execute the machine readable instructions to access source input data and target input data, identify a domain shift prediction based on at least one of a feature decorrelation of the source input data or a feature decorrelation of the target input data, the domain shift prediction a source domain prediction or a target domain prediction, initiate gradient propagation of a domain loss to determine data features for the domain shift prediction, and rank input data features for the domain shift prediction.

    UNCERTAINTY ANALYSIS OF EVIDENTIAL DEEP LEARNING NEURAL NETWORKS

    公开(公告)号:US20230136209A1

    公开(公告)日:2023-05-04

    申请号:US18148138

    申请日:2022-12-29

    Inventor: Anthony Rhodes

    Abstract: Disclosed is an example solution to analyze uncertainty of an evidential deep learning neural network with dissonance regularization and recurrent priors. An example apparatus includes processor circuitry to at least one of instantiate or execute the machine readable instructions to receive a first predicted classification of a first input of an evidential deep learning neural network (EVDL NN), identify a first uncertainty metric associated with the EVDL NN, the first uncertainty metric corresponding to the first input of the EVDL NN, calculate a first dissonance score based on the first uncertainty metric, and when the first dissonance score satisfies a threshold, assign the first predicted classification to the first input.

Patent Agency Ranking