METHODS, SYSTEMS, AND MEDIA FOR SEGMENTING IMAGES

    公开(公告)号:US20200380695A1

    公开(公告)日:2020-12-03

    申请号:US16885579

    申请日:2020-05-28

    IPC分类号: G06T7/143 G06T7/00

    摘要: Methods, systems, and media for segmenting images are provided. In some embodiments, the method comprises: generating an aggregate U-Net comprised of a plurality of U-Nets, wherein each U-Net in the plurality of U-Nets has a different depth, wherein each U-Net is comprised of a plurality of nodes Xi,j, wherein i indicates a down-sampling layer the U-Net, and wherein j indicates a convolution layer of the U-Net; training the aggregate U-Net by: for each training sample in a group of training samples, calculating, for each node in the plurality of nodes Xi,j, a feature map xi,j, wherein xi,j is based on a convolution operation performed on a down-sampling of an output from Xi−1,j when j=0, and wherein xi,j is based on a convolution operation performed on an up-sampling operation of an output from Xi+1,j−1 when j>0; and predicting a segmentation of a test image using the trained aggregate U-Net.

    SYSTEMS, METHODS, AND APPARATUSES FOR IMPLEMENTING MEDICAL IMAGE SEGMENTATION USING INTERACTIVE REFINEMENT

    公开(公告)号:US20220270357A1

    公开(公告)日:2022-08-25

    申请号:US17675929

    申请日:2022-02-18

    摘要: Described herein are means for implementing medical image segmentation using interactive refinement, in which the trained deep models are then utilized for the processing of medical imaging. For instance, an exemplary system is specially configured for operating a two-step deep learning training framework including means for receiving original input images at the deep learning training framework; means for generating an initial prediction image specifying image segmentation by processing the original input images through the base segmentation model to render the initial prediction image in the absence of user input guidance signals; means for receiving user input guidance signals indicating user-guided segmentation refinements to the initial prediction image; means for routing each of (i) the original input images, (ii) the initial prediction image, and (iii) the user input guidance signals to an InterCNN; means for generating a refined prediction image specifying refined image segmentation by processing each of the (i) the original input images, (ii) the initial prediction image, and (iii) the user input guidance signals through the InterCNN to render the refined prediction image incorporating the user input guidance signals; and means for outputting a refined segmentation mask based on application of the user input guidance signals to the deep learning training framework as a guidance signal. Other related embodiments are disclosed.

    Methods, systems, and media for segmenting images

    公开(公告)号:US11328430B2

    公开(公告)日:2022-05-10

    申请号:US16885579

    申请日:2020-05-28

    IPC分类号: G06K9/00 G06T7/143 G06T7/00

    摘要: Methods, systems, and media for segmenting images are provided. In some embodiments, the method comprises: generating an aggregate U-Net comprised of a plurality of U-Nets, wherein each U-Net in the plurality of U-Nets has a different depth, wherein each U-Net is comprised of a plurality of nodes Xi,j, wherein i indicates a down-sampling layer the U-Net, and wherein j indicates a convolution layer of the U-Net; training the aggregate U-Net by: for each training sample in a group of training samples, calculating, for each node in the plurality of nodes Xi,j, a feature map xi,j, wherein xi,j is based on a convolution operation performed on a down-sampling of an output from Xi−1,j when j=0, and wherein xi,j is based on a convolution operation performed on an up-sampling operation of an output from Xi+1,j−1 when j>0; and predicting a segmentation of a test image using the trained aggregate U-Net.

    SYSTEMS, METHODS, AND APPARATUSES FOR IMPLEMENTING A SELF-SUPERVISED CHEST X-RAY IMAGE ANALYSIS MACHINE-LEARNING MODEL UTILIZING TRANSFERABLE VISUAL WORDS

    公开(公告)号:US20210150710A1

    公开(公告)日:2021-05-20

    申请号:US17098422

    申请日:2020-11-15

    IPC分类号: G06T7/00 G06T7/73 A61B6/00

    摘要: Not only is annotating medical images tedious and time consuming, but it also demands costly, specialty-oriented expertise, which is not easily accessible. To address this challenge, a new self-supervised framework is introduced: TransVW (transferable visual words), exploiting the prowess of transfer learning with convolutional neural networks and the unsupervised nature of visual word extraction with bags of visual words, resulting in an annotation-efficient solution to medical image analysis. TransVW was evaluated using NIH ChestX-ray14 to demonstrate its annotation efficiency. When compared with training from scratch and ImageNet-based transfer learning, TransVW reduces the annotation efforts by 75% and 12%, respectively, in addition to significantly accelerating the convergence speed. More importantly, TransVW sets new records: achieving the best average AUC on all 14 diseases, the best individual AUC scores on 10 diseases, and the second best individual AUC scores on 3 diseases. This performance is unprecedented, because heretofore no self-supervised learning method has outperformed ImageNet-based transfer learning and no annotation reduction has been reported for self-supervised learning. These achievements are contributable to a simple yet powerful observation: The complex and recurring anatomical structures in medical images are natural visual words, which can be automatically extracted, serving as strong yet free supervision signals for CNNs to learn generalizable and transferable image representation via self-supervision.

    SYSTEMS, METHODS, AND APPARATUSES FOR ACCRUING AND REUSING KNOWLEDGE (ARK) FOR SUPERIOR AND ROBUST PERFORMANCE BY A TRAINED AI MODEL FOR USE WITH MEDICAL IMAGE CLASSIFICATION

    公开(公告)号:US20240339200A1

    公开(公告)日:2024-10-10

    申请号:US18627831

    申请日:2024-04-05

    摘要: Exemplary systems include means for receiving medical image data at the system from a plurality of datasets provided via publicly available sources; evaluating the medical image data for the presence of expert notation embedded within the medical image data; determining the expert notations embedded within the medical image data are formatted using inconsistent and heterogeneous labeling across the plurality of datasets; generating an interim AI model by applying a task head classifier to learn the annotations of the expert notations embedded within the medical image data to generate an interim AI model; scaling the interim AI model having the learned annotations of the expert notations embedded therein to additional tasks by applying multi-task heads using cyclical pre-training of the interim AI model trained previously to generate task-specific AI models, with each respective task-specific AI model having differently configured task-specific learning objectives; training a pre-trained AI model specially configured for an application-specific target task by applying task re-visitation training forcing the pre-trained AI model being trained to re-visit all tasks in each round of training and forcing the pre-trained AI model being trained to re-use all accrued knowledge to improve learning by the pre-trained AI model being trained against the current application-specific target task for which the pre-trained AI model is being trained.