MODEL TRAINING METHOD AND APPARATUS, PEDESTRIAN RE-IDENTIFICATION METHOD AND APPARATUS, AND ELECTRONIC DEVICE

    公开(公告)号:US20240221346A1

    公开(公告)日:2024-07-04

    申请号:US17800880

    申请日:2022-01-29

    CPC classification number: G06V10/44 G06T9/00 G06V10/761 G06V10/762 G06V10/806

    Abstract: The present disclosure provides a model training method and apparatus, a pedestrian re-identification method and apparatus, and an electronic device, and relates to the field of artificial intelligence, and specifically to computer vision and deep learning technologies, which can be applied to smart city scenarios. A specific implementation solution is: performing, by using a first encoder, feature extraction on a first pedestrian image and a second pedestrian image in a sample dataset, to obtain an image feature of the first pedestrian image and an image feature of the second pedestrian image; fusing the image feature of the first pedestrian image and the image feature of the second pedestrian image, to obtain a fused feature; performing, by using a first decoder, feature decoding on the fused feature, to obtain a third pedestrian image; and determining the third pedestrian image as a negative sample image of the first pedestrian image, and using the first pedestrian image and the negative sample image to train a first preset model to convergence, to obtain a pedestrian re-identification model. The embodiments of the present disclosure can improve the effect of the model in distinguishing between pedestrians with similar appearances but different identities.

    Model Determination Method and Electronic Device

    公开(公告)号:US20230124389A1

    公开(公告)日:2023-04-20

    申请号:US17887690

    申请日:2022-08-15

    Abstract: A model determination method and electronic device is provided, and relates to the technical field of artificial intelligence and, in particular, to the field of computer visions and deep learning, and can be applied to image processing, image identification and other scenarios. A specific implementation solution includes an image sample and a text sample are acquired, wherein text data in the text sample is used for performing text description to target image data in the image sample; at least one image feature in the image sample is stored to a first queue, and at least text feature in the text sample is stored to a second queue; the first queue and the second queue are trained to obtain a first target model; and the first target model is determined as an initialization model for a second target model.

    METHOD FOR TRAINING MULTI-MODAL DATA MATCHING DEGREE CALCULATION MODEL, METHOD FOR CALCULATING MULTI-MODAL DATA MATCHING DEGREE, AND RELATED APPARATUSES

    公开(公告)号:US20230215136A1

    公开(公告)日:2023-07-06

    申请号:US18113826

    申请日:2023-02-24

    CPC classification number: G06V10/761 G06V10/7715

    Abstract: The present disclosure provides a method and apparatus for training a multi-modal data matching degree calculation model, a method and apparatus for calculating a multi-modal data matching degree, an electronic device, a computer readable storage medium and a computer program product, and relates to the field of artificial intelligence technology such as deep learning, image processing and computer vision. The method comprises: acquiring first sample data and second sample data that are different in modalities; constructing a contrastive learning loss function comprising a semantic perplexity parameter, the semantic perplexity parameter being determined based on a semantic feature distance between the first sample data and the second sample data; and training, by using the contrastive learning loss function, an initial multi-modal data matching degree calculation model through a contrastive learning approach, to obtain a target multi-modal data matching degree calculation model.

Patent Agency Ranking