METHOD OF PROCESSING VIDEO, METHOD OF QUERING VIDEO, AND METHOD OF TRAINING MODEL

    公开(公告)号:US20230130006A1

    公开(公告)日:2023-04-27

    申请号:US18145724

    申请日:2022-12-22

    Abstract: The present application provides a method of processing a video, a method of querying a video, and a method of training a video processing model. A specific implementation solution of the method of processing the video includes: extracting, for a video to be processed, a plurality of video features under a plurality of receptive fields; extracting a local feature of the video to be processed according to a video feature under a target receptive field in the plurality of receptive fields; obtaining a global feature of the video to be processed according to a video feature under a largest receptive field in the plurality of receptive fields; and merging the local feature and the global feature to obtain a target feature of the video to be processed.

    VEHICLE RE-IDENTIFICATION METHOD, APPARATUS, DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20210192214A1

    公开(公告)日:2021-06-24

    申请号:US17164681

    申请日:2021-02-01

    Abstract: The present application discloses a vehicle re-identification method and apparatus, a device and a storage medium, which relates to the field of computer vision, intelligent search, deep learning and intelligent transportation. The specific implementation scheme is: receiving a re-identification request from a terminal device, the re-identification request including a first image of a first vehicle shot by a first camera and information of the first camera; acquiring a first feature of the first vehicle and a first head orientation of the first vehicle according to the first image; determining a second image of the first vehicle from images of multiple vehicles according to the first feature, multiple second features extracted based on the images of the multiple vehicles in an image database, the first head orientation of the first vehicle, and the information of the first camera; and transmitting the second image to the terminal device.

    METHOD AND APPARATUS FOR PRE-TRAINING SEMANTIC REPRESENTATION MODEL AND ELECTRONIC DEVICE

    公开(公告)号:US20230147550A1

    公开(公告)日:2023-05-11

    申请号:US18051594

    申请日:2022-11-01

    CPC classification number: G06V10/774 G06V20/41 G06F40/30 G06V30/19147

    Abstract: A method for pre-training a semantic representation model includes: for each video-text pair in pre-training data, determining a mask image sequence, a mask character sequence, and a mask image-character sequence of the video-text pair; determining a plurality of feature sequences and mask position prediction results respectively corresponding to the plurality of feature sequences by inputting the mask image sequence, the mask character sequence, and the mask image-character sequence into an initial semantic representation model; and building a loss function based on the plurality of feature sequences, the mask position prediction results respectively corresponding to the plurality of feature sequences and true mask position results, and adjusting coefficients of the semantic representation model to realize training.

    TRAINING METHOD, METHOD OF DETECTING TARGET IMAGE, ELECTRONIC DEVICE AND MEDIUM

    公开(公告)号:US20220392101A1

    公开(公告)日:2022-12-08

    申请号:US17887740

    申请日:2022-08-15

    Abstract: A training method, a method of detecting a target image, an electronic device and a medium, which relate to the field of artificial intelligence technology, and in particular to fields of computer vision and deep learning. The method can include: generating an expanded sample image set for a target scene by using a mask image set and an initial sample image set, wherein the mask image set is acquired by parsing a predetermined image set, a target object in the target scene is interfered by another object or the target object in the target scene is cut off, and an image in the predetermined image set includes the target object in the target scene or the another object; and training, by using the initial sample image set and the expanded sample image set, a detection model for detecting the target object.

    IMAGE CLASSIFICATION METHOD, ELECTRONIC DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20220027611A1

    公开(公告)日:2022-01-27

    申请号:US17498226

    申请日:2021-10-11

    Abstract: Provided are an image classification method and apparatus, an electronic device and a storage medium, relating to the field of artificial intelligence and, in particular, to computer vision and deep learning. The method includes inputting a to-be-classified document image into a pretrained neural network and obtaining a feature submap of each text box of the to-be-classified document image by use of the neural network; inputting the feature submap of each text box, a semantic feature corresponding to preobtained text information of each text box and a position feature corresponding to preobtained position information of each text box into a pretrained multimodal feature fusion model and fusing, by use of the multimodal feature fusion model, the three into a multimodal feature corresponding to each text box; and classifying the to-be-classified document image based on the multimodal feature corresponding to each text box.

    OBJECT DETECTING METHOD, ELECTRONIC DEVICE AND STORAGE MEDIUM

    公开(公告)号:US20230027813A1

    公开(公告)日:2023-01-26

    申请号:US17936570

    申请日:2022-09-29

    Abstract: An object detecting method includes: obtaining an object image of an object; obtaining an object feature map by performing feature extraction on the object image; obtaining decoded features by performing feature mapping on the object feature map by adopting a mapping network of an object recognition model; obtaining positions of prediction boxes by inputting the decoded features into a first prediction layer of the object recognition model to perform object regression prediction; and obtaining classes of objects within the prediction boxes by inputting the decoded features into a second prediction layer of the object recognition model to perform object class prediction.

    METHOD FOR TRAINING IMAGE RECOGNITION MODEL BASED ON SEMANTIC ENHANCEMENT

    公开(公告)号:US20220392205A1

    公开(公告)日:2022-12-08

    申请号:US17892669

    申请日:2022-08-22

    Abstract: Embodiments of the present disclosure provide a method and apparatus for training an image recognition model based on a semantic enhancement, a method and apparatus for recognizing an image, an electronic device, and a computer readable storage medium. The method for training an image recognition model based on a semantic enhancement comprises: extracting, from an inputted first image being unannotated and having no textual description, a first feature representation of the first image; calculating a first loss function based on the first feature representation; extracting, from an inputted second image being unannotated and having an original textual description, a second feature representation of the second image; calculating a second loss function based on the second feature representation, and training an image recognition model based on a fusion of the first loss function and the second loss function.

    MODEL TRAINING METHOD AND APPARATUS, KEYPOINT POSITIONING METHOD AND APPARATUS, DEVICE AND MEDIUM

    公开(公告)号:US20220139061A1

    公开(公告)日:2022-05-05

    申请号:US17576198

    申请日:2022-01-14

    Abstract: Provided are a training method and apparatus for a human keypoint positioning model, a human keypoint positioning method and apparatus, a device, a medium and a program product. The training method includes determining an initial positioned point of each of keypoints; acquiring N candidate points of each keypoint according to a position of the initial positioned point; extracting a first feature image, and forming N sets of graph structure feature data according to the first feature image and the N candidate points; performing graph convolution on the N sets of graph structure feature data to obtain N sets of offsets; correcting initial positioned points of all the keypoints to obtain N sets of current positioning results; and calculating each set of loss values according to labeled true values of all the keypoints and each set of current positioning results, and performing supervised training on the positioning model.

Patent Agency Ranking