PERSON SEARCH METHOD BASED ON PERSON RE-IDENTIFICATION DRIVEN LOCALIZATION REFINEMENT

    公开(公告)号:US20210365743A1

    公开(公告)日:2021-11-25

    申请号:US17253124

    申请日:2020-06-23

    Abstract: The invention discloses a person search method based on person re-identification driven localization refinement. On one hand, the region of interest (ROI) conversion module converts an original input image into a small image corresponding to a ROI, and contradiction existing in part of features shared by a person re-identification network and a detection network is avoided; and on the other hand, loss of the person re-identification network can be transmitted back to the detection network in a gradient manner through the ROI conversion module, the supervision of loss of the person re-identification network for the detection bounding box output by the detection network is realized, and the adjusted detection bounding box can effectively remove background interference, contains more useful attribute information and is more suitable for person search, so that the person search accuracy is greatly improved.

    SPATIOTEMPORAL ACTION DETECTION METHOD

    公开(公告)号:US20210248378A1

    公开(公告)日:2021-08-12

    申请号:US16965015

    申请日:2020-01-07

    Abstract: A spatiotemporal action detection method includes performing object detection on all frames of a sample video to obtain a candidate object set; calculating all interframe optical flow information on the sample video to obtain a motion set; constructing a spatiotemporal convolution-deconvolution network of an attention mechanism and a motion attention mechanism of an additional object; adding both a corresponding sparse variable and a sparse constraint to obtain a network structure S after performing spatiotemporal convolution processing on each time segment of the sample video; training the network structure S with an objective function based on classification loss and loss of the sparse constraint of cross entropy; and calculating an action category and a sparse coefficient corresponding to each time segment of a test sampled video to obtain an object action spatiotemporal location.

Patent Agency Ranking