Method and device for encoding or decoding based on inter-frame prediction

    公开(公告)号:US12206892B2

    公开(公告)日:2025-01-21

    申请号:US17768212

    申请日:2019-10-25

    Abstract: A method and a device for encoding or decoding based on an inter-frame prediction. The method includes steps of: determining a temporal motion vector prediction value of a to-be-processed coding unit, where the temporal motion vector prediction value is a temporal motion vector prediction value of a sub-block, a temporal motion vector of which is obtainable through prediction, in sub-blocks adjacent to the to-be-processed coding unit and/or sub-blocks in the to-be-processed coding unit; determining a motion vector residual prediction value of the to-be-processed coding unit according to the temporal motion vector prediction value; determining a motion vector of a sub-block in the to-be-processed coding unit according to the temporal motion vector prediction value and the motion vector residual prediction value and performing a motion compensation according to the motion vector of the sub-block in the to-be-processed coding unit to determine a prediction block of the to-be-processed coding unit.

    Back-propagation image visual saliency detection method based on depth image mining

    公开(公告)号:US11227178B2

    公开(公告)日:2022-01-18

    申请号:US16336737

    申请日:2017-11-24

    Abstract: A back-propagation significance detection method based on depth map mining, comprising: for an input image Io, at a preprocessing phase, obtaining a depth image Id and an image Cb with four background corners removed of the image Io; at a first processing phase, carrying out positioning detection on a significant region of the image by means of the obtained image Cb with four background corners removed and the obtained depth image Id to obtain the preliminary detection result S1 of a significant object in the image; then carrying out depth mining on a plurality of processing phases of the depth image Id to obtain corresponding significance detection results; and then optimizing the significance detection result mined in each processing phase by means of a back-propagation mechanism to obtain a final significance detection result map. The method can improve the detection accuracy of the significance object.

    Method of using deep discriminate network model for person re-identification in image or video

    公开(公告)号:US11100370B2

    公开(公告)日:2021-08-24

    申请号:US16622929

    申请日:2018-01-23

    Abstract: Disclosed is a deep discriminative network for person re-identification in an image or a video. Concatenation are carried out on different input images on a color channel by constructing a deep discriminative network, and an obtained splicing result is defined as an original difference space of different images. The original difference space is sent into a convolutional network. The network outputs the similarity between two input images by learning difference information in the original difference space, thereby realizing person re-identification. The features of an individual image are not learnt, and concatenation are carried out on input images on a color channel at the beginning, and difference information is learnt on an original space of the images by using a designed network. By introducing an Inception module and embedding the same into a model, the learning ability of a network can be improved, and a better differentiation effect can be achieved.

    Method for detecting pedestrians in image by using Gaussian penalty

    公开(公告)号:US11030444B2

    公开(公告)日:2021-06-08

    申请号:US16621385

    申请日:2017-11-24

    Abstract: Disclosed is a method for detecting pedestrians in an image by using Gaussian penalty. Initial pedestrian boundary box is screened using a Gaussian penalty, to improve the pedestrian detection performance, especially sheltered pedestrians in an image. The method includes acquiring a training data set, a test data set and pedestrian labels of a pedestrian detection image; using the training data set for training to obtain a detection model by using a pedestrian detection method, and acquiring initial pedestrian boundary box and confidence degrees and coordinates thereof; performing Gaussian penalty on the confidence degrees of the pedestrian boundary box, to obtain confidence degree of the pedestrian boundary box after the penalty; and obtaining final pedestrian boundary boxes by screening the pedestrian boundary boxes. Thus, repeated boundary boxes of a single pedestrian are removed while reserving boundary boxes of sheltered pedestrians, thereby realizing the detection of the pedestrians in an image.

    Image deblurring method based on light streak information in an image

    公开(公告)号:US10755390B2

    公开(公告)日:2020-08-25

    申请号:US16098732

    申请日:2016-07-15

    Abstract: An image deblurring method based on light streak information in an image is provided, wherein shape information of a blur kernel is obtained based on a light streak in a motion blur image and image restoration is constrained by combining the shape information, a natural image and the blur kernel to thereby obtain an accurate blur kernel and a high-quality restored image. The method specifically comprises: selecting an optimum image patch including an optimum light streak; extracting shape information of a blur kernel from the optimum image patch including the optimum light streak; performing blur kernel estimation to obtain the final blur kernel; performing non-blind deconvolution and restoring a sharp restored image as a final deblurred image. The present disclosure establishes a blurry image test set of captured images including light streaks and a method to obtain an accurate blur kernel and a high quality restore image.

    Method of inter-frame prediction for video encoding and decoding

    公开(公告)号:US10425656B2

    公开(公告)日:2019-09-24

    申请号:US15746932

    申请日:2016-01-19

    Abstract: A video encoding and decoding method, and its inter-frame prediction method, device and system thereof are disclosed. The inter-frame prediction method includes obtaining a motion vector of the current image block and related spatial position of a current pixel, obtaining a motion vector of the current pixel according to the motion vector of the current image block and the related spatial position of the current pixel; and obtaining a predicted value of the current pixel according to the motion vector of the current pixel. The method considers both the motion vector of the current image block and the related spatial position information of the current pixel during inter-frame prediction. The method can accommodate lens distortion characteristics of different images and zoom-in/zoom-out produced when the object moves in pictures, thereby improving the calculation accuracy of pixels' motion vectors, and improving inter-frame prediction performance and compression efficiency in video encoding and decoding.

    Video background removal method
    9.
    发明授权

    公开(公告)号:US10297016B2

    公开(公告)日:2019-05-21

    申请号:US16089493

    申请日:2017-01-05

    Abstract: Disclosed is a video background removal method, which relates to the technical field of video analysis, and in particular to a background removal method based on an image block, a Gaussian mixture model and a random process. Firstly, the concept of blocks is defined, and a foreground and a background are determined by means of comparing a difference between blocks; a threshold value is automatically adjusted by using a Gaussian mixture model, and at the same time, the background is updated by using the idea of random process; and finally, an experiment is made on a BMC dataset, and a result shows that this method surpasses most of the current advanced algorithms, and the accuracy is very high. This method has wide applicability, can be applied to monitor video background subtraction, and is applied very importantly in the field of video analysis.

    METHOD FOR ACCELERATING A CDVS EXTRACTION PROCESS BASED ON A GPGPU PLATFORM

    公开(公告)号:US20190139186A1

    公开(公告)日:2019-05-09

    申请号:US16307329

    申请日:2016-12-05

    Abstract: Embodiments of the present disclosure provide a method for accelerating CDVS extraction process based on a GPGPU platform, wherein for the stages of feature detection and local descriptor computation of the CDVS extraction process, operation logics and parallelism strategies of respective inter-pixel parallelism sub-procedures and respective inter-feature point parallelism sub-procedures are implemented by leveraging an OpenCL general-purpose parallelism programming framework, and acceleration is achieved by leveraging a GPU's parallelism computation capability; including: partitioning computing tasks for a GPU and a CPU; reconstructing an image scale pyramid storage model; assigning parallelism strategies to respective sub-procedures for the GPU; and applying local memory to mitigate the access bottleneck. The technical solution of the present disclosure may accelerate the CDVS extraction process and significantly enhances the extraction performance.

Patent Agency Ranking