Boundary filtering method for intra prediction

    公开(公告)号:US11381812B2

    公开(公告)日:2022-07-05

    申请号:US16755350

    申请日:2018-09-25

    Abstract: Disclosed is a boundary filtering method for intra prediction, relating to the video encoding technology filed. Whether boundary filtering is performed on an intra prediction block or not is adaptively selected by means of a rate distortion optimization decision; during filtering, a filter coefficient exponentially attenuated relative to distance to boundary is adopted to perform filtering on the first N rows or the first N columns of the intra prediction block by means of an intra prediction block filter, and different filtering strengths are used according to different sizes of the prediction blocks. Therefore, the boundary distortion problem of intra prediction block is solved, the intra prediction precision is improved, and the encoding efficiency of intra prediction block is increased; and the practicability and the robustness of the boundary filtering technology are improved.

    Video action detection method based on convolutional neural network

    公开(公告)号:US11379711B2

    公开(公告)日:2022-07-05

    申请号:US16414783

    申请日:2017-08-16

    Abstract: A video action detection method based on a convolutional neural network (CNN) is disclosed in the field of computer vision recognition technologies. A temporal-spatial pyramid pooling layer is added to a network structure, which eliminates limitations on input by a network, speeds up training and detection, and improves performance of video action classification and time location. The disclosed convolutional neural network includes a convolutional layer, a common pooling layer, a temporal-spatial pyramid pooling layer and a full connection layer. The outputs of the convolutional neural network include a category classification output layer and a time localization calculation result output layer. The disclosed method does not require down-sampling to obtain video clips of different durations, but instead utilizes direct input of the whole video at once, improving efficiency. Moreover, the network is trained by using video clips of the same frequency without increasing differences within a category, thus reducing the learning burden of the network, achieving faster model convergence and better detection.

    MCMC framework-based sub-hypergraph matching method and device

    公开(公告)号:US11347979B2

    公开(公告)日:2022-05-31

    申请号:US16079660

    申请日:2016-03-10

    Abstract: A method and a device for MCMC framework-based sub-hypergraph matching are provided. Matching of object features is performed by constructing sub-hypergraphs. In a large number of actual images and videos, objects vary constantly, and contain various noise points as well as other interference factors, which makes image object matching and searching very difficult. Perform object feature matching by representing the appearance and positions of objects by sub-hypergraphs allows for faster and more accurate image matching. Furthermore, a sub-hypergraph has several advantages over a graph or a hypergraph: on one hand, a sub-hypergraph has more geometric information (e.g. angle transformation, rotation, scale, etc.) than a graph, and has a lower degree of difficulty and better extensibility than a hypergraph. On the other hand, the disclosed method and device have stronger capabilities to resist interference and good robustness, and are adaptable to more complex settings, especially with outliers.

    Main viewpoint-based panoramic video mapping method

    公开(公告)号:US11301953B2

    公开(公告)日:2022-04-12

    申请号:US16650141

    申请日:2018-05-29

    Abstract: Disclosed are a panoramic video asymmetrical mapping method and a corresponding inverse mapping method that include mapping a spherical surface corresponding to a panoramic image or video A onto a two-dimensional image or video B, projecting the spherical surface onto an isosceles quadrangular pyramid with a square bottom plane, and further projecting the isosceles quadrangular pyramid onto a planar surface, using isometric projection on a main viewpoint region in the projection and using a relatively high sampling density to ensure that the video quality of the region of the main viewpoint is high, while using a relatively low sample density for non-main viewpoint regions so as to reduce bit rate. The panoramic video asymmetrical inverse mapping technique provides a method for mapping from a planar surface to a spherical surface, and a planar surface video may be mapped back to a spherical surface for rendering and viewing.

    Point cloud attribute compression method based on deleting 0 elements in quantisation matrix

    公开(公告)号:US11216985B2

    公开(公告)日:2022-01-04

    申请号:US17045894

    申请日:2018-05-15

    Abstract: Disclosed in the present invention is a point cloud attribution compression method based on deleting 0 elements in a quantisation matrix, including optimizing a traversal sequence for a quantisation matrix and deleting the 0 elements at the end of the data stream. The present invention may use seven types of traversal sequences at the encoding end of the point cloud attribute compression, such that the distribution of the 0 elements in the data stream may be more concentrated at the end thereof. The 0 elements at the end of the data stream may be deleted, removing redundant information and reducing the quantity of data to be entropy encoded. At the decoding end, the point cloud geometric information may be incorporated to supplement the deleted 0 elements and the quantisation matrix may be restored according to the traversal sequence, thereby improving compression performance without introducing new errors.

    Hierarchical division-based point cloud attribute compression method

    公开(公告)号:US11004240B2

    公开(公告)日:2021-05-11

    申请号:US16626907

    申请日:2018-05-15

    Abstract: Disclosed is a hierarchical division-based point cloud attribute compression method. For point cloud attribute information, a new hierarchical division based coding scheme is proposed, wherein a frame of point cloud is adaptively divided into a “stripe-macroblock-block” hierarchical structure according to the spatial position and color distribution of the point cloud, and stripes are coded independently from one another, increasing the coding efficiency, enhancing the fault tolerance of a system and improving the performance of point cloud attribute compression. The method comprises: (1) inputting a point cloud; (2) division of a k-dimension (KD) tree structure of the point cloud; (3) continuity analysis of point cloud attribute information; (4) stripe division of the point cloud; (5) division of macroblocks and coding blocks of the point cloud; and (6) intra-frame prediction, transformation, quantification and entropy coding based on a block structure.

    Method and a device for extracting local features of a three-dimensional point cloud

    公开(公告)号:US10339409B2

    公开(公告)日:2019-07-02

    申请号:US15575897

    申请日:2015-06-18

    Abstract: A method and a device for extracting local features of a 3D point cloud are disclosed. Angle information and the concavo-convex information about a feature point to be extracted and a point of an adjacent body element are calculated based on a local reference system corresponding to the points of each body element. The feature relation between the two points can be calculated accurately. The property of invariance in translation and rotation is possessed. Since concavo-convex information about a local point cloud is contained during extraction, the inaccurate extraction caused by ignoring concavo-convex ambiguity in previous 3D local feature description is resolved. During normalization processing, exponential normalization processing and second-normal-form normalization are adopted, which solves the problem of inaccurate similarity calculation caused by a circumstance that a few elements in a vector are too large or too small during feature extraction, thus improving accuracy of extracted three-dimensional local features.

Patent Agency Ranking