3D point cloud compression system based on multi-scale structured dictionary learning

    公开(公告)号:US11836954B2

    公开(公告)日:2023-12-05

    申请号:US18182401

    申请日:2023-03-13

    IPC分类号: G06T9/00 G06T9/40

    CPC分类号: G06T9/001 G06T9/40

    摘要: In a 3D point cloud compression system based on multi-scale structured dictionary learning, a point cloud data partition module outputs a voxel set and a set of blocks of voxels of different scales. A geometric information encoding module outputs an encoded geometric information bit stream. A geometric information decoding module outputs decoded geometric information. An attribute signal encoding module outputs a sparse coding coefficient matrix and a learned multi-scale structured dictionary. An attribute signal compression module outputs a compressed attribute signal bit stream. An attribute signal decoding module outputs decoded attribute signals. A 3D point cloud reconstruction module completes reconstruction. The system is applicable to lossless geometric and lossy attribute compression of point cloud signals. Based on the natural hierarchical partitioning structure of point cloud signals, the system gradually improves the reconstruction quality of high-frequency details in the signals from coarse scale to fine scale, and achieves significant gains.

    Image classification method for maximizing mutual information, device, medium and system

    公开(公告)号:US12106546B2

    公开(公告)日:2024-10-01

    申请号:US18623054

    申请日:2024-04-01

    摘要: The present disclosure provides an image classification method for maximizing mutual information, device, medium and system, the method including: acquiring a training image; maximizing the mutual information between the training image and a neural network architecture, and automatically determining the network architecture and parameter of the neural network; and processing image data to be classified using the obtained neural network to obtain an image classification result. According to the present disclosure, the network architecture and parameter of the neutral network are automatically designed and determined by maximizing the mutual information based on given image data without burdensome manual design and saving human and computational resource consumption. The present disclosure can automatically design and obtain a neural network-based image classification method in a very short time, and at the same time can achieve higher image classification accuracy.

    Method and system for bit rate control and version selection for dynamic adaptive video streaming media

    公开(公告)号:US10778982B2

    公开(公告)日:2020-09-15

    申请号:US16297717

    申请日:2019-03-11

    摘要: The disclosure provides a method and system for encoding bit rate control and version selection for a dynamic adaptive video streaming media. The method adopts a dynamic adaptive streaming media encoding technology to encode each original video into a plurality of versions with different bit rates at a server and determines video version subsets to be encoded by the original videos and specific encoding parameters of each video version by taking an encoding complexity-bit rate-distortion model for different original video contents, constraints on an encoding bit rate and a computing resource of the video server, network connection conditions of different users and a video-on-demand probability distribution into consideration, and finally, the video server outputs an optimal video version set through encoding, so as to maximize the overall quality of videos watched by users.

    Image processing method, system, device and storage medium

    公开(公告)号:US11995801B2

    公开(公告)日:2024-05-28

    申请号:US18472248

    申请日:2023-09-22

    IPC分类号: G06T5/60 G06T5/00 G06T5/70

    摘要: An image processing method for sparse image reconstruction, image denoising, compressed sensing image reconstruction or image restoration, comprising: establishing a general linear optimization inverse problem under the 1-norm constraint of a sparse signal; establishing a differentiable deep network model based on convex combination to solve the problem on the basis of standard or learned iterative soft shrinkage thresholding algorithm; and introducing a deep neural network of arbitrary structure into the solving step to accelerate the solving step and reducing a number of iterations needed to reach a convergence. The present disclosure combines the traditional iterative optimization algorithm with the deep neural network of arbitrary structure to improve the image reconstruction performance and ensure fast convergence to meet the current needs of sparse image reconstruction.