METHOD, DEVICE, AND MEDIUM FOR VIDEO PROCESSING

    公开(公告)号:US20240244272A1

    公开(公告)日:2024-07-18

    申请号:US18622817

    申请日:2024-03-29

    申请人: Bytedance Inc.

    IPC分类号: H04N19/90 H04N19/176

    CPC分类号: H04N19/90 H04N19/176

    摘要: Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: obtaining a first machine learning (ML) model for processing a video, wherein the first ML model is trained based on one or more second ML models; and performing, according to the first ML model, a conversion between a current video block of the video and a bitstream of the video.

    Decoding 1D-barcodes in digital capture systems

    公开(公告)号:US11954930B2

    公开(公告)日:2024-04-09

    申请号:US17666401

    申请日:2022-02-07

    摘要: The present disclosure relates to advanced image signal processing technology including: i) rapid localization for machine-readable indicia including, e.g., 1-D and 2-D barcodes; and ii) barcode reading and decoders. One claim recites: an image processing method comprising: obtaining 2-dimensional (2D) image data representing a 1-dimensional (1D) barcode within a first image area; generating a plurality of scanlines across the first image area; for each of the plurality of scanlines, synchronizing the scanline, including decoding an initial set of numerical digits represented by the scanline, in which said synchronizing provides a scale estimate for the scanline; using a path decoder to decode remaining numerical digits within the scanline, the path decoder decoding multiple numerical digits in groups, in which the scale estimate is adapted as the remaining numerical digits are decoded; and providing decoded numerical digits as an identifier represented by the scanline. Of course, other combinations and claims are described within the present disclosure.

    PARALLELIZED VIDEO DECODING USING A NEURAL NETWORK

    公开(公告)号:US20230283790A1

    公开(公告)日:2023-09-07

    申请号:US18016568

    申请日:2021-07-13

    申请人: FONDATION B-COM

    IPC分类号: H04N19/436 H04N19/90

    CPC分类号: H04N19/436 H04N19/90

    摘要: In a method for decoding a data stream by way of an electronic device (10) including a processor (14), and a parallelized processing unit (16) designed to perform a plurality of operations of the same type in parallel at a given time, the data stream includes a first dataset (Fet) and a second dataset (Fnn) representative of audio or video content. The decoding method includes the processor (14) processing data from the first dataset (Fet), obtaining the audio or video content by processing (E70) data from the second dataset (Fnn) using a process depending at least partially on the data from the first set (Fet) and using an artificial neural network (18) implemented by the parallelized processing unit (16).

    Machine learning video processing systems and methods

    公开(公告)号:US11616960B2

    公开(公告)日:2023-03-28

    申请号:US17210478

    申请日:2021-03-23

    申请人: Apple Inc.

    摘要: System and method for improving video encoding and/or video decoding. In embodiments, a video encoding pipeline includes a main encoding pipeline that compresses source image data corresponding with an image frame by processing the source image data based at least in part on encoding parameters to generate encoded image data. Additionally the video encoding pipeline includes a machine learning block communicatively coupled to the main encoding pipeline, in which the machine learning block analyzes content of the image frame by processing the source image data based at least in part on machine learning parameters implemented in the machine learning block when the machine learning block is enabled by the encoding parameters; and the video encoding pipeline adaptively adjusts the encoding parameters based at least in part on the content expected to be present in the image frame to facilitate improving encoding efficiency.

    SYSTEMS AND METHODS FOR SPATIAL PREDICTION

    公开(公告)号:US20220408109A1

    公开(公告)日:2022-12-22

    申请号:US17896350

    申请日:2022-08-26

    申请人: VID SCALE, INC.

    摘要: Systems, methods, and instrumentalities are disclosed relating to intra prediction of a video signal based on mode-dependent subsampling. A block of coefficients associated with a first sub block of a video block, one or more blocks of coefficients associated with one or more remaining sub blocks of the video block, and an indication of a prediction mode for the video block may be received. One or more interpolating techniques, a predicted first sub block, and the predicted sub blocks of the one or more remaining sub blocks may be determined. A reconstructed first sub block and one or more reconstructed remaining sub blocks may be generated. A reconstructed video block may be formed based on the prediction mode, the reconstructed first sub block, and the one or more reconstructed remaining sub blocks.