-
公开(公告)号:US20240244272A1
公开(公告)日:2024-07-18
申请号:US18622817
申请日:2024-03-29
申请人: Bytedance Inc.
IPC分类号: H04N19/90 , H04N19/176
CPC分类号: H04N19/90 , H04N19/176
摘要: Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: obtaining a first machine learning (ML) model for processing a video, wherein the first ML model is trained based on one or more second ML models; and performing, according to the first ML model, a conversion between a current video block of the video and a bitstream of the video.
-
公开(公告)号:US12028540B2
公开(公告)日:2024-07-02
申请号:US17844152
申请日:2022-06-20
IPC分类号: H04N19/42 , H04N19/167 , H04N19/503 , H04N19/593 , H04N19/90
CPC分类号: H04N19/42 , H04N19/167 , H04N19/503 , H04N19/593 , H04N19/90
摘要: A video file is detected by a computer system. The video file that is to be provided to one or more client devices. The video file contains a video stream that includes a plurality of video images. A first video image of the plurality of video images is reconstructed based on a first machine learning technique. The first machine learning technique is based on one or more video images that occur temporally before the first video image in the video stream. A reconstruction status of the first video image of the plurality of video images is identified based on the video file and based on a second machine learning technique. An altered video file is generated in response to the reconstruction status and based on the video file.
-
3.
公开(公告)号:US20240187640A1
公开(公告)日:2024-06-06
申请号:US18281844
申请日:2022-03-16
申请人: VID SCALE, INC.
发明人: Fabien RACAPE , Jean BEGAINT , Simon FELTMAN , Akshay PUSHPARAJA
IPC分类号: H04N19/537 , H04N19/177 , H04N19/184 , H04N19/90
CPC分类号: H04N19/537 , H04N19/177 , H04N19/184 , H04N19/90
摘要: Video encoding and decoding is implemented with auto encoders using luminance information to derive motion information for chrominance prediction. In one embodiment conditional convolutions are used to encode motion flow information. A current condition, for example, GOP structure, is used as input to a succession of fully connected layers to implement the conditional convolution. In a related embodiment, more than one reference frame is used to encode motion flow information.
-
公开(公告)号:US11954930B2
公开(公告)日:2024-04-09
申请号:US17666401
申请日:2022-02-07
申请人: Digimarc Corporation
发明人: Brett A. Bradley , Tomas Filler , Vojtech Holub
CPC分类号: G06V30/224 , G06F18/24 , G06T7/0012 , G06V10/42 , H04N19/44 , H04N19/90 , H04N19/93
摘要: The present disclosure relates to advanced image signal processing technology including: i) rapid localization for machine-readable indicia including, e.g., 1-D and 2-D barcodes; and ii) barcode reading and decoders. One claim recites: an image processing method comprising: obtaining 2-dimensional (2D) image data representing a 1-dimensional (1D) barcode within a first image area; generating a plurality of scanlines across the first image area; for each of the plurality of scanlines, synchronizing the scanline, including decoding an initial set of numerical digits represented by the scanline, in which said synchronizing provides a scale estimate for the scanline; using a path decoder to decode remaining numerical digits within the scanline, the path decoder decoding multiple numerical digits in groups, in which the scale estimate is adapted as the remaining numerical digits are decoded; and providing decoded numerical digits as an identifier represented by the scanline. Of course, other combinations and claims are described within the present disclosure.
-
公开(公告)号:US20230412825A1
公开(公告)日:2023-12-21
申请号:US17844152
申请日:2022-06-20
IPC分类号: H04N19/42 , H04N19/167 , H04N19/90 , H04N19/593 , H04N19/503
CPC分类号: H04N19/42 , H04N19/167 , H04N19/90 , H04N19/593 , H04N19/503
摘要: A video file is detected by a computer system. The video file that is to be provided to one or more client devices. The video file contains a video stream that includes a plurality of video images. A first video image of the plurality of video images is reconstructed based on a first machine learning technique. The first machine learning technique is based on one or more video images that occur temporally before the first video image in the video stream. A reconstruction status of the first video image of the plurality of video images is identified based on the video file and based on a second machine learning technique. An altered video file is generated in response to the reconstruction status and based on the video file.
-
公开(公告)号:US20230379466A1
公开(公告)日:2023-11-23
申请号:US18359384
申请日:2023-07-26
申请人: Electronics and Telecommunications Research Institute , Industry Academy Cooperation Foundation of Sejong University
发明人: Sung Chang LIM , Jung Won KANG , Hyun Suk KO , Jin Ho LEE , Ha Hyun LEE , Dong San JUN , Seung Hyun CHO , Hui Yong KIM , Jin Soo CHOI , Yung Lyul LEE , Nam Uk KIM , Jun Woo CHOI
IPC分类号: H04N19/122 , H04N19/90 , H04N19/132 , H04N19/70 , H04N19/625 , H04N19/103 , H04N19/107 , H04N19/186 , H04N19/176 , H04N19/61
CPC分类号: H04N19/122 , H04N19/90 , H04N19/132 , H04N19/70 , H04N19/625 , H04N19/103 , H04N19/107 , H04N19/186 , H04N19/176 , H04N19/61
摘要: The present invention relates to a method and apparatus for encoding and decoding a video image based on transform. The method for decoding a video includes: determining a transform mode of a current block; inverse-transforming residual data of the current block according to the transform mode of the current block; and rearranging the inverse-transformed residual data of the current block according to the transform mode of the current block, wherein the transform mode includes at least one of SDST (Shuffling Discrete Sine Transform), SDCT (Shuffling Discrete cosine Transform), DST (Discrete Sine Transform) or DCT (Discrete Cosine Transform).
-
公开(公告)号:US20230283790A1
公开(公告)日:2023-09-07
申请号:US18016568
申请日:2021-07-13
申请人: FONDATION B-COM
发明人: Félix HENRY , Gordon CLARE
IPC分类号: H04N19/436 , H04N19/90
CPC分类号: H04N19/436 , H04N19/90
摘要: In a method for decoding a data stream by way of an electronic device (10) including a processor (14), and a parallelized processing unit (16) designed to perform a plurality of operations of the same type in parallel at a given time, the data stream includes a first dataset (Fet) and a second dataset (Fnn) representative of audio or video content. The decoding method includes the processor (14) processing data from the first dataset (Fet), obtaining the audio or video content by processing (E70) data from the second dataset (Fnn) using a process depending at least partially on the data from the first set (Fet) and using an artificial neural network (18) implemented by the parallelized processing unit (16).
-
公开(公告)号:US11616960B2
公开(公告)日:2023-03-28
申请号:US17210478
申请日:2021-03-23
申请人: Apple Inc.
发明人: Jim C. Chou , Alexandros Tourapis
IPC分类号: H04N19/159 , H04N19/154 , H04N19/59 , H04N19/86 , H04N19/46 , G06T9/00 , G06N3/04 , G06N3/08 , H04N19/102 , H04N19/117 , H04N19/189 , H04N19/132 , H04N19/90 , H04N19/136 , H04N19/172 , H04N19/436
摘要: System and method for improving video encoding and/or video decoding. In embodiments, a video encoding pipeline includes a main encoding pipeline that compresses source image data corresponding with an image frame by processing the source image data based at least in part on encoding parameters to generate encoded image data. Additionally the video encoding pipeline includes a machine learning block communicatively coupled to the main encoding pipeline, in which the machine learning block analyzes content of the image frame by processing the source image data based at least in part on machine learning parameters implemented in the machine learning block when the machine learning block is enabled by the encoding parameters; and the video encoding pipeline adaptively adjusts the encoding parameters based at least in part on the content expected to be present in the image frame to facilitate improving encoding efficiency.
-
公开(公告)号:US20220408109A1
公开(公告)日:2022-12-22
申请号:US17896350
申请日:2022-08-26
申请人: VID SCALE, INC.
IPC分类号: H04N19/52 , H04N19/176 , H04N19/119 , H04N19/46 , H04N19/593 , H04N19/11 , H04N19/124 , H04N19/14 , H04N19/80 , H04N19/90 , H04N19/60 , H04N19/59 , H04N19/182
摘要: Systems, methods, and instrumentalities are disclosed relating to intra prediction of a video signal based on mode-dependent subsampling. A block of coefficients associated with a first sub block of a video block, one or more blocks of coefficients associated with one or more remaining sub blocks of the video block, and an indication of a prediction mode for the video block may be received. One or more interpolating techniques, a predicted first sub block, and the predicted sub blocks of the one or more remaining sub blocks may be determined. A reconstructed first sub block and one or more reconstructed remaining sub blocks may be generated. A reconstructed video block may be formed based on the prediction mode, the reconstructed first sub block, and the one or more reconstructed remaining sub blocks.
-
公开(公告)号:US11533514B2
公开(公告)日:2022-12-20
申请号:US16911775
申请日:2020-06-25
发明人: Chi Wang , Pongsak Lasang , Toshiyasu Sugio , Tatsuya Koyama
摘要: An encoding method according to the present disclosure includes: inputting three-dimensional data including three-dimensional coordinate data to a deep neural network (DNN); encoding the three-dimensional data by the DNN to generate encoded three-dimensional data; and outputting the encoded three-dimensional data.
-
-
-
-
-
-
-
-
-