专利检索 ipc:H04N19/90 第 1 页

1.

发明公开
METHOD, DEVICE, AND MEDIUM FOR VIDEO PROCESSING 审中-公开

公开(公告)号：US20240244272A1

公开(公告)日：2024-07-18

申请号：US18622817

申请日：2024-03-29

申请人： Bytedance Inc.

发明人： Yue LI , Kai ZHANG , Li ZHANG

IPC分类号： H04N19/90 , H04N19/176

CPC分类号： H04N19/90 , H04N19/176

摘要： Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: obtaining a first machine learning (ML) model for processing a video, wherein the first ML model is trained based on one or more second ML models; and performing, according to the first ML model, a conversion between a current video block of the video and a bitstream of the video.

2.

发明授权
Video size reduction by reconstruction 有权

公开(公告)号：US12028540B2

公开(公告)日：2024-07-02

申请号：US17844152

申请日：2022-06-20

申请人： INTERNATIONAL BUSINESS MACHINES CORPORATION

发明人： Sathya Santhar , Sarbajit K. Rakshit , Sridevi Kannan , Samuel Mathew Jawaharlal

IPC分类号： H04N19/42 , H04N19/167 , H04N19/503 , H04N19/593 , H04N19/90

CPC分类号： H04N19/42 , H04N19/167 , H04N19/503 , H04N19/593 , H04N19/90

摘要： A video file is detected by a computer system. The video file that is to be provided to one or more client devices. The video file contains a video stream that includes a plurality of video images. A first video image of the plurality of video images is reconstructed based on a first machine learning technique. The first machine learning technique is based on one or more video images that occur temporally before the first video image in the video stream. A reconstruction status of the first video image of the plurality of video images is identified based on the video file and based on a second machine learning technique. An altered video file is generated in response to the reconstruction status and based on the video file.

3.

发明公开
TEMPORAL STRUCTURE-BASED CONDITIONAL CONVOLUTIONAL NEURAL NETWORKS FOR VIDEO COMPRESSION 审中-公开

公开(公告)号：US20240187640A1

公开(公告)日：2024-06-06

申请号：US18281844

申请日：2022-03-16

申请人： VID SCALE, INC.

发明人： Fabien RACAPE , Jean BEGAINT , Simon FELTMAN , Akshay PUSHPARAJA

IPC分类号： H04N19/537 , H04N19/177 , H04N19/184 , H04N19/90

CPC分类号： H04N19/537 , H04N19/177 , H04N19/184 , H04N19/90

摘要： Video encoding and decoding is implemented with auto encoders using luminance information to derive motion information for chrominance prediction. In one embodiment conditional convolutions are used to encode motion flow information. A current condition, for example, GOP structure, is used as input to a succession of fully connected layers to implement the conditional convolution. In a related embodiment, more than one reference frame is used to encode motion flow information.

4.

发明授权
Decoding 1D-barcodes in digital capture systems 有权

公开(公告)号：US11954930B2

公开(公告)日：2024-04-09

申请号：US17666401

申请日：2022-02-07

申请人： Digimarc Corporation

发明人： Brett A. Bradley , Tomas Filler , Vojtech Holub

IPC分类号： G06V30/224 , G06F18/24 , G06T7/00 , G06V10/42 , H04N19/44 , H04N19/90 , H04N19/93

CPC分类号： G06V30/224 , G06F18/24 , G06T7/0012 , G06V10/42 , H04N19/44 , H04N19/90 , H04N19/93

摘要： The present disclosure relates to advanced image signal processing technology including: i) rapid localization for machine-readable indicia including, e.g., 1-D and 2-D barcodes; and ii) barcode reading and decoders. One claim recites: an image processing method comprising: obtaining 2-dimensional (2D) image data representing a 1-dimensional (1D) barcode within a first image area; generating a plurality of scanlines across the first image area; for each of the plurality of scanlines, synchronizing the scanline, including decoding an initial set of numerical digits represented by the scanline, in which said synchronizing provides a scale estimate for the scanline; using a path decoder to decode remaining numerical digits within the scanline, the path decoder decoding multiple numerical digits in groups, in which the scale estimate is adapted as the remaining numerical digits are decoded; and providing decoded numerical digits as an identifier represented by the scanline. Of course, other combinations and claims are described within the present disclosure.

5.

发明公开
VIDEO SIZE REDUCTION BY RECONSTRUCTION 审中-公开

公开(公告)号：US20230412825A1

公开(公告)日：2023-12-21

申请号：US17844152

申请日：2022-06-20

申请人： INTERNATIONAL BUSINESS MACHINES CORPORATION

发明人： Sathya Santhar , Sarbajit K. Rakshit , Sridevi Kannan , Samuel Mathew Jawaharlal

IPC分类号： H04N19/42 , H04N19/167 , H04N19/90 , H04N19/593 , H04N19/503

CPC分类号： H04N19/42 , H04N19/167 , H04N19/90 , H04N19/593 , H04N19/503

摘要： A video file is detected by a computer system. The video file that is to be provided to one or more client devices. The video file contains a video stream that includes a plurality of video images. A first video image of the plurality of video images is reconstructed based on a first machine learning technique. The first machine learning technique is based on one or more video images that occur temporally before the first video image in the video stream. A reconstruction status of the first video image of the plurality of video images is identified based on the video file and based on a second machine learning technique. An altered video file is generated in response to the reconstruction status and based on the video file.

6.

发明公开
METHOD AND APPARATUS FOR TRANSFORM-BASED IMAGE ENCODING/DECODING 审中-公开

公开(公告)号：US20230379466A1

公开(公告)日：2023-11-23

申请号：US18359384

申请日：2023-07-26

申请人： Electronics and Telecommunications Research Institute , Industry Academy Cooperation Foundation of Sejong University

发明人： Sung Chang LIM , Jung Won KANG , Hyun Suk KO , Jin Ho LEE , Ha Hyun LEE , Dong San JUN , Seung Hyun CHO , Hui Yong KIM , Jin Soo CHOI , Yung Lyul LEE , Nam Uk KIM , Jun Woo CHOI

IPC分类号： H04N19/122 , H04N19/90 , H04N19/132 , H04N19/70 , H04N19/625 , H04N19/103 , H04N19/107 , H04N19/186 , H04N19/176 , H04N19/61

CPC分类号： H04N19/122 , H04N19/90 , H04N19/132 , H04N19/70 , H04N19/625 , H04N19/103 , H04N19/107 , H04N19/186 , H04N19/176 , H04N19/61

摘要： The present invention relates to a method and apparatus for encoding and decoding a video image based on transform. The method for decoding a video includes: determining a transform mode of a current block; inverse-transforming residual data of the current block according to the transform mode of the current block; and rearranging the inverse-transformed residual data of the current block according to the transform mode of the current block, wherein the transform mode includes at least one of SDST (Shuffling Discrete Sine Transform), SDCT (Shuffling Discrete cosine Transform), DST (Discrete Sine Transform) or DCT (Discrete Cosine Transform).

7.

发明公开
PARALLELIZED VIDEO DECODING USING A NEURAL NETWORK 审中-公开

公开(公告)号：US20230283790A1

公开(公告)日：2023-09-07

申请号：US18016568

申请日：2021-07-13

申请人： FONDATION B-COM

发明人： Félix HENRY , Gordon CLARE

IPC分类号： H04N19/436 , H04N19/90

CPC分类号： H04N19/436 , H04N19/90

摘要： In a method for decoding a data stream by way of an electronic device (10) including a processor (14), and a parallelized processing unit (16) designed to perform a plurality of operations of the same type in parallel at a given time, the data stream includes a first dataset (Fet) and a second dataset (Fnn) representative of audio or video content. The decoding method includes the processor (14) processing data from the first dataset (Fet), obtaining the audio or video content by processing (E70) data from the second dataset (Fnn) using a process depending at least partially on the data from the first set (Fet) and using an artificial neural network (18) implemented by the parallelized processing unit (16).

8.

发明授权
Machine learning video processing systems and methods 有权

公开(公告)号：US11616960B2

公开(公告)日：2023-03-28

申请号：US17210478

申请日：2021-03-23

申请人： Apple Inc.

发明人： Jim C. Chou , Alexandros Tourapis

IPC分类号： H04N19/159 , H04N19/154 , H04N19/59 , H04N19/86 , H04N19/46 , G06T9/00 , G06N3/04 , G06N3/08 , H04N19/102 , H04N19/117 , H04N19/189 , H04N19/132 , H04N19/90 , H04N19/136 , H04N19/172 , H04N19/436

摘要： System and method for improving video encoding and/or video decoding. In embodiments, a video encoding pipeline includes a main encoding pipeline that compresses source image data corresponding with an image frame by processing the source image data based at least in part on encoding parameters to generate encoded image data. Additionally the video encoding pipeline includes a machine learning block communicatively coupled to the main encoding pipeline, in which the machine learning block analyzes content of the image frame by processing the source image data based at least in part on machine learning parameters implemented in the machine learning block when the machine learning block is enabled by the encoding parameters; and the video encoding pipeline adaptively adjusts the encoding parameters based at least in part on the content expected to be present in the image frame to facilitate improving encoding efficiency.

9.

发明申请
SYSTEMS AND METHODS FOR SPATIAL PREDICTION 有权

公开(公告)号：US20220408109A1

公开(公告)日：2022-12-22

申请号：US17896350

申请日：2022-08-26

申请人： VID SCALE, INC.

发明人： Yan Ye , Qian Chen , Jie Dong

IPC分类号： H04N19/52 , H04N19/176 , H04N19/119 , H04N19/46 , H04N19/593 , H04N19/11 , H04N19/124 , H04N19/14 , H04N19/80 , H04N19/90 , H04N19/60 , H04N19/59 , H04N19/182

摘要： Systems, methods, and instrumentalities are disclosed relating to intra prediction of a video signal based on mode-dependent subsampling. A block of coefficients associated with a first sub block of a video block, one or more blocks of coefficients associated with one or more remaining sub blocks of the video block, and an indication of a prediction mode for the video block may be received. One or more interpolating techniques, a predicted first sub block, and the predicted sub blocks of the one or more remaining sub blocks may be determined. A reconstructed first sub block and one or more reconstructed remaining sub blocks may be generated. A reconstructed video block may be formed based on the prediction mode, the reconstructed first sub block, and the one or more reconstructed remaining sub blocks.

10.

发明授权
Encoding method, decoding method, information processing method, encoding device, decoding device, and information processing system 有权

公开(公告)号：US11533514B2

公开(公告)日：2022-12-20

申请号：US16911775

申请日：2020-06-25

申请人： Panasonic Intellectual Property Corporation of America

发明人： Chi Wang , Pongsak Lasang , Toshiyasu Sugio , Tatsuya Koyama

IPC分类号： H04N19/90 , G06N3/02

摘要： An encoding method according to the present disclosure includes: inputting three-dimensional data including three-dimensional coordinate data to a deep neural network (DNN); encoding the three-dimensional data by the DNN to generate encoded three-dimensional data; and outputting the encoded three-dimensional data.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类