-
公开(公告)号:US20240244272A1
公开(公告)日:2024-07-18
申请号:US18622817
申请日:2024-03-29
Applicant: Bytedance Inc.
IPC: H04N19/90 , H04N19/176
CPC classification number: H04N19/90 , H04N19/176
Abstract: Embodiments of the present disclosure provide a solution for video processing. A method for video processing is proposed. The method comprises: obtaining a first machine learning (ML) model for processing a video, wherein the first ML model is trained based on one or more second ML models; and performing, according to the first ML model, a conversion between a current video block of the video and a bitstream of the video.
-
公开(公告)号:US12028540B2
公开(公告)日:2024-07-02
申请号:US17844152
申请日:2022-06-20
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Sathya Santhar , Sarbajit K. Rakshit , Sridevi Kannan , Samuel Mathew Jawaharlal
IPC: H04N19/42 , H04N19/167 , H04N19/503 , H04N19/593 , H04N19/90
CPC classification number: H04N19/42 , H04N19/167 , H04N19/503 , H04N19/593 , H04N19/90
Abstract: A video file is detected by a computer system. The video file that is to be provided to one or more client devices. The video file contains a video stream that includes a plurality of video images. A first video image of the plurality of video images is reconstructed based on a first machine learning technique. The first machine learning technique is based on one or more video images that occur temporally before the first video image in the video stream. A reconstruction status of the first video image of the plurality of video images is identified based on the video file and based on a second machine learning technique. An altered video file is generated in response to the reconstruction status and based on the video file.
-
3.
公开(公告)号:US20240187640A1
公开(公告)日:2024-06-06
申请号:US18281844
申请日:2022-03-16
Applicant: VID SCALE, INC.
Inventor: Fabien RACAPE , Jean BEGAINT , Simon FELTMAN , Akshay PUSHPARAJA
IPC: H04N19/537 , H04N19/177 , H04N19/184 , H04N19/90
CPC classification number: H04N19/537 , H04N19/177 , H04N19/184 , H04N19/90
Abstract: Video encoding and decoding is implemented with auto encoders using luminance information to derive motion information for chrominance prediction. In one embodiment conditional convolutions are used to encode motion flow information. A current condition, for example, GOP structure, is used as input to a succession of fully connected layers to implement the conditional convolution. In a related embodiment, more than one reference frame is used to encode motion flow information.
-
公开(公告)号:US11954930B2
公开(公告)日:2024-04-09
申请号:US17666401
申请日:2022-02-07
Applicant: Digimarc Corporation
Inventor: Brett A. Bradley , Tomas Filler , Vojtech Holub
CPC classification number: G06V30/224 , G06F18/24 , G06T7/0012 , G06V10/42 , H04N19/44 , H04N19/90 , H04N19/93
Abstract: The present disclosure relates to advanced image signal processing technology including: i) rapid localization for machine-readable indicia including, e.g., 1-D and 2-D barcodes; and ii) barcode reading and decoders. One claim recites: an image processing method comprising: obtaining 2-dimensional (2D) image data representing a 1-dimensional (1D) barcode within a first image area; generating a plurality of scanlines across the first image area; for each of the plurality of scanlines, synchronizing the scanline, including decoding an initial set of numerical digits represented by the scanline, in which said synchronizing provides a scale estimate for the scanline; using a path decoder to decode remaining numerical digits within the scanline, the path decoder decoding multiple numerical digits in groups, in which the scale estimate is adapted as the remaining numerical digits are decoded; and providing decoded numerical digits as an identifier represented by the scanline. Of course, other combinations and claims are described within the present disclosure.
-
公开(公告)号:US20230412825A1
公开(公告)日:2023-12-21
申请号:US17844152
申请日:2022-06-20
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Sathya Santhar , Sarbajit K. Rakshit , Sridevi Kannan , Samuel Mathew Jawaharlal
IPC: H04N19/42 , H04N19/167 , H04N19/90 , H04N19/593 , H04N19/503
CPC classification number: H04N19/42 , H04N19/167 , H04N19/90 , H04N19/593 , H04N19/503
Abstract: A video file is detected by a computer system. The video file that is to be provided to one or more client devices. The video file contains a video stream that includes a plurality of video images. A first video image of the plurality of video images is reconstructed based on a first machine learning technique. The first machine learning technique is based on one or more video images that occur temporally before the first video image in the video stream. A reconstruction status of the first video image of the plurality of video images is identified based on the video file and based on a second machine learning technique. An altered video file is generated in response to the reconstruction status and based on the video file.
-
公开(公告)号:US20230379466A1
公开(公告)日:2023-11-23
申请号:US18359384
申请日:2023-07-26
Applicant: Electronics and Telecommunications Research Institute , Industry Academy Cooperation Foundation of Sejong University
Inventor: Sung Chang LIM , Jung Won KANG , Hyun Suk KO , Jin Ho LEE , Ha Hyun LEE , Dong San JUN , Seung Hyun CHO , Hui Yong KIM , Jin Soo CHOI , Yung Lyul LEE , Nam Uk KIM , Jun Woo CHOI
IPC: H04N19/122 , H04N19/90 , H04N19/132 , H04N19/70 , H04N19/625 , H04N19/103 , H04N19/107 , H04N19/186 , H04N19/176 , H04N19/61
CPC classification number: H04N19/122 , H04N19/90 , H04N19/132 , H04N19/70 , H04N19/625 , H04N19/103 , H04N19/107 , H04N19/186 , H04N19/176 , H04N19/61
Abstract: The present invention relates to a method and apparatus for encoding and decoding a video image based on transform. The method for decoding a video includes: determining a transform mode of a current block; inverse-transforming residual data of the current block according to the transform mode of the current block; and rearranging the inverse-transformed residual data of the current block according to the transform mode of the current block, wherein the transform mode includes at least one of SDST (Shuffling Discrete Sine Transform), SDCT (Shuffling Discrete cosine Transform), DST (Discrete Sine Transform) or DCT (Discrete Cosine Transform).
-
公开(公告)号:US20230283790A1
公开(公告)日:2023-09-07
申请号:US18016568
申请日:2021-07-13
Applicant: FONDATION B-COM
Inventor: Félix HENRY , Gordon CLARE
IPC: H04N19/436 , H04N19/90
CPC classification number: H04N19/436 , H04N19/90
Abstract: In a method for decoding a data stream by way of an electronic device (10) including a processor (14), and a parallelized processing unit (16) designed to perform a plurality of operations of the same type in parallel at a given time, the data stream includes a first dataset (Fet) and a second dataset (Fnn) representative of audio or video content. The decoding method includes the processor (14) processing data from the first dataset (Fet), obtaining the audio or video content by processing (E70) data from the second dataset (Fnn) using a process depending at least partially on the data from the first set (Fet) and using an artificial neural network (18) implemented by the parallelized processing unit (16).
-
公开(公告)号:US20190200028A1
公开(公告)日:2019-06-27
申请号:US16293153
申请日:2019-03-05
Applicant: Sun Patent Trust
Inventor: Kengo TERADA , Youji SHIBAHARA , Kyoko TANIKAWA , Hisao SASAI , Toshiyasu SUGIO , Toru MATSUNOBU
IPC: H04N19/196 , H04N19/463 , H04N19/18 , H04N19/90 , H04N19/124 , H04N19/176 , H04N19/136
CPC classification number: H04N19/196 , H04N19/124 , H04N19/136 , H04N19/176 , H04N19/18 , H04N19/463 , H04N19/70 , H04N19/90
Abstract: An image coding method includes: coding (i) coefficient information, (ii) a first flag indicating whether to quantize one or more blocks using quantization, (iii) a second flag indicating whether the plurality of quantization matrices are included in a sequence parameter set, and (iv) a third flag indicating whether the plurality of quantization matrices are included in a picture parameter set; and quantizing the plurality of coefficients, wherein when the one or more blocks are quantized using a plurality of default matrices, the following are coded in the coding: (i) the first flag indicating that the one or more blocks are quantized using the plurality of quantization matrices, (ii) the second flag indicating that the plurality of quantization matrices are not included in the sequence parameter set, and (iii) the third flag indicating that the plurality of quantization matrices are not included in the picture parameter set.
-
公开(公告)号:US20190199904A1
公开(公告)日:2019-06-27
申请号:US16228999
申请日:2018-12-21
Applicant: CANON KABUSHIKI KAISHA
Inventor: Tatsuya Nishiguchi , Emi Kondo , Tomoaki Takahashi , Yosuke Takagi
CPC classification number: H04N5/2351 , G06F3/14 , G06T5/007 , G06T2207/20208 , G09G2340/02 , G09G2340/0428 , G09G2340/06 , H04N5/2355 , H04N5/355 , H04N5/93 , H04N19/90 , H04N19/98
Abstract: An electronic apparatus includes: a converting unit that converts a first type of image into a converted image; a connecting unit that connects with an external apparatus; a setting unit that sets a connection mode; and a control unit that controls so that in a case where the connection is in a first connection mode, the first type of image is outputted from the connecting unit without converting the first type of image by the converting unit, and in a case where the connection is in a second connection mode, the first type of image is converted by the converting unit, and is outputted from the connecting unit, wherein source of the image to be outputted from the connecting unit can be switched from a second type of image to the first type of image, while maintaining the connection in the second connection mode.
-
公开(公告)号:US20190052913A1
公开(公告)日:2019-02-14
申请号:US15672689
申请日:2017-08-09
Applicant: Vital Images, Inc.
Inventor: William D. Hachfeld
IPC: H04N19/93 , H04N19/172 , H04N19/91
CPC classification number: H04N19/93 , G16H30/20 , H04N19/172 , H04N19/176 , H04N19/184 , H04N19/59 , H04N19/593 , H04N19/80 , H04N19/90 , H04N19/91
Abstract: Techniques and configurations for compression of image data in a progressive, lossless manner are disclosed. In an example, three-dimensional medical images may be compressed and decompressed with high-speed operations, through a compression technique performed on a cube (chunk) of voxels that includes generating a subsampled or filtered cube of voxels, and generating and optimizing a delta data set between the cube of voxels and the subsampled cube of voxels. This optimized delta data set is operable with a decompression technique to losslessly recreate the cube of voxels. Further, the compression technique may be progressively performed with multiple iterations, to allow multiple lower resolution versions of the images prior to loading or receiving the entire compressed data that is reconstructable in a lossless form. Use of this technique may result in dramatically reduced time to first image when visualizing 3D images and performing image data transfers.
-
-
-
-
-
-
-
-
-