-
公开(公告)号:US20240340446A1
公开(公告)日:2024-10-10
申请号:US18573838
申请日:2022-06-30
发明人: Kenneth ANDERSSON , Jacob STRÖM , Ruoyang YU
IPC分类号: H04N19/59 , G06V10/74 , H04N19/117 , H04N19/159 , H04N19/176 , H04N19/177 , H04N19/184
CPC分类号: H04N19/59 , G06V10/761 , H04N19/117 , H04N19/159 , H04N19/176 , H04N19/177 , H04N19/184
摘要: Methods and devices for determining video picture resolution. A first source picture is obtained and a first reduced resolution picture is generated based on the first source picture. A first similarity metric is determined for the first reduced resolution picture and the first source picture. A picture resolution is selected based at least in part on the first similarity metric.
-
公开(公告)号:US12113964B2
公开(公告)日:2024-10-08
申请号:US18346697
申请日:2023-07-03
发明人: Seungsoo Jeong , Gahyun Ryu , Minsoo Park , Minwoo Park , Kiho Choi , Narae Choi , Woongil Choi , Anish Tamse , Yinji Piao
IPC分类号: H04N19/105 , H04N19/159 , H04N19/176 , H04N19/177 , H04N19/91
CPC分类号: H04N19/105 , H04N19/159 , H04N19/176 , H04N19/177 , H04N19/91
摘要: Disclosed is an image decoding method according to an embodiment, the image decoding method including: obtaining a first reference block and a second reference block, for bi-directional prediction of a current block; obtaining, from a bitstream, weight information for combining the first reference block with the second reference block; performing entropy decoding on the weight information to obtain a weight index; combining the first reference block with the second reference block according to a candidate value indicated by the weight index among candidate values included in a weight candidate group; and reconstructing the current block based on a result of the combining, wherein a first binary value corresponding to the weight index is entropy-decoded based on a context model, and the remaining binary value corresponding to the weight index is entropy-decoded by a bypass method.
-
公开(公告)号:US12108054B2
公开(公告)日:2024-10-01
申请号:US18179166
申请日:2023-03-06
发明人: Gary J. Sullivan , You Zhou , Chih-Lung Lin
IPC分类号: H04N19/174 , H04N19/105 , H04N19/109 , H04N19/136 , H04N19/139 , H04N19/142 , H04N19/147 , H04N19/17 , H04N19/177 , H04N19/179 , H04N19/513 , H04N19/52 , H04N19/523
CPC分类号: H04N19/174 , H04N19/105 , H04N19/109 , H04N19/136 , H04N19/139 , H04N19/142 , H04N19/147 , H04N19/17 , H04N19/177 , H04N19/179 , H04N19/52 , H04N19/521 , H04N19/523
摘要: Approaches to selection of motion vector (“MV”) precision during video encoding are presented. These approaches can facilitate compression that is effective in terms of rate-distortion performance and/or computational efficiency. For example, a video encoder determines an MV precision for a unit of video from among multiple MV precisions, which include one or more fractional-sample MV precisions and integer-sample MV precision. The video encoder can identify a set of MV values having a fractional-sample MV precision, then select the MV precision for the unit based at least in part on prevalence of MV values (within the set) having a fractional part of zero. Or, the video encoder can perform rate-distortion analysis, where the rate-distortion analysis is biased towards the integer-sample MV precision. Or, the video encoder can collect information about the video and select the MV precision for the unit based at least in part on the collected information.
-
公开(公告)号:US20240314315A1
公开(公告)日:2024-09-19
申请号:US18670611
申请日:2024-05-21
发明人: David M. Baylon , Zhouye Gu , Ajay Luthra , Koohyar Minoo , Yue Yu
IPC分类号: H04N19/124 , G06T5/90 , H04N19/174 , H04N19/177 , H04N19/186 , H04N19/44 , H04N19/70 , H04N19/98
CPC分类号: H04N19/124 , G06T5/90 , H04N19/174 , H04N19/177 , H04N19/186 , H04N19/44 , H04N19/70 , H04N19/98 , G06T2207/20208
摘要: A system and method for regenerating high dynamic range (HDR) video data from encoded video data, extracts, from the encoded video data, a self-referential metadata structure specifying a video data reshaping transfer function. The video data reshaping transfer function is regenerated using data from the metadata structure and the extracted reshaping transfer function is used to generate the HDR video data by applying decoded video data values to the reshaping transfer function.
-
公开(公告)号:US20240292025A1
公开(公告)日:2024-08-29
申请号:US18424332
申请日:2024-01-26
发明人: Thomas WIEGAND , Karsten MUELLER , Philipp MERKLE
IPC分类号: H04N19/597 , H04N13/00 , H04N13/111 , H04N19/105 , H04N19/109 , H04N19/11 , H04N19/124 , H04N19/17 , H04N19/172 , H04N19/177 , H04N19/30 , H04N19/46 , H04N19/513 , H04N19/567 , H04N19/61 , H04N19/80
CPC分类号: H04N19/597 , H04N13/111 , H04N19/105 , H04N19/109 , H04N19/11 , H04N19/124 , H04N19/17 , H04N19/172 , H04N19/177 , H04N19/30 , H04N19/46 , H04N19/513 , H04N19/521 , H04N19/567 , H04N19/61 , H04N19/80 , H04N2013/0081
摘要: Hybrid video decoder supporting intermediate view synthesis of an intermediate view video from a first- and a second-view video which are predictively coded into a multi-view data signal with frames of the second-view video being spatially subdivided into sub-regions and the multi-view data signal having a prediction mode is provided, having: an extractor configured to respectively extract, from the multi-view data signal, for sub-regions of the frames of the second-view video, a disparity vector and a prediction residual; a predictive reconstructor configured to reconstruct the sub-regions of the frames of the second-view video, by generating a prediction from a reconstructed version of a portion of frames of the first-view video using the disparity vectors and a prediction residual for the respective sub-regions; and an intermediate view synthesizer configured to reconstruct first portions of the intermediate view video.
-
6.
公开(公告)号:US20240251097A1
公开(公告)日:2024-07-25
申请号:US18623198
申请日:2024-04-01
发明人: Luong Pham Van , Adarsh Krishnan Ramasubramonian , Bappaditya Ray , Geert Van der Auwera , Marta Karczewicz
IPC分类号: H04N19/527 , H04N19/124 , H04N19/139 , H04N19/177 , H04N19/597
CPC分类号: H04N19/527 , H04N19/124 , H04N19/139 , H04N19/177 , H04N19/597
摘要: A device to code a point cloud data that includes a memory configured to store data representing points of a point cloud, and one or more processors implemented in circuitry and configured to: determine height values of points in a point cloud; code a data structure including data that represents a top threshold and a bottom threshold; classify points having height values between the top threshold and the bottom threshold into the set of ground points; classify points having height values above the top threshold or below the bottom threshold into the set of object points. The one or more processors code the ground points and the object points according to the classifications. The one or more processors code a geometry data unit header that includes data that overrides or refines the data of the data structure for the at least one of the top threshold or the bottom threshold.
-
公开(公告)号:US20240251086A1
公开(公告)日:2024-07-25
申请号:US18628462
申请日:2024-04-05
发明人: Kwan-Jung OH , Gwangsoon Lee , Euee S. JANG
IPC分类号: H04N19/136 , H04N19/177
CPC分类号: H04N19/136 , H04N19/177
摘要: A method of encoding an immersive video according to the present disclosure includes determining whether an input image is a first type, converting the input image into the first type when the input image is a second type different from the first type, encoding a converted image, and generating metadata for the encoded image.
-
公开(公告)号:US20240236363A1
公开(公告)日:2024-07-11
申请号:US18617538
申请日:2024-03-26
发明人: Cheolkon JUNG , Qipu QIN
IPC分类号: H04N19/573 , H04N19/105 , H04N19/114 , H04N19/124 , H04N19/139 , H04N19/172 , H04N19/177 , H04N19/42 , H04N19/52 , H04N19/70
CPC分类号: H04N19/573 , H04N19/105 , H04N19/114 , H04N19/124 , H04N19/139 , H04N19/172 , H04N19/177 , H04N19/42 , H04N19/52 , H04N19/70
摘要: Methods and a decoder for video processing are provided. The method includes: a current picture is received; and a group of pictures associated with the current picture is determined. The group of pictures includes a first key picture at a first time prior to the current picture and a second key picture at a second time later than the current picture. The method includes: first and second reference pictures are generated based on the first and second key pictures. Based on the first and second reference pictures bi-directional predictive pictures in the group of pictures are determined. A motion estimation process based on the current picture and the bi-directional predictive pictures is then performed to generate motion information of the current picture.
-
公开(公告)号:US20240214590A1
公开(公告)日:2024-06-27
申请号:US18392409
申请日:2023-12-21
申请人: 3649954 Canada Inc.
发明人: Frédéric Giasson
IPC分类号: H04N19/40 , H04N19/177
CPC分类号: H04N19/40 , H04N19/177
摘要: A versatile high-throughput multimedia transcoding station, serving a plurality of multimedia sources, employs transcoding resources including a pool of decoders, a pool of signal-adaptors, and a pool of encoders operating concurrently to realize low-latency transcoding of high-flow-rate multimedia streams. A multimedia stream contains a video stream organized into source groups-of-pictures (GOPs). Upon receiving a transcoding request indicating characteristics of a source multimedia stream and desired characteristics of a destination multimedia stream, an orchestrator rapidly allocates a resource for each GOP and coordinates activation of a content-processing assembly which encompasses the transcoding resources and means for distributing each GOP to compatible resources. The orchestrator assembly monitors progress of GOPs' processing and, when needed under high workload fluctuation, instructs a multimedia source to pause transmission. Each of the decoders, signal adaptors, and encoders comprises a respective hardware processor coupled to a memory device storing software instructions and a buffer holding intermediate data.
-
10.
公开(公告)号:US20240196000A1
公开(公告)日:2024-06-13
申请号:US18493874
申请日:2023-10-25
发明人: KIYOSHI IWABUCHI , KOJI OKAWA
IPC分类号: H04N19/46 , G06V20/40 , H04N19/142 , H04N19/167 , H04N19/172 , H04N19/177 , H04N19/23
CPC分类号: H04N19/46 , G06V20/41 , H04N19/142 , H04N19/167 , H04N19/172 , H04N19/177 , H04N19/23 , G06V2201/10
摘要: A video distribution apparatus includes an object detection unit configured to execute object detection processing with respect to a video indicated by video data obtained by the image capturing unit; metadata addition unit configured to add information related to an object obtained by the object detection unit, as metadata, to encoded data of a corresponding frame of the video, and a control unit configured to control the metadata addition unit based on a result of the detection processing executed by the object detection unit, wherein in a case where the result of the detection processing executed by the object detection unit indicates a disappearance of the object, the control unit controls the metadata addition unit to add metadata indicating the disappearance of the object also to a frame that follows the corresponding frame and satisfies a predetermined condition.
-
-
-
-
-
-
-
-
-