-
公开(公告)号:US12177445B2
公开(公告)日:2024-12-24
申请号:US18214311
申请日:2023-06-26
Applicant: Microsoft Technology Licensing, LLC
Inventor: Daniel Dinu , Juan Carlos Arevalo Baeza , Barry Friemel , William Chen
IPC: H04N19/13 , G06T1/20 , H04N19/105 , H04N19/112 , H04N19/124 , H04N19/15 , H04N19/159 , H04N19/186 , H04N19/31 , H04N19/42 , H04N19/43 , H04N19/436 , H04N19/44 , H04N19/46 , H04N19/51 , H04N19/593 , H04N19/61 , H04N19/82 , H04N19/89 , H04N19/91 , H04N19/137 , H04N19/16 , H04N19/172 , H04N19/174 , H04N19/176 , H04N19/184
Abstract: Video decoding innovations for multithreading implementations and graphics processor unit (“GPU”) implementations are described. For example, for multithreaded decoding, a decoder uses innovations in the areas of layered data structures, picture extent discovery, a picture command queue, and/or task scheduling for multithreading. Or, for a GPU implementation, a decoder uses innovations in the areas of inverse transforms, inverse quantization, fractional interpolation, intra prediction using waves, loop filtering using waves, memory usage and/or performance-adaptive loop filtering. Innovations are also described in the areas of error handling and recovery, determination of neighbor availability for operations such as context modeling and intra prediction, CABAC decoding, computation of collocated information for direct mode macroblocks in B slices, reduction of memory consumption, implementation of trick play modes, and picture dropping for quality adjustment.
-
公开(公告)号:US12131408B2
公开(公告)日:2024-10-29
申请号:US18545848
申请日:2023-12-19
Inventor: Jinyuan Wu , Jingye Wang , Jingjing Zhuge , Qi Li
IPC: G06T11/00 , G06F3/048 , G06T7/00 , G06T7/90 , G06T11/60 , G06T15/40 , G06T15/50 , H04N19/16 , H04N19/46 , H04N21/23 , H04N21/65
CPC classification number: G06T11/001 , G06T7/0002 , G06T7/90 , G06T2207/10024
Abstract: An image rendering method and apparatus, a device, and a medium. The method includes: determining a pre-captured initial rendering video stream for a background image; obtaining initial rendering coordinates of each rendering pixel in each initial rendering image frame in the video stream, and obtaining a rendering moment of each initial rendering image frame in the initial rendering video stream; determining the amount of noise of each rendering pixel at the corresponding rendering moment according to the corresponding initial rendering coordinates, a preset noise texture, and the corresponding rendering moment, and obtaining target rendering coordinates of each rendering pixel according to the corresponding amount of noise; and updating the initial rendering coordinates of each rendering pixel according to the corresponding target rendering coordinates to obtain a target rendering image, and rendering the background image according to a target rendering video stream composed of all target rendering images.
-
3.
公开(公告)号:US11902542B2
公开(公告)日:2024-02-13
申请号:US17947263
申请日:2022-09-19
Applicant: Microsoft Technology Licensing, LLC
Inventor: Gary J. Sullivan , Yongjun Wu
IPC: H04N19/172 , H04N19/136 , H04N19/16 , H04N19/46 , H04N19/70
CPC classification number: H04N19/172 , H04N19/136 , H04N19/16 , H04N19/46 , H04N19/70
Abstract: This application relates to video encoding and decoding, and specifically to tools and techniques for using and providing supplemental enhancement information in bitstreams. Among other things, the detailed description presents innovations for bitstreams having supplemental enhancement information (SEI). In particular embodiments, the SEI message includes picture source data (e.g., data indicating whether the associated picture is a progressive scan picture or an interlaced scan picture and/or data indicating whether the associated picture is a duplicate picture). The SEI message can also express a confidence level of the encoder's relative confidence in the accuracy of this picture source data. A decoder can use the confidence level indication to determine whether the decoder should separately identify the picture as progressive or interlaced and/or a duplicate picture or honor the picture source scanning information in the SEI as it is.
-
公开(公告)号:US11902505B2
公开(公告)日:2024-02-13
申请号:US18106025
申请日:2023-02-06
Inventor: Kiyofumi Abe , Kazuhito Kimura , Hideyuki Ohgose , Hiroshi Arakawa , Koji Arimura
IPC: H04N19/102 , H04N19/172 , H04N19/46 , H04N19/112 , H04N19/16 , H04N19/70 , H04N19/61 , H04N19/577
CPC classification number: H04N19/102 , H04N19/112 , H04N19/16 , H04N19/172 , H04N19/46 , H04N19/577 , H04N19/61 , H04N19/70
Abstract: A video decoding device, in the case where a video of the progressive format is inputted, processes a frame as a picture, in the case where a video of the interlace format is inputted, processes a field as a picture. A video decoding device performs display control corresponding to a format of the both video by analyzing display control information in display control information analyzer. The display control information includes sequence unit display control information which is commonly used in a display process of all pictures that belong to a sequence to be decoded and picture unit display control information which is individually used in a display process of a picture to be decoded. A second code string analyzer acquires each of the sequence unit display control information and the picture unit display control information from an extended information area in units of pictures.
-
公开(公告)号:US20230077410A1
公开(公告)日:2023-03-16
申请号:US17932531
申请日:2022-09-15
Applicant: Apple Inc.
Inventor: Matthew J. Yaeger , David M. Cole , Benjamin D. Buckner
IPC: H04N19/167 , H04N19/16 , H04N13/161
Abstract: Encoding an image stream may include receiving an image stream with an original image resolution; generating a plurality of copies of the image stream with the original image resolution; encoding, for each copy of the plurality of copies of the image stream, the copy of the image stream to generate an encoded copy of the image stream, wherein the encoded copy of the image stream comprises a first region having a first image resolution and a second region having a second image resolution, wherein each encoded copy of the plurality of encoded copies of the image stream has a different first region, and providing, to a playback device, at least one encoded copy of the plurality of encoded copies of the image stream.
-
6.
公开(公告)号:US20230017315A1
公开(公告)日:2023-01-19
申请号:US17947263
申请日:2022-09-19
Applicant: Microsoft Technology Licensing, LLC
Inventor: Gary J. Sullivan , Yongjun Wu
IPC: H04N19/172 , H04N19/136 , H04N19/16 , H04N19/46
Abstract: This application relates to video encoding and decoding, and specifically to tools and techniques for using and providing supplemental enhancement information in bitstreams. Among other things, the detailed description presents innovations for bitstreams having supplemental enhancement information (SEI). In particular embodiments, the SEI message includes picture source data (e.g., data indicating whether the associated picture is a progressive scan picture or an interlaced scan picture and/or data indicating whether the associated picture is a duplicate picture). The SEI message can also express a confidence level of the encoder's relative confidence in the accuracy of this picture source data. A decoder can use the confidence level indication to determine whether the decoder should separately identify the picture as progressive or interlaced and/or a duplicate picture or honor the picture source scanning information in the SEI as it is.
-
公开(公告)号:US11539909B2
公开(公告)日:2022-12-27
申请号:US17301102
申请日:2021-03-25
Applicant: Axis AB
Inventor: Axel Keskikangas , Song Yuan , John Rehn
IPC: H04N5/45 , H04N19/159 , H04N19/16 , H04N19/177 , H04N5/232 , H04N5/262 , H04N5/272 , H04N7/01
Abstract: Methods and apparatus, including computer program products, for controlling a Pan-Tilt-Zoom (PTZ) camera. A video stream is provided by the PTZ camera. The video stream includes a main image and a smaller picture-in-picture overlay. The main image and the overlay show the same view. The overlay has both a higher frame rate and a lower resolution compared to the main image. A movement control instruction is received, by the PTZ camera, in response to an evaluation of at least one of the main image and the overlay.
-
公开(公告)号:US11475602B2
公开(公告)日:2022-10-18
申请号:US17086399
申请日:2020-10-31
Applicant: PANAMORPH, INC.
Inventor: Shawn L. Kelly
IPC: G06T9/00 , G06T5/00 , G06T5/20 , H04N19/112 , H04N19/132 , H04N19/16 , H04N19/80 , H04N19/85 , G06T5/10
Abstract: Initial low-quality images of a progressively-displayed high-definition image are masked with corresponding progressively-revealing mask filters or masking algorithms to realistically obscure such low quality and therefore to provide a realistically appearing progressive presentation of the high-definition image.
-
公开(公告)号:US20220159304A1
公开(公告)日:2022-05-19
申请号:US17591259
申请日:2022-02-02
Inventor: Masato OHKAWA , Hideo SAITOU , Kiyofumi ABE , Takahiro NISHI , Tadamasa TOMA , Yusuke KATO
IPC: H04N19/60 , H04N19/30 , H04N19/186 , H04N19/12 , H04N19/17 , H04N19/184 , H04N19/119 , H04N19/112 , H04N19/16
Abstract: Provided is an encoder including: circuitry; and memory coupled to the circuitry. In operation, the circuitry: performs a mapping process of Luma Mapping with Chroma Scaling (LMCS) for transforming a first pixel value space applied to a luma display image signal into a second pixel value space applied to a luma encoding process signal, using line segments forming a transform curve, each of which corresponds to a different one of sections obtained by partitioning the first pixel value space; and encodes an image, and in the performing of the LMCS, the circuitry determines the transform curve so that among boundary values in the second pixel value space, a first value obtained by dividing a boundary value by a base width defined according to a bit depth of the image is not equal to a second value obtained by dividing another boundary value by the base width.
-
公开(公告)号:US11315011B2
公开(公告)日:2022-04-26
申请号:US15844449
申请日:2017-12-15
Applicant: WaveOne Inc.
Inventor: Oren Rippel , Lubomir Bourdev , Carissa Lew , Sanjay Nair
IPC: G06N3/08 , G06N3/04 , G06N20/00 , G06K9/62 , G06K9/42 , G06K9/46 , H04N19/12 , H04N19/16 , H04N19/17 , H04N19/19 , H04N19/91 , H04N19/44 , G06K9/66 , G06T5/00 , H04N19/13 , H04N19/14 , H04N19/18 , H04N19/48 , H04N19/15 , H04N19/33 , G06K9/00 , H04N19/126 , H04N19/167 , H04N19/172 , H04N19/196 , H04N19/149 , H04N19/154
Abstract: The compression system trains a machine-learned encoder and decoder through an autoencoder architecture. The encoder can be deployed by a sender system to encode content for transmission to a receiver system, and the decoder can be deployed by the receiver system to decode the encoded content and reconstruct the original content. The encoder is coupled to receive content and output a tensor as a compact representation of the content. The content may be, for example, images, videos, or text. The decoder is coupled to receive a tensor representing content and output a reconstructed version of the content. The compression system trains the autoencoder with a discriminator to reduce compression artifacts in the reconstructed content. The discriminator is coupled to receive one or more input content, and output a discrimination prediction that discriminates whether the input content is the original or reconstructed version of the content.
-
-
-
-
-
-
-
-
-