-
41.
公开(公告)号:US11902543B2
公开(公告)日:2024-02-13
申请号:US17947272
申请日:2022-09-19
Applicant: Microsoft Technology Licensing, LLC
Inventor: Gary J. Sullivan , Yongjun Wu
IPC: H04N19/172 , H04N19/136 , H04N19/16 , H04N19/46 , H04N19/70
CPC classification number: H04N19/172 , H04N19/136 , H04N19/16 , H04N19/46 , H04N19/70
Abstract: This application relates to video encoding and decoding, and specifically to tools and techniques for using and providing supplemental enhancement information in bitstreams. Among other things, the detailed description presents innovations for bitstreams having supplemental enhancement information (SEI). In particular embodiments, the SEI message includes picture source data (e.g., data indicating whether the associated picture is a progressive scan picture or an interlaced scan picture and/or data indicating whether the associated picture is a duplicate picture). The SEI message can also express a confidence level of the encoder's relative confidence in the accuracy of this picture source data. A decoder can use the confidence level indication to determine whether the decoder should separately identify the picture as progressive or interlaced and/or a duplicate picture or honor the picture source scanning information in the SEI as it is.
-
公开(公告)号:US20230164349A1
公开(公告)日:2023-05-25
申请号:US18158295
申请日:2023-01-23
Applicant: Microsoft Technology Licensing, LLC
Inventor: Lihua Zhu , Yongjun Wu , Gary J. Sullivan
IPC: H04N19/52 , H04N19/119 , H04N19/11 , H04N19/61 , H04N19/70 , H04N19/593 , H04N19/162 , H04N19/167 , H04N19/51 , H04N19/105 , H04N19/159 , H04N19/176
CPC classification number: H04N19/52 , H04N19/119 , H04N19/11 , H04N19/61 , H04N19/70 , H04N19/593 , H04N19/162 , H04N19/167 , H04N19/51 , H04N19/105 , H04N19/159 , H04N19/176
Abstract: Innovations in encoder-side search ranges having horizontal bias or vertical bias are described herein. For example, a video encoder determines a block vector (“BV”) for a current block of a picture, performs intra prediction for the current block using the BV, and encodes the BV. The BV indicates a displacement to a region within the picture. When determining the BV, the encoder checks a constraint that the region is within a BV search range having a horizontal bias or vertical bias. The encoder can select the BV search range from among multiple available BV search ranges, e.g., depending at least in part on BV values of one or more previous blocks, which can be tracked in a histogram data structure.
-
公开(公告)号:US20230038995A1
公开(公告)日:2023-02-09
申请号:US17581446
申请日:2022-01-21
Applicant: Microsoft Technology Licensing, LLC
Inventor: Lihua Zhu , Yongjun Wu , Gary J. Sullivan
IPC: H04N19/51 , H04N19/167 , H04N19/162 , H04N19/159 , H04N19/176
Abstract: Innovations in encoder-side search ranges having horizontal bias or vertical bias are described herein. For example, a video encoder determines a block vector (“BV”) for a current block of a picture, performs intra prediction for the current block using the BV, and encodes the BV. The BV indicates a displacement to a region within the picture. When determining the BV, the encoder checks a constraint that the region is within a BV search range having a horizontal bias or vertical bias. The encoder can select the BV search range from among multiple available BV search ranges, e.g., depending at least in part on BV values of one or more previous blocks, which can be tracked in a histogram data structure.
-
44.
公开(公告)号:US20230015071A1
公开(公告)日:2023-01-19
申请号:US17947272
申请日:2022-09-19
Applicant: Microsoft Technology Licensing, LLC
Inventor: Gary J. Sullivan , Yongjun Wu
IPC: H04N19/172 , H04N19/136 , H04N19/16 , H04N19/46
Abstract: This application relates to video encoding and decoding, and specifically to tools and techniques for using and providing supplemental enhancement information in bitstreams. Among other things, the detailed description presents innovations for bitstreams having supplemental enhancement information (SEI). In particular embodiments, the SEI message includes picture source data (e.g., data indicating whether the associated picture is a progressive scan picture or an interlaced scan picture and/or data indicating whether the associated picture is a duplicate picture). The SEI message can also express a confidence level of the encoder's relative confidence in the accuracy of this picture source data. A decoder can use the confidence level indication to determine whether the decoder should separately identify the picture as progressive or interlaced and/or a duplicate picture or honor the picture source scanning information in the SEI as it is.
-
公开(公告)号:US20220329852A1
公开(公告)日:2022-10-13
申请号:US17850622
申请日:2022-06-27
Applicant: Microsoft Technology Licensing, LLC
Inventor: Gary J. Sullivan , Yongjun Wu
IPC: H04N19/58 , H04N19/70 , H04N19/46 , H04N19/423 , H04N19/573
Abstract: Innovations for signaling state of a decoded picture buffer (“DPB”) and reference picture lists (“RPLs”). In example implementations, rather than rely on internal state of a decoder to manage and update DPB and RPLs, state information about the DPB and RPLs is explicitly signaled. This permits a decoder to determine which pictures are expected to be available for reference from the signaled state information. For example, an encoder determines state information that identifies which pictures are available for use as reference pictures (optionally considering feedback information from a decoder about which pictures are available). The encoder sets syntax elements that represent the state information. In doing so, the encoder sets identifying information for a long-term reference picture (“LTRP”), where the identifying information is a value of picture order count least significant bits for the LTRB. The encoder then outputs the syntax elements as part of a bitstream.
-
公开(公告)号:US11115691B2
公开(公告)日:2021-09-07
申请号:US16899214
申请日:2020-06-11
Applicant: Microsoft Technology Licensing, LLC
Inventor: Yongjun Wu , Naveen Thumpudi , Shyam Sadhwani
IPC: H04N21/2383 , H04N21/235 , H04N21/438 , H04N21/84 , H04N21/2343 , H04N21/236 , H04N21/4402
Abstract: A media processing tool adds custom data to an elementary media bitstream or media container. The custom data indicates nominal range of samples of media content, but the meaning of the custom data is not defined in the codec format or media container format. For example, the custom data indicates the nominal range is full range or limited range. For playback, a media processing tool parses the custom data and determines an indication of media content type. A rendering engine performs color conversion operations whose logic changes based at least in part on the media content type. In this way, a codec format or media container format can in effect be extended to support full nominal range media content as well as limited nominal range media content, and hence preserve full or correct color fidelity, while maintaining backward compatibility and conformance with the codec format or media container format.
-
公开(公告)号:US10257421B2
公开(公告)日:2019-04-09
申请号:US15378942
申请日:2016-12-14
Applicant: Microsoft Technology Licensing, LLC
Inventor: Yongjun Wu , Nikola Borisov , Weidong Zhao , Shyam Sadhwani , Naveen Thumpudi
Abstract: Video image stabilization provides better performance on a generic platform for computing devices by evaluating available multimedia digital signal processing components, and selecting the available components to utilize according to a hierarchy structure for video stabilization performance for processing parts of the video stabilization. The video stabilization has improved motion vector estimation that employs refinement motion vector searching according to a pyramid block structure relationship starting from a downsampled resolution version of the video frames. The video stabilization also improves global motion transform estimation by performing a random sample consensus approach for processing the local motion vectors, and selection criteria for motion vector reliability. The video stabilization achieves the removal of hand shakiness smoothly by real-time one-pass or off-line two-pass temporal smoothing with error detection and correction.
-
公开(公告)号:US10200707B2
公开(公告)日:2019-02-05
申请号:US14927127
申请日:2015-10-29
Applicant: Microsoft Technology Licensing, LLC
Inventor: Yongjun Wu , Shyam Sadhwani , Bakkama Srinath Reddy
IPC: H04N19/44 , H04N19/423
Abstract: Aspects extend to methods, systems, and computer program products for video bit stream decoding. Aspects include flexible definition and detection of surface alignment requirements for decoding hardware. Surface alignment requirements can be handled by render cropping (e.g., cropping at a video output device), through adjustment and modification of original syntax values in a video bit stream and relaxed media type negotiation in a software (host) decoder. Resolution changes can be hidden with the aligned surface allocation when applicable. Performance can be improved and power consumption reduced by using hidden resolution changes.
-
公开(公告)号:US10129566B2
公开(公告)日:2018-11-13
申请号:US14720722
申请日:2015-05-22
Applicant: Microsoft Technology Licensing, LLC
Inventor: Shyam Sadhwani , Yongjun Wu , Wenbo Zhang , Sudhakar V. Prabhu
IPC: H04N7/12 , H04N19/70 , H04N21/854 , H04N19/46 , H04N19/127 , H04N19/44
Abstract: Disclosed herein are innovations in decoding compressed video media data. The disclosed innovations facilitate decoding operations with improved computational efficiency, faster speeds, reduced power, reduced memory usage, and/or reduced latency. In one embodiment, for example, an encoded bitstream of video media data is input from an external video content provider, the encoded bitstream being encoded according to a video codec standard. A decoder is then configured to decode the encoded bitstream based at least in part on supplemental information that identifies a property of the encoded bitstream but that is supplemental to the encoded bitstream (e.g., supplemental information that is not part of the encoded bitstream or its associated media container and that is specific (or related) to the application for which the bitstream is used and/or the standard by which the bitstream is encoded and/or encrypted).
-
50.
公开(公告)号:US20180234698A1
公开(公告)日:2018-08-16
申请号:US15952796
申请日:2018-04-13
Applicant: Microsoft Technology Licensing, LLC
Inventor: Gary J. Sullivan , Yongjun Wu
IPC: H04N19/58 , H04N19/423 , H04N19/573 , H04N19/46 , H04N19/70
CPC classification number: H04N19/58 , H04N19/423 , H04N19/46 , H04N19/573 , H04N19/70
Abstract: Innovations for signaling state of a decoded picture buffer (“DPB”) and reference picture lists (“RPLs”). In example implementations, rather than rely on internal state of a decoder to manage and update DPB and RPLs, state information about the DPB and RPLs is explicitly signaled. This permits a decoder to determine which pictures are expected to be available for reference from the signaled state information. For example, an encoder determines state information that identifies which pictures are available for use as reference pictures (optionally considering feedback information from a decoder about which pictures are available). The encoder sets syntax elements that represent the state information. In doing so, the encoder sets identifying information for a long-term reference picture (“LTRP”), where the identifying information is a value of picture order count least significant bits for the LTRB. The encoder then outputs the syntax elements as part of a bitstream.
-
-
-
-
-
-
-
-
-