-
公开(公告)号:US20230396812A1
公开(公告)日:2023-12-07
申请号:US18451933
申请日:2023-08-18
Inventor: Kai Zhang , Li Zhang , Zhipin Deng , Na Zhang , Yang Wang
IPC: H04N19/96 , H04N19/119 , H04N19/176 , H04N19/70
CPC classification number: H04N19/96 , H04N19/70 , H04N19/176 , H04N19/119
Abstract: A mechanism for processing video data is disclosed. An Unsymmetric Binary Tree (UBT) partition of a parent block is determined to create two sub-blocks with different dimensions. At least one of the sub-blocks comprises a side that is non-dyadic. A conversion is performed between a visual media data and a bitstream based on the sub-blocks.
-
公开(公告)号:US20230396779A1
公开(公告)日:2023-12-07
申请号:US18453453
申请日:2023-08-22
Inventor: Kai Zhang , Li Zhang , Zhipin Deng , Na Zhang , Yang Wang
IPC: H04N19/176 , H04N19/119 , H04N19/60
CPC classification number: H04N19/176 , H04N19/119 , H04N19/60
Abstract: A method for processing video data is disclosed. The method includes determining a width (w) and a height (h) of a coding group based on a width (W) and a height (H) of a block that is non-dyadic and contains residual, processing the residual of the block in unit of the coding group, and performing a conversion between a visual media data and a bitstream according to a rule based on the processing.
-
公开(公告)号:US20230396764A1
公开(公告)日:2023-12-07
申请号:US18451942
申请日:2023-08-18
Inventor: Kai Zhang , Li Zhang , Zhipin Deng , Na Zhang , Yang Wang
IPC: H04N19/12 , H04N7/01 , H04N19/176 , H04N19/18 , H04N19/625 , H04N19/157
CPC classification number: H04N19/12 , H04N7/01 , H04N19/176 , H04N19/18 , H04N19/625 , H04N19/157
Abstract: A mechanism for processing video data is disclosed. A Unsymmetric Binary Tree (UBT) partition of a parent block is determined to create two sub-blocks with different dimensions. At least one of the sub-blocks includes a side that is non-dyadic. A conversion is performed between a visual media data and a bitstream based on the sub-blocks.
-
公开(公告)号:US20230345032A1
公开(公告)日:2023-10-26
申请号:US18343307
申请日:2023-06-28
Inventor: Ye-Kui Wang , Yang Wang , Li Zhang
IPC: H04N19/172 , H04N19/136 , H04N19/105 , H04N19/70 , H04N19/184 , H04N19/503 , H04N19/109
CPC classification number: H04N19/503 , H04N19/105 , H04N19/109 , H04N19/136 , H04N19/172 , H04N19/184 , H04N19/70
Abstract: A mechanism for processing video data is disclosed. An indication is signaled. The indication indicates whether a picture following a dependent random access point (DRAP) picture in decoding order and preceding the DRAP picture in output order is permitted to refer to a reference picture positioned prior to the DRAP picture in decoding order for inter prediction. A conversion is performed between a visual media data and a bitstream based on the indication.
-
公开(公告)号:US20230345025A1
公开(公告)日:2023-10-26
申请号:US18339012
申请日:2023-06-21
Inventor: Ye-Kui Wang , Kai Zhang , Li Zhang , Yang Wang , Jizheng Xu , Zhipin Deng
Abstract: A mechanism for processing video data is disclosed. Video decoder initialization information is signaled between an encoder to a decoder. The video decoder initialization information contains a range of initialization parameters for a plurality of video units when the plurality of video units are coded according to a same video codec and a same profile. A conversion is performed between a visual media data and a visual media data file based on the range of initialization parameters.
-
公开(公告)号:US11776535B2
公开(公告)日:2023-10-03
申请号:US17885965
申请日:2022-08-11
IPC: G10L15/18 , G10L15/183
CPC classification number: G10L15/1815 , G10L15/183
Abstract: A semantic understanding method and apparatus, and a device and a storage medium are provided. The method includes: acquiring a recognition character string that matches speech information; acquiring, from an entity vocabulary library, at least one entity vocabulary respectively corresponding to each recognition character in the recognition character string; and according to a situation of each entity vocabulary hitting the recognition character string, determining a matching entity vocabulary as a semantic understanding result of the speech information. By means of the method, insofar as a completely matching entity vocabulary is not acquired, a matching entity vocabulary can still be determined according to an entity vocabulary library, and semantic information of speech is thus accurately understood; and the method also has relatively high fault tolerance for situations such as wrong words, added words, and omitted words, such that the semantic understanding accuracy of speech information is improved.
-
公开(公告)号:US20230300337A1
公开(公告)日:2023-09-21
申请号:US18322461
申请日:2023-05-23
Inventor: Zhipin Deng , Li Zhang , Hongbin Liu , Kai Zhang , Jizheng Xu , Yang Wang , Yue Wang
IPC: H04N19/132 , H04N19/105 , H04N19/137 , H04N19/176 , H04N19/70
CPC classification number: H04N19/132 , H04N19/105 , H04N19/137 , H04N19/176 , H04N19/70
Abstract: A method of processing video data is described. The method includes performing a conversion between a current video block of a video and a bitstream of the video. A geometric partitioning mode index for the current video block is coded in the bitstream and a binarization of the geometric partitioning mode index is performed according to a rule. The geometric partitioning mode index specifies a geometric splitting shape of a geometric partitioning mode applied to the current video block. The rule specifies that the geometric partitioning mode index is coded with a fixed-length binarization.
-
公开(公告)号:US20230179766A1
公开(公告)日:2023-06-08
申请号:US17977630
申请日:2022-10-31
Inventor: Yang Wang , Li Zhang , Zhipin Deng , Kai Zhang , Hongbin Liu
IPC: H04N19/119 , H04N19/70 , H04N19/176
CPC classification number: H04N19/119 , H04N19/70 , H04N19/176
Abstract: Systems, methods and apparatus for video processing are described. The video processing may include video encoding, video decoding or video transcoding. One example method of video processing includes performing a conversion between a current block of a video and a bitstream of the video according to a rule. The rule specifies that selection of a context for coding a syntax element specifying whether the block is split horizontally or vertically is based on a number of allowed vertical splits and a number of allowed horizontal splits. The number of allowed vertical splits includes a number of allowed binary vertical splits and a number of allowed ternary vertical splits, and the number of allowed horizontal splits includes a number of allowed binary horizontal splits and a number of allowed ternary horizontal splits.
-
公开(公告)号:US11653002B2
公开(公告)日:2023-05-16
申请号:US17541092
申请日:2021-12-02
Inventor: Li Zhang , Kai Zhang , Hongbin Liu , Yang Wang , Yue Wang
IPC: H04N19/52 , H04N19/137 , H04N19/105 , H04N19/176 , H04N19/132
CPC classification number: H04N19/137 , H04N19/105 , H04N19/132 , H04N19/176
Abstract: A method of video processing includes determining, for a conversion between a current block of a video and a bitstream representation of the video, an operation associated with a list of motion candidates based on a condition related to a characteristic of the current block. The list of motion candidates is constructed for a coding technique or based on information from previously processed blocks of the video. The method also includes performing the conversion based on the determining.
-
公开(公告)号:US20230075048A1
公开(公告)日:2023-03-09
申请号:US17968536
申请日:2022-10-18
Inventor: Yang Wang , Li Zhang , Kai Zhang , Hongbin Liu , Yue Wang
IPC: H04N19/132 , H04N19/105 , H04N19/186 , H04N19/176
Abstract: A method of video processing is provided to include determining, for a conversion between a video block of a video and a bitstream of the video, a parameter of a cross-component linear model (CCLM) for the video block according to a rule, and performing the conversion based on the determining, and wherein the rule specifies to use a variable representing a neighbouring luma sample in the determining of the parameter of the CCLM only in case that the variable has a certain value.
-
-
-
-
-
-
-
-
-