Usage of templates for decoder-side intra mode derivation

    公开(公告)号:US11388421B1

    公开(公告)日:2022-07-12

    申请号:US17148383

    申请日:2021-01-13

    Applicant: Lemon Inc.

    Abstract: Example implementations include a method, apparatus and computer-readable medium of video processing, including constructing, during a conversion between a current video block of a video and a bitstream of the video, at least one template set for the current video block from a plurality of sub-templates. The one or more sub-templates may be selected from a plurality of sub-templates including: a left sub-template, an above sub-template, a right-above sub-template, a left-below sub-template, and a left-above sub-template. The implementations further include deriving at least one intra-prediction mode (IPM) based on cost calculations. The implementations include determining, based on the at least one IPM, a final predictor of the current video block. The implementations include performing the conversion based on the final predictor.

    ADAPTATION PARAMETER SET STORAGE IN VIDEO CODING

    公开(公告)号:US20220109848A1

    公开(公告)日:2022-04-07

    申请号:US17475826

    申请日:2021-09-15

    Applicant: Lemon Inc.

    Inventor: Ye-Kui WANG

    Abstract: Systems, methods and apparatus for encoding or decoding a file format that stores one or more images are described. One example method includes performing a conversion between a visual media file and a bitstream of a visual media data according to a format rule, where the format rule specifies that first adaptation parameter set network abstraction layer units are disallowed from being simultaneously stored in in the visual media file in (1) any one or both of samples of video coding layer tracks or sample entries of the video coding layer tracks, and (2) samples of non-video coding layer tracks, where the video coding layer tracks are tracks containing video coding layer network abstraction layer units, and where the first adaptation parameter set network abstraction layer units includes luma mapping with chroma scaling parameters for a video stream and scaling list parameters for the video stream.

    CHROMA FORMAT AND BIT DEPTH INDICATION IN CODED VIDEO

    公开(公告)号:US20220103865A1

    公开(公告)日:2022-03-31

    申请号:US17476809

    申请日:2021-09-16

    Applicant: Lemon Inc.

    Inventor: Ye-Kui WANG

    Abstract: Systems, methods and apparatus for processing visual media data are described. One example method includes performing a conversion between visual media data and a visual media file including one or more tracks storing one or more bitstreams of the visual media data according to a format rule; wherein the format rule specifies whether a first element indicative of whether a track contains a bitstream corresponding to a specific output layer set controls whether a second element indicative of a chroma format of the track and/or a third element indictive of a bit depth information of the track is included in a configuration record of the track.

    SUBPICTURE ENTITY GROUP SIGNALING IN CODED VIDEO

    公开(公告)号:US20220086497A1

    公开(公告)日:2022-03-17

    申请号:US17476178

    申请日:2021-09-15

    Applicant: Lemon Inc.

    Inventor: Ye-Kui WANG

    Abstract: Systems, methods and apparatus for encoding or decoding a file format that stores one or more images are described. One example method includes performing a conversion between a visual media file and a bitstream of a visual media data according to a format rule, wherein the format rule specifies a characteristic of a syntax element in the visual media file, and wherein the format rule specifies that the syntax element that has a value indicative of a level identification is coded in any one or both of a subpicture common group box or a subpicture multiple groups box using eight bits.

    DECODING CAPABILITY INFORMATION STORAGE IN VIDEO CODING

    公开(公告)号:US20220086446A1

    公开(公告)日:2022-03-17

    申请号:US17475774

    申请日:2021-09-15

    Applicant: Lemon Inc.

    Inventor: Ye-Kui WANG

    Abstract: Systems, methods and apparatus for encoding or decoding a file format that stores one or more images are described. One example method includes performing a conversion between a visual media file and a bitstream of a visual media data according to a format rule, where the format rule specifies that a type of a sample entry determines whether decoding capability information network abstraction layer units are included in either the sample entry of a video track in the visual media file or in a sample of the video track and the sample entry of the video track in the visual media file.

    VERSATILE VIDEO CODING TRACK CODING
    156.
    发明申请

    公开(公告)号:US20220086430A1

    公开(公告)日:2022-03-17

    申请号:US17475719

    申请日:2021-09-15

    Applicant: Lemon Inc.

    Inventor: Ye-Kui WANG

    Abstract: Systems, methods and apparatus for encoding or decoding a file format that stores one or more images are described. One example method includes performing a conversion between a visual media file and a bitstream of a visual media data according to a format rule, where the format rule specifies a condition that controls whether an information item is included in a non-video coding layer track of the visual media file, and where a presence of the non-video coding layer track in the visual media file is indicated by a specific track reference in a video coding layer track of the visual media file.

    GENERATING AUDIO REPRESENTATIONS USING MACHINE LEARNING MODEL

    公开(公告)号:US20250140242A1

    公开(公告)日:2025-05-01

    申请号:US18385749

    申请日:2023-10-31

    Applicant: Lemon Inc.

    Abstract: The present disclosure describes techniques for generating audio representations using a machine learning model. A machine learning model is pre-trained using unlabeled audio data. The pre-training enables the machine learning model to recognize audio patterns and generate initial audio representations. The machine learning model is refined by a task-specific fine-tuning process using labeled data. The task-specific fine-tuning process incorporates multi-task learning heads to optimize the machine learning model. The task-specific fine-tuning process enables the machine learning model to be specialized in specific audio tasks and generate continuous audio representations. The continuous audio representations retain acoustic nuances and subtleties of audio signals. The machine learning model is configured and enabled to generate quantized audio representations by incorporating vector quantization to the task-specific fine-tuning process.

    VIDEO GENERATION METHOD, AND TRAINING METHOD FOR VIDEO GENERATION MODEL

    公开(公告)号:US20250131613A1

    公开(公告)日:2025-04-24

    申请号:US18834154

    申请日:2022-12-15

    Applicant: Lemon Inc.

    Abstract: Provided in the embodiments of the present disclosure are a video generation method, and a training method for a video generation model. The video generation method includes: acquiring a first video, wherein the first video includes a first object image; and inputting the first video into a pre-trained video generation model to obtain a second video, wherein the video generation model is obtained by means of performing training on the basis of a target image and a plurality of sample image pairs obtained from a plurality of first sample images, an object image in the second video is generated on the basis of a preset animal image in the target image and the first object image, and a background image of the second video is generated on the basis of a first background image of the first video.

Patent Agency Ranking