-
公开(公告)号:US11388421B1
公开(公告)日:2022-07-12
申请号:US17148383
申请日:2021-01-13
Applicant: Lemon Inc.
Inventor: Yang Wang , Kai Zhang , Li Zhang , Yuwen He , Hongbin Liu
IPC: H04N19/159 , H04N19/70 , H04N19/176 , H04N19/147
Abstract: Example implementations include a method, apparatus and computer-readable medium of video processing, including constructing, during a conversion between a current video block of a video and a bitstream of the video, at least one template set for the current video block from a plurality of sub-templates. The one or more sub-templates may be selected from a plurality of sub-templates including: a left sub-template, an above sub-template, a right-above sub-template, a left-below sub-template, and a left-above sub-template. The implementations further include deriving at least one intra-prediction mode (IPM) based on cost calculations. The implementations include determining, based on the at least one IPM, a final predictor of the current video block. The implementations include performing the conversion based on the final predictor.
-
公开(公告)号:US20220109848A1
公开(公告)日:2022-04-07
申请号:US17475826
申请日:2021-09-15
Applicant: Lemon Inc.
Inventor: Ye-Kui WANG
IPC: H04N19/132 , H04N19/117 , H04N19/30 , H04N19/169
Abstract: Systems, methods and apparatus for encoding or decoding a file format that stores one or more images are described. One example method includes performing a conversion between a visual media file and a bitstream of a visual media data according to a format rule, where the format rule specifies that first adaptation parameter set network abstraction layer units are disallowed from being simultaneously stored in in the visual media file in (1) any one or both of samples of video coding layer tracks or sample entries of the video coding layer tracks, and (2) samples of non-video coding layer tracks, where the video coding layer tracks are tracks containing video coding layer network abstraction layer units, and where the first adaptation parameter set network abstraction layer units includes luma mapping with chroma scaling parameters for a video stream and scaling list parameters for the video stream.
-
公开(公告)号:US20220103865A1
公开(公告)日:2022-03-31
申请号:US17476809
申请日:2021-09-16
Applicant: Lemon Inc.
Inventor: Ye-Kui WANG
IPC: H04N19/70 , H04N19/132 , H04N19/105 , H04N19/186 , H04N19/169
Abstract: Systems, methods and apparatus for processing visual media data are described. One example method includes performing a conversion between visual media data and a visual media file including one or more tracks storing one or more bitstreams of the visual media data according to a format rule; wherein the format rule specifies whether a first element indicative of whether a track contains a bitstream corresponding to a specific output layer set controls whether a second element indicative of a chroma format of the track and/or a third element indictive of a bit depth information of the track is included in a configuration record of the track.
-
公开(公告)号:US20220086497A1
公开(公告)日:2022-03-17
申请号:US17476178
申请日:2021-09-15
Applicant: Lemon Inc.
Inventor: Ye-Kui WANG
Abstract: Systems, methods and apparatus for encoding or decoding a file format that stores one or more images are described. One example method includes performing a conversion between a visual media file and a bitstream of a visual media data according to a format rule, wherein the format rule specifies a characteristic of a syntax element in the visual media file, and wherein the format rule specifies that the syntax element that has a value indicative of a level identification is coded in any one or both of a subpicture common group box or a subpicture multiple groups box using eight bits.
-
公开(公告)号:US20220086446A1
公开(公告)日:2022-03-17
申请号:US17475774
申请日:2021-09-15
Applicant: Lemon Inc.
Inventor: Ye-Kui WANG
IPC: H04N19/132 , H04N19/513 , H04N19/159 , H04N19/169 , H04N21/2343 , H04N21/234
Abstract: Systems, methods and apparatus for encoding or decoding a file format that stores one or more images are described. One example method includes performing a conversion between a visual media file and a bitstream of a visual media data according to a format rule, where the format rule specifies that a type of a sample entry determines whether decoding capability information network abstraction layer units are included in either the sample entry of a video track in the visual media file or in a sample of the video track and the sample entry of the video track in the visual media file.
-
公开(公告)号:US20220086430A1
公开(公告)日:2022-03-17
申请号:US17475719
申请日:2021-09-15
Applicant: Lemon Inc.
Inventor: Ye-Kui WANG
IPC: H04N19/105 , H04N19/132 , H04N19/169 , H04N19/172
Abstract: Systems, methods and apparatus for encoding or decoding a file format that stores one or more images are described. One example method includes performing a conversion between a visual media file and a bitstream of a visual media data according to a format rule, where the format rule specifies a condition that controls whether an information item is included in a non-video coding layer track of the visual media file, and where a presence of the non-video coding layer track in the visual media file is indicated by a specific track reference in a video coding layer track of the visual media file.
-
公开(公告)号:US20250159311A1
公开(公告)日:2025-05-15
申请号:US18730721
申请日:2023-01-03
Applicant: Lemon Inc.
Inventor: Jingjie CHEN , Tianyang XU , Siyao YANG , Weikai LI , Zihao CHEN , Changhao OU , Yixin ZHAO , Sang Hyup LEE , Yiling CHEN , Yi YUE , Jie LIAO , Shengchuan SHI , Zixiong ZHANG , Quan WANG , Jian SUN , Aoyu WANG
IPC: H04N21/81 , G06T19/20 , H04N5/262 , H04N21/472
Abstract: Provided in the present disclosure are a video generation method, apparatus and device, and a storage medium. The method includes: when a trigger operation for a preset effect entry has been received, displaying, on a photographic page, a preset effect resource for a target POI which corresponds to a first user, wherein the preset effect resource comprises at least one first virtual object, and each first virtual object is displayed on the photographic page in a manner of moving towards the target POI; generating a second virtual object on the basis of a virtual object generation control on the photographic page; when a trigger operation for the second virtual object has been received, displaying the second virtual object on the photographic page in a manner of moving towards the target POI; and then generating a resulting video on the basis of the photographic page.
-
公开(公告)号:US20250156656A1
公开(公告)日:2025-05-15
申请号:US18941556
申请日:2024-11-08
Applicant: Lemon Inc. , Beijing Youzhuju Network Technology Co., Ltd.
Inventor: Zhichao Huang , Rong Ye , Yu Ting Ko , Qianqian Dong , Shanbo Cheng , Mingxuan Wang , Hang Li
Abstract: Embodiments of the present disclosure relate to a speech translation method, an apparatus, an electronic device, and a medium. The method includes generating a speech representation corresponding to a source-language audio based on the audio. The method also includes obtaining prompt content related to a target language. In addition, the method also includes generating a target-language text corresponding to the audio based on the speech representation and the prompt content.
-
公开(公告)号:US20250140242A1
公开(公告)日:2025-05-01
申请号:US18385749
申请日:2023-10-31
Applicant: Lemon Inc.
Inventor: Zongyu Yin , Qingqing Huang , Janne Jayne Harm Renee Spijkervet
Abstract: The present disclosure describes techniques for generating audio representations using a machine learning model. A machine learning model is pre-trained using unlabeled audio data. The pre-training enables the machine learning model to recognize audio patterns and generate initial audio representations. The machine learning model is refined by a task-specific fine-tuning process using labeled data. The task-specific fine-tuning process incorporates multi-task learning heads to optimize the machine learning model. The task-specific fine-tuning process enables the machine learning model to be specialized in specific audio tasks and generate continuous audio representations. The continuous audio representations retain acoustic nuances and subtleties of audio signals. The machine learning model is configured and enabled to generate quantized audio representations by incorporating vector quantization to the task-specific fine-tuning process.
-
公开(公告)号:US20250131613A1
公开(公告)日:2025-04-24
申请号:US18834154
申请日:2022-12-15
Applicant: Lemon Inc.
Inventor: Yizhe ZHU , Bingchen LIU , Xiao YANG
Abstract: Provided in the embodiments of the present disclosure are a video generation method, and a training method for a video generation model. The video generation method includes: acquiring a first video, wherein the first video includes a first object image; and inputting the first video into a pre-trained video generation model to obtain a second video, wherein the video generation model is obtained by means of performing training on the basis of a target image and a plurality of sample image pairs obtained from a plurality of first sample images, an object image in the second video is generated on the basis of a preset animal image in the target image and the first object image, and a background image of the second video is generated on the basis of a first background image of the first video.
-
-
-
-
-
-
-
-
-