-
公开(公告)号:US20220286799A1
公开(公告)日:2022-09-08
申请号:US17728037
申请日:2022-04-25
Applicant: Samsung Electronics Co., Ltd.
Inventor: Tammy LEE , Sangchul KO , Kyungrae KIM , Sunmin KIM , Jungkyu KIM , Woohyun NAM , Yoonjae SON , Hyunkwon CHUNG , Sunghee HWANG
IPC: H04S3/00 , G10L19/008
Abstract: According to various embodiments of the disclosure, an audio processing apparatus includes at least one processor configured to execute one or more instructions to obtain a second audio signal down-mixed from at least one first audio signal, obtain information related to error removal for the at least one first audio signal, de-mix the at least one first audio signal from the down-mixed second audio signal, and reconstruct the at least one first audio signal by applying the information related to the error removal for the at least one first audio signal to the at least one first audio signal de-mixed from the second audio signal. The information related to the error removal having been generated using at least one of an original signal power of the at least one first audio signal or a second signal power of the at least one first audio signal after decoding.
-
2.
公开(公告)号:US20230281755A1
公开(公告)日:2023-09-07
申请号:US18195182
申请日:2023-05-09
Applicant: SAMSUNG ELECTRONICS CO, LTD.
Inventor: Heechul YANG , Inhak NA , Hyunkwon CHUNG
CPC classification number: G06T3/4046 , G06T5/006 , G06T2207/20024
Abstract: An artificial intelligence (AI) encoding apparatus, including a memory configured to store instructions; and at least one processor configured to execute the instructions to: obtain an original image, previously-encoded frame information, and network environment information; obtain deblocking filter setting information, based on the original image, the previously-encoded frame information, and the network environment information; perform deblocking filtering to the original image, based on the deblocking filter setting information to obtain a deblocking-filtered original image; obtain an AI-downscaled first image by providing the deblocking-filtered original image a downscaling deep neural network (DNN); generate image data by performing first encoding on the AI-downscaled first image; and transmit the deblocking filter setting information, AI data including information related to the AI downscaling, and the image data
-
公开(公告)号:US20250056178A1
公开(公告)日:2025-02-13
申请号:US18929050
申请日:2024-10-28
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Sunghee HWANG , Sangchul KO , Kyungae KIM , Jungkyu KIM , Woohyun NAM , Yoonjae SON , Kyunggeun LEE , Tammy LEE , Hyunkwon CHUNG
IPC: H04S7/00
Abstract: Disclosed are an audio processing apparatus and method including obtaining at least one substream and additional information by parsing a bitstream, obtaining at least one audio signal of at least one channel group (CG) by decompressing the at least one substream, and obtaining a multi-channel audio signal by de-mixing the at least one audio signal of the at least one CG, based on the additional information. The additional information includes a weight index offset identified based on an energy value of a height channel of the multi-channel audio signal and an energy value of a surround channel of the multi-channel audio signal.
-
公开(公告)号:US20220386055A1
公开(公告)日:2022-12-01
申请号:US17749840
申请日:2022-05-20
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Yoonjae SON , Sangchul KO , Woohyun NAM , Kyungrae KIM , Jungkyu KIM , Tammy LEE , Hyunkwon CHUNG , Sunghee HWANG
Abstract: An apparatus for processing audio includes at least one processor configured to obtain a down-mixed audio signal from a bitstream, to obtain down-mixing-related information from the bitstream, to de-mix the down-mixing-related information by using down-mixing-related information, and to reconstruct an audio signal including at least one frame based on the de-mixed audio signal. The down-mixing-related information is information generated in units of frames by using an audio scene type.
-
5.
公开(公告)号:US20240105205A1
公开(公告)日:2024-03-28
申请号:US18225406
申请日:2023-07-24
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Woohyun NAM , Kyungrae KIM , Jungkyu KIM , Sangchul KO , Yoonjae SON , Tammy LEE , Hyunkwon CHUNG , Sunghee HWANG
IPC: G10L25/57 , G10L15/25 , G11B27/031
CPC classification number: G10L25/57 , G10L15/25 , G11B27/031
Abstract: A method of matching a voice for each object included in a video, includes: separating a plurality of voices in a video; determining a dissimilarity between the plurality of voices; selecting a partial duration in an entire duration of the video as a matching duration, based on the dissimilarity between the plurality of voices; matching, within the matching duration, the plurality of voices with a plurality of objects in the video respectively, based on mouth movements of the plurality of objects; and matching the plurality of voices with the plurality of objects respectively in the entire duration of the video, based on results of the matching between the plurality of voices and the plurality of objects within the matching duration.
-
公开(公告)号:US20230238003A1
公开(公告)日:2023-07-27
申请号:US18127374
申请日:2023-03-28
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Woohyun NAM , Yoonjae SON , Hyunkwon CHUNG , Sunghee HWANG
IPC: G10L19/008
CPC classification number: G10L19/008
Abstract: An audio signal processing apparatus is configured to: transform a first audio signal includes n channels to generate a first audio data in a frequency domain, generate a frequency feature signal for each channel from the first audio data in the frequency domain, based on a first deep neural network (DNN), generate a second audio signal includes m channels from the first audio signal, based on a second DNN, and generate an output audio signal by encoding the second audio signal and the frequency feature signal. The first audio signal is a high order ambisonic signal includes a zeroth order signal and a plurality of first order signals. The second audio signal includes a mono signal or a stereo signal. m is smaller than n.
-
公开(公告)号:US20220246158A1
公开(公告)日:2022-08-04
申请号:US17722569
申请日:2022-04-18
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Woohyun NAM , Sangchul KO , Kyungrae KIM , Jungkyu KIM , Yoonjae SON , Tammy LEE , Hyunkwon CHUNG , Sunghee HWANG
IPC: G10L19/008 , H04S3/00 , G06N3/08
Abstract: An audio processing apparatus may obtain second audio signals corresponding to channels included in a second channel group from first audio signals corresponding to channels included in a first channel group, downsample at least one third audio signal corresponding to at least one channel identified based on a correlation with the second channel group from among the channels included in the first channel group, by using an artificial intelligence (AI) model, and generate a bitstream including the second audio signals corresponding to the channels included in the second channel group and the downsampled at least one third audio signal. The first channel group includes a channel group of an original audio signal, and the second channel group is constructed by combining at least two channels from among the channels included in the first channel group.
-
-
-
-
-
-