-
1.
公开(公告)号:US20240105205A1
公开(公告)日:2024-03-28
申请号:US18225406
申请日:2023-07-24
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Woohyun NAM , Kyungrae KIM , Jungkyu KIM , Sangchul KO , Yoonjae SON , Tammy LEE , Hyunkwon CHUNG , Sunghee HWANG
IPC: G10L25/57 , G10L15/25 , G11B27/031
CPC classification number: G10L25/57 , G10L15/25 , G11B27/031
Abstract: A method of matching a voice for each object included in a video, includes: separating a plurality of voices in a video; determining a dissimilarity between the plurality of voices; selecting a partial duration in an entire duration of the video as a matching duration, based on the dissimilarity between the plurality of voices; matching, within the matching duration, the plurality of voices with a plurality of objects in the video respectively, based on mouth movements of the plurality of objects; and matching the plurality of voices with the plurality of objects respectively in the entire duration of the video, based on results of the matching between the plurality of voices and the plurality of objects within the matching duration.
-
公开(公告)号:US20230238003A1
公开(公告)日:2023-07-27
申请号:US18127374
申请日:2023-03-28
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Woohyun NAM , Yoonjae SON , Hyunkwon CHUNG , Sunghee HWANG
IPC: G10L19/008
CPC classification number: G10L19/008
Abstract: An audio signal processing apparatus is configured to: transform a first audio signal includes n channels to generate a first audio data in a frequency domain, generate a frequency feature signal for each channel from the first audio data in the frequency domain, based on a first deep neural network (DNN), generate a second audio signal includes m channels from the first audio signal, based on a second DNN, and generate an output audio signal by encoding the second audio signal and the frequency feature signal. The first audio signal is a high order ambisonic signal includes a zeroth order signal and a plurality of first order signals. The second audio signal includes a mono signal or a stereo signal. m is smaller than n.
-
公开(公告)号:US20220246158A1
公开(公告)日:2022-08-04
申请号:US17722569
申请日:2022-04-18
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Woohyun NAM , Sangchul KO , Kyungrae KIM , Jungkyu KIM , Yoonjae SON , Tammy LEE , Hyunkwon CHUNG , Sunghee HWANG
IPC: G10L19/008 , H04S3/00 , G06N3/08
Abstract: An audio processing apparatus may obtain second audio signals corresponding to channels included in a second channel group from first audio signals corresponding to channels included in a first channel group, downsample at least one third audio signal corresponding to at least one channel identified based on a correlation with the second channel group from among the channels included in the first channel group, by using an artificial intelligence (AI) model, and generate a bitstream including the second audio signals corresponding to the channels included in the second channel group and the downsampled at least one third audio signal. The first channel group includes a channel group of an original audio signal, and the second channel group is constructed by combining at least two channels from among the channels included in the first channel group.
-
公开(公告)号:US20240127847A1
公开(公告)日:2024-04-18
申请号:US18380929
申请日:2023-10-17
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Kyungrae KIM , Woohyun NAM , Jungkyu KIM , Deokjun EOM
Abstract: An electronic device for processing a video including an image signal and a mixed audio signal, includes: a memory configured to store at least one program for processing the video; and at least one processor configured to: generate, from the image signal and the mixed audio signal, audio-related information indicating a degree of overlap in a plurality of sound sources included in the mixed audio signal by using a first artificial intelligence (AI) model; and separate at least one of the plurality of sound sources included in the mixed audio signal from the mixed audio signal, by applying the audio-related information to a second AI model.
-
公开(公告)号:US20210218878A1
公开(公告)日:2021-07-15
申请号:US17019948
申请日:2020-09-14
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Yangwook KIM , Woohyun NAM , Seungho LEE
Abstract: Disclosed is an electronic apparatus. The electronic apparatus includes: a camera; a first microphone; a second microphone; and a processor configured to: control the electronic apparatus to obtain first sound data through the first microphone, control the electronic apparatus to obtain second sound data through the second microphone, identify an object corresponding to the first sound data and the second sound data in image data obtained through the camera, obtain position information of the identified object from the image data, and change volume information of at least one of the first sound data or the second sound data based on the obtained position information.
-
公开(公告)号:US20210021953A1
公开(公告)日:2021-01-21
申请号:US16847947
申请日:2020-04-14
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Yangwook KIM , Seungho LEE , Woohyun NAM
Abstract: An electronic apparatus is provided. The electronic apparatus includes a camera, a processor and a memory configured to store at least one instruction executable by the processor where and the processor is configured to input audio data to an artificial intelligence model corresponding to user information, and obtain output audio data from the artificial intelligence model, and the artificial intelligence model is a model learned based on first learning audio data obtained by recording a sound source with a first recording device, second learning audio data obtained by recording the sound source with a second recording device, and information on a recording device for obtaining the second learning audio data, and the second learning audio data is binaural audio data.
-
公开(公告)号:US20230360665A1
公开(公告)日:2023-11-09
申请号:US18195121
申请日:2023-05-09
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Kyungrae KIM , Woohyun NAM
Abstract: An audio processing method includes obtaining a first audio signal corresponding to a first frame; extracting a first feature vector by inputting the first audio signal to a first neural network; obtaining a temporal correlation vector representing a similarity between the first feature vector and at least one second feature vector extracted from at least one second audio signal corresponding to at least one second frame that is temporally before the first frame; and classifying a scene of the first audio signal by inputting the first feature vector, the at least one second feature vector, and the temporal correlation vector to a second neural network.
-
公开(公告)号:US20220329966A1
公开(公告)日:2022-10-13
申请号:US17851795
申请日:2022-06-28
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Inventor: Yangwook KIM , Seungho LEE , Woohyun NAM
Abstract: An electronic apparatus is provided. The electronic apparatus includes a camera, a processor and a memory configured to store at least one instruction executable by the processor where and the processor is configured to input audio data to an artificial intelligence model corresponding to user information, and obtain output audio data from the artificial intelligence model, and the artificial intelligence model is a model learned based on first learning audio data obtained by recording a sound source with a first recording device, second learning audio data obtained by recording the sound source with a second recording device, and information on a recording device for obtaining the second learning audio data, and the second learning audio data is binaural audio data.
-
公开(公告)号:US20220286799A1
公开(公告)日:2022-09-08
申请号:US17728037
申请日:2022-04-25
Applicant: Samsung Electronics Co., Ltd.
Inventor: Tammy LEE , Sangchul KO , Kyungrae KIM , Sunmin KIM , Jungkyu KIM , Woohyun NAM , Yoonjae SON , Hyunkwon CHUNG , Sunghee HWANG
IPC: H04S3/00 , G10L19/008
Abstract: According to various embodiments of the disclosure, an audio processing apparatus includes at least one processor configured to execute one or more instructions to obtain a second audio signal down-mixed from at least one first audio signal, obtain information related to error removal for the at least one first audio signal, de-mix the at least one first audio signal from the down-mixed second audio signal, and reconstruct the at least one first audio signal by applying the information related to the error removal for the at least one first audio signal to the at least one first audio signal de-mixed from the second audio signal. The information related to the error removal having been generated using at least one of an original signal power of the at least one first audio signal or a second signal power of the at least one first audio signal after decoding.
-
公开(公告)号:US20200213736A1
公开(公告)日:2020-07-02
申请号:US16727346
申请日:2019-12-26
Applicant: Samsung Electronics Co., Ltd.
Inventor: Yangwook KIM , Woohyun NAM , Seungho LEE
Abstract: Provided are an artificial intelligence (AI) system using a machine learning algorithm such as deep learning and an application of the AI system. Provided is a method of converting sound data, the method including acquiring binaural sound data; converting the binaural sound data by using a pre-generated training network model, based on a parameter indicating a context at a time of acquiring the binaural sound data; and outputting the converted binaural sound data.
-
-
-
-
-
-
-
-
-