Abstract:
A method of generating context-customized digital twin is provided. The method includes receiving pieces of basic data by using a data receiver, classifying the pieces of basic data into a plurality of layers and tagging the classified pieces of basic data to tags by using a preprocessor, generating context-customized digital twin models by using pieces of basic data corresponding to tags selected from among the tags by using a twin model generator, and storing the tags, the tagged pieces of basic data, and the context-customized digital twin models in a storage.
Abstract:
An audio signal encoding/decoding device and method using a filter bank is disclosed. The audio signal encoding method includes generating a plurality of first audio signals by performing filtering on an input audio signal using an analysis filter bank, generating a plurality of second audio signals by performing downsampling on the first audio signals, and outputting a bitstream by encoding and quantizing the second audio signals.
Abstract:
A system and method for synchronizing an audio signal and a video signal are provided. A decoding method in the system may include decoding an audio signal and a video signal received from an encoding apparatus, extracting first unique information of the audio signal from the decoded video signal, generating second unique information of the audio signal based on the decoded audio signal, determining a delay between the audio signal and the video signal by comparing the first unique information to the second unique information, and synchronizing the audio signal and the video signal based on the delay. The first unique information may be generated based on an audio signal that is not encoded by the encoding apparatus, and may be inserted into the video signal.
Abstract:
The encoding method includes computing the first feature information of an input signal using a recurrent encoding model, quantizing the first feature information and producing the first feature bitstream, computing the first output signal from the quantized first feature information using a recurrent decoding model, computing the second feature information of the input signal using a nonrecurrent encoding model, quantizing the second feature information and producing the second feature bitstream, computing the second output signal from the quantized second feature information using a nonrecurrent decoding model, determining an encoding mode based on the input signal, the first and second output signals, and the first and second feature bitstreams, and outputting an overall bitstream by multiplexing an encoding mode bit and one of the first feature bitstream and the second feature bitstream depending on the encoding mode.
Abstract:
An inventive concept relates to an audio coding method to which CNN-based frequency spectrum recovery is applied. An inventive concept transmits a part of frequency spectral coefficients generated in transform coding to a decoder and the decoder recovers the frequency spectral coefficient not transmitted. Furthermore, the signs of frequency spectral coefficient are transmitted from an encoder to the decoder depending on a sign transmission rule.
Abstract:
A method and apparatus for widening a viewing angle in a video conferencing system are provided. The apparatus for widening a viewing angle in a video conferencing system includes: generating reference data from images of a video conference participant captured by a camera included in the video conferencing system; generating movement data based on the video conference participant's movements sensed by the camera; extracting a first control parameter by comparing the reference data with the movement data; transmitting the first control parameter to the other end of the conference; receiving a second control parameter generated at the other end of the conference; and controlling the camera by the second control parameter.
Abstract:
The present invention provides a scalable digital twin system structure and a scalable digital twin service method that are capable of, based on a digital twin, performing real-time control of the real world while providing information required for the user to determine the optimal countermeasure in solve real-world problems in stages, thereby helping rapidly solve problems of the real-world. In order to preemptively respond to the real-world problems by providing decision support information with improved reliability according to a timeline based on a digital twin of a scalable structure, the operation of the digital twin is divided into several stages according to complexity and a result of each stage is transferred to an application service and the next stage, so that as the stage becomes higher, a more reliable result can be provided.
Abstract:
A method and apparatus for performing binaural rendering of an audio signal are provided. The method includes identifying an input signal that is based on an object, and metadata that includes distance information indicating a distance to the object, generating a binaural filter that is based on the metadata, using a binaural room impulse response, obtaining a binaural filter to which a low-pass filter (LPF) is applied, using a frequency response control that is based on the distance information, and generating a binaural-rendered output signal by performing a convolution of the input signal and the binaural filter to which the LPF is applied.
Abstract:
An audio signal encoding and decoding method using a neural network model, a method of training the neural network model, and an encoder and decoder performing the methods are disclosed. The encoding method includes computing the first feature information of an input signal using a recurrent encoding model, computing an output signal from the first feature information using a recurrent decoding model, calculating a residual signal by subtracting the output signal from the input signal, computing the second feature information of the residual signal using a nonrecurrent encoding model, and converting the first feature information and the second feature information to a bitstream.
Abstract:
A method of predicting a channel parameter of an original signal from a downmix signal is disclosed. The method may include generating an input feature map to be used to predict a channel parameter of the original signal based on a downmix signal of an original signal, determining an output feature map including a predicted parameter to be used to predict the channel parameter by applying the input feature map to a neural network, generating a label map including information associated with the channel parameter of the original signal, and predicting the channel parameter of the original signal by comparing the output feature map and the label map.