摘要:
A three-dimensional audio signal processing apparatus using a Head Related Transfer Function (HRTF) includes an audio decoder for decoding audio data to restore original audio signals and a three-dimensional audio generator for generating three-dimensional signals corresponding to the audio signals restored by using the HRTF modeled according to physical characteristics of an user, wherein the HRTF modeled according to physical characteristics of an user is an individualized HRTF.
摘要:
Provided are a multi-object audio encoding and decoding method and an apparatus thereof. The multi-object encoding method includes generating a down-mix signal and a residual signal by down-mixing a foreground audio object and a background audio object, and generating a bitstream including the down-mix signal and the residual signal.
摘要:
Video data encoding and decoding methods and apparatuses are provided. In the video data encoding and decoding methods, codes books are provided to an encoder and a decoder. In the encoder, an index corresponding to a vector that is most similar to a current vector of an input moving picture among the vectors of the code book is encoded. In the decoder, the index is decoded. Accordingly, it is possible to increase compression ratio and reduce calculation complexity.
摘要:
Provided is a method and apparatus for generating a side information bitstream of a multi-object audio signal. The apparatus for generating a side information bitstream of a multi-object audio signal includes a spatial cue information input unit configured to receive spatial cue information generated in an encoder of the multi-object audio signal, a preset information input unit configured to receive preset information for the multi-object audio signal, and a side information bitstream generator configured to generate the side information bitstream based on the spatial cue information and the preset information. The side information bitstream includes a header region and a frame region, and the preset information is included in the frame region.
摘要:
Provided is a method and apparatus for generating a side information bitstream of a multi-object audio signal. The apparatus for generating a side information bitstream of a multi-object audio signal includes a spatial cue information input unit configured to receive spatial cue information generated in an encoder of the multi-object audio signal, a preset information input unit configured to receive preset information for the multi-object audio signal, and a side information bitstream generator configured to generate the side information bitstream based on the spatial cue information and the preset information. The side information bitstream includes a header region and a frame region, and the preset information is included in the frame region.
摘要:
Provided are a method of generating and playing an object-based audio content that may effectively store preset information about an object-based audio content, and a computer-readable recording medium for storing data having a file format structure for an object-based audio service. The method of generating the object-based audio content may include: receiving a plurality of audio objects (310) generating at least one preset using the plurality of audio objects (320) and storing a preset parameter with respect to an attribute of the at least preset and the plurality of audio objects (330). The preset parameter may be stored in a form of a box that is defined in a media file format about the object-based audio content. Through this, it is possible to effectively store a preset about a plurality of audio objects.
摘要:
Provided is an apparatus of separating a musical sound source, which may re-construct mixed signals into target sound sources and other sound sources directly using sound source information performed using a predetermined musical instrument when the sound source information is present, thereby more effectively separating sound sources included in the mixed signal. The apparatus may include a Nonnegative Matrix Partial Co-Factorization (NMPCF) analysis unit to perform an NMPCF analysis on a mixed signal and a predetermined sound source signal using a sound source separation model, and to obtain a plurality of entity matrices based on the analysis result, and a target instrument signal separating unit to separate, from the mixed signal, a target instrument signal corresponding to the predetermined sound source signal by calculating an inner product between the plurality of entity matrices.
摘要:
Provided are a method of generating and playing an object-based audio content that may effectively store preset information about an object-based audio content, and a computer-readable recording medium for storing data having a file format structure for an object-based audio service. The method of generating the object-based audio content may include: receiving a plurality of audio objects (310) generating at least one preset using the plurality of audio objects (320) and storing a preset parameter with respect to an attribute of the at least preset and the plurality of audio objects (330). The preset parameter may be stored in a form of a box that is defined in a media file format about the object-based audio content. Through this, it is possible to effectively store a preset about a plurality of audio objects.
摘要:
Provided are an apparatus and method of separating, from a mixed signal, a sound source generated using a rhythm musical instrument based on characteristics of the rhythm musical instrument repeated in an aspect of time. The apparatus may include a separation unit to separate a plurality of mixed signals into a plurality of segments, a Nonnegative Matrix Partial Co-Factorization (NMPCF) analysis unit to perform an NMPCF analysis on the plurality of segments, and to obtain a plurality of entity matrices based on the analysis result, a target instrument signal separating unit to separate, from the mixed signals, a target instrument signal, by calculating an inner product between the plurality of entity matrices, and a signal association unit to associate the target instrument signals separated from each of the plurality of segments.
摘要:
Disclosed is an audio object editing apparatus of a multi-object audio coding apparatus. The audio object editing apparatus of the multi-object audio coding apparatus may include an object information extracting unit to receive an object bit stream and to extract object information from the object bit stream, a downmix processing unit to receive a downmix signal, and to control the downmix signal using object editing information and the object information, and a bit stream processing unit to edit the object information according to the object editing information, and to generate a controlled object bit stream based on the edited object information.