Abstract:
The present disclosure relates to an information processing apparatus and an information processing method which are capable of improving an efficiency of acquiring a predetermined type of audio data among a plurality of types of audio data. Audio data of a predetermined track is acquired in a file in which a plurality of types of audio data are divided into a plurality of tracks depending on the types and the tracks are arranged. The present disclosure is applicable to, for example, an information processing system including a file generation device that generates a file, a Web server that records a file generated by the file generation device, and a video playback terminal that plays back a file.
Abstract:
The present technology relates to a device, a method, and a program for expanding a frequency band, which are capable of obtaining high-quality sound with a small processing amount. A low band extraction band-pass filter processing unit passes a predetermined band of a low band of an input signal and generates a low band sub band signal. A band-pass filter calculation circuit calculates band-pass filter coefficients of band-pass filters having sub bands of high bands as a pass band based on an estimate value of high band sub band power, and an addition unit obtains one filter coefficient by adding the band-pass filter coefficients. A poly-phase configuration level adjustment filter performs up-sampling and level adjustment by performing filtering on a flattened signal obtained from a low band sub band signal using the filter coefficient obtained by the addition unit, and generates a high band signal. An addition unit obtains an output signal by adding the high band signal to the low band signal. The present technology can be applied to a frequency band expanding device.
Abstract:
An input unit receives input of an assumed listening position of sound of an object, which is a sound source, and outputs assumed listening position information indicating the assumed listening position. A position information correction unit corrects position information of each object on the basis of the assumed listening position information to obtain corrected position information. A gain/frequency characteristic correction unit performs gain correction and frequency characteristic correction on a waveform signal of an object on the basis of the position information and the corrected position information. A spatial acoustic characteristic addition unit further adds a spatial acoustic characteristic to the waveform signal resulting from the gain correction and the frequency characteristic correction on the basis of the position information of the object and the assumed listening position information. The present technology is applicable to an audio processing device.
Abstract:
An input unit receives input of an assumed listening position of sound of an object, which is a sound source, and outputs assumed listening position information indicating the assumed listening position. A position information correction unit corrects position information of each object on the basis of the assumed listening position information to obtain corrected position information. A gain/frequency characteristic correction unit performs gain correction and frequency characteristic correction on a waveform signal of an object on the basis of the position information and the corrected position information. A spatial acoustic characteristic addition unit further adds a spatial acoustic characteristic to the waveform signal resulting from the gain correction and the frequency characteristic correction on the basis of the position information of the object and the assumed listening position information. The present technology is applicable to an audio processing device.
Abstract:
An audio signal processing apparatus that includes a band division section, an analysis section, a gain adjustment amount calculation section, and a gain adjustment section. The band division section is configured to generate a resonance in-band signal by band division on an audio input signal. The analysis section is configured to extract an amount of features from each of the resonance in-band signal and the input signal. The gain adjustment amount calculation section is configured to calculate a gain adjustment amount for a resonance frequency band in the input signal, the gain adjustment amount being calculated based on the amount of features of the resonance in-band signal and the amount of features of the input signal. The gain adjustment section is configured to perform a gain adjustment on the resonance frequency band in the input signal based on the gain adjustment amount.
Abstract:
The present technology relates to an information processing device, an information processing method, and a program that enable reduction of an amount of data to be transmitted in transmission of data of a plurality of audio objects. An information processing device according to one aspect of the present technology: combines audio objects with sounds that are undistinguishable at a predetermined supposed listening position among a plurality of audio objects for the predetermined supposed listening position among a plurality of supposed listening positions; and transmits data of a combined audio object obtained by the combination, along with data of other audio objects with sounds that are distinguishable at the predetermined supposed listening position. The present technology can be applied to a device that can process object-based audio data.
Abstract:
The present disclosure relates to a file generation device and a file generation method which enable acquisition of a video stream having an optimum bit rate when acquiring an audio stream encoded by a lossless compression technique and a video stream. An MPD file generation unit generates AveBandwidth and DurationForAveBandwidth representing the bit rate of an audio stream encoded by a lossless DSD technique. The present disclosure can be applied to, for example, a file generation device or the like that generates a segment file of moving image content by a technique conforming to MPEG-DASH.
Abstract:
The present technology relates to an audio signal output device and a method, an encoding device and a method, a decoding device and a method, and a program for realizing audio reproduction with a more realistic feeling.In a case where an audio signal generated to be output as a sound from an ideal speaker that is a virtual speaker placed in an ideal position is input, the distance between the ideal speaker and a real reproduction speaker is determined. Gain adjustment is then performed on the audio signal with the gain corresponding to the determined distance, the audio signal subjected to the gain adjustment is reproduced by the reproduction speaker. Accordingly, even in a case where there is a difference in position between the ideal speaker and the reproduction speaker, audio reproduction with a more realistic feeling can be realized. The present technology can be applied to reproduction devices.
Abstract:
The present technology relates to a signal processing apparatus and method, and a program that are capable of reproducing sound at an optional listening position with a high sense of reality. The signal processing apparatus includes a rendering unit that generates reproduction data of sound at an optional listening position in a target space on the basis of recording signals of microphones attached to a plurality of moving bodies in the target space. The present technology can be applied to a reproduction apparatus.
Abstract:
The present technology relates a reproduction apparatus, a reproduction method, an information processing apparatus, an information processing method, and a program which can realize reproduction of highly flexible audio data while reflecting the intention of a content creator. A reproduction apparatus according to one aspect of the present technology acquires content including audio data of each of audio objects and rendering parameters of the audio data for each of a plurality of presumed listening positions, renders the audio data on the basis of the rendering parameters for a selected predetermined presumed listening position, and outputs an audio signal. The present technology can be applied to an apparatus that can reproduce object-based audio data.