Abstract:
An information processing device including an acquisition unit that acquires a sound collection result of a sound from each of one or more sound sources obtained by a sound collection portion of which positional information indicating at least one of a position and a direction is changed and an estimation unit that estimates a direction of each of the one or more sound sources on a basis of a change in a frequency of a sound collected by the sound collection portion in association with a change in the positional information of the sound collection portion.
Abstract:
A sound field control device according to the present disclosure includes a display target object position information acquisition unit for acquiring position information of a viewer from information obtained by imaging, and a virtual sound source position control unit for controlling a virtual sound source position on the basis of the position information. Thus, it becomes possible to optimally adjust virtual sound source reproduction in consideration of size or orientation of a head. Accordingly, it becomes possible to provide a sound field without unnatural feeling to viewers.
Abstract:
Provided is a signal processing apparatus that includes a voice quality conversion unit that converts acoustic data of any sound of an input sound source to acoustic data of voice quality of a target sound source different from the input sound source on the basis of a voice quality converter parameter obtained by training using acoustic data for each of one or more sound sources as training data, the acoustic data being different from parallel data or clean data.
Abstract:
Provided is a signal processing apparatus including a feature detection unit configured to detect, from an input signal, a detection signal including at least one of audience-generated-sound likelihood and music likelihood, a reverberation adding unit configured to add long or short reverberations to the input signal based on a detected tone being moderate or ordinary tone respectively, and a vicinity-sound generation unit configured to generate vicinity sound based on the detection signal.
Abstract:
Provided is an audio processing device including a narration canceling section configured to generate a narration canceling signal by removing a narration component from an input signal, and a reverberation adding section configured to add a reverberation effect to the narration canceling signal.
Abstract:
Provided is an information processing device including: an audio signal output unit that causes measuring audio in an inaudible band to be output from a speaker; and a viewing position computation unit that computes a viewing position of a user based on the measuring audio picked up by a microphone.
Abstract:
Provided is an information processing device and a method that enable extraction of a desired object from a moving image with sound. The information processing device includes an image object detection unit that detects an image object on the basis of a moving image with sound, a sound object detection unit that detects a sound object on the basis of the moving image with sound, and a sound image object detection unit that detects a sound image object on the basis of a detection result of the image object and a detection result of the sound object.
Abstract:
Provided is an audio processing device including a narration canceling section configured to generate a narration canceling signal by removing a narration component from an input signal, and a reverberation adding section configured to add a reverberation effect to the narration canceling signal.
Abstract:
Provided is a sound source separation device that includes a combining unit that combines a first sound source separation signal of a predetermined sound source, the first sound source separation signal being separated from a mixed sound signal by a first sound source separation system, with a second sound source separation signal of the sound source, the second sound source separation signal being separated from the mixed sound signal by a second sound source separation system that differs in separation performance from the first sound source separation system in predetermined units of time, and that outputs a sound source separation signal obtained by the combination.
Abstract:
A sound field control device according to the present disclosure includes a display target object position information acquisition unit for acquiring position information of a viewer from information obtained by imaging, and a virtual sound source position control unit for controlling a virtual sound source position on the basis of the position information. Thus, it becomes possible to optimally adjust virtual sound source reproduction in consideration of size or orientation of a head. Accordingly, it becomes possible to provide a sound field without unnatural feeling to viewers.