摘要:
In an audio signal processing apparatus, a generation section (10) generates an audio signal (Sa) representing a voice. A distribution section (20) distributes the audio signal generated by the generation section to a first channel (Sa1) and a second channel (Sa2), respectively. A delay section (30) delays the audio signal of the first channel relative to the audio signal of the second channel for creating a phase difference between the audio signal of the first channel and the audio signal of the second channel such that the created phase difference has a duration corresponding to either an added value of a first duration which is approximately one half of a period of the audio signal generated by the generation section and a second duration which is set shorter than the first duration, or a difference value of the first duration and the second duration. An addition section adds (50) the delayed audio signal of the first channel (Sc1) and the audio signal of the second channel with one another, and outputs the added audio signal (Sout) which represents natural voice with various characteristics.
摘要:
An apparatus is constructed for converting an input voice signal into an output voice signal according to a target voice signal. In the apparatus, an input device provides the input voice signal composed of original sinusoidal components and original residual components other than the original sinusoidal components. An extracting device extracts original attribute data from at least the sinusoidal components of the input voice signal. The original attribute data is characteristic of the input voice signal. A synthesizing device synthesizes new attribute data based on both of the original attribute data derived from the input voice signal and target attribute data being characteristic of the target voice signal composed of target sinusoidal components and target residual components other than the sinusoidal components. The target attribute data is derived from at least the target sinusoidal components. An output device operates based on the new attribute data and either of the original residual component and the target residual component for producing the output voice signal.