摘要:
In a speech synthesizer apparatus, a weighting coefficient training controller calculates acoustic distances in second acoustic feature parameters between one target phoneme from the same phoneme and the phoneme candidates other than the target phoneme based on first acoustic feature parameters and prosodic feature parameters, and determines weighting coefficient vectors for respective target phonemes defining degrees of contribution to the second acoustic feature parameters for respective phoneme candidates by executing a predetermined statistical analysis therefor. Then, a speech unit selector searches for a combination of phoneme candidates which correspond to a phoneme sequence of an input sentence and which minimizes a cost including a target cost representing approximate costs between a target phoneme and the phoneme candidates and a concatenation cost representing approximate costs between two phoneme candidates to be adjacently concatenated, and outputs index information on the searched out combination of phoneme candidates. Further, a speech synthesizer synthesizes a speech signal corresponding to the input phoneme sequence by sequentially reading out speech segments of speech waveform signals corresponding to the index information and concatenating the read speech segments of the speech waveform signals.
摘要:
A circuit breaker includes a housing, a rigid circuit board, and a flexible circuit board. The rigid circuit board is enclosed within the housing and has a main surface for supporting an electronic component. The flexible circuit board has a first end that is directly attached to an edge of the rigid circuit board.
摘要:
In one embodiment of the present invention, an action agenda determining apparatus for determining an agenda of action to be taken with reference to surrounding situation is provided. An action agenda determining apparatus includes a matching model storage unit for storing an action agenda determining model that has learned in advance relation between time-sequence of prescribed feature information related to human motion extracted from surrounding images and action agenda to be taken, and a model reference unit for forming the time-sequence of prescribed feature information from the surrounding motion images and referring to the action agenda determining model stored in the matching model storage unit, for determining the action agenda to be taken. Sound may be included as part of the feature information.
摘要:
In one embodiment of the present invention, an action agenda determining apparatus for determining an agenda of action to be taken with reference to surrounding situation is provided. An action agenda determining apparatus includes a matching model storage unit for storing an action agenda determining model that has learned in advance relation between time-sequence of prescribed feature information related to human motion extracted from surrounding images and action agenda to be taken, and a model reference unit for forming the time-sequence of prescribed feature information from the surrounding motion images and referring to the action agenda determining model stored in the matching model storage unit, for determining the action agenda to be taken. Sound may be included as part of the feature information.
摘要:
A circuit breaker includes a housing, a rigid circuit board, and a flexible circuit board. The rigid circuit board is enclosed within the housing and has a main surface for supporting an electronic component. The flexible circuit board has a first end that is directly attached to an edge of the rigid circuit board.
摘要:
An apparatus enabling automatic determination of a portion that reliably represents a feature of a speech waveform includes: an acoustic/prosodic analysis unit (92) calculating, from data, distribution of an energy of a prescribed frequency range of the speech waveform on a time axis, and for extracting, among various syllables of the speech waveform, a range that is generated stably, based on the distribution and the pitch of the speech waveform; cepstral analysis unit (94) estimating, based on the spectral distribution of the speech waveform on the time axis, a range of the speech waveform of which change is well controlled by a speaker; and a pseudo-syllabic center extracting unit (96) extracting, as a portion of high reliability of the speech waveform, that range which has been estimated to be the stably generated range and of which change is estimated to be well controlled by the speaker.
摘要:
An apparatus enabling automatic determination of a portion that reliably represents a feature of a speech waveform includes: an acoustic/prosodic analysis unit calculating, from data, distribution of an energy of a prescribed frequency range of the speech waveform on a time axis, and for extracting, among various syllables of the speech waveform, a range that is generated stably, based on the distribution and the pitch of the speech waveform; cepstral analysis unit estimating, based on the spectral distribution of the speech waveform on the time axis, a range of the speech waveform of which change is well controlled by a speaker; and a pseudo-syllabic center extracting unit extracting, as a portion of high reliability of the speech waveform, that range which has been estimated to be the stably generated range and of which change is estimated to be well controlled by the speaker.
摘要:
A speech processing apparatus includes a statistics collecting module operable to collect, for each of a prescribed utterance units of a speech in a training speech corpus, a prescribed type of acoustic feature and statistic information on a plurality of paralinguistic information labels being selected by a plurality of listeners to a speech corresponding to the utterance unit; and a training apparatus trained by supervised machine training using said prescribed acoustic feature as input data and using the statistic information as answer data, to output probability of allocation of the label to a given acoustic feature, for each of said plurality of paralinguistic information labels, forming a paralinguistic information vector.