摘要:
The present invention is a speech synthesizer that generates speech data of text including a fixed part and a variable part, in combination with recorded speech and rule-based synthetic speech. The speech synthesizer is a high-quality one in which recorded speech and synthetic speech are concatenated with the discontinuity of timbres and prosodies not perceived. The speech synthesizer includes: a recorded speech database that previously stores recorded speech data including a recorded fixed part; a rule-based synthesizer that generates rule-based synthetic speech data including a variable part and at least part of the fixed part, from received text; a concatenation boundary calculator that a concatenation boundary position in a region in which the recorded speech data and the rule-based synthetic speech data overlap, based on acoustic characteristics of the recorded speech data and the rule-based synthetic speech data that correspond to the text; a concatenative synthesizer that generates synthetic speech data corresponding to the text by concatenating the recorded speech data and the rule-based synthetic speech data that are segmented in the concatenation boundary position.
摘要:
The present invention is a speech synthesizer that generates speech data of text including a fixed part and a variable part, in combination with recorded speech and rule-based synthetic speech. The speech synthesizer is a high-quality one in which recorded speech and synthetic speech are concatenated with the discontinuity of timbres and prosodies not perceived. The speech synthesizer includes: a recorded speech database that previously stores recorded speech data including a recorded fixed part; a rule-based synthesizer that generates rule-based synthetic speech data including a variable part and at least part of the fixed part, from received text; a concatenation boundary calculator that a concatenation boundary position in a region in which the recorded speech data and the rule-based synthetic speech data overlap, based on acoustic characteristics of the recorded speech data and the rule-based synthetic speech data that correspond to the text; a concatenative synthesizer that generates synthetic speech data corresponding to the text by concatenating the recorded speech data and the rule-based synthetic speech data that are segmented in the concatenation boundary position.
摘要:
Included in a speech synthesizer, a natural language processing unit divides text data, input from a text input unit, into a plurality of components (particularly, words). An importance prediction unit estimates an importance level of each component according to the degree of how much each component contributes to understanding when a listener hears synthesized speech. Then, the speech synthesizer determines a processing load based on the device state when executing synthesis processing and the importance level. Included in the speech synthesizer, a synthesizing control unit and a wave generation unit reduce the processing time for a phoneme with a low importance level by curtailing its processing load (relatively degrading its sound quality), allocate a part of the processing time, made available by this reduction, to the processing time of a phoneme with a high importance level, and generates synthesized speech in which important words are easily audible.
摘要:
When braking of a motion of a part of a first robot is assumed to be started at points in time, a first stop position of the first robot part is estimated at each point in time. When braking of a motion of a part of a second robot is assumed to be started at the points in time, an estimated second stop position of the second robot part is obtained at each point in time. When it is determined that the first stop position of the first robot part at one of the points in time and either the actual position or the second stop position of the second robot part for each interval at the one of the points in time are contained in the shared workspace, the first robot part is braked.
摘要:
In a robot, a first determining unit determines whether there is an interference region in which a first occupation region and a second occupation region are at least partially overlapped with each other. A second determining determines whether a second movable part of another robot is at least partially located in the interference region based on an actual position of the second movable part. A stopping unit begins stopping, at a predetermined timing, movement of the first movable part if it is determined that there is the interference region, and that the second inovable part is at least partially located in the interference region. The predetermined timing is determined based on a positional relationship between an actual position of the first movable part and the interference region.
摘要:
A stereotypical sentence is synthesized into a voice of an arbitrary speech style. A third party is able to prepare prosody data and a user of a terminal device having a voice synthesizing part can acquire the prosody data. The voice synthesizing method determines a voice-contents identifier to point to a type of voice contents of a stereotypical sentence, prepares a speech style dictionary including speech style and prosody data which correspond to the voice-contents identifier, selects prosody data of the synthesized voice to be generated from the speech style dictionary, and adds the selected prosody data to a voice synthesizer 13 as voice-synthesizer driving data to thereby perform voice synthesis with a specific speech style. Thus, a voice of a stereotypical sentence can be synthesized with an arbitrary speech style.
摘要:
A user of a hands-free phone cannot determine with which speech quality the distant side of communication is listened to, and does not know which action to be required to improve the speech quality. In the case of the present invention, a function of presenting an action to be implemented by the user to improve the speech quality at the opposite party is mounted on the hands-free terminal. For the presentation here, it is assumed that the hands-free terminal according to the present invention has a function of estimating the speech quality at the distant side, a function of estimating an action to be implemented by the user to improve the estimated speech quality, and a function of presenting the estimated action to the user.
摘要:
In a robot, a first determining unit determines whether there is an interference region in which a first occupation region and a second occupation region are at least partially overlapped with each other. A second determining determines whether a second movable part of another robot is at least partially located in the interference region based on an actual position of the second movable part. A stopping unit begins stopping, at a predetermined timing, movement of the first movable part if it is determined that there is the interference region, and that the second movable part is at least partially located in the interference region. The predetermined timing is determined based on a positional relationship between an actual position of the first movable part and the interference region.
摘要:
When braking of a motion of a part of a first robot is assumed to be started at points in time, a first stop position of the first robot part is estimated at each point in time. When braking of a motion of a part of a second robot is assumed to be started at the points in time, an estimated second stop position of the second robot part is obtained at each point in time. When it is determined that the first stop position of the first robot part at one of the points in time and either the actual position or the second stop position of the second robot part for each interval at the one of the points in time are contained in the shared workspace, the first robot part is braked.
摘要:
Disclosed here is an information providing system for mobile objects. The system obtains information of a moving purpose of the user of each mobile object to provide the user with information matching with the moving purpose. The information providing system comprises route type determining apparatus for determining the type of the current route on which the user is moving, provided information selecting unit for selecting information to be provided to the user according to the route type determined by the route type determining apparatus, and provided information presenting means for presenting the information selected by the provided information selecting unit to the user. Therefore, the user can receive more proper information according to the user's moving purpose at that time, thereby the convenience of the user is improved while the information provider can improve the advertisement effect by transmitting information to more proper users.