摘要:
In one embodiment of a controllable prosody re-estimation system, a TTS/STS engine consists of a prosody prediction/estimation module, a prosody re-estimation module and a speech synthesis module. The prosody prediction/estimation module generates predicted or estimated prosody information. And then the prosody re-estimation module re-estimates the predicted or estimated prosody information and produces new prosody information, according to a set of controllable parameters provided by a controllable prosody parameter interface. The new prosody information is provided to the speech synthesis module to produce a synthesized speech.
摘要:
In one embodiment of a controllable prosody re-estimation system, a TTS/STS engine consists of a prosody prediction/estimation module, a prosody re-estimation module and a speech synthesis module. The prosody prediction/estimation module generates predicted or estimated prosody information. And then the prosody re-estimation module re-estimates the predicted or estimated prosody information and produces new prosody information, according to a set of controllable parameters provided by a controllable prosody parameter interface. The new prosody information is provided to the speech synthesis module to produce a synthesized speech.
摘要:
A multi-lingual text-to-speech system and method processes a text to be synthesized via an acoustic-prosodic model selection module and an acoustic-prosodic model mergence module, and obtains a phonetic unit transformation table. In an online phase, the acoustic-prosodic model selection module, according to the text and a phonetic unit transcription corresponding to the text, uses at least a set controllable accent weighting parameter to select a transformation combination and find a second and a first acoustic-prosodic models. The acoustic-prosodic model mergence module merges the two acoustic-prosodic models into a merged acoustic-prosodic model, according to the at least a controllable accent weighting parameter, processes all transformations in the transformation combination and generates a merged acoustic-prosodic model sequence. A speech synthesizer and the merged acoustic-prosodic model sequence are further applied to synthesize the text into an L1-accent L2 speech.
摘要:
A system for leaving and transmitting speech messages automatically analyzes input speech of at least a reminder, fetches a plurality of tag informations, and transmits speech message to at least a message receiver, according to the transmit criterions of the reminder. A command or message parser parses the tag informations at least including at least a reminder ID, at least a transmitted command and at least a speech message. The tag informations are sent to a message composer for being synthesized into a transmitted message. A transmitting controller controls a device switch according to the reminder ID and the transmitted command, to allow the transmitted message send to the message receiver via a transmitting device.
摘要:
A method and a system for executing correlative services are provided. In the method and the system, an event type corresponding to an input message is determined through semantic analysis. After collecting the necessary execution information of the event type according to the input message, a user database, or by inquiring the user or another system, the system automatically executes various correlative services of the event type. Therefore, the system can help users to execute correlative services more correctly and more efficiently.
摘要:
A method and a system for executing correlative services are provided. In the method and the system, an event type corresponding to an input message is determined through semantic analysis. After collecting the necessary execution information of the event type according to the input message, a user database, or by inquiring the user or another system, the system automatically executes various correlative services of the event type. Therefore, the system can help users to execute correlative services more correctly and more efficiently.
摘要:
A telematics apparatus for providing driving assistance, and a system and a method are provided. The telematics apparatus receives position information indicating a current position of the telematics apparatus. The telematics apparatus transmits a request signal to the server. According to the request signal, the server obtains the position information, time information indicating a current time of the telematics apparatus, and identification information identifying a user of the telematics apparatus. The telematics apparatus displays driving assistance information received from the server which generates the driving assistance information according to the identification information, the position information, and the time information by searching through a route usage history of a plurality of routes and referring to a plurality of reference values of the routes. The reference value of each route indicates the telematics apparatus user's familiarity with that route.
摘要:
A system and method for providing mobile information, a server and a portable device therein are provided. The server comprises an intelligent download manager and the portable device comprises a file browse manager. The intelligent download manager determines a downloaded file update rule and a file browse rule according to any combination of a document attribute, a browse record, and a document preference. The files to be downloaded to the portable device can be determined automatically according to the downloaded file update rule. The file browse manager provides an intelligent browse mode related to the browse sequence of the downloaded files according to the file browse rule. Therefore, information really interesting to the user can be stored in the limited space by the present invention and the user can access the information quickly and efficiently.
摘要:
An automatic speech segmentation and verification system and method is disclosed, which has a known text script and a recorded speech corpus corresponding to the known text script. A speech unit segmentor segments the recorded speech corpus into N test speech unit segments referring to the phonetic information of the known text script. Then, a segmental verifier is applied to obtain a confidence measure of syllable segmentation for verifying the correctness of the cutting points of test speech unit segments. A phonetic verifier obtains a confidence measure of syllable verification by using verification models for verifying whether the recorded speech corpus is correctly recorded. Finally, a speech unit inspector integrates the confidence measure of syllable segmentation and the confidence measure of syllable verification to determine whether the test speech unit segment is accepted or not.