摘要:
A text-to-speech (TTS) engine combines recorded speech with synthesized speech from a TTS synthesizer based on text input. The TTS engine receives the text input and identifies the domain for the speech (e.g. navigation, dialing, . . . ). The identified domain is used in selecting domain specific speech recordings (e.g. pre-recorded static phrases such as “turn left”, “turn right” . . . ) from the input text. The speech recordings are obtained based on the static phrases for the domain that are identified from the input text. The TTS engine blends the static phrases with the TTS output to smooth the acoustic trajectory of the input text. The prosody of the static phrases is used to create similar prosody in the TTS output.
摘要:
A text-to-speech (TTS) engine combines recorded speech with synthesized speech from a TTS synthesizer based on text input. The TTS engine receives the text input and identifies the domain for the speech (e.g. navigation, dialing, . . . ). The identified domain is used in selecting domain specific speech recordings (e.g. pre-recorded static phrases such as “turn left”, “turn right” . . . ) from the input text. The speech recordings are obtained based on the static phrases for the domain that are identified from the input text. The TTS engine blends the static phrases with the TTS output to smooth the acoustic trajectory of the input text. The prosody of the static phrases is used to create similar prosody in the TTS output.
摘要:
Techniques to create and share custom voice fonts are described. An apparatus may include a preprocessing component to receive voice audio data and a corresponding text script from a client and to process the voice audio data to produce prosody labels and a rich script. The apparatus may further include a verification component to automatically verify the voice audio data and the text script. The apparatus may further include a training component to train a custom voice font from the verified voice audio data and rich script and to generate custom voice font data usable by the TTS component. Other embodiments are described and claimed.
摘要:
Techniques to create and share custom voice fonts are described. An apparatus may include a preprocessing component to receive voice audio data and a corresponding text script from a client and to process the voice audio data to produce prosody labels and a rich script. The apparatus may further include a verification component to automatically verify the voice audio data and the text script. The apparatus may further include a training component to train a custom voice font from the verified voice audio data and rich script and to generate custom voice font data usable by the TTS component. Other embodiments are described and claimed.
摘要:
A spontaneous-extending and anti-rotation scoliosis correcting system comprises pedicle screws and a plurality of correcting rods locked with the pedicle screws. Each correcting rod includes at least one sleeve and at least one inserting rod which can be inserted into the sleeve. The inner wall of the sleeve and the inserting rod are the same in shape and are in clearance fit. A positioning mechanism for restricting the relative rotation of the inserting rod with respect to the sleeve is arranged on a matching surface between the inserting rod and the sleeve. The scoliosis correcting system has the benefits of ensuring the lateral stability and the anti-rotation function for scoliosis correction; having the performance of spontaneous extending along the growth direction of the spine; and ensuring both the short-term operating effect and the long-term curative effect.
摘要:
Mechanisms for performing database queries are provided. With these mechanisms, in response to a query request, a query plan intended for minimum query response time and a query plan intended for minimum query total time for the query request are obtained execution of the minimum query response time query plan and the minimum query total time query plan is started. Before the execution of the minimum query total time query plan reaches a specified point, an initial query result obtained from the execution of the minimum query response time query plan is output. In response to the execution of the minimum query total time query plan reaching the specified point, continuing the execution of the minimum query total time query plan to output remaining query results.
摘要:
The invention relates to a novel class of 2,4-diamino-6,7-dihydro-5H-pyrrolo[2,3]pyrimidine derivatives as a FAK and/or Pyk2 inhibitor, to a process for their preparation, and to a composition thereof, as well as to use of the compounds for the inhibiting FAK and/or Pyk2 and method for the treatment of a FAK and/or Pyk2 mediated disorder or disease.
摘要:
Mechanisms for performing database queries are provided. With these mechanisms, in response to a query request, a query plan intended for minimum query response time and a query plan intended for minimum query total time for the query request are obtained execution of the minimum query response time query plan and the minimum query total time query plan is started. Before the execution of the minimum query total time query plan reaches a specified point, an initial query result obtained from the execution of the minimum query response time query plan is output. In response to the execution of the minimum query total time query plan reaching the specified point, continuing the execution of the minimum query total time query plan to output remaining query results.
摘要:
The invention relates to a novel class of 2,4-diamino-6,7-dihydro-5H-pyrrolo[2,3]pyrimidine derivatives as a FAK and/or Pyk2 inhibitor, to a process for their preparation, and to a composition thereof, as well as to use of the compounds for the inhibiting FAK and/or Pyk2 and method for the treatment of a FAK and/or Pyk2 mediated disorder or disease.
摘要:
Dynamic features are utilized with CRFs to handle long-distance dependencies of output labels. The dynamic features present a probability distribution involved in explicit distance from/to a special output label that is pre-defined according to each application scenario. Besides the number of units in the segment (from the previous special output label to the current unit), the dynamic features may also include the sum of any basic features of units in the segment. Since the added dynamic features are involved in the distance from the previous specific label, the searching lattice associated with Viterbi searching is expanded to distinguish the nodes with various distances. The dynamic features may be used in a variety of different applications, such as Natural Language Processing, Text-To-Speech and Automatic Speech Recognition. For example, the dynamic features may be used to assist in prosodic break and pause prediction.