摘要:
In a speech recognizing apparatus, a grammatical qualification of a proposed speech recognition result candidate is judged without using a grammatical rule. The speech recognizing apparatus for performing sentence/speech recognition is comprised of an analyzing unit for acoustically analyzing speech inputted therein to extract a feature parameter of the inputted speech; a recognizing unit for recognizing the inputted speech based upon the feature parameter outputted from said analyzing unit to thereby a plurality of proposed recognition result candidates; an example data base for storing therein a plurality of examples; and an example retrieving unit for calculating a resemblance degree between each of said plurality of proposed recognition result candidates and each of the plural examples stored in the example data base and for obtaining the speech recognition result based on said calculated resemblance degree.
摘要:
An apparatus and a method for processing a natural language arranged so as to improve the speech recognition rate. In an example search section, the degree of similarity between each of a plurality of examples of the actual use of the language stored in an example data base and each of a plurality of probable recognition results output from a recognition section, and one of the examples corresponding to the highest degree of similarity is selected. A final speech recognition result is obtained by using the selected example. The example search section calculates the degree of similarity by weighting the degree of similarity on the basis of a context according to at least one of the examples previously selected.
摘要:
A translation apparatus and a translation method arranged to facilitate the operation of inputting a speech and to obtain a correct translation. When a speech in Japanese is input to a microphone by a user, it is recognized in a speech recognition section and one or more words constituting the speech are output to a system control section. The system control section searches Japanese sentences stored in a first language sentence storage section to find one of them most similar to a combination of one or more words output from the speech recognition section. The Japanese sentence thereby found is output through an output section. If this Japanese sentence is correct, the user operates one of control keys. The system control section then searches English sentences stored in a second language sentence storage section to find one of them corresponding to a translation of the Japanese sentence output as a search result, and outputs the English sentence thereby found through the output section.
摘要:
A book database stores at least phonetic signal information including phoneme information and rhythm information as document data, a central system transmits phonetic signal information stored on the book database to a terminal and the terminal receives the phonetic signal information is then carried out at the terminal and the document is then recited via synthesized sounds.
摘要:
A book database stores at least phonetic signal information including phoneme information and rhythm information as document data, a central system transmits phonetic signal information stored on the book database to a terminal and the terminal receives the phonetic signal information is then carried out at the terminal and the document is then recited via synthesized sounds.
摘要:
A mapping determination method for obtaining mapping F from an N-dimensional metric vector space .OMEGA..sub.N to an M-dimensional metric vector space .OMEGA..sub.M has the following steps to get the optimal mapping quickly and positively. In the first step, complete, periodic, L.sub.m basic functions g.sub.m (X) according to the distribution of samples classified into Q categories on the N-dimensional metric vector space .OMEGA..sub.N are set. In the second step, a function f.sub.m (X) indicating the m-th component of the mapping F is expressed with the linear sum of the functions g.sub.m (X) and L.sub.m coefficients c.sub.m. The third step provides Q teacher vectors T.sub.q =(t.sub.q.1, t.sub.q.2, t.sub.q.3, . . . , t.sub.q.M) (where q=1, 2, . . . , Q) for the categories on the M-dimensional metric vector space .OMEGA..sub.M, calculates the specified estimation function J, and obtains the coefficients c.sub.m which minimize the estimation function J. In the fourth step, the coefficients c.sub.m obtained in the third step are stored in memory.
摘要:
A voice recognition device according to the present invention including a voice analyzer for acoustically analyzing voice every predetermined frame unit to extract a feature vector X, a converter for subjecting the feature vector X output from the analyzer to a predetermined conversion process, and a voice recognizer for recognizing the voice on the basis of a new feature vector output from the converter, wherein the converter conducts the predetermined conversion processing according to a mapping F from an N-dimensional vector space .OMEGA..sub.N to an M-dimensional vector space .OMEGA..sub.M, the feature vector X is a vector on the N-dimensional vector space .OMEGA..sub.N and the function f.sub.m (X) of an m-th component of the mapping F is represented by the following linear summation of the products of functions g.sub.m.sup.k (X) and coefficients c.sub.m.sup.k of L.sub.m : ##EQU1## Each function g.sub.m.sup.k (X) may be set to a monomial.
摘要:
A map determination method and apparatus for calculating the coefficients to give a minimum evaluation function quickly and reliably where a map is expressed as the linear sum of a function g.sub.i (X) and a coefficient c.sub.i while a map for transforming a N-dimensional vector (x.sub.0, x.sub.1, x.sub.2, x.sub.3) to a M-dimensional vector y is being decided. The coefficient ci for the map is obtained by giving a learning sample and a teaching sample, obtaining an evaluation function and solving a simultaneous linear equation for which the partial differential is zero.
摘要:
A code conversion table, in which a code of a voice with noise added thereto and a code of a voice without noise are associated with each other in terms of probability, is referred to in a code converter. Using the code converter, a code is obtained in a vector quantizer by vector-quantizing cepstrum coefficients extracted from the voice with noise added thereto, and is converted into a code of a voice obtained by suppressing the noise in the voice with noise added thereto. Linear predictive coefficients are obtained from the code, and the voice signal is reproduced in a synthesis filter according to the linear predictive coefficients.
摘要:
A pattern matching multiplies distances between an input pattern and standard patterns by distance-scale correcting weight coefficients respectively prepared for the standard patterns. This yields a distance corresponding to a category shape of a standard pattern and enhance an input pattern recognition ratio. A pattern recognition apparatus stores distance-scale correcting weight coefficients respectively prepared for all standard patterns in storage means. Distances between stored standard patterns and an input pattern generated by input pattern generating means are calculated. The calculated distance is multiplied by a weight coefficient stored in the storage means in accordance with the standard pattern used for calculating the distance from the input pattern. This structure reduces the number of templates when providing the standard patterns as multiple templates and, at the same time, enhances the input pattern recognition ratio.