摘要:
An auto-correlation unit for pattern recognition to obtain auto-correlation functions as for sampled signals is disclosed N pieces of sample values X.sub.n (n=0 to N-1)are extracted from a series of the sample values expressed with an accuracy of optional multi-bits and the auto-correlation coefficients of these N pieces of the sample values obtained. The sum of sample values X.sub.n and X.sub.n-.tau. (.tau.=0 to P) are calculated. The squared value (X.sub.n +X.sub.n-.tau.).sup.2 of the added results is previously memorized in a ROM while being addressed by the resultant sum. The auto-correlation coefficients are available by feeding the output (X.sub.n +X.sub.n-.tau.).sup.2 of the ROM and executing calculation as defined by the following equation: ##EQU1##
摘要:
In a speech recognition system, similarity calculations between speech feature patterns are reduced by stopping similarity calculations for any one reference pattern when a frame in the pattern fails to exceed a corresponding similarity threshold.
摘要:
In a speech analysis/recognition system, the input utterance is also directly registered (stored) for later readout verification of correct start and end control signals.
摘要:
Speech recognition using self-correlation (auto-correlation) parameters and time normalization is improved by first making a preliminary decision using lower-order (zeroth to third) self-correlation coefficients derived from the self-correlation function of order m: ##EQU1## where .phi.(0) is the power, .phi.(1) is the linear or first order coefficient, and so on.
摘要:
Speech recognition executes the final matching operation for each characteristic pattern according to the preliminarily selected order prior to the final matching, where any further calculation in reference to such a characteristic pattern is not executed after a specific condition has been satisfied during the matching calculation, and as result, calculation requirements can be drastically minimized so that the speech recognition can eventually be implemented at higher speed and yet under a sharply reduced cost.
摘要:
Inputted voice signals are analyzed in units of syllables. Each syllable is compared with standard syllables preregistered in a memory and thereby recognized but corrections may be made on erroneous recognitions by referencing a dictionary or entering a command. Each standard pattern is associated with phonological information on the neighborhood in a voice signal from where it was extracted and may be updated by another pattern having the same phonological information. Temporal sequences of correct and erroneous recognitions of individual syllables as well as whole syllables are stored and referenced in determining whether a standard pattern should be updated. A maximum pattern number may be set for each syllable.
摘要:
A correlation function computing device capable of computing a selfcorrelation function or the like is constructed to comprise a window function memorizing circuit, an A/D converting circuit for converting time series of a voice signal into digital values, a multiplier for calculating products of the window functions and the digitalized time series of the voice signal and also products required for the computation of the correlation function, a memory circuit for storing the products of the window functions and the digitalized time series of the voice signal delivered from the multiplier, a selector circuit for transferring data supplied to the multiplier, an adder for obtaining a product-sum in a divided manner, a memory circuit of a first-in and first-out type for storing the product-sum dividedly obtained in the adder, and a control circuit for controlling all of the aforementioned circuits.
摘要:
In a coordinate detection period, a first electrode drive circuit sequentially selects source electrodes Sn of the TFT LCD panel to connect the same to an x-signal current amplifying circuit, and to apply an AC voltage to the non-selected electrodes. The x-signal current amplifying circuit detects a detection current flowing through the selected electrode, the magnitude of which corresponds to a distance to a finger (or a conductor pen), to obtain an x-coordinate detection voltage. A second electrode drive circuit and a y-signal current amplifying circuit operate in similar fashion to obtain a y-coordinate detection voltage. A coordinate detection circuit then obtains the x- and y-coordinate values of the designated position based on the x-coordinate detection voltage and the y-coordinate detection voltage, thus enabling pointing by a codeless pen and a finger.
摘要:
A minimum distance storage device for storing distances obtained in pattern recognition as arranged in the ascending order easily generates addresses with no need for multiloop control and at the same time of detects a minimal distance of the same category. A read-out distance read from a result memory and temporarily held in a first register, and an input distance input from a distance calculating portion and temporarily held in a second register, are compared with each other by a second selector. The read-out distance, if smaller than the input distance, is written in the same place where it was stored in the result memory. If the read-out distance is larger than the input distance, the latter is written in the place where the read-out distance was stored and, then, the larger read-out distance and other read-out distances are written in places moved to the right each by one in sequence in the result memory.
摘要:
An acoustic processing unit for a speech recognition system is characterized in that the parameters are extracted from the speech data, including correlation functions, the zero crossing number of the original waveforms, the zero crossing number of the differential waveforms, and the average level of the waveforms. A suitable threshold is selected from a plurality of thresholds preliminarily stored, depending on the inputted speech volume level. The inputted speech volume level of the speaker is detected so that feedback of the detected volume level is obtained. The selected threshold set is then compared with each of the parameters to thereby make the phonemic classification. Because the plurality of threshold sets are thus automatically selected depending on the inputted speech volume level, flexible phonemic classification can be obtained to exactly detect the speech sections.