Speech recognition apparatus capable of discriminating between similar
acoustic features of speech
    1.
    发明授权
    Speech recognition apparatus capable of discriminating between similar acoustic features of speech 失效
    语音识别装置能够区分语音的类似声学特征

    公开(公告)号:US4998280A

    公开(公告)日:1991-03-05

    申请号:US409991

    申请日:1989-09-19

    IPC分类号: G10L15/00 G10L15/08 G10L7/08

    CPC分类号: G10L15/08

    摘要: A speech recognition apparatus including a memory for storing with respect to each feature specific to a particular phoneme a name of a process and a procedure of the process which is performed in order to search whether the presence of a feature specific to a certain type of speech is included in a feature vector series, and for storing a table in which the names of the processes in a performed for all the categories of speech to be recognized. The information stored in the memory is used to discriminate between two categories and provides ways for interpreting the results of the process. The recognition processes performed the discrimination is done in accordance with the information stored in the table.

    摘要翻译: 一种语音识别装置,包括存储器,用于相对于特定音素的特定的每个特征存储处理的名称和执行的处理的过程,以便搜索是否存在特定类型的语音的特征 被包括在特征向量序列中,并且用于存储其中针对要被识别的所有语音类别执行的处理的名称的表。 存储在存储器中的信息用于区分两个类别,并提供了解释过程结果的方法。 执行识别的识别处理根据表中存储的信息进行。

    Noise reduction system using neural network
    2.
    发明授权
    Noise reduction system using neural network 失效
    使用神经网络的降噪系统

    公开(公告)号:US5185848A

    公开(公告)日:1993-02-09

    申请号:US448949

    申请日:1989-12-12

    摘要: A noise reduction system used for transmission and/or recognition of speech includes a speech analyzer for analyzing a noisy speech input signal thereby converting the speech signal into feature vectors such as autocorrelation coefficients, and a neural network for receiving the feature vectors of the noisy speech signal as its input. The neural network extracts from a codebook an index of prototype vectors corresponding to a noise-free equivalent to the noisy speech input signal. Feature vectors of speech are read out from the codebook on the basis of the index delivered as an output from the neural network, thereby causing the speech input to be reproduced on the basis of the feature vectors of speech read out from the codebook.

    摘要翻译: 用于语音传输和/或识别的降噪系统包括语音分析器,用于分析噪声语音输入信号,从而将语音信号转换成诸如自相关系数的特征向量,以及用于接收噪声语音的特征向量的神经网络 信号作为其输入。 神经网络从码本中提取与噪声相当于噪声语音输入信号的原始矢量的索引。 基于作为神经网络的输出而输出的索引,从码本读出语音的特征向量,从而根据从码本读出的语音的特征向量,再现语音输入。

    Speech recognition apparatus using neural network and fuzzy logic
    3.
    发明授权
    Speech recognition apparatus using neural network and fuzzy logic 失效
    使用神经网络和模糊逻辑的语音识别装置

    公开(公告)号:US5040215A

    公开(公告)日:1991-08-13

    申请号:US400342

    申请日:1989-08-30

    CPC分类号: G10L15/16 Y10S706/90

    摘要: A speech recognition apparatus has a speech input unit for inputting a speech; a speech analysis unit for analyzing the inputted speech to output the time series of a feature vector; a candidates selection unit for inputting the time series of a feature vector from the speech analysis unit to select a plurality of candidates of recognition result from the speech categories; and a discrimination processing unit for discriminating the selected candidates to obtain a final recognition result. The discrimination processing unit includes three components in the form of a pair generation unit for generating all of the two combinations of the n-number of candidates selected by said candidate selection unit, a pair discrimination unit for discriminating which of the candidates of the combinations is more certain for each of all .sub.n C.sub.2 -number of combinations (or pairs) on the basis of the extracted result of the acoustic feature intrinsic to each of said candidate speeches, and a final decision unit for collecting all the pair discrimination results obtained from the pair discrimination unit for each of all the .sub.n C.sub.2 -number of combinations (or pairs) to decide the final result. The pair discrimination unit handles the extracted result of the acoustic feature intrinsic to each of the candidate speeches as fuzzy information and accomplishes the discrimination processing on the basis of fuzzy logic algorithms, and the final decision unit accomplishes its collections on the basis of the fuzzy logic algorithms.

    摘要翻译: 语音识别装置具有用于输入语音的语音输入单元; 语音分析单元,用于分析输入的语音以输出特征向量的时间序列; 候选选择单元,用于从语音分析单元输入特征向量的时间序列,以从语音类别中选择多个候选的识别结果; 以及鉴别处理单元,用于识别所选择的候选以获得最终识别结果。 鉴别处理单元包括成对生成单元形式的三个组件,用于产生由所述候选选择单元选择的n个候选者的所有两个组合;对鉴别单元,用于鉴别组合中的哪一个候选 基于每个所述候选讲话所固有的声学特征的提取结果,对于所有nC2个组合(或对)中的每一个更确定,以及用于收集从该对获得的所有对鉴别结果的最终决定单元 所有nC2个组合(或对)中的每一个的判别单元来决定最终结果。 对鉴别单元处理作为模糊信息的每个候选语音固有的声学特征的提取结果,并且基于模糊逻辑算法完成鉴别处理,并且最终决策单元基于模糊逻辑来完成其集合 算法。

    Speech recognition apparatus using neural network and fuzzy logic
    4.
    发明授权
    Speech recognition apparatus using neural network and fuzzy logic 失效
    使用神经网络和模糊逻辑的语音识别装置

    公开(公告)号:US5179624A

    公开(公告)日:1993-01-12

    申请号:US727089

    申请日:1991-07-09

    IPC分类号: G10L15/16

    CPC分类号: G10L15/16 Y10S706/90

    摘要: A speech recognition apparatus has: a speech input unit for inputting a speech; a speech analysis unit for analyzing the inputted speech to output the time series of a feature vector; a candidates selection unit for inputting the time series of a feature vector from the speech analysis unit to select a plurality of candidates of recognition result from the speech categories; and a discrimination processing unit for discriminating the selected candidates to obtain a final recognition result. The discrimination processing unit includes three components in the form of a pair generation unit for generating all of the two combinations of the n-number of candidates selected by said candidate selection unit a pair discrimination unit for discriminating which of the candidates of the combinations is more certain for each of all .sub.n C.sub.2 -number of combinations (or pairs) on the basis of the extracted result of the acoustic feature intrinsic to each of said candidate speeches and a final decision unit for collecting all the pair discrimination results obtained from the pair discrimination unit for each of all the .sub.n C.sub.2 -number of combinations (or pairs) to decide the final result. The pair discrimination unit handles the extracted result of the acoustic feature intrinsic to each of the candidate speeches as fuzzy information and accomplishes the discrimination processing on the basis of fuzzy logic algorithms, and the final decision unit accomplishes its collections on the basis of the fuzzy logic algorithms.

    摘要翻译: 语音识别装置具有:用于输入语音的语音输入单元; 语音分析单元,用于分析输入的语音以输出特征向量的时间序列; 候选选择单元,用于从语音分析单元输入特征向量的时间序列,以从语音类别中选择多个候选的识别结果; 以及鉴别处理单元,用于识别所选择的候选以获得最终识别结果。 鉴别处理单元包括成对生成单元形式的三个组成部分,用于产生由所述候选选择单元选择的n个候选项的所有两个组合中的一个对鉴别单元,用于鉴别组合中的哪个候选者更多 基于每个所述候选讲话所固有的声学特征的提取结果,以及用于收集从对鉴别单元获得的所有对鉴别结果的最终决定单元,对于所有nC2个组合(或对)中的每一个确定; 对于所有nC2个组合(或对)中的每一个来决定最终结果。 对鉴别单元处理作为模糊信息的每个候选语音固有的声学特征的提取结果,并且基于模糊逻辑算法完成鉴别处理,并且最终决策单元基于模糊逻辑来完成其集合 算法。

    Method for production of speech reference templates
    5.
    发明授权
    Method for production of speech reference templates 失效
    语音参考模板的制作方法

    公开(公告)号:US4590605A

    公开(公告)日:1986-05-20

    申请号:US449660

    申请日:1982-12-14

    IPC分类号: G10L11/00 G10L15/06 G10L9/02

    CPC分类号: G10L15/063

    摘要: In this speech recognition system, a set of templates for each phoneme includes clusters of speech patterns based on two speech features: "physical" features (formant spectra of men versus women) and "utterance" features (unvoiced vowels and nasalization), derived from a plurality of reference speakers.

    摘要翻译: 在这种语音识别系统中,每个音素的一组模板包括基于两个语音特征的语音模式簇:“物理”特征(男性与女性的共振峰)和“话语”特征(无声元音和鼻音),源于 多个参考扬声器。

    Speech recognition method and device
    6.
    发明授权
    Speech recognition method and device 失效
    语音识别方法和装置

    公开(公告)号:US4426551A

    公开(公告)日:1984-01-17

    申请号:US208251

    申请日:1980-11-19

    CPC分类号: G10L15/00

    摘要: Speech sound recognition is made using a reduced number of speech parameter elements, e.g., five correlation coefficients rather than sixteen spectral coefficients. The five correlation coefficients are derived from comparison of the spectral coefficients of unknown or standard sounds against the spectral coefficients of five highly-separable vowel-like sounds. Then, unknown-sound correlation coefficients are compared with standard-sound coefficients for recognition.

    摘要翻译: 使用减少数量的语音参数元素(例如,五个相关系数,而不是十六个频谱系数)进行语音识别。 五个相关系数是从未知或标准声音的频谱系数与五个高度可分离的元音声音的频谱系数的比较得出的。 然后,将未知声相关系数与用于识别的标准声系数进行比较。

    Character voice communication system
    7.
    发明授权
    Character voice communication system 失效
    字符语音通信系统

    公开(公告)号:US4975957A

    公开(公告)日:1990-12-04

    申请号:US343892

    申请日:1989-04-24

    摘要: A character voice communication system including high efficiency voice coding system for encoding and transmitting speech information at a high efficiency and a voice character input/output system for converting speech information into character information or receiving character information and transmitting speech or character information are organically integrated. A speech analyzer and a speech synthesizer are shared by both the voice coding and the voice character input/output systems. Communication apparatus is also provided which allows mutual conversion between speech signals and character codes.

    摘要翻译: 包括用于高效率地编码和发送语音信息的高效语音编码系统和用于将语音信息转换为字符信息或接收字符信息并传送语音或字符信息的语音字符输入/输出系统的字符语音通信系统被有机地整合。 语音分析器和语音合成器由语音编码和语音字符输入/输出系统共享。 还提供了允许语音信号和字符代码之间的相互转换的通信装置。

    Speech detecting method
    8.
    发明授权
    Speech detecting method 失效
    语音检测方法

    公开(公告)号:US4401849A

    公开(公告)日:1983-08-30

    申请号:US227677

    申请日:1981-01-23

    CPC分类号: G10L25/00

    摘要: Speech signal presence is decided if total signal power is above a first threshold, and if either low or high frequency components exceed thresholds as a large fraction of the total power. Total power is calculated as the zero-order auto-correlation coefficient, and fractional power of frequency components is calculated as the first-order partial auto-correlation coefficient.

    摘要翻译: 如果总信号功率高于第一阈值,并且如果低或高频率分量超过总功率的大部分的阈值,则决定语音信号存在。 总功率被计算为零阶自相关系数,并且计算频率分量的分数功率作为一阶部分自相关系数。

    Speech recognition method
    9.
    发明授权
    Speech recognition method 失效
    语音识别方法

    公开(公告)号:US4718095A

    公开(公告)日:1988-01-05

    申请号:US554960

    申请日:1983-11-25

    CPC分类号: G10L15/12 G10L15/00

    摘要: A speech recognition method makes it possible to improve the accuracy of recognition of input speech and is capable of operating on a real time basis. This is accomplished by generating from the input speech signal a difference signal which indicates whether the speech power of the input speech is increasing or decreasing for each frame. The similarity between the input speech and a standard pattern is then calculated for each frame, and this is then followed by correcting the similarity calculation on the basis of the generated difference signal and a difference signal relating to the standard pattern obtained from storage. The matching of the input speech and the standard pattern is then effected by using the corrected similarity, and the input speech is then recognized from the result of this matching. Thus, a spectrum matching distance weighted by power information of speech can be obtained in real time.

    摘要翻译: 语音识别方法使得可以提高输入语音的识别精度并能够实时地进行操作。 这是通过从输入语音信号生成指示输入语音的语音功率对于每个帧是增加还是减少的差分信号来实现的。 然后针对每个帧计算输入语音与标准模式之间的相似度,然后根据生成的差分信号和与从存储获得的标准模式相关的差分信号来校正相似度计算。 然后通过使用校正的相似度来实现输入语音和标准模式的匹配,然后从该匹配的结果中识别输入语音。 因此,可以实时获得通过语音功率信息加权的频谱匹配距离。