专利检索 ap:("Mitsuru Otsuka" OR "Yasunori Ohora" OR "Takashi Aso" OR "Toshiaki Fukada") AND inv:"Yasunori Ohora" 第 1 页

1.

发明授权
Document inputting method and apparatus and speech outputting apparatus 失效
标题翻译：文件输入方法和装置和语音输出装置

公开(公告)号：US5809467A

公开(公告)日：1998-09-15

申请号：US923939

申请日：1997-09-05

申请人： Mitsuru Otsuka , Yasunori Ohora , Takashi Aso , Toshiyuki Noguchi , Toshiaki Fukada

发明人： Mitsuru Otsuka , Yasunori Ohora , Takashi Aso , Toshiyuki Noguchi , Toshiaki Fukada

IPC分类号： G06F3/16 , G06F17/21 , G06F17/22 , G10L13/08 , G10L5/02

CPC分类号： G10L13/08 , G10L13/04

摘要： A document inputting apparatus or speech outputting apparatus inputs and displays document data, specifies accent information, pronunciation information and syllable-length information of words or characters of the document data. The apparatus displays the document data in accordance with the specified information so that information such as the accent positions or accent intensities can be recognized. Thus formed document data is stored in a memory with the accent information, the pronunciation information or the syllable-length information. Upon reading the document data from the memory and outputting it as speech, the specified information is referred to for speech synthesizing, thus outputting speech corresponding to the correct pronunciation.

摘要翻译： 文档输入装置或语音输出装置输入和显示文档数据，指定文档数据的单词或字符的重音信息，发音信息和音节长度信息。该装置根据指定的信息显示文档数据，以便能够识别诸如重音位置或重音强度之类的信息。这样形成的文档数据被存储在具有重音信息，发音信息或音节长度信息的存储器中。在从存储器读取文档数据并将其作为语音输出时，参考用于语音合成的指定信息，从而输出与正确发音相对应的语音。

2.

发明授权
Speech synthesis apparatus and method for causing a computer to perform speech synthesis by calculating product of parameters for a speech waveform and a read waveform generation matrix 失效
标题翻译：语音合成装置和方法，用于使计算机通过计算语音波形和读取波形生成矩阵的参数的乘积来执行语音合成

公开(公告)号：US5745651A

公开(公告)日：1998-04-28

申请号：US452545

申请日：1995-05-30

申请人： Mitsuru Otsuka , Yasunori Ohora , Takashi Aso , Toshiaki Fukada

发明人： Mitsuru Otsuka , Yasunori Ohora , Takashi Aso , Toshiaki Fukada

IPC分类号： G10L13/00 , G10L13/02 , G10L13/04 , G10L13/06 , G10L13/08 , G10L3/02

CPC分类号： G10L13/033 , G10L13/08

摘要： A speech synthesis method and a speech synthesis apparatus includes a system for synthesis by rule that prevents the quality of synthesized speech from deteriorating and for reducing the number of calculations that are required for the generation of a speech waveform. The speech synthesis apparatus includes a character series input section, for inputting a character series as phonetic text, a pitch waveform generator, for generating a pitch waveform by calculating a product of a matrix, which has been acquired for each pitch, and the character series, which is input by the character series input section, and a device for connecting pitch waveforms that are generated by the pitch waveform generator and for providing a speech waveform. The calculation method for the generation of such a pitch waveform provides a great reduction in the number of calculations that are required. In addition, in the calculation for the generation of a pitch waveform, a function that determines a frequency response is employed to convert a spectral envelope, which is obtained from a parameter, so that the timbres of synthesized speech can be changed without parameter operations.

摘要翻译： 语音合成方法和语音合成装置包括用于合成规则的系统，该系统防止合成语音的质量恶化，并减少产生语音波形所需的计算次数。语音合成装置包括：字符串输入部，用于输入作为语音文本的字符串;音调波形发生器，用于通过计算已经针对每个音调获取的矩阵的乘积和字符串来产生音调波形，由字符串输入部输入，以及用于连接由音调波形发生器产生的音调波形并用于提供语音波形的装置。用于产生这种音调波形的计算方法大大减少了所需的计算次数。此外，在产生音调波形的计算中，采用确定频率响应的函数来转换从参数获得的频谱包络，使得可以在没有参数操作的情况下改变合成语音的音色。

3.

发明授权
Speech synthesis apparatus and method for synthesizing speech from a character series comprising a text and pitch information 失效
标题翻译：用于从包括文本和音调信息的字符串中合成语音的语音合成装置和方法

公开(公告)号：US5745650A

公开(公告)日：1998-04-28

申请号：US448982

申请日：1995-05-24

申请人： Mitsuru Otsuka , Yasunori Ohora , Takashi Aso , Toshiaki Fukada

发明人： Mitsuru Otsuka , Yasunori Ohora , Takashi Aso , Toshiaki Fukada

IPC分类号： G10L11/06 , G10L13/00 , G10L13/04 , G10L13/06 , G10L13/08 , G10L9/04

CPC分类号： G10L13/10 , G10L13/04 , G10L25/93

摘要： A speech synthesis method and apparatus for synthesizing speech from a character series comprising a text and pitch information. The apparatus includes a parameter generator for generating power spectrum envelopes as parameters of a speech waveform to be synthesized representing the input text in accordance with the input character series. The apparatus also includes a pitch waveform generator for generating pitch waveforms whose period equals the pitch specified by the pitch information. The pitch waveform generator generates the pitch waveforms from the input pitch information and the power spectrum envelopes generated by the parameter generator. Also provided is a speech waveform output device for outputting the speech waveform obtained by connecting the generated pitch waveforms.

摘要翻译： 一种用于从包括文本和音调信息的字符系列合成语音的语音合成方法和装置。该装置包括参数发生器，用于根据输入的字符系列，产生功率谱包络作为要合成的语音波形的参数，表示输入文本。该装置还包括用于产生音调波形的音调波形发生器，其音调波形的周期等于由音调信息指定的音调。音调波形发生器根据输入音调信息和由参数发生器产生的功率谱包络产生音调波形。还提供了用于输出通过连接所生成的音调波形而获得的语音波形的语音波形输出装置。

4.

发明授权
Data processing method and apparatus for generating sound signals representing music and speech in a multimedia apparatus 失效
标题翻译：一种用于在多媒体装置中产生表示音乐和语音的声音信号的数据处理方法和装置

公开(公告)号：US5806039A

公开(公告)日：1998-09-08

申请号：US859092

申请日：1997-05-20

申请人： Toshiaki Fukada , Yasunori Ohora , Takashi Aso , Mitsuru Otsuka

发明人： Toshiaki Fukada , Yasunori Ohora , Takashi Aso , Mitsuru Otsuka

IPC分类号： G10K15/04 , G10H1/00 , G10L13/08 , G11B27/34 , G11B31/00 , G10L5/02

CPC分类号： G10H1/0008 , G10H1/0041 , G10L13/04 , G10L13/08

摘要： A data processing apparatus for synchronized audiovisual output has synchronizing signal bits which are assigned to bits of each sound data, represented by a 16-bit PCM code. A predetermined bit of the assigned bits having the least influence upon the human auditory sense is extracted as a synchronizing signal bit for synchronization of the image data output and sound output.

摘要翻译： 用于同步视听输出的数据处理装置具有分配给由16位PCM代码表示的每个声音数据的位的同步信号位。作为用于图像数据输出和声音输出同步的同步信号比特，提取对人类听觉影响最小的分配比特的预定比特。

5.

发明授权
Method and apparatus for processing speech information using a phoneme environment 失效
标题翻译：使用音素环境处理语音信息的方法和装置

公开(公告)号：US5845047A

公开(公告)日：1998-12-01

申请号：US406487

申请日：1995-03-20

申请人： Toshiaki Fukada , Yasunori Ohora , Yasuhiro Komori , Takashi Aso

发明人： Toshiaki Fukada , Yasunori Ohora , Yasuhiro Komori , Takashi Aso

IPC分类号： G10L15/06 , G10L13/04 , G10L13/08 , G10L15/02 , G10L15/14 , G10L5/00

CPC分类号： G10L15/02 , G10L13/04

摘要： A speech information processing apparatus includes a statistical processing unit for extracting features by performing statistical processing of a feature file formed by extracting features of speech, such as the fundamental frequency and its variations, and the power and its variations of speech, from a speech file, and a label file in which a phoneme environment, comprising the accent type, the number of moras, the mora position, phonemes and the like, is considered, and a pitch pattern forming unit for forming a pitch pattern, in which phoneme environment is considered, based on the result of the statistical processing.

摘要翻译： 语音信息处理装置包括：统计处理单元，用于通过从语音文件中提取特征文件的特征文件来提取特征，所述特征文件通过从基本频率及其变化提取语音的特征，以及语音的功率及其变化，以及其中考虑包括重音类型，莫尔斯数，莫尔斯位置，音素等的音素环境的标签文件，以及用于形成音调模式的音调模式形成单元，其中音素环境是根据统计处理结果考虑。

6.

发明授权
Speech synthesis apparatus and method 失效
标题翻译：语音合成装置及方法

公开(公告)号：US6021388A

公开(公告)日：2000-02-01

申请号：US995152

申请日：1997-12-19

申请人： Mitsuru Otsuka , Yasunori Ohora , Takashi Aso , Yasuo Okutani

发明人： Mitsuru Otsuka , Yasunori Ohora , Takashi Aso , Yasuo Okutani

IPC分类号： G10L11/04 , G10L13/00 , G10L13/06 , G10L7/02

CPC分类号： G10L13/07

摘要： A speech synthesis apparatus for outputting synthesized speech on the basis of a parameter sequence of a speech waveform includes a parameter generation unit which generates a parameter sequence for speech synthesis on the basis of a character sequence input by a character sequence input unit, and stores the generated parameter sequence in a parameter storage unit. A waveform generation unit is also provided that generates pitch waveforms each for one pitch period on the basis of synthesis parameters and pitch scales included in the parameter sequence, and generates a speech waveform by connecting the generated pitch waveforms in accordance with frame lengths set by a frame length setting unit.

摘要翻译： 一种用于根据语音波形的参数序列输出合成语音的语音合成装置，包括参数生成单元，该参数生成单元根据由字符序列输入单元输入的字符序列生成用于语音合成的参数序列，并存储在参数存储单元中生成参数序列。还提供了一种波形生成单元，其基于包括在参数序列中的合成参数和音调标度生成针对一个音调周期的音调波形，并且通过根据由参数序列设置的帧长度连接所生成的音调波形来生成语音波形帧长设定单位。

7.

发明授权
Syllable-beat-point synchronized rule-based speech synthesis from coded utterance-speed-independent phoneme combination parameters 失效
标题翻译：基于编码话音速度独立音素组合参数的音节点同步规则语音合成

公开(公告)号：US5682502A

公开(公告)日：1997-10-28

申请号：US490140

申请日：1995-06-14

申请人： Mitsuru Ohtsuka , Yasunori Ohora , Takashi Asou , Takeshi Fujita , Toshiaki Fukada

发明人： Mitsuru Ohtsuka , Yasunori Ohora , Takashi Asou , Takeshi Fujita , Toshiaki Fukada

IPC分类号： G10L13/08 , G10L13/06 , G10L21/04 , G10L5/04

CPC分类号： G10L13/06 , G10L21/04

摘要： In a speech synthesizer, each frame for generating a speech waveform has an expansion degree to which the frame is expanded or compressed in accordance with the production speed of synthetic speech. In accordance with the set speech production speed, the time interval between beat synchronization points is determined on the basis of the speed of the speech to be produced, and the time length of each frame present between the beat synchronization points is determined on the basis of the expansion degree of the frame. Parameters for producing a speech waveform in each frame are properly generated by the time length determined for the frame. In the speech synthesizer for outputting a speech signal by coupling phonemes constituted by one or a plurality of frames having phoneme vowel-consonant combination parameters (VcV, cV, or V) of the speech waveform, the number of frames can be held constant regardless of a change in the speech production speed. This prevents degradation in the tone quality or a variation in the processing quantity resulting from a change in the speech production speed.

摘要翻译： 在语音合成器中，用于产生语音波形的每个帧具有根据合成语音的生产速度来扩展或压缩帧的扩展度。根据设定的语音生成速度，基于要产生的语音的速度来确定拍子同步点之间的时间间隔，并且基于在拍子同步点之间存在的每个帧的时间长度框架的扩展程度。用于产生每帧中的语音波形的参数通过为帧确定的时间长度适当地产生。在语音合成器中，通过将由语音波形的音素元音辅音组合参数（VcV，cV或V）组成的一个或多个帧构成的音素耦合到语音信号中，可以将帧数保持不变，而不管演讲生产速度的变化。这防止了语音质量的劣化或由语音生成速度的变化导致的处理量的变化。

8.

发明授权
Method and apparatus for speech processing 失效
标题翻译：用于语音处理的方法和装置

公开(公告)号：US5633984A

公开(公告)日：1997-05-27

申请号：US439652

申请日：1995-05-12

申请人： Takashi Aso , Yasunori Ohora , Takeshi Fujita

发明人： Takashi Aso , Yasunori Ohora , Takeshi Fujita

IPC分类号： G10L19/00 , G10L13/06 , G10L5/02 , G10L3/02 , G10L9/00

CPC分类号： G10L13/06

摘要： An apparatus and method for processing vocal information includes an extractor for extracting a plurality of spectrum information from parameters for vocal information, a vector quantizer for vector-quantizing the extracted spectrum information and for producing a plurality of parameter patterns therefrom, a memory for storing the plurality of parameter patterns so obtained, and a memory for storing positional information indicating the positions at which the plurality of parameter patterns are stored and for storing code information specifying parameter patterns and corresponding to the positional information. The parameter patterns and code information can be used to synthesize speech. Because a small number of parameter patterns are used, only a small memory capacity is needed and efficient processing of vocal information can be performed.

摘要翻译： 一种用于处理声乐信息的装置和方法，包括从用于声音信息的参数中提取多个频谱信息的提取器，用于对所提取的频谱信息进行矢量量化并用于产生多个参数模式的矢量量化器，用于存储以及存储器，用于存储指示存储多个参数模式的位置的位置信息，并存储指定参数模式的代码信息并对应于位置信息。参数模式和代码信息可用于合成语音。因为使用少量的参数模式，所以只需要较小的存储容量，并且可以执行声音信息的有效处理。

9.

发明授权
Speech synthesis apparatus and method 失效
标题翻译：语音合成设备和方法

公开(公告)号：US5220629A

公开(公告)日：1993-06-15

申请号：US608757

申请日：1990-11-05

申请人： Tetsuo Kosaka , Atsushi Sakurai , Junichi Tamura , Yasunori Ohora , Takeshi Fujita , Takashi Aso , Katsuhiko Kawasaki

发明人： Tetsuo Kosaka , Atsushi Sakurai , Junichi Tamura , Yasunori Ohora , Takeshi Fujita , Takashi Aso , Katsuhiko Kawasaki

IPC分类号： G10L13/07

CPC分类号： G10L13/07

摘要： A method and apparatus for reading out a feature parameter and a driver sound source stored in a VCV (vowel-consonant-vowel) speech segment file, sequentially connecting the readout parameter and the readout sound source information in accordance with a predetermined rule, and supplying connected data to a speech synthesizer, thereby generating a speech output, includes a memory for storing the average power of each vowel, and a power controller for controlling the apparatus to normalize a VCV speech segment so that powers at both ends of each VCV segment coincide with the average power of each vowel.

10.

发明授权
Speech synthesizer and method for synthesizing speech for superposing and adding a waveform onto a waveform obtained by delaying a previously obtained waveform 失效
标题翻译：语音合成器和用于将通过延迟先前获得的波形获得的波形叠加和添加波形的语音合成的方法

公开(公告)号：US5381514A

公开(公告)日：1995-01-10

申请号：US101621

申请日：1992-12-23

申请人： Takashi Aso , Yasunori Ohora

发明人： Takashi Aso , Yasunori Ohora

IPC分类号： G10L13/06 , G10L13/00 , G10L13/04 , H03K3/84 , G10L9/00

CPC分类号： G10L13/04 , H03K3/84

摘要： A speech synthesizer includes a first indicator for indicating the amplitude of a waveform by using a random number, a second indicator for indicating the superposition period for waveforms by using a random number, a waveform generator for generating first and second waveforms having an amplitude indicated by the first indicator, and a waveform superposition device for synthesizing an unvoiced speech waveform by superposing the second waveform generated by the waveform generator onto a waveform obtained by delaying the first waveform by a superposition period indicated by the second indication means. The speech synthesizer is capable of making the frequency characteristic of unvoiced speech analogous to that of white noise, and generating synthesized speech which is natural and analogous to an actual human voice.

摘要翻译： 语音合成器包括用于通过使用随机数指示波形幅度的第一指示符，用于通过使用随机数指示波形的叠加周期的第二指示符，用于产生具有由第一指示符和波形叠加装置，用于通过将由波形发生器产生的第二波形叠加在由第二指示装置指示的叠加周期延迟第一波形而获得的波形上，合成清音语音波形。语音合成器能够使无声语音的频率特性类似于白噪声的频率特性，并且产生自然而且类似于实际人声的合成语音。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类