专利检索 ap:("Atsushi Imai" OR "Nobumasa Seiyama" OR "Tohru Takagi") AND inv:"Nobumasa Seiyama" 第 1 页

1.

发明申请
SPEECH SPEED CONVERSION FACTOR DETERMINING DEVICE, SPEECH SPEED CONVERSION DEVICE, PROGRAM, AND STORAGE MEDIUM 有权
标题翻译：语音速度转换因子确定设备，语音速度转换设备，程序和存储介质

公开(公告)号：US20130325456A1

公开(公告)日：2013-12-05

申请号：US13981950

申请日：2012-01-27

申请人： Tohru Takagi , Atsushi Imai , Nobumasa Seiyama , Reiko Saitou

发明人： Tohru Takagi , Atsushi Imai , Nobumasa Seiyama , Reiko Saitou

IPC分类号： G10L21/043

CPC分类号： G10L21/043 , G10L21/04 , G10L25/78 , G10L2025/783 , G10L2025/906

摘要： A speech speed conversion factor determining device has a physical index calculation unit including a sound/silence judgment unit that distinguishes between sound and silent intervals of an input signal, a fundamental frequency calculation unit that calculates a fundamental frequency of the signal in the sound intervals and determines stable and unstable intervals, a frequency smoothing unit that smoothes the fundamental frequency in the stable intervals, a pseudo fundamental frequency calculation unit that calculates, for the intervals, a pseudo fundamental frequency by interpolation , and a fundamental frequency general shape connection unit that connects the smoothed and pseudo frequencies to obtain sampled values of a general shape of the frequency, such that the sampled values are output as an index, based on which conversion factor are calculated.

摘要翻译： 语音速度转换因子确定装置具有物理指标计算单元，该物理指标计算单元包括区分输入信号的声音和静音间隔的声音/静音判断单元，计算声音间隔中的信号的基频的基频计算单元，确定稳定和不稳定的间隔，频率平滑单元，其在稳定间隔中平滑基频;伪基频计算单元，其通过插值来计算间隔的伪基频;以及基频通用形连接单元，其连接平滑和伪频率以获得频率的一般形状的采样值，使得基于哪个转换因子被计算，采样值作为索引输出。

2.

发明授权
Audio processing method, audio processing apparatus, and recording reproduction apparatus capable of outputting voice having regular pitch regardless of reproduction speed 有权
标题翻译：音频处理方法，音频处理装置和能够输出具有规则音调的语音的记录再现装置，而与再现速度无关

公开(公告)号：US06360198B1

公开(公告)日：2002-03-19

申请号：US09297730

申请日：1999-05-06

申请人： Atsushi Imai , Nobumasa Seiyama , Tohru Takagi

发明人： Atsushi Imai , Nobumasa Seiyama , Tohru Takagi

IPC分类号： G01L2104

CPC分类号： G11B20/10527 , G11B27/005 , G11B2220/90

摘要： A reproduction part reproduced at a changeable speed ratio r. An A/D conversion part A/D converts, based on sampling frequency fi, an audio signal reproduced at a speed different from that upon recording. A block data division part divides audio data based on an attribute possessed by the audio data. An audio data connection part successively interpolates or thins out the divided audio data based on a ratio of 1/r. A D/A conversion part D/A converts the interpolated or thinned-out audio data based on sampling frequency fo. If a relation of fi/fo=r/c is satisfied, the audio signal is outputted as a sound of high quality constantly synchronized with an image signal and having a pitch which does not change irrespective of the changeable speed ratio r at which the image signal is reproduced.

摘要翻译： 以可变速比r再现的再现部分。 A / D转换部分A / D基于采样频率fi转换以与记录时不同的速度再现的音频信号。块数据分割部根据音频数据拥有的属性来分割音频数据。音频数据连接部分基于1 / r的比率连续地内插或分出分割的音频数据。 D / A转换部分D / A基于采样频率fo转换内插或稀疏音频数据。如果满足fi / fo = r / c的关系，则音频信号被输出为与图像信号不断同步的高质量的声音，并且具有不改变的音调，而不管图像的可变速比r如何信号被复制。

3.

发明授权
Speech speed conversion factor determining device, speech speed conversion device, program, and storage medium 有权
标题翻译：语音速度转换因子确定装置，语音速度转换装置，程序和存储介质

公开(公告)号：US09129609B2

公开(公告)日：2015-09-08

申请号：US13981950

申请日：2012-01-27

申请人： Tohru Takagi , Atsushi Imai , Nobumasa Seiyama , Reiko Saitou

发明人： Tohru Takagi , Atsushi Imai , Nobumasa Seiyama , Reiko Saitou

IPC分类号： G10L19/06 , G10L21/043 , G10L21/04 , G10L25/78 , G10L25/90

CPC分类号： G10L21/043 , G10L21/04 , G10L25/78 , G10L2025/783 , G10L2025/906

摘要： A speech speed conversion factor determining device has a physical index calculation unit including a sound/silence judgment unit that distinguishes between sound and silent intervals of an input signal, a fundamental frequency calculation unit that calculates a fundamental frequency of the signal in the sound intervals and determines stable and unstable intervals, a frequency smoothing unit that smoothes the fundamental frequency in the stable intervals, a pseudo fundamental frequency calculation unit that calculates, for the intervals, a pseudo fundamental frequency by interpolation , and a fundamental frequency general shape connection unit that connects the smoothed and pseudo frequencies to obtain sampled values of a general shape of the frequency, such that the sampled values are output as an index, based on which conversion factor are calculated.

摘要翻译： 语音速度转换因子确定装置具有物理指标计算单元，该物理指标计算单元包括区分输入信号的声音和静音间隔的声音/静音判断单元，计算声音间隔中的信号的基频的基频计算单元，确定稳定和不稳定的间隔，频率平滑单元，其在稳定间隔中平滑基频;伪基频计算单元，其通过插值来计算间隔的伪基频;以及基频通用形连接单元，其连接平滑和伪频率以获得频率的一般形状的采样值，使得基于哪个转换因子被计算，采样值作为索引输出。

4.

发明授权
Adaptive speech rate conversion without extension of input data duration, using speech interval detection 有权

公开(公告)号：US06374213B1

公开(公告)日：2002-04-16

申请号：US09781634

申请日：2001-02-12

申请人： Atsushi Imai , Nobumasa Seiyama , Tohru Takagi

发明人： Atsushi Imai , Nobumasa Seiyama , Tohru Takagi

IPC分类号： G10L1102

摘要： Frame power of an input signal is calculated to discriminate speech frame intervals from non-speech intervals, by thresholding current frame power using an adaptive speech-detection threshold based on the past maximum frame power value and the difference between past maximum and the minimum frame power values, adaptively updated using a predetermined number of frames prior to the current one.

5.

发明授权
Adaptive speech rate conversion without extension of input data duration, using speech interval detection 有权
标题翻译：自适应语音速率转换，不扩展输入数据持续时间，使用语音间隔检测

公开(公告)号：US06236970B1

公开(公告)日：2001-05-22

申请号：US09202867

申请日：1998-12-22

申请人： Atsushi Imai , Nobumasa Seiyama , Tohru Takagi

发明人： Atsushi Imai , Nobumasa Seiyama , Tohru Takagi

IPC分类号： G10L2104

CPC分类号： G10L25/78 , G10L2025/786

摘要： A speech-rate converter slowing down input speech regularly monitors the data length of the input speech and the previously estimated extended output data length for the current rate scaling factor, computing new output data length estimates. The conversion rate is adaptively modified depending on the time lag between input and output speech so as to make input and output data lengths consistent without skipping any spoken input portions. Input signal power is monitored to discriminate speech and non-speech intervals, and the portions of input non-speech intervals exceeding a conversion-rate-dependent duration are deleted.

摘要翻译： 降低输入语音的语音速率转换器定期监视输入语音的数据长度和当前速率缩放因子的先前估计的扩展输出数据长度，计算新的输出数据长度估计。根据输入和输出语音之间的时间间隔自适应地修改转换速率，以使输入和输出数据长度保持一致，而不会跳过任何口头输入部分。监视输入信号功率以区分语音和非语音间隔，并且删除超过转换速率相关持续时间的输入非语音间隔的部分。

6.

发明授权
Method and device for instantly changing the speed of a speech 有权
标题翻译：立即改变语音速度的方法和装置

公开(公告)号：US06205420B1

公开(公告)日：2001-03-20

申请号：US09180429

申请日：1998-11-06

申请人： Tohru Takagi , Nobumasa Seiyama , Atsushi Imai , Akio Ando

发明人： Tohru Takagi , Nobumasa Seiyama , Atsushi Imai , Akio Ando

IPC分类号： G10L2100

CPC分类号： G10L21/04

摘要： An analysis processor applies an analysis process to input speech data thereby to obtain block lengths for respective attributes of voiced sound, voiceless sound and silence. A block data splitter splits the input speech data into blocks having the block lengths dependent on the respective attributes. A block data memory sequentially stores speech data split by the block data splitter as block speech data and the block lengths. A connection data generator generates connection data for connecting the adjacent block speech data each other at every moment by using the block speech data. A connection data storing portion sequentially stores the connection data. A connection order generator generates block connection order of the block speech data and the connection data at every moment according to at least the block lengths output sequentially from the block data storing portion and extension scaling factors in time for the respective attributes. A speech data connector connects sequentially the block speech data and the connection data based on the block connection order. Accordingly, the speed of output speech can be instantly changed in response to an instruction of an operator.

摘要翻译： 分析处理器应用分析处理来输入语音数据，从而获得有声声音，无声音和静音的各个属性的块长度。块数据分离器将输入语音数据分割成具有取决于相应属性的块长度的块。块数据存储器顺序地存储由块数据分割器分割的语音数据作为块语音数据和块长度。连接数据发生器通过使用块语音数据产生用于在每一时刻彼此相邻的块语音数据连接的连接数据。连接数据存储部分依次存储连接数据。连接顺序发生器至少根据块数据存储部分顺序地输出的块长度和相应属性的时间上的扩展缩放因子，在每个时刻产生块语音数据和连接数据的块连接顺序。语音数据连接器基于块连接顺序顺序地连接块语音数据和连接数据。因此，可以响应于操作者的指令立即改变输出语音的速度。

7.

发明授权
Method and apparatus for hearing assistance with speech speed control function 失效
标题翻译：用于语音速度控制功能的助听器的方法和装置

公开(公告)号：US5305420A

公开(公告)日：1994-04-19

申请号：US950411

申请日：1992-09-22

申请人： Akira Nakamura , Ryou Ikezawa , Nobumasa Seiyama , Tohru Takagi , Eiichi Miyasaka

发明人： Akira Nakamura , Ryou Ikezawa , Nobumasa Seiyama , Tohru Takagi , Eiichi Miyasaka

IPC分类号： G09B19/04 , G09B21/00 , G10L21/04 , H04R25/00 , G10L9/02

CPC分类号： G09B21/009 , G09B19/04 , G10L21/04 , H04R25/505 , H04R2225/43

摘要： A method and an apparatus for hearing assistance, capable of compensating the lowering of the speech recognition ability related to the deterioration of the auditory sense center. The input speech is divided into voiced speech sections, unvoiced speech sections, and silent sections, of which the voiced speech sections and the silent sections are appropriately extended/contracted while the unvoiced speech sections are left unchanged, and then these sections are combined in an identical order as in the input speech, so as to obtain output speech which is easier to listen for a listener with a handicapped hearing ability. Also, only the silent sections other than the punctuation silent sections for pauses due to punctuation between sentences can be contracted and the speech speed for each of the voiced speech sections can be adjusted, and then the adjusted voiced speech sections, the unvoiced speech sections, the punctuation silent sections and the contracted silent sections can be combined in an identical order as in the input speech, in order to realize the real time hearing assistance without extending the speech utterance period.

摘要翻译： 一种用于听力辅助的方法和装置，其能够补偿与听觉中心的恶化相关的语音识别能力的降低。输入语音被分为有声语音部分，无声语音部分和静音部分，其中有声语音部分和无声部分被适当地扩展/收缩，而清音语音部分保持不变，然后将这些部分组合成与输入语音相同的顺序，以便获得更容易听到具有残疾听力的听众的输出语音。此外，只有在句子之间的标点符号之外的仅用于暂停的标点符号的无声部分才能被收缩，并且可以调整每个有声语音部分的语音速度，然后调整的有声语音部分，无声语音部分，可以以与输入语音中相同的顺序组合标点无声部分和合约无声部分，以便在不延长语音发音周期的情况下实现实时听力辅助。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类