专利检索 ap:("Kaoru Satoh" OR "Toshiyuki Morii" OR "Hiroyuki Ehara") AND inv:"Hiroyuki Ehara" 第 1 页

1.

发明授权
Vector quantization apparatus, vector dequantization apparatus, and the methods 有权
标题翻译：矢量量化装置，矢量逆量化装置及方法

公开(公告)号：US08438020B2

公开(公告)日：2013-05-07

申请号：US12682086

申请日：2008-10-10

申请人： Kaoru Satoh , Toshiyuki Morii , Hiroyuki Ehara

发明人： Kaoru Satoh , Toshiyuki Morii , Hiroyuki Ehara

IPC分类号： G10L21/00

CPC分类号： G10L19/032 , G10L19/07 , G10L19/18 , G10L2019/0005

摘要： A vector quantizer which improves the accuracy of vector quantization in switching over a vector quantization codebook on a first stage depending on the type of feature having the correlation with a quantization target vector. In the vector quantizer, a classifier generates classification information representing a type of narrowband LSP vector having the correlation with wideband LSP (Line Spectral Pairs) of the plural types. A first codebook selects one sub-codebook corresponding to the classification information as a codebook used for the quantization of the first stage from plural sub-codebooks corresponding to each of the types of narrowband LSP vectors. A multiplier multiplies the quantization residual vector of the first stage inputted from an adder by a scaling factor corresponding to the classification information of plural scaling factors stored in a scaling factor determiner and outputs it to an adder as the quantization target of a second stage.

摘要翻译： 矢量量化器，其根据与量化目标矢量具有相关性的特征的类型，提高在第一级切换矢量量化码本时的矢量量化的精度。在矢量量化器中，分类器生成表示与多种类型的宽带LSP（线谱对）具有相关性的窄带LSP矢量的类型的分类信息。第一码本从对应于每种类型的窄带LSP矢量的多个子码本中选择与分类信息相对应的一个子码本作为用于第一级的量化的码本。乘法器将从加法器输入的第一级的量化残差矢量乘以与存储在缩放因子确定器中的多个缩放因子的分类信息相对应的缩放因子，并将其作为第二级的量化对象输出到加法器。

2.

发明申请
TONE DETERMINATION DEVICE AND TONE DETERMINATION METHOD 审中-公开
标题翻译：音调测定装置和音调测定方法

公开(公告)号：US20110301946A1

公开(公告)日：2011-12-08

申请号：US13202170

申请日：2010-02-26

申请人： Kaoru Satoh , Toshiyuki Morii , Hiroyuki Ehara

发明人： Kaoru Satoh , Toshiyuki Morii , Hiroyuki Ehara

IPC分类号： G10L19/00

CPC分类号： H04Q1/46

摘要： Disclosed is a tone determination device that determines the tonality of an input signal using correlations between the frequency components of a current frame with the frequency components of the preceding frame, such that the tone determination device is able to decrease the calculation complexity. In the device, a vector coupling unit (104) couples some of the SDFT coefficients of the preceding frame with some of the down-sampled SDFT coefficients of the preceding frame to generate new SDFT coefficients, and also couples some of the SDFT coefficients of the current frame with some of the down-sampled SDFT coefficients of the current frame to generate new SDFT coefficients. A correlation analysis unit (105) finds correlations for the SDFT coefficients between frames, and also finds the power of the current frame for each specific band. A band determination unit (106) determines the band with the greatest power and outputs the location information for the determined band as shift information, and a tone determination unit (107) determines the tonality of the input signal according to the values of the correlations input from the correlation analysis unit (105).

摘要翻译： 公开了一种音调确定装置，其使用当前帧的频率分量与前一帧的频率分量之间的相关性来确定输入信号的音调，使得音调确定装置能够降低计算复杂度。在装置中，矢量耦合单元（104）将前一帧的一些SDFT系数与先前帧的下采样SDFT系数中的一些相耦合，以产生新的SDFT系数，并且还耦合一些SDFT系数当前帧与当前帧的一些下采样SDFT系数，以产生新的SDFT系数。相关分析单元（105）找出帧之间的SDFT系数的相关性，并且还找到每个特定频带的当前帧的功率。频带确定单元（106）确定具有最大功率的频带，并输出所确定频带的位置信息作为偏移信息，并且音调确定单元（107）根据相关输入值确定输入信号的音调（105）。

3.

发明申请
VECTOR QUANTIZER, VECTOR INVERSE QUANTIZER, AND THE METHODS 有权
标题翻译：矢量量化器，矢量反相量子和方法

公开(公告)号：US20100211398A1

公开(公告)日：2010-08-19

申请号：US12682086

申请日：2008-10-10

申请人： Kaoru Satoh , Toshiyuki Morii , Hiroyuki Ehara

发明人： Kaoru Satoh , Toshiyuki Morii , Hiroyuki Ehara

IPC分类号： G10L19/00

CPC分类号： G10L19/032 , G10L19/07 , G10L19/18 , G10L2019/0005

摘要： A vector quantizer which improves the accuracy of vector quantization in switching over a vector quantization codebook on a first stage depending on the type of feature having the correlation with a quantization target vector. In the vector quantizer, a classifier (101) generates classification information representing a type of narrowband LSP vector having the correlation with wideband LSP (Line Spectral Pairs) out of the plural types. A first codebook (103) selects one sub-codebook corresponding to the classification information as a codebook used for the quantization of the first stage from plural sub-codebooks (CBa1 to CBan) corresponding to each of the types of narrowband LSP vectors. A multiplier (107) multiplies the quantization residual vector of the first stage inputted from an adder (104) by a scaling factor corresponding to the classification information out of plural scaling factors stored in a scaling factor determining section (106) and outputs it to an adder (109) as the quantization target of a second stage.

摘要翻译： 矢量量化器，其根据与量化目标矢量具有相关性的特征的类型，提高在第一级切换矢量量化码本时的矢量量化的精度。在矢量量化器中，分类器（101）生成表示与多种类型中的宽带LSP（线谱对）相关的窄带LSP矢量的类型的分类信息。第一码本（103）从与各种窄带LSP矢量对应的多个子码本（CBa1〜CBan）中选择与分类信息对应的一个子码本作为用于第一级量化的码本。乘法器（107）将从加法器（104）输入的第一级的量化残差矢量与存储在缩放因子确定部分（106）中的多个缩放因子中的分类信息相对应的缩放因子相乘，并将其输出到加法器（109）作为第二级的量化目标。

4.

发明授权
Wide-band encoding device, wide-band LSP prediction device, band scalable encoding device, wide-band encoding method 有权
标题翻译：宽带编码装置，宽带LSP预测装置，频带可伸缩编码装置，宽带编码方法

公开(公告)号：US08229749B2

公开(公告)日：2012-07-24

申请号：US11721358

申请日：2005-12-09

申请人： Hiroyuki Ehara , Koji Yoshida , Toshiyuki Morii

发明人： Hiroyuki Ehara , Koji Yoshida , Toshiyuki Morii

IPC分类号： G10L13/00 , G10L19/00 , G10L21/04

CPC分类号： G10L19/24 , G10L19/07 , G10L21/038

摘要： There is provided a wide-band LSP prediction device and others capable of predicting a wide-band LSP from a narrow-band LSP with a high quantization efficiency and a high accuracy while suppressing the size of a conversion table correlating the narrow-band LSP to the wide-band LSP. In this device, a non-linear prediction unit (102) performs non-linear prediction by using a converted wide-band LSP inputted from a narrow-band/wide-band conversion unit (101) and inputs the non-linear prediction result to an amplifier (103). The converted wide-band LSP is inputted to an amplifier (104). An adder (122) adds multiplication results (vectors) inputted from the amplifiers (103, 104).

摘要翻译： 提供了一种宽带LSP预测装置和其他能够在抑制与窄带LSP相关联的转换表的大小的同时以高量化效率和高精度预测来自窄带LSP的宽带LSP的方法宽带LSP。在该装置中，非线性预测单元（102）通过使用从窄带/宽带转换单元（101）输入的转换宽带LSP进行非线性预测，并将非线性预测结果输入到放大器（103）。转换的宽带LSP被输入到放大器（104）。加法器（122）将从放大器（103,104）输入的相乘结果相加。

5.

发明授权
CELP speech decoder modifying an input vector with a fixed waveform to transform a waveform of the input vector 有权
标题翻译： CELP语音解码器用固定波形修改输入向量，以变换输入矢量的波形

公开(公告)号：US08036887B2

公开(公告)日：2011-10-11

申请号：US12781049

申请日：2010-05-17

申请人： Kazutoshi Yasunaga , Toshiyuki Morii , Hiroyuki Ehara

发明人： Kazutoshi Yasunaga , Toshiyuki Morii , Hiroyuki Ehara

IPC分类号： G10L19/12

CPC分类号： G10L19/135 , G10L19/12 , G10L2019/0007 , G10L2019/0013

摘要： A CELP speech decoder includes an adaptive codebook that generates an adaptive code vector and a random codebook that generates a random code vector. The random codebook includes an input vector provider that provides an input vector including at least one pulse, each pulse having a position and a polarity, a fixed waveform storage that stores at least one fixed waveform, and a selector that selects at least one of a first process and a second process based on a value of an adaptive codebook gain. The random codebook further includes a convolution section that generates the random code vector by convoluting the at least one fixed waveform with the input vector when the first process is selected. A synthesis filter outputs synthesized speech by performing linear prediction coefficient synthesis on a signal based on the adaptive code vector and the random code vector.

摘要翻译： CELP语音解码器包括产生自适应码矢量的自适应码本和产生随机码矢量的随机码本。所述随机码本包括输入向量提供者，所述输入向量提供器提供包括至少一个脉冲，每个具有位置和极性的脉冲，存储至少一个固定波形的固定波形存储器的输入向量，以及选择器，第一处理和基于自适应码本增益的值的第二处理。所述随机码本还包括卷积部分，其通过在选择所述第一处理时将所述至少一个固定波形与所述输入矢量进行卷积来生成所述随机码矢量。合成滤波器通过对基于自适应码矢量和随机码矢量的信号执行线性预测系数合成来输出合成语音。

6.

发明授权
Excitation vector generator and a method for generating an excitation vector including a convolution system 有权
标题翻译：励磁矢量发生器和用于产生包括卷积系统的激励矢量的方法

公开(公告)号：US06947889B2

公开(公告)日：2005-09-20

申请号：US09843939

申请日：2001-04-30

申请人： Kazutoshi Yasunaga , Toshiyuki Morii , Hiroyuki Ehara

发明人： Kazutoshi Yasunaga , Toshiyuki Morii , Hiroyuki Ehara

IPC分类号： G10L19/00 , G10L19/08 , G10L19/12 , G10L19/135 , G10L19/14 , G10L21/00 , G10I19/14

CPC分类号： G10L19/135 , G10L19/12 , G10L2019/0007 , G10L2019/0013

摘要： A random code vector reading section and a random codebook of a conventional CELP type speech coder/decoder are respectively replaced with an oscillator for outputting different vector streams in accordance with values of input seeds, and a seed storage section for storing a plurality of seeds. This makes it unnecessary to store fixed vectors as they are in a fixed codebook (ROM), thereby considerably reducing the memory capacity.

摘要翻译： 传统CELP型语音编码器/解码器的随机码矢量读取部分和随机码本分别被替换为根据输入种子的值输出不同矢量流的振荡器和用于存储多个种子的种子存储部分。这使得不必像固定码本（ROM）一样存储固定向量，从而显着地降低了存储器容量。

7.

发明申请
Excitation vector generator, speech coder and speech decoder 有权
标题翻译：激励矢量发生器，语音编码器和语音解码器

公开(公告)号：US20050203736A1

公开(公告)日：2005-09-15

申请号：US11126171

申请日：2005-05-11

申请人： Kazutoshi Yasunaga , Toshiyuki Morii , Hiroyuki Ehara

发明人： Kazutoshi Yasunaga , Toshiyuki Morii , Hiroyuki Ehara

IPC分类号： G10L19/00 , G10L19/08 , G10L19/12 , G10L19/135 , G10L19/14 , G10L21/00

CPC分类号： G10L19/135 , G10L19/12 , G10L2019/0007 , G10L2019/0013

摘要： A noise canceller removes a noise component from an input speech signal. The noise canceller includes a noise cancellation coefficient adjuster that adjusts a noise cancellation coefficient to determine an amount of noise cancellation. A noise spectrum storage device stores an estimated noise spectrum. A noise estimator estimates a noise spectrum by comparing an input spectrum with a noise spectrum stored in the noise spectrum storage device. A noise canceling/spectrum compensator subtracts the noise spectrum stored in the noise spectrum storage device from the input spectrum based on a coefficient acquired by the noise cancellation coefficient adjuster.

摘要翻译： 噪声消除器从输入语音信号中去除噪声分量。噪声消除器包括噪声消除系数调节器，其调整噪声消除系数以确定噪声消除量。噪声频谱存储装置存储估计的噪声谱。噪声估计器通过将输入频谱与存储在噪声频谱存储装置中的噪声谱进行比较来估计噪声谱。噪声消除/频谱补偿器基于由噪声消除系数调整器获取的系数从输入频谱中减去存储在噪声频谱存储装置中的噪声谱。

8.

发明授权
Excitation vector generator, speech coder and speech decoder 有权
标题翻译：激励矢量发生器，语音编码器和语音解码器

公开(公告)号：US06757650B2

公开(公告)日：2004-06-29

申请号：US09855708

申请日：2001-05-16

申请人： Kazutoshi Yasunaga , Toshiyuki Morii , Taisuke Watanabe , Hiroyuki Ehara

发明人： Kazutoshi Yasunaga , Toshiyuki Morii , Taisuke Watanabe , Hiroyuki Ehara

IPC分类号： G10L1904

CPC分类号： G10L19/135 , G10L19/12 , G10L2019/0007 , G10L2019/0013

摘要： A random code vector reading section and a random codebook of a conventional CELP type speech coder/decoder are respectively replaced with an oscillator for outputting different vector streams in accordance with values of input seeds, and a seed storage section for storing a plurality of seeds. This makes it unnecessary to store fixed vectors as they are in a fixed codebook (ROM), thereby considerably reducing the memory capacity.

摘要翻译： 传统CELP型语音编码器/解码器的随机码矢量读取部分和随机码本分别被替换为根据输入种子的值输出不同矢量流的振荡器和用于存储多个种子的种子存储部分。这使得不必像固定码本（ROM）一样存储固定向量，从而显着地降低了存储器容量。

9.

发明授权
Quantizer, encoder, and the methods thereof 有权
标题翻译：量化器，编码器及其方法

公开(公告)号：US08473288B2

公开(公告)日：2013-06-25

申请号：US12990697

申请日：2009-06-18

申请人： Toshiyuki Morii , Hiroyuki Ehara , Koji Yoshida

发明人： Toshiyuki Morii , Hiroyuki Ehara , Koji Yoshida

IPC分类号： G10L19/00

CPC分类号： G10L19/008 , G10L25/27

摘要： Disclosed are a quantizer, encoder, and the methods thereof, wherein the computational load is reduced when the values related to the transform coefficients of the principal component analysis transform are quantized when a principal component analysis transform is applied to code stereo. A quantizer includes a power correlation calculator which calculates the power of the left channel signal, the power of the right channel signal, and the correlation between the left channel signal and the right channel signal; an intermediate value calculator which calculates the intermediate value which is the difference between left channel signal the power and the right channel signal power; a codebook which holds a plurality of sets of the coefficients related to the transform coefficients of the principal component analysis transform and the code; and a quantizer which calculates the sum of the first multiplication result obtained by multiplying the coefficient by the correlation value and the second multiplication result obtained by multiplying the coefficient by the intermediate value as the cost function E, selects the coefficients where the cost function E becomes the maximum, and fetches the code related to the selected coefficients as the quantized code.

摘要翻译： 公开了一种量化器，编码器及其方法，其中当将主分量分析变换应用于代码立体声时，当与主成分分析变换的变换系数相关的值被量化时，计算负荷减小。量化器包括功率相关计算器，其计算左声道信号的功率，右声道信号的功率，以及左声道信号和右声道信号之间的相关性; 中间值计算器，其计算作为左声道信号的功率和右声道信号功率之间的差的中间值; 码本，其保存与主成分分析变换和代码的变换系数相关的多个系数集合; 以及量化器，其计算通过将系数乘以相关值而获得的第一相乘结果与通过将系数乘以中间值而获得的第二乘法结果作为成本函数E的和，选择成本函数E变为最大值，并且将与所选择的系数相关的代码作为量化代码获取。

10.

发明授权
Speech encoding apparatus and speech encoding method 有权
标题翻译：语音编码装置和语音编码方法

公开(公告)号：US08239191B2

公开(公告)日：2012-08-07

申请号：US12440661

申请日：2007-09-14

申请人： Hiroyuki Ehara , Toshiyuki Morii , Koji Yoshida

发明人： Hiroyuki Ehara , Toshiyuki Morii , Koji Yoshida

IPC分类号： G10L19/00 , G10L21/02

CPC分类号： G10L19/265 , G10L19/08

摘要： Disclosed is an audio encoding device capable of adjusting a spectrum inclination of a quantized noise without changing the Formant weight. The device includes: an HPF (131) which extracts a high-frequency component of the frequency region from an input audio signal; a high-frequency energy level calculation unit (132) which calculates an energy level of the high-frequency component in a frame unit; an LPF (133) which extracts a low-frequency component of the frequency region from the input audio signal; a low-energy level calculation unit (134) which calculates an energy level of a low-frequency component in a frame unit; an inclination correction coefficient calculation unit (141) multiplies the difference between SNR of the high-frequency component and SNR of the low-frequency component inputted from an adder (140) by a constant and adds a bias component to the product so as to calculate an inclination correction coefficient ?3. The inclination correction coefficient is used for adjusting the spectrum inclination of a quantized noise.

摘要翻译： 公开了能够在不改变共振峰重量的情况下调整量化噪声的频谱倾斜度的音频编码装置。该装置包括：HPF（131），其从输入音频信号中提取频率区域的高频分量; 高频能量计算单元，其计算帧单位中的高频分量的能级; LPF（133），其从所述输入音频信号中提取所述频率区域的低频分量; 低能量计算单元（134），其计算帧单位中的低频分量的能级; 倾斜校正系数计算单元（141）将从加法器（140）输入的低频分量的SNR与高频成分的SNR之差乘以常数，并将偏置分量加到乘积上，以计算倾斜校正系数α3。倾斜校正系数用于调整量化噪声的频谱倾斜度。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类