专利检索 ap:("Hiroyuki Ehara" OR "Toshiyuki Morii" OR "Koji Yoshida") AND inv:"Hiroyuki Ehara" 第 1 页

1.

发明授权
Wide-band encoding device, wide-band LSP prediction device, band scalable encoding device, wide-band encoding method 有权
标题翻译：宽带编码装置，宽带LSP预测装置，频带可伸缩编码装置，宽带编码方法

公开(公告)号：US08229749B2

公开(公告)日：2012-07-24

申请号：US11721358

申请日：2005-12-09

申请人： Hiroyuki Ehara , Koji Yoshida , Toshiyuki Morii

发明人： Hiroyuki Ehara , Koji Yoshida , Toshiyuki Morii

IPC分类号： G10L13/00 , G10L19/00 , G10L21/04

CPC分类号： G10L19/24 , G10L19/07 , G10L21/038

摘要： There is provided a wide-band LSP prediction device and others capable of predicting a wide-band LSP from a narrow-band LSP with a high quantization efficiency and a high accuracy while suppressing the size of a conversion table correlating the narrow-band LSP to the wide-band LSP. In this device, a non-linear prediction unit (102) performs non-linear prediction by using a converted wide-band LSP inputted from a narrow-band/wide-band conversion unit (101) and inputs the non-linear prediction result to an amplifier (103). The converted wide-band LSP is inputted to an amplifier (104). An adder (122) adds multiplication results (vectors) inputted from the amplifiers (103, 104).

摘要翻译： 提供了一种宽带LSP预测装置和其他能够在抑制与窄带LSP相关联的转换表的大小的同时以高量化效率和高精度预测来自窄带LSP的宽带LSP的方法宽带LSP。在该装置中，非线性预测单元（102）通过使用从窄带/宽带转换单元（101）输入的转换宽带LSP进行非线性预测，并将非线性预测结果输入到放大器（103）。转换的宽带LSP被输入到放大器（104）。加法器（122）将从放大器（103,104）输入的相乘结果相加。

2.

发明授权
Quantizer, encoder, and the methods thereof 有权
标题翻译：量化器，编码器及其方法

公开(公告)号：US08473288B2

公开(公告)日：2013-06-25

申请号：US12990697

申请日：2009-06-18

申请人： Toshiyuki Morii , Hiroyuki Ehara , Koji Yoshida

发明人： Toshiyuki Morii , Hiroyuki Ehara , Koji Yoshida

IPC分类号： G10L19/00

CPC分类号： G10L19/008 , G10L25/27

摘要： Disclosed are a quantizer, encoder, and the methods thereof, wherein the computational load is reduced when the values related to the transform coefficients of the principal component analysis transform are quantized when a principal component analysis transform is applied to code stereo. A quantizer includes a power correlation calculator which calculates the power of the left channel signal, the power of the right channel signal, and the correlation between the left channel signal and the right channel signal; an intermediate value calculator which calculates the intermediate value which is the difference between left channel signal the power and the right channel signal power; a codebook which holds a plurality of sets of the coefficients related to the transform coefficients of the principal component analysis transform and the code; and a quantizer which calculates the sum of the first multiplication result obtained by multiplying the coefficient by the correlation value and the second multiplication result obtained by multiplying the coefficient by the intermediate value as the cost function E, selects the coefficients where the cost function E becomes the maximum, and fetches the code related to the selected coefficients as the quantized code.

摘要翻译： 公开了一种量化器，编码器及其方法，其中当将主分量分析变换应用于代码立体声时，当与主成分分析变换的变换系数相关的值被量化时，计算负荷减小。量化器包括功率相关计算器，其计算左声道信号的功率，右声道信号的功率，以及左声道信号和右声道信号之间的相关性; 中间值计算器，其计算作为左声道信号的功率和右声道信号功率之间的差的中间值; 码本，其保存与主成分分析变换和代码的变换系数相关的多个系数集合; 以及量化器，其计算通过将系数乘以相关值而获得的第一相乘结果与通过将系数乘以中间值而获得的第二乘法结果作为成本函数E的和，选择成本函数E变为最大值，并且将与所选择的系数相关的代码作为量化代码获取。

3.

发明授权
Speech encoding apparatus and speech encoding method 有权
标题翻译：语音编码装置和语音编码方法

公开(公告)号：US08239191B2

公开(公告)日：2012-08-07

申请号：US12440661

申请日：2007-09-14

申请人： Hiroyuki Ehara , Toshiyuki Morii , Koji Yoshida

发明人： Hiroyuki Ehara , Toshiyuki Morii , Koji Yoshida

IPC分类号： G10L19/00 , G10L21/02

CPC分类号： G10L19/265 , G10L19/08

摘要： Disclosed is an audio encoding device capable of adjusting a spectrum inclination of a quantized noise without changing the Formant weight. The device includes: an HPF (131) which extracts a high-frequency component of the frequency region from an input audio signal; a high-frequency energy level calculation unit (132) which calculates an energy level of the high-frequency component in a frame unit; an LPF (133) which extracts a low-frequency component of the frequency region from the input audio signal; a low-energy level calculation unit (134) which calculates an energy level of a low-frequency component in a frame unit; an inclination correction coefficient calculation unit (141) multiplies the difference between SNR of the high-frequency component and SNR of the low-frequency component inputted from an adder (140) by a constant and adds a bias component to the product so as to calculate an inclination correction coefficient ?3. The inclination correction coefficient is used for adjusting the spectrum inclination of a quantized noise.

摘要翻译： 公开了能够在不改变共振峰重量的情况下调整量化噪声的频谱倾斜度的音频编码装置。该装置包括：HPF（131），其从输入音频信号中提取频率区域的高频分量; 高频能量计算单元，其计算帧单位中的高频分量的能级; LPF（133），其从所述输入音频信号中提取所述频率区域的低频分量; 低能量计算单元（134），其计算帧单位中的低频分量的能级; 倾斜校正系数计算单元（141）将从加法器（140）输入的低频分量的SNR与高频成分的SNR之差乘以常数，并将偏置分量加到乘积上，以计算倾斜校正系数α3。倾斜校正系数用于调整量化噪声的频谱倾斜度。

4.

发明申请
WIDE-BAND ENCODING DEVICE, WIDE-BAND LSP PREDICTION DEVICE, BAND SCALABLE ENCODING DEVICE, WIDE-BAND ENCODING METHOD 有权
标题翻译：宽带编码装置，宽带预测装置，带可编码编码装置，宽带编码方法

公开(公告)号：US20090292537A1

公开(公告)日：2009-11-26

申请号：US11721358

申请日：2005-12-09

申请人： Hiroyuki Ehara , Koji Yoshida , Toshiyuki Morii

发明人： Hiroyuki Ehara , Koji Yoshida , Toshiyuki Morii

IPC分类号： G10L19/04

CPC分类号： G10L19/24 , G10L19/07 , G10L21/038

摘要： There is provided a wide-band LSP prediction device and others capable of predicting a wide-band LSP from a narrow-band LSP with a high quantization efficiency and a high accuracy while suppressing the size of a conversion table correlating the narrow-band LSP to the wide-band LSP. In this device, a non-linear prediction unit (102) performs non-linear prediction by using a converted wide-band LSP inputted from a narrow-band/wide-band conversion unit (101) and inputs the non-linear prediction result to an amplifier (103). The converted wide-band LSP is inputted to an amplifier (104). An adder (122) adds multiplication results (vectors) inputted from the amplifiers (103, 104).

摘要翻译： 提供了一种宽带LSP预测装置和其他能够在抑制与窄带LSP相关联的转换表的大小的同时以高量化效率和高精度预测来自窄带LSP的宽带LSP的方法宽带LSP。在该装置中，非线性预测单元（102）通过使用从窄带/宽带转换单元（101）输入的转换宽带LSP进行非线性预测，并将非线性预测结果输入到放大器（103）。转换的宽带LSP被输入到放大器（104）。加法器（122）将从放大器（103,104）输入的相乘结果相加。

5.

发明申请
SPEECH ENCODING APPARATUS AND SPEECH ENCODING METHOD 有权
标题翻译：语音编码装置和语音编码方法

公开(公告)号：US20090265167A1

公开(公告)日：2009-10-22

申请号：US12440661

申请日：2007-09-14

申请人： Hiroyuki Ehara , Toshiyuki Morii , Koji Yoshida

发明人： Hiroyuki Ehara , Toshiyuki Morii , Koji Yoshida

IPC分类号： G10L19/08 , G10L19/14

CPC分类号： G10L19/265 , G10L19/08

摘要： Disclosed is an audio encoding device capable of adjusting a spectrum inclination of a quantized noise without changing the Formant weight. The device includes: an HPF (131) which extracts a high-frequency component of the frequency region from an input audio signal; a high-frequency energy level calculation unit (132) which calculates an energy level of the high-frequency component in a frame unit; an LPF (133) which extracts a low-frequency component of the frequency region from the input audio signal; a low-energy level calculation unit (134) which calculates an energy level of a low-frequency component in a frame unit; an inclination correction coefficient calculation unit (141) multiplies the difference between SNR of the high-frequency component and SNR of the low-frequency component inputted from an adder (140) by a constant and adds a bias component to the product so as to calculate an inclination correction coefficient ?3. The inclination correction coefficient is used for adjusting the spectrum inclination of a quantized noise.

摘要翻译： 公开了能够在不改变共振峰重量的情况下调整量化噪声的频谱倾斜度的音频编码装置。该装置包括：HPF（131），其从输入音频信号中提取频率区域的高频分量; 高频能量计算单元，其计算帧单位中的高频分量的能级; LPF（133），其从所述输入音频信号中提取所述频率区域的低频分量; 低能量计算单元（134），其计算帧单位中的低频分量的能级; 倾斜校正系数计算单元（141）将从加法器（140）输入的低频分量的SNR与高频成分的SNR之差乘以常数，并将偏置分量加到乘积上，以计算倾斜校正系数α3。倾斜校正系数用于调整量化噪声的频谱倾斜度。

6.

发明申请
QUANTIZER, ENCODER, AND THE METHODS THEREOF 有权
标题翻译：量子，编码器及其方法

公开(公告)号：US20110125495A1

公开(公告)日：2011-05-26

申请号：US12990697

申请日：2009-06-18

申请人： Toshiyuki Morii , Hiroyuki Ehara , Koji Yoshida

发明人： Toshiyuki Morii , Hiroyuki Ehara , Koji Yoshida

IPC分类号： G10L19/00

CPC分类号： G10L19/008 , G10L25/27

摘要： Disclosed are a quantizer, encoder, and the methods thereof, wherein the computational load is reduced when the values related to the transform coefficients of the principal component analysis transform are quantized when a principal component analysis transform is applied to code stereo. A quantizer (110) is comprised of a power correlation calculation unit (111) which calculates the power (C11) of the left channel signal, the power (C22) of the right channel signal, and the correlation (C12) between the left channel signal and the right channel signal; an intermediate value calculation unit (112) which calculates the intermediate value (C1122) which is the difference between the power (C11) and the power (C22); a codebook (113) which holds a plurality of sets of the coefficients ?1,n,?2,n related to the transform coefficients of the principal component analysis transform and the code; and a quantizer (114) which calculates the sum of the first multiplication result obtained by multiplying the coefficient ?1,n by the correlation value C12 and the second multiplication result obtained by multiplying the coefficient ?1,n by the intermediate value C1122 as the cost function E, selects the coefficients ?1,n,?2,n where the cost function E becomes the maximum, and fetches the code related to the selected coefficients ?1,n,?2,n as the quantized code.

摘要翻译： 公开了一种量化器，编码器及其方法，其中当将主分量分析变换应用于代码立体声时，当与主成分分析变换的变换系数相关的值被量化时，计算负荷减小。量化器（110）包括功率相关计算单元（111），其计算左声道信号的功率（C11），右声道信号的功率（C22）和左声道信号之间的相关性（C12）信号和右声道信号; 计算作为功率（C11）和功率（C22）之间的差的中间值（C1122）的中间值计算单元（112）。保存与主成分分析变换和代码的变换系数有关的多组系数α1，n，α2，n的码本（113）; 以及量化器（114），其计算通过将系数α1，n乘以相关值C12获得的第一相乘结果与通过将系数α1，n乘以中间值C1122而获得的第二乘法结果的和作为成本函数E选择成本函数E变为最大的系数α1，n，β2，n，并且取出与所选择的系数α1，n，φ2，n相关的代码作为量化代码。

7.

发明授权
CELP speech decoder modifying an input vector with a fixed waveform to transform a waveform of the input vector 有权
标题翻译： CELP语音解码器用固定波形修改输入向量，以变换输入矢量的波形

公开(公告)号：US08036887B2

公开(公告)日：2011-10-11

申请号：US12781049

申请日：2010-05-17

申请人： Kazutoshi Yasunaga , Toshiyuki Morii , Hiroyuki Ehara

发明人： Kazutoshi Yasunaga , Toshiyuki Morii , Hiroyuki Ehara

IPC分类号： G10L19/12

CPC分类号： G10L19/135 , G10L19/12 , G10L2019/0007 , G10L2019/0013

摘要： A CELP speech decoder includes an adaptive codebook that generates an adaptive code vector and a random codebook that generates a random code vector. The random codebook includes an input vector provider that provides an input vector including at least one pulse, each pulse having a position and a polarity, a fixed waveform storage that stores at least one fixed waveform, and a selector that selects at least one of a first process and a second process based on a value of an adaptive codebook gain. The random codebook further includes a convolution section that generates the random code vector by convoluting the at least one fixed waveform with the input vector when the first process is selected. A synthesis filter outputs synthesized speech by performing linear prediction coefficient synthesis on a signal based on the adaptive code vector and the random code vector.

摘要翻译： CELP语音解码器包括产生自适应码矢量的自适应码本和产生随机码矢量的随机码本。所述随机码本包括输入向量提供者，所述输入向量提供器提供包括至少一个脉冲，每个具有位置和极性的脉冲，存储至少一个固定波形的固定波形存储器的输入向量，以及选择器，第一处理和基于自适应码本增益的值的第二处理。所述随机码本还包括卷积部分，其通过在选择所述第一处理时将所述至少一个固定波形与所述输入矢量进行卷积来生成所述随机码矢量。合成滤波器通过对基于自适应码矢量和随机码矢量的信号执行线性预测系数合成来输出合成语音。

8.

发明授权
Excitation vector generator and a method for generating an excitation vector including a convolution system 有权
标题翻译：励磁矢量发生器和用于产生包括卷积系统的激励矢量的方法

公开(公告)号：US06947889B2

公开(公告)日：2005-09-20

申请号：US09843939

申请日：2001-04-30

申请人： Kazutoshi Yasunaga , Toshiyuki Morii , Hiroyuki Ehara

发明人： Kazutoshi Yasunaga , Toshiyuki Morii , Hiroyuki Ehara

IPC分类号： G10L19/00 , G10L19/08 , G10L19/12 , G10L19/135 , G10L19/14 , G10L21/00 , G10I19/14

CPC分类号： G10L19/135 , G10L19/12 , G10L2019/0007 , G10L2019/0013

摘要： A random code vector reading section and a random codebook of a conventional CELP type speech coder/decoder are respectively replaced with an oscillator for outputting different vector streams in accordance with values of input seeds, and a seed storage section for storing a plurality of seeds. This makes it unnecessary to store fixed vectors as they are in a fixed codebook (ROM), thereby considerably reducing the memory capacity.

摘要翻译： 传统CELP型语音编码器/解码器的随机码矢量读取部分和随机码本分别被替换为根据输入种子的值输出不同矢量流的振荡器和用于存储多个种子的种子存储部分。这使得不必像固定码本（ROM）一样存储固定向量，从而显着地降低了存储器容量。

9.

发明申请
Excitation vector generator, speech coder and speech decoder 有权
标题翻译：激励矢量发生器，语音编码器和语音解码器

公开(公告)号：US20050203736A1

公开(公告)日：2005-09-15

申请号：US11126171

申请日：2005-05-11

申请人： Kazutoshi Yasunaga , Toshiyuki Morii , Hiroyuki Ehara

发明人： Kazutoshi Yasunaga , Toshiyuki Morii , Hiroyuki Ehara

IPC分类号： G10L19/00 , G10L19/08 , G10L19/12 , G10L19/135 , G10L19/14 , G10L21/00

CPC分类号： G10L19/135 , G10L19/12 , G10L2019/0007 , G10L2019/0013

摘要： A noise canceller removes a noise component from an input speech signal. The noise canceller includes a noise cancellation coefficient adjuster that adjusts a noise cancellation coefficient to determine an amount of noise cancellation. A noise spectrum storage device stores an estimated noise spectrum. A noise estimator estimates a noise spectrum by comparing an input spectrum with a noise spectrum stored in the noise spectrum storage device. A noise canceling/spectrum compensator subtracts the noise spectrum stored in the noise spectrum storage device from the input spectrum based on a coefficient acquired by the noise cancellation coefficient adjuster.

摘要翻译： 噪声消除器从输入语音信号中去除噪声分量。噪声消除器包括噪声消除系数调节器，其调整噪声消除系数以确定噪声消除量。噪声频谱存储装置存储估计的噪声谱。噪声估计器通过将输入频谱与存储在噪声频谱存储装置中的噪声谱进行比较来估计噪声谱。噪声消除/频谱补偿器基于由噪声消除系数调整器获取的系数从输入频谱中减去存储在噪声频谱存储装置中的噪声谱。

10.

发明授权
Excitation vector generator, speech coder and speech decoder 有权
标题翻译：激励矢量发生器，语音编码器和语音解码器

公开(公告)号：US06757650B2

公开(公告)日：2004-06-29

申请号：US09855708

申请日：2001-05-16

申请人： Kazutoshi Yasunaga , Toshiyuki Morii , Taisuke Watanabe , Hiroyuki Ehara

发明人： Kazutoshi Yasunaga , Toshiyuki Morii , Taisuke Watanabe , Hiroyuki Ehara

IPC分类号： G10L1904

CPC分类号： G10L19/135 , G10L19/12 , G10L2019/0007 , G10L2019/0013

摘要： A random code vector reading section and a random codebook of a conventional CELP type speech coder/decoder are respectively replaced with an oscillator for outputting different vector streams in accordance with values of input seeds, and a seed storage section for storing a plurality of seeds. This makes it unnecessary to store fixed vectors as they are in a fixed codebook (ROM), thereby considerably reducing the memory capacity.

摘要翻译： 传统CELP型语音编码器/解码器的随机码矢量读取部分和随机码本分别被替换为根据输入种子的值输出不同矢量流的振荡器和用于存储多个种子的种子存储部分。这使得不必像固定码本（ROM）一样存储固定向量，从而显着地降低了存储器容量。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类