Patent search ap:"Pengjun Huang" Page 3

21.

发明申请
Low-complexity encoding/decoding of quantized MDCT spectrum in scalable speech and audio codecs 有权
Title translation: 可扩展语音和音频编解码器中量化MDCT频谱的低复杂度编码/解码

公开(公告)号：US20090234644A1

公开(公告)日：2009-09-17

申请号：US12255604

申请日：2008-10-21

Applicant: Yuriy Reznik , Pengjun Huang

Inventor： Yuriy Reznik , Pengjun Huang

IPC: G10L19/02 , G10L19/00

CPC classification number: G10L19/24 , G10L19/038

Abstract: A scalable speech and audio codec is provided that implements combinatorial spectrum encoding. A residual signal is obtained from a Code Excited Linear Prediction (CELP)-based encoding layer, where the residual signal is a difference between an original audio signal and a reconstructed version of the original audio signal. The residual signal is transformed at a Discrete Cosine Transform (DCT)-type transform layer to obtain a corresponding transform spectrum having a plurality of spectral lines. The transform spectrum spectral lines are transformed using a combinatorial position coding technique.The combinatorial position coding technique includes generating a lexicographical index for a selected subset of spectral lines, where each lexicographic index represents one of a plurality of possible binary strings representing the positions of the selected subset of spectral lines. The lexicographical index represents non-zero spectral lines in a binary string in fewer bits than the length of the binary string.

Abstract translation: 提供可实现组合频谱编码的可扩展语音和音频编解码器。从基于码激励线性预测（CELP）的编码层获得残留信号，其中残留信号是原始音频信号和原始音频信号的重建版本之间的差异。残差信号在离散余弦变换（DCT）型变换层处变换，以获得具有多个谱线的对应变换频谱。使用组合位置编码技术对变换频谱谱线进行变换。组合位置编码技术包括为选定的谱线子集生成词典索引，其中每个词典索引表示表示所选择的谱线子集的位置的多个可能的二进制串中的一个。字典索引表示二进制串中的非零谱线，比二进制串的长度少。

22.

发明申请
METHOD AND APPARATUS FOR PREDICTIVELY QUANTIZING VOICED SPEECH 有权
Title translation: 用于预测定语音的方法和装置

公开(公告)号：US20080312917A1

公开(公告)日：2008-12-18

申请号：US12190524

申请日：2008-08-12

Applicant: Arasanipalai K. Ananthapadmanabhan , Sharath Manjunath , Pengjun Huang , Eddie-Lun Tik Choy , Andrew P. Dejaco

Inventor： Arasanipalai K. Ananthapadmanabhan , Sharath Manjunath , Pengjun Huang , Eddie-Lun Tik Choy , Andrew P. Dejaco

IPC: G10L19/00

CPC classification number: G10L19/04 , G10L19/0204 , G10L19/032 , G10L19/08 , G10L19/097 , G10L19/26 , G10L25/12

Abstract: A method and apparatus for predictively quantizing voiced speech includes a parameter generator and a quantizer. The parameter generator is configured to extract parameters from frames of predictive speech such as voiced speech, and to transform the extracted information to a frequency-domain representation. The quantizer is configured to subtract a weighted sum of the parameters for previous frames from the parameter for the current frame. The quantizer is configured to quantize the difference value. A prototype extractor may be added to first extract a pitch period prototype to be processed by the parameter generator.

Abstract translation: 用于预测量化浊音的方法和装置包括参数发生器和量化器。参数发生器被配置为从诸如有声语音的预测语音的帧中提取参数，并且将提取的信息变换为频域表示。量化器被配置为从当前帧的参数中减去先前帧的参数的加权和。量化器被配置为量化差值。可以添加原型提取器以首先提取要由参数发生器处理的音调周期原型。

23.

发明申请
Method and apparatus for high performance low bit-rate coding of unvoiced speech 有权

公开(公告)号：US20050143980A1

公开(公告)日：2005-06-30

申请号：US11066356

申请日：2005-02-24

Applicant: Pengjun Huang

Inventor： Pengjun Huang

IPC: G10L19/04 , G10L19/18 , H03M7/30 , G10L11/04

CPC classification number: G10L19/12 , G10L19/083 , G10L19/18 , G10L25/93

Abstract: A low-bit-rate coding technique for unvoiced segments of speech, without loss of quality compared to the conventional Code Excited Linear Prediction (CELP) method operating at a much higher bit rate. A set of gains are derived from a residual signal after whitening the speech signal by a linear prediction filter. These gains are then quantized and applied to a randomly generated sparse excitation. The excitation is filtered, and its spectral characteristics are analyzed and compared to the spectral characteristics of the original residual signal. Based on this analysis, a filter is chosen to shape the spectral characteristics of the excitation to achieve optimal performance.

24.

发明授权
System and method of an in-band modem for data communications over digital wireless communication networks 有权

公开(公告)号：US08958441B2

公开(公告)日：2015-02-17

申请号：US12477608

申请日：2009-06-03

Applicant: Marc W. Werner , Christian Pietsch , Christian Sgraja , Wolfgang Granzow , Nikolai K N Leung , Christoph A. Joetten , Pengjun Huang

Inventor： Marc W. Werner , Christian Pietsch , Christian Sgraja , Wolfgang Granzow , Nikolai K N Leung , Christoph A. Joetten , Pengjun Huang

IPC: H04W72/00 , H04L1/00 , H04L1/18 , H04L1/16

CPC classification number: H04L1/0003 , H04L1/1607 , H04L1/1858 , H04L1/189

Abstract: A system is provided for transmitting information through a speech codec (in-band) such as found in a wireless communication network. A modulator transforms the data into a spectrally noise-like signal based on the mapping of a shaped pulse to predetermined positions within a modulation frame, and the signal is efficiently encoded by a speech codec. A synchronization sequence provides modulation frame timing at the receiver and is detected based on analysis of a correlation peak pattern. A request/response protocol provides reliable transfer of data using message redundancy, retransmission, and/or robust modulation modes dependent on the communication channel conditions.

25.

发明授权
System and method of an in-band modem for data communications over digital wireless communication networks 有权
Title translation: 用于通过数字无线通信网络进行数据通信的带内调制解调器的系统和方法

公开(公告)号：US08725502B2

公开(公告)日：2014-05-13

申请号：US12477561

申请日：2009-06-03

Applicant: Christian Pietsch , Georg Frank , Christian Sgraja , Pengjun Huang , Christoph A. Joetten , Marc W. Werner , Wolfgang Granzow

Inventor： Christian Pietsch , Georg Frank , Christian Sgraja , Pengjun Huang , Christoph A. Joetten , Marc W. Werner , Wolfgang Granzow

IPC: G10L19/008

CPC classification number: G10L19/008 , G10L19/00 , G10L19/018 , H04J3/0611 , H04L25/4902

Abstract: A system is provided for transmitting information through a speech codec (in-band) such as found in a wireless communication network. A modulator transforms the data into a spectrally noise-like signal based on the mapping of a shaped pulse to predetermined positions within a modulation frame, and the signal is efficiently encoded by a speech codec. A synchronization sequence provides modulation frame timing at the receiver and is detected based on analysis of a correlation peak pattern. A request/response protocol provides reliable transfer of data using message redundancy, retransmission, and/or robust modulation modes dependent on the communication channel conditions.

Abstract translation: 提供了一种用于通过诸如在无线通信网络中找到的语音编解码器（带内）来发送信息的系统。调制器基于成形脉冲到调制帧内的预定位置的映射，将数据转换成频谱类似噪声的信号，并且该信号被语音编解码器有效地编码。同步序列提供接收机处的调制帧定时，并根据相关峰值模式的分析来检测。请求/响应协议使用消息冗余，重传和/或依赖于通信信道条件的鲁棒调制模式提供可靠的数据传输。

26.

发明授权
Method and apparatus for predictively quantizing voiced speech 有权
Title translation: 用于预测量化浊音的方法和装置

公开(公告)号：US08660840B2

公开(公告)日：2014-02-25

申请号：US12190524

申请日：2008-08-12

Applicant: Arasanipalai K. Ananthapadmanabhan , Sarath Manjunath , Pengjun Huang , Eddie-Lun Tik Choy , Andrew P. Dejaco

Inventor： Arasanipalai K. Ananthapadmanabhan , Sarath Manjunath , Pengjun Huang , Eddie-Lun Tik Choy , Andrew P. Dejaco

IPC: G06F15/00 , G10L21/00 , G10L21/02 , G10L19/00

CPC classification number: G10L19/04 , G10L19/0204 , G10L19/032 , G10L19/08 , G10L19/097 , G10L19/26 , G10L25/12

Abstract: A method and apparatus for predictively quantizing voiced speech includes a parameter generator and a quantizer. The parameter generator is configured to extract parameters from frames of predictive speech such as voiced speech, and to transform the extracted information to a frequency-domain representation. The quantizer is configured to subtract a weighted sum of the parameters for previous frames from the parameter for the current frame. The quantizer is configured to quantize the difference value. A prototype extractor may be added to first extract a pitch period prototype to be processed by the parameter generator.

Abstract translation: 用于预测量化浊音的方法和装置包括参数发生器和量化器。参数发生器被配置为从诸如有声语音的预测语音的帧中提取参数，并且将提取的信息变换为频域表示。量化器被配置为从当前帧的参数中减去先前帧的参数的加权和。量化器被配置为量化差值。可以添加原型提取器以首先提取要由参数发生器处理的音调周期原型。

27.

发明授权
Method and apparatus for improved detection of rate errors in variable rate receivers 有权
Title translation: 用于改进可变速率接收机中速率误差检测的方法和装置

公开(公告)号：US08243695B2

公开(公告)日：2012-08-14

申请号：US12537906

申请日：2009-08-07

Applicant: Khaled H. El-Maleh , Eddie-Lun Tik Choy , Arasanipalai K. Ananthapadmanabhan , Andrew P. DeJaco , Pengjun Huang

Inventor： Khaled H. El-Maleh , Eddie-Lun Tik Choy , Arasanipalai K. Ananthapadmanabhan , Andrew P. DeJaco , Pengjun Huang

IPC: H04B7/216 , G10L19/12 , H04L27/06

CPC classification number: H04L1/08 , H04L1/0046 , H04L1/201

Abstract: A system and method for detection of rate determination algorithm errors in variable rate communications system receivers. The disclosed embodiments prevent rate determination algorithm errors from causing audible artifacts such as screeches or beeps. The disclosed system and method detects frames with incorrectly determined data rates and performs frame erasure processing and/or memory state clean up to prevent propagation of distortion across multiple frames. Frames with incorrectly determined data rates are detected by checking illegal rate transitions, reserved bits, validating unused filter type bit combinations and analyzing relationships between fixed code-book gains and linear prediction coefficient gains.

Abstract translation: 一种用于在可变速率通信系统接收机中检测速率确定算法错误的系统和方法。所公开的实施例防止速率确定算法错误引起可听见的伪影，例如吱吱声或嘟嘟声。所公开的系统和方法检测具有错误确定的数据速率的帧，并执行帧擦除处理和/或存储器状态清理，以防止跨多个帧的失真传播。通过检查非法速率转换，保留位，验证未使用的过滤器类型位组合以及分析固定代码簿增益和线性预测系数增益之间的关系来检测具有不正确确定的数据速率的帧。

28.

发明申请
METHOD AND APPARATUS FOR VECTOR QUANTIZATION CODEBOOK SEARCH 审中-公开
Title translation: 用于矢量量化的方法和装置代码搜索

公开(公告)号：US20100174539A1

公开(公告)日：2010-07-08

申请号：US12349327

申请日：2009-01-06

Applicant: Rama Muralidhara Reddy Nandhimandalam , Pengjun Huang

Inventor： Rama Muralidhara Reddy Nandhimandalam , Pengjun Huang

IPC: G10L19/12

CPC classification number: G10L19/038

Abstract: A vector quantization codebook search method and apparatus use support vector machines (“SVMs”) to compute a hyperplane, where the hyperplane is used to separate codebook elements into a plurality of bins. During execution, a controller determines which of the plurality of bins contains a desired codebook element, and then searches the determined bin. Codebook search complexity is reduced and an exhaustive codebook search is selectively avoided.

Abstract translation: 矢量量化码本搜索方法和装置使用支持向量机（“SVM”）来计算超平面，其中超平面用于将码本元素分离成多个箱。在执行期间，控制器确定多个箱中的哪一个包含期望的码本元素，然后搜索所确定的仓。减少了码本搜索的复杂度，并选择性地避免了详尽的码本搜索。

29.

发明授权
Method and apparatus for improved detection of rate errors in variable rate receivers 有权

公开(公告)号：US07590096B2

公开(公告)日：2009-09-15

申请号：US10938445

申请日：2004-09-09

Applicant: Khaled H. El-Maleh , Eddie-Lun Tik Choy , Arasanipalai K. Ananthapadmanabhan , Andrew P. DeJaco , Pengjun Huang

Inventor： Khaled H. El-Maleh , Eddie-Lun Tik Choy , Arasanipalai K. Ananthapadmanabhan , Andrew P. DeJaco , Pengjun Huang

IPC: H04B7/216 , G10L19/12 , H04L27/06

CPC classification number: H04L1/08 , H04L1/0046 , H04L1/201

Abstract: A system and method for detection of rate determination algorithm errors in variable rate communications system receivers. The disclosed embodiments prevent rate determination algorithm errors from causing audible artifacts such as screeches or beeps. The disclosed system and method detects frames with incorrectly determined data rates and performs frame erasure processing and/or memory state clean up to prevent propagation of distortion across multiple frames. Frames with incorrectly determined data rates are detected by checking illegal rate transitions, reserved bits, validating unused filter type bit combinations and analyzing relationships between fixed code-book gains and linear prediction coefficient gains.

30.

发明授权
Method and apparatus for robust speech classification 有权
Title translation: 鲁棒语音分类的方法和装置

公开(公告)号：US07472059B2

公开(公告)日：2008-12-30

申请号：US09733740

申请日：2000-12-08

Applicant: Pengjun Huang

Inventor： Pengjun Huang

IPC: G10L19/00 , G10L11/06

CPC classification number: G10L25/93 , G10L19/025 , G10L19/22 , G10L25/78

Abstract: A speech classification technique for robust classification of varying modes of speech to enable maximum performance of multi-mode variable bit rate encoding techniques. A speech classifier accurately classifies a high percentage of speech segments for encoding at minimal bit rates, meeting lower bit rate requirements. Highly accurate speech classification produces a lower average encoded bit rate, and higher quality decoded speech. The speech classifier considers a maximum number of parameters for each frame of speech, producing numerous and accurate speech mode classifications for each frame. The speech classifier correctly classifies numerous modes of speech under varying environmental conditions. The speech classifier inputs classification parameters from external components, generates internal classification parameters from the input parameters, sets a Normalized Auto-correlation Coefficient Function threshold and selects a parameter analyzer according to the signal environment, and then analyzes the parameters to produce a speech mode classification.

Abstract translation: 一种语音分类技术，用于对不同语音模式进行鲁棒分类，以实现多模式可变比特率编码技术的最大性能。语音分类器以最低比特率对用于编码的高百分比的语音段进行精确的分类，满足较低的比特率要求。高精度的语音分类产生较低的平均编码比特率和更高质量的解码语音。语音分类器考虑每个语音帧的最大参数数，为每个帧产生大量且准确的语音模式分类。语音分类器在不同的环境条件下正确分类了许多语音模式。语音分类器从外部组件输入分类参数，从输入参数生成内部分类参数，设置归一化自相关系数函数阈值，并根据信号环境选择参数分析仪，然后分析参数以产生语音模式分类。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification