Method and system for segmenting phonemes from voice signals
    4.
    发明授权
    Method and system for segmenting phonemes from voice signals 有权
    从语音信号中分割音素的方法和系统

    公开(公告)号:US08849662B2

    公开(公告)日:2014-09-30

    申请号:US11646911

    申请日:2006-12-28

    申请人: Hyun-Soo Kim

    发明人: Hyun-Soo Kim

    CPC分类号: G10L15/04 G10L2015/025

    摘要: A method and a system for segmenting phonemes from voice signals. A method for accurately segmenting phonemes, in which a histogram showing a peak distribution corresponding to an order is formed by using a high order concept, and a boundary indicating a starting point and an ending point of each phoneme is determined by calculating a peak statistic based on the histogram. The phoneme segmentation method can remarkably reduce an amount of calculation, and has an advantage of being applied to sound signal systems which perform sound coding, sound recognition, sound synthesizing, sound reinforcement, etc.

    摘要翻译: 一种从语音信号中分割音素的方法和系统。 一种用于准确分割音素的方法,其中通过使用高阶概念形成表示与顺序相对应的峰值分布的直方图,并且通过计算基于峰值统计量来确定表示每个音素的起点和终点的边界 在直方图上。 音素分割方法可以显着减少计算量,并且具有应用于执行声音编码,声音识别,声音合成,扩音等的声音信号系统的优点。

    Sound source separation method and system using beamforming technique
    5.
    发明授权
    Sound source separation method and system using beamforming technique 有权
    声源分离方法和使用波束成形技术的系统

    公开(公告)号:US08577677B2

    公开(公告)日:2013-11-05

    申请号:US12460473

    申请日:2009-07-20

    IPC分类号: G10L21/02

    摘要: A system and method for sound source separation. The system and method use a beamforming technique. The sound source separation system includes a windowing processor; a DFT transformer; a transfer function estimator; and a noise estimator. The system also includes a voice signal extractor that cancels individual voice signals, except an individual voice signal that is desired to be extracted among individual voice signals, from the integrated voice signals. The system further includes a voice signal detector that cancels a noise part provided through the noise estimator from a transfer function of an individual voice signal which is desired to be detected and extracts a noise-canceled individual voice signal. Even when two or more sound sources are simultaneously input, the sound sources can be separated from each other and separately stored and managed, or an initial sound source can be stored and managed.

    摘要翻译: 一种声源分离的系统和方法。 该系统和方法使用波束成形技术。 声源分离系统包括开窗处理器; DFT变压器; 传递函数估计器; 和噪声估计器。 该系统还包括语音信号提取器,除了从各个语音信号中提取的单个语音信号之外,还可以从集成语音信号中消除各个语音信号。 该系统还包括语音信号检测器,该语音信号检测器根据期望被检测的单独语音信号的传递函数来消除通过噪声估计器提供的噪声部分,并提取噪声消除的个体语音信号。 即使当同时输入两个或更多个声源时,声源可以彼此分开并分开存储和管理,或者可以存储和管理初始声源。

    Method and apparatus for estimating noise by using harmonics of voice signal
    7.
    发明授权
    Method and apparatus for estimating noise by using harmonics of voice signal 有权
    通过使用语音信号的谐波估计噪声的方法和装置

    公开(公告)号:US08135586B2

    公开(公告)日:2012-03-13

    申请号:US12053144

    申请日:2008-03-21

    IPC分类号: G10L19/10

    CPC分类号: G10L21/0208

    摘要: Disclosed is a method and an apparatus for estimating noise included in a sound signal during sound signal processing. The method includes estimating harmonics components in a frame of an input sound signal; using the estimated harmonics components, computing a Voice Presence Probability (VPP) on the frame of the input sound signal; determining a weight of an equation necessary to estimate a noise spectrum, depending on the computed VPP; and using the determined weight and the equation necessary to estimate a noise spectrum, estimating the noise spectrum, and updating the noise spectrum.

    摘要翻译: 公开了一种用于在声音信号处理期间估计包括在声音信号中的噪声的方法和装置。 该方法包括估计输入声音信号的帧中的谐波分量; 使用估计的谐波分量,在输入声音信号的帧上计算声音概率(VPP); 根据所计算的VPP确定估计噪声谱所必需的方程的权重; 并且使用所确定的权重和所需的等式来估计噪声谱,估计噪声谱和更新噪声谱。

    Face recognition system and method based on adaptive learning
    8.
    发明授权
    Face recognition system and method based on adaptive learning 有权
    基于自适应学习的人脸识别系统和方法

    公开(公告)号:US08135220B2

    公开(公告)日:2012-03-13

    申请号:US12115250

    申请日:2008-05-05

    IPC分类号: G06K9/62 G06K9/00

    CPC分类号: G06K9/00295

    摘要: A face recognition system based on adaptive learning includes a specific person detection and tracking unit for detecting and tracking a specific person from a moving image. A facial feature extraction unit extracts a plurality of facial feature vectors from the detected and tracked specific person. A face recognition unit searches for a given registration model by comparing the extracted facial feature vectors with facial feature vectors of the registration models previously stored in a user registration model database. A learning target selection unit selects a facial feature vector to be added to a record of the given registration model from among the extracted facial feature vectors. A registration model learning unit adds and updates the selected facial feature vector to the record of the given registration model.

    摘要翻译: 基于自适应学习的面部识别系统包括用于从运动图像中检测和跟踪特定人物的特定人物检测和跟踪单元。 面部特征提取单元从检测到和跟踪的特定人物中提取多个面部特征向量。 面部识别单元通过将提取的面部特征向量与先前存储在用户登记模型数据库中的登记模型的面部特征向量进行比较来搜索给定的登记模型。 学习目标选择单元从所提取的面部特征向量中选择要添加到给定登记模型的记录的面部特征向量。 注册模型学习单元将所选择的面部特征向量添加并更新到给定注册模型的记录。

    Method and system for aligning windows to extract peak feature from a voice signal
    9.
    发明授权
    Method and system for aligning windows to extract peak feature from a voice signal 有权
    用于对齐窗口以从语音信号中提取峰值特征的方法和系统

    公开(公告)号:US08103512B2

    公开(公告)日:2012-01-24

    申请号:US11656873

    申请日:2007-01-23

    申请人: Hyun-Soo Kim

    发明人: Hyun-Soo Kim

    IPC分类号: G10L19/00

    CPC分类号: G10L25/90

    摘要: Disclosed is a method capable of adaptively aligning windows to extract features according to the types and characteristics of voice signals. To this end, window lengths based on the window update points in a corresponding order are determined by employing the concept of a higher order peak, and windows are aligned according to window lengths. When the windows are aligned according to such a manner, the start and end points of each window is known, so that it becomes possible to easily extract and analyze peak feature information.

    摘要翻译: 公开了一种能够根据语音信号的类型和特性自适应地对齐窗口以提取特征的方法。 为此,通过采用高阶峰值的概念来确定基于相应顺序的窗口更新点的窗口长度,并且根据窗口长度对准窗口。 当窗口按照这种方式对齐时,每个窗口的开始和结束点是已知的,从而可以容易地提取和分析峰值特征信息。

    Sound processing apparatus and method
    10.
    发明授权
    Sound processing apparatus and method 有权
    声音处理装置及方法

    公开(公告)号:US08073148B2

    公开(公告)日:2011-12-06

    申请号:US11479472

    申请日:2006-06-30

    申请人: Hyun-Soo Kim

    发明人: Hyun-Soo Kim

    IPC分类号: A61F11/06

    CPC分类号: G10L21/0208

    摘要: Disclosed is an apparatus and method for processing signals such as sound signals. The sound processing apparatus includes a sound signal input unit for receiving sound signals, a harmonic noise separator for separating a harmonic region and a noise region from the received sound signals, a noise restraint index determination unit for determining an optimal noise restraint index k according to a system and circumstance, and a noise restrainer for restraining the separated noise region depending on the noise restraint index k so as to output noise attenuated signals.

    摘要翻译: 公开了一种用于处理诸如声音信号的信号的装置和方法。 声音处理装置包括:用于接收声音信号的声音信号输入单元,用于从接收的声音信号中分离谐波区域和噪声区域的谐波噪声分离器;噪声抑制指标确定单元,用于根据 系统和环境,以及用于根据噪声抑制指数k抑制分离的噪声区域以便输出噪声衰减信号的噪声抑制器。