-
公开(公告)号:WO2001016937A1
公开(公告)日:2001-03-08
申请号:PCT/US2000/023754
申请日:2000-08-29
Applicant: WAVEMAKERS RESEARCH, INC. , ZAKARAUSKAS, Pierre
Inventor: ZAKARAUSKAS, Pierre
IPC: G10L17/00
Abstract: A system and method to identify a sound source among a group of sound sources. The invention matches the acoustic input to a number of signal models, one per source class, and produces a goodness-of-match number for each signal model. The sound source is declared to be of the same class as that of the signal model with the best goodness-of-match if that score is sufficiently high. The data are recorded with a microphone, digitized and transformed into the frequency domain. A signal detector is applied to the transient. A harmonic detection method can be used to determine if the sound source has harmonic characteristics. If at least some part of a transient contains signal of interest, the spectrum of the signal after rescaling is compared to a set of signal models, and the input signal's parameters are fitted to the data. The average distortion is calculated to compare patterns with those of sources that used in training the signal models. Before classification can occur, a source model is trained with signal data. Each signal model is built by creating templates from input signal spectrograms when they are significantly different from existing templates. If an existing template is found that resembles the input pattern, the template is averaged with the pattern in such a way that the resulting template is the average of all the spectra that matched that template in the past.
Abstract translation: 一组识别声源的声源的系统和方法。 本发明将声输入匹配到多个信号模型,每个源类一个信号模型,并且为每个信号模型产生一个良好的匹配次数。 如果该分数足够高,则声源被声明为与具有最佳匹配度的信号模型相同的类。 用麦克风记录数据,数字化并转换到频域。 信号检测器应用于瞬态。 可以使用谐波检测方法来确定声源是否具有谐波特性。 如果瞬态的至少部分包含感兴趣的信号,则将重新缩放之后的信号的频谱与一组信号模型进行比较,并将输入信号的参数拟合到数据中。 计算平均失真以将模式与用于训练信号模型的源的模式进行比较。 在分类之前,可以用信号数据对源模型进行训练。 每个信号模型是通过从输入信号谱图创建模板构建的,当它们与现有模板显着不同时。 如果找到类似于输入模式的现有模板,则模板将以该模式进行平均,使得所得到的模板是与过去匹配该模板的所有光谱的平均值。