METHOD FOR SEARCH IN AN AUDIO DATABASE

Invention Application

WO0211123A3 METHOD FOR SEARCH IN AN AUDIO DATABASE 审中-公开

Title translation: 在音频数据库中搜索的方法

Please log in to see more content

Patent Title: METHOD FOR SEARCH IN AN AUDIO DATABASE
Patent Title (中): 在音频数据库中搜索的方法
Application No.: PCT/EP0108709

Application Date: 2001-07-26
Publication No.: WO0211123A3

Publication Date: 2002-05-30
Inventor: WANG AVERY LI-CHUN , SMITH JULIUS O III
Applicant: SHAZAM ENTERTAINMENT LTD , WANG AVERY LI CHUN , SMITH JULIUS O III
Assignee: SHAZAM ENTERTAINMENT LTD,WANG AVERY LI CHUN,SMITH JULIUS O III
Current Assignee: SHAZAM ENTERTAINMENT LTD,WANG AVERY LI CHUN,SMITH JULIUS O III
Priority: US22202300 2000-07-31; US83947601 2001-04-20
Main IPC: G10L15/10
IPC: G10L15/10 ; G06K9/00 ; G10L11/00 ; G10L15/00 ; G10L15/26 ; G06F17/30 ; G10H1/00 ; G10L15/02 ; G10L15/20

Abstract:

A method for recognizing an audio sample locates an audio file that most closely matches the audio sample from a database indexing a large set of original recordings. Each indexed audio file is represented in the database index by a set of landmark timepoints and associated fingerprints. Landmarks occur at reproductible locations within the file, while fingerprints represent features of the signal at or near the landmark timepoints. To perform recognition, landmarks and fingerprints are computed for the unknown sample and used to retrieve matching fingerprints from the database. For each file containing matching fingerprints, the landmarks are compared with landmarks of the sample at which the same fingerprints were computed. If a large number of corresponding landmarks are linearly related, i.e., if equivalent fingerprints of the sample and retrieved file have the same time evolution, then the file is identified with the sample. The method can be used for any type of sound or music, and is particularly effective for audio signals subject to linear and nonlinear distortion such as background noise, compression artifacts, or transmission dropouts. The sample can be identified in a time proportional to the logarithm of the number of entries in the database; given sufficient computational power, recognition can be performed in nearly real time as the sound is being sampled.

Abstract(Chinese):

用于识别音频样本的方法定位与索引大量原始记录的数据库最接近匹配的音频文件的音频文件。每个索引的音频文件通过一组里程碑时间点和相关指纹在数据库索引中表示。地标出现在文件内的可再生位置，而指纹表示在地标时间点处或附近的信号的特征。为了执行识别，计算未知样本的地标和指纹，并用于从数据库中检索匹配的指纹。对于包含匹配指纹的每个文件，将地标与样本的与所计算的相同指纹的地标进行比较。如果大量对应的地标是线性相关的，即如果样本和检索文件的等效指纹具有相同的时间演化，则用该样本识别该文件。该方法可以用于任何类型的声音或音乐，并且对于经历线性和非线性失真（例如背景噪声，压缩伪像或传输丢失）的音频信号特别有效。可以在与数据库中条目数的对数成比例的时间内识别样本; 给予足够的计算能力，随着声音被采样，可以几乎实时地执行识别。

Information query

Global Dossier Patent Scope Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/08	.语音分类或检索
G10L15/10	..利用未知语音与基准模板之间的距离测度或失真测度