Invention Application
- Patent Title: METHOD FOR SEARCH IN AN AUDIO DATABASE
- Patent Title (中): 在音频数据库中搜索的方法
-
Application No.: PCT/EP0108709Application Date: 2001-07-26
-
Publication No.: WO0211123A3Publication Date: 2002-05-30
- Inventor: WANG AVERY LI-CHUN , SMITH JULIUS O III
- Applicant: SHAZAM ENTERTAINMENT LTD , WANG AVERY LI CHUN , SMITH JULIUS O III
- Assignee: SHAZAM ENTERTAINMENT LTD,WANG AVERY LI CHUN,SMITH JULIUS O III
- Current Assignee: SHAZAM ENTERTAINMENT LTD,WANG AVERY LI CHUN,SMITH JULIUS O III
- Priority: US22202300 2000-07-31; US83947601 2001-04-20
- Main IPC: G10L15/10
- IPC: G10L15/10 ; G06K9/00 ; G10L11/00 ; G10L15/00 ; G10L15/26 ; G06F17/30 ; G10H1/00 ; G10L15/02 ; G10L15/20
Abstract:
A method for recognizing an audio sample locates an audio file that most closely matches the audio sample from a database indexing a large set of original recordings. Each indexed audio file is represented in the database index by a set of landmark timepoints and associated fingerprints. Landmarks occur at reproductible locations within the file, while fingerprints represent features of the signal at or near the landmark timepoints. To perform recognition, landmarks and fingerprints are computed for the unknown sample and used to retrieve matching fingerprints from the database. For each file containing matching fingerprints, the landmarks are compared with landmarks of the sample at which the same fingerprints were computed. If a large number of corresponding landmarks are linearly related, i.e., if equivalent fingerprints of the sample and retrieved file have the same time evolution, then the file is identified with the sample. The method can be used for any type of sound or music, and is particularly effective for audio signals subject to linear and nonlinear distortion such as background noise, compression artifacts, or transmission dropouts. The sample can be identified in a time proportional to the logarithm of the number of entries in the database; given sufficient computational power, recognition can be performed in nearly real time as the sound is being sampled.
Information query