System and method of lattice-based search for spoken utterance retrieval
    12.
    发明公开
    System and method of lattice-based search for spoken utterance retrieval 审中-公开
    基于网格搜索来检索发声方法及装置

    公开(公告)号:EP1630705A2

    公开(公告)日:2006-03-01

    申请号:EP05270042.4

    申请日:2005-08-22

    申请人: AT&T Corp.

    IPC分类号: G06F17/30

    摘要: A system and method are disclosed for retrieving audio segments from a spoken document. The spoken document preferably is one having moderate word error rates such as telephone calls or teleconferences. The method comprises converting speech associated with a spoken document into a lattice representation and indexing the lattice representation of speech. These steps are performed typically off-line. Upon receiving a query from a user, the method further comprises searching the indexed lattice representation of speech and returning retrieved audio segments from the spoken document that match the user query.

    摘要翻译: 一种系统和方法,游离缺失盘用于从声音数据中检索音频段。 口述文件优选为具有适度的字错误率:如电话呼叫或远程会议。 该方法包括:转换用声音数据成格子表示相关联的语音和语音编索引的晶格表示。 这些步骤通常执行离线。 在从用户接收查询,该方法还包括搜索语音的索引晶格表示和从声音数据返回检索的音频段做匹配用户查询。

    QUERY BY INDEFINITE EXPRESSIONS
    13.
    发明公开
    QUERY BY INDEFINITE EXPRESSIONS 审中-公开
    查询按不定条款

    公开(公告)号:EP1579343A1

    公开(公告)日:2005-09-28

    申请号:EP03777128.4

    申请日:2003-11-27

    IPC分类号: G06F17/30

    摘要: A method and apparatus for retrieving data from a database is disclosed. A plurality of entities are stored in a first memory and information about each stored entity is stored in a second memory. Criteria in the form of at least one indefinite expression is received from a user for selecting entites from the stored entities. The received criteria are translated into terms used in the stored information. A sequence of entites based on the translated criteria are then selected.

    MUSIC DISCOVERY
    14.
    发明公开
    MUSIC DISCOVERY 审中-公开

    公开(公告)号:EP3111348A4

    公开(公告)日:2017-11-29

    申请号:EP15811605

    申请日:2015-05-28

    申请人: SONOS INC

    发明人: BATES PAUL

    IPC分类号: G06F17/30

    摘要: Examples described herein relate to music discovery. In one aspect, a method is provided that involves (a) receiving by a computing device an indication of a search tool from among a plurality of search tools, where each search tool of the plurality of search tools is associated with at least one respective media service, (b) receiving by the computing device an indication of a media characteristic, where the computing device receives the media characteristic via the indicated search tool, (c) selecting by the computing device one or more of the at least one respective media service that maintains media associated with the indicated media characteristic, and (d) sending by the computing device an indication of the selected one or more of the at least one respective media service.

    CADENCE AND MEDIA CONTENT PHASE ALIGNMENT
    15.
    发明公开
    CADENCE AND MEDIA CONTENT PHASE ALIGNMENT 审中-公开
    压缩与媒体内容相位对齐

    公开(公告)号:EP3215962A1

    公开(公告)日:2017-09-13

    申请号:EP16727139.4

    申请日:2016-05-17

    申请人: Spotify AB

    发明人: JEHAN, Tristan

    IPC分类号: G06F17/30

    摘要: Systems, devices, apparatuses, components, methods, and techniques for cadence and media content phase alignment are provided. An example media-playback device includes a content output device that operates to output media content, a cadence-acquiring device, a phase-delay calibration engine, a cadence-based media content selection engine, and a phase-aligned media playback engine. The cadence-acquiring device includes a movement-determining device and a cadence-determination engine configured to determine a cadence based on movement data captured by the movement-determining device. The phase-delay calibration engine configured to determine phase delay values for at least one cadence value. The cadence-based media content selection engine configured to identify a media content item based on the cadence determined by the cadence-acquiring device. The phase-aligned media playback engine configured to align the identified media content item to the repetitive-motion activity and cause the media-output device to output the aligned media content item.

    摘要翻译: 提供了用于节奏和媒体内容相位对齐的系统,设备,装置,组件,方法和技术。 示例性媒体回放设备包括用于输出媒体内容的内容输出设备,节奏获取设备,相位延迟校准引擎,基于节奏的媒体内容选择引擎以及相位对齐的媒体回放引擎。 踏频获取设备包括运动确定设备和节奏确定引擎,节奏确定引擎被配置为基于由运动确定设备采集的运动数据来确定节奏。 相位延迟校准引擎被配置为确定至少一个节奏值的相位延迟值。 基于节奏的媒体内容选择引擎,被配置为基于由步调获取装置确定的节奏来识别媒体内容项目。 所述相位对齐媒体回放引擎被配置为将所识别的媒体内容项目与所述重复运动活动对齐,并且使所述媒体输出设备输出所述对齐的媒体内容项目。

    Sound data processing device and method
    16.
    发明公开
    Sound data processing device and method 有权
    声音数据处理装置和方法

    公开(公告)号:EP2602786A3

    公开(公告)日:2016-08-17

    申请号:EP12196037.1

    申请日:2012-12-07

    发明人: Watanabe, Daichi

    摘要: For each of a plurality of performance parts, a database (221) stores therein a plurality of part performance data. The part performance data for each of the parts includes a sound generation pattern and tone data corresponding to the sound generation pattern. A query pattern indicative of a sound generation pattern to be made an object of search is input by a user. A search is made through the database for part performance data including a sound generation pattern matching the query pattern. In response to a user's operation, one part performance data is identified from among searched-out results from the database, and the sound generation pattern of the identified part performance data is instructed as a new query pattern (Sa8b). Then, a further search is made through the database for part performance data including a sound generation pattern matching the new query pattern. In accordance with a user's operation, one part performance data is identified from among searched-out results, and the identified part performance data is edited. The thus-edited data can be registered into the database as new part performance data.

    摘要翻译: 对于多个演奏部件中的每一个,数据库(221)在其中存储多个部件演奏数据。 每个部分的部分演奏数据包括对应于声音生成模式的声音生成模式和乐音数据。 由用户输入指示要成为搜索对象的声音产生模式的查询模式。 通过数据库搜索部件性能数据,包括匹配查询模式的声音生成模式。 响应于用户的操作,从数据库中搜出的结果中识别出一部分演奏数据,并且将所识别的部分演奏数据的声音产生模式指示为新的查询模式(Sa8b)。 然后,通过数据库进一步搜索包括匹配新查询模式的声音生成模式的部分性能数据。 根据用户的操作,从搜出结果中识别一部分演奏数据,并且对所识别的部分演奏数据进行编辑。 这样编辑的数据可以作为新的零件性能数据登记到数据库中。

    Methods and systems for music information management
    17.
    发明公开
    Methods and systems for music information management 审中-公开
    Verfahren und Systeme zur Verwaltung von Musikinformationen

    公开(公告)号:EP2840516A1

    公开(公告)日:2015-02-25

    申请号:EP14180676.0

    申请日:2014-08-12

    申请人: HTC Corporation

    IPC分类号: G06F17/30

    摘要: Methods and systems for music information management are provided. When audio data is generated in an electronic device, a control module is notified to launch a specific application to perform a music recognition procedure for the audio data, thus to obtain music information corresponding to the audio data,

    摘要翻译: 提供音乐信息管理的方法和系统。 当在电子设备中产生音频数据时,通知控制模块启动特定应用以对音频数据执行音乐识别过程,从而获得对应于音频数据的音乐信息,

    Music information retrieval using a 3D search algorithm
    19.
    发明公开
    Music information retrieval using a 3D search algorithm 审中-公开
    Musikabfrage手套3D-Suchalgorithmus

    公开(公告)号:EP1785891A1

    公开(公告)日:2007-05-16

    申请号:EP05024429.2

    申请日:2005-11-09

    发明人: Kemp, Thomas

    IPC分类号: G06F17/30

    摘要: The present invention generally relates to the field of content-based music information retrieval systems, in particular to a method and a query-by-humming (QbH) database system (100') for processing queries in the form of analog audio sequences which encompass recorded parts of sung, hummed or whistled tunes (102), recorded parts of a melody (300a) played on a musical instrument and/or a speaker's recorded voice (400) articulating at least one part of a song's lyrics to retrieve textual background information about a musical piece whose score is stored in an integrated database (103, 105) of said system after having analyzed and recognized said melody (300a).
    According to one embodiment of the present invention, said method is characterized by the steps of recording (S1) said analog audio sequences (102, 300a, 400), extracting (S4a) and analyzing (S4b) various acoustic-phonetic speech characteristics of the speaker's voice and pronunciation from spoken parts (400) of a recorded song's lyrics (102") and recognizing (S4c) syntax and semantics of said lyrics (102"). The method further comprises the steps of extracting (S2a), analyzing (S2b) and recognizing (S2c) musical key characteristics from the analog audio sequences (102, 300a, 400), which are given by the semitone numbers of the particular notes, the intervals and/or interval directions of the melody and the time values of the notes and pauses the rhythm of said melody is composed of, the key, beat, tempo, volume, agogics, dynamics, phrasing, articulation, timbre and instrumentation of said melody, the harmonies of accompaniment chords and/or electronic sound effects generated by said musical instrument. The invention is characterized by the step of calculating (S3a) a similarity measure indicating the similarity of melody and lyrics of the recorded audio sequence (102, 300a) compared to melody and lyrics of various music files stored in said database (103, 105) by performing a Viterbi search algorithm on a three-dimensional search space, said search space having a first dimension ( t ) for the time, a second dimension ( S ) for an appropriate coding of the acoustic-phonetic speech characteristics and a third dimension ( H ) for an appropriate coding of the musical key characteristics, and generating (S3b) a ranked list (107) of said music files.

    摘要翻译: 本发明一般涉及基于内容的音乐信息检索系统的领域,特别涉及一种用于处理模拟音频序列形式的查询的方法和一种笨拙查询(QbH)数据库系统(100'),其包括 唱歌,哼唱或吹口哨音乐(102)的记录部分,记录在乐器上播放的旋律(300a)的部分和/或扬声器的录音(400),其表达歌曲的歌词的至少一部分以检索文本背景信息 关于在分析并识别所述旋律(300a)之后,其乐谱存储在所述系统的综合数据库(103,105)中的乐曲。 根据本发明的一个实施例,所述方法的特征在于记录(S1)所述模拟音频序列(102,300a,400),提取(S4a)和分析(S4b)各种声音语音特征 扬声器的声音和来自所录音的歌词(102“)的口语部分(400)的发音以及识别(S4c)所述歌词(102”)的语法和语义。 该方法还包括从模拟音频序列(102,300a,400)提取(S2a),分析(S2b)并识别(S2c)音乐特征,所述模拟音频序列由特定音符的半号给出, 旋律的间隔和/或间隔方向和音符的时间值并暂停所述旋律的节奏由所述旋律的关键,节拍,速度,音量,逻辑,动力学,措辞,发音,音色和乐器组成 ,由所述乐器产生的伴奏和弦和/或电子声音效果的和声。 本发明的特征在于,计算(S3a)与相对于存储在所述数据库(103,105)中的各种音乐文件的旋律和歌词相比较的指示所记录的音频序列(102,300a)的旋律和歌词的相似度的相似性度量(S3a) 通过在三维搜索空间上执行维特比搜索算法,所述搜索空间具有时间上的第一维度(t),用于对声音语音特征的适当编码的第二维度(S)和第三维度 H),并且产生(S3b)所述音乐文件的排序列表(107)(S3b)。