MUSIC SIMILARITY SYSTEMS AND METHODS USING DESCRIPTORS
    32.
    发明申请
    MUSIC SIMILARITY SYSTEMS AND METHODS USING DESCRIPTORS 审中-公开
    音乐相似系统和使用描述符的方法

    公开(公告)号:US20080300702A1

    公开(公告)日:2008-12-04

    申请号:US12128917

    申请日:2008-05-29

    IPC分类号: G06F17/00

    CPC分类号: G10L25/48 G06F16/683

    摘要: Systems and methods for determining similarity between two or more audio pieces are disclosed. An illustrative method for determining musical similarities includes extracting one or more descriptors from each audio piece, generating a vector for each of the audio pieces, extracting one or more audio features from each of the audio pieces, calculating values for each audio feature, calculating a distance between a vector containing the normalized values and the vectors containing the audio pieces, and outputting a response to a user or process indicating the similarity between the audio pieces. The descriptors can be used in performing content-based audio classification and for determining similarities between music. The descriptors that can be extracted from each audio piece can include tonal descriptors, dissonance descriptors, rhythm descriptors, and spatial descriptors.

    摘要翻译: 公开了用于确定两个或多个音频片段之间的相似性的系统和方法。 用于确定音乐相似性的说明性方法包括从每个音频片段中提取一个或多个描述符,为每个音频片段生成矢量,从每个音频片段中提取一个或多个音频特征,计算每个音频特征的值, 包含归一化值的矢量与包含音频片段的矢量之间的距离,并且向用户指示音频片段之间的相似性的用户或处理器输出响应。 描述符可用于执行基于内容的音频分类和用于确定音乐之间的相似性。 可以从每个音频片段提取的描述符可以包括音调描述符,不一致描述符,节奏描述符和空间描述符。

    Music-piece processing apparatus and method
    33.
    发明申请
    Music-piece processing apparatus and method 有权
    音乐片处理装置和方法

    公开(公告)号:US20080115658A1

    公开(公告)日:2008-05-22

    申请号:US11985212

    申请日:2007-11-13

    IPC分类号: G10H1/22

    摘要: For each of a plurality of music pieces, a storage device stores respective tone data of a plurality of fragments of the music piece and respective musical character values of the fragments. Similarity determination section calculates a similarity index value indicative of a degree of similarity between the character values of each of the fragments of a main music piece and the character values of each individual fragment of a plurality of sub music pieces. Each of the similarity index values calculated for the fragments of each of the sub music pieces can be adjusted in accordance with a user's control. Processing section processes the tone data of each of the fragments of the main music piece on the basis of the tone data of any one of the fragments of the sub music pieces of which the similarity index value indicates sufficient similarity.

    摘要翻译: 对于多个音乐片段中的每一个,存储装置存储音乐片段的多个片段和片段的相应音乐字符值的各个音调数据。 相似度确定部分计算表示主乐曲的每个片段的字符值与多个子乐曲的每个片段的字符值之间的相似度的相似度指数值。 可以根据用户的控制来对每个子音乐片段的片段计算出的每个相似性索引值进行调整。 处理部根据相似度指标值表示足够的相似度的子音乐片段中任一片段的色调数据,处理主乐曲的每个片段的色调数据。

    Singing voice synthesizing apparatus, singing voice synthesizing method, and program for realizing singing voice synthesizing method

    公开(公告)号:US07016841B2

    公开(公告)日:2006-03-21

    申请号:US10034359

    申请日:2001-12-27

    IPC分类号: G10L13/00 G10H7/00

    CPC分类号: G10L13/07

    摘要: A singing voice synthesizing apparatus is provided, which enables achievement of a natural sounding synthesized singing voice with a good level of comprehensibility. A phoneme database stores a plurality of voice fragment data formed of voice fragments each being a single phoneme or a phoneme chain of at least two concatenated phonemes, each of the plurality of voice fragment data comprising data of a deterministic component and data of a stochastic component. A readout device that reads out from the phoneme database the voice fragment data corresponding to inputted lyrics. A duration time adjusting device adjusts time duration of the read-out voice fragment data so as to match a desired tempo and manner of singing. An adjusting device adjusts the deterministic component and the stochastic component of the read-out voice fragment so as to match a desired pitch. A synthesizing device synthesizes a singing sound by sequentially concatenating the voice fragment data that have been adjusted by the duration time adjusting device and the adjusting device.

    Singing voice synthesizing method
    35.
    发明授权

    公开(公告)号:US06992245B2

    公开(公告)日:2006-01-31

    申请号:US10375420

    申请日:2003-02-27

    IPC分类号: G10H1/06 G10H7/00

    摘要: A frequency spectrum is detected by analyzing a frequency of a voice waveform corresponding to a voice synthesis unit formed of a phoneme or a phonemic chain. Local peaks are detected on the frequency spectrum, and spectrum distribution regions including the local peaks are designated. For each spectrum distribution region, amplitude spectrum data representing an amplitude spectrum distribution depending on a frequency axis and phase spectrum data representing a phase spectrum distribution depending on the frequency axis are generated. The amplitude spectrum data is adjusted to move the amplitude spectrum distribution represented by the amplitude spectrum data along the frequency axis based on an input note pitch, and the phase spectrum data is adjusted corresponding to the adjustment. Spectrum intensities are adjusted to be along with a spectrum envelope corresponding to a desired tone color. The adjusted amplitude and phase spectrum data are converted into a synthesized voice signal.

    Voice converter for assimilation by frame synthesis with temporal alignment
    36.
    发明授权
    Voice converter for assimilation by frame synthesis with temporal alignment 失效
    语音转换器通过帧合成与时间对准同化

    公开(公告)号:US06836761B1

    公开(公告)日:2004-12-28

    申请号:US09693144

    申请日:2000-10-20

    IPC分类号: G10L1300

    CPC分类号: G10L13/033 G10L2021/0135

    摘要: A voice converting apparatus is constructed for converting an input voice into an output voice according to a target voice. In the apparatus, a storage section provisionally stores source data, which is associated to and extracted from the target voice. An analyzing section analyzes the input voice to extract therefrom a series of input data frames representing the input voice. A producing section produces a series of target data frames representing the target voice based on the source data, while aligning the target data frames with the input data frames to secure synchronization between the target data frames and the input data frames. A synthesizing section synthesizes the output voice according to the target data frames and the input data frames.

    摘要翻译: 构成语音转换装置,用于根据目标语音将输入语音转换为输出语音。 在装置中,存储部临时存储与目标语音相关联并从其中提取的源数据。 分析部分分析输入声音以从中提取代表输入声音的一系列输入数据帧。 产生部分基于源数据产生一系列表示目标语音的目标数据帧,同时使目标数据帧与输入数据帧对齐,以确保目标数据帧与输入数据帧之间的同步。 合成部根据目标数据帧和输入数据帧合成输出声音。