Estimating pitch by modeling audio as a weighted mixture of tone models for harmonic structures
    1.
    发明授权
    Estimating pitch by modeling audio as a weighted mixture of tone models for harmonic structures 有权
    通过将音频建模为谐波结构的音调模型的加权混合来估计音高

    公开(公告)号:US08543387B2

    公开(公告)日:2013-09-24

    申请号:US11849217

    申请日:2007-08-31

    IPC分类号: G10L11/04 G10L19/14 G10L19/00

    摘要: Disclosed herein is a pitch estimation apparatus and associated methods for estimating a fundamental frequency of an audio signal from a fundamental frequency probability density function by modeling the audio signal as a weighted mixture of a plurality of tone models corresponding respectively to harmonic structures of individual fundamental frequencies, so that the fundamental frequency probability density function of the audio signal is given as a distribution of respective weights of the plurality of the tone models.

    摘要翻译: 本文公开了一种音调估计装置和相关方法,用于通过对音频信号进行建模来模拟音频信号的基频,分别对应于分别对应于各个基本频率的谐波结构的多个音调模型 ,使得音频信号的基频概率密度函数被给出为多个音调模型的相应权重的分布。

    PITCH ESTIMATION APPARATUS, PITCH ESTIMATION METHOD, AND PROGRAM
    2.
    发明申请
    PITCH ESTIMATION APPARATUS, PITCH ESTIMATION METHOD, AND PROGRAM 有权
    PITCH估计装置,PITCH估计方法和程序

    公开(公告)号:US20080262836A1

    公开(公告)日:2008-10-23

    申请号:US11849217

    申请日:2007-08-31

    IPC分类号: G10L11/04

    摘要: In a pitch estimation apparatus, a function estimation part estimates a fundamental frequency probability density function of an audio signal by repeating a weight calculation process and an estimated shape specification process. The weight calculation process calculates a weight of each tone model of each fundamental frequency based on an estimated shape of each tone model of each fundamental frequency. The estimated shape indicates a degree of dominancy of a corresponding tone model in a total harmonic structure of the audio signal. The estimated shape specification process specifies each estimated shape of each tone model based on an amplitude spectrum of the audio signal, the harmonic structure of each tone model and the weight of each tone model. A similarity analysis part calculates a similarity index value indicating a degree of similarity between each tone model and corresponding estimated shape. A weight correction part reduces a weight of a tone model of a certain fundamental frequency having the similarity index value indicating that the tone model and the corresponding estimated shape are not similar to each other.

    摘要翻译: 在音调估计装置中,功能估计部通过重复权重计算处理和估计形状规定处理来估计音频信号的基频概率密度函数。 权重计算处理基于每个基本频率的每个音调模型的估计形状来计算每个基本频率的每个音调模型的权重。 估计的形状表示音频信号的总谐波结构中对应的音调模型的统治程度。 估计的形状规格处理根据音频信号的幅度谱,每个音调模型的谐波结构和每个音调模型的权重来指定每个音调模型的每个估计的形状。 相似度分析部分计算表示每个音调模型与对应的估计形状之间的相似度的相似度指数值。 加权校正部分减小具有表示音调模型和对应的估计形状彼此不相似的相似性指数值的某个基本频率的音调模型的权重。

    SOUND ANALYSIS APPARATUS AND PROGRAM
    3.
    发明申请
    SOUND ANALYSIS APPARATUS AND PROGRAM 有权
    声音分析装置和程序

    公开(公告)号:US20080053295A1

    公开(公告)日:2008-03-06

    申请号:US11849232

    申请日:2007-08-31

    IPC分类号: G10H7/00

    CPC分类号: G10H3/125 G10H2210/066

    摘要: A sound analysis apparatus stores sound source structure data defining a constraint on one or more of sounds that can be simultaneously generated by a sound source of an input audio signal. A form estimation part selects fundamental frequencies of one or more of sounds likely to be contained in the input audio signal with peaked weights from various fundamental frequencies during sequential updating and optimizing of weights of tone models corresponding to the various fundamental frequencies, so that the sounds of the selected fundamental frequencies satisfy the sound source structure data, and creates form data specifying the selected fundamental frequencies. A previous distribution imparting part imparts a previous distribution to the weights of the tone models corresponding to the various fundamental frequencies so as to emphasize weights corresponding to the fundamental frequencies specified by the form data created by the form estimation part.

    摘要翻译: 声音分析装置存储定义对可由输入音频信号的声源同时产生的一个或多个声音的约束的声源结构数据。 形式估计部分在顺序更新和优化对应于各种基本频率的乐曲模型的权重的情况下,从各种基本频率中选出可能包含在输入音频信号中的一个或多个声音的基本频率,使得声音 所选择的基本频率满足声源结构数据,并且创建指定所选基频的形式数据。 先前的分配赋予部分赋予与各种基本频率相对应的乐音模型的权重的先前分布,以便强调与由表单估计部分创建的表单数据指定的基本频率相对应的权重。

    Sound analysis apparatus and program
    4.
    发明授权
    Sound analysis apparatus and program 有权
    声音分析仪器和程序

    公开(公告)号:US07858869B2

    公开(公告)日:2010-12-28

    申请号:US12037036

    申请日:2008-02-25

    IPC分类号: G10H1/18 G06F17/00

    CPC分类号: G10H1/361 G10H2210/066

    摘要: A sound analysis apparatus employs tone models which are associated with various fundamental frequencies and each of which simulates a harmonic structure of a performance sound generated by a musical instrument, then defines a weighted mixture of the tone models to simulate frequency components of the performance sound, further sequentially updates and optimizes weight values of the respective tone models so that a frequency distribution of the weighted mixture of the tone models corresponds to a distribution of the frequency components of the performance sound, and estimates the fundamental frequency of the performance sound based on the optimized weight values.

    摘要翻译: 声音分析装置使用与各种基本频率相关联的音调模型,并且每个音调模型模拟由乐器产生的演奏声音的谐波结构,然后定义音调模型的加权混合以模拟演奏声音的频率分量, 进一步顺序地更新和优化各个音调模型的权重值,使得音调模型的加权混合的频率分布对应于演奏声音的频率分量的分布,并且基于该音调模型估计演奏声音的基本频率 优化重量值。

    SOUND ANALYSIS APPARATUS AND PROGRAM
    5.
    发明申请
    SOUND ANALYSIS APPARATUS AND PROGRAM 有权
    声音分析装置和程序

    公开(公告)号:US20080202321A1

    公开(公告)日:2008-08-28

    申请号:US12037036

    申请日:2008-02-25

    IPC分类号: G10H7/00

    CPC分类号: G10H1/361 G10H2210/066

    摘要: A sound analysis apparatus employs tone models which are associated with various fundamental frequencies and each of which simulates a harmonic structure of a performance sound generated by a musical instrument, then defines a weighted mixture of the tone models to simulate frequency components of the performance sound, further sequentially updates and optimizes weight values of the respective tone models so that a frequency distribution of the weighted mixture of the tone models corresponds to a distribution of the frequency components of the performance sound, and estimates the fundamental frequency of the performance sound based on the optimized weight values.

    摘要翻译: 声音分析装置使用与各种基本频率相关联的音调模型,并且每个音调模型模拟由乐器产生的演奏声音的谐波结构,然后定义音调模型的加权混合以模拟演奏声音的频率分量, 进一步顺序地更新和优化各个音调模型的权重值,使得音调模型的加权混合的频率分布对应于演奏声音的频率分量的分布,并且基于该音调模型估计演奏声音的基本频率 优化重量值。

    Sound analysis apparatus and program
    6.
    发明授权
    Sound analysis apparatus and program 有权
    声音分析仪器和程序

    公开(公告)号:US07754958B2

    公开(公告)日:2010-07-13

    申请号:US11849232

    申请日:2007-08-31

    IPC分类号: G10H1/00

    CPC分类号: G10H3/125 G10H2210/066

    摘要: A sound analysis apparatus stores sound source structure data defining a constraint on one or more of sounds that can be simultaneously generated by a sound source of an input audio signal. A form estimation part selects fundamental frequencies of one or more of sounds likely to be contained in the input audio signal with peaked weights from various fundamental frequencies during sequential updating and optimizing of weights of tone models corresponding to the various fundamental frequencies, so that the sounds of the selected fundamental frequencies satisfy the sound source structure data, and creates form data specifying the selected fundamental frequencies. A previous distribution imparting part imparts a previous distribution to the weights of the tone models corresponding to the various fundamental frequencies so as to emphasize weights corresponding to the fundamental frequencies specified by the form data created by the form estimation part.

    摘要翻译: 声音分析装置存储定义对可由输入音频信号的声源同时产生的一个或多个声音的约束的声源结构数据。 形式估计部分在顺序更新和优化对应于各种基本频率的乐曲模型的权重的情况下,从各种基本频率中选出可能包含在输入音频信号中的一个或多个声音的基本频率,使得声音 所选择的基本频率满足声源结构数据,并且创建指定所选基频的形式数据。 先前的分配赋予部分赋予与各种基本频率相对应的乐音模型的权重的先前分布,以便强调与由表单估计部分创建的表单数据指定的基本频率相对应的权重。

    Sound signal processing apparatus and method

    公开(公告)号:US08494668B2

    公开(公告)日:2013-07-23

    申请号:US12378719

    申请日:2009-02-19

    摘要: Character value of a sound signal is extracted for each unit portion, and degrees of similarity between the character values of the individual unit portions are calculated and arranged in a matrix configuration. The matrix has arranged in each column the degrees of similarity acquired by comparing, for each of the unit portions, the sound signal and a delayed sound signal obtained by delaying the sound signal by a time difference equal to an integral multiple of a time length of the unit portion, and it has a plurality of the columns in association with different time differences. Repetition probability is calculated for each of the columns corresponding to the different time differences in the matrix. A plurality of peaks in a distribution of the repetition probabilities are identified. The loop region in the sound signal is identified by collating a reference matrix with the degree of similarity matrix.

    Sound signal processing apparatus and method
    8.
    发明申请
    Sound signal processing apparatus and method 有权
    声音信号处理装置及方法

    公开(公告)号:US20090216354A1

    公开(公告)日:2009-08-27

    申请号:US12378719

    申请日:2009-02-19

    IPC分类号: G06F17/00

    摘要: Character value of a sound signal is extracted for each unit portion, and degrees of similarity between the character values of the individual unit portions are calculated and arranged in a matrix configuration. The matrix has arranged in each column the degrees of similarity acquired by comparing, for each of the unit portions, the sound signal and a delayed sound signal obtained by delaying the sound signal by a time difference equal to an integral multiple of a time length of the unit portion, and it has a plurality of the columns in association with different time differences. Repetition probability is calculated for each of the columns corresponding to the different time differences in the matrix. A plurality of peaks in a distribution of the repetition probabilities are identified. The loop region in the sound signal is identified by collating a reference matrix with the degree of similarity matrix.

    摘要翻译: 针对每个单位部分提取声音信号的字符值,并且以矩阵形式计算并排列各个单位部分的字符值之间的相似度。 矩阵在每列中排列相似度,通过比较每个单位部分的声音信号和通过将声音信号延迟等于时间长度的整数倍而得到的延迟声音信号 单位部分,并且其具有与不同时间差相关联的多个列。 对于与矩阵中的不同时间差对应的每个列计算重复概率。 识别重复概率分布中的多个峰。 通过将参考矩阵与相似度矩阵进行比较来识别声音信号中的环路区域。

    Searching for a tone data set based on a degree of similarity to a rhythm pattern
    9.
    发明授权
    Searching for a tone data set based on a degree of similarity to a rhythm pattern 有权
    基于与节奏模式的相似度搜索音调数据集

    公开(公告)号:US09053696B2

    公开(公告)日:2015-06-09

    申请号:US13395433

    申请日:2011-12-01

    IPC分类号: G10H1/40

    摘要: It is an object of the present invention to provide an improved technique for searching for a tone data set of a phrase constructed in a rhythm pattern that satisfies a predetermined condition of similarity to a rhythm pattern intended by a user. The user inputs a rhythm pattern via a rhythm input device. An input rhythm pattern storage section stores the input rhythm pattern into a RAM on the basis of clock signals output from a bar line clock output section and trigger data included in the input rhythm pattern. A rhythm pattern search section searches through a rhythm database for a tone data set presenting the highest degree of similarity to the stored input rhythm pattern. A performance processing section causes a sound output section to audibly output the searched-out tone data set.

    摘要翻译: 本发明的目的是提供一种用于搜索以节拍模式构成的短语的音调数据集的改进技术,其满足与用户想要的节奏模式相似的预定条件。 用户通过节奏输入设备输入节奏模式。 输入节奏模式存储部分基于从条形线时钟输出部分输出的时钟信号和触发包括在输入节奏模式中的数据,将输入的节奏模式存储在RAM中。 节奏模式搜索部分通过节奏数据库搜索与存储的输入节奏模式呈现最高相似度的乐音数据集。 演奏处理部分使声音输出部分可听见地输出搜索出的乐音数据集。

    Method, apparatus, and program for assessing similarity of performance sound
    10.
    发明授权
    Method, apparatus, and program for assessing similarity of performance sound 有权
    用于评估性能声音相似性的方法,装置和程序

    公开(公告)号:US07659472B2

    公开(公告)日:2010-02-09

    申请号:US12177398

    申请日:2008-07-22

    申请人: Keita Arimoto

    发明人: Keita Arimoto

    IPC分类号: G10H7/08 A63H5/00

    摘要: A similarity assessment apparatus is provided for assessing a performance sound based on a model performance sound. In the apparatus, a probability density function generating unit divides data of a performance sound into a sequence of frames each having a predetermined temporal length, and generates a probability density function of a fundamental frequency for each frame of the performance sound. A probability density function providing portion provides a probability density function of a fundamental frequency for each frame of the model performance sound. A similarity assessment unit compares the generated probability density function of a frame of the performance sound with the provided probability density function of a frame of the model performance sound so as to assess a similarity between the performance sound and the model performance sound.

    摘要翻译: 提供了一种基于模型演奏声音来评估演奏声音的相似性评估装置。 在该装置中,概率密度函数生成单元将演奏声音的数据分割为具有预定时间长度的帧序列,并且对于演奏声音的每一帧生成基本频率的概率密度函数。 概率密度函数提供部分为模型演奏声音的每个帧提供基频的概率密度函数。 相似性评估单元将演奏声音的帧的所生成的概率密度函数与所提供的模型演奏声音的帧的概率密度函数进行比较,以评估演奏声音和模型演奏声音之间的相似性。