MUSIC SIMILARITY SYSTEMS AND METHODS USING DESCRIPTORS
    1.
    发明申请
    MUSIC SIMILARITY SYSTEMS AND METHODS USING DESCRIPTORS 审中-公开
    音乐相似系统和使用描述符的方法

    公开(公告)号:US20080300702A1

    公开(公告)日:2008-12-04

    申请号:US12128917

    申请日:2008-05-29

    IPC分类号: G06F17/00

    CPC分类号: G10L25/48 G06F16/683

    摘要: Systems and methods for determining similarity between two or more audio pieces are disclosed. An illustrative method for determining musical similarities includes extracting one or more descriptors from each audio piece, generating a vector for each of the audio pieces, extracting one or more audio features from each of the audio pieces, calculating values for each audio feature, calculating a distance between a vector containing the normalized values and the vectors containing the audio pieces, and outputting a response to a user or process indicating the similarity between the audio pieces. The descriptors can be used in performing content-based audio classification and for determining similarities between music. The descriptors that can be extracted from each audio piece can include tonal descriptors, dissonance descriptors, rhythm descriptors, and spatial descriptors.

    摘要翻译: 公开了用于确定两个或多个音频片段之间的相似性的系统和方法。 用于确定音乐相似性的说明性方法包括从每个音频片段中提取一个或多个描述符,为每个音频片段生成矢量,从每个音频片段中提取一个或多个音频特征,计算每个音频特征的值, 包含归一化值的矢量与包含音频片段的矢量之间的距离,并且向用户指示音频片段之间的相似性的用户或处理器输出响应。 描述符可用于执行基于内容的音频分类和用于确定音乐之间的相似性。 可以从每个音频片段提取的描述符可以包括音调描述符,不一致描述符,节奏描述符和空间描述符。

    Graphical Audio Signal Control
    3.
    发明申请
    Graphical Audio Signal Control 有权
    图形音频信号控制

    公开(公告)号:US20120201385A1

    公开(公告)日:2012-08-09

    申请号:US13367696

    申请日:2012-02-07

    IPC分类号: H04R5/00

    摘要: Signal processing section of a terminal converts acquired audio signals of a plurality of channels into frequency spectra set, calculates sound image positions corresponding to individual frequency components, and displays, on a display screen, the calculated sound image positions results by use of a coordinate system having coordinate axes of the frequency components and sound image positions. User-designated partial region of the coordinate system is set as a designated region and an amplitude-level adjusting amount is set for the designated region, so that the signal processing section adjusts amplitude levels of frequency components included in the frequency spectra and in the designated region, converts the adjusted frequency components into audio signals and outputs the converted audio signals.

    摘要翻译: 终端的信号处理部将所获取的多个频道的音频信号变换为频谱集,计算与各个频率成分对应的声像位置,并在显示画面上使用坐标系显示计算出的声像位置的结果 具有频率分量和声像位置的坐标轴。 坐标系的用户指定的部分区域被设置为指定区域,并且为指定区域设置幅度电平调整量,使得信号处理部分调整频谱中包括的频率分量的振幅水平,并且在指定的 将经调整的频率分量转换成音频信号并输出​​转换的音频信号。

    Technique for suppressing particular audio component
    4.
    发明授权
    Technique for suppressing particular audio component 有权
    抑制特定音频成分的技术

    公开(公告)号:US09070370B2

    公开(公告)日:2015-06-30

    申请号:US13284199

    申请日:2011-10-28

    摘要: A coefficient train processing section, which sequentially generates per unit segment a processing coefficient train for suppressing a target component of an audio signal, includes a basic coefficient train generation section and coefficient train processing section. The basic coefficient train generation section generates a basic coefficient train where basic coefficient values corresponding to frequencies within a particular frequency band range are each set at a suppression value that suppresses the audio signal while coefficient values corresponding to frequencies outside the particular frequency band range are each set at a pass value that maintains the audio signal. The coefficient train processing section generates the processing coefficient train, per unit segment, by changing, to the pass value, each of the coefficient values corresponding to frequencies other than the target component among the coefficient values corresponding to the frequencies within the particular frequency band range.

    摘要翻译: 系数列处理部,其顺序地生成用于抑制音频信号的目标分量的处理系数列的每单位段,包括基本系数列生成部和系数列处理部。 基本系数列生成部生成基本系数列,其中对应于特定频带范围内的频率的基本系数值各自被设置为抑制音频信号的抑制值,而对应于特定频带范围外的频率的系数值各自 设置为保持音频信号的通过值。 系数列处理部通过将对应于特定频带范围内的频率的系数值中的与目标分量以外的频率对应的每个系数值变更为通过值,生成每单位分段的处理系数列 。

    Technique for estimating particular audio component
    5.
    发明授权
    Technique for estimating particular audio component 有权
    用于估计特定音频分量的技术

    公开(公告)号:US09224406B2

    公开(公告)日:2015-12-29

    申请号:US13284170

    申请日:2011-10-28

    IPC分类号: H04R29/00 G10L25/90

    摘要: Candidate frequencies per unit segment of an audio signal are identified. First processing section identifies an estimated train that is a time series of candidate frequencies, each selected for a different one of the segments, arranged over a plurality of the unit segments and that has a high likelihood of corresponding to a time series of fundamental frequencies of a target component. Second processing section identifies a state train of states, each indicative of one of sound-generating and non-sound-generating states of the target component in a different one of the segments, arranged over the unit segments. Frequency information which designates, as a fundamental frequency of the target component, a candidate frequency corresponding to the unit segment in the estimated train is generated for each unit segment corresponding to the sound-generating state. Frequency information indicative of no sound generation is generated for each unit segment corresponding to the non-sound-generating state.

    摘要翻译: 识别音频信号的每单位片段的候选频率。 第一处理部分识别作为候选频率的时间序列的估计列车,每个候选频率被选择用于不同的一个段,排列在多个单位段上,并且具有对应于基本频率的时间序列的高可能性 目标组件。 第二处理部分识别状态列,每个状态列表指示布置在单位段上的不同的段中的目标分量的声音产生和非声音生成状态之一。 针对对应于发声状态的每个单位片段生成作为目标分量的基本频率指定与估计列车中的单位片段对应的候选频率的频率信息。 针对对应于非声音产生状态的每个单位片段产生指示没有声音产生的频率信息。

    Technique for Estimating Particular Audio Component
    6.
    发明申请
    Technique for Estimating Particular Audio Component 有权
    用于估计特定音频组件的技术

    公开(公告)号:US20120106746A1

    公开(公告)日:2012-05-03

    申请号:US13284170

    申请日:2011-10-28

    IPC分类号: H04R29/00

    摘要: Candidate frequencies per unit segment of an audio signal are identified. First processing section identifies an estimated train that is a time series of candidate frequencies, each selected for a different one of the segments, arranged over a plurality of the unit segments and that has a high likelihood of corresponding to a time series of fundamental frequencies of a target component. Second processing section identifies a state train of states, each indicative of one of sound-generating and non-sound-generating states of the target component in a different one of the segments, arranged over the unit segments. Frequency information which designates, as a fundamental frequency of the target component, a candidate frequency corresponding to the unit segment in the estimated train is generated for each unit segment corresponding to the sound-generating state. Frequency information indicative of no sound generation is generated for each unit segment corresponding to the non-sound-generating state.

    摘要翻译: 识别音频信号的每单位片段的候选频率。 第一处理部分识别作为候选频率的时间序列的估计列车,每个候选频率被选择用于不同的一个段,排列在多个单位段上,并且具有对应于基本频率的时间序列的高可能性 目标组件。 第二处理部分识别状态列,每个状态列表指示布置在单位段上的不同的段中的目标分量的声音产生和非声音生成状态之一。 针对对应于发声状态的每个单位片段生成作为目标分量的基本频率指定与估计列车中的单位片段对应的候选频率的频率信息。 针对对应于非声音产生状态的每个单位片段产生指示没有声音产生的频率信息。