Extracting classifying data in music from an audio bitstream
    4.
    发明授权
    Extracting classifying data in music from an audio bitstream 失效
    从音频比特流中提取音乐中的分类数据

    公开(公告)号:US07295977B2

    公开(公告)日:2007-11-13

    申请号:US09939954

    申请日:2001-08-27

    IPC分类号: G10L15/16 G10H7/10

    CPC分类号: G10L17/02 G10L17/26 G10L25/30

    摘要: The method of the present invention utilizes machine-learning techniques, particularly Support Vector Machines in combination with a neural network, to process a unique machine-learning enabled representation of the audio bitstream. Using this method, a classifying machine is able to autonomously detect characteristics of a piece of music, such as the artist or genre, and classify it accordingly. The method includes transforming digital time-domain representation of music into a frequency-domain representation, then dividing that frequency data into time slices, and compressing it into frequency bands to form multiple learning representations of each song. The learning representations that result are processed by a group of Support Vector Machines, then by a neural network, both previously trained to distinguish among a given set of characteristics, to determine the classification.

    摘要翻译: 本发明的方法利用机器学习技术,特别是与神经网络相结合的支持向量机来处理独特的机器学习使能的音频比特流表示。 使用这种方法,分类机能够自主地检测诸如艺术家或流派之类的音乐的特征,并相应地对其进行分类。 该方法包括将音乐的数字时域表示变换为频域表示,然后将该频率数据划分为时间片,并将其压缩为频带,以形成每首歌曲的多个学习表示。 结果的学习表示由一组支持向量机处理,然后由神经网络处理,两者都被训练以区分给定的一组特征,以确定分类。

    Waveform generating device and method, and decoder
    5.
    发明申请
    Waveform generating device and method, and decoder 失效
    波形发生装置及方法及解码器

    公开(公告)号:US20050251709A1

    公开(公告)日:2005-11-10

    申请号:US10516819

    申请日:2003-06-27

    摘要: Amplitude, phase and frequency of a sine wave to be generated are calculated on the basis of feature quantity s1 delivered to feature quantity detecting means (2), and are sent to initialization means (3). The initialization means (3) calculates first two points of the sine wave to send the points thus calculated to oscillator (sine wave generating means) (4) as initial value s4. The oscillator (4) sequentially calculates values of respective sample points of waveform by using recurrence formula in accordance with initial value or values instructed from the initialization means (3) to thereby generate a sine wave signal. Thus, sine wave generation is performed without performing modulo-addressing.

    摘要翻译: 基于递送到特征量检测装置(2)的特征量s 1计算要产生的正弦波的幅度,相位和频率,并将其发送到初始化装置(3)。 初始化装置(3)计算正弦波的前两点,将由此计算的点发送到振荡器(正弦波产生装置)(4)作为初始值s 4。 振荡器(4)通过使用根据从初始化装置(3)指示的初始值或值产生正弦波信号的递归公式,顺序地计算各波形采样点的值。 因此,在不执行模寻址的情况下执行正弦波生成。

    Music tone generating method by waveform synthesis with advance
parameter computation

    公开(公告)号:US5913258A

    公开(公告)日:1999-06-15

    申请号:US32091

    申请日:1998-02-27

    申请人: Motoichi Tamura

    发明人: Motoichi Tamura

    IPC分类号: G10H1/00 G10H7/00 G10H7/10

    摘要: Musical tones are produced according to song data basically by three steps. The first step converts the song data sequentially into control parameters. The control parameters are written into a parameter memory. Then, the second step generates waveform data by using the control parameters written in the parameter memory. The generated waveform data are written into a waveform memory, while the used control parameters are erased from the parameter memory to provide a vacant area. Lastly, the third step reads the waveform data sequentially from the waveform memory to produce the musical tones. Characterizingly, the second step of generating waveform data is executed dependently on progression of the third step of reading the waveform data. Further, the first step of converting the song data is executed independently from progression of the second step of generating waveform data as long as the parameter memory has the vacant area sufficient to store the control parameters converted from the song data.

    Method and system for editing digital audio information with music-like
parameters
    8.
    发明授权
    Method and system for editing digital audio information with music-like parameters 失效
    使用类似音乐的参数编辑数字音频信息的方法和系统

    公开(公告)号:US5792971A

    公开(公告)日:1998-08-11

    申请号:US715529

    申请日:1996-09-18

    摘要: The present invention provides a method for editing digital audio information, such as musical material. Original musical parameters (302) are extracted and/or inputted from recorded original digital audio material (300). The original musical parameters (302) are then edited. The resulting edited musical parameters (304) are compared to the original musical parameters (302) to provide time varying control functions (308, 310, 312). The original digital audio material (300) is then processed with signal processing algorithms (314, 316, 318) which are controlled by the time varying control functions (308, 310, 312). This processing changes the original digital audio material (300) into new digital audio material (320) having musical characteristics which correspond to the edited musical parameters (304).

    摘要翻译: 本发明提供了一种用于编辑诸如音乐材料的数字音频信息的方法。 原始音乐参数(302)从记录的原始数字音频素材(300)提取和/或输入。 然后对原来的音乐参数(302)进行编辑。 将所得到的编辑音乐参数(304)与原始音乐参数(302)进行比较,以提供时变控制功能(308,310,312)。 然后用由时变控制功能(308,310,312)控制的信号处理算法(314,316,318)处理原始数字音频材料(300)。 该处理将原始数字音频材料(300)改变为具有与编辑的音乐参数(304)相对应的音乐特征的新数字音频材料(320)。

    Inverse transform narrow band/broad band sound synthesis
    9.
    发明授权
    Inverse transform narrow band/broad band sound synthesis 失效
    反变换窄带/宽带声合成

    公开(公告)号:US5686683A

    公开(公告)日:1997-11-11

    申请号:US551889

    申请日:1995-10-23

    申请人: Adrian Freed

    发明人: Adrian Freed

    摘要: An additive sound synthesis process for generating complex, realistic sounds is realized in a computationally efficient manner. In accordance with one aspect of the invention, polyphony is efficiently achieved by dosing the energy of a given partial between separate transform sums corresponding to different channels. In accordance with another aspect of the invention, noise is injected by randomly perturbing the phase of the sound, either on a per-partial basis or on a transform-sum basis. In the latter instance, the phase is perturbed in different regions of the spectrum to a degree determined by the amount of energy present in the respective regions of the spectrum. In accordance with yet another aspect of the invention, a transform sum representing a sound is processed in the transform domain to achieve with great economy effects achievable only at much greater expense outside the transform domain. Other transforms besides the Fourier transform may be used to advantage. For example, use of the Hartley transform produces comparable results but allows transforms to be computed at approximately twice the speed as the Fourier transform.

    摘要翻译: 用计算上有效的方式实现了用于产生复杂,逼真的声音的附加声音合成过程。 根据本发明的一个方面,通过在对应于不同信道的不同变换和之间计量给定部分的能量来有效地实现复音。 根据本发明的另一方面,噪声通过随机扰动声音的相位而被注入,无论是在每个部分的基础上还是在基于变换的基础上。 在后一种情况下,该相位在频谱的不同区域被扰动到由光谱的各个区域中存在的能量的量确定的程度。 根据本发明的另一方面,在变换域中处理表示声音的变换和,以实现仅在转换域之外以更大的费用实现的巨大的经济效应。 除了傅里叶变换之外的其他变换也可以被使用。 例如,使用Hartley变换产生可比较的结果,但允许以大约是傅立叶变换的两倍的速度来计算变换。

    Tone signal synthesizer employing a closed wave guide network
    10.
    发明授权
    Tone signal synthesizer employing a closed wave guide network 失效
    使用封闭波导网络的音频信号合成器

    公开(公告)号:US5554813A

    公开(公告)日:1996-09-10

    申请号:US436924

    申请日:1995-05-08

    摘要: In a closed wave guide network having a bidirectional signal transmitting channel section and a signal junction section, signal delay time is variably controlled by a first parameter group so as to control the resonance frequency characteristics of the wave guide network. A signal excitor is connected to the wave guide network so that an excited signal is supplied to the network. The excitation frequency of the excitor is controlled in accordance with a second parameter group. There are also provided a combination determination section which, in correspondence to the pitch of a tone to be generated, determines a combination of the first parameter group to be used in the wave guide network and the second parameter group to be used in the excitor, and a parameter generator which, in accordance with the combination determined by the combination determination section, generates and supplies individual parameters of the first and second parameter groups. The pitch of a tone to be generated is determined by a combination of the resonance characteristics of the wave guide network and the excitation frequency of the signal excitor.

    摘要翻译: 在具有双向信号发送信道部分和信号结部分的封闭波导网络中,信号延迟时间由第一参数组可变地控制,以便控制波导网络的谐振频率特性。 信号激励器连接到波导网络,使得激励信号被提供给网络。 根据第二参数组来控制激励器的激励频率。 还提供了一种组合确定部分,其对应于要生成的音调的音调,确定在波导网络中要使用的第一参数组和要在该激励器中使用的第二参数组的组合, 以及参数发生器,其根据由组合确定部确定的组合生成并提供第一和第二参数组的各个参数。 要产生的音调的音调由波导网络的谐振特性和信号激励器的激励频率的组合决定。