Measuring Waveforms With The Digital Infinite Exponential Transform
    1.
    发明申请
    Measuring Waveforms With The Digital Infinite Exponential Transform 审中-公开
    数字无限指数变换测量波形

    公开(公告)号:US20160112225A1

    公开(公告)日:2016-04-21

    申请号:US14885693

    申请日:2015-10-16

    发明人: Frederick M. SLAY

    IPC分类号: H04L27/156

    摘要: A method is performed of selecting a signal of interest from a compound signal. The method includes generating a matrix and initializing a first row of the matrix to zero. The compound signal is obtained as a digital waveform signal, with sampling rate S samples per second. For each sample of the digital signal, a new entry is recursively computed for the matrix. For each frequency bin in the matrix (where f is the center frequency of the bin) the value in the new row is computed by multiplying the value for that bin in the previous row by the complex number r*ei2πf/S and adding the new signal sample multiplied by a real constant. The method includes identifying the signal of interest from the matrix, whereby an uncertainty of which frequency bin the signal of interest exists in is eliminated.

    摘要翻译: 执行从复合信号中选择感兴趣的信号的方法。 该方法包括生成矩阵并将矩阵的第一行初始化为零。 获得复合信号作为数字波形信号,采样速率S采样每秒。 对于数字信号的每个样本,对矩阵递归计算一个新的条目。 对于矩阵中的每个频率仓(其中f是仓的中心频率),新行中的值通过将上一行中的该仓的值乘以复数r * ei2&pgr; f / S并添加 新信号样本乘以实常数。 该方法包括从矩阵中识别感兴趣的信号,由此消除了感兴趣的信号存在哪个频率bin的不确定性。

    AUDIO MATCHING WITH SEMANTIC AUDIO RECOGNITION AND REPORT GENERATION
    2.
    发明申请
    AUDIO MATCHING WITH SEMANTIC AUDIO RECOGNITION AND REPORT GENERATION 有权
    音频与语音识别和报告生成的匹配

    公开(公告)号:US20140180674A1

    公开(公告)日:2014-06-26

    申请号:US13725004

    申请日:2012-12-21

    申请人: ARBITRON INC.

    IPC分类号: G10L19/018

    摘要: System, apparatus and method for determining semantic information from audio, where incoming audio is sampled and processed to extract audio features, including temporal, spectral, harmonic and rhythmic features. The extracted audio features are compared to stored audio templates that include ranges and/or values for certain features and are tagged for specific ranges and/or values. The semantic information may be associated with audio signature dataExtracted audio features that are most similar to one or more templates from the comparison are identified according to the tagged information. The tags are used to determine the semantic audio data that includes genre, instrumentation, style, acoustical dynamics, and emotive descriptor for the audio signal.

    摘要翻译: 用于从音频确定语义信息的系统,装置和方法,其中对输入音频进行采样和处理以提取音频特征,包括时间,频谱,谐波和节奏特征。 所提取的音频特征与存储的音频模板进行比较,所述音频模板包括特定特征的范围和/或值,并针对特定范围和/或值进行标记。 语义信息可以与音频签名数据相关联。根据标记的信息来识别与来自比较的一个或多个模板最相似的提取的音频特征。 这些标签用于确定语音音频数据,包括音频信号的类型,仪器,风格,声学动力学和情感描述符。

    Method and apparatus for best matching an audible query to a set of audible targets
    4.
    发明授权
    Method and apparatus for best matching an audible query to a set of audible targets 有权
    用于将可听见查询与一组可听目标最佳匹配的方法和装置

    公开(公告)号:US08049093B2

    公开(公告)日:2011-11-01

    申请号:US12649458

    申请日:2009-12-30

    IPC分类号: G04B13/00

    摘要: During operation, a “coarse search” stage applies variable-scale windowing on the query pitch contours to compare them with fixed-length segments of target pitch contours to find matching candidates while efficiently scanning over variable tempo differences and target locations. Because the target segments are of fixed-length, this has the effect of drastically reducing the storage space required in a prior-art method. Furthermore, by breaking the query contours into parts, rhythmic inconsistencies can be more flexibly handled. Normalization is also applied to the contours to allow comparisons independent of differences in musical key. In a “fine search” stage, a “segmental” dynamic time warping (DTW) method is applied that calculates a more accurate similarity score between the query and each candidate target with more explicit consideration toward rhythmic inconsistencies.

    摘要翻译: 在操作期间,“粗略搜索”阶段在查询音调轮廓上应用可变尺度窗口,以将其与目标俯仰轮廓的固定长度段进行比较,以在有效扫描可变速度差异和目标位置的同时找到匹配候选。 因为目标段是固定长度的,所以这具有显着减少现有方法所需的存储空间的效果。 此外,通过将查询轮廓分解成部分,可以更灵活地处理节奏不一致。 归一化也适用于轮廓,以便独立于音乐键的差异进行比较。 在“精细搜索”阶段,应用“分段”动态时间扭曲(DTW)方法,通过更明确地考虑节奏不一致来计算查询和每个候选目标之间的更准确的相似性分数。

    Image processing apparatus, image display system, program, and storage medium
    5.
    发明申请
    Image processing apparatus, image display system, program, and storage medium 审中-公开
    图像处理装置,图像显示系统,程序和存储介质

    公开(公告)号:US20050031212A1

    公开(公告)日:2005-02-10

    申请号:US10891591

    申请日:2004-07-14

    申请人: Tooru Suino

    发明人: Tooru Suino

    摘要: A technique is disclosed for evaluating an audio characteristic such as singing ability, and processing an image to be displayed according to the evaluation result in a manner that can attract the interest of a user. JPEG 2000 code data of a moving image for a karaoke system, for example, are transmitted from a server to a client along with accompanying audio data, and the code data are then decoded at a decoder to form an image to be displayed. An audio signal such as the voice of the user that is input to a microphone is evaluated at an evaluation unit, and the evaluation result is transmitted to the server. Based on this evaluation result, an inter-code transform unit conducts image processing by selectively discarding codes from code data of an image that are to be transmitted to the client.

    摘要翻译: 公开了一种用于评估诸如歌唱能力的音频特性的技术,并且以可以吸引用户兴趣的方式根据评估结果处理要显示的图像。 例如,用于卡拉OK系统的运动图像的JPEG 2000代码数据与伴随的音频数据一起从服务器发送到客户端,然后在解码器处对代码数据进行解码以形成要显示的图像。 在评估单元处评估输入到麦克风的用户的语音等音频信号,将评价结果发送到服务器。 基于该评估结果,代码间变换部通过选择性地将要发送给客户端的图像的代码数据进行废弃而进行图像处理。

    Inverse transform narrow band/broad band sound synthesis
    7.
    发明授权
    Inverse transform narrow band/broad band sound synthesis 失效
    反变换窄带/宽带声合成

    公开(公告)号:US5686683A

    公开(公告)日:1997-11-11

    申请号:US551889

    申请日:1995-10-23

    申请人: Adrian Freed

    发明人: Adrian Freed

    摘要: An additive sound synthesis process for generating complex, realistic sounds is realized in a computationally efficient manner. In accordance with one aspect of the invention, polyphony is efficiently achieved by dosing the energy of a given partial between separate transform sums corresponding to different channels. In accordance with another aspect of the invention, noise is injected by randomly perturbing the phase of the sound, either on a per-partial basis or on a transform-sum basis. In the latter instance, the phase is perturbed in different regions of the spectrum to a degree determined by the amount of energy present in the respective regions of the spectrum. In accordance with yet another aspect of the invention, a transform sum representing a sound is processed in the transform domain to achieve with great economy effects achievable only at much greater expense outside the transform domain. Other transforms besides the Fourier transform may be used to advantage. For example, use of the Hartley transform produces comparable results but allows transforms to be computed at approximately twice the speed as the Fourier transform.

    摘要翻译: 用计算上有效的方式实现了用于产生复杂,逼真的声音的附加声音合成过程。 根据本发明的一个方面,通过在对应于不同信道的不同变换和之间计量给定部分的能量来有效地实现复音。 根据本发明的另一方面,噪声通过随机扰动声音的相位而被注入,无论是在每个部分的基础上还是在基于变换的基础上。 在后一种情况下,该相位在频谱的不同区域被扰动到由光谱的各个区域中存在的能量的量确定的程度。 根据本发明的另一方面,在变换域中处理表示声音的变换和,以实现仅在转换域之外以更大的费用实现的巨大的经济效应。 除了傅里叶变换之外的其他变换也可以被使用。 例如,使用Hartley变换产生可比较的结果,但允许以大约是傅立叶变换的两倍的速度来计算变换。

    Electronically-simulated live music
    10.
    发明授权
    Electronically-simulated live music 失效
    电子模拟现场音乐

    公开(公告)号:US08670577B2

    公开(公告)日:2014-03-11

    申请号:US12906647

    申请日:2010-10-18

    IPC分类号: H04B1/00 G06F17/00

    摘要: A method for producing an electronically-simulated live musical performance, the method comprising providing morph-friendly solo tracks, morphing the morph-friendly solo tracks to produce a morphed track, and post-processing the morphed track. The method may also include combining the post-processed morphed track with one or more supporting tracks to produce an acoustic image for playback.

    摘要翻译: 一种用于制作电子仿真的现场音乐表演的方法,该方法包括提供易变形的独奏曲目,使变形的独奏曲目变形以产生变形的曲目,以及后处理变形曲目。 该方法还可以包括将后处理的变形轨迹与一个或多个支撑轨道组合以产生用于回放的声学图像。