Audio classification by comparison of feature sections and integrated features to known references
    1.
    发明授权
    Audio classification by comparison of feature sections and integrated features to known references 有权
    通过将功能部分和集成功能与已知参考文献的比较进行音频分类

    公开(公告)号:US08892497B2

    公开(公告)日:2014-11-18

    申请号:US13382362

    申请日:2011-03-15

    摘要: To classify moving images using audio signals. An audio signal is acquired, a section feature relating to an audio frequency distribution is extracted with respect to each of a plurality of sections each having a predetermined length contained in the acquired audio signal, each extracted section feature is compared with each of reference section features to calculate a section similarity indicating a degree of correlation between each section feature and each reference section feature. An integrated feature relating to the plurality of sections and being calculated based on the section similarity calculated with respect to each of the plurality of sections is extracted from the acquired audio signal. The extracted integrated feature is compared with each of one or more reference integrated features, and the audio signal is classified based on comparison result. Then, classification result is used for moving image classification.

    摘要翻译: 使用音频信号对运动图像进行分类。 获取音频信号,提取与获取的音频信号中包含的具有预定长度的多个部分中的每一个相关的音频分布相关的部分特征,将每个提取的部分特征与参考部分特征 以计算表示每个部分特征与每个参考部分特征之间的相关程度的部分相似度。 从所获取的音频信号中提取与多个部分相关的并且基于相对于多个部分中的每一个计算的部分相似度来计算的集成特征。 将提取的集成特征与一个或多个参考集成特征中的每一个进行比较,并且基于比较结果对音频信号进行分类。 然后,分类结果用于运动图像分类。

    AUDIO CLASSIFICATION DEVICE, METHOD, PROGRAM AND INTEGRATED CIRCUIT
    2.
    发明申请
    AUDIO CLASSIFICATION DEVICE, METHOD, PROGRAM AND INTEGRATED CIRCUIT 有权
    音频分类设备,方法,程序和集成电路

    公开(公告)号:US20120136823A1

    公开(公告)日:2012-05-31

    申请号:US13382362

    申请日:2011-03-15

    IPC分类号: G06N5/02

    摘要: To classify moving images using audio signals. An audio signal is acquired, a section feature relating to an audio frequency distribution is extracted with respect to each of a plurality of sections each having a predetermined length contained in the acquired audio signal, each extracted section feature is compared with each of reference section features to calculate a section similarity indicating a degree of correlation between each section feature and each reference section feature. An integrated feature relating to the plurality of sections and being calculated based on the section similarity calculated with respect to each of the plurality of sections is extracted from the acquired audio signal. The extracted integrated feature is compared with each of one or more reference integrated features, and the audio signal is classified based on comparison result. Then, classification result is used for moving image classification.

    摘要翻译: 使用音频信号对运动图像进行分类。 获取音频信号,提取与获取的音频信号中包含的具有预定长度的多个部分中的每一个相关的音频分布相关的部分特征,将每个提取的部分特征与参考部分特征 以计算表示每个部分特征与每个参考部分特征之间的相关程度的部分相似度。 从所获取的音频信号中提取与多个部分相关的并且基于相对于多个部分中的每一个计算的部分相似度来计算的集成特征。 将提取的集成特征与一个或多个参考集成特征中的每一个进行比较,并且基于比较结果对音频信号进行分类。 然后,分类结果用于运动图像分类。

    PRESENTATION CONTENT GENERATION DEVICE, PRESENTATION CONTENT GENERATION METHOD, PRESENTATION CONTENT GENERATION PROGRAM, AND INTEGRATED CIRCUIT
    4.
    发明申请
    PRESENTATION CONTENT GENERATION DEVICE, PRESENTATION CONTENT GENERATION METHOD, PRESENTATION CONTENT GENERATION PROGRAM, AND INTEGRATED CIRCUIT 审中-公开
    演示内容生成装置,演示内容生成方法,演示内容生成程序和集成电路

    公开(公告)号:US20130111373A1

    公开(公告)日:2013-05-02

    申请号:US13702143

    申请日:2011-11-21

    IPC分类号: G06F3/0484

    摘要: To provide a presentation content generation device that generates various types of presentation contents by dynamically generating a template appropriate for the substance of each content set. The presentation content generation device includes an attribute information extraction unit 2 that extracts attribute information indicating image feature from a content set stored in a local data storage unit 1, a design type determination unit 4 that determines a base land pattern and a color of a template based on the extracted attribute information, a selection index type determination unit 5 that, based on the extracted attribute information, selects one or more contents to be placed on the template and respective placement positions of the selected contents on the template, and a view format conversion unit 6 that places the selected contents on the respective placement positions to generate a presentation content.

    摘要翻译: 提供通过动态生成适合每个内容集的实质的模板来生成各种呈现内容的呈现内容生成装置。 呈现内容生成装置包括从存储在本地数据存储单元1中的内容集中提取表示图像特征的属性信息的属性信息提取单元2,确定基准区域图案和模板颜色的设计类型确定单元4 基于所提取的属性信息,选择索引类型确定单元5,基于所提取的属性信息,选择要放置在模板上的一个或多个内容和所选择的内容在模板上的各个放置位置,以及视图格式 转换单元6,其将所选择的内容放置在相应的放置位置上以生成呈现内容。

    REGION OF INTEREST IDENTIFICATION DEVICE, REGION OF INTEREST IDENTIFICATION METHOD, REGION OF INTEREST IDENTIFICATION PROGRAM, AND REGION OF INTEREST IDENTIFICATION INTEGRATED CIRCUIT
    5.
    发明申请
    REGION OF INTEREST IDENTIFICATION DEVICE, REGION OF INTEREST IDENTIFICATION METHOD, REGION OF INTEREST IDENTIFICATION PROGRAM, AND REGION OF INTEREST IDENTIFICATION INTEGRATED CIRCUIT 有权
    兴趣鉴别设备地区,兴趣鉴别方法区域,兴趣鉴别程序区域和兴趣识别集成电路区域

    公开(公告)号:US20130108244A1

    公开(公告)日:2013-05-02

    申请号:US13809480

    申请日:2012-04-24

    IPC分类号: G11B27/031

    摘要: An interesting section identifying device for identifying an interesting section of a video file based on an audio signal included in the video file, the interesting section being a section in which a user is estimated to express interest, includes an interesting section candidate extracting unit that extracts an interesting section candidate from the video file, the interesting section candidate being a candidate for the interesting section, a detailed structure determining unit that determines whether the interesting section candidate includes a specific detailed structure, and an interesting section identifying unit that identifies the interesting section by analyzing a specific section when the detailed structure determining unit determines that the interesting section candidate includes the detailed structure, the specific section including the detailed structure and being shorter than the interesting section candidate.

    摘要翻译: 一种有趣的部分识别装置,用于基于包括在视频文件中的音频信号识别视频文件的有趣部分,感兴趣部分是用户被估计表示兴趣的部分,包括提取的感兴趣部分候选提取单元 来自视频文件的有趣的部分候选者,感兴趣的部分候选者是感兴趣部分的候选者,确定感兴趣部分候选是否包括特定详细结构的详细结构确定单元,以及识别感兴趣部分的感兴趣部分识别单元 通过分析特定部分,当详细结构确定单元确定感兴趣部分候选包括详细结构时,具体部分包括详细结构并且短于感兴趣部分候选。

    ANCHOR MODEL ADAPTATION DEVICE, INTEGRATED CIRCUIT, AV (AUDIO VIDEO) DEVICE, ONLINE SELF-ADAPTATION METHOD, AND PROGRAM THEREFOR
    6.
    发明申请
    ANCHOR MODEL ADAPTATION DEVICE, INTEGRATED CIRCUIT, AV (AUDIO VIDEO) DEVICE, ONLINE SELF-ADAPTATION METHOD, AND PROGRAM THEREFOR 审中-公开
    ANCHOR型号适配器件,集成电路,AV(音频视频)器件,在线自适应方法及其程序

    公开(公告)号:US20120093327A1

    公开(公告)日:2012-04-19

    申请号:US13379827

    申请日:2011-04-19

    IPC分类号: H04R29/00

    CPC分类号: G10L25/57 G10L2015/0631

    摘要: The present invention provides a device that performs online self-adaption of anchor models for an acoustic space, and a method thereof, the anchor models being used for categorization of an AV stream which is performed based on an audio stream in the AV stream. The device divides an input audio stream into audio segments, each being estimated to have a single acoustic feature, and estimates a single probability model for each audio segment. Then, the device performs clustering on the estimated probability models and probability models stored therein, thereby generating a new anchor model.

    摘要翻译: 本发明提供一种执行用于声学空间的锚模型的在线自适应的装置及其方法,所述锚模型用于基于AV流中的音频流执行的AV流的分类。 该设备将输入音频流划分成音频段,每个音频段被估计具有单个声学特征,并且估计每个音频段的单个概率模型。 然后,设备对存储在其中的估计概率模型和概率模型进行聚类,从而生成新的锚模型。

    Method and apparatus of speech recognition and speech control system using the speech recognition method
    7.
    发明授权
    Method and apparatus of speech recognition and speech control system using the speech recognition method 有权
    使用语音识别方法的语音识别和语音控制系统的方法和装置

    公开(公告)号:US06308152B1

    公开(公告)日:2001-10-23

    申请号:US09337734

    申请日:1999-06-22

    IPC分类号: G10L1522

    CPC分类号: G10L15/08

    摘要: A string of acoustic feature parameters of each of recognition-desired words and a string of acoustic feature parameters of each of reception words are registered in advance. When an uttered word is received, a string of acoustic feature parameters is extracted from the uttered word, the acoustic feature parameters of the uttered word is compared with the string of acoustic feature parameters of each recognition-desired word, and a recognition-desired word recognition score indicating a similarity degree between the uttered word and each recognition-desired word is calculated. Also, a reception word recognition score indicating a similarity degree between the uttered word and each reception word is calculated. In cases where a particular recognition-desired word recognition score corresponding to a particular recognition-desired word is higher than the highest reception word recognition score, the utter word is recognized as the particular recognition-desired word, and an operation of an electric apparatus is controlled according to the particular recognition-desired word. In contrast, in cases where a particular reception word recognition score corresponding to a particular reception word is higher than the highest recognition-desired word recognition score, the utter word is recognized as the particular reception word and is rejected, so that the electric apparatus is not operated.

    摘要翻译: 预先记录每个识别期望单词和每个接收单词的一组声学特征参数的一串声学特征参数。 当接收到发出的字时,从发出的字中提取一串声学特征参数,将发音字的声学特征参数与每个识别期望字的声学特征参数串进行比较,并且识别期望字 计算表示发音字和每个识别期望字之间的相似度的识别分数。 此外,计算表示发音字和每个接收字之间的相似度的接收字识别分数。 在与特定识别期望字相对应的特定识别期望字识别得分高于最高接收字识别分数的情况下,将该字识别为特定识别期望字,电器的操作为 根据特定的识别期望字控制。 相反,在与特定接收字相对应的特定接收字识别得分高于最高识别期望字识别分数的情况下,将该字识别为特定接收字,并被拒绝,使得电器是 没有操作。

    Image classification apparatus, image classification method, program, recording medium, integrated circuit, and model creation apparatus
    8.
    发明授权
    Image classification apparatus, image classification method, program, recording medium, integrated circuit, and model creation apparatus 有权
    图像分类装置,图像分类方法,程序,记录介质,集成电路和模型创建装置

    公开(公告)号:US08953895B2

    公开(公告)日:2015-02-10

    申请号:US13574878

    申请日:2011-10-06

    IPC分类号: G06K9/62 G06K9/46 G06F17/30

    CPC分类号: G06F17/30247

    摘要: The image classification apparatus extracts first features of each received image (S22) and second features of a relevant image relevant to each received image (S25). Subsequently, the image classification apparatus obtains a third feature by calculation using locality of the extracted first and second features, the third feature being distinctive of a target object of each received image (S26), and creates model data based on the obtained third feature (S27).

    摘要翻译: 图像分类装置提取每个接收图像的第一特征(S22)和与每个接收图像相关的相关图像的第二特征(S25)。 随后,图像分类装置通过使用所提取的第一和第二特征的局部性进行计算获得第三特征,第三特征与每个接收图像的目标对象不同(S26),并且基于获得的第三特征( S27)。

    AUDIO PROCESSING DEVICE, AUDIO PROCESSING METHOD, PROGRAM AND INTEGRATED CIRCUIT
    9.
    发明申请
    AUDIO PROCESSING DEVICE, AUDIO PROCESSING METHOD, PROGRAM AND INTEGRATED CIRCUIT 有权
    音频处理设备,音频处理方法,程序和集成电路

    公开(公告)号:US20140043543A1

    公开(公告)日:2014-02-13

    申请号:US14113481

    申请日:2013-03-11

    IPC分类号: H04N5/60

    摘要: An audio processing device including a feature calculation unit, a boundary calculation unit and a judgment unit, detects points of change of audio features from an audio signal in an AV content. The feature calculation unit calculates, for each unit section of the audio signal, section feature data expressing features of the audio signal in the unit section. The boundary calculation unit calculates, for each target unit section among the unit sections of the audio signal, a piece of boundary information relating to at least one boundary of a similarity section. The similarity section consists of consecutive unit sections, inclusive of the target unit section, which each have similar section feature data. The judgment unit calculates a priority of each boundary indicated by one or more of the pieces of boundary information and judges whether the boundary is a scene change point based on the priority.

    摘要翻译: 包括特征计算单元,边界计算单元和判断单元的音频处理设备从AV内容中的音频信号中检测音频特征的变化点。 特征计算单元针对音频信号的每个单位部分计算表示单位部分中的音频信号的特征的部分特征数据。 边界计算单元对于音频信号的单位部分中的每个目标单位部分计算与相似度部分的至少一个边界相关的边界信息。 相似部分由连续的单位部分组成,包括目标单位部分,每个部分具有类似的部分特征数据。 判断单元计算由一个或多个边界信息指示的每个边界的优先级,并且基于优先级来判断边界是否是场景改变点。

    HEARING AID APPARATUS
    10.
    发明申请
    HEARING AID APPARATUS 有权
    听力辅助装置

    公开(公告)号:US20120063620A1

    公开(公告)日:2012-03-15

    申请号:US13320613

    申请日:2010-06-16

    IPC分类号: H04R25/00

    摘要: A call other than a conversion partner call and various sounds are detected by input audio signals from plural microphones without deteriorating a voice recognition precision. A hearing aid apparatus according to the present invention corrects a frequency characteristic of the call voice other than the conversation partner voice based on an arrival direction of the call voice other than the conversation partner voice, which is estimated based on the audio signal converted by the plural microphones, checks a call word standard pattern representing features of a phoneme and a syllabic sound based on other voice data picked up by using the microphones having one characteristic against a call voice other than the conversation partner voice in which the frequency characteristic is corrected by the frequency characteristic correction processing unit to determine whether the call voice is a call word, and forms a directivity in the direction other than the arrival direction of the voice of the conversation partner. Then, the hearing aid apparatus according to the present invention corrects the frequency characteristic of the call voice other than the conversation partner voice so as to provide the same characteristic as that of the microphones at the time of creating the audio standard pattern.

    摘要翻译: 通过来自多个麦克风的输入音频信号来检测除转换伴侣呼叫之外的呼叫和各种声音,而不会降低语音识别精度。 根据本发明的助听器装置根据对话伙伴声音以外的呼叫语音的到达方向,校正对话伙伴语音以外的呼叫语音的频率特性,该对话伙伴语音是基于由 多个麦克风,基于通过使用具有一个特性的麦克风拾取的表示音素和音节音的特征的呼叫字标准模式,对抗除了频率特性被校正的对话伙伴语音之外的呼叫语音 所述频率特性校正处理单元确定所述呼叫语音是否是呼叫字,并且在所述对话伙伴的语音的到达方向以外的方向上形成方向性。 然后,根据本发明的助听器装置校正除了对话伙伴语音之外的呼叫语音的频率特性,以便在创建音频标准模式时提供与麦克风相同的特性。