AUTOMATED DETECTION AND FILTERING OF AUDIO ADVERTISEMENTS
    53.
    发明申请
    AUTOMATED DETECTION AND FILTERING OF AUDIO ADVERTISEMENTS 有权
    自动检测和过滤音频广告

    公开(公告)号:US20130268103A1

    公开(公告)日:2013-10-10

    申请号:US13867264

    申请日:2013-04-22

    IPC分类号: G06F17/00

    摘要: Methods, apparatuses, and media for filtering a data stream are provided. The data stream is partitioned into a plurality of data stream segments. An acoustic parameter of each of the data stream segments is measured, and it is determined whether the acoustic parameter of each of the data stream segments satisfies a predetermined condition. Extraneous segments of the data stream segments are identified in which the predetermined condition is satisfied, and it is determined whether the extraneous segments have a predetermined relationship in the data stream. The extraneous segments are deleted from the data stream to produce a filtered data stream in response to the extraneous segments having the predetermined relationship.

    摘要翻译: 提供了用于过滤数据流的方法,设备和介质。 数据流被划分成多个数据流段。 测量每个数据流段的声学参数,并且确定每个数据流段的声学参数是否满足预定条件。 识别数据流段的外部段,其中满足预定条件,并且确定外部段在数据流中是否具有预定关系。 响应于具有预定关系的外部段,从数据流中删除无关段以产生经滤波的数据流。

    Method and System For Endpoint Automatic Detection of Audio Record
    54.
    发明申请
    Method and System For Endpoint Automatic Detection of Audio Record 有权
    端点自动检测音频记录的方法和系统

    公开(公告)号:US20130197911A1

    公开(公告)日:2013-08-01

    申请号:US13878818

    申请日:2010-10-29

    IPC分类号: G10L15/26

    摘要: A method and system for endpoint automatic detection of audio record is provided. The method comprises the following steps: acquiring a audio record text and affirming the text endpoint acoustic model for the audio record text; starting acquiring the audio record data of each frame in turn from the audio record start frame in the audio record data; affirming the characteristics acoustic model of the decoding optimal path for the acquired current frame of the audio record data; comparing the characteristics acoustic model of the decoding optimal path acquired from the current frame of the audio record data with the endpoint acoustic model to determine if they are the same; if yes, updating a mute duration threshold with a second time threshold, wherein the second time threshold is less than a first time threshold. This method can improve the recognizing efficiency of the audio record endpoint.

    摘要翻译: 提供了一种端点自动检测音频记录的方法和系统。 该方法包括以下步骤:获取音频记录文本并确定音频记录文本的文本端点声学模型; 依次从音频记录数据中的音频记录开始帧获取每个帧的音频记录数据; 确定音频记录数据的获取的当前帧的解码最优路径的特征声学模型; 将从音频记录数据的当前帧获取的解码最优路径的特征声学模型与端点声学模型进行比较,以确定它们是否相同; 如果是,则用第二时间阈值更新静音持续时间阈值,其中所述第二时间阈值小于第一时间阈值。 该方法可以提高音频记录端点的识别效率。

    System for detecting speech with background voice estimates and noise estimates
    55.
    发明授权
    System for detecting speech with background voice estimates and noise estimates 有权
    用于用背景语音估计和噪声估计来检测语音的系统

    公开(公告)号:US08457961B2

    公开(公告)日:2013-06-04

    申请号:US13566603

    申请日:2012-08-03

    IPC分类号: G10L15/20 G10L15/04

    CPC分类号: G10L25/87

    摘要: A system detects a speech segment that may include unvoiced, fully voiced, or mixed voice content. The system includes a window function that passes signals within a programmed aural frequency range while substantially blocking signals above and below the programmed aural frequency range. A frequency converter converts the signals passing within the programmed aural frequency range into a plurality of frequency bins. A background voice detector estimates the strength of a background speech segment relative to the noise of selected portions of the aural spectrum. A noise estimator estimates a maximum distribution of noise to an average of an acoustic noise power of some of the plurality of frequency bins. A voice detector compares the strength of a desired speech segment to a maximum of an output of the background voice detector and an output of the noise estimator.

    摘要翻译: 系统检测可能包括清音,完全有声或混合语音内容的语音段。 该系统包括窗口功能,其在编程的听觉频率范围内传递信号,同时基本上阻挡在编程的听觉频率范围之上和之下的信号。 变频器将在编程的听觉频率范围内通过的信号转换成多个频率仓。 背景语音检测器估计背景语音段相对于听觉频谱的所选部分的噪声的强度。 噪声估计器将噪声的最大分布估计为多个频率仓中的一些频率仓的声学噪声功率的平均值。 语音检测器将所需语音段的强度与背景语音检测器的输出的最大值和噪声估计器的输出进行比较。

    REGION OF INTEREST IDENTIFICATION DEVICE, REGION OF INTEREST IDENTIFICATION METHOD, REGION OF INTEREST IDENTIFICATION PROGRAM, AND REGION OF INTEREST IDENTIFICATION INTEGRATED CIRCUIT
    56.
    发明申请
    REGION OF INTEREST IDENTIFICATION DEVICE, REGION OF INTEREST IDENTIFICATION METHOD, REGION OF INTEREST IDENTIFICATION PROGRAM, AND REGION OF INTEREST IDENTIFICATION INTEGRATED CIRCUIT 有权
    兴趣鉴别设备地区,兴趣鉴别方法区域,兴趣鉴别程序区域和兴趣识别集成电路区域

    公开(公告)号:US20130108244A1

    公开(公告)日:2013-05-02

    申请号:US13809480

    申请日:2012-04-24

    IPC分类号: G11B27/031

    摘要: An interesting section identifying device for identifying an interesting section of a video file based on an audio signal included in the video file, the interesting section being a section in which a user is estimated to express interest, includes an interesting section candidate extracting unit that extracts an interesting section candidate from the video file, the interesting section candidate being a candidate for the interesting section, a detailed structure determining unit that determines whether the interesting section candidate includes a specific detailed structure, and an interesting section identifying unit that identifies the interesting section by analyzing a specific section when the detailed structure determining unit determines that the interesting section candidate includes the detailed structure, the specific section including the detailed structure and being shorter than the interesting section candidate.

    摘要翻译: 一种有趣的部分识别装置,用于基于包括在视频文件中的音频信号识别视频文件的有趣部分,感兴趣部分是用户被估计表示兴趣的部分,包括提取的感兴趣部分候选提取单元 来自视频文件的有趣的部分候选者,感兴趣的部分候选者是感兴趣部分的候选者,确定感兴趣部分候选是否包括特定详细结构的详细结构确定单元,以及识别感兴趣部分的感兴趣部分识别单元 通过分析特定部分,当详细结构确定单元确定感兴趣部分候选包括详细结构时,具体部分包括详细结构并且短于感兴趣部分候选。

    Structure and method for echo reduction without loss of information
    58.
    发明授权
    Structure and method for echo reduction without loss of information 有权
    回波减少的结构和方法,而不会丢失信息

    公开(公告)号:US08422663B1

    公开(公告)日:2013-04-16

    申请号:US13168762

    申请日:2011-06-24

    申请人: James H. Parry

    发明人: James H. Parry

    摘要: An echo reduction method stores a received audio information stream. A sound detection flag is activated following detection of locally generated sound. Output based on the received audio information stream is muted in response to the activating the sound detection flag. Rendering status of the received audio information stream is saved, in response to the activating the sound detection flag, to reduce loss of audio information. At least a portion of the stored received audio information stream is rendered following inactivation of the sound detection flag.

    摘要翻译: 回波减少方法存储所接收的音频信息流。 检测到本地产生的声音后,声音检测标志被激活。 响应于激活声音检测标志,基于接收到的音频信息流的输出被静音。 响应于激活声音检测标志,节省了所接收的音频信息流的渲染状态,以减少音频信息的丢失。 存储的所接收的音频信息流的至少一部分在声音检测标志失效之后呈现。

    AUDIO ANALYSIS APPARATUS AND AUDIO ANALYSIS SYSTEM
    59.
    发明申请
    AUDIO ANALYSIS APPARATUS AND AUDIO ANALYSIS SYSTEM 有权
    音频分析设备和音频分析系统

    公开(公告)号:US20130080170A1

    公开(公告)日:2013-03-28

    申请号:US13412214

    申请日:2012-03-05

    IPC分类号: G10L17/00

    摘要: An audio analysis apparatus includes the following components. A main body includes a discrimination unit and a transmission unit. A strap is used for hanging the main body from a user's neck. A first audio acquisition device is provided to the strap or the main body. A second audio acquisition device is provided to the strap at a position where a distance between the second audio acquisition device and the user's mouth is smaller than the distance between the first audio acquisition device and the user's in a state where the strap is worn around the user's neck. The discrimination unit discriminates whether an acquired sound is an uttered voice of the user or of another person by comparing audio signals of the sound acquired by the first and second audio acquisition devices. The transmission unit transmits information including the discrimination result to an external apparatus.

    摘要翻译: 音频分析装置包括以下部件。 主体包括识别单元和发送单元。 用于将主体从使用者的脖子上悬挂下来。 第一音频获取装置被提供给带或主体。 第二音频采集装置被提供给带子,在第二音频采集装置和用户嘴部之间的距离小于第一音频采集装置与用户之间的距离的位置处, 用户的脖子。 识别单元通过比较由第一和第二音频获取装置获取的声音的音频信号来鉴别所获取的声音是用户或另一人的发声。 发送单元将包含鉴别结果的信息发送到外部设备。

    SYSTEM FOR DETECTING SPEECH WITH BACKGROUND VOICE ESTIMATES AND NOISE ESTIMATES
    60.
    发明申请
    SYSTEM FOR DETECTING SPEECH WITH BACKGROUND VOICE ESTIMATES AND NOISE ESTIMATES 有权
    用背景语音估计和噪声估计来检测语音的系统

    公开(公告)号:US20120303366A1

    公开(公告)日:2012-11-29

    申请号:US13566603

    申请日:2012-08-03

    IPC分类号: G10L15/20

    CPC分类号: G10L25/87

    摘要: A system detects a speech segment that may include unvoiced, fully voiced, or mixed voice content. The system includes a window function that passes signals within a programmed aural frequency range while substantially blocking signals above and below the programmed aural frequency range. A frequency converter converts the signals passing within the programmed aural frequency range into a plurality of frequency bins. A background voice detector estimates the strength of a background speech segment relative to the noise of selected portions of the aural spectrum. A noise estimator estimates a maximum distribution of noise to an average of an acoustic noise power of some of the plurality of frequency bins. A voice detector compares the strength of a desired speech segment to a maximum of an output of the background voice detector and an output of the noise estimator.

    摘要翻译: 系统检测可能包括清音,完全有声或混合语音内容的语音段。 该系统包括窗口功能,其在编程的听觉频率范围内传递信号,同时基本上阻挡在编程的听觉频率范围之上和之下的信号。 变频器将在编程的听觉频率范围内通过的信号转换成多个频率仓。 背景语音检测器估计背景语音段相对于听觉频谱的所选部分的噪声的强度。 噪声估计器将噪声的最大分布估计为多个频率仓中的一些频率仓的声学噪声功率的平均值。 语音检测器将所需语音段的强度与背景语音检测器的输出的最大值和噪声估计器的输出进行比较。