Frequency domain noise detection of audio with tone parameter

    公开(公告)号:US10089999B2

    公开(公告)日:2018-10-02

    申请号:US15380163

    申请日:2016-12-15

    Inventor: Lijing Xu

    Abstract: A noise detection method and apparatus are disclosed. The noise detection method includes: obtaining a frequency-domain energy distribution parameter of a current frame of an audio signal, and obtaining a frequency-domain energy distribution parameter; obtaining a tone parameter of the current frame, and obtaining a tone parameter; determining, according to the tone parameter of the current frame and the tone parameter of each of the frames in the preset neighboring domain range of the current frame, whether the current frame is in a speech section or a non-speech section; and determining that the current frame is speech-grade noise if the current frame is in a speech section and a quantity of frequency-domain energy distribution parameters falling within a preset speech-grade noise frequency-domain energy distribution parameter interval in all the frequency-domain energy distribution parameters is greater than or equal to a first threshold.

    Noise Detection Method and Apparatus
    2.
    发明申请

    公开(公告)号:US20170098455A1

    公开(公告)日:2017-04-06

    申请号:US15380163

    申请日:2016-12-15

    Inventor: Lijing Xu

    CPC classification number: G10L21/0232 G10L25/18 G10L25/21 G10L25/84 G10L25/90

    Abstract: A noise detection method and apparatus are disclosed. The noise detection method includes: obtaining a frequency-domain energy distribution parameter of a current frame of an audio signal, and obtaining a frequency-domain energy distribution parameter; obtaining a tone parameter of the current frame, and obtaining a tone parameter; determining, according to the tone parameter of the current frame and the tone parameter of each of the frames in the preset neighboring domain range of the current frame, whether the current frame is in a speech section or a non-speech section; and determining that the current frame is speech-grade noise if the current frame is in a speech section and a quantity of frequency-domain energy distribution parameters falling within a preset speech-grade noise frequency-domain energy distribution parameter interval in all the frequency-domain energy distribution parameters is greater than or equal to a first threshold.

    Voice Quality Monitoring Method and Apparatus
    3.
    发明申请
    Voice Quality Monitoring Method and Apparatus 审中-公开
    语音质量监测方法与设备

    公开(公告)号:US20150179187A1

    公开(公告)日:2015-06-25

    申请号:US14640354

    申请日:2015-03-06

    Abstract: A voice quality monitoring method and apparatus are provided, which solves a difficult problem of how to perform proper voice quality monitoring on a relatively long audio signal by using relatively low costs. The method includes capturing one or more voice signal segments from an input signal; performing voice segment segmentation on each voice signal segment to obtain one or more voice segments; and performing a voice quality evaluation on the voice segment to obtain a quality evaluation result according to the voice quality evaluation. Because the segmented voice segment includes only a voice signal and is shorter than the input signal, proper voice quality monitoring can be performed on a relatively long audio signal by using relatively low costs, thereby obtaining a more accurate voice quality evaluation result.

    Abstract translation: 提供了一种语音质量监测方法和装置,其解决了如何通过使用相对低的成本在相对长的音频信号上执行适当的语音质量监测的难题。 该方法包括从输入信号捕获一个或多个语音信号段; 在每个语音信号段执行语音段分段以获得一个或多个语音段; 并且对语音段执行语音质量评估,以根据语音质量评估获得质量评估结果。 因为分段语音段仅包括语音信号并且比输入信号短,所以可以通过使用相对低的成本在相对长的音频信号上执行适当的语音质量监视,从而获得更准确的语音质量评估结果。

    Method and apparatus for detecting audio signal according to frequency domain energy

    公开(公告)号:US10339956B2

    公开(公告)日:2019-07-02

    申请号:US15361597

    申请日:2016-11-28

    Inventor: Lijing Xu

    Abstract: A method and an apparatus for detecting an audio signal according to frequency domain energy is presented. The method may include receiving an audio signal frame; acquiring frequency domain energy distribution of the audio signal frame; obtaining a maximum value distribution characteristic of a frequency domain energy distribution derivative of the audio signal frame according to the frequency domain energy distribution of the audio signal frame; using the audio signal frame and each frame in a preset neighborhood range of the audio signal frame as a frame set, where the frame set includes a to-be-detected frame; and detecting the to-be-detected frame according to a maximum value distribution characteristic of a frequency domain energy distribution derivative of the frame set. In the various embodiments, detection on an audio signal can be implemented.

    Method and Apparatus for Detecting Audio Signal According to Frequency Domain Energy
    5.
    发明申请
    Method and Apparatus for Detecting Audio Signal According to Frequency Domain Energy 审中-公开
    根据频域能量检测音频信号的方法和装置

    公开(公告)号:US20170076739A1

    公开(公告)日:2017-03-16

    申请号:US15361597

    申请日:2016-11-28

    Inventor: Lijing Xu

    Abstract: A method and an apparatus for detecting an audio signal according to frequency domain energy is presented. The method may include receiving an audio signal frame; acquiring frequency domain energy distribution of the audio signal frame; obtaining a maximum value distribution characteristic of a frequency domain energy distribution derivative of the audio signal frame according to the frequency domain energy distribution of the audio signal frame; using the audio signal frame and each frame in a preset neighborhood range of the audio signal frame as a frame set, where the frame set includes a to-be-detected frame; and detecting the to-be-detected frame according to a maximum value distribution characteristic of a frequency domain energy distribution derivative of the frame set. In the various embodiments, detection on an audio signal can be implemented.

    Abstract translation: 提出了一种根据频域能量检测音频信号的方法和装置。 该方法可以包括接收音频信号帧; 获取音频信号帧的频域能量分布; 根据音频信号帧的频域能量分布获得音频信号帧的频域能量分布导数的最大值分布特性; 使用所述音频信号帧和所述音频信号帧的预设邻域范围中的每个帧作为帧集合,其中所述帧集合包括待检测帧; 以及根据所述帧组的频域能量分布导数的最大值分布特性来检测所述待检测帧。 在各种实施例中,可以实现对音频信号的检测。

    Method and Apparatus for Processing Speech Signal According to Frequency-Domain Energy
    6.
    发明申请
    Method and Apparatus for Processing Speech Signal According to Frequency-Domain Energy 审中-公开
    根据频域能量处理语音信号的方法和装置

    公开(公告)号:US20160351204A1

    公开(公告)日:2016-12-01

    申请号:US15237095

    申请日:2016-08-15

    Inventor: Lijing Xu

    CPC classification number: G10L21/0308 G10L15/04 G10L25/06 G10L25/18 G10L25/78

    Abstract: A method and an apparatus for processing a speech signal according to frequency-domain energy where the method and apparatus include receiving an original speech signal including a first speech frame and a second speech frame that are adjacent to each other, performing a Fourier transform on the first speech frame and the second speech frame, obtaining a frequency-domain energy distribution of the first speech frame and the second speech frame, obtaining a frequency-domain energy correlation coefficient, and segmenting the original speech signal according to the frequency-domain energy correlation coefficient. Hence a problem that a speech signal segmentation result has low accuracy due to a characteristic of a phoneme of a speech signal or severe impact of noise when refined speech signal segmentation is performed may be resolved.

    Abstract translation: 一种用于根据频域能量处理语音信号的方法和装置,其中所述方法和装置包括接收包括彼此相邻的包括第一语音帧和第二语音帧的原始语音信号,对其进行傅立叶变换 第一语音帧和第二语音帧,获得第一语音帧和第二语音帧的频域能量分布,获得频域能量相关系数,并根据频域能量相关分割原始语音信号 系数。 因此,可以解决语音信号分割结果由于语音信号的音素的特性或精细语音信号分割执行时的噪声的严重影响而具有低精度的问题。

    Method and apparatus for detecting voice signal
    7.
    发明授权
    Method and apparatus for detecting voice signal 有权
    用于检测语音信号的方法和装置

    公开(公告)号:US09396739B2

    公开(公告)日:2016-07-19

    申请号:US14747731

    申请日:2015-06-23

    Inventor: Lijing Xu

    CPC classification number: G10L25/78 G10L19/005 G10L25/87 G10L25/90 G10L25/93

    Abstract: The invention discloses a method including: performing in a unit of first timeframe frame length, framing on a continuous voice sample to obtain a plurality of first timeframes, detecting energy of each of the first timeframes, and determining a target first timeframe including a potential abrupt exception of a voice signal by analyzing a relationship between the energy of the plurality of first timeframes; performing, in a unit of second timeframe frame length, framing on the continuous voice sample to obtain a plurality of second timeframes, and processing each of the second timeframes to acquire a tone feature, and determining, by analyzing a tone feature of at least one of the second timeframes including at least one target second timeframe, whether the potential abrupt exception of a voice signal included in the target first timeframe included in the target second timeframe is a real abrupt exception of a voice signal.

    Abstract translation: 本发明公开了一种方法,包括:以第一时间帧长度为单位执行对连续语音样本进行成帧以获得多个第一时间帧,检测每个第一时间帧的能量,以及确定包括潜在突发性的目标第一时间帧 通过分析多个第一时间帧的能量之间的关系来排除语音信号; 以第二时间帧长度为单位,对所述连续语音样本进行成帧以获得多个第二时间帧,并且处理所述第二时间帧中的每一个以获取音调特征,以及通过分析至少一个 包括至少一个目标第二时间帧的第二时间帧,包括在目标第二时间帧中的目标第一时间帧中包括的语音信号的潜在突然异常是否是语音信号的真正突然异常。

Patent Agency Ranking