Method and Apparatus for Frame Loss Concealment in Transform Domain

    公开(公告)号:US20190096430A1

    公开(公告)日:2019-03-28

    申请号:US15926582

    申请日:2018-03-20

    申请人: ZTE CORPORATION

    摘要: The present document discloses a method and apparatus for compensating for a lost frame in a transform domain, comprising: calculating frequency-domain coefficients of a current lost frame using frequency-domain coefficients of one or more frames prior to the current lost frame, and performing frequency-time transform to obtain an initially compensated signal; and performing waveform adjustment, to obtain a compensated signal. Alternatively, extrapolation is performed for all or part of frequency points of the current lost frame using phases and amplitudes of corresponding frequency points of a plurality of previous frames to obtain phases and amplitudes of the corresponding frequency points of the current lost frame, to obtain frequency-domain coefficients of the corresponding frequency points, and frequency-time transform is performed to obtain a compensated signal. The above methods can be selected through a judgment algorithm to compensate for the current lost frame, thereby achieving a better compensation effect.

    Comparing differential ZC count to database to detect expected sound
    4.
    发明授权
    Comparing differential ZC count to database to detect expected sound 有权
    将差分ZC计数与数据库进行比较,以检测预期声音

    公开(公告)号:US09466288B2

    公开(公告)日:2016-10-11

    申请号:US14013014

    申请日:2013-08-28

    发明人: Zhenyong Zhang Wei Ma

    IPC分类号: G10L25/09 G10L15/02

    CPC分类号: G10L25/48 G10L15/02 G10L25/09

    摘要: A low power sound recognition sensor is configured to receive an analog signal that may contain a signature sound. Sparse sound parameter information is extracted from the analog signal and compared to a sound parameter reference stored locally with the sound recognition sensor to detect when the signature sound is received in the analog signal. A portion of the sparse sound parameter information is differential zero crossing (ZC) counts. Differential ZC rate may be determined by measuring a number of times the analog signal crosses a threshold value during each of a sequence of time frames to form a sequence of ZC counts and taking a difference between selected pairs of ZC counts to form a sequence of differential ZC counts.

    摘要翻译: 低功率声音识别传感器被配置为接收可能包含签名声音的模拟信号。 从模拟信号中提取稀疏声音参数信息,并将其与用声音识别传感器本地存储的声音参数参考值进行比较,以检测在模拟信号中何时接收到签名声音。 稀疏声音参数信息的一部分是差分零交叉(ZC)计数。 差分ZC速率可以通过测量模拟信号在时间序列序列中的每一个中跨越阈值的次数来确定,以形成ZC计数序列,并获取所选择的ZC计数对之间的差异以形成差分序列 ZC计数。

    SPEECH RECOGNITION APPARATUS AND SPEECH RECOGNITION METHOD
    5.
    发明申请
    SPEECH RECOGNITION APPARATUS AND SPEECH RECOGNITION METHOD 有权
    语音识别装置和语音识别方法

    公开(公告)号:US20160217808A1

    公开(公告)日:2016-07-28

    申请号:US14659542

    申请日:2015-03-16

    申请人: Acer Incorporated

    摘要: A speech recognition apparatus and a speech recognition method are provided. In the invention, whether an original voice sampling signal corresponding to a target voice frame is a noise signal is determined according to a ratio of an energy of a first consonant frequency band signal to an energy of a second consonant frequency band signal, a ratio of an energy of the first consonant frequency band signal to an energy of the original voice sampling signal and a ratio of an energy of the second consonant frequency band signal to an energy of the original voice sampling signal.

    摘要翻译: 提供语音识别装置和语音识别方法。 在本发明中,根据第一辅音频带信号的能量与第二辅音频带信号的能量的比率来确定与目标语音帧对应的原始语音采样信号是否为噪声信号, 第一辅音频带信号的能量与原始语音采样信号的能量以及第二辅音频带信号的能量与原始语音采样信号的能量的比率。

    QUALITY-OF-EXPERIENCE MEASUREMENT FOR VOICE SERVICES
    6.
    发明申请
    QUALITY-OF-EXPERIENCE MEASUREMENT FOR VOICE SERVICES 有权
    音质服务的质量经验测量

    公开(公告)号:US20160014187A1

    公开(公告)日:2016-01-14

    申请号:US14864920

    申请日:2015-09-25

    摘要: An example method to determine a quality-of-experience (QoE) metric for a network communication includes receiving a media signal from the network communication, wherein the media signal includes a voice component, extracting an experience indicator from the voice component, wherein the experience indicator is a voice feature descriptive of a service quality of the network communication, evaluating the experience indicator, retrieving a quality-of-service (QoS) metric if the evaluated experience indicator reflects the service quality of the network possibly being subpar, and determining the QoE metric for the network communication based on the evaluated experience indicator and the retrieved QoS metric for the network communication.

    摘要翻译: 用于确定网络通信的经验质量(QoE)度量的示例性方法包括从所述网络通信接收媒体信号,其中所述媒体信号包括语音组件,从所述语音组件提取体验指示符,其中所述体验 指示符是描述网络通信的服务质量的语音特征,评估体验指标,检索服务质量(QoS)度量,如果评估的体验指示符反映网络的服务质量可能是次要的,并且确定 基于经评估的经验指标的网络通信的QoE度量和网络通信的检索的QoS度量。

    APPARATUS AND METHOD FOR DETECTING SPEECH
    7.
    发明申请
    APPARATUS AND METHOD FOR DETECTING SPEECH 有权
    用于检测语音的装置和方法

    公开(公告)号:US20100268533A1

    公开(公告)日:2010-10-21

    申请号:US12761489

    申请日:2010-04-16

    IPC分类号: G10L15/06 G10L15/20

    摘要: A speech detection apparatus and method are provided. The speech detection apparatus and method determine whether a frame is speech or not using feature information extracted from an input signal. The speech detection apparatus may estimate a situation related to an input frame and determine which feature information is required for speech detection for the input frame in the estimated situation. The speech detection apparatus may detect a speech signal using dynamic feature information that may be more suitable to the situation of a particular frame, instead of using the same feature information for each and every frame.

    摘要翻译: 提供语音检测装置和方法。 语音检测装置和方法使用从输入信号提取的特征信息来确定帧是否是语音。 语音检测装置可以在估计的情况下估计与输入帧相关的情况并确定哪个特征信息是用于输入帧的语音检测所需要的。 语音检测装置可以使用可能更适合于特定帧的情况的动态特征信息来检测语音信号,而不是为每个帧使用相同的特征信息。

    Storage medium storing breath blowing determining program, breath blowing determining apparatus, breath blowing determining method, storage medium storing game program, game apparatus, and game control method
    8.
    发明授权
    Storage medium storing breath blowing determining program, breath blowing determining apparatus, breath blowing determining method, storage medium storing game program, game apparatus, and game control method 有权
    存储呼吸确定程序的存储介质,呼气确定装置,呼气确定方法,存储游戏程序的存储介质,游戏装置和游戏控制方法

    公开(公告)号:US07498505B2

    公开(公告)日:2009-03-03

    申请号:US11281730

    申请日:2005-11-18

    IPC分类号: G10H7/00

    摘要: A game apparatus includes an operating switch and a microphone. A player operates a player object through intuition by the operating switch or inputting a sound. The number of zero crossings contained in waveform of a sound input through the microphone is detected, and also individual interval times between the zero crossings are detected. Then, it is determined whether or not the distribution of the interval times, i.e. the frequency distribution matches the distribution of interval times (frequency distribution) related to a breath sound stored in advance. If there is a match between the two, the input sound is recognized as a breath sound, and a game process based on the breath (wind) is carried out. For example, a game screen depicting the breath or wind is displayed on an LCD.

    摘要翻译: 游戏装置包括操作开关和麦克风。 玩家通过操作开关的直觉操作玩家对象或输入声音。 检测到通过麦克风输入的声音的波形中包含的零交叉点的数量,并且也检测到过零点之间的单独的间隔时间。 然后,确定间隔时间的分布,即频率分布是否与预先存储的呼吸音相关的间隔时间(频率分布)的分布匹配。 如果两者之间存在匹配,则输入声音被识别为呼吸音,并且执行基于呼吸(风)的游戏处理。 例如,在LCD上显示描绘呼吸或风的游戏画面。

    Sound Signal Processing Apparatus and Program
    9.
    发明申请
    Sound Signal Processing Apparatus and Program 有权
    声音信号处理装置和程序

    公开(公告)号:US20080154585A1

    公开(公告)日:2008-06-26

    申请号:US11962439

    申请日:2007-12-21

    申请人: Yasuo Yoshioka

    发明人: Yasuo Yoshioka

    IPC分类号: G10L15/04

    摘要: In a sound signal processing apparatus, a frame information generation section generates frame information of each frame of a sound signal. A storage stores the frame information generated by the frame information generation section. A first interval determination section determines a first utterance interval in the sound signal. A second interval determination section determines a second utterance interval based on the frame information of the first utterance interval stored in the storage such that the second utterance interval is made shorter than the first utterance interval and confined within the first utterance interval by trimming frames from either of a start point or an end point of the first utterance interval.

    摘要翻译: 在声音信号处理装置中,帧信息生成部生成声音信号的各帧的帧信息。 存储器存储由帧信息生成部生成的帧信息。 第一间隔确定部分确定声音信号中的第一发声间隔。 第二间隔确定部分基于存储在存储器中的第一发声间隔的帧信息来确定第二发声间隔,使得第二发声间隔比第一发声间隔短,并且通过从任一个 第一发声间隔的起始点或终点。