-
公开(公告)号:US20100250260A1
公开(公告)日:2010-09-30
申请号:US12741508
申请日:2007-11-06
申请人: Lasse Laaksonen , Mikko Tammi , Adriana Vasilache , Anssi Ramo
发明人: Lasse Laaksonen , Mikko Tammi , Adriana Vasilache , Anssi Ramo
IPC分类号: G10L21/06
CPC分类号: G10L21/038
摘要: A method including generating from a first audio signal, and via a first encoding and decoding of the first audio signal, a second audio signal; determining at least one energy difference value between the first audio signal and the second audio signal; and calculating at least one signal shaping factor dependent on the at least one energy difference value.
摘要翻译: 一种方法,包括从第一音频信号产生第一音频信号,并经由第一音频信号的第一编码和解码,产生第二音频信号; 确定所述第一音频信号和所述第二音频信号之间的至少一个能量差值; 以及根据所述至少一个能量差值计算至少一个信号整形因子。
-
公开(公告)号:US20080091418A1
公开(公告)日:2008-04-17
申请号:US11580690
申请日:2006-10-13
申请人: Lasse Laaksonen , Anssi Ramo , Adriana Vasilache
发明人: Lasse Laaksonen , Anssi Ramo , Adriana Vasilache
IPC分类号: G10L21/00
摘要: Autocorrelation values are determined as a basis for an estimation of a pitch lag in a segment of an audio signal. A first considered delay range for the autocorrelation computations is divided into a first set of sections, and first autocorrelation values are determined for delays in a plurality of sections of this first set of sections. A second considered delay range for the autocorrelation computations is divided into a second set of sections such that sections of the first set and sections of the second set are overlapping. Second autocorrelation values are determined for delays in a plurality of sections of this second set of sections.
摘要翻译: 自相关值被确定为用于估计音频信号的段中的音调滞后的基础。 用于自相关计算的第一被考虑的延迟范围被划分为第一组部分,并且为该第一组部分的多个部分中的延迟确定第一自相关值。 用于自相关计算的第二被考虑的延迟范围被划分成第二组部分,使得第一组的部分和第二组的部分重叠。 第二自相关值是针对该第二组部分的多个部分中的延迟而确定的。
-
公开(公告)号:US20130346073A1
公开(公告)日:2013-12-26
申请号:US13978130
申请日:2011-01-12
申请人: Lasse Laaksonen , Mikko Tammi , Adriana Vasilache , Anssi Ramo
发明人: Lasse Laaksonen , Mikko Tammi , Adriana Vasilache , Anssi Ramo
IPC分类号: G10L21/0316
CPC分类号: G10L21/0316 , G10L19/0208 , G10L21/0208 , G10L21/038
摘要: Apparatus comprising a noise estimator configured to determine a noise estimate for a first part of an audio signal, a comparator configured to compare the noise estimate to an energy threshold parameter, a damping factor determiner configured to determine a damping factor for at least one sub band gain value of a second part of an audio signal, wherein the damping factor is dependent on a result of the comparison and a gain modifier configured to apply the damping factor to the sub band gain value.
摘要翻译: 包括噪声估计器的装置,被配置为确定音频信号的第一部分的噪声估计,配置成将噪声估计与能量阈值参数进行比较的比较器,被配置为确定至少一个子带的阻尼因子的阻尼因子确定器 音频信号的第二部分的增益值,其中所述阻尼因子取决于比较的结果,以及增益修正器,被配置为将所述阻尼因子应用于所述子带增益值。
-
公开(公告)号:US20100274555A1
公开(公告)日:2010-10-28
申请号:US12741636
申请日:2007-11-06
申请人: Lasse Laaksonen , Mikko Tammi , Adriana Vasilache , Anssi Ramo
发明人: Lasse Laaksonen , Mikko Tammi , Adriana Vasilache , Anssi Ramo
IPC分类号: G10L19/00
CPC分类号: G10L19/0208 , G10L21/038
摘要: An apparatus comprising at least one processor and at least one memory including computer program code the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus at least to determine at least one characteristic of the audio signal; divide the audio signal into at least a low frequency portion and a high frequency portion, and generate from the high frequency portion a plurality of high frequency band signals dependent on the at least one characteristic of the audio signal; and determine for each of the plurality of high frequency band signals at least part of the low frequency portion which can represent the high frequency band signal.
摘要翻译: 一种装置,包括至少一个处理器和包括计算机程序代码的至少一个存储器,所述至少一个存储器和所述计算机程序代码被配置为与所述至少一个处理器一起使所述设备至少确定所述音频的至少一个特性 信号; 将音频信号划分成至少低频部分和高频部分,并且从高频部分产生取决于音频信号的至少一个特性的多个高频带信号; 并且确定可以表示高频带信号的低频部分的至少一部分的多个高频带信号中的每一个。
-
公开(公告)号:US07752038B2
公开(公告)日:2010-07-06
申请号:US11580690
申请日:2006-10-13
申请人: Lasse Laaksonen , Anssi Ramo , Adriana Vasilache
发明人: Lasse Laaksonen , Anssi Ramo , Adriana Vasilache
摘要: Autocorrelation values are determined as a basis for an estimation of a pitch lag in a segment of an audio signal. A first considered delay range for the autocorrelation computations is divided into a first set of sections, and first autocorrelation values are determined for delays in a plurality of sections of this first set of sections. A second considered delay range for the autocorrelation computations is divided into a second set of sections such that sections of the first set and sections of the second set are overlapping. Second autocorrelation values are determined for delays in a plurality of sections of this second set of sections.
摘要翻译: 自相关值被确定为用于估计音频信号的段中的音调滞后的基础。 用于自相关计算的第一被考虑的延迟范围被划分为第一组部分,并且为该第一组部分的多个部分中的延迟确定第一自相关值。 用于自相关计算的第二被考虑的延迟范围被划分成第二组部分,使得第一组的部分和第二组的部分重叠。 第二自相关值是针对该第二组部分的多个部分中的延迟而确定的。
-
公开(公告)号:US20070011009A1
公开(公告)日:2007-01-11
申请号:US11177250
申请日:2005-07-08
申请人: Jani Nurminen , Sakari Himanen , Anssi Ramo , Janne Vainio
发明人: Jani Nurminen , Sakari Himanen , Anssi Ramo , Janne Vainio
IPC分类号: G10L13/08
CPC分类号: G10L13/06
摘要: The invention relates to a support of a concatenative TTS synthesis. In order to generate a speech database as a basis for the TTS synthesis, first, a speech processing including a segmental parametric speech encoding of speech data based on a parametric modeling of speech is performed, which results in compressed parameterized speech segments. Then, the compressed parameterized speech segments are assembled in a speech database. In order to synthesize output speech, compressed parameterized speech segments are selected from the speech database based on an available text and decompressed to regain parameterized speech segments. The parameterized speech segments are then concatenated in a parameter domain. The output speech is synthesized based on these concatenated parametric speech segments.
摘要翻译: 本发明涉及一种级联TTS合成的支持。 为了生成语音数据库作为TTS综合的基础,首先,执行包括基于语音的参数建模的语音数据的分段参数语音编码的语音处理,这导致压缩的参数化语音段。 然后,压缩的参数化语音段被组合在语音数据库中。 为了合成输出语音,基于可用文本从语音数据库中选择压缩的参数化语音段,并且解压缩以重新获得参数化语音段。 参数化语音段然后在参数域中连接。 基于这些连接的参数语音段来合成输出语音。
-
公开(公告)号:US20050137858A1
公开(公告)日:2005-06-23
申请号:US10742645
申请日:2003-12-19
申请人: Ari Heikkinen , Sakari Himanen , Anssi Ramo
发明人: Ari Heikkinen , Sakari Himanen , Anssi Ramo
CPC分类号: G10L19/265 , G10L19/08 , G10L19/16
摘要: The invention relates to a method for use in parametric speech coding. In order to enable an improved parametric coding of speech signals, the method comprises a first step of pre-processing a to be encoded speech based signal such that a phase structure of the to be encoded speech based signal is approached to a phase structure which is obtained when the to be encoded speech based signal is parametrically encoded and decoded again. Only in a second step, a parametric encoding is applied to this pre-processed to be encoded speech based signal. The invention relates equally to a corresponding device, to a corresponding coding module, to a corresponding system and to a corresponding software program product.
摘要翻译: 本发明涉及一种用于参数语音编码的方法。 为了能够实现语音信号的改进的参数编码,该方法包括对要编码的基于语音的信号进行预处理的第一步骤,使得要编码的基于语音的信号的相位结构接近于相位结构,该相位结构是 当要编码的基于语音的信号被再次参数编码和解码时获得。 仅在第二步骤中,将参数编码应用于该预处理为被编码的基于语音的信号。 本发明同样涉及对应的设备,相应的编码模块,相应的系统和相应的软件程序产品。
-
8.
公开(公告)号:US20050091044A1
公开(公告)日:2005-04-28
申请号:US10692291
申请日:2003-10-23
申请人: Anssi Ramo , Jani Nurminen , Sakari Himanen , Ari Heikkinen
发明人: Anssi Ramo , Jani Nurminen , Sakari Himanen , Ari Heikkinen
IPC分类号: G10L11/04 , G10L19/02 , H03M20060101
CPC分类号: G10L19/032 , G10L19/09
摘要: A method and device for improving coding efficiency in audio coding. From the pitch values of a pitch contour of an audio signal, a plurality of simplified pitch contour segments are generated to approximate the pitch contour, based on one or more pre-selected criteria. The contour segments can be linear or non-linear with each contour segment represented by a first end point and a second end point. If the contour segments are linear, then only the information regarding the end points, instead of the pitch values, are provided to a decoder for reconstructing the audio signal. The contour segment can have a fixed maximum length or a variable length, but the deviation between a contour segment and the pitch values in that segment is limited by a maximum value.
摘要翻译: 一种提高音频编码效率的方法和装置。 根据音频信号的音调轮廓的音调值,基于一个或多个预先选择的标准,生成多个简化俯仰轮廓线段以近似俯仰轮廓。 轮廓段可以是由第一终点和第二终点表示的每个轮廓段线性或非线性的。 如果轮廓段是线性的,则仅将关于终点而不是音调值的信息提供给用于重建音频信号的解码器。 轮廓段可以具有固定的最大长度或可变长度,但轮廓段与该段中的俯仰值之间的偏差受到最大值的限制。
-
公开(公告)号:US20050091041A1
公开(公告)日:2005-04-28
申请号:US10692290
申请日:2003-10-23
申请人: Anssi Ramo , Jani Nurminen , Sakari Himanen , Ari Heikkinen
发明人: Anssi Ramo , Jani Nurminen , Sakari Himanen , Ari Heikkinen
IPC分类号: G10L20060101 , G10L11/06 , G10L19/02 , G10L19/04 , G10L19/14 , G10L21/04 , H04B1/06 , H04M11/00
CPC分类号: G10L19/24
摘要: A method and device for use in conjunction with an encoder for encoding an audio signal into a plurality of parameters. Based on the behavior of the parameters, such as pitch, voicing, energy and spectral amplitude information of the audio signal, the audio signal can be segmented, so that the parameter update rate can be optimized. The parameters of the segmented audio signal are recorded in a storage medium or transmitted to a decoder so as to allow the decoder to reconstruct the audio signal based on the parameters indicative of the segment audio signals. For example, based on the pitch characteristic, the pitch contour can be approximated by a plurality of contour segments. An adaptive downsampling method is used to update the parameters based on the contour segments so as to reduce the update rate. At the decoder, the parameters are updated at the original rate.
摘要翻译: 一种与用于将音频信号编码为多个参数的编码器结合使用的方法和装置。 基于音频信号的音调,发音,能量和频谱幅度信息等参数的行为,可以对音频信号进行分段,从而可以优化参数更新速率。 分段音频信号的参数被记录在存储介质中或被发送到解码器,以便允许解码器基于指示段音频信号的参数重建音频信号。 例如,基于俯仰特性,俯仰轮廓可以由多个轮廓段近似。 使用自适应下采样方法根据轮廓段更新参数,以便降低更新速率。 在解码器处,参数以原始速率更新。
-
公开(公告)号:US20150371643A1
公开(公告)日:2015-12-24
申请号:US14394211
申请日:2012-04-18
申请人: Anssi Ramo , Adriana Vasilache , Lasse Laaksonen , Miikka Vilermo , Mikko Tammi
发明人: Anssi Ramo , Adriana Vasilache , Lasse Laaksonen , Miikka Vilermo , Mikko Tammi
IPC分类号: G10L19/008 , H04S1/00
CPC分类号: G10L19/008 , H04S1/007 , H04S2420/03
摘要: An apparatus comprising a channel analyser configured to analyse an audio signal comprising at least two audio channels to determine at least one parameter associated with a difference between the at least two audio channels; an encoding mode determiner configured to select a multichannel audio signal encoding dependent on the at least one parameter; and a channel encoder configured to encode the audio signal with the multichannel audio signal encoding.
摘要翻译: 一种装置,包括信道分析器,其被配置为分析包括至少两个音频信道的音频信号,以确定与所述至少两个音频信道之间的差异相关联的至少一个参数; 编码模式确定器,被配置为选择取决于所述至少一个参数的多声道音频信号编码; 以及频道编码器,被配置为用多声道音频信号编码对音频信号进行编码。
-
-
-
-
-
-
-
-
-