专利检索 ap:("Ari Heikkinen" OR "Sakari Himanen" OR "Anssi Ramo") AND inv:"Sakari Himanen" 第 1 页

1.

发明申请
Method and system for pitch contour quantization in audio coding 有权
标题翻译：音频编码中音调轮廓量化的方法和系统

公开(公告)号：US20080275695A1

公开(公告)日：2008-11-06

申请号：US12150307

申请日：2008-04-25

申请人： Anssi Ramo , Jani Nurminen , Sakari Himanen , Ari Heikkinen

发明人： Anssi Ramo , Jani Nurminen , Sakari Himanen , Ari Heikkinen

IPC分类号： G10L11/04

CPC分类号： G10L19/032 , G10L19/09

摘要： A method and device for improving coding efficiency in audio coding. From the pitch values of a pitch contour of an audio signal, a plurality of simplified pitch contour segments are generated to approximate the pitch contour, based on one or more pre-selected criteria. The contour segments can be linear or non-linear with each contour segment represented by a first end point and a second end point. If the contour segments are linear, then only the information regarding the end points, instead of the pitch values, are provided to a decoder for reconstructing the audio signal. The contour segment can have a fixed maximum length or a variable length, but the deviation between a contour segment and the pitch values in that segment is limited by a maximum value.

摘要翻译： 一种提高音频编码效率的方法和装置。根据音频信号的音调轮廓的音调值，基于一个或多个预先选择的标准，生成多个简化俯仰轮廓线段以近似俯仰轮廓。轮廓段可以是由第一终点和第二终点表示的每个轮廓段线性或非线性的。如果轮廓段是线性的，则仅将关于终点而不是音调值的信息提供给用于重建音频信号的解码器。轮廓段可以具有固定的最大长度或可变长度，但轮廓段与该段中的俯仰值之间的偏差受到最大值的限制。

2.

发明申请
Speech coding 失效
标题翻译：语音编码

公开(公告)号：US20050137858A1

公开(公告)日：2005-06-23

申请号：US10742645

申请日：2003-12-19

申请人： Ari Heikkinen , Sakari Himanen , Anssi Ramo

发明人： Ari Heikkinen , Sakari Himanen , Anssi Ramo

IPC分类号： G10L19/06 , G10L19/08 , G10L19/14

CPC分类号： G10L19/265 , G10L19/08 , G10L19/16

摘要： The invention relates to a method for use in parametric speech coding. In order to enable an improved parametric coding of speech signals, the method comprises a first step of pre-processing a to be encoded speech based signal such that a phase structure of the to be encoded speech based signal is approached to a phase structure which is obtained when the to be encoded speech based signal is parametrically encoded and decoded again. Only in a second step, a parametric encoding is applied to this pre-processed to be encoded speech based signal. The invention relates equally to a corresponding device, to a corresponding coding module, to a corresponding system and to a corresponding software program product.

摘要翻译： 本发明涉及一种用于参数语音编码的方法。为了能够实现语音信号的改进的参数编码，该方法包括对要编码的基于语音的信号进行预处理的第一步骤，使得要编码的基于语音的信号的相位结构接近于相位结构，该相位结构是当要编码的基于语音的信号被再次参数编码和解码时获得。仅在第二步骤中，将参数编码应用于该预处理为被编码的基于语音的信号。本发明同样涉及对应的设备，相应的编码模块，相应的系统和相应的软件程序产品。

3.

发明申请
Method and system for pitch contour quantization in audio coding 审中-公开
标题翻译：音频编码中音调轮廓量化的方法和系统

公开(公告)号：US20050091044A1

公开(公告)日：2005-04-28

申请号：US10692291

申请日：2003-10-23

申请人： Anssi Ramo , Jani Nurminen , Sakari Himanen , Ari Heikkinen

发明人： Anssi Ramo , Jani Nurminen , Sakari Himanen , Ari Heikkinen

IPC分类号： G10L11/04 , G10L19/02 , H03M20060101

CPC分类号： G10L19/032 , G10L19/09

摘要： A method and device for improving coding efficiency in audio coding. From the pitch values of a pitch contour of an audio signal, a plurality of simplified pitch contour segments are generated to approximate the pitch contour, based on one or more pre-selected criteria. The contour segments can be linear or non-linear with each contour segment represented by a first end point and a second end point. If the contour segments are linear, then only the information regarding the end points, instead of the pitch values, are provided to a decoder for reconstructing the audio signal. The contour segment can have a fixed maximum length or a variable length, but the deviation between a contour segment and the pitch values in that segment is limited by a maximum value.

摘要翻译： 一种提高音频编码效率的方法和装置。根据音频信号的音调轮廓的音调值，基于一个或多个预先选择的标准，生成多个简化俯仰轮廓线段以近似俯仰轮廓。轮廓段可以是由第一终点和第二终点表示的每个轮廓段线性或非线性的。如果轮廓段是线性的，则仅将关于终点而不是音调值的信息提供给用于重建音频信号的解码器。轮廓段可以具有固定的最大长度或可变长度，但轮廓段与该段中的俯仰值之间的偏差受到最大值的限制。

4.

发明申请
Method and system for speech coding 审中-公开
标题翻译：语音编码方法和系统

公开(公告)号：US20050091041A1

公开(公告)日：2005-04-28

申请号：US10692290

申请日：2003-10-23

申请人： Anssi Ramo , Jani Nurminen , Sakari Himanen , Ari Heikkinen

发明人： Anssi Ramo , Jani Nurminen , Sakari Himanen , Ari Heikkinen

IPC分类号： G10L20060101 , G10L11/06 , G10L19/02 , G10L19/04 , G10L19/14 , G10L21/04 , H04B1/06 , H04M11/00

CPC分类号： G10L19/24

摘要： A method and device for use in conjunction with an encoder for encoding an audio signal into a plurality of parameters. Based on the behavior of the parameters, such as pitch, voicing, energy and spectral amplitude information of the audio signal, the audio signal can be segmented, so that the parameter update rate can be optimized. The parameters of the segmented audio signal are recorded in a storage medium or transmitted to a decoder so as to allow the decoder to reconstruct the audio signal based on the parameters indicative of the segment audio signals. For example, based on the pitch characteristic, the pitch contour can be approximated by a plurality of contour segments. An adaptive downsampling method is used to update the parameters based on the contour segments so as to reduce the update rate. At the decoder, the parameters are updated at the original rate.

摘要翻译： 一种与用于将音频信号编码为多个参数的编码器结合使用的方法和装置。基于音频信号的音调，发音，能量和频谱幅度信息等参数的行为，可以对音频信号进行分段，从而可以优化参数更新速率。分段音频信号的参数被记录在存储介质中或被发送到解码器，以便允许解码器基于指示段音频信号的参数重建音频信号。例如，基于俯仰特性，俯仰轮廓可以由多个轮廓段近似。使用自适应下采样方法根据轮廓段更新参数，以便降低更新速率。在解码器处，参数以原始速率更新。

5.

发明申请
Reusing codebooks in parameter quantization 审中-公开
标题翻译：在参数量化中重用码本

公开(公告)号：US20060080090A1

公开(公告)日：2006-04-13

申请号：US10961471

申请日：2004-10-07

申请人： Anssi Ramo , Sakari Himanen , Jani Nurminen

发明人： Anssi Ramo , Sakari Himanen , Jani Nurminen

IPC分类号： G10L19/12

CPC分类号： G10L19/07

摘要： The present invention provides a new methodology for reusing codebooks for a multistage vector quantization of parameter quantizers of signals. Prior art multistage vector quantization is done in such a way that each stage has different optimized codebooks. The prior art codebooks, thus, use quite a lot of a memory storage space. Using the same codebook stages several times, according to the present invention, reduces the memory usage and a codebook structure maintains good quality by using optimized codebooks for the most important (first) stages in the quantization. The number of codebooks is reduced by reusing the same codebooks in the refining stages. Additionally, according to the present invention, using many predictors is space-wise efficient since they need only a few of coefficients instead of larger codebooks.

摘要翻译： 本发明提供了一种用于重新使用信号参数量化器的多级矢量量化码本的新方法。现有技术的多级矢量量化是以每一级具有不同优化码本的方式完成的。因此，现有技术的码本使用相当多的存储器存储空间。使用相同的码本阶段，根据本发明，通过使用量化中最重要的（第一）级的优化码本来减少存储器使用并且码本结构保持良好的质量。通过在精炼阶段重复使用相同的码本来减少码本的数量。此外，根据本发明，使用许多预测器是空间有效的，因为它们仅需要少数系数而不是较大的码本。

6.

发明授权
Method and system for pitch contour quantization in audio coding 有权
标题翻译：音频编码中音调轮廓量化的方法和系统

公开(公告)号：US08380496B2

公开(公告)日：2013-02-19

申请号：US12150307

申请日：2008-04-25

申请人： Anssi Rämö , Jani Nurminen , Sakari Himanen , Ari Heikkinen

发明人： Anssi Rämö , Jani Nurminen , Sakari Himanen , Ari Heikkinen

IPC分类号： G10L11/04 , G10L19/00

CPC分类号： G10L19/032 , G10L19/09

摘要： A method and device for improving coding efficiency in audio coding. From the pitch values of a pitch contour of an audio signal, a plurality of simplified pitch contour segments are generated to approximate the pitch contour, based on one or more pre-selected criteria. The contour segments can be linear or non-linear with each contour segment represented by a first end point and a second end point. If the contour segments are linear, then only the information regarding the end points, instead of the pitch values, are provided to a decoder for reconstructing the audio signal. The contour segment can have a fixed maximum length or a variable length, but the deviation between a contour segment and the pitch values in that segment is limited by a maximum value.

摘要翻译： 一种提高音频编码效率的方法和装置。根据音频信号的音调轮廓的音调值，基于一个或多个预先选择的标准，生成多个简化俯仰轮廓线段以近似俯仰轮廓。轮廓段可以是由第一终点和第二终点表示的每个轮廓段线性或非线性的。如果轮廓段是线性的，则仅将关于终点而不是音调值的信息提供给用于重建音频信号的解码器。轮廓段可以具有固定的最大长度或可变长度，但轮廓段与该段中的俯仰值之间的偏差受到最大值的限制。

7.

发明申请
Supporting a concatenative text-to-speech synthesis 审中-公开
标题翻译：支持连贯的文本到语音合成

公开(公告)号：US20070011009A1

公开(公告)日：2007-01-11

申请号：US11177250

申请日：2005-07-08

申请人： Jani Nurminen , Sakari Himanen , Anssi Ramo , Janne Vainio

发明人： Jani Nurminen , Sakari Himanen , Anssi Ramo , Janne Vainio

IPC分类号： G10L13/08

CPC分类号： G10L13/06

摘要： The invention relates to a support of a concatenative TTS synthesis. In order to generate a speech database as a basis for the TTS synthesis, first, a speech processing including a segmental parametric speech encoding of speech data based on a parametric modeling of speech is performed, which results in compressed parameterized speech segments. Then, the compressed parameterized speech segments are assembled in a speech database. In order to synthesize output speech, compressed parameterized speech segments are selected from the speech database based on an available text and decompressed to regain parameterized speech segments. The parameterized speech segments are then concatenated in a parameter domain. The output speech is synthesized based on these concatenated parametric speech segments.

摘要翻译： 本发明涉及一种级联TTS合成的支持。为了生成语音数据库作为TTS综合的基础，首先，执行包括基于语音的参数建模的语音数据的分段参数语音编码的语音处理，这导致压缩的参数化语音段。然后，压缩的参数化语音段被组合在语音数据库中。为了合成输出语音，基于可用文本从语音数据库中选择压缩的参数化语音段，并且解压缩以重新获得参数化语音段。参数化语音段然后在参数域中连接。基于这些连接的参数语音段来合成输出语音。

8.

发明授权
Speech coding method, device, coding module, system and software program product for pre-processing the phase structure of a to be encoded speech signal to match the phase structure of the decoded signal 失效
标题翻译：用于对要编码的语音信号的相位结构进行预处理以匹配解码信号的相位结构的语音编码方法，装置，编码模块，系统和软件程序产品

公开(公告)号：US07523032B2

公开(公告)日：2009-04-21

申请号：US10742645

申请日：2003-12-19

申请人： Ari Heikkinen , Sakari Himanen , Anssi Rämö

发明人： Ari Heikkinen , Sakari Himanen , Anssi Rämö

IPC分类号： G10L11/04 , G10L19/10 , G10L19/04

CPC分类号： G10L19/265 , G10L19/08 , G10L19/16

摘要： The invention relates to a method for use in parametric speech coding. In order to enable an improved parametric coding of speech signals, the method comprises a first step of pre-processing a to be encoded speech based signal such that a phase structure of the to be encoded speech based signal is approached to a phase structure which is obtained when the to be encoded speech based signal is parametrically encoded and decoded again. Only in a second step, a parametric encoding is applied to this pre-processed to be encoded speech based signal. The invention relates equally to a corresponding device, to a corresponding coding module, to a corresponding system and to a corresponding software program product.

摘要翻译： 本发明涉及一种用于参数语音编码的方法。为了能够实现语音信号的改进的参数编码，该方法包括对要编码的基于语音的信号进行预处理的第一步骤，使得要编码的基于语音的信号的相位结构接近于相位结构，该相位结构是当要编码的基于语音的信号被再次参数编码和解码时获得。仅在第二步骤中，将参数编码应用于该预处理为被编码的基于语音的信号。本发明同样涉及对应的设备，相应的编码模块，相应的系统和相应的软件程序产品。

9.

发明授权
System and method for modeling speech spectra 有权
标题翻译：语音谱建模系统和方法

公开(公告)号：US08489392B2

公开(公告)日：2013-07-16

申请号：US11855108

申请日：2007-09-13

申请人： Jani Nurminen , Sakari Himanen

发明人： Jani Nurminen , Sakari Himanen

IPC分类号： G10L11/06

CPC分类号： G10L25/93 , G10L19/0204 , G10L2025/935

摘要： A system and method for modeling speech in such a way that both voiced and unvoiced contributions can co-exist at certain frequencies. In various embodiments, three spectral bands (or bands of up to three different types) are used. In one embodiment, the lowest band or group of bands is completely voiced, the middle band or group of bands contains both voiced and unvoiced contributions, and the highest band or group of bands is completely unvoiced. The embodiments of the present invention may be used for speech coding and other speech processing applications.

摘要翻译： 一种用于对语音进行建模的系统和方法，使得有声和无声的贡献可以在某些频率下共存。在各种实施例中，使用三个光谱带（或多达三种不同类型的频带）。在一个实施例中，最低频带或频带组完全浊音，中间频带或频带组包含有声和无声贡献，并且最高频带或组的频带完全无声。本发明的实施例可以用于语音编码和其他语音处理应用。

10.

发明授权
Method for inputting characters in electronic device 有权
标题翻译：在电子设备中输入字符的方法

公开(公告)号：US07495585B2

公开(公告)日：2009-02-24

申请号：US11433090

申请日：2006-05-12

申请人： Janne Vainio , Hannu J. Mikkola , Hannu Korhonen , Sakari Himanen , Toni P. Nieminen , Tuomas Vaittinen , Juha Marila

发明人： Janne Vainio , Hannu J. Mikkola , Hannu Korhonen , Sakari Himanen , Toni P. Nieminen , Tuomas Vaittinen , Juha Marila

IPC分类号： H03M1/22

CPC分类号： G06F3/0233 , G06F3/167 , H04M1/23 , H04M2250/70

摘要： According to an aspect of the invention, an enhanced audible feedback solution has been invented for electronic devices using an input device facilitating navigation though a plurality of available user interface input options and confirmation of a selected input option. The electronic device is arranged to define, as a response to detecting a selection of a character on the basis of a detection of a first input to an input device of the electronic device, an audio segment specific to the character. The electronic device is arranged to output the defined audio segment via the audio output means prior to a confirmation by a second input to the input device, the second input being associated with a function adding the character as part of a character sequence entered by the user.

摘要翻译： 根据本发明的一个方面，已经针对电子设备发明了一种增强的可听反馈解决方案，所述输入设备通过多个可用的用户接口输入选项和所选择的输入选项的确认来促进导航。电子设备被配置为根据对电子设备的输入设备的第一输入的检测来检测字符的选择的响应来定义特定于该字符的音频段。电子设备被布置成在通过输入设备的第二输入的确认之前经由音频输出装置输出定义的音频片段，第二输入与添加作为用户输入的字符序列的一部分的功能相关联的功能。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类