专利检索 ap:("Hideki Kemmochi" OR "Jordi Bonada") AND inv:"Jordi Bonada" 第 1 页

1.

发明授权
Voice synthesizer of multi sounds 有权
标题翻译：多声音合成器

公开(公告)号：US07613612B2

公开(公告)日：2009-11-03

申请号：US11345023

申请日：2006-01-31

申请人： Hideki Kemmochi , Jordi Bonada

发明人： Hideki Kemmochi , Jordi Bonada

IPC分类号： G10L13/04

CPC分类号： G10L13/06 , G10L25/18 , G10L2021/0135

摘要： In a voice synthesizer, an envelope acquisition portion obtains a spectral envelope of a reference frequency spectrum of a given voice. A spectrum acquisition portion obtains a collective frequency spectrum of a plurality of voices which are generated in parallel to one another. An envelope adjustment portion adjusts a spectral envelope of the collective frequency spectrum obtained by the spectrum acquisition portion so as to approximately match with the spectral envelope of the reference frequency spectrum obtained by the envelope acquisition portion. A voice generation portion generates an output voice signal from the collective frequency spectrum having the spectral envelope adjusted by the envelope adjustment portion.

摘要翻译： 在语音合成器中，包络获取部分获得给定语音的参考频谱的频谱包络。频谱获取部分获得彼此并行产生的多个语音的集体频谱。信封调整部分调整由频谱获取部分获得的集体频谱的频谱包络，以便与由包络获取部分获得的参考频谱的频谱包络近似匹配。声音产生部分从具有通过包络线调节部分调整的频谱包络的集体频谱产生输出声音信号。

2.

发明授权
Sound processing apparatus and method, and program therefor 有权
标题翻译：声音处理装置及方法及其程序

公开(公告)号：US07945446B2

公开(公告)日：2011-05-17

申请号：US11372812

申请日：2006-03-09

申请人： Hideki Kemmochi , Yasuo Yoshioka , Jordi Bonada

发明人： Hideki Kemmochi , Yasuo Yoshioka , Jordi Bonada

IPC分类号： G10L21/00 , G10L13/06 , G10L13/00

CPC分类号： G10H1/366 , G10H1/10 , G10H5/005 , G10H2210/251 , G10H2250/031 , G10L13/033

摘要： Spectrum envelope of an input sound is detected. In the meantime, a converting spectrum is acquired which is a frequency spectrum of a converting sound comprising a plurality of sounds, such as unison sounds. Output spectrum is generated by imparting the detected spectrum envelope of the input sound to the acquired converting spectrum. Sound signal is synthesized on the basis of the generated output spectrum. Further, a pitch of the input sound may be detected, and frequencies of peaks in the acquired converting spectrum may be varied in accordance with the detected pitch of the input sound. In this manner, the output spectrum can have the pitch and spectrum envelope of the input sound and spectrum frequency components of the converting sound comprising a plurality of sounds, and thus, unison sounds can be readily generated with simple arrangements.

摘要翻译： 检测输入声音的频谱包络。同时，获取转换频谱，其是包括多个声音（例如一致声音）的转换声音的频谱。通过将检测到的输入声音的频谱包络赋予所获取的转换频谱来产生输出频谱。声音信号是根据产生的输出频谱进行合成的。此外，可以检测输入声音的音调，并且可以根据检测到的输入声音的音调来改变所获取的转换频谱中的峰值频率。以这种方式，输出频谱可以具有包括多个声音的转换声音的输入声音和频谱频率分量的音调和频谱包络，从而可以以简单的布置容易地产生一致的声音。

3.

发明申请
Voice synthesizer of multi sounds 有权

公开(公告)号：US20060173676A1

公开(公告)日：2006-08-03

申请号：US11345023

申请日：2006-01-31

申请人： Hideki Kemmochi , Jordi Bonada

发明人： Hideki Kemmochi , Jordi Bonada

IPC分类号： G10L11/04

CPC分类号： G10L13/06 , G10L25/18 , G10L2021/0135

摘要： In a voice synthesizer, an envelope acquisition portion obtains a spectral envelope of a reference frequency spectrum of a given voice. A spectrum acquisition portion obtains a collective frequency spectrum of a plurality of voices which are generated in parallel to one another. An envelope adjustment portion adjusts a spectral envelope of the collective frequency spectrum obtained by the spectrum acquisition portion so as to approximately match with the spectral envelope of the reference frequency spectrum obtained by the envelope acquisition portion. A voice generation portion generates an output voice signal from the collective frequency spectrum having the spectral envelope adjusted by the envelope adjustment portion.

4.

发明申请
Apparatus for and program of processing audio signal 有权
标题翻译：用于处理音频信号的装置和程序

公开(公告)号：US20060111903A1

公开(公告)日：2006-05-25

申请号：US11273749

申请日：2005-11-14

申请人： Hideki Kemmochi , Jordi Bonada

发明人： Hideki Kemmochi , Jordi Bonada

IPC分类号： G10L15/06

CPC分类号： G10H1/0091 , G10H1/366 , G10H2210/251 , G10H2250/455 , G10L13/04 , G10L19/26

摘要： In an audio signal processing apparatus, a generation section generates an audio signal representing a voice. A distribution section distributes the audio signal generated by the generation section to a first channel and a second channel, respectively. A delay section delays the audio signal of the first channel relative to the audio signal of the second channel for creating a phase difference between the audio signal of the first channel and the audio signal of the second channel such that the created phase difference has a duration corresponding to either an added value of a first duration which is approximately one half of a period of the audio signal generated by the generation section and a second duration which is set shorter than the first duration, or a difference value of the first duration and the second duration. An addition section adds the audio signal of the first channel and the audio signal of the second channel with one another, between which the phase difference is created by the delay section, and outputs the added audio signal which represents natural voice with various characteristics.

摘要翻译： 在音频信号处理装置中，生成部生成表示声音的音频信号。分配部将由生成部生成的音频信号分别分配到第一信道和第二信道。延迟部分相对于第二通道的音频信号延迟第一通道的音频信号，以产生第一通道的音频信号和第二通道的音频信号之间的相位差，使得所创建的相位差具有持续时间对应于第一持续时间的附加值，其大约是由生成部分生成的音频信号的周期的一半，以及被设置为短于第一持续时间的第二持续时间，或者第一持续时间和第二个持续时间。加法部分将第一信道的音频信号和第二信道的音频信号彼此相加，在延迟部分之间产生相位差，并输出表示具有各种特性的自然语音的附加音频信号。

5.

发明授权
Apparatus for and program of processing audio signal 有权
标题翻译：用于处理音频信号的装置和程序

公开(公告)号：US08170870B2

公开(公告)日：2012-05-01

申请号：US11273749

申请日：2005-11-14

申请人： Hideki Kemmochi , Jordi Bonada

发明人： Hideki Kemmochi , Jordi Bonada

IPC分类号： G10L11/04 , G10L13/00 , G10H1/06

CPC分类号： G10H1/0091 , G10H1/366 , G10H2210/251 , G10H2250/455 , G10L13/04 , G10L19/26

摘要： In an audio signal processing apparatus, a generation section generates an audio signal representing a voice. A distribution section distributes the audio signal generated by the generation section to a first channel and a second channel, respectively. A delay section delays the audio signal of the first channel relative to the audio signal of the second channel for creating a phase difference between the audio signal of the first channel and the audio signal of the second channel such that the created phase difference has a duration corresponding to either an added value of a first duration which is approximately one half of a period of the audio signal generated by the generation section and a second duration which is set shorter than the first duration, or a difference value of the first duration and the second duration. An addition section adds the audio signal of the first channel and the audio signal of the second channel with one another, between which the phase difference is created by the delay section, and outputs the added audio signal which represents natural voice with various characteristics.

摘要翻译： 在音频信号处理装置中，生成部生成表示声音的音频信号。分配部将由生成部生成的音频信号分别分配到第一信道和第二信道。延迟部分相对于第二通道的音频信号延迟第一通道的音频信号，以产生第一通道的音频信号和第二通道的音频信号之间的相位差，使得所创建的相位差具有持续时间对应于第一持续时间的附加值，其大约是由生成部分生成的音频信号的周期的一半，以及被设置为短于第一持续时间的第二持续时间，或者第一持续时间和第二个持续时间。加法部分将第一信道的音频信号和第二信道的音频信号彼此相加，在延迟部分之间产生相位差，并输出表示具有各种特性的自然语音的附加音频信号。

6.

发明授权
Singing voice synthesizing apparatus, singing voice synthesizing method and program for singing voice synthesizing 有权
标题翻译：唱歌语音合成装置，歌唱合成方法和歌唱合成程序

公开(公告)号：US07135636B2

公开(公告)日：2006-11-14

申请号：US10375272

申请日：2003-02-27

申请人： Hideki Kemmochi , Yasuo Yoshioka , Jordi Bonada

发明人： Hideki Kemmochi , Yasuo Yoshioka , Jordi Bonada

IPC分类号： G10H1/06 , G10H7/00

CPC分类号： G10L13/00 , G10H7/00 , G10H2240/056 , G10H2250/235 , G10H2250/455

摘要： A method for synthesizing a natural-sounding singing voice divides performance data into a transition part and a long sound part. The transition part is represented by articulation (phonemic chain) data that is read from an articulation template database and is outputted without modification. For the long sound part, a new characteristic parameter is generated by linearly interpolating characteristic parameters of the transition parts positioned before and after the long sound part and adding thereto a changing component of stationary data that is read from a constant part (stationary) template database. An associated apparatus for carrying out the singing voice synthesizing method includes a phoneme database for storing articulation data for the transition part and stationary data for the long sound part, a first device for outputting the articulation data, and a second device for outputting the newly-generated characteristic parameter of the long sound part.

摘要翻译： 用于合成自然发声的歌声的方法将演奏数据分成转换部分和长音部分。过渡部分由从关节运动模板数据库读取并且没有修改地输出的关节（音素链）数据表示。对于长音部分，通过线性内插位于长声部分之前和之后的过渡部分的特征参数，并且向其添加从恒定部分（静止）模板数据库读取的静止数据的变化分量，生成新的特征参数。用于执行歌唱声合成方法的相关装置包括用于存储用于转换部分的发音数据的音素数据库和用于长音部分的固定数据，用于输出关节数据的第一装置，以及用于输出新音符的第二装置，生成长音部分的特征参数。

7.

发明申请
Sound processing apparatus and method, and program therefor 有权
标题翻译：声音处理装置及方法及其程序

公开(公告)号：US20060212298A1

公开(公告)日：2006-09-21

申请号：US11372812

申请日：2006-03-09

申请人： Hideki Kemmochi , Yasuo Yoshioka , Jordi Bonada

发明人： Hideki Kemmochi , Yasuo Yoshioka , Jordi Bonada

IPC分类号： G10L21/02

CPC分类号： G10H1/366 , G10H1/10 , G10H5/005 , G10H2210/251 , G10H2250/031 , G10L13/033

摘要： Spectrum envelope of an input sound is detected. In the meantime, a converting spectrum is acquired which is a frequency spectrum of a converting sound comprising a plurality of sounds, such as unison sounds. Output spectrum is generated by imparting the detected spectrum envelope of the input sound to the acquired converting spectrum. Sound signal is synthesized on the basis of the generated output spectrum. Further, a pitch of the input sound may be detected, and frequencies of peaks in the acquired converting spectrum may be varied in accordance with the detected pitch of the input sound. In this manner, the output spectrum can have the pitch and spectrum envelope of the input sound and spectrum frequency components of the converting sound comprising a plurality of sounds, and thus, unison sounds can be readily generated with simple arrangements.

摘要翻译： 检测输入声音的频谱包络。同时，获取转换频谱，其是包括多个声音（例如一致声音）的转换声音的频谱。通过将检测到的输入声音的频谱包络赋予所获取的转换频谱来产生输出频谱。声音信号是根据产生的输出频谱进行合成的。此外，可以检测输入声音的音调，并且可以根据检测到的输入声音的音调来改变所获取的转换频谱中的峰值频率。以这种方式，输出频谱可以具有包括多个声音的转换声音的输入声音和频谱频率分量的音调和频谱包络，从而可以以简单的布置容易地产生一致的声音。

8.

发明授权
Tone processing apparatus and method 有权
标题翻译：音调处理装置及方法

公开(公告)号：US07750228B2

公开(公告)日：2010-07-06

申请号：US12006918

申请日：2008-01-07

申请人： Takuya Fujishima , Jordi Bonada , Maarten De Boer

发明人： Takuya Fujishima , Jordi Bonada , Maarten De Boer

IPC分类号： G10H1/00 , G10H1/18

CPC分类号： G10H7/02 , G10H1/14 , G10H1/183 , G10H1/344 , G10H1/40 , G10H1/46 , G10H7/008 , G10H2210/095 , G10H2210/381 , G10H2210/565 , G10H2220/221 , G10H2220/395 , G10H2240/135 , G10H2240/155 , G10H2250/035 , G10H2250/455 , G10H2250/625 , G10H2250/641

摘要： For at least one music piece, a storage section stores tone data of each of a plurality of fragments segmented from the music piece and stores a first descriptor indicative of a musical character of each of the fragments in association with the fragment. Descriptor generation section receives input data based on operation by a user and generates a second descriptor, indicative of a musical character, on the basis of the received input data. Determination section determines similarity between the second descriptor and the first descriptor of each of the fragments. Selection section selects the tone data of at least one fragment on the basis of a result of the similarity determination by the determination section. On the basis of the tone data of the selected at least one fragment, a data generation section generates tone data to be outputted.

摘要翻译： 对于至少一个音乐片段，存储部分存储从音乐片段分割的多个片段中的每一个的乐曲数据，并且存储指示与片段相关联的每个片段的音乐特征的第一描述符。描述符生成部分基于用户的操作接收输入数据，并且基于所接收的输入数据生成表示音乐人物的第二描述符。确定部分确定第二描述符和每个片段的第一描述符之间的相似性。选择部根据判定部的相似判定结果，选择至少一个片段的色调数据。根据所选择的至少一个片段的色调数据，数据产生部分产生要输出的色调数据。

9.

发明授权
Apparatus and method for creating singing synthesizing database, and pitch curve generation apparatus and method 有权

公开(公告)号：US08115089B2

公开(公告)日：2012-02-14

申请号：US12828375

申请日：2010-07-01

申请人： Keijiro Saino , Jordi Bonada

发明人： Keijiro Saino , Jordi Bonada

IPC分类号： G10H1/06

CPC分类号： G10L13/10 , G10H1/0008 , G10H1/361 , G10H2210/086 , G10H2240/155 , G10H2250/015 , G10H2250/425 , G10H2250/481

摘要： Waveform data representative of singing voices of a singing music piece are analyzed to generate melody component data representative of variation over time in fundamental frequency component presumed to represent a melody in the singing voices. Then, through machine learning that uses score data representative of a musical score of the singing music piece and the melody component data, a melody component model, representative of a variation component presumed to represent the melody among the variation over time in fundamental frequency component, is generated for each combination of notes. Parameters defining the melody component models and note identifiers indicative of the combinations of notes whose variation over time in fundamental frequency component are represented by the melody component models are stored into a pitch curve generating database in association with each other.

10.

发明授权
Fragment search apparatus and method 有权
标题翻译：片段搜索装置和方法

公开(公告)号：US07812240B2

公开(公告)日：2010-10-12

申请号：US12287584

申请日：2008-10-10

申请人： Sebastian Streich , Jordi Bonada , Samuel Roig

发明人： Sebastian Streich , Jordi Bonada , Samuel Roig

IPC分类号： A63H5/00

CPC分类号： G10H1/0025 , G06F17/30743 , G10H2210/076 , G10H2210/125 , G10H2240/131

摘要： Analysis section divides waveform data of a given music piece into waveform data of a plurality of fragments and divides the waveform data of each of the fragments into one or more events of sound, and obtains a character value indicative of a character of the waveform data pertaining to each of the divided events. Storage section stores respective music piece data and music piece composing data of one or more music pieces. The music piece composing data include a character value indicative of a character of the waveform data pertaining to each of the events of each of the fragments. Search section searches (or retrieves) for, from among the stored music piece composing data, one event or a plurality of successive events having a character value of a high degree of similarity to one or more events included in a designated fragment.

摘要翻译： 分析部将给定音乐片段的波形数据分割为多个片段的波形数据，并将每个片段的波形数据划分为一个或多个声音事件，并获得表示相关波形数据的字符的字符值对每一个分开的事件。存储部分存储合成一个或多个音乐片段的各个乐曲数据和音乐作品的数据。构成数据的乐曲包括表示与每个片段的每个事件有关的波形数据的字符的字符值。搜索部分从存储的音乐作品数据中搜索（或检索）一个事件或多个连续事件，该事件或多个连续事件具有与包括在指定片段中的一个或多个事件高度相似度的字符值。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类