专利检索 ap:("Hideki Kemmochi" OR "Jordi Bonada") AND inv:"Jordi Bonada" 第 4 页

31.

发明授权
Voice converter for assimilation by frame synthesis with temporal alignment 失效
标题翻译：语音转换器通过帧合成与时间对准同化

公开(公告)号：US07464034B2

公开(公告)日：2008-12-09

申请号：US10951328

申请日：2004-09-27

申请人： Takahiro Kawashima , Yasuo Yoshioka , Pedro Cano , Alex Loscos , Xavier Serra , Mark Schiementz , Jordi Bonada

发明人： Takahiro Kawashima , Yasuo Yoshioka , Pedro Cano , Alex Loscos , Xavier Serra , Mark Schiementz , Jordi Bonada

IPC分类号： G10L13/06

CPC分类号： G10L13/033 , G10L2021/0135

摘要： A voice converting apparatus is constructed for converting an input voice into an output voice according to a target voice. The apparatus includes a storage section, an analyzing section including a characteristic analyzer, a producing section, a synthesizing section, a memory, an alignment processor, and target decoder.

摘要翻译： 构成语音转换装置，用于根据目标语音将输入语音转换为输出语音。该装置包括存储部分，分析部分，包括特征分析器，产生部分，合成部分，存储器，对准处理器和目标解码器。

32.

发明申请
MUSIC SIMILARITY SYSTEMS AND METHODS USING DESCRIPTORS 审中-公开
标题翻译：音乐相似系统和使用描述符的方法

公开(公告)号：US20080300702A1

公开(公告)日：2008-12-04

申请号：US12128917

申请日：2008-05-29

申请人： Emilia Gomez , Perfecto Herrera , Pedro Cano Vila , Jordi Janer , Joan Serra , Jordi Bonada , Shadi Walid El-Hajj , Thomas Etienne Aussenac , Gunnar Nils Holmberg

发明人： Emilia Gomez , Perfecto Herrera , Pedro Cano Vila , Jordi Janer , Joan Serra , Jordi Bonada , Shadi Walid El-Hajj , Thomas Etienne Aussenac , Gunnar Nils Holmberg

IPC分类号： G06F17/00

CPC分类号： G10L25/48 , G06F16/683

摘要： Systems and methods for determining similarity between two or more audio pieces are disclosed. An illustrative method for determining musical similarities includes extracting one or more descriptors from each audio piece, generating a vector for each of the audio pieces, extracting one or more audio features from each of the audio pieces, calculating values for each audio feature, calculating a distance between a vector containing the normalized values and the vectors containing the audio pieces, and outputting a response to a user or process indicating the similarity between the audio pieces. The descriptors can be used in performing content-based audio classification and for determining similarities between music. The descriptors that can be extracted from each audio piece can include tonal descriptors, dissonance descriptors, rhythm descriptors, and spatial descriptors.

摘要翻译： 公开了用于确定两个或多个音频片段之间的相似性的系统和方法。用于确定音乐相似性的说明性方法包括从每个音频片段中提取一个或多个描述符，为每个音频片段生成矢量，从每个音频片段中提取一个或多个音频特征，计算每个音频特征的值，包含归一化值的矢量与包含音频片段的矢量之间的距离，并且向用户指示音频片段之间的相似性的用户或处理器输出响应。描述符可用于执行基于内容的音频分类和用于确定音乐之间的相似性。可以从每个音频片段提取的描述符可以包括音调描述符，不一致描述符，节奏描述符和空间描述符。

33.

发明申请
Music-piece processing apparatus and method 有权
标题翻译：音乐片处理装置和方法

公开(公告)号：US20080115658A1

公开(公告)日：2008-05-22

申请号：US11985212

申请日：2007-11-13

申请人： Takuya Fujishima , Jordi Bonada , Maarten De Boer , Sebastian Streich , Bee Suan Ong

发明人： Takuya Fujishima , Jordi Bonada , Maarten De Boer , Sebastian Streich , Bee Suan Ong

IPC分类号： G10H1/22

CPC分类号： G10H1/0025 , G10H2210/061 , G10H2210/125 , G10H2220/106 , G10H2240/145

摘要： For each of a plurality of music pieces, a storage device stores respective tone data of a plurality of fragments of the music piece and respective musical character values of the fragments. Similarity determination section calculates a similarity index value indicative of a degree of similarity between the character values of each of the fragments of a main music piece and the character values of each individual fragment of a plurality of sub music pieces. Each of the similarity index values calculated for the fragments of each of the sub music pieces can be adjusted in accordance with a user's control. Processing section processes the tone data of each of the fragments of the main music piece on the basis of the tone data of any one of the fragments of the sub music pieces of which the similarity index value indicates sufficient similarity.

摘要翻译： 对于多个音乐片段中的每一个，存储装置存储音乐片段的多个片段和片段的相应音乐字符值的各个音调数据。相似度确定部分计算表示主乐曲的每个片段的字符值与多个子乐曲的每个片段的字符值之间的相似度的相似度指数值。可以根据用户的控制来对每个子音乐片段的片段计算出的每个相似性索引值进行调整。处理部根据相似度指标值表示足够的相似度的子音乐片段中任一片段的色调数据，处理主乐曲的每个片段的色调数据。

34.

发明授权
Singing voice synthesizing apparatus, singing voice synthesizing method, and program for realizing singing voice synthesizing method 有权

公开(公告)号：US07016841B2

公开(公告)日：2006-03-21

申请号：US10034359

申请日：2001-12-27

申请人： Hideki Kenmochi , Xavier Serra , Jordi Bonada

发明人： Hideki Kenmochi , Xavier Serra , Jordi Bonada

IPC分类号： G10L13/00 , G10H7/00

CPC分类号： G10L13/07

摘要： A singing voice synthesizing apparatus is provided, which enables achievement of a natural sounding synthesized singing voice with a good level of comprehensibility. A phoneme database stores a plurality of voice fragment data formed of voice fragments each being a single phoneme or a phoneme chain of at least two concatenated phonemes, each of the plurality of voice fragment data comprising data of a deterministic component and data of a stochastic component. A readout device that reads out from the phoneme database the voice fragment data corresponding to inputted lyrics. A duration time adjusting device adjusts time duration of the read-out voice fragment data so as to match a desired tempo and manner of singing. An adjusting device adjusts the deterministic component and the stochastic component of the read-out voice fragment so as to match a desired pitch. A synthesizing device synthesizes a singing sound by sequentially concatenating the voice fragment data that have been adjusted by the duration time adjusting device and the adjusting device.

35.

发明授权
Singing voice synthesizing method 有权

公开(公告)号：US06992245B2

公开(公告)日：2006-01-31

申请号：US10375420

申请日：2003-02-27

申请人： Hideki Kenmochi , Alex Loscos , Jordi Bonada

发明人： Hideki Kenmochi , Alex Loscos , Jordi Bonada

IPC分类号： G10H1/06 , G10H7/00

CPC分类号： G10H7/002 , G10H2240/056 , G10H2240/311 , G10H2250/235 , G10H2250/455 , G10L13/02

摘要： A frequency spectrum is detected by analyzing a frequency of a voice waveform corresponding to a voice synthesis unit formed of a phoneme or a phonemic chain. Local peaks are detected on the frequency spectrum, and spectrum distribution regions including the local peaks are designated. For each spectrum distribution region, amplitude spectrum data representing an amplitude spectrum distribution depending on a frequency axis and phase spectrum data representing a phase spectrum distribution depending on the frequency axis are generated. The amplitude spectrum data is adjusted to move the amplitude spectrum distribution represented by the amplitude spectrum data along the frequency axis based on an input note pitch, and the phase spectrum data is adjusted corresponding to the adjustment. Spectrum intensities are adjusted to be along with a spectrum envelope corresponding to a desired tone color. The adjusted amplitude and phase spectrum data are converted into a synthesized voice signal.

36.

发明授权
Voice converter for assimilation by frame synthesis with temporal alignment 失效
标题翻译：语音转换器通过帧合成与时间对准同化

公开(公告)号：US06836761B1

公开(公告)日：2004-12-28

申请号：US09693144

申请日：2000-10-20

申请人： Takahiro Kawashima , Yasuo Yoshioka , Pedro Cano , Alex Loscos , Xavier Serra , Mark Schiementz , Jordi Bonada

发明人： Takahiro Kawashima , Yasuo Yoshioka , Pedro Cano , Alex Loscos , Xavier Serra , Mark Schiementz , Jordi Bonada

IPC分类号： G10L1300

CPC分类号： G10L13/033 , G10L2021/0135

摘要： A voice converting apparatus is constructed for converting an input voice into an output voice according to a target voice. In the apparatus, a storage section provisionally stores source data, which is associated to and extracted from the target voice. An analyzing section analyzes the input voice to extract therefrom a series of input data frames representing the input voice. A producing section produces a series of target data frames representing the target voice based on the source data, while aligning the target data frames with the input data frames to secure synchronization between the target data frames and the input data frames. A synthesizing section synthesizes the output voice according to the target data frames and the input data frames.

摘要翻译： 构成语音转换装置，用于根据目标语音将输入语音转换为输出语音。在装置中，存储部临时存储与目标语音相关联并从其中提取的源数据。分析部分分析输入声音以从中提取代表输入声音的一系列输入数据帧。产生部分基于源数据产生一系列表示目标语音的目标数据帧，同时使目标数据帧与输入数据帧对齐，以确保目标数据帧与输入数据帧之间的同步。合成部根据目标数据帧和输入数据帧合成输出声音。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类