专利检索 ap:("Gakuto Kurata" OR "Toru Nagano" OR "Masafumi Nishimura" OR "Ryuki Tachibana") AND inv:"Gakuto Kurata" 第 1 页

1.

发明申请
System And Method For Supporting Text-To-Speech 有权
标题翻译：支持文字转语音的系统和方法

公开(公告)号：US20080046247A1

公开(公告)日：2008-02-21

申请号：US11774798

申请日：2007-07-09

申请人： Gakuto Kurata , Toru Nagano , Masafumi Nishimura , Ryuki Tachibana

发明人： Gakuto Kurata , Toru Nagano , Masafumi Nishimura , Ryuki Tachibana

IPC分类号： G10L13/00

CPC分类号： G10L13/04 , G10L15/26

摘要： A system for generating high-quality synthesized text-to-speech includes a learning data generating unit, a frequency data generating unit, and a setting unit. The learning data generating unit recognizes inputted speech, and then generates first learning data in which wordings of phrases are associated with readings thereof. The frequency data generating unit generates, based on the first learning data, frequency data indicating appearance frequencies of both wordings and readings of phrases. The setting unit sets the thus generated frequency data for a language processing unit in order to approximate outputted speech of text-to-speech to the inputted speech. Furthermore, the language processing unit generates, from a wording of text, a reading corresponding to the wording, on the basis of the appearance frequencies.

摘要翻译： 用于产生高质量合成文本到语音的系统包括学习数据生成单元，频率数据生成单元和设置单元。学习数据生成单元识别输入的语音，然后生成其中短语的词语与其读数相关联的第一学习数据。频率数据生成单元基于第一学习数据生成指示短语的两个措辞和读数的出现频率的频率数据。设置单元设置由此产生的语言处理单元的频率数据，以将文本到语音的输出语音与输入的语音近似。此外，语言处理单元根据出现频率，从文字的文字生成与该文字对应的阅读。

2.

发明授权
System and method for supporting text-to-speech 有权
标题翻译：支持文字转语音的系统和方法

公开(公告)号：US07921014B2

公开(公告)日：2011-04-05

申请号：US11774798

申请日：2007-07-09

申请人： Gakuto Kurata , Toru Nagano , Masafumi Nishimura , Ryuki Tachibana

发明人： Gakuto Kurata , Toru Nagano , Masafumi Nishimura , Ryuki Tachibana

IPC分类号： G10L13/00

CPC分类号： G10L13/04 , G10L15/26

摘要： A system for generating high-quality synthesized text-to-speech includes a learning data generating unit, a frequency data generating unit, and a setting unit. The learning data generating unit recognizes inputted speech, and then generates first learning data in which wordings of phrases are associated with readings thereof. The frequency data generating unit generates, based on the first learning data, frequency data indicating appearance frequencies of both wordings and readings of phrases. The setting unit sets the thus generated frequency data for a language processing unit in order to approximate outputted speech of text-to-speech to the inputted speech. Furthermore, the language processing unit generates, from a wording of text, a reading corresponding to the wording, on the basis of the appearance frequencies.

摘要翻译： 用于产生高质量合成文本到语音的系统包括学习数据生成单元，频率数据生成单元和设置单元。学习数据生成单元识别输入的语音，然后生成其中短语的词语与其读数相关联的第一学习数据。频率数据生成单元基于第一学习数据生成指示短语的两个措辞和读数的出现频率的频率数据。设置单元设置由此产生的语言处理单元的频率数据，以将文本到语音的输出语音与输入的语音近似。此外，语言处理单元根据出现频率，从文字的文字生成与该文字对应的阅读。

3.

发明申请
Stochastic Syllable Accent Recognition 审中-公开
标题翻译：随机音节重音识别

公开(公告)号：US20080177543A1

公开(公告)日：2008-07-24

申请号：US11945900

申请日：2007-11-27

申请人： Tohru Nagano , Masafumi Nishimura , Ryuki Tachibana , Gakuto Kurata

发明人： Tohru Nagano , Masafumi Nishimura , Ryuki Tachibana , Gakuto Kurata

IPC分类号： G10L15/04

CPC分类号： G10L15/04 , G10L13/04

摘要： Training wording data indicating the wording of each of the words in training text, training speech data indicating characteristics of speech of each of the words, and training boundary data indicating whether each word in training speech is a boundary of a prosodic phrase are stored. After inputting candidates for boundary data, a first likelihood that each of the a boundary of a prosodic phrase of the words in the inputted text would agree with one of the inputted boundary data candidates is calculated and a second likelihood is calculated. Thereafter, one boundary data candidate maximizing a product of the first and second likelihoods is searched out from among the inputted boundary data candidates, and then a result of the searching is outputted.

摘要翻译： 训练表示训练文本中的每个单词的措辞的语言数据，训练表示每个单词的语音特征的语音数据，以及指示训练语音中的每个单词是韵律短语的边界的训练边界数据。在输入边界数据的候选者之后，计算输入文本中的单词的韵律短语的边界中的每一个与输入的边界数据候选中的一个一致的第一可能性，并计算第二似然。此后，从输入的边界数据候选中搜索最大化第一和第二似然性的乘积的一个边界数据候选，然后输出搜索结果。

4.

发明授权
System and method for extracting a specific situation from a conversation 有权
标题翻译：从会话中提取特定情况的系统和方法

公开(公告)号：US09269357B2

公开(公告)日：2016-02-23

申请号：US12576540

申请日：2009-10-09

申请人： Nobuyasu Itoh , Gakuto Kurata , Masafumi Nishimura

发明人： Nobuyasu Itoh , Gakuto Kurata , Masafumi Nishimura

IPC分类号： G10L15/04 , G10L15/26 , G06F17/30

CPC分类号： G10L15/26 , G06F17/30746

摘要： A system, method, and computer readable article of manufacture for extracting a specific situation in a conversation. The system includes: an acquisition unit for acquiring speech voice data of speakers in the conversation; a specific expression detection unit for detecting the speech voice data of a specific expression from speech voice data of a specific speaker in the conversation; and a specific situation extraction unit for extracting, from the speech voice data of the speakers in the conversation, a portion of the speech voice data that forms a speech pattern that includes the speech voice data of the specific expression detected by the specific expression detection unit.

摘要翻译： 一种用于提取会话中的特定情况的系统，方法和计算机可读制品。该系统包括：采集单元，用于获取对话中的扬声器的语音语音数据; 特定表达检测单元，用于从会话中的特定说话者的语音语音数据中检测特定表达语言语音数据; 以及特定情况提取单元，用于从会话中的讲话者的语音语音数据中提取形成语音模式的语音语音数据的一部分，该语音模式包括由特定表达式检测单元检测到的特定表达的语音语音数据。

5.

发明授权
Apparatus, method, and program for supporting speech interface design 有权

公开(公告)号：US07747443B2

公开(公告)日：2010-06-29

申请号：US11773256

申请日：2007-07-03

申请人： Osamu Ichikawa , Gakuto Kurata , Masafumi Nishimura

发明人： Osamu Ichikawa , Gakuto Kurata , Masafumi Nishimura

IPC分类号： G10L11/00 , G10L15/26 , G06F17/27

CPC分类号： G10L15/063 , G10L15/183

摘要： For design of a speech interface accepting speech control options, speech samples are stored on a computer-readable medium. A similarity calculating unit calculates a certain indication of similarity of first and second sets of ones of the speech samples, the first set of speech samples being associated with a first speech control option and the second set of speech samples being associated with a second speech control option. A display unit displays the similarity indication.In another aspect, word vectors are generated for the respective speech sample sets, indicating frequencies of occurrence of respective words in the respective speech sample sets. The similarity calculating unit calculates the similarity indication responsive to the word vectors of the respective speech sample sets.In another aspect, a perplexity indication is calculated for respective speech sample sets responsive to language models for the respective speech sample sets.

6.

发明授权
Apparatus, method, and program for supporting speech interface design 有权
标题翻译：用于支持语音界面设计的装置，方法和程序

公开(公告)号：US07729921B2

公开(公告)日：2010-06-01

申请号：US12184182

申请日：2008-07-31

申请人： Osamu Ichikawa , Gakuto Kurata , Masafumi Nishimura

发明人： Osamu Ichikawa , Gakuto Kurata , Masafumi Nishimura

IPC分类号： G10L21/06 , G06F17/27 , G10L15/26

CPC分类号： G10L15/063 , G10L15/183

摘要： For design of a speech interface accepting speech control options, speech samples are stored on a computer-readable medium. A similarity calculating unit calculates a certain indication of similarity of first and second sets of ones of the speech samples, the first set of speech samples being associated with a first speech control option and the second set of speech samples being associated with a second speech control option. A display unit displays the similarity indication.In another aspect, word vectors are generated for the respective speech sample sets, indicating frequencies of occurrence of respective words in the respective speech sample sets. The similarity calculating unit calculates the similarity indication responsive to the word vectors of the respective speech sample sets.In another aspect, a perplexity indication is calculated for respective speech sample sets responsive to language models for the respective speech sample sets.

摘要翻译： 为了设计接收语音控制选项的语音接口，将语音样本存储在计算机可读介质上。相似度计算单元计算第一和第二组语音样本的相似度的某一指示，第一组语音样本与第一语音控制选项相关联，第二组语音样本与第二语音控制相关联选项。显示单元显示相似性指示。在另一方面，为相应的语音样本集生成词向量，指示各个语音样本集中各个词的出现频率。相似度计算单元响应于各个语音样本集的字矢量来计算相似度指示。在另一方面，响应于各个语音样本集的语言模型，针对各个语音样本集计算困惑指示。

7.

发明授权
System, method, and program product for processing speech ratio difference data variations in a conversation between two persons 有权
标题翻译：系统，方法和程序产品，用于处理两人之间的会话中的语音比差异数据变化

公开(公告)号：US08165874B2

公开(公告)日：2012-04-24

申请号：US12399560

申请日：2009-03-06

申请人： Gakuto Kurata , Masafumi Nishimura

发明人： Gakuto Kurata , Masafumi Nishimura

IPC分类号： G10L11/02

CPC分类号： G10L25/78 , G10L17/00 , H04M3/5175

摘要： A system, method, and program product for processing voice data in a conversation between two persons to determine characteristic conversation patterns. The system includes: a variation calculator for calculating a variation of a speech ratio of a first speaker and a variation calculator for calculating a variation of a speech ratio of a second speaker; a difference calculator for calculating a difference data string; a smoother for generating a smoothed difference data string; and a presenter for presenting the difference between the variation of the speech ratio of the first speaker and the speech ratio of the second speaker. The method includes: calculating a variation of a speech ratio of a first speaker and a second speaker; calculating a difference data string; generating a smoothed difference data string; and grouping them according to their patterns.

摘要翻译： 一种用于在两人之间的对话中处理语音数据以确定特征对话模式的系统，方法和程序产品。该系统包括：变化计算器，用于计算第一说话者的语音比率的变化和用于计算第二说话者的语音比率的变化的变化计算器; 差分计算器，用于计算差分数据串; 平滑器，用于产生平滑的差分数据串; 以及呈现者，用于呈现第一说话者的语音比率的变化与第二说话者的语音比率之间的差异。该方法包括：计算第一扬声器和第二扬声器的语音比率的变化; 计算差数据串; 生成平滑差分数据串; 并根据他们的模式进行分组。

8.

发明申请
System and Method for Extracting a Specific Situation From a Conversation 有权
标题翻译：从对话中提取具体情况的系统和方法

公开(公告)号：US20100114575A1

公开(公告)日：2010-05-06

申请号：US12576540

申请日：2009-10-09

申请人： Nobuyasu Itoh , Gakuto Kurata , Masafumi Nishimura

发明人： Nobuyasu Itoh , Gakuto Kurata , Masafumi Nishimura

IPC分类号： G10L15/00

CPC分类号： G10L15/26 , G06F17/30746

摘要： A system, method, and computer readable article of manufacture for extracting a specific situation in a conversation. The system includes: an acquisition unit for acquiring speech voice data of speakers in the conversation; a specific expression detection unit for detecting the speech voice data of a specific expression from speech voice data of a specific speaker in the conversation; and a specific situation extraction unit for extracting, from the speech voice data of the speakers in the conversation, a portion of the speech voice data that forms a speech pattern that includes the speech voice data of the specific expression detected by the specific expression detection unit.

摘要翻译： 一种用于提取会话中的特定情况的系统，方法和计算机可读制品。该系统包括：采集单元，用于获取对话中的扬声器的语音语音数据; 特定表达检测单元，用于从会话中的特定说话者的语音语音数据中检测特定表达语言语音数据; 以及特定情况提取单元，用于从会话中的讲话者的语音语音数据中提取形成语音模式的语音语音数据的一部分，该语音模式包括由特定表达式检测单元检测到的特定表达的语音语音数据。

9.

发明申请
UNSUPERVISED LEXICON ACQUISITION FROM SPEECH AND TEXT 有权
标题翻译：来自言论和文字的不稳定的利比亚收购

公开(公告)号：US20080221890A1

公开(公告)日：2008-09-11

申请号：US12043810

申请日：2008-03-06

申请人： Gakuto Kurata , Shinsuke Mori , Masafumi Nishimura

发明人： Gakuto Kurata , Shinsuke Mori , Masafumi Nishimura

IPC分类号： G10L15/04

CPC分类号： G10L15/063

摘要： Techniques for acquiring, from an input text and an input speech, a set of a character string and a pronunciation thereof which should be recognized as a word. A system according to the present invention: selects, from an input text, plural candidate character strings which are candidates to be recognized as a word; generates plural pronunciation candidates of the selected candidate character strings; generates frequency data by combining data in which the generated pronunciation candidates are respectively associated with the character strings; generates recognition data in which character strings respectively indicating plural words contained in the input speech are associated with pronunciations; and selects and outputs a combination contained in the recognition data, out of combinations each consisting of one of the candidate character strings and one of the pronunciation candidates.

摘要翻译： 用于从输入文本和输入语音中获取将被识别为单词的一组字符串及其发音的技术。根据本发明的系统：从输入文本中选择作为要识别为字的候选的多个候选字符串; 生成所选候选字符串的多个发音候选; 通过组合生成的发音候选者与字符串相关联的数据来生成频率数据; 产生识别数据，其中分别指示输入语音中包含的多个单词的字符串与发音相关联; 并且从由候选字符串中的一个和发音候选中的一个组成的组合中选择并输出包含在识别数据中的组合。

10.

发明申请
APPARATUS, METHOD, AND PROGRAM FOR SUPPORTING SPEECH INTERFACE DESIGN 有权
标题翻译：用于支持语音界面设计的装置，方法和程序

公开(公告)号：US20080040119A1

公开(公告)日：2008-02-14

申请号：US11773256

申请日：2007-07-03

申请人： Osamu Ichikawa , Gakuto Kurata , Masafumi Nishimura

发明人： Osamu Ichikawa , Gakuto Kurata , Masafumi Nishimura

IPC分类号： G10L11/00

CPC分类号： G10L15/063 , G10L15/183

摘要： For design of a speech interface accepting speech control options, speech samples are stored on a computer-readable medium. A similarity calculating unit calculates a certain indication of similarity of first and second sets of ones of the speech samples, the first set of speech samples being associated with a first speech control option and the second set of speech samples being associated with a second speech control option. A display unit displays the similarity indication.In another aspect, word vectors are generated for the respective speech sample sets, indicating frequencies of occurrence of respective words in the respective speech sample sets. The similarity calculating unit calculates the similarity indication responsive to the word vectors of the respective speech sample sets.In another aspect, a perplexity indication is calculated for respective speech sample sets responsive to language models for the respective speech sample sets.

摘要翻译： 为了设计接收语音控制选项的语音接口，将语音样本存储在计算机可读介质上。相似度计算单元计算第一和第二组语音样本的相似度的某一指示，第一组语音样本与第一语音控制选项相关联，第二组语音样本与第二语音控制相关联选项。显示单元显示相似性指示。在另一方面，为相应的语音样本集生成词向量，指示各个语音样本集中各个词的出现频率。相似度计算单元响应于各个语音样本集的字矢量来计算相似度指示。在另一方面，响应于各个语音样本集的语言模型，针对各个语音样本集计算困惑指示。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类