专利检索 ap:("International Business Machines Corporation") AND inv:"Toru Nagano" 第 1 页

1.

发明授权
Acoustic data augmentation with mixed normalization factors 有权

公开(公告)号：US12112767B2

公开(公告)日：2024-10-08

申请号：US17326463

申请日：2021-05-21

申请人： INTERNATIONAL BUSINESS MACHINES CORPORATION

发明人： Toru Nagano , Takashi Fukuda , Masayuki Suzuki

IPC分类号： G10L21/00 , G06N20/00 , G10L19/02 , G10L25/27

CPC分类号： G10L21/00 , G06N20/00 , G10L19/02 , G10L25/27

摘要： A method, computer system, and a computer program product for audio data augmentation are provided. Sets of audio data from different sources may be obtained. A respective normalization factor for at least two sources of the different sources may be calculated. The normalization factors from the at least two sources may be mixed to determine a mixed normalization factor. A first set of the sets may be normalized by using the mixed normalization factor and to obtain training data for training an acoustic model.

2.

发明申请
DATA AUGMENTATION BY FRAME INSERTION FOR SPEECH DATA 有权

公开(公告)号：US20210043186A1

公开(公告)日：2021-02-11

申请号：US16535829

申请日：2019-08-08

申请人： INTERNATIONAL BUSINESS MACHINES CORPORATION

发明人： Toru Nagano , Takashi Fukuda , Masayuki Suzuki , Gakuto Kurata

IPC分类号： G10L13/033 , G10L15/18 , G06F17/27

摘要： A technique for data augmentation for speech data is disclosed. Original speech data including a sequence of feature frames is obtained. A partially prolonged copy of the original speech data is generated by inserting one or more new frames into the sequence of the feature frames. The partially prolonged copy is output as augmented speech data for training an acoustic model for training an acoustic model.

3.

发明授权
Technique for automatically splitting words 有权

公开(公告)号：US10572586B2

公开(公告)日：2020-02-25

申请号：US15906525

申请日：2018-02-27

申请人： INTERNATIONAL BUSINESS MACHINES CORPORATION

发明人： Toru Nagano , Nobuyasu Itoh , Gakuto Kurata

IPC分类号： G06F17/00 , G06F17/27

摘要： A computer-implemented method, computer program product, and system are provided for separating a word in a dictionary. The method includes reading a word from the dictionary as a source word. The method also includes searching the dictionary for another word having a substring with a same surface string and a same reading as the source word. The method additionally includes splitting the another word by the source word to obtain one or more remaining substrings of the another word. The method further includes registering each of the one or more remaining substrings as a new word in the dictionary.

4.

发明申请
PROCESSING OF SPEECH SIGNALS 审中-公开

公开(公告)号：US20190130932A1

公开(公告)日：2019-05-02

申请号：US15800112

申请日：2017-11-01

申请人： International Business Machines Corporation

发明人： Masayuki Suzuki , Takashi Fukuda , Toru Nagano

IPC分类号： G10L25/24 , G10L15/26 , G10L15/01

摘要： A method for processing a speech signal. The method comprises obtaining a logmel feature of a speech signal. The method further includes one or more processors processing the logmel feature so that the logmel feature is normalized under a constraint that a power level of the logmel feature is kept as originally obtained. The method further includes inputting the processed logmel feature into a speech-to-text system to generate corresponding text data.

5.

发明授权
Method for improving acoustic model, computer for improving acoustic model and computer program thereof 有权

公开(公告)号：US09870767B2

公开(公告)日：2018-01-16

申请号：US14969340

申请日：2015-12-15

申请人： INTERNATIONAL BUSINESS MACHINES CORPORATION

发明人： Gakuto Kurata , Toru Nagano , Masayuki Suzuki

IPC分类号： G10L15/00 , G10L15/06 , G10L15/14 , G10L15/02 , G10L21/0208 , G10L25/18 , G10L25/24 , G10L15/187 , G10L15/065

CPC分类号： G10L15/063 , G10L15/02 , G10L15/065 , G10L15/187 , G10L21/0208 , G10L25/18 , G10L25/24 , G10L2015/025

摘要： Embodiments include methods and systems for improving an acoustic model. Aspects include acquiring a first standard deviation value by calculating standard deviation of a feature from first training data and acquiring a second standard deviation value by calculating standard deviation of a feature from second training data acquired in a different environment from an environment of the first training data. Aspects also include creating a feature adapted to an environment where the first training data is recorded, by multiplying the feature acquired from the second training data by a ratio obtained by dividing the first standard deviation value by the second standard deviation value. Aspects further include reconstructing an acoustic model constructed using training data acquired in the same environment as the environment of the first training data using the feature adapted to the environment where the first training data is recorded.

6.

发明授权
Processing of speech signals 有权

公开(公告)号：US10540990B2

公开(公告)日：2020-01-21

申请号：US15800112

申请日：2017-11-01

申请人： International Business Machines Corporation

发明人： Masayuki Suzuki , Takashi Fukuda , Toru Nagano

IPC分类号： G10L21/06 , G10L25/24 , G10L15/01 , G10L15/26

摘要： A method for processing a speech signal. The method comprises obtaining a logmel feature of a speech signal. The method further includes one or more processors processing the logmel feature so that the logmel feature is normalized under a constraint that a power level of the logmel feature is kept as originally obtained. The method further includes inputting the processed logmel feature into a speech-to-text system to generate corresponding text data.

7.

发明申请
TECHNIQUE FOR AUTOMATICALLY SPLITTING WORDS 审中-公开

公开(公告)号：US20190266239A1

公开(公告)日：2019-08-29

申请号：US15906525

申请日：2018-02-27

申请人： INTERNATIONAL BUSINESS MACHINES CORPORATION

发明人： Toru Nagano , Nobuyasu Itoh , Gakuto Kurata

IPC分类号： G06F17/27

摘要： A computer-implemented method, computer program product, and system are provided for separating a word in a dictionary. The method includes reading a word from the dictionary as a source word. The method also includes searching the dictionary for another word having a substring with a same surface string and a same reading as the source word. The method additionally includes splitting the another word by the source word to obtain one or more remaining substrings of the another word. The method further includes registering each of the one or more remaining substrings as a new word in the dictionary.

8.

发明授权
Expansion of a question and answer database 有权

公开(公告)号：US10380177B2

公开(公告)日：2019-08-13

申请号：US14957139

申请日：2015-12-02

申请人： INTERNATIONAL BUSINESS MACHINES CORPORATION

发明人： Yoshinori Kabeya , Toru Nagano , Masayuki Suzuki , Issei Yoshida

IPC分类号： G06F16/632 , G06F16/635 , G06F16/638 , G06F16/683 , G10L15/22 , G06F16/332

摘要： A system and method for expanding a question and answer (Q&A) database. The method includes preparing a set of Q&A documents and speech recognition results of an agent's utterances in conversations between an agent and a customer, each Q&A document in the set having an identifier, and each speech recognition result having an identifier common with the identifier of a relevant Q&A document, and adding one or more repetition parts extracted from the speech recognition results of the agent's utterances to a corresponding Q&A document in the set.

9.

发明授权
Method for improving acoustic model, computer for improving acoustic model and computer program thereof 有权

公开(公告)号：US09984681B2

公开(公告)日：2018-05-29

申请号：US15678195

申请日：2017-08-16

申请人： International Business Machines Corporation

发明人： Gakuto Kurata , Toru Nagano , Masayuki Suzuki

IPC分类号： G10L15/02 , G10L21/0208 , G10L25/18 , G10L15/06 , G10L15/065 , G10L15/187 , G10L25/24

CPC分类号： G10L15/063 , G10L15/02 , G10L15/065 , G10L15/187 , G10L21/0208 , G10L25/18 , G10L25/24 , G10L2015/025

摘要： Embodiments include methods and systems for improving an acoustic model. Aspects include acquiring a first standard deviation value by calculating standard deviation of a feature from first training data and acquiring a second standard deviation value by calculating standard deviation of a feature from second training data acquired in a different environment from an environment of the first training data. Aspects also include creating a feature adapted to an environment where the first training data is recorded, by multiplying the feature acquired from the second training data by a ratio obtained by dividing the first standard deviation value by the second standard deviation value. Aspects further include reconstructing an acoustic model constructed using training data acquired in the same environment as the environment of the first training data using the feature adapted to the environment where the first training data is recorded.

10.

发明申请
Speech Recognition Model Construction Method, Speech Recognition Method, Computer System, Speech Recognition Apparatus, Program, and Recording Medium 有权
标题翻译：语音识别模型构建方法，语音识别方法，计算机系统，语音识别装置，程序和记录介质

公开(公告)号：US20160086599A1

公开(公告)日：2016-03-24

申请号：US14863124

申请日：2015-09-23

申请人： INTERNATIONAL BUSINESS MACHINES CORPORATION

发明人： Gakuto Kurata , Toru Nagano , Masayuki Suzuki , Ryuki Tachibana

IPC分类号： G10L15/06 , G10L15/24 , G10L13/027 , G10L15/07 , G10L15/02

CPC分类号： G10L15/063 , G10L13/00 , G10L15/187 , G10L15/19

摘要： A construction method for a speech recognition model, in which a computer system includes; a step of acquiring alignment between speech of each of a plurality of speakers and a transcript of the speaker; a step of joining transcripts of the respective ones of the plurality of speakers along a time axis, creating a transcript of speech of mixed speakers obtained from synthesized speech of the speakers, and replacing predetermined transcribed portions of the plurality of speakers overlapping on the time axis with a unit which represents a simultaneous speech segment; and a step of constructing at least one of an acoustic model and a language model which make up a speech recognition model, based on the transcript of the speech of the mixed speakers.

摘要翻译： 一种用于语音识别模型的构造方法，其中计算机系统包括：获取多个扬声器中的每一个的语音与扬声器的抄本之间的对准的步骤; 沿着时间轴连接多个扬声器中的各个扬声器的转录本的步骤，创建从扬声器的合成语音获得的混合扬声器的语音转录，并替换在时间轴上重叠的多个扬声器的预定转录部分具有表示同时语音段的单元; 以及基于混合扬声器的语音的抄本，构成构成语音识别模型的声学模型和语言模型中的至少一个的步骤。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类