Technique for automatically splitting words

    公开(公告)号:US10572586B2

    公开(公告)日:2020-02-25

    申请号:US15906525

    申请日:2018-02-27

    IPC分类号: G06F17/00 G06F17/27

    摘要: A computer-implemented method, computer program product, and system are provided for separating a word in a dictionary. The method includes reading a word from the dictionary as a source word. The method also includes searching the dictionary for another word having a substring with a same surface string and a same reading as the source word. The method additionally includes splitting the another word by the source word to obtain one or more remaining substrings of the another word. The method further includes registering each of the one or more remaining substrings as a new word in the dictionary.

    PROCESSING OF SPEECH SIGNALS
    4.
    发明申请

    公开(公告)号:US20190130932A1

    公开(公告)日:2019-05-02

    申请号:US15800112

    申请日:2017-11-01

    IPC分类号: G10L25/24 G10L15/26 G10L15/01

    摘要: A method for processing a speech signal. The method comprises obtaining a logmel feature of a speech signal. The method further includes one or more processors processing the logmel feature so that the logmel feature is normalized under a constraint that a power level of the logmel feature is kept as originally obtained. The method further includes inputting the processed logmel feature into a speech-to-text system to generate corresponding text data.

    TECHNIQUE FOR AUTOMATICALLY SPLITTING WORDS
    7.
    发明申请

    公开(公告)号:US20190266239A1

    公开(公告)日:2019-08-29

    申请号:US15906525

    申请日:2018-02-27

    IPC分类号: G06F17/27

    摘要: A computer-implemented method, computer program product, and system are provided for separating a word in a dictionary. The method includes reading a word from the dictionary as a source word. The method also includes searching the dictionary for another word having a substring with a same surface string and a same reading as the source word. The method additionally includes splitting the another word by the source word to obtain one or more remaining substrings of the another word. The method further includes registering each of the one or more remaining substrings as a new word in the dictionary.

    Speech Recognition Model Construction Method, Speech Recognition Method, Computer System, Speech Recognition Apparatus, Program, and Recording Medium
    10.
    发明申请
    Speech Recognition Model Construction Method, Speech Recognition Method, Computer System, Speech Recognition Apparatus, Program, and Recording Medium 有权
    语音识别模型构建方法,语音识别方法,计算机系统,语音识别装置,程序和记录介质

    公开(公告)号:US20160086599A1

    公开(公告)日:2016-03-24

    申请号:US14863124

    申请日:2015-09-23

    摘要: A construction method for a speech recognition model, in which a computer system includes; a step of acquiring alignment between speech of each of a plurality of speakers and a transcript of the speaker; a step of joining transcripts of the respective ones of the plurality of speakers along a time axis, creating a transcript of speech of mixed speakers obtained from synthesized speech of the speakers, and replacing predetermined transcribed portions of the plurality of speakers overlapping on the time axis with a unit which represents a simultaneous speech segment; and a step of constructing at least one of an acoustic model and a language model which make up a speech recognition model, based on the transcript of the speech of the mixed speakers.

    摘要翻译: 一种用于语音识别模型的构造方法,其中计算机系统包括: 获取多个扬声器中的每一个的语音与扬声器的抄本之间的对准的步骤; 沿着时间轴连接多个扬声器中的各个扬声器的转录本的步骤,创建从扬声器的合成语音获得的混合扬声器的语音转录,并替换在时间轴上重叠的多个扬声器的预定转录部分 具有表示同时语音段的单元; 以及基于混合扬声器的语音的抄本,构成构成语音识别模型的声学模型和语言模型中的至少一个的步骤。