Patent search ap:("Casio Computer Co. Page Ltd.") AND inv:"Hiroki Tomita"

1.

发明授权
Voice retrieval apparatus, voice retrieval method, and non-transitory recording medium 有权

公开(公告)号：US09767790B2

公开(公告)日：2017-09-19

申请号：US14953729

申请日：2015-11-30

Applicant: Casio Computer Co., Ltd.

Inventor： Hiroki Tomita

IPC: G10L15/00 , G10L15/05 , G10L15/02 , G10L25/54

CPC classification number: G10L15/05 , G10L15/02 , G10L25/54 , G10L2015/025

Abstract: A voice retrieval apparatus executes processes of: converting a retrieval string into a phoneme string; obtaining, from a time length memory, a continuous time length for each phoneme contained in the converted phoneme string; deriving a plurality of time lengths corresponding to a plurality of utterance rates as candidate utterance time lengths of voices corresponding to the retrieval string based on the obtained continuous time length; specifying, for each of the plurality of time lengths, a plurality of likelihood obtainment segments having the derived time length within a time length of a retrieval sound signal; obtaining a likelihood showing a plausibility that the specified likelihood obtainment segment specified is a segment where the voices are uttered; and identifying, based on the obtained likelihood, for each of the specified likelihood obtainment segments, an estimation segment where utterance of the voices is estimated in the retrieval sound signal.

2.

发明申请
VOICE PROCESSING APPARATUS 审中-公开

公开(公告)号：US20190172445A1

公开(公告)日：2019-06-06

申请号：US16193163

申请日：2018-11-16

Applicant: CASIO COMPUTER CO., LTD

Inventor： Hiroki Tomita

IPC: G10L15/06 , G10L15/22

Abstract: A voice processing apparatus includes a first storage unit which stores a known-word, and a processor. The processor executes a voice recognition process of extracting an unknown-word by executing a voice recognition process on an input voice signal, based on a storage content of the first storage unit, and a storage control process of executing storage control to the first storage unit, wherein the storage control process includes a process of storing, when information of a number of unknown-words which are recognized to be identical, among the extracted unknown-words by the voice recognition process, meets a predetermined condition, a corresponding unknown-word in the first storage unit as a known-word.

3.

发明授权
Voice retrieval apparatus, voice retrieval method, and non-transitory recording medium 有权

公开(公告)号：US09754024B2

公开(公告)日：2017-09-05

申请号：US14953775

申请日：2015-11-30

Applicant: Casio Computer Co., Ltd.

Inventor： Hiroki Tomita

IPC: G10L15/00 , G06F17/30 , G10L15/02

CPC classification number: G06F17/3074 , G06F17/30734 , G06F17/30743 , G10L2015/025

Abstract: A voice retrieval apparatus executes processes of: obtaining, from a time length memory, a continuous time length for each phoneme contained in a phoneme string of a retrieval string; obtaining user-specified information on an utterance rate; changing the continuous time length for each obtained phoneme in accordance with the obtained information; deriving, based on the changed continuous time length, an utterance time length of voices corresponding to the retrieval string; specifying a plurality of likelihood obtainment segments of the derived utterance time length in a time length of a retrieval sound signal; obtaining a likelihood showing a plausibility that the specified likelihood obtainment segment is a segment where the voices are uttered; and identifying, based on the obtained likelihood, an estimation segment where, within the retrieval sound signal, utterance of the voices is estimated, the estimation segment being identified for each specified likelihood obtainment segment.

4.

发明授权
Audio interval detection apparatus, method, and recording medium to eliminate a specified interval that does not represent speech based on a divided phoneme 有权

公开(公告)号：US11276390B2

公开(公告)日：2022-03-15

申请号：US16352787

申请日：2019-03-13

Applicant: CASIO COMPUTER CO., LTD.

Inventor： Hiroki Tomita

IPC: G10L15/10 , G10L15/16 , G10L15/187 , G10L15/04 , G10L15/02 , G10L15/14

Abstract: An audio interval detection apparatus has a processor and a storage storing instructions that, when executed by the processor, control the processor to: detect, from a target audio signal, a specified audio interval including a specified audio signal representing a state of a phoneme of a same consonant produced continuously over a period longer than a specified time, and, by eliminating, from the target audio signal at least the detected specified audio interval, detect from the target audio signal an utterance audio interval that includes a speech utterance signal representing a speech utterance uttered by a speaker.

5.

发明授权
Voice search device, voice search method, and non-transitory recording medium 有权
Title translation: 语音搜索装置，语音搜索方法和非暂时记录媒体

公开(公告)号：US09431007B2

公开(公告)日：2016-08-30

申请号：US14597958

申请日：2015-01-15

Applicant: CASIO COMPUTER CO., LTD.

Inventor： Hiroki Tomita

IPC: G10L15/00 , G10L15/14 , G10L15/02 , G10L15/18 , G06F17/30 , G10L25/87 , G10L25/54 , G10L15/08

CPC classification number: G10L15/142 , G06F17/30755 , G06F17/30967 , G10L15/02 , G10L15/18 , G10L25/54 , G10L25/87 , G10L2015/025 , G10L2015/081 , G10L2015/088

Abstract: In a voice search device, a processor acquires a search word, converts the search word into a phoneme sequence, acquires, for each frame, an output probability of a feature quantity of a target voice signal being output from each phoneme included in the phoneme sequence, and executes relative calculation of the output probability acquired from each phoneme, based on an output probability acquired from another phoneme included in the phoneme sequence. In addition, the processor successively designates likelihood acquisition zones, acquires a likelihood indicating how likely a designated likelihood acquisition zone is a zone in which voice corresponding to the search word is spoken, and identifies from the target voice signal an estimated zone for which the voice corresponding to the search word is estimated to be spoken, based on the acquired likelihood.

Abstract translation: 在语音搜索装置中，处理器获取搜索词，将搜索词转换成音素序列，为每个帧获取从包含在音素序列中的每个音素输出的目标语音信号的特征量的输出概率并且基于从包括在音素序列中的另一音素获取的输出概率，执行从每个音素获取的输出概率的相对计算。此外，处理器连续地指定可能性获取区域，获取表示指定的可能性获取区域是与哪个语音相对应的语音的区域的可能性，并且从目标语音信号中识别语音的估计区域基于获得的可能性，估计对应于搜索词的口令。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification