Voice search device, voice search method, and non-transitory recording medium
    1.
    发明授权
    Voice search device, voice search method, and non-transitory recording medium 有权
    语音搜索设备,语音搜索方法和非暂时记录介质

    公开(公告)号:US09437187B2

    公开(公告)日:2016-09-06

    申请号:US14604345

    申请日:2015-01-23

    Inventor: Hiroyasu Ide

    Abstract: A search string acquiring unit acquires a search string. A converting unit converts the search string into a phoneme sequence. A time length deriving unit derives the spoken time length of the voice corresponding to the search string. A zone designating unit designates a likelihood acquisition zone in a target voice signal. A likelihood acquiring device acquires a likelihood indicating how likely the likelihood acquisition interval is an interval in which voice corresponding to the search string is spoken. A repeating unit changes the likelihood acquisition zone designated by the zone designating unit, and repeats the process of the zone designating unit and the likelihood acquiring device. An identifying unit identifies, from the target voice signal, estimated intervals for which the voice corresponding to the search string is estimated to be spoken, on the basis of the likelihoods acquired for each of the likelihood acquisition zones.

    Abstract translation: 搜索字符串获取单元获取搜索串。 转换单元将搜索字符串转换为音素序列。 时间长度导出单元导出与搜索字符串相对应的语音的语音时间长度。 区域指定单元指定目标语音信号中的可能性获取区域。 可能性获取装置获取表示可能性获取间隔是表示与搜索字符串对应的语音的间隔的可能性。 重复单元改变由区域指定单元指定的可能性获取区域,并且重复区域指定单元和可能性获取设备的处理。 识别单元根据对于每个可能性获取区域获得的可能性,从目标语音信号中识别估计出与搜索串相对应的语音的估计间隔。

    Voice processing device, voice processing method, and non-transitory recording medium that stores program

    公开(公告)号:US10037759B2

    公开(公告)日:2018-07-31

    申请号:US14251201

    申请日:2014-04-11

    Inventor: Hiroyasu Ide

    CPC classification number: G10L17/14

    Abstract: A voice processing device includes: an acquirer which acquires feature quantities of vowel sections included in voice data; a classifier which classifies, among the acquired feature quantities, feature quantities corresponding to a plurality of same vowels into a plurality of clusters for respective vowels with unsupervised classification; and a determiner which determines a combination of clusters corresponding to the same speaker from clusters classified for the plurality of vowels.

Patent Agency Ranking