SPEAKER RECOGNITION SYSTEM AND METHOD OF USING THE SAME

    公开(公告)号:US20200258527A1

    公开(公告)日:2020-08-13

    申请号:US16270597

    申请日:2019-02-08

    Abstract: A speaker recognition system includes a non-transitory computer readable medium configured to store instructions. The speaker recognition system further includes a processor connected to the non-transitory computer readable medium. The processor is configured to execute the instructions for extracting acoustic features from each frame of a plurality of frames in input speech data. The processor is configured to execute the instructions for calculating a saliency value for each frame of the plurality of frames using a first neural network (NN) based on the extracted acoustic features, wherein the first NN is a trained NN using speaker posteriors. The processor is configured to execute the instructions for extracting a speaker feature using the saliency value for each frame of the plurality of frames.

    AUTHENTICATION DEVICE AND AUTHENTICATION METHOD

    公开(公告)号:US20210133302A1

    公开(公告)日:2021-05-06

    申请号:US16475730

    申请日:2017-03-23

    Abstract: An authentication device is provided with: a plurality of attribute-dependent score calculation units each calculating an attribute-dependent score dependent on a prescribed attribute for input data; an attribute-independent score calculation unit for calculating an attribute-independent score independent of the attribute for the input data; an attribute estimation unit for performing attribute estimation for the input data; and a score integration unit for determining a score weight of each of a plurality of attribute-dependent scores and of the attribute-independent score using the result of the attribute estimation and calculating an output score using the attribute-dependent scores, the attribute-independent score, and the determined score weights.

    SIGNAL PROCESSING SYSTEM, SIGNAL PROCESSING DEVICE, SIGNAL PROCESSING METHOD, AND RECORDING MEDIUM

    公开(公告)号:US20210050021A1

    公开(公告)日:2021-02-18

    申请号:US16976600

    申请日:2019-03-13

    Abstract: A feature vector having high class identification capability is generated. A signal processing system provided with: a first generation unit for generating a first feature vector on the basis of one of time-series voice data, meteorological data, sensor data, and text data, or on the basis of a feature quantity of one of these; a weight calculation unit for calculating a weight for the first feature vector; a statistical amount calculation unit for calculating a weighted average vector and a weighted high-order statistical vector of second or higher order using the first feature vector and the weight; and a second generation unit for generating a second feature vector using the weighted high-order statistical vector.

    CONVERSATION ANALYSIS DEVICE AND CONVERSATION ANALYSIS METHOD
    6.
    发明申请
    CONVERSATION ANALYSIS DEVICE AND CONVERSATION ANALYSIS METHOD 审中-公开
    对流分析装置和对流分析方法

    公开(公告)号:US20150310877A1

    公开(公告)日:2015-10-29

    申请号:US14438953

    申请日:2013-08-21

    Abstract: This conversation analysis device comprises: a change detection unit that detects, for each of a plurality of conversation participants, each of a plurality of prescribed change patterns for emotional states, on the basis of data corresponding to voices in a target conversation; an identification unit that identifies, from among the plurality of prescribed change patterns detected by the change detection unit, a beginning combination and an ending combination, which are prescribed combinations of the prescribed change patterns that satisfy prescribed position conditions between the plurality of conversation participants; and an interval determination unit that determines specific emotional intervals, which have a start time and an end time and represent specific emotions of the conversation participants of the target conversation, by determining a start time and an end time on the basis of each time position in the target conversation pertaining to the starting combination and ending combination identified by the identification unit.

    Abstract translation: 该对话分析装置包括:变更检测单元,根据对应于目标会话中的语音的数据,针对多个对话参与者中的每一个,检测用于情感状态的多个规定变化模式中的每一个; 识别单元,其从所述变化检测单元检测到的所述多个规定变化模式中识别作为所述多个对话参与者之间满足规定位置条件的规定变化模式的规定组合的开始组合和结束组合; 以及间隔确定单元,其通过基于每个时间位置确定开始时间和结束时间来确定具有开始时间和结束时间并且表示目标会话的对话参与者的特定情绪的特定情绪间隔 关于由识别单元识别的起始组合和结束组合的目标会话。

    VOICE DETECTION APPARATUS, VOICE DETECTION METHOD, AND RECORDING MEDIUM

    公开(公告)号:US20250104730A1

    公开(公告)日:2025-03-27

    申请号:US18728141

    申请日:2022-03-22

    Abstract: A voice detection apparatus includes: a beginning determination unit that determines a beginning of a voice segment including a voice that appears in a voice signal; an end determination unit that determines an end of the voice segment by determining whether or not a length of a non-voice segment that appears after the beginning is determined, is greater than or equal to a threshold; and a setting unit that sets the threshold on the basis of a property of a provisional voice segment starting from the beginning.

    BIOMETRIC AUTHENTICATION DEVICE, BIOMETRIC AUTHENTICATION METHOD, AND RECORDING MEDIUM

    公开(公告)号:US20240038241A1

    公开(公告)日:2024-02-01

    申请号:US18483881

    申请日:2023-10-10

    CPC classification number: G10L17/06 A61B5/117 G06F21/32

    Abstract: A biometric authentication device is provided with: a replay unit for reproducing a sound; an ear authentication unit for acquiring a reverberation sound of the sound in an ear of a user to be authenticated, extracting an ear acoustic feature from the reverberation sound, and calculating an ear authentication score by comparing the extracted ear acoustic feature with an ear acoustic feature stored in advance; a voice authentication unit for extracting a talker feature from a voice of the user that has been input, and calculating a voice authentication score by comparing the extracted talker feature with a talker feature stored in advance; and an authentication integration unit for outputting an authentication integration result calculated based on the ear authentication score and the voice authentication score. After the sound is output into the ear, a recording unit inputs the voice of the user.

    INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND RECORDING MEDIUM

    公开(公告)号:US20220383113A1

    公开(公告)日:2022-12-01

    申请号:US17771954

    申请日:2019-11-12

    Abstract: The information processing device is provided in a feature extraction block in a neural network. The information processing device acquires a local feature quantity group constituting one unit of information, and computes a weight corresponding to a degree of importance of each local feature quantity. Next, the information processing device computes a weighted statistic for a whole of the local feature quantity group using the computed weights, and deforms and outputs the local feature quantity group using the computed weighted statistic.

    SPEAKER RECOGNITION SYSTEM AND METHOD OF USING THE SAME

    公开(公告)号:US20220130397A1

    公开(公告)日:2022-04-28

    申请号:US17428960

    申请日:2020-02-05

    Abstract: A speaker recognition system includes a non-transitory computer readable medium configured to store instructions. The speaker recognition system further includes a processor connected to the non-transitory computer readable medium. The processor is configured to execute the instructions for extracting acoustic features from each frame of a plurality of frames in input speech data. The processor is configured to execute the instructions for calculating a saliency value for each frame of the plurality of frames using a first neural network (NN) based on the extracted acoustic features, wherein the first NN is a trained NN using speaker posteriors. The processor is configured to execute the instructions for extracting a speaker feature using the saliency value for each frame of the plurality of frames.

Patent Agency Ranking