-
公开(公告)号:US20210287682A1
公开(公告)日:2021-09-16
申请号:US17255511
申请日:2018-06-27
Applicant: NEC Corporation
Inventor: Ling GUO , Hitoshi YAMAMOTO , Takafumi KOSHINAKA
Abstract: The information processing apparatus (2000) computes a first score representing a degree of similarity between the input sound data (10) and the registrant sound data (22) of the registrant (20). The information processing apparatus (2000) obtains a plurality of pieces of segmented sound data (12) by segmenting the input sound data (10) in the time direction. The information processing apparatus (2000) computes, for each piece of segmented sound data piece (12), a second score representing the degree of similarity between the segmented sound data (12) and the registrant sound data (22). The information processing apparatus 2000 makes first determination to determine whether a number of speakers of sound included in the input sound data (10) is one or multiple, using at least the second score. The information processing apparatus (2000) makes second determination to determine whether the input sound data (10) includes the sound of the registrant (20), based on the first score, the second scores, and a result of the first determination.
-
公开(公告)号:US20250029619A1
公开(公告)日:2025-01-23
申请号:US18686492
申请日:2021-09-08
Applicant: NEC Corporation
Inventor: Ling GUO , Hitoshi YAMAMOTO
Abstract: An authentication apparatus includes: a calculation unit that calculates, from an air conduction sound signal indicating an air conduction sound of a voice of a target person and a bone conduction sound signal indicating a bone conduction sound of the voice of the target person, an air conduction feature quantity that is a feature quantity of the air conduction sound signal and a bone conduction feature quantity that is a feature quantity of the bone conduction sound signal, and that calculates a target feature quantity that is a feature quantity of the voice of the target person by combining the air conduction feature quantity and the bone conduction feature quantity; and an authentication unit that authenticates the target person on the basis of the target feature quantity.
-
公开(公告)号:US20220238097A1
公开(公告)日:2022-07-28
申请号:US17616224
申请日:2019-06-07
Applicant: NEC Corporation
Inventor: Ling GUO , Hitoshi YAMAMOTO , Takafumi KOSHINAKA
Abstract: A speech processing device includes: first segment means for dividing first speech into a plurality of first speech segments; second segment means for dividing second speech into a plurality of second speech segments; primary speaker recognition means for calculating scores indicating similarities between the plurality of first and second speech segments; threshold value calculation means for calculating a threshold value based on scores indicating similarities between the plurality of first speech segments; speaker clustering means for classifying each of the plurality of second speech segments into one or more clusters having a similarity higher than the similarity indicated by the threshold value; and secondary speaker recognition means for calculating a similarity between each of the one or more clusters and the first speech and determining based on a result of the calculation whether speech corresponding to the first speech is contained in any of the one or more clusters.
-
-