Patent search ap:("NEC Corporation") AND inv:"Ling GUO" Page 1

1.

发明申请
INFORMATION PROCESSING APPARATUS, CONTROL METHOD, AND PROGRAM 有权

公开(公告)号：US20210287682A1

公开(公告)日：2021-09-16

申请号：US17255511

申请日：2018-06-27

Applicant: NEC Corporation

Inventor： Ling GUO , Hitoshi YAMAMOTO , Takafumi KOSHINAKA

IPC: G10L17/12 , G10L17/20

Abstract: The information processing apparatus (2000) computes a first score representing a degree of similarity between the input sound data (10) and the registrant sound data (22) of the registrant (20). The information processing apparatus (2000) obtains a plurality of pieces of segmented sound data (12) by segmenting the input sound data (10) in the time direction. The information processing apparatus (2000) computes, for each piece of segmented sound data piece (12), a second score representing the degree of similarity between the segmented sound data (12) and the registrant sound data (22). The information processing apparatus 2000 makes first determination to determine whether a number of speakers of sound included in the input sound data (10) is one or multiple, using at least the second score. The information processing apparatus (2000) makes second determination to determine whether the input sound data (10) includes the sound of the registrant (20), based on the first score, the second scores, and a result of the first determination.

2.

发明申请
AUTHENTICATION APPARATUS, AUTHENTICATION METHOD, AND RECORDING MEDIUM 有权

公开(公告)号：US20250029619A1

公开(公告)日：2025-01-23

申请号：US18686492

申请日：2021-09-08

Applicant: NEC Corporation

Inventor： Ling GUO , Hitoshi YAMAMOTO

IPC: G10L17/26 , G10L25/30

Abstract: An authentication apparatus includes: a calculation unit that calculates, from an air conduction sound signal indicating an air conduction sound of a voice of a target person and a bone conduction sound signal indicating a bone conduction sound of the voice of the target person, an air conduction feature quantity that is a feature quantity of the air conduction sound signal and a bone conduction feature quantity that is a feature quantity of the bone conduction sound signal, and that calculates a target feature quantity that is a feature quantity of the voice of the target person by combining the air conduction feature quantity and the bone conduction feature quantity; and an authentication unit that authenticates the target person on the basis of the target feature quantity.

3.

发明申请
SPEECH PROCESSING DEVICE, SPEECH PROCESSING METHOD, AND NON-TRANSITORY COMPUTER READABLE MEDIUM STORING PROGRAM 有权

公开(公告)号：US20220238097A1

公开(公告)日：2022-07-28

申请号：US17616224

申请日：2019-06-07

Applicant: NEC Corporation

Inventor： Ling GUO , Hitoshi YAMAMOTO , Takafumi KOSHINAKA

IPC: G10L15/04 , G10L15/08 , G10L25/51

Abstract: A speech processing device includes: first segment means for dividing first speech into a plurality of first speech segments; second segment means for dividing second speech into a plurality of second speech segments; primary speaker recognition means for calculating scores indicating similarities between the plurality of first and second speech segments; threshold value calculation means for calculating a threshold value based on scores indicating similarities between the plurality of first speech segments; speaker clustering means for classifying each of the plurality of second speech segments into one or more clusters having a similarity higher than the similarity indicated by the threshold value; and secondary speaker recognition means for calculating a similarity between each of the one or more clusters and the first speech and determining based on a result of the calculation whether speech corresponding to the first speech is contained in any of the one or more clusters.

Patent Agency Ranking