专利检索 ipc:"G10L17/08" 第 1 页

1.

发明授权
Voice and speech recognition for call center feedback and quality assurance 有权

公开(公告)号：US12088761B2

公开(公告)日：2024-09-10

申请号：US18203094

申请日：2023-05-30

申请人： STATE FARM MUTUAL AUTOMOBILE INSURANCE COMPANY

发明人： Sylvia Hernandez

IPC分类号： H04M3/51 , G06F40/205 , G10L15/26 , G10L17/04 , G10L17/08 , G10L25/51 , G10L25/63 , G10L25/87 , H04M3/42

CPC分类号： H04M3/5175 , G06F40/205 , G10L17/04 , G10L17/08 , G10L25/51 , G10L25/63 , H04M3/42 , G10L15/26 , G10L25/87 , H04M2201/40 , H04M2201/41 , H04M2203/40 , H04M2203/401

摘要： A computer-implemented method for providing an objective evaluation to a customer service representative regarding his performance during an interaction with a customer may include receiving a digitized data stream corresponding to a spoken conversation between a customer and a representative; converting the data stream to a text stream; generating a representative transcript that includes the words from the text stream that are spoken by the representative; comparing the representative transcript with a plurality of positive words and a plurality of negative words; and generating a score that varies according to the occurrence of each word spoken by the representative that matches one of the positive words, and/or the occurrence of each word spoken by the representative that matches one of the negative words. Tone of voice, as well as response time, during the interaction may also be monitored and analyzed to adjust the score, or generate a separate score.

2.

发明公开
PROVIDING ACCESS WITH A PORTABLE DEVICE AND VOICE COMMANDS 审中-公开

公开(公告)号：US20240112683A1

公开(公告)日：2024-04-04

申请号：US18520034

申请日：2023-11-27

申请人： Tamiras Per Pte. Ltd., LLC

发明人： Richard B. Himmelstein

IPC分类号： G10L17/08 , B60R25/24 , B60R25/25 , G10L15/26 , G10L17/22

CPC分类号： G10L17/08 , B60R25/241 , B60R25/257 , G10L15/26 , G10L17/22 , G10L2015/223

摘要： A system comprising a plurality of transponders, each having a user interface to receive commands from a plurality of users and to operate a virtual assistant utilizing artificial intelligence. A micro-computer in an automobile that implements the virtual assistant, wherein the micro-computer configured to identify a particular user from the plurality of users based on a spoken command received via the plurality of transponders. Further, have conversations, including regarding the automobile, between the particular user and respond to future commands or questions from particular user based on prior commands or questions received from the particular user that are stored in a memory of the micro-computer.

3.

发明授权
Authenticating a user 有权

公开(公告)号：US11869513B2

公开(公告)日：2024-01-09

申请号：US17142775

申请日：2021-01-06

申请人： VERIDAS DIGITAL AUTHENTICATION SOLUTIONS, S.L.

发明人： Iván López Espejo , Santiago Prieto Calero , Ana Iriarte Ruiz , David Roncal Redín , Miguel Ángel Sánchez Yoldi , Eduardo Azanza Ladrón

IPC分类号： G10L17/08 , G10L17/04 , G10L17/22 , G06F21/32 , H04L9/40 , G10L17/12 , G10L25/60

CPC分类号： G10L17/08 , G06F21/32 , G10L17/04 , G10L17/22 , H04L63/0861 , G10L17/12 , G10L25/60

摘要： Methods of authenticating a user or speaker are provided. These methods include obtaining an input speech signal and user credentials identifying the user or speaker. The input speech signal includes a single-channel signal or a multi-channel speech signal. The methods further include extracting a speech voiceprint from the input speech signal, and retrieving a reference voiceprint associated to the user credentials. The methods still further include determining a voiceprint correspondence between the speech voiceprint and the reference voiceprint, and authenticating the user or speaker depending on said voiceprint correspondence. The methods yet further include updating the reference voiceprint depending on the speech voiceprint corresponding to the authenticated user or speaker. Computer programs, systems and computing systems are also provided which are suitable for performing said methods of authenticating a user or speaker.

4.

发明授权
Robust spoofing detection system using deep residual neural networks 有权

公开(公告)号：US11862177B2

公开(公告)日：2024-01-02

申请号：US17155851

申请日：2021-01-22

申请人： PINDROP SECURITY, INC.

发明人： Tianxiang Chen , Elie Khoury

IPC分类号： G10L17/18 , G10L17/02 , G10L17/22 , G10L17/04 , G10L17/08

CPC分类号： G10L17/18 , G10L17/02 , G10L17/04 , G10L17/08 , G10L17/22

摘要： Embodiments described herein provide for systems and methods for implementing a neural network architecture for spoof detection in audio signals. The neural network architecture contains a layers defining embedding extractors that extract embeddings from input audio signals. Spoofprint embeddings are generated for particular system enrollees to detect attempts to spoof the enrollee's voice. Optionally, voiceprint embeddings are generated for the system enrollees to recognize the enrollee's voice. The voiceprints are extracted using features related to the enrollee's voice. The spoofprints are extracted using features related to features of how the enrollee speaks and other artifacts. The spoofprints facilitate detection of efforts to fool voice biometrics using synthesized speech (e.g., deepfakes) that spoof and emulate the enrollee's voice.

5.

发明公开
A DEEP NEURAL NETWORK TRAINING METHOD AND APPARATUS FOR SPEAKER VERIFICATION 审中-公开

公开(公告)号：US20230206926A1

公开(公告)日：2023-06-29

申请号：US17926605

申请日：2020-09-21

申请人： Northwestern Polytechnical University

发明人： Zhongxin BAI , Xiao-Lei ZHANG , Jingdong CHEN

IPC分类号： G10L17/04 , G10L17/18 , G10L17/02 , G10L17/08

CPC分类号： G10L17/04 , G10L17/18 , G10L17/02 , G10L17/08

摘要： A feature extraction deep neural network (DNN) may be trained based on the minimization of a loss function. A similarity function may be specified to calculate a similarity score for two representations of verbal utterances. A training data set comprising pairs of representations of utterances is received, wherein each one of the pairs of representations of utterances is associated with a corresponding a ground-truth label confirming whether the pair of represented utterances come from a same speaker or not. A respective similarity score may then be calculated for each one of the pairs of representations of utterances. Parameters associated with the DNN may then be updated based on minimizing a loss function associated with an area under a section of a receiver-operating-characteristic (ROC) curve for the similarity scores, wherein the ROC curve section is delimited between a low false positive rate (FPR) value and a high FPR value.

6.

发明授权
Sample-efficient representation learning for real-time latent speaker state characterization 有权

公开(公告)号：US11646037B2

公开(公告)日：2023-05-09

申请号：US17115382

申请日：2020-12-08

申请人： OTO Systems Inc.

发明人： Valentin Alain Jean Perret , Nicolas Lucien Perony , Nándor Kedves

IPC分类号： G10L17/18 , G10L17/02 , G06N3/04 , G06N3/08 , G06N3/049 , G06N3/045 , G06N3/048 , G10L17/08

CPC分类号： G10L17/18 , G06N3/045 , G06N3/048 , G06N3/049 , G06N3/08 , G10L17/02 , G10L17/08

摘要： Systems, methods, and non-transitory computer-readable media can provide audio waveform data that corresponds to a voice sample to a temporal convolutional network for evaluation. The temporal convolutional network can pre-process the audio waveform data and can output an identity embedding associated with the audio waveform data. The identity embedding associated with the voice sample can be obtained from the temporal convolutional network. Information describing a speaker associated with the voice sample can be determined based at least in part on the identity embedding.

7.

发明授权
Method and device with data recognition 有权

公开(公告)号：US11574641B2

公开(公告)日：2023-02-07

申请号：US16845464

申请日：2020-04-10

申请人： SAMSUNG ELECTRONICS CO., LTD.

发明人： Sung-Un Park , Kyuhong Kim

IPC分类号： G10L17/18 , G10L17/02 , G06K9/62 , G10L17/08 , G10L17/06 , G06N3/08 , G06V10/46 , G06V40/16 , G06F21/32

摘要： A processor-implemented method with data recognition includes: extracting input feature data from input data; calculating a matching score between the extracted input feature data and enrolled feature data of an enrolled user, based on the extracted input feature data, common component data of a plurality of enrolled feature data corresponding to the enrolled user, and distribution component data of the plurality of enrolled feature data corresponding to the enrolled user; and recognizing the input data based on the matching score.

8.

发明授权
Enrollment in speaker recognition system 有权

公开(公告)号：US11468899B2

公开(公告)日：2022-10-11

申请号：US16188629

申请日：2018-11-13

申请人： Cirrus Logic International Semiconductor Ltd.

发明人： John Paul Lesso , Ben Hopson

IPC分类号： G10L17/04 , G06K9/62 , G10L17/22 , G10L17/08 , G10L17/16

摘要： A method of enrolling a user in a speaker recognition system comprises receiving a sample of the user's speech. A trial voice print is generated from the sample of the user's speech. A score is obtained relating to the trial voice print. The user is enrolled on the basis of the trial voice print only if the score meets a predetermined criterion.

9.

发明申请
AUTOMATIC GENERATION AND/OR USE OF TEXT-DEPENDENT SPEAKER VERIFICATION FEATURES 有权

公开(公告)号：US20220215845A1

公开(公告)日：2022-07-07

申请号：US17700135

申请日：2022-03-21

申请人： GOOGLE LLC

发明人： Matthew Sharifi , Victor Carbune

IPC分类号： G10L17/08 , G10L17/22

摘要： Implementations relate to automatic generation of speaker features for each of one or more particular text-dependent speaker verifications (TD-SVs) for a user. Implementations can generate speaker features for a particular TD-SV using instances of audio data that each capture a corresponding spoken utterance of the user during normal non-enrollment interactions with an automated assistant via one or more respective assistant devices. For example, a portion of an instance of audio data can be used in response to: (a) determining that recognized term(s) for the spoken utterance captured by that the portion correspond to the particular TD-SV; and (b) determining that an authentication measure, for the user and for the spoken utterance, satisfies a threshold. Implementations additionally or alternatively relate to utilization of speaker features, for each of one or more particular TD-SVs for a user, in determining whether to authenticate a spoken utterance for the user.

10.

发明申请
VOICE AND SPEECH RECOGNITION FOR CALL CENTER FEEDBACK AND QUALITY ASSURANCE 有权

公开(公告)号：US20220201122A1

公开(公告)日：2022-06-23

申请号：US17690099

申请日：2022-03-09

申请人： STATE FARM MUTUAL AUTOMOBILE INSURANCE COMPANY

发明人： Sylvia Hernandez

IPC分类号： H04M3/51 , G10L25/63 , H04M3/42 , G10L25/51 , G06F40/205 , G10L17/04 , G10L17/08

摘要： A computer-implemented method for providing an objective evaluation to a customer service representative regarding his performance during an interaction with a customer may include receiving a digitized data stream corresponding to a spoken conversation between a customer and a representative; converting the data stream to a text stream; generating a representative transcript that includes the words from the text stream that are spoken by the representative; comparing the representative transcript with a plurality of positive words and a plurality of negative words; and generating a score that varies according to the occurrence of each word spoken by the representative that matches one of the positive words, and/or the occurrence of each word spoken by the representative that matches one of the negative words. Tone of voice, as well as response time, during the interaction may also be monitored and analyzed to adjust the score, or generate a separate score.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类