-
公开(公告)号:US12088761B2
公开(公告)日:2024-09-10
申请号:US18203094
申请日:2023-05-30
发明人: Sylvia Hernandez
IPC分类号: H04M3/51 , G06F40/205 , G10L15/26 , G10L17/04 , G10L17/08 , G10L25/51 , G10L25/63 , G10L25/87 , H04M3/42
CPC分类号: H04M3/5175 , G06F40/205 , G10L17/04 , G10L17/08 , G10L25/51 , G10L25/63 , H04M3/42 , G10L15/26 , G10L25/87 , H04M2201/40 , H04M2201/41 , H04M2203/40 , H04M2203/401
摘要: A computer-implemented method for providing an objective evaluation to a customer service representative regarding his performance during an interaction with a customer may include receiving a digitized data stream corresponding to a spoken conversation between a customer and a representative; converting the data stream to a text stream; generating a representative transcript that includes the words from the text stream that are spoken by the representative; comparing the representative transcript with a plurality of positive words and a plurality of negative words; and generating a score that varies according to the occurrence of each word spoken by the representative that matches one of the positive words, and/or the occurrence of each word spoken by the representative that matches one of the negative words. Tone of voice, as well as response time, during the interaction may also be monitored and analyzed to adjust the score, or generate a separate score.
-
公开(公告)号:US20240112683A1
公开(公告)日:2024-04-04
申请号:US18520034
申请日:2023-11-27
CPC分类号: G10L17/08 , B60R25/241 , B60R25/257 , G10L15/26 , G10L17/22 , G10L2015/223
摘要: A system comprising a plurality of transponders, each having a user interface to receive commands from a plurality of users and to operate a virtual assistant utilizing artificial intelligence. A micro-computer in an automobile that implements the virtual assistant, wherein the micro-computer configured to identify a particular user from the plurality of users based on a spoken command received via the plurality of transponders. Further, have conversations, including regarding the automobile, between the particular user and respond to future commands or questions from particular user based on prior commands or questions received from the particular user that are stored in a memory of the micro-computer.
-
公开(公告)号:US11869513B2
公开(公告)日:2024-01-09
申请号:US17142775
申请日:2021-01-06
发明人: Iván López Espejo , Santiago Prieto Calero , Ana Iriarte Ruiz , David Roncal Redín , Miguel Ángel Sánchez Yoldi , Eduardo Azanza Ladrón
摘要: Methods of authenticating a user or speaker are provided. These methods include obtaining an input speech signal and user credentials identifying the user or speaker. The input speech signal includes a single-channel signal or a multi-channel speech signal. The methods further include extracting a speech voiceprint from the input speech signal, and retrieving a reference voiceprint associated to the user credentials. The methods still further include determining a voiceprint correspondence between the speech voiceprint and the reference voiceprint, and authenticating the user or speaker depending on said voiceprint correspondence. The methods yet further include updating the reference voiceprint depending on the speech voiceprint corresponding to the authenticated user or speaker. Computer programs, systems and computing systems are also provided which are suitable for performing said methods of authenticating a user or speaker.
-
公开(公告)号:US11862177B2
公开(公告)日:2024-01-02
申请号:US17155851
申请日:2021-01-22
发明人: Tianxiang Chen , Elie Khoury
摘要: Embodiments described herein provide for systems and methods for implementing a neural network architecture for spoof detection in audio signals. The neural network architecture contains a layers defining embedding extractors that extract embeddings from input audio signals. Spoofprint embeddings are generated for particular system enrollees to detect attempts to spoof the enrollee's voice. Optionally, voiceprint embeddings are generated for the system enrollees to recognize the enrollee's voice. The voiceprints are extracted using features related to the enrollee's voice. The spoofprints are extracted using features related to features of how the enrollee speaks and other artifacts. The spoofprints facilitate detection of efforts to fool voice biometrics using synthesized speech (e.g., deepfakes) that spoof and emulate the enrollee's voice.
-
公开(公告)号:US20230206926A1
公开(公告)日:2023-06-29
申请号:US17926605
申请日:2020-09-21
发明人: Zhongxin BAI , Xiao-Lei ZHANG , Jingdong CHEN
摘要: A feature extraction deep neural network (DNN) may be trained based on the minimization of a loss function. A similarity function may be specified to calculate a similarity score for two representations of verbal utterances. A training data set comprising pairs of representations of utterances is received, wherein each one of the pairs of representations of utterances is associated with a corresponding a ground-truth label confirming whether the pair of represented utterances come from a same speaker or not. A respective similarity score may then be calculated for each one of the pairs of representations of utterances. Parameters associated with the DNN may then be updated based on minimizing a loss function associated with an area under a section of a receiver-operating-characteristic (ROC) curve for the similarity scores, wherein the ROC curve section is delimited between a low false positive rate (FPR) value and a high FPR value.
-
6.
公开(公告)号:US11646037B2
公开(公告)日:2023-05-09
申请号:US17115382
申请日:2020-12-08
申请人: OTO Systems Inc.
摘要: Systems, methods, and non-transitory computer-readable media can provide audio waveform data that corresponds to a voice sample to a temporal convolutional network for evaluation. The temporal convolutional network can pre-process the audio waveform data and can output an identity embedding associated with the audio waveform data. The identity embedding associated with the voice sample can be obtained from the temporal convolutional network. Information describing a speaker associated with the voice sample can be determined based at least in part on the identity embedding.
-
公开(公告)号:US11574641B2
公开(公告)日:2023-02-07
申请号:US16845464
申请日:2020-04-10
发明人: Sung-Un Park , Kyuhong Kim
IPC分类号: G10L17/18 , G10L17/02 , G06K9/62 , G10L17/08 , G10L17/06 , G06N3/08 , G06V10/46 , G06V40/16 , G06F21/32
摘要: A processor-implemented method with data recognition includes: extracting input feature data from input data; calculating a matching score between the extracted input feature data and enrolled feature data of an enrolled user, based on the extracted input feature data, common component data of a plurality of enrolled feature data corresponding to the enrolled user, and distribution component data of the plurality of enrolled feature data corresponding to the enrolled user; and recognizing the input data based on the matching score.
-
公开(公告)号:US11468899B2
公开(公告)日:2022-10-11
申请号:US16188629
申请日:2018-11-13
发明人: John Paul Lesso , Ben Hopson
摘要: A method of enrolling a user in a speaker recognition system comprises receiving a sample of the user's speech. A trial voice print is generated from the sample of the user's speech. A score is obtained relating to the trial voice print. The user is enrolled on the basis of the trial voice print only if the score meets a predetermined criterion.
-
公开(公告)号:US20220215845A1
公开(公告)日:2022-07-07
申请号:US17700135
申请日:2022-03-21
申请人: GOOGLE LLC
发明人: Matthew Sharifi , Victor Carbune
摘要: Implementations relate to automatic generation of speaker features for each of one or more particular text-dependent speaker verifications (TD-SVs) for a user. Implementations can generate speaker features for a particular TD-SV using instances of audio data that each capture a corresponding spoken utterance of the user during normal non-enrollment interactions with an automated assistant via one or more respective assistant devices. For example, a portion of an instance of audio data can be used in response to: (a) determining that recognized term(s) for the spoken utterance captured by that the portion correspond to the particular TD-SV; and (b) determining that an authentication measure, for the user and for the spoken utterance, satisfies a threshold. Implementations additionally or alternatively relate to utilization of speaker features, for each of one or more particular TD-SVs for a user, in determining whether to authenticate a spoken utterance for the user.
-
公开(公告)号:US20220201122A1
公开(公告)日:2022-06-23
申请号:US17690099
申请日:2022-03-09
发明人: Sylvia Hernandez
摘要: A computer-implemented method for providing an objective evaluation to a customer service representative regarding his performance during an interaction with a customer may include receiving a digitized data stream corresponding to a spoken conversation between a customer and a representative; converting the data stream to a text stream; generating a representative transcript that includes the words from the text stream that are spoken by the representative; comparing the representative transcript with a plurality of positive words and a plurality of negative words; and generating a score that varies according to the occurrence of each word spoken by the representative that matches one of the positive words, and/or the occurrence of each word spoken by the representative that matches one of the negative words. Tone of voice, as well as response time, during the interaction may also be monitored and analyzed to adjust the score, or generate a separate score.
-
-
-
-
-
-
-
-
-