-
公开(公告)号:US09940926B2
公开(公告)日:2018-04-10
申请号:US14728528
申请日:2015-06-02
Applicant: International Business Machines Corporation
Inventor: Jonathan H. Connell, II , Etienne Marcheret
CPC classification number: G10L15/075 , G10L15/10 , G10L15/265 , G10L17/00 , G10L17/02 , G10L2015/025
Abstract: A method includes the following steps. An acoustic input is obtained from a user, including issuing a verbal prompt to the user and receiving the acoustic input from the user in response to the verbal prompt. One or more acoustic representations are obtained, wherein the one or more acoustic representations are generated from a list of expected responses to the issued verbal prompt. The acoustic input from the user is compared to the one or more acoustic representations. One or more speech recognition parameters are adjusted based on the comparison.
-
公开(公告)号:US20180096271A1
公开(公告)日:2018-04-05
申请号:US15835807
申请日:2017-12-08
Applicant: AffectLayer, Inc.
Inventor: Roy Raanani , Russell Levy , Dominik Facher , Micha Yochanan Breakstone
CPC classification number: G06Q10/0635 , G06F17/2785 , G06N5/02 , G06N99/005 , G10L15/02 , G10L15/08 , G10L15/22 , G10L17/02 , G10L17/26 , G10L25/51 , G10L25/63 , G10L2015/088 , H04M3/42221 , H04M3/5175 , H04M3/5232 , H04M2201/40 , H04M2203/357 , H04M2203/556
Abstract: The disclosure is directed to automatically determining deals at risk by analyzing conversations of representatives with customers. A risk identification system retrieves recordings of various conversations, extracts features of each of the conversations, and analyzes the features to determine if any of the conversations includes features that are indicative of a deal discussed in that conversation being at risk. By performing such an analysis of conversations, the risk identification system can identify a number of deals that are at risk and generate a report of such deals and notify a consumer user of the risk identification system of such deals.
-
公开(公告)号:US09922668B2
公开(公告)日:2018-03-20
申请号:US14969036
申请日:2015-12-15
Applicant: KnuEdge Incorporated
Inventor: David C. Bradley , Yao Huang Morin , Janis Intoy , Sean O'Connor , Nick Hilton , Massimo Mascaro
IPC: G10L17/26 , G10L25/90 , G10L25/03 , G10L25/27 , G10L17/02 , G10L25/18 , G10L25/06 , G10L25/51 , G10L15/02 , G10L21/0208
CPC classification number: G10L25/90 , G10L15/02 , G10L17/02 , G10L21/0208 , G10L25/03 , G10L25/06 , G10L25/18 , G10L25/27 , G10L25/51
Abstract: An estimate of a fractional chirp rate of a signal may be computed by using multiple frequency representations of the signal. A first frequency representation may be computed using a first fractional chirp rate and a first score may be computed using the first frequency representation that indicates a match between the first fractional chirp rate and a fractional chirp rate of the signal. A second frequency representation may be computed using a second fractional chirp rate and a second score may be computed using the second frequency representation that indicates a match between the second fractional chirp rate and the fractional chirp rate of the signal. The fractional chirp rate of the signal may be estimated using the first score and the second score, for example, by selecting a fractional chirp rate corresponding to a highest score.
-
公开(公告)号:US09911411B2
公开(公告)日:2018-03-06
申请号:US14755596
申请日:2015-06-30
Applicant: International Business Machines Corporation
Inventor: Jonathan H. Connell, II , Etienne Marcheret
CPC classification number: G10L15/075 , G10L15/10 , G10L15/265 , G10L17/00 , G10L17/02 , G10L2015/025
Abstract: A method includes the following steps. An acoustic input is obtained from a user, including issuing a verbal prompt to the user and receiving the acoustic input from the user in response to the verbal prompt. One or more acoustic representations are obtained, wherein the one or more acoustic representations are generated from a list of expected responses to the issued verbal prompt. The acoustic input from the user is compared to the one or more acoustic representations. One or more speech recognition parameters are adjusted based on the comparison.
-
公开(公告)号:US20180046710A1
公开(公告)日:2018-02-15
申请号:US15793691
申请日:2017-10-25
Applicant: AffectLayer, Inc.
Inventor: Roy Raanani , Russell Levy , Micha Yochanan Breadstone
CPC classification number: G06F17/30772 , G06F17/2785 , G06F17/30743 , G06K9/00268 , G06K9/00302 , G06K9/00744 , G06K9/6256 , G06K2009/00738 , G06N7/005 , G06N99/005 , G06Q30/016 , G10L15/02 , G10L15/1815 , G10L15/183 , G10L15/265 , G10L17/02 , G10L17/26 , G10L25/51 , G10L25/63 , H04M3/42221 , H04M3/5175 , H04M3/5232 , H04M2201/40 , H04M2203/301 , H04M2203/305 , H04M2203/357 , H04M2203/556 , H04N7/141
Abstract: The disclosure is directed to automatically generating a playlist of conversations having a specified moment. A moment can be occurrence of a specific event or a specific characteristic in a conversation, or any event that is of specific interest for an application for which the playlist is being generated. For example, a moment can include laughter, fast-talking, objections, response to questions, a discussion on a particular topic such as budget, behavior of a speaker, intent to buy, etc., in a conversation. A moment identification system analyzes each of the conversations to determine if one or more features of a conversation correspond to a specified moment, and includes those of the conversations in the playlist having one or more features that correspond to the specified moment. The playlist may include a portion of a conversation that has the specified moment rather than the entire conversation.
-
公开(公告)号:US20180040325A1
公开(公告)日:2018-02-08
申请号:US15666267
申请日:2017-08-01
Inventor: John Laurence MELANSON , John Paul LESSO
Abstract: This application describes methods and apparatus for generating a prompt to be presented to a user for the user to vocalise as part of speaker recognition. An apparatus according to an embodiment has a selector for selecting at least one vocal prompt element to form at least part of said prompt from a predetermined set of a plurality of vocal prompt elements. The selector is configured to select the vocal prompt element based, at least partly, on an indication of the operating conditions for the biometric speaker recognition, for example background noise. The prompt is selected to be one which will provide a good likelihood of discrimination between users when vocalised and used for speaker recognition in the current operating conditions. The prompt may be issued as part of a verification process for an existing user or an enrolment process for an enrolling user.
-
公开(公告)号:US09875743B2
公开(公告)日:2018-01-23
申请号:US15006575
申请日:2016-01-26
Applicant: Verint Systems Ltd.
Inventor: Alex Gorodetski , Ido Shapira , Ron Wein , Oana Sidi
IPC: G10L15/00 , G10L15/06 , G10L17/00 , G10L17/20 , G10L17/04 , G10L17/16 , G10L17/02 , G10L25/84 , G10L15/26
Abstract: Disclosed herein are methods of diarizing audio data using first-pass blind diarization and second-pass blind diarization that generate speaker statistical models, wherein the first pass-blind diarization is on a per-frame basis and the second pass-blind diarization is on a per-word basis, and methods of creating acoustic signatures for a common speaker based only on the statistical models of the speakers in each audio session.
-
公开(公告)号:US20170364516A1
公开(公告)日:2017-12-21
申请号:US15129590
申请日:2015-12-24
Applicant: Intel Corporation
Inventor: Eric Ariel Shellef , Reshef Shilon , Peter Graff , Jonathan Eng , Guillermo Perez , Juan Manuel Lucas , Martin Henk Van Den Berg
CPC classification number: G06F16/436 , G06K2009/00939 , G10L15/075 , G10L15/183 , G10L15/24 , G10L17/02 , G10L17/22
Abstract: The present disclosure describes dynamically adjusting linguistic models for automatic speech recognition based on biometric information to produce a more reliable speech recognition experience. Embodiments include receiving a speech signal, receiving a biometric signal from a biometric sensor implemented at least partially in hardware, determining a linguistic model based on the biometric signal, and processing the speech signal for speech recognition using the linguistic model based on the biometric signal.
-
公开(公告)号:US20170352345A1
公开(公告)日:2017-12-07
申请号:US15172921
申请日:2016-06-03
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Gakuto Kurata , Masayuki A. Suzuki
Abstract: Methods and a system are provided for estimating automatic speech recognition (ASR) accuracy. A method includes obtaining transcriptions of utterances in a conversation over two channels. The method further includes sorting the transcriptions along a time axis using a forced alignment. The method also includes training a language model with the sorted transcriptions. The method additionally includes performing ASR for utterances in a conversation between a first user and a second user. The second user is a target of ASR accuracy estimation. The method further includes determining whether an ASR result of the second user is consistent or inconsistent with an ASR result of the first user using the trained language model. The method also includes estimating the ASR result of the second user as poor responsive to the ASR result of the second user being as inconsistent with the ASR result of the first user.
-
公开(公告)号:US09837078B2
公开(公告)日:2017-12-05
申请号:US13673187
申请日:2012-11-09
Applicant: Mattersight Corporation
Inventor: Roger Warford , Douglas Brown , Christopher Danson , David Gustafson
IPC: G10L15/20 , G10L17/00 , G10L25/78 , G10L17/02 , G10L17/04 , G10L17/06 , G10L25/27 , G10L25/51 , G10L15/00 , G10L21/00 , G06F17/00 , G11B7/00 , H04R25/00 , H04R29/00 , H04M11/00
CPC classification number: G10L17/005 , G10L17/02 , G10L17/04 , G10L17/06 , G10L25/27 , G10L25/51 , G10L25/78 , G10L2025/783
Abstract: The methods, apparatus, and systems described herein are designed to identify fraudulent callers. A voice print of a call is created and compared to known voice prints to determine if it matches one or more of the known voice prints. The methods include a pre-processing step to separate speech from non-speech, selecting a number of elements that affect the voice print the most, and/or computing an adjustment factor based on the scores of each received voice print against known voice prints.
-
-
-
-
-
-
-
-
-