-
公开(公告)号:US11508355B1
公开(公告)日:2022-11-22
申请号:US16172115
申请日:2018-10-26
Applicant: Interactions LLC
Inventor: Ryan Price , Srinivas Bangalore
IPC: G10L15/06 , G10L15/22 , G10L15/02 , G10L21/0232 , G10L15/18
Abstract: Systems and methods are disclosed herein for discerning aspects of user speech to determine user intent and/or other acoustic features of a sound input without the use of an ASR engine. To this end, a processor may receive a sound signal comprising raw acoustic data from a client device, and divides the data into acoustic units. The processor feeds the acoustic units through a first machine learning model to obtain a first output and determines a first mapping, using the first output, of each respective acoustic unit to a plurality of candidate representations of the respective acoustic unit. The processor feeds each candidate representation of the plurality through a second machine learning model to obtain a second output, determines a second mapping, using the second output, of each candidate representation to a known condition, and determines a label for the sound signal based on the second mapping.