-
公开(公告)号:US20210397610A1
公开(公告)日:2021-12-23
申请号:US17350294
申请日:2021-06-17
Applicant: SoundHound, Inc.
Inventor: Pranav SINGH , Yilun ZHANG , Keyvan MOHAJER , Mohammadreza FAZELI
IPC: G06F16/242 , G06N3/04 , G06N3/08
Abstract: A machine learning system for a digital assistant is described, together with a method of training such a system. The machine learning system is based on an encoder-decoder sequence-to-sequence neural network architecture trained to map input sequence data to output sequence data, where the input sequence data relates to an initial query and the output sequence data represents canonical data representation for the query. The method of training involves generating a training dataset for the machine learning system. The method involves clustering vector representations of the query data samples to generate canonical-query original-query pairs in training the machine learning system.