-
公开(公告)号:US20240290322A1
公开(公告)日:2024-08-29
申请号:US18587860
申请日:2024-02-26
Applicant: Google LLC
Inventor: JAEYOUNG Kim , Han Lu , Soheil Khorram , Anshuman Tripathi , Qian Zhang , Hasim Sak
IPC: G10L15/06
CPC classification number: G10L15/063
Abstract: A method of training an accent recognition model includes receiving a corpus of training utterances spoken across various accents, each training utterance in the corpus including training audio features characterizing the training utterance, and executing a training process to train the accent recognition model on the corpus of training utterances to teach the accent recognition model to learn how to predict accent representations from the training audio features. The accent recognition model includes one or more strided convolution layers, a stack of multi-headed attention layers, and a pooling layer configured to generate a corresponding accent representation.