-
公开(公告)号:US10127904B2
公开(公告)日:2018-11-13
申请号:US14811939
申请日:2015-07-29
Applicant: Google LLC
Inventor: Kanury Kanishka Rao , Francoise Beaufays , Hasim Sak , Ouais Alsharif
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media for learning pronunciations from acoustic sequences. One method includes receiving an acoustic sequence, the acoustic sequence comprising a respective acoustic feature representation at each of a plurality of time steps; for each of the time steps processing the acoustic feature representation through each of one or more recurrent neural network layers to generate a recurrent output; processing the recurrent output for the time step using a phoneme output layer to generate a phoneme representation for the acoustic feature representation for the time step; and processing the recurrent output for the time step using a grapheme output layer to generate a grapheme representation for the acoustic feature representation for the time step; and extracting, from the phoneme and grapheme representations for the acoustic feature representations at each time step, a respective pronunciation for each of one or more words.
-
公开(公告)号:US20180322877A1
公开(公告)日:2018-11-08
申请号:US16036662
申请日:2018-07-16
Applicant: GOOGLE LLC
Inventor: Brian Strope , Francoise Beaufays , Willaim J. Byrne
Abstract: A method of providing a personal directory service includes receiving, over the Internet, from a user terminal, a query spoken by a user, where the query spoken by the user includes a speech utterance representing a category of persons. The method also includes determining a geographic location of the user terminal, recognizing the category of persons with the speech recognition engine based on the speech utterance representing the category of persons a listing of persons within or near the determined geographic location matching the query to select persons responsive to the query spoken by the user, and sending to the user terminal information related to at least some of the responsive persons.
-