-
公开(公告)号:US11562152B2
公开(公告)日:2023-01-24
申请号:US17030093
申请日:2020-09-23
Applicant: Google LLC
Inventor: Naveen Arivazhagan , Colin Andrew Cherry , Wolfgang Macherey , Te I , George Foster , Pallavi N Baljekar
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer-storage media, for re-translation for simultaneous, spoken-language machine translation. In some implementations, a stream of audio data comprising speech in a first language is received. A transcription for the speech in the stream of audio data is generated using an automated speech recognizer through a series of updates. A translation of the transcription into a second language is generated using a machine translation module. The translation is generated with translation iterations that translate increasing amounts of the transcription, including re-translating previously portions of the transcription. A series of translation updates are provided to a client device based on the translation iterations.
-
公开(公告)号:US20220092274A1
公开(公告)日:2022-03-24
申请号:US17030093
申请日:2020-09-23
Applicant: Google LLC
Inventor: Naveen Arivazhagan , Colin Andrew Cherry , Wolfgang Macherey , Te I , George Foster , Pallavi N. Baljekar
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer-storage media, for re-translation for simultaneous, spoken-language machine translation. In some implementations, a stream of audio data comprising speech in a first language is received. A transcription for the speech in the stream of audio data is generated using an automated speech recognizer through a series of updates. A translation of the transcription into a second language is generated using a machine translation module. The translation is generated with translation iterations that translate increasing amounts of the transcription, including re-translating previously portions of the transcription. A series of translation updates are provided to a client device based on the translation iterations.
-
公开(公告)号:US20240420680A1
公开(公告)日:2024-12-19
申请号:US18337168
申请日:2023-06-19
Applicant: GOOGLE LLC
Inventor: Te I , Chris Kau , Jeffrey Robert Pitman , Robert Eric Genter , Qi Ge , Wolfgang Macherey , Dirk Ryan Padfield , Naveen Arivazhagan , Colin Cherry
Abstract: Implementations relate to a multimodal translation application that can provide an abridged version of a translation through an audio interface of a computing device, while simultaneously providing a verbatim textual translation at a display interface of the computing device. The application can provide these different versions of the translation in certain circumstances when, for example, the rate of speech of a person speaking to a user is relatively high compared to a preferred rate of speech of the user. For example, a comparison between phonemes of an original language speech and a translated language speech can be performed to determine whether the ratio satisfies a threshold for providing an audible abridged translation. A determination to provide the abridged translation can additionally or alternatively be based on a determined language of the speaker.
-
公开(公告)号:US20240331681A1
公开(公告)日:2024-10-03
申请号:US18128107
申请日:2023-03-29
Applicant: GOOGLE LLC
Inventor: Rakesh Iyer , Jeffrey Robert Pitman , Pendar Yousefi , Te I , Tiruvilwamalai Raman
IPC: G10L13/047 , G06F40/58 , G10L13/033 , G10L13/08 , G10L15/00 , G10L15/16 , G10L15/22 , G10L25/90
CPC classification number: G10L13/047 , G06F40/58 , G10L13/0335 , G10L13/08 , G10L15/005 , G10L15/16 , G10L15/22 , G10L25/90
Abstract: A computer generated voice can automatically be adapted to be similar to a user's voice. Various implementations include processing audio data capturing a first language spoken utterance to identify one or more pitch characteristics. For example, the one or more pitch characteristics can include an estimated frequency range of the given user's voice. Additionally or alternatively, the system can process the audio data capturing the first language spoken utterance and a set of candidate computer generated voices using a computer generated voice selection model to select a candidate computer generated voice. Various implementations can include automatically modifying the selected candidate computer generated voice based on the one or more pitch characteristics to change the frequency range of the modified computer generated voice based on the user's voice.
-
-
-