Patent search ap:("GOOGLE LLC") AND inv:"Te I" Page 1

1.

发明授权
Re-translation for simultaneous, spoken-language machine translation 有权

公开(公告)号：US11562152B2

公开(公告)日：2023-01-24

申请号：US17030093

申请日：2020-09-23

Applicant: Google LLC

Inventor： Naveen Arivazhagan , Colin Andrew Cherry , Wolfgang Macherey , Te I , George Foster , Pallavi N Baljekar

IPC: G06F40/58 , G10L15/26 , G06F40/47 , G10L15/16 , G10L15/19

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer-storage media, for re-translation for simultaneous, spoken-language machine translation. In some implementations, a stream of audio data comprising speech in a first language is received. A transcription for the speech in the stream of audio data is generated using an automated speech recognizer through a series of updates. A translation of the transcription into a second language is generated using a machine translation module. The translation is generated with translation iterations that translate increasing amounts of the transcription, including re-translating previously portions of the transcription. A series of translation updates are provided to a client device based on the translation iterations.

2.

发明申请
RE-TRANSLATION FOR SIMULTANEOUS, SPOKEN-LANGUAGE MACHINE TRANSLATION 有权

公开(公告)号：US20220092274A1

公开(公告)日：2022-03-24

申请号：US17030093

申请日：2020-09-23

Applicant: Google LLC

Inventor： Naveen Arivazhagan , Colin Andrew Cherry , Wolfgang Macherey , Te I , George Foster , Pallavi N. Baljekar

IPC: G06F40/58 , G10L15/26 , G10L15/19 , G10L15/16 , G06F40/47

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer-storage media, for re-translation for simultaneous, spoken-language machine translation. In some implementations, a stream of audio data comprising speech in a first language is received. A transcription for the speech in the stream of audio data is generated using an automated speech recognizer through a series of updates. A translation of the transcription into a second language is generated using a machine translation module. The translation is generated with translation iterations that translate increasing amounts of the transcription, including re-translating previously portions of the transcription. A series of translation updates are provided to a client device based on the translation iterations.

3.

发明申请
SIMULTANEOUS AND MULTIMODAL RENDERING OF ABRIDGED AND NON-ABRIDGED TRANSLATIONS 有权

公开(公告)号：US20240420680A1

公开(公告)日：2024-12-19

申请号：US18337168

申请日：2023-06-19

Applicant: GOOGLE LLC

Inventor： Te I , Chris Kau , Jeffrey Robert Pitman , Robert Eric Genter , Qi Ge , Wolfgang Macherey , Dirk Ryan Padfield , Naveen Arivazhagan , Colin Cherry

IPC: G10L13/08 , G10L13/10 , G10L15/00 , G10L15/26

Abstract: Implementations relate to a multimodal translation application that can provide an abridged version of a translation through an audio interface of a computing device, while simultaneously providing a verbatim textual translation at a display interface of the computing device. The application can provide these different versions of the translation in certain circumstances when, for example, the rate of speech of a person speaking to a user is relatively high compared to a preferred rate of speech of the user. For example, a comparison between phonemes of an original language speech and a translated language speech can be performed to determine whether the ratio satisfies a threshold for providing an audible abridged translation. A determination to provide the abridged translation can additionally or alternatively be based on a determined language of the speaker.

4.

发明公开
AUTOMATIC ADAPTATION OF THE SYNTHESIZED SPEECH OUTPUT OF A TRANSLATION APPLICATION 审中-公开

公开(公告)号：US20240331681A1

公开(公告)日：2024-10-03

申请号：US18128107

申请日：2023-03-29

Applicant: GOOGLE LLC

Inventor： Rakesh Iyer , Jeffrey Robert Pitman , Pendar Yousefi , Te I , Tiruvilwamalai Raman

IPC: G10L13/047 , G06F40/58 , G10L13/033 , G10L13/08 , G10L15/00 , G10L15/16 , G10L15/22 , G10L25/90

CPC classification number: G10L13/047 , G06F40/58 , G10L13/0335 , G10L13/08 , G10L15/005 , G10L15/16 , G10L15/22 , G10L25/90

Abstract: A computer generated voice can automatically be adapted to be similar to a user's voice. Various implementations include processing audio data capturing a first language spoken utterance to identify one or more pitch characteristics. For example, the one or more pitch characteristics can include an estimated frequency range of the given user's voice. Additionally or alternatively, the system can process the audio data capturing the first language spoken utterance and a set of candidate computer generated voices using a computer generated voice selection model to select a candidate computer generated voice. Various implementations can include automatically modifying the selected candidate computer generated voice based on the one or more pitch characteristics to change the frequency range of the modified computer generated voice based on the user's voice.

Patent Agency Ranking