-
公开(公告)号:US20240289563A1
公开(公告)日:2024-08-29
申请号:US18589358
申请日:2024-02-27
Applicant: GOOGLE LLC
Inventor: Michelle Tadmor Ramanovich , Eliya Nachmani , Alon Levkovitch , Byungha Chun , Yifan Ding , Nadav Bar , Chulayuth Asawaroengchai
CPC classification number: G06F40/58 , G10L15/005 , G10L15/063 , G10L25/18 , G10L2015/0635
Abstract: Training and/or utilizing a Speech-To-Speech Translation (S2ST) system that can be used to generate, based on processing source audio data that captures a spoken utterance in a source language, target audio data that includes a synthetic spoken utterance that is spoken in a target language and that corresponds, both linguistically and para-linguistically, to the spoken utterance in the source language. Implementations that are directed to training the S2ST system utilize an unsupervised approach, with monolingual speech data, in training the S2ST system.
-
公开(公告)号:US11811968B2
公开(公告)日:2023-11-07
申请号:US17266534
申请日:2019-01-08
Applicant: Google LLC
Inventor: Shavit Matias , Noam Etzion-Rosenberg , Rebecca Chiou , Benjamin Schlesinger , Brandon Charles Barbello , Ori Kabeli , Usman Abdullah , Eric Erfanian , Michelle Tadmor , Aditi Bhargava , Jan Piotr Jedrzejowicz , Alex Agranovich , Nir Shemy , Paul Dunlop , Yossi Matias , Kyungmin Youn , Nadav Bar
CPC classification number: H04M3/436 , H04M3/42042 , H04M3/42136 , H04M2201/42
Abstract: A computing device is described that accepts, a telephone call, from another device, initiated by a caller. Prior to establishing a telephone user interface that receives spoken input from the user and outputs spoken audio from the caller, the computing device executes a call screening service that outputs an audio user interface, to the other device and as part of the telephone call. The audio user interface interrogates the caller for additional information including a purpose of the telephone call, which allows the user to have more context of the telephone call before deciding whether to accept the call or hang up. The computing device outputs a graphical user interface associated with telephone call. The graphical user interface includes an indication of the additional information obtained via the audio user interface that interrogates the caller.
-
公开(公告)号:US12249313B2
公开(公告)日:2025-03-11
申请号:US17914010
申请日:2020-10-27
Applicant: GOOGLE LLC
Inventor: Michael Hassid , Sapir Caduri , Nadav Bar , Danielle Cohen , Benny Schlesinger , Michelle Tadmor Ramanovich
Abstract: A method and system is disclosed for speech synthesis of streaming text. At a text-to-speech (“ITS) system, a real-time streaming text string having a starting point and an ending point may be received, and a first sub-string comprising a first portion of the text string received from an initial point to a first trigger point may be accumulated. The initial point is no earlier than the starting point and is prior to the first trigger point, and the first trigger point is no further than the ending point. A punctuation model of the ITS system may be applied to the first sub-string to generate a pre-processed first sub-string comprising the first sub-string with added grammatical punctuation as determined by the punctuation model. TTS synthesis processing may be applied to at least the pre-processed first sub-string to generate first synthesized speech, and audio play out of the first synthesized speech produced.
-
公开(公告)号:US20240031482A1
公开(公告)日:2024-01-25
申请号:US18373790
申请日:2023-09-27
Applicant: Google LLC
Inventor: Shavit Matias , Noam Etzion-Rosenberg , Rebecca Chiou , Benjamin Schlesinger , Brandon Charles Barbello , Ori Kabeli , Usman Abdullah , Eric Erfanian , Michelle Tadmor , Aditi Bhargava , Jan Piotr Jedrzejowicz , Alex Agranovich , Nir Shemy , Paul Dunlop , Yossi Matias , Kyungmin Youn , Nadav Bar
CPC classification number: H04M3/436 , H04M3/42042 , H04M3/42136 , H04M2201/42
Abstract: A computing device is described that accepts, a telephone call, from another device, initiated by a caller. Prior to establishing a telephone user interface that receives spoken input from the user and outputs spoken audio from the caller, the computing device executes a call screening service that outputs an audio user interface, to the other device and as part of the telephone call. The audio user interface interrogates the caller for additional information including a purpose of the telephone call, which allows the user to have more context of the telephone call before deciding whether to accept the call or hang up. The computing device outputs a graphical user interface associated with telephone call. The graphical user interface includes an indication of the additional information obtained via the audio user interface that interrogates the caller.
-
公开(公告)号:US11805208B2
公开(公告)日:2023-10-31
申请号:US16771630
申请日:2019-05-09
Applicant: Google LLC
Inventor: Shavit Matias , Noam Etzion-Rosenberg , Blaise Aguera-Arcas , Benjamin Schlesinger , Brandon Barbello , Ori Kabeli , David Petrou , Yossi Matias , Nadav Bar
CPC classification number: H04M3/527 , G06N20/00 , G06Q10/1095 , G10L15/08 , G10L15/22 , H04M3/42357 , H04M3/42374 , H04M3/4365 , G10L2015/088 , G10L2015/223
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving, at a mobile computing device that is associated with a called user, a call from a calling computing device that is associated with a calling user; in response to receiving the call, determining, by the mobile computing device, that data associated with the called user indicates that the called user will not respond to the call; in response to determining that the called user will not respond to the call, inferring, by the mobile computing device, an informational need of the calling user; and automatically providing, from the mobile computing device to the calling computing device, information associated with the called user and that satisfies the inferred informational need of the calling user.
-
公开(公告)号:US20220148614A1
公开(公告)日:2022-05-12
申请号:US17437725
申请日:2019-06-03
Applicant: Google LLC
Inventor: Asa Jonas Ivry Block , Elliott Charles Burford , Anthony Felice Tripaldi , Stefanie Bianca Pitaro , Heather Patricia Luipold , Brian Kemler , Kelsie Hope Van Deman , Nadav Bar , Robert James Berry , Daniel Cohen , Michelle Ramanovich , Thomas Weedon Hume , Nicole Kiana Bleuel , Benjamin Schlesinger , Justin Wooyoung Lee , Kevin Rocard , Eric Laurent
Abstract: Techniques and computing devices are described that automatically caption content directly from audio data being output from content sources, unlike other captioning systems which often rely on information contained in audio signals being sent to speakers. The disclosed techniques and computing devices may analyze metadata to determine whether the audio data is suitable for captioning or whether the audio data is some other type of audio data. Responsive to identifying audio data for captioning, the disclosed techniques and computing devices can generate a description of audible sounds interpreted from the audio data, providing for the automatic captioning of content and making audible content accessible to many users who have difficulty hearing or are otherise unable to listen to content.
-
公开(公告)号:US20210314440A1
公开(公告)日:2021-10-07
申请号:US17266534
申请日:2019-01-08
Applicant: Google LLC
Inventor: Shavit Matias , Noam Etzion-Rosenberg , Rebecca Chiou , Benjamin Schlesinger , Brandon Charles Barbello , Ori Kabeli , Usman Abdullah , Eric Erfanian , Michelle Tadmor , Aditi Bhargava , Jan Piotr Jedrzejowicz , Alex Agranovich , Nir Shemy , Paul Dunlop , Yossi Matias , Kyungmin Youn , Nadav Bar
Abstract: A computing device is described that accepts, a telephone call, from another device, initiated by a caller. Prior to establishing a telephone user interface that receives spoken input from the user and outputs spoken audio from the caller, the computing device executes a call screening service that outputs an audio user interface, to the other device and as part of the telephone call. The audio user interface interrogates the caller for additional information including a purpose of the telephone call, which allows the user to have more context of the telephone call before deciding whether to accept the call or hang up. The computing device outputs a graphical user interface associated with telephone call. The graphical user interface includes an indication of the additional information obtained via the audio user interface that interrogates the caller.
-
-
-
-
-
-