-
公开(公告)号:US20240428816A1
公开(公告)日:2024-12-26
申请号:US18797400
申请日:2024-08-07
Applicant: Google LLC
Inventor: Anatoly Efros , Noam Etzion-Rosenberg , Tal Remez , Oran Lang , Inbar Mosseri , Israel Or Weinstein , Benjamin Schlesinger , Michael Rubinstein , Ariel Ephrat , Yukun Zhu , Stella Laurenzo , Amit Pitaru , Yossi Matias
IPC: G10L21/0208 , G10L17/00 , G10L21/0272 , G10L25/57
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: receiving, by a user device, a first indication of one or more first speakers visible in a current view recorded by a camera of the user device, in response, generating a respective isolated speech signal for each of the one or more first speakers that isolates speech of the first speaker in the current view and sending the isolated speech signals for each of the one or more first speakers to a listening device operatively coupled to the user device, receiving, by the user device, a second indication of one or more second speakers visible in the current view recorded by the camera of the user device, and in response generating and sending a respective isolated speech signal for each of the one or more second speakers to the listening device.
-
公开(公告)号:US11811968B2
公开(公告)日:2023-11-07
申请号:US17266534
申请日:2019-01-08
Applicant: Google LLC
Inventor: Shavit Matias , Noam Etzion-Rosenberg , Rebecca Chiou , Benjamin Schlesinger , Brandon Charles Barbello , Ori Kabeli , Usman Abdullah , Eric Erfanian , Michelle Tadmor , Aditi Bhargava , Jan Piotr Jedrzejowicz , Alex Agranovich , Nir Shemy , Paul Dunlop , Yossi Matias , Kyungmin Youn , Nadav Bar
CPC classification number: H04M3/436 , H04M3/42042 , H04M3/42136 , H04M2201/42
Abstract: A computing device is described that accepts, a telephone call, from another device, initiated by a caller. Prior to establishing a telephone user interface that receives spoken input from the user and outputs spoken audio from the caller, the computing device executes a call screening service that outputs an audio user interface, to the other device and as part of the telephone call. The audio user interface interrogates the caller for additional information including a purpose of the telephone call, which allows the user to have more context of the telephone call before deciding whether to accept the call or hang up. The computing device outputs a graphical user interface associated with telephone call. The graphical user interface includes an indication of the additional information obtained via the audio user interface that interrogates the caller.
-
公开(公告)号:US12124863B2
公开(公告)日:2024-10-22
申请号:US17909894
申请日:2020-03-13
Applicant: Google LLC
Inventor: Benjamin Schlesinger , Noam Etzion-Rosenberg , Anatoly Efros , Morgan Venable , Gabriel Taubman , John DiMartile , Nikita Edward Dubrovsky , Sahil Goel , Hen Fitoussi , Sapir Caduri
IPC: G06F9/451
CPC classification number: G06F9/451 , G06F2221/2149
Abstract: Methods, systems, and apparatus for filtering content at the operating system level. In one aspect, a method includes accessing, at a user device, data that includes content items that are to be presented by an application executing on the user device; prior to the content being presented by the application: for each content item, determining, at the user device and by a filtering model, whether the content item is to be presented by the application or filtered, for each content item that is determined to be presented by the application, allowing the application to present the content item, and for each content item that is determined to be filtered, precluding, by the filtering model by a system level filtering operation performed at an operating system level and separate from an application level at which the application is executing, presentation of the content item by the application.
-
公开(公告)号:US20230096274A1
公开(公告)日:2023-03-30
申请号:US17909894
申请日:2020-03-13
Applicant: Google LLC
Inventor: Benjamin Schlesinger , Noam Etzion-Rosenberg , Anatoly Efros , Morgan Venable , Gabriel Taubman , John DiMartile , Nikita Edward Dubrovsky , Sahil Goel , Hen Fitoussi , Sapir Caduri
IPC: G06F9/451
Abstract: Methods, systems, and apparatus for filtering content at the operating system level. In one aspect, a method includes accessing, at a user device, data that includes content items that are to be presented by an application executing on the user device; prior to the content being presented by the application: for each content item, determining, at the user device and by a filtering model, whether the content item is to be presented by the application or filtered, for each content item that is determined to be presented by the application, allowing the application to present the content item, and for each content item that is determined to be filtered, precluding, by the filtering model by a system level filtering operation performed at an operating system level and separate from an application level at which the application is executing, presentation of the content item by the application.
-
公开(公告)号:US12073844B2
公开(公告)日:2024-08-27
申请号:US17601042
申请日:2020-10-01
Applicant: Google LLC
Inventor: Anatoly Efros , Noam Etzion-Rosenberg , Tal Remez , Oran Lang , Inbar Mosseri , Israel Or Weinstein , Benjamin Schlesinger , Michael Rubinstein , Ariel Ephrat , Yukun Zhu , Stella Laurenzo , Amit Pitaru , Yossi Matias
IPC: G10L21/0208 , G10L17/00 , G10L21/0272 , G10L25/57
CPC classification number: G10L21/0208 , G10L17/00 , G10L21/0272 , G10L25/57 , G10L2021/02087
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: receiving, by a user device, a first indication of one or more first speakers visible in a current view recorded by a camera of the user device, in response, generating a respective isolated speech signal for each of the one or more first speakers that isolates speech of the first speaker in the current view and sending the isolated speech signals for each of the one or more first speakers to a listening device operatively coupled to the user device, receiving, by the user device, a second indication of one or more second speakers visible in the current view recorded by the camera of the user device, and in response generating and sending a respective isolated speech signal for each of the one or more second speakers to the listening device.
-
公开(公告)号:US20240031482A1
公开(公告)日:2024-01-25
申请号:US18373790
申请日:2023-09-27
Applicant: Google LLC
Inventor: Shavit Matias , Noam Etzion-Rosenberg , Rebecca Chiou , Benjamin Schlesinger , Brandon Charles Barbello , Ori Kabeli , Usman Abdullah , Eric Erfanian , Michelle Tadmor , Aditi Bhargava , Jan Piotr Jedrzejowicz , Alex Agranovich , Nir Shemy , Paul Dunlop , Yossi Matias , Kyungmin Youn , Nadav Bar
CPC classification number: H04M3/436 , H04M3/42042 , H04M3/42136 , H04M2201/42
Abstract: A computing device is described that accepts, a telephone call, from another device, initiated by a caller. Prior to establishing a telephone user interface that receives spoken input from the user and outputs spoken audio from the caller, the computing device executes a call screening service that outputs an audio user interface, to the other device and as part of the telephone call. The audio user interface interrogates the caller for additional information including a purpose of the telephone call, which allows the user to have more context of the telephone call before deciding whether to accept the call or hang up. The computing device outputs a graphical user interface associated with telephone call. The graphical user interface includes an indication of the additional information obtained via the audio user interface that interrogates the caller.
-
公开(公告)号:US11805208B2
公开(公告)日:2023-10-31
申请号:US16771630
申请日:2019-05-09
Applicant: Google LLC
Inventor: Shavit Matias , Noam Etzion-Rosenberg , Blaise Aguera-Arcas , Benjamin Schlesinger , Brandon Barbello , Ori Kabeli , David Petrou , Yossi Matias , Nadav Bar
CPC classification number: H04M3/527 , G06N20/00 , G06Q10/1095 , G10L15/08 , G10L15/22 , H04M3/42357 , H04M3/42374 , H04M3/4365 , G10L2015/088 , G10L2015/223
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for receiving, at a mobile computing device that is associated with a called user, a call from a calling computing device that is associated with a calling user; in response to receiving the call, determining, by the mobile computing device, that data associated with the called user indicates that the called user will not respond to the call; in response to determining that the called user will not respond to the call, inferring, by the mobile computing device, an informational need of the calling user; and automatically providing, from the mobile computing device to the calling computing device, information associated with the called user and that satisfies the inferred informational need of the calling user.
-
公开(公告)号:US20230267942A1
公开(公告)日:2023-08-24
申请号:US17601042
申请日:2020-10-01
Applicant: Google LLC
Inventor: Anatoly Efros , Noam Etzion-Rosenberg , Tal Remez , Oran Lang , Inbar Mosseri , Israel Or Weinstein , Benjamin Schlesinger , Michael Rubinstein , Ariel Ephrat , Yukun Zhu , Stella Laurenzo , Amit Pitaru , Yossi Matias
IPC: G10L21/0208 , G10L25/57
CPC classification number: G10L21/0208 , G10L25/57 , G10L2021/02087
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: receiving, by a user device, a first indication of one or more first speakers visible in a current view recorded by a camera of the user device, in response, generating a respective isolated speech signal for each of the one or more first speakers that isolates speech of the first speaker in the current view and sending the isolated speech signals for each of the one or more first speakers to a listening device operatively coupled to the user device, receiving, by the user device, a second indication of one or more second speakers visible in the current view recorded by the camera of the user device, and in response generating and sending a respective isolated speech signal for each of the one or more second speakers to the listening device.
-
公开(公告)号:US20220148614A1
公开(公告)日:2022-05-12
申请号:US17437725
申请日:2019-06-03
Applicant: Google LLC
Inventor: Asa Jonas Ivry Block , Elliott Charles Burford , Anthony Felice Tripaldi , Stefanie Bianca Pitaro , Heather Patricia Luipold , Brian Kemler , Kelsie Hope Van Deman , Nadav Bar , Robert James Berry , Daniel Cohen , Michelle Ramanovich , Thomas Weedon Hume , Nicole Kiana Bleuel , Benjamin Schlesinger , Justin Wooyoung Lee , Kevin Rocard , Eric Laurent
Abstract: Techniques and computing devices are described that automatically caption content directly from audio data being output from content sources, unlike other captioning systems which often rely on information contained in audio signals being sent to speakers. The disclosed techniques and computing devices may analyze metadata to determine whether the audio data is suitable for captioning or whether the audio data is some other type of audio data. Responsive to identifying audio data for captioning, the disclosed techniques and computing devices can generate a description of audible sounds interpreted from the audio data, providing for the automatic captioning of content and making audible content accessible to many users who have difficulty hearing or are otherise unable to listen to content.
-
公开(公告)号:US20210314440A1
公开(公告)日:2021-10-07
申请号:US17266534
申请日:2019-01-08
Applicant: Google LLC
Inventor: Shavit Matias , Noam Etzion-Rosenberg , Rebecca Chiou , Benjamin Schlesinger , Brandon Charles Barbello , Ori Kabeli , Usman Abdullah , Eric Erfanian , Michelle Tadmor , Aditi Bhargava , Jan Piotr Jedrzejowicz , Alex Agranovich , Nir Shemy , Paul Dunlop , Yossi Matias , Kyungmin Youn , Nadav Bar
Abstract: A computing device is described that accepts, a telephone call, from another device, initiated by a caller. Prior to establishing a telephone user interface that receives spoken input from the user and outputs spoken audio from the caller, the computing device executes a call screening service that outputs an audio user interface, to the other device and as part of the telephone call. The audio user interface interrogates the caller for additional information including a purpose of the telephone call, which allows the user to have more context of the telephone call before deciding whether to accept the call or hang up. The computing device outputs a graphical user interface associated with telephone call. The graphical user interface includes an indication of the additional information obtained via the audio user interface that interrogates the caller.
-
-
-
-
-
-
-
-
-