-
公开(公告)号:US20230229390A1
公开(公告)日:2023-07-20
申请号:US18189181
申请日:2023-03-23
Applicant: Google LLC
Inventor: Jan Althaus , Matthew Sharifi
IPC: G06F3/16 , G06F3/0481 , G06F3/0484 , G10L15/08 , G10L15/22
CPC classification number: G06F3/167 , G06F3/0481 , G06F3/0484 , G10L15/08 , G10L15/22 , G10L2015/088 , G10L2015/223
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for implementing hotword recognition and passive assistance are disclosed. In one aspect, a method includes the actions of receiving, by a computing device that is operating in a low-power mode and that includes a display that displays a graphical interface while the computing device is in the low-power mode and that is configured to exit the low-power mode in response to detecting a first hotword, audio data corresponding to an utterance. The method further includes determining that the audio data includes a second, different hotword. The method further includes obtaining a transcription of the utterance by performing speech recognition on the audio data. The method further includes generating an additional user interface. The method further includes providing, for output on the display, the additional graphical interface.
-
公开(公告)号:US20220335930A1
公开(公告)日:2022-10-20
申请号:US17859068
申请日:2022-07-07
Applicant: Google LLC
Inventor: Matthew Sharifi , Tom Hume , Mohamad Hassan Mohamad Rom , Jan Althaus , Diego Melendo Casado
Abstract: Techniques are described herein for selectively processing a user's utterances captured prior to and after an event that invokes an automated assistant to determine the user's intent and/or any parameters required for resolving the user's intent. In various implementations, respective measures of fitness for triggering responsive action by the automated assistant may be determined for pre-event and a post-event input streams. Based on the respective measures of fitness, one or both of the pre-event input stream or post-event input stream may be selected and used to cause the automated assistant to perform one or more responsive actions.
-
公开(公告)号:US12249321B2
公开(公告)日:2025-03-11
申请号:US17859068
申请日:2022-07-07
Applicant: Google LLC
Inventor: Matthew Sharifi , Tom Hume , Mohamad Hassan Mohamad Rom , Jan Althaus , Diego Melendo Casado
Abstract: Techniques are described herein for selectively processing a user's utterances captured prior to and after an event that invokes an automated assistant to determine the user's intent and/or any parameters required for resolving the user's intent. In various implementations, respective measures of fitness for triggering responsive action by the automated assistant may be determined for pre-event and a post-event input streams. Based on the respective measures of fitness, one or both of the pre-event input stream or post-event input stream may be selected and used to cause the automated assistant to perform one or more responsive actions.
-
公开(公告)号:US20190102144A1
公开(公告)日:2019-04-04
申请号:US16148401
申请日:2018-10-01
Applicant: Google LLC
Inventor: Dominik Roblek , Blaise Aguera-Arcas , Tom Hume , Marvin Ritter , Brandon Barbello , Kevin Kilgour , Mihajlo Velimirovic , Christopher Walter George Thornton , Gabriel Taubman , James David Lyon , Jan Althaus , Katsiaryna Naliuka , Julian Odell , Matthew Sharifi , Beat Gfeller
Abstract: In general, the subject matter described in this disclosure can be embodied in methods, systems, and program products for indicating a reference song. A computing device stores reference song characterization data that identifies a plurality of audio characteristics for each reference song in a plurality of reference songs. The computing device receives digital audio data that represents audio recorded by a microphone, converts the digital audio data from time-domain format into frequency-domain format, and uses the digital audio data in the frequency-domain format in a music-characterization process. In response to determining that characterization values for the digital audio data are most relevant to characterization values for a particular reference song, the computing device outputs an indication of the particular reference song.
-
公开(公告)号:US20200050427A1
公开(公告)日:2020-02-13
申请号:US16536831
申请日:2019-08-09
Applicant: Google LLC
Inventor: Jan Althaus , Matthew Sharifi
IPC: G06F3/16 , G06F3/0481 , G10L15/22 , G10L15/08 , G06F3/0484
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for implementing hotword recognition and passive assistance are disclosed. In one aspect, a method includes the actions of receiving, by a computing device that is operating in a low-power mode and that includes a display that displays a graphical interface while the computing device is in the low-power mode and that is configured to exit the low-power mode in response to detecting a first hotword, audio data corresponding to an utterance. The method further includes determining that the audio data includes a second, different hotword. The method further includes obtaining a transcription of the utterance by performing speech recognition on the audio data. The method further includes generating an additional user interface. The method further includes providing, for output on the display, the additional graphical interface.
-
公开(公告)号:US11423885B2
公开(公告)日:2022-08-23
申请号:US16349871
申请日:2019-02-20
Applicant: Google LLC
Inventor: Matthew Sharifi , Tom Hume , Mohamad Hassan Mohamad Rom , Jan Althaus , Diego Melendo Casado
Abstract: Techniques are described herein for selectively processing a user's utterances captured prior to and after an event that invokes an automated assistant to determine the user's intent and/or any parameters required for resolving the user's intent. In various implementations, respective measures of fitness for triggering responsive action by the automated assistant may be determined for pre-event and a post-event input streams. Based on the respective measures of fitness, one or both of the pre-event input stream or post-event input stream may be selected and used to cause the automated assistant to perform one or more responsive actions.
-
公开(公告)号:US20210065693A1
公开(公告)日:2021-03-04
申请号:US16349871
申请日:2019-02-20
Applicant: Google LLC
Inventor: Matthew Sharifi , Tom Hume , Mohamad Hassan Mohamad Rom , Jan Althaus , Diego Melendo Casado
Abstract: Techniques are described herein for selectively processing a user's utterances captured prior to and after an event that invokes an automated assistant to determine the user's intent and/or any parameters required for resolving the user's intent. In various implementations, respective measures of fitness for triggering responsive action by the automated assistant may be determined for pre-event and a post-event input streams. Based on the respective measures of fitness, one or both of the pre-event input stream or post-event input stream may be selected and used to cause the automated assistant to perform one or more responsive actions.
-
公开(公告)号:US20190079724A1
公开(公告)日:2019-03-14
申请号:US16114494
申请日:2018-08-28
Applicant: Google LLC
Inventor: Sandro Feuz , Sebastian Millius , Jan Althaus
Abstract: Techniques are described related to improved intercom-style communication using a plurality of computing devices distributed about an environment. In various implementations, voice input may be received, e.g., at a microphone of a first computing device of multiple computing devices, from a first user. The voice input may be analyzed and, based on the analyzing, it may be determined that the first user intends to convey a message to a second user. A location of the second user relative to the multiple computing devices may be determined, so that, based on the location of the second user, a second computing device may be selected from the multiple computing devices that is capable of providing audio or visual output that is perceptible to the second user. The second computing device may then be operated to provide audio or visual output that conveys the message to the second user.
-
-
-
-
-
-
-