-
公开(公告)号:US20240296848A1
公开(公告)日:2024-09-05
申请号:US18662590
申请日:2024-05-13
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G10L17/08 , G06F3/16 , G06F21/32 , G10L15/08 , G10L15/22 , G10L17/02 , G10L17/04 , G10L17/10 , G10L17/14 , G10L17/18 , G10L17/22 , G10L17/24
CPC classification number: G10L17/08 , G06F3/167 , G06F21/32 , G10L15/22 , G10L17/02 , G10L17/04 , G10L17/10 , G10L17/14 , G10L17/18 , G10L17/22 , G10L17/24 , G10L2015/088 , G10L2015/227
Abstract: Implementations relate to automatic generation of speaker features for each of one or more particular text-dependent speaker verifications (TD-SVs) for a user. Implementations can generate speaker features for a particular TD-SV using instances of audio data that each capture a corresponding spoken utterance of the user during normal non-enrollment interactions with an automated assistant via one or more respective assistant devices. For example, a portion of an instance of audio data can be used in response to: (a) determining that recognized term(s) for the spoken utterance captured by that the portion correspond to the particular TD-SV; and (b) determining that an authentication measure, for the user and for the spoken utterance, satisfies a threshold. Implementations additionally or alternatively relate to utilization of speaker features, for each of one or more particular TD-SVs for a user, in determining whether to authenticate a spoken utterance for the user.
-
公开(公告)号:US12057119B2
公开(公告)日:2024-08-06
申请号:US18092883
申请日:2023-01-03
Applicant: GOOGLE LLC
Inventor: Victor Carbune , Matthew Sharifi , Ondrej Skopek , Justin Lu , Daniel Valcarce , Kevin Kilgour , Mohamad Hassan Rom , Nicolo D'Ercole , Michael Golikov
CPC classification number: G10L15/22 , G10L15/05 , G10L15/1815 , G10L25/78 , G10L2015/088 , G10L2015/223
Abstract: Some implementations process, using warm word model(s), a stream of audio data to determine a portion of the audio data that corresponds to particular word(s) and/or phrase(s) (e.g., a warm word) associated with an assistant command, process, using an automatic speech recognition (ASR) model, a preamble portion of the audio data (e.g., that precedes the warm word) and/or a postamble portion of the audio data (e.g., that follows the warm word) to generate ASR output, and determine, based on processing the ASR output, whether a user intended the assistant command to be performed. Additional or alternative implementations can process the stream of audio data using a speaker identification (SID) model to determine whether the audio data is sufficient to identify the user that provided a spoken utterance captured in the stream of audio data, and determine if that user is authorized to cause performance of the assistant command.
-
公开(公告)号:US12051408B2
公开(公告)日:2024-07-30
申请号:US16838966
申请日:2020-04-02
Applicant: Google LLC
Inventor: Matthew Sharifi
CPC classification number: G10L15/22 , G06F3/167 , G10L15/02 , G10L15/063 , G10L15/08 , G10L15/26 , G10L15/285 , G10L17/22 , G10L2015/088 , G10L2015/223
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for designating certain voice commands as hotwords. The methods, systems, and apparatus include actions of receiving a hotword followed by a voice command. Additional actions include determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, where a voice command that is designated as a hotword is treated as a voice input regardless of whether the voice command is preceded by another hotword. Further actions include, in response to determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, designating the voice command as a hotword.
-
124.
公开(公告)号:US20240242122A1
公开(公告)日:2024-07-18
申请号:US18559728
申请日:2021-06-14
Applicant: Google LLC
Inventor: Matthew Sharifi
Abstract: Systems and methods for multi device learning and inference in an ambient computing environment. In some aspects, the present technology discloses systems and methods for performing cross-device learning in which new devices may be trained based on supervision signals from existing devices in the ambient computing environment. In some aspects, the present technology discloses systems and methods for performing multi-device inference across two or more devices in the ambient computing environment. Likewise, in some aspects, the present technology discloses systems and methods for training models that are robust to the addition or removal of one or more devices from an ambient computing environment.
-
公开(公告)号:US20240240959A1
公开(公告)日:2024-07-18
申请号:US18010160
申请日:2021-09-29
Applicant: Google LLC
Inventor: Matthew Sharifi
IPC: G01C21/36
CPC classification number: G01C21/362
Abstract: Methods, systems, devices, and tangible computer readable media for navigation are provided. The disclosed technology can include receiving a navigation request from a user. There is a determination of whether the navigation request is associated with a user contact of the user. In response to the navigation request being associated with the user contact, a location sharing request can be generated. The location sharing request includes a request for location data including information associated with locations associated with the user contact. The location sharing request can be sent to a remote computing system associated with the user contact. Furthermore, in response to receiving the location data from the remote computing system, output can be generated. The output can include indications associated with navigation by the user to at least one of the locations associated with the user contact.
-
126.
公开(公告)号:US20240211120A1
公开(公告)日:2024-06-27
申请号:US18594926
申请日:2024-03-04
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G06F3/04842 , G06F3/01 , G06F3/0481 , G06F3/16 , G10L15/22
CPC classification number: G06F3/04842 , G06F3/016 , G06F3/0481 , G06F3/167 , G10L15/22 , G10L2015/223
Abstract: Implementations set forth herein relate to an automated assistant that can perform operations to revert various applications to prior states that the applications may have arrived at via certain user inputs. The user can provide a spoken utterance such as, “undo,” in order to cause the automated assistant to identify a particular application that the user may want to affect with the “undo” command. When the particular application is identified, the automated assistant can identify one or more operations recently performed using the particular application. In some implementations, the automated assistant can provide the user with a variety of undo options in response to an “undo” command. For instance, the automated assistant can prompt the user to select one of a first cluster of operations and/or a second cluster of operations to be undone, and each cluster can refer to different operations.
-
公开(公告)号:US20240210194A1
公开(公告)日:2024-06-27
申请号:US17919962
申请日:2022-05-02
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi
CPC classification number: G01C21/3608 , G10L15/22 , G10L2015/223 , G10L2015/225
Abstract: A computing device may implement a method for determining places and routes through natural conversation. The method may include receiving, from a user, a speech input including a search query to initiate a navigation session; and generating a set of navigation search results responsive to the search query. The set of navigation search results include a plurality of destinations or a plurality of routes corresponding to one or more destinations. The method further includes providing an audio request to the user for refining the set of navigation search results, and in response to the audio request, receiving, from the user, a subsequent speech input including a refined search query. The method further includes providing one or more refined navigation search results responsive to the refined search query including a subset of the plurality of destinations or the plurality of routes.
-
公开(公告)号:US20240202469A1
公开(公告)日:2024-06-20
申请号:US18082503
申请日:2022-12-15
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
Abstract: Implementations relate to automatically translating a customized automated assistant from a first language to a new language, so that the automated assistant can interpret spoken utterances in the new language and respond to such spoken utterances in the new language. For example, a customized automated assistant can be configured for use in a first language through the developer(s) providing input(s) that are in the first language, and thereafter automatically translated to a distinct second language for which no developer input is provided. The deployment of the customized automated assistant for utilization with the second language can be selective. For example, it can be selective in that it is only automatically deployed and/or is only suggested for deployment in response to determining that one or more objective criteria, that indicate accuracy and/or robustness of the second language translation of the customized automated assistant, are satisfied.
-
公开(公告)号:US20240202263A1
公开(公告)日:2024-06-20
申请号:US18587455
申请日:2024-02-26
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G06F16/9536 , G06F16/2455 , G06F16/9535 , G06F16/9538
CPC classification number: G06F16/9536 , G06F16/2456 , G06F16/9535 , G06F16/9538
Abstract: Techniques are described herein for collaborative search sessions through an automated assistant. A method includes: receiving, from a first user of a first client device, a first query in a query session; providing, to the first user, a first set of search results; determining, based on at least one term in the first query, that the first query is relevant to a second user of the first client device; providing, to the second user, a selectable option to join the query session; in response to receiving, from the second user, an indication of acceptance of the selectable option, adding the second user to the query session; receiving, from the second user, additional input; generating, based on the additional input received from the second user, a modified set of search results; and providing, to the first user and the second user, the modified set of search results.
-
公开(公告)号:US20240167834A1
公开(公告)日:2024-05-23
申请号:US17784221
申请日:2021-05-07
Applicant: Google LLC
Inventor: Matthew Sharifi
IPC: G01C21/36 , G01C21/34 , H04L51/216
CPC classification number: G01C21/362 , G01C21/343 , H04L51/216
Abstract: Methods, systems, devices, and tangible non-transitory computer readable media for using incoming communications to generate suggestions for navigation. The disclosed technology can include accessing route data that includes information associated with navigation from a starting location to a destination. Based on the route data, one or more routes from the starting location to the destination can be determined. Message data including one or more messages to a user can be accessed. Based on the message data and one or more machine-learned models, at least one entity and objectives that are associated with the one or more messages can be determined. Based on the one or more routes, the at least one entity, and the objectives, suggestions associated with the one or more messages can be determined. Furthermore, output including indications associated with the suggestions directed to the user can be generated via a user interface.
-
-
-
-
-
-
-
-
-