-
171.
公开(公告)号:US20230377580A1
公开(公告)日:2023-11-23
申请号:US18361468
申请日:2023-07-28
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
CPC classification number: G10L15/30 , G10L15/005 , G10L15/063 , G10L15/22 , G10L2015/223
Abstract: Implementations are directed to dynamically adapting which assistant on-device model(s) are locally stored at assistant devices of an assistant device group and/or dynamically adapting the assistant processing role(s) of the assistant device(s) of the assistant device group. In some of those implementations, the corresponding on-device model(s) and/or corresponding processing role(s), for each of the assistant devices of the group, is determined based on collectively considering individual processing capabilities of the assistant devices of the group. Implementations are additionally or alternatively directed to cooperatively utilizing assistant devices of a group, and their associated post-adaptation on-device model(s) and/or post-adaptation processing role(s), in cooperatively processing assistant requests that are directed to any one of the assistant devices of the group.
-
公开(公告)号:US20230342384A1
公开(公告)日:2023-10-26
申请号:US18215032
申请日:2023-06-27
Applicant: GOOGLE LLC
Inventor: Victor Carbune , Pedro Gonnet Anders
IPC: G06F16/33 , G06F16/93 , G06F16/332 , G06F16/338 , G06F40/40 , H04L51/02 , G06V30/418
CPC classification number: G06F16/3344 , G06F16/93 , G06F16/3347 , G06F16/3329 , G06F16/338 , G06F40/40 , H04L51/02 , G06V30/418 , G06F3/0482
Abstract: Techniques are described herein for determining an information gain score for one or more documents of interest to the user and present information from the documents based on the information gain score. An information gain score for a given document is indicative of additional information that is included in the document beyond information contained in documents that were previously viewed by the user. In some implementations, the information gain score may be determined for one or more documents by applying data from the documents across a machine learning model to generate an information gain score. Based on the information gain scores of a set of documents, the documents can be provided to the user in a manner that reflects the likely information gain that can be attained by the user if the user were to view the documents.
-
公开(公告)号:US11798530B2
公开(公告)日:2023-10-24
申请号:US17085926
申请日:2020-10-30
Applicant: Google LLC
Inventor: Matthew Sharifi , Victor Carbune
CPC classification number: G10L15/01 , G01S3/8006 , G10L15/08 , G10L15/32 , H04R29/006 , G10L2015/088
Abstract: Implementations can detect respective audio data that captures an acoustic event at multiple assistant devices in an ecosystem that includes a plurality of assistant devices, process the respective audio data locally at each of the multiple assistant devices to generate respective measures that are associated with the acoustic event using respective event detection models, process the respective measures to determine whether the detected acoustic event is an actual acoustic event, and cause an action associated with the actional acoustic event to be performed in response to determining that the detected acoustic event is the actual acoustic event. In some implementations, the multiple assistant devices that detected the respective audio data are anticipated to detect the respective audio data that captures the actual acoustic event based on a plurality of historical acoustic events being detected at each of the multiple assistant devices.
-
174.
公开(公告)号:US11790005B2
公开(公告)日:2023-10-17
申请号:US17107286
申请日:2020-11-30
Applicant: Google LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G06F16/903 , G06F21/62
CPC classification number: G06F16/90335 , G06F21/6245
Abstract: Implementations are directed to receiving a search query from a user, obtaining environmental signal(s) associated with an environment in which the user is located when the search query is received, processing the environmental signal(s) to generate a privacy measure associated with submission of the search query, obtaining additional environmental signal(s) associated with the environment in which the user is located when user input directed to a search interface is received, processing the additional environmental signal(s) to generate an additional privacy measure associated with the user input, selecting, from a superset of historical search queries of the user, a subset of the historical search queries based on at least the privacy measure and the additional privacy measure, and causing the subset of the historical search queries to be presented to the user in response to receiving the user input directed to the search interface.
-
公开(公告)号:US11765452B2
公开(公告)日:2023-09-19
申请号:US18097150
申请日:2023-01-13
Applicant: GOOGLE LLC
Inventor: Felix Weissenberger , Balint Miklos , Victor Carbune , Matthew Sharifi , Domenico Carbotta , Ray Chen , Kevin Fu , Bogdan Prisacari , Fo Lee , Mucun Lu , Neha Garg , Jacopo Sannazzaro Natta , Barbara Poblocka , Jae Seo , Matthew Miao , Thomas Qian , Luv Kothari
IPC: H04N23/60 , G06N20/00 , G10L15/22 , G10L25/51 , H04N5/92 , H04N23/61 , H04N23/62 , H04N23/66 , H04N23/80 , G10L15/18
CPC classification number: H04N23/64 , G06N20/00 , G10L15/22 , G10L25/51 , H04N5/9201 , H04N23/61 , H04N23/62 , H04N23/66 , H04N23/80 , G10L15/1822 , G10L2015/223
Abstract: Implementations set forth herein relate to an automated assistant that can control a camera according to one or more conditions specified by a user. A condition can be satisfied when, for example, the automated assistant detects a particular environment feature is apparent. In this way, the user can rely on the automated assistant to identify and capture certain moments without necessarily requiring the user to constantly monitor a viewing window of the camera. In some implementations, a condition for the automated assistant to capture media data can be based on application data and/or other contextual data that is associated with the automated assistant. For instance, a relationship between content in a camera viewing window and other content of an application interface can be a condition upon which the automated assistant captures certain media data using a camera.
-
176.
公开(公告)号:US11756544B2
公开(公告)日:2023-09-12
申请号:US17122875
申请日:2020-12-15
Applicant: Google LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G10L15/22 , G10L15/30 , G10L15/18 , G06F3/0488 , G06F3/16 , G06F3/0481 , G10L25/51
CPC classification number: G10L15/22 , G06F3/0481 , G06F3/0488 , G06F3/165 , G10L15/1815 , G10L15/30 , G10L25/51 , G10L2015/223 , G10L2015/228
Abstract: Implementations described herein receive audio data that captures a spoken utterance, generate, based on processing the audio data, a recognition that corresponds to the spoken utterance, and determine, based on processing the recognition, that the spoken utterance is ambiguous (i.e., is interpretable as requesting performance of a first particular action exclusively and is also interpretable a second particular action exclusively). In response to determining that the spoken utterance is ambiguous, implementations determine to provide an enhanced clarification prompt that renders output that is in addition to natural language. The enhanced clarification prompt solicits further user interface input for disambiguating between the first particular action and the second particular action. Determining to provide the enhanced clarification prompt includes a current or prior determination to provide the enhanced clarification prompt instead of a natural language (NL) only clarification prompt that is restricted to rendering natural language.
-
公开(公告)号:US11756537B2
公开(公告)日:2023-09-12
申请号:US17962636
申请日:2022-10-10
Applicant: GOOGLE LLC
Inventor: Pedro Gonnet Anders , Victor Carbune , Daniel Keysers , Thomas Deselaers , Sandro Feuz
CPC classification number: G10L15/1815 , G10L15/19 , G10L15/22 , G10L2015/223
Abstract: Techniques are described herein for enabling an automated assistant to adjust its behavior depending on a detected age range and/or “vocabulary level” of a user who is engaging with the automated assistant. In various implementations, data indicative of a user's utterance may be used to estimate one or more of the user's age range and/or vocabulary level. The estimated age range/vocabulary level may be used to influence various aspects of a data processing pipeline employed by an automated assistant. In various implementations, aspects of the data processing pipeline that may be influenced by the user's age range/vocabulary level may include one or more of automated assistant invocation, speech-to-text (“STT”) processing, intent matching, intent resolution (or fulfillment), natural language generation, and/or text-to-speech (“TTS”) processing. In some implementations, one or more tolerance thresholds associated with one or more of these aspects, such as grammatical tolerances, vocabularic tolerances, etc., may be adjusted.
-
公开(公告)号:US11741944B2
公开(公告)日:2023-08-29
申请号:US17103878
申请日:2020-11-24
Applicant: Google LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G10L15/00 , G10L15/06 , G10L15/187 , G10L15/20 , G10L15/22 , G10L15/30 , G10L21/0232 , G10L15/08
CPC classification number: G10L15/063 , G10L15/187 , G10L15/20 , G10L15/22 , G10L15/30 , G10L21/0232 , G10L2015/088
Abstract: A method of training a speech model includes receiving, at a voice-enabled device, a fixed set of training utterances where each training utterance in the fixed set of training utterances includes a transcription paired with a speech representation of the corresponding training utterance. The method also includes sampling noisy audio data from an environment of the voice-enabled device. For each training utterance in the fixed set of training utterances, the method further includes augmenting, using the noisy audio data sampled from the environment of the voice-enabled device, the speech representation of the corresponding training utterance to generate noisy audio samples and pairing each of the noisy audio samples with the corresponding transcription of the corresponding training utterance. The method additionally includes training a speech model on the noisy audio samples generated for each speech representation in the fixed set of training utterances.
-
公开(公告)号:US20230215422A1
公开(公告)日:2023-07-06
申请号:US17568920
申请日:2022-01-05
Applicant: GOOGLE LLC
Inventor: Victor Carbune , Matthew Sharifi
CPC classification number: G10L15/08 , G10L15/22 , G06N20/00 , G10L2021/02163
Abstract: Implementations described herein include detecting a stream of audio data that captures a spoken utterance of the user and that captures ambient noise occurring within a threshold time period of the spoken utterance being spoken by the user. Implementations further include processing a portion of the audio data that includes the ambient noise to determine ambient noise classification(s), processing a portion of the audio data that includes the spoken utterance to generate a transcription, processing both the transcription and the ambient noise classification(s) with a machine learning model to generate a user intent and parameter(s) for the user intent, and performing one or more automated assistant actions based on the user intent and using the parameter(s).
-
公开(公告)号:US20230195815A1
公开(公告)日:2023-06-22
申请号:US17554608
申请日:2021-12-17
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G06F16/9536 , G10L17/22 , G06F16/9538 , G06F16/2455 , G06F16/9535
CPC classification number: G06F16/9536 , G10L17/22 , G06F16/9538 , G06F16/2456 , G06F16/9535
Abstract: Techniques are described herein for collaborative search sessions through an automated assistant. A method includes: receiving, from a first user of a first client device, a first query in a query session; providing, to the first user, a first set of search results; determining, based on at least one term in the first query, that the first query is relevant to a second user of the first client device; providing, to the second user, a selectable option to join the query session; in response to receiving, from the second user, an indication of acceptance of the selectable option, adding the second user to the query session; receiving, from the second user, additional input; generating, based on the additional input received from the second user, a modified set of search results; and providing, to the first user and the second user, the modified set of search results.
-
-
-
-
-
-
-
-
-