-
公开(公告)号:US12230252B2
公开(公告)日:2025-02-18
申请号:US17282135
申请日:2020-06-09
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
Abstract: Generating audio tracks is provided. The system selects a digital component object having a visual output format. The system determines to convert the digital component object into an audio output format. The system generates text for the digital component object. The system selects, based on context of the digital component object, a digital voice to render the text. The system constructs a baseline audio track of the digital component object with the text rendered by the digital voice. The system generates, based on the digital component object, non-spoken audio cues. The system combines the non-spoken audio cues with the baseline audio form of the digital component object to generate an audio track of the digital component object. The system provides the audio track of the digital component object to the computing device for output via a speaker of the computing device.
-
公开(公告)号:US12209875B2
公开(公告)日:2025-01-28
申请号:US17781292
申请日:2022-03-09
Applicant: Google LLC
Inventor: Matthew Sharifi
IPC: G01C21/34
Abstract: A computing system and method that can be used for a mapping system that can recommend paths for navigational routing to a primary user. In particular, a primary user may be interested in navigational routes that secondary users, who may be the same or different from the primary user, have taken in the past. Even more particularly, the mapping systems described herein can provide improved user navigational services by leveraging the insight that users who have previously been in a similar situation (e.g., on vacation, in icy conditions, etc.) may wish to take the same route as they did the last time. Alternatively, users may desire to take routes based on routes which were taken by other users who are connected (e.g., via social media, address books, etc.).
-
公开(公告)号:US20250028770A1
公开(公告)日:2025-01-23
申请号:US18908294
申请日:2024-10-07
Applicant: GOOGLE LLC
Inventor: Victor Carbune , Matthew Sharifi
IPC: G06F16/9537
Abstract: Implementations described herein relate to pairing a location-based automated assistant with a user device. The user device can include, for example, a headphones apparatus and/or a device that is paired with the headphones apparatus. The user device provides an indication that it is present at a location that is associated with a location-based automated assistant. A trust measure is determined that is indicative of trust between the user device and the location-based automated assistant. User information is provided by the user device to the location-based automated assistant. The location-based automated assistant determines response data to provide, via one or more speakers associated with the user device, that is specific to the location and further based on the user information.
-
公开(公告)号:US12205591B2
公开(公告)日:2025-01-21
申请号:US17938659
申请日:2022-10-06
Applicant: Google LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G10L15/22 , G06F3/16 , G06F16/638 , G10L17/22 , G10L17/00
Abstract: A method includes detecting multiple users, receiving a first query issued by a first user, the first query including a command for a digital assistant to perform a first action, and enabling a round robin mode to control performance of actions commanded by queries. The method also includes, while performing the first action, receiving audio data corresponding to a second query including a command to perform a second action, performing speaker identification on the audio data, determining that the second query was spoken by the first user, preventing performing the second action, and prompting at least another user to issue a query. The method further includes receiving a third query issued by a second user, the third query including a command for the digital assistant to perform a third action, and when the digital assistant completes performing the first action, executing performance of the third action.
-
公开(公告)号:US12197818B2
公开(公告)日:2025-01-14
申请号:US18241707
申请日:2023-09-01
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G06F3/16 , G06F3/0488 , G06F18/214 , G06F18/22 , G06N20/00 , G10L15/22
Abstract: Implementations relate to an automated assistant that can automate repeatedly performed procedures. The automation can involve communicating with different users, organizations, and/or other automated assistants. The automated assistant, with prior permission from respective user(s), can detect repeated performance of a particular series of manually initiated computational actions. Based on this determination, the automated assistant can determine automated assistant computational action(s) that can be performed by the automated assistant in order to reduce latency in performing a procedure, reduce quantity and/or size of transmissions in performing the procedure, and/or reduce an amount of client device resources required for performing the procedure. Such actions can include communicating with an additional automated assistant that may be associated with another user and/or organization. In these and other manners, manually initiated computational actions that include electronic communications amongst users can be converted to backend operations amongst instances of automated assistants to achieve technical benefits.
-
226.
公开(公告)号:US20250014574A1
公开(公告)日:2025-01-09
申请号:US18217930
申请日:2023-07-03
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
Abstract: Implementations set forth herein relate to an automated assistant that can jointly process spoken utterances that can include at least one corrective utterance and one utterance that may have been misinterpreted by the automated assistant. In this way, the automated assistant does not need to exclusively rely on the corrective utterance to determine a request of a user but, rather, can utilize data capturing any prior spoken utterance to more accurately determine the request. The automated assistant can process utterances jointly after determining that they are acoustically similar and/or received within a threshold duration of time. The joint processing can involve generating a single input, or multiple inputs, to characterize the series of utterances and processing the single input, or multiple inputs, with one or more neural network models and/or machine learning models. The automated assistant can then proactively correct any previous responses to misinterpreted utterances.
-
227.
公开(公告)号:US20250012587A1
公开(公告)日:2025-01-09
申请号:US18275745
申请日:2022-09-29
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi
Abstract: A computing device may implement a method for providing route information regarding a completed or ongoing trip by a user without the user having previously initiated a navigation session. The method may include receiving a query regarding a previous or ongoing trip by a user prior to the user initiating a navigation session; determining an origin for the previous or ongoing trip; obtaining route information for the previous or ongoing trip; generating one or more route attributes associated with the query based at least on the origin for the previous or ongoing trip and the route information for the previous or ongoing trip; and providing a response to the query based at least on the one or more route attributes.
-
228.
公开(公告)号:US12175981B2
公开(公告)日:2024-12-24
申请号:US18361468
申请日:2023-07-28
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
Abstract: Implementations are directed to dynamically adapting which assistant on-device model(s) are locally stored at assistant devices of an assistant device group and/or dynamically adapting the assistant processing role(s) of the assistant device(s) of the assistant device group. In some of those implementations, the corresponding on-device model(s) and/or corresponding processing role(s), for each of the assistant devices of the group, is determined based on collectively considering individual processing capabilities of the assistant devices of the group. Implementations are additionally or alternatively directed to cooperatively utilizing assistant devices of a group, and their associated post-adaptation on-device model(s) and/or post-adaptation processing role(s), in cooperatively processing assistant requests that are directed to any one of the assistant devices of the group.
-
公开(公告)号:US20240380970A1
公开(公告)日:2024-11-14
申请号:US18784226
申请日:2024-07-25
Applicant: GOOGLE LLC
Inventor: Felix Weissenberger , Balint Miklos , Victor Carbune , Matthew Sharifi , Domenico Carbotta , Ray Chen , Kevin Fu , Bogdan Prisacari , Fo Lee , Mucun Lu , Neha Garg , Jacopo Sannazzaro Natta , Barbara Poblocka , Jae Seo , Matthew Miao , Thomas Qian , Luv Kothari
IPC: H04N23/60 , G06N20/00 , G10L15/18 , G10L15/22 , G10L25/51 , H04N5/92 , H04N23/61 , H04N23/62 , H04N23/66 , H04N23/80
Abstract: Implementations set forth herein relate to an automated assistant that can control a camera according to one or more conditions specified by a user. A condition can be satisfied when, for example, the automated assistant detects a particular environment feature is apparent. In this way, the user can rely on the automated assistant to identify and capture certain moments without necessarily requiring the user to constantly monitor a viewing window of the camera. In some implementations, a condition for the automated assistant to capture media data can be based on application data and/or other contextual data that is associated with the automated assistant. For instance, a relationship between content in a camera viewing window and other content of an application interface can be a condition upon which the automated assistant captures certain media data using a camera.
-
公开(公告)号:US20240371377A1
公开(公告)日:2024-11-07
申请号:US18776117
申请日:2024-07-17
Applicant: Google LLC
Inventor: Matthew Sharifi
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for designating certain voice commands as hotwords. The methods, systems, and apparatus include actions of receiving a hotword followed by a voice command. Additional actions include determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, where a voice command that is designated as a hotword is treated as a voice input regardless of whether the voice command is preceded by another hotword. Further actions include, in response to determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, designating the voice command as a hotword.
-
-
-
-
-
-
-
-
-