Generation of interactive audio tracks from visual content

    公开(公告)号:US12230252B2

    公开(公告)日:2025-02-18

    申请号:US17282135

    申请日:2020-06-09

    Applicant: GOOGLE LLC

    Abstract: Generating audio tracks is provided. The system selects a digital component object having a visual output format. The system determines to convert the digital component object into an audio output format. The system generates text for the digital component object. The system selects, based on context of the digital component object, a digital voice to render the text. The system constructs a baseline audio track of the digital component object with the text rendered by the digital voice. The system generates, based on the digital component object, non-spoken audio cues. The system combines the non-spoken audio cues with the baseline audio form of the digital component object to generate an audio track of the digital component object. The system provides the audio track of the digital component object to the computing device for output via a speaker of the computing device.

    Providing navigational instructions based on contextual data

    公开(公告)号:US12209875B2

    公开(公告)日:2025-01-28

    申请号:US17781292

    申请日:2022-03-09

    Applicant: Google LLC

    Inventor: Matthew Sharifi

    Abstract: A computing system and method that can be used for a mapping system that can recommend paths for navigational routing to a primary user. In particular, a primary user may be interested in navigational routes that secondary users, who may be the same or different from the primary user, have taken in the past. Even more particularly, the mapping systems described herein can provide improved user navigational services by leveraging the insight that users who have previously been in a similar situation (e.g., on vacation, in icy conditions, etc.) may wish to take the same route as they did the last time. Alternatively, users may desire to take routes based on routes which were taken by other users who are connected (e.g., via social media, address books, etc.).

    PAIRING USER HEADPHONES WITH A LOCATION-BASED AUTOMATED ASSISTANT

    公开(公告)号:US20250028770A1

    公开(公告)日:2025-01-23

    申请号:US18908294

    申请日:2024-10-07

    Applicant: GOOGLE LLC

    Abstract: Implementations described herein relate to pairing a location-based automated assistant with a user device. The user device can include, for example, a headphones apparatus and/or a device that is paired with the headphones apparatus. The user device provides an indication that it is present at a location that is associated with a location-based automated assistant. A trust measure is determined that is indicative of trust between the user device and the location-based automated assistant. User information is provided by the user device to the location-based automated assistant. The location-based automated assistant determines response data to provide, via one or more speakers associated with the user device, that is specific to the location and further based on the user information.

    Voice query handling in an environment with multiple users

    公开(公告)号:US12205591B2

    公开(公告)日:2025-01-21

    申请号:US17938659

    申请日:2022-10-06

    Applicant: Google LLC

    Abstract: A method includes detecting multiple users, receiving a first query issued by a first user, the first query including a command for a digital assistant to perform a first action, and enabling a round robin mode to control performance of actions commanded by queries. The method also includes, while performing the first action, receiving audio data corresponding to a second query including a command to perform a second action, performing speaker identification on the audio data, determining that the second query was spoken by the first user, preventing performing the second action, and prompting at least another user to issue a query. The method further includes receiving a third query issued by a second user, the third query including a command for the digital assistant to perform a third action, and when the digital assistant completes performing the first action, executing performance of the third action.

    Automated assistant training and/or execution of inter-user procedures

    公开(公告)号:US12197818B2

    公开(公告)日:2025-01-14

    申请号:US18241707

    申请日:2023-09-01

    Applicant: GOOGLE LLC

    Abstract: Implementations relate to an automated assistant that can automate repeatedly performed procedures. The automation can involve communicating with different users, organizations, and/or other automated assistants. The automated assistant, with prior permission from respective user(s), can detect repeated performance of a particular series of manually initiated computational actions. Based on this determination, the automated assistant can determine automated assistant computational action(s) that can be performed by the automated assistant in order to reduce latency in performing a procedure, reduce quantity and/or size of transmissions in performing the procedure, and/or reduce an amount of client device resources required for performing the procedure. Such actions can include communicating with an additional automated assistant that may be associated with another user and/or organization. In these and other manners, manually initiated computational actions that include electronic communications amongst users can be converted to backend operations amongst instances of automated assistants to achieve technical benefits.

    JOINT PROCESSING OF CORRECTIVE AND MISINTERPRETED UTTERANCES FOR ACCURATE ASSISTANT RESPONSES

    公开(公告)号:US20250014574A1

    公开(公告)日:2025-01-09

    申请号:US18217930

    申请日:2023-07-03

    Applicant: GOOGLE LLC

    Abstract: Implementations set forth herein relate to an automated assistant that can jointly process spoken utterances that can include at least one corrective utterance and one utterance that may have been misinterpreted by the automated assistant. In this way, the automated assistant does not need to exclusively rely on the corrective utterance to determine a request of a user but, rather, can utilize data capturing any prior spoken utterance to more accurately determine the request. The automated assistant can process utterances jointly after determining that they are acoustically similar and/or received within a threshold duration of time. The joint processing can involve generating a single input, or multiple inputs, to characterize the series of utterances and processing the single input, or multiple inputs, with one or more neural network models and/or machine learning models. The automated assistant can then proactively correct any previous responses to misinterpreted utterances.

    PROVIDING INVERTED DIRECTIONS AND OTHER INFORMATION BASED ON A CURRENT OR RECENT JOURNEY

    公开(公告)号:US20250012587A1

    公开(公告)日:2025-01-09

    申请号:US18275745

    申请日:2022-09-29

    Applicant: GOOGLE LLC

    Inventor: Matthew Sharifi

    Abstract: A computing device may implement a method for providing route information regarding a completed or ongoing trip by a user without the user having previously initiated a navigation session. The method may include receiving a query regarding a previous or ongoing trip by a user prior to the user initiating a navigation session; determining an origin for the previous or ongoing trip; obtaining route information for the previous or ongoing trip; generating one or more route attributes associated with the query based at least on the origin for the previous or ongoing trip and the route information for the previous or ongoing trip; and providing a response to the query based at least on the one or more route attributes.

    Dynamically adapting on-device models, of grouped assistant devices, for cooperative processing of assistant requests

    公开(公告)号:US12175981B2

    公开(公告)日:2024-12-24

    申请号:US18361468

    申请日:2023-07-28

    Applicant: GOOGLE LLC

    Abstract: Implementations are directed to dynamically adapting which assistant on-device model(s) are locally stored at assistant devices of an assistant device group and/or dynamically adapting the assistant processing role(s) of the assistant device(s) of the assistant device group. In some of those implementations, the corresponding on-device model(s) and/or corresponding processing role(s), for each of the assistant devices of the group, is determined based on collectively considering individual processing capabilities of the assistant devices of the group. Implementations are additionally or alternatively directed to cooperatively utilizing assistant devices of a group, and their associated post-adaptation on-device model(s) and/or post-adaptation processing role(s), in cooperatively processing assistant requests that are directed to any one of the assistant devices of the group.

    PROMOTING VOICE ACTIONS TO HOTWORDS
    230.
    发明申请

    公开(公告)号:US20240371377A1

    公开(公告)日:2024-11-07

    申请号:US18776117

    申请日:2024-07-17

    Applicant: Google LLC

    Inventor: Matthew Sharifi

    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for designating certain voice commands as hotwords. The methods, systems, and apparatus include actions of receiving a hotword followed by a voice command. Additional actions include determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, where a voice command that is designated as a hotword is treated as a voice input regardless of whether the voice command is preceded by another hotword. Further actions include, in response to determining that the voice command satisfies one or more predetermined criteria associated with designating the voice command as a hotword, designating the voice command as a hotword.

Patent Agency Ranking