-
公开(公告)号:US11996101B2
公开(公告)日:2024-05-28
申请号:US18160342
申请日:2023-01-27
Applicant: Google LLC
Inventor: Matthew Sharifi , Victor Carbune
CPC classification number: G10L15/22 , G06F3/167 , G10L15/075 , G10L15/083 , G10L2015/223
Abstract: A method for streaming action fulfillment receives audio data corresponding to an utterance where the utterance includes a query to perform an action that requires performance of a sequence of sub-actions in order to fulfill the action. While receiving the audio data, but before receiving an end of speech condition, the method processes the audio data to generate intermediate automated speech recognition (ASR) results, performs partial query interpretation on the intermediate ASR results to determine whether the intermediate ASR results identify an application type needed to perform the action and, when the intermediate ASR results identify a particular application type, performs a first sub-action in the sequence of sub-actions by launching a first application to execute on the user device where the first application is associated with the particular application type. The method, in response to receiving an end of speech condition, fulfills performance of the action.
-
公开(公告)号:US20240169977A1
公开(公告)日:2024-05-23
申请号:US18430196
申请日:2024-02-01
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
Abstract: Implementations can receive, at a computing device, audio data corresponding to a spoken utterance of a user, process the audio data to generate, for one or more parts of the spoken utterance, a plurality of speech hypotheses, select a given one of the speech hypotheses, cause the given one of the speech hypotheses to be incorporated as a portion of a transcription associated with the software application, and store the plurality of speech hypotheses. In some implementations, the plurality of speech hypotheses can be loaded at an additional computing device when the transcription is accessed at the additional computing device. In additional or alternative implementations, the plurality of speech hypotheses can be loaded into memory of the computing device when the software application is reactivated and/or when a subsequent dialog session associated with the transcription is initiated.
-
公开(公告)号:US20240169206A1
公开(公告)日:2024-05-23
申请号:US18426036
申请日:2024-01-29
Applicant: GOOGLE LLC
Inventor: Sebastian Millius , Tom Hume , Matthew Sharifi
IPC: G06N3/084 , G06F40/169 , G06F40/216 , G06F40/284 , G06F40/30 , G06F40/35 , G06N3/044 , G06N3/045 , G06Q10/107 , H04L51/046 , H04L51/234 , H04M1/72403 , H04M1/72436 , H04M1/72451 , H04M1/72454 , H04M1/72484
CPC classification number: G06N3/084 , G06F40/216 , G06F40/284 , G06F40/30 , G06F40/35 , G06N3/044 , G06N3/045 , G06Q10/107 , H04L51/046 , H04L51/234 , H04M1/72403 , H04M1/72484 , G06F40/169 , H04M1/72436 , H04M1/72451 , H04M1/72454
Abstract: Training and/or utilizing an interaction prediction model to generate a predicted interaction value that indicates a likelihood of interaction with a corresponding application on the basis of an electronic communication. The application can be in addition to any electronic communication application that is utilized in formulating the electronic communication and/or that is utilized in rendering the electronic communication. The predicted interaction value can be generated based on processing, utilizing the interaction prediction model, of features of the electronic communication and/or of other features. The predicted interaction value can be utilized to determine whether to perform further action(s) that interact with, and/or enable efficient interaction with, the application on the basis of the electronic communication.
-
公开(公告)号:US20240160680A1
公开(公告)日:2024-05-16
申请号:US18418594
申请日:2024-01-22
Applicant: Google LLC
Inventor: Victor Carbune , Matthew Sharifi
IPC: G06F16/9537 , G01C21/34 , G01C21/36 , G06F16/29 , G06F16/9538 , G06F40/103 , G06T11/60
CPC classification number: G06F16/9537 , G01C21/3476 , G01C21/3626 , G06F16/29 , G06F16/9538 , G06F40/103 , G06T11/60
Abstract: The technology relates to integrating web content into a map application. A query is sent from the map application. At least one snippet of web content identified as relevant to the query is received in response to the query, the at least one snippet of content including a portion of media or textual content from a source on the web. The portion of media or textual content is formatted for display in the map application and output for display in the map application.
-
公开(公告)号:US20240127799A1
公开(公告)日:2024-04-18
申请号:US17967183
申请日:2022-10-17
Applicant: GOOGLE LLC
Inventor: Victor Carbune , Matthew Sharifi
Abstract: Implementations related to facilitating continued conversations of a user with an automated assistant when the user changes locations relative to one or more devices in an ecosystem of linked assistant devices. The user initially invokes a first device and provides a request, which is processed by the first device. The first device provides a notification to one or more other devices in the ecosystem to indicate that the user is likely to issue a further assistant request. The first device processes subsequent audio data to determine whether the subsequent audio data includes a further assistant request. The one or more other notified devices process device-specific sensor data to determine whether the user is co-present with the one of the other devices. If the user presence is detected, an indication is provided to the first device, causing the first device to cease processing subsequent audio data. Further, the co-present device starts to process subsequent audio data.
-
公开(公告)号:US20240119088A1
公开(公告)日:2024-04-11
申请号:US17938455
申请日:2022-10-06
Applicant: Google LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G06F16/632 , G06F16/638 , G10L17/02 , G10L17/06 , G10L17/22
CPC classification number: G06F16/632 , G06F16/639 , G10L17/02 , G10L17/06 , G10L17/22
Abstract: A method for handling contradictory queries on a shared device includes receiving a first query issued by a first user, the first query specifying a first long-standing operation for a digital assistant to perform, and while the digital assistant is performing the first long-standing operation, receiving a second query, the second query specifying a second long-standing operation for the digital assistant to perform. The method also includes determining that the second query was issued by another user different than the first user and determining, using a query resolver, that performing the second long-standing operation would conflict with the first long-standing operation. The method further includes identifying one or more compromise operations for the digital assistant to perform, and instructing the digital assistant to perform a selected compromise operation among the identified one or more compromise operations.
-
公开(公告)号:US20240119086A1
公开(公告)日:2024-04-11
申请号:US18390590
申请日:2023-12-20
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi
IPC: G06F16/487 , G06F16/245 , G06F16/2455 , G06F16/2457 , G06F16/432 , G06F16/435 , G06F16/48 , G06F16/683 , G06F16/783 , G06F16/9535 , G06F16/955 , G06Q30/02 , G06Q30/0601
CPC classification number: G06F16/487 , G06F16/245 , G06F16/2455 , G06F16/24578 , G06F16/433 , G06F16/435 , G06F16/437 , G06F16/489 , G06F16/685 , G06F16/7834 , G06F16/9535 , G06F16/955 , G06Q30/02 , G06Q30/0631
Abstract: Methods, systems, and apparatus for receiving, from a user, a request that includes an entity identifier associated with an entity that is referenced by one or more query terms of a search query, determining that the entity is identified in a media consumption database as a media item that has been indicated as consumed by the user or that the entity is associated with a media item that is identified in the media consumption database as a media item that has been indicated as consumed by the user, and based on the determination, providing a response to the request, the response including data indicating that the entity is a media item that has been indicated as consumed by the user or that the entity is associated with a media item that has been indicated as consumed by the user.
-
公开(公告)号:US20240105178A1
公开(公告)日:2024-03-28
申请号:US18535701
申请日:2023-12-11
Applicant: Google LLC
Inventor: Matthew Sharifi , Victor Carbune
CPC classification number: G10L15/22 , G06F3/167 , G10L15/02 , G10L15/08 , G10L17/24 , G10L2015/088 , G10L2015/223
Abstract: A method includes a first assistant-enabled device (AED) receiving an assignment instruction assigning a group hotword to a selected group of AEDs that includes the first AED and one or more other AEDs. Each AED is configured to wake-up from a low-power state when the group hotword is detected in streaming audio by at least one of the AEDs. The method also includes receiving audio data that corresponds to an utterance spoken by the user and includes a query that specifies an operation to perform. In response to detecting the group hotword in the audio data, the method also includes triggering the first AED to wake-up from the low-power state and executing a collaboration routine to cause the first AED and each other AED in the selected group of AEDs to collaborate with one another to fulfill performance of the operation specified by the query.
-
249.
公开(公告)号:US20240104140A1
公开(公告)日:2024-03-28
申请号:US18531015
申请日:2023-12-06
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G06F16/9032 , G10L15/30 , G16Y10/80 , G16Y40/35
CPC classification number: G06F16/90332 , G10L15/30 , G16Y10/80 , G16Y40/35
Abstract: Implementations can identify a given assistant device from among a plurality of assistant devices in an ecosystem, obtain device-specific signal(s) that are generated by the given assistant device, process the device-specific signal(s) to generate candidate semantic label(s) for the given assistant device, select a given semantic label for the given semantic device from among the candidate semantic label(s), and assigning, in a device topology representation of the ecosystem, the given semantic label to the given assistant device. Implementations can optionally receive a spoken utterance that includes a query or command at the assistant device(s), determine a semantic property of the query or command matches the given semantic label to the given assistant device, and cause the given assistant device to satisfy the query or command.
-
公开(公告)号:US20240102816A1
公开(公告)日:2024-03-28
申请号:US18010714
申请日:2022-03-31
Applicant: Google LLC
Inventor: Matthew Sharifi
CPC classification number: G01C21/3484 , G01C21/3608 , G01C21/3641 , G01C21/3655 , G01C21/367 , G10L15/16 , G10L15/26
Abstract: Methods and systems for customizing the presentation of navigation instructions to a user during a navigation session are disclosed herein. During a navigation session, a user can request to customize instructions being provided for the navigation session, such as increasing or decreasing a level of detail being provided to the user. Based on the request, the system can determine one or modifications for instructions remaining in the navigation session, modify those instructions, and store the modified instructions for presentation to the user.
-
-
-
-
-
-
-
-
-