-
公开(公告)号:US11838582B1
公开(公告)日:2023-12-05
申请号:US18064921
申请日:2022-12-12
Applicant: Google LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G06F15/16 , H04N21/442 , H04N21/258 , H04W4/029
CPC classification number: H04N21/44209 , H04N21/25891 , H04W4/029
Abstract: A method using media arbitration includes, while a first assistant-enabled device is performing a first long-standing operation, determining the first assistant-enabled device satisfies a co-presence condition with a second assistant-enabled device, and determining that the second assistant-enabled device is performing a second long-standing operation that conflicts with the first long-standing operation performed by the first assistant-enabled device. Based on determining that the first long-standing operation and the second long-standing operation conflict, the method also includes executing an operation arbitration routine to identify one or more compromise operations for at least one of the first assistant-enabled device or the second assistant-enabled device to perform, and instructing the first assistant-enabled device or the second assistant-enabled device to perform a selected compromise operation among the identified compromise operations.
-
公开(公告)号:US20230386468A1
公开(公告)日:2023-11-30
申请号:US18446420
申请日:2023-08-08
Applicant: Google LLC
Inventor: Aleksandar Kracun , Matthew Sharifi
IPC: G10L15/22 , G10L15/197 , G10L17/06 , G10L17/24 , G10L15/30
CPC classification number: G10L15/22 , G10L15/197 , G10L17/06 , G10L17/24 , G10L15/30 , G10L2015/088
Abstract: A method for adapting hotword recognition includes receiving audio data characterizing a hotword event detected by a first stage hotword detector in streaming audio captured by a user device. The method also includes processing, using a second stage hotword detector, the audio data to determine whether a hotword is detected by the second stage hotword detector in a first segment of the audio data. When the hotword is not detected by the second stage hotword detector, the method includes, classifying the first segment of the audio data as containing a negative hotword that caused a false detection of the hotword event in the streaming audio by the first stage hotword detector. Based on the first segment of the audio data classified as containing the negative hotword, the method includes updating the first stage hotword detector to prevent triggering the hotword event in subsequent audio data that contains the negative hotword.
-
公开(公告)号:US11823664B2
公开(公告)日:2023-11-21
申请号:US17982834
申请日:2022-11-08
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
CPC classification number: G10L15/08 , G06F3/16 , G10L13/02 , G10L2015/088
Abstract: Implementations can receive audio data corresponding to a spoken utterance of a user, process the audio data to generate a plurality of speech hypotheses, determine an action to be performed by an automated assistant based on the speech hypotheses, and cause the computing device to render an indication of the action. In response to the computing device rendering the indication, implementations can receive additional audio data corresponding to an additional spoken utterance of the user, process the additional audio data to determine that a portion of the spoken utterance is similar to an additional portion of the additional spoken utterance, supplant the action with an alternate action, and cause the automated assistant to initiate performance of the alternate action. Some implementations can determine whether to render the indication of the action based on a confidence level associated with the action.
-
公开(公告)号:US20230350905A1
公开(公告)日:2023-11-02
申请号:US18344509
申请日:2023-06-29
Applicant: Google LLC
Inventor: Matthew Sharifi , Abhanshu Sharma , David Petrou
IPC: G06F16/2457 , G06F16/583 , G06F16/58 , G06F16/2452 , G06F16/903
CPC classification number: G06F16/24578 , G06F16/583 , G06F16/5866 , G06F16/24522 , G06F16/90335
Abstract: Methods, systems, and apparatus for receiving a query image, receiving one or more entities that are associated with the query image, identifying, for one or more of the entities, one or more candidate search queries that are pre-associated with the one or more entities, generating a respective relevance score for each of the candidate search queries, selecting, as a representative search query for the query image, a particular candidate search query based at least on the generated respective relevance scores and providing the representative search query for output in response to receiving the query image.
-
公开(公告)号:US20230345061A1
公开(公告)日:2023-10-26
申请号:US18208570
申请日:2023-06-12
Applicant: Google LLC
Inventor: Matthew Sharifi
IPC: H04N21/235 , H04N21/234 , H04N21/25 , H04N21/466 , H04N21/84
CPC classification number: H04N21/2353 , H04N21/23418 , H04N21/251 , H04N21/4668 , H04N21/84
Abstract: Systems and methods for matching live media content are disclosed. At a server, obtaining first media content from a client device, herein the first media content corresponds to a portion of media content being played on the client device, and the first media content is associated with a predefined expiration time; obtaining second media content from one or more content feeds, wherein the second media content also corresponds to a portion of the media content being played on the client device; in accordance with a determination that the second media content corresponds to a portion of the media content that has been played on the client device: before the predefined expiration time, obtaining third media content corresponding to the media content being played on the client device, from the one or more content feeds; and comparing the first media content with the third media content.
-
公开(公告)号:US11798557B2
公开(公告)日:2023-10-24
申请号:US17650173
申请日:2022-02-07
Applicant: Google LLC
Inventor: Alexander H. Gruenstein , Johan Schalkwyk , Matthew Sharifi
CPC classification number: G10L15/22 , G06F3/167 , G10L15/08 , G10L15/30 , G10L25/51 , G10L17/00 , G10L2015/088 , G10L2015/223
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for hotword trigger suppression are disclosed. In one aspect, a method includes the actions of receiving, by a microphone of a computing device, audio corresponding to playback of an item of media content, the audio including an utterance of a predefined hotword that is associated with performing an operation on the computing device. The actions further include processing the audio. The actions further include in response to processing the audio, suppressing performance of the operation on the computing device.
-
公开(公告)号:US11783828B2
公开(公告)日:2023-10-10
申请号:US17231333
申请日:2021-04-15
Applicant: Google LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G10L15/22 , G10L15/32 , G10L15/30 , G06F16/245 , G06F16/248 , G10L15/26
CPC classification number: G10L15/22 , G06F16/245 , G06F16/248 , G10L15/26 , G10L15/30 , G10L15/32 , G10L2015/223
Abstract: Systems and methods for determining whether to combine responses from multiple automated assistants. An automated assistant may be invoked by a user utterance, followed by a query, which is provided to a plurality of automated assistants. A first response is received from a first automated assistant and a second response is received from a second automated assistant. Based on similarity between the responses, a primary automated assistant determines whether to combine the responses into a combined response. Once the combined response has been generated, one or more actions are performed in response to the combined response.
-
公开(公告)号:US20230298588A1
公开(公告)日:2023-09-21
申请号:US18323725
申请日:2023-05-25
Applicant: Google LLC
Inventor: Victor Carbune , Matthew Sharifi
IPC: G10L15/22 , G06F16/2452 , G06F1/3231 , G10L15/16 , G10L15/28
CPC classification number: G10L15/22 , G06F1/3231 , G06F16/24522 , G10L15/16 , G10L15/285 , G10L2015/088
Abstract: A method includes receiving audio data corresponding to an utterance spoken by the user and captured by the user device. The utterance includes a command for a digital assistant to perform an operation. The method also includes determining, using a hotphrase detector configured to detect each trigger word in a set of trigger words associated with a hotphrase, whether any of the trigger words in the set of trigger words are detected in the audio data during the corresponding fixed-duration time window. The method also includes determining identifying, in the audio corresponding to the utterance, the hotphrase when each other trigger word in the set of trigger words was also detected in the audio data. The method also includes triggering an automated speech recognizer to perform speech recognition on the audio data when the hotphrase is identified in the audio data corresponding to the utterance.
-
39.
公开(公告)号:US20230298583A1
公开(公告)日:2023-09-21
申请号:US18200518
申请日:2023-05-22
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G10L15/22 , G06F3/04886 , G10L25/78 , G06F3/16
CPC classification number: G10L15/22 , G06F3/04886 , G10L25/78 , G06F3/167 , G10L2015/223 , G10L2015/228
Abstract: Implementations set forth relate to suggesting an alternate interface modality when an automated assistant and/or a user is expected to not understand a particular interaction between the user and the automated assistant. In some instances, the automated assistant can pre-emptively determine that a forthcoming and/or ongoing interaction between a user and an automated assistant may experience interference. Based on this determination, the automated assistant can provide an indication that the interaction may not be successful and/or that the user should interact with the automated assistant through a different modality. For example, the automated assistant can render a keyboard interface at a portable computing device when the automated assistant determines that an audio interface of the portable computing device is experiencing interference.
-
公开(公告)号:US11762848B2
公开(公告)日:2023-09-19
申请号:US17903449
申请日:2022-09-06
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G06F17/00 , G06F16/242 , G06F16/2457 , G06F16/248 , G06F16/23 , G06F18/22 , G06F18/214
CPC classification number: G06F16/2428 , G06F16/23 , G06F16/248 , G06F16/24575 , G06F18/214 , G06F18/22
Abstract: Methods, systems, and computer readable media related to generating a combined search query based on search parameters of a current search query of a user and search parameters of one or more previously submitted search quer(ies) of the user that are determined to be of the same line of inquiry as the current search query. Two or more search queries may be determined to share a line of inquiry when it is determined that they are within a threshold level of semantic similarity to one another. Once a shared line of inquiry has been identified and a combined search query generated, users may interact with the search parameters and/or the search results to update the search parameters of the combined search query.
-
-
-
-
-
-
-
-
-