-
公开(公告)号:US20240161740A1
公开(公告)日:2024-05-16
申请号:US18055395
申请日:2022-11-14
Applicant: Google LLC
Inventor: Matthew Sharifi , Victror Carbune
CPC classification number: G10L15/22 , G10L15/08 , G10L2015/088
Abstract: A method using multi-assistant warm words includes, for each digital assistant in a group of digital assistants enabled on a multi-assistant device, receiving a respective active set of warm words that each specify a respective action to perform. Based on the respective active set of warm words, the method also includes executing a warm word arbitration routine to enable a final set of warm words for detection, each warm word in the final set of warm words selected from the respective active set of warm words for at least one digital assistant. While the final set of warm words are enabled for detection, the method includes receiving audio data corresponding to an utterance, detecting a warm word from the final set of warm words, and instructing the digital assistant associated with the detected warm word to perform the respective action specified by the detected warm word.
-
公开(公告)号:US11984128B2
公开(公告)日:2024-05-14
申请号:US17700135
申请日:2022-03-21
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G10L17/08 , G06F3/16 , G06F21/32 , G10L15/22 , G10L17/02 , G10L17/04 , G10L17/10 , G10L17/14 , G10L17/18 , G10L17/22 , G10L17/24 , G10L15/08
CPC classification number: G10L17/08 , G06F3/167 , G06F21/32 , G10L15/22 , G10L17/02 , G10L17/04 , G10L17/10 , G10L17/14 , G10L17/18 , G10L17/22 , G10L17/24 , G10L2015/088 , G10L2015/227
Abstract: Implementations relate to automatic generation of speaker features for each of one or more particular text-dependent speaker verifications (TD-SVs) for a user. Implementations can generate speaker features for a particular TD-SV using instances of audio data that each capture a corresponding spoken utterance of the user during normal non-enrollment interactions with an automated assistant via one or more respective assistant devices. For example, a portion of an instance of audio data can be used in response to: (a) determining that recognized term(s) for the spoken utterance captured by that the portion correspond to the particular TD-SV; and (b) determining that an authentication measure, for the user and for the spoken utterance, satisfies a threshold. Implementations additionally or alternatively relate to utilization of speaker features, for each of one or more particular TD-SVs for a user, in determining whether to authenticate a spoken utterance for the user.
-
公开(公告)号:US20240143154A1
公开(公告)日:2024-05-02
申请号:US18391583
申请日:2023-12-20
Applicant: Google LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G06F3/04847 , G06F3/0482 , G06F3/16 , H04B17/318 , H04W4/02
CPC classification number: G06F3/04847 , G06F3/0482 , G06F3/167 , H04B17/318 , H04W4/023
Abstract: A method includes obtaining proximity information for each of a plurality of assistant-enabled devices within an environment of a user device. Each assistant-enabled device is controllable by an assistant application to perform a respective set of available actions associated with the assistant-enabled device. For each assistant-enabled device, the method also includes determining a proximity score based on the proximity information indicating a proximity estimation of the corresponding assistant-enabled device relative to the user device. The method further includes generating, using the proximity scores determined for the assistant-enabled devices, a ranked list of candidate assistant-enabled devices, and for each corresponding assistant-enabled device in the ranked list, displaying, in a graphical user interface (GUI), a respective set of controls for performing the respective set of actions associated with the corresponding assistant-enabled device.
-
134.
公开(公告)号:US11972766B2
公开(公告)日:2024-04-30
申请号:US18100424
申请日:2023-01-23
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G10L17/22 , G06F3/16 , G10L19/018 , G10L25/51
CPC classification number: G10L17/22 , G06F3/165 , G10L19/018 , G10L25/51
Abstract: Techniques are described herein for detecting and suppressing commands in media that may trigger another automated assistant. A method includes: determining, for each of a plurality of automated assistant devices in an environment that are each executing at least one automated assistant, an active capability of the automated assistant device; initiating playback of digital media by an automated assistant; in response to initiating playback, processing the digital media to identify an audio segment in the digital media that, upon playback, is expected to trigger activation of at least one automated assistant executing on at least one of the plurality of automated assistant devices in the environment, based on the active capability of the at least one of the plurality of automated assistant devices; and in response to identifying the audio segment in the digital media, modifying the digital media to suppress the activation of the at least one automated assistant.
-
公开(公告)号:US20240110793A1
公开(公告)日:2024-04-04
申请号:US17637994
申请日:2021-09-29
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi
IPC: G01C21/34
CPC classification number: G01C21/34
Abstract: Techniques for preserving privacy during turn by turn navigation sessions are provided. An example method includes receiving an indication of a precise origin location and a precise destination location via a navigation application operating on a mobile computing device. The navigation application may generate an indication of a coarse origin region including the precise origin location, and an indication of a coarse destination region including the precise destination location, and transmit the indications of the coarse origin and destination regions to an external navigation server. The navigation application may receive, from the external navigation server, a coarse navigation route from the coarse origin region to the coarse destination region, as well as navigation data associated with the coarse origin and destination regions, based on which the navigation application may generate the private navigation route for the user. The private navigation route may then be provided to the user.
-
公开(公告)号:US11915706B2
公开(公告)日:2024-02-27
申请号:US18150561
申请日:2023-01-05
Applicant: Google LLC
Inventor: Matthew Sharifi
CPC classification number: G10L15/285 , G10L15/01 , G10L15/08 , G10L15/22 , G10L15/32 , G10L17/22 , G06F3/167 , G10L2015/088 , G10L2015/223
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a method includes the actions of receiving, by a first computing device, audio data that corresponds to an utterance. The actions further include determining a first value corresponding to a likelihood that the utterance includes a hotword. The actions further include receiving a second value corresponding to a likelihood that the utterance includes the hotword, the second value being determined by a second computing device. The actions further include comparing the first value and the second value. The actions further include based on comparing the first value to the second value, initiating speech recognition processing on the audio data.
-
公开(公告)号:US11914661B2
公开(公告)日:2024-02-27
申请号:US17252439
申请日:2020-09-02
Applicant: Google LLC
Inventor: Victor Carbune , Matthew Sharifi
IPC: G06F16/9537 , G06F16/9538 , G06F16/29 , G06F40/103 , G01C21/34 , G01C21/36 , G06T11/60
CPC classification number: G06F16/9537 , G01C21/3476 , G01C21/3626 , G06F16/29 , G06F16/9538 , G06F40/103 , G06T11/60
Abstract: The technology relates to integrating web content into a map application. A query is sent from the map application. At least one snippet of web content identified as relevant to the query is received in response to the query, the at least one snippet of content including a portion of media or textual content from a source on the web. The portion of media or textual content is formatted for display in the map application and output for display in the map application.
-
公开(公告)号:US20240064363A1
公开(公告)日:2024-02-22
申请号:US17902601
申请日:2022-09-02
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: H04N21/422 , G10L25/57 , G10L15/22 , G06V20/40 , H04N21/472
CPC classification number: H04N21/42204 , G10L25/57 , G10L15/22 , G06V20/40 , H04N21/42203 , H04N21/472 , G10L2015/223 , G06V2201/10
Abstract: Voice-based interaction with video content being presented by a media player application is enhanced through the use of an automated assistant capable of identifying when a spoken utterance by a user is a request to playback a specific scene in the video content. A query identified in a spoken utterance may be used to access stored scene metadata associated with video content being presented in the vicinity of the user to identify one or more locations in the video content that correspond to the query, such that a media control command may be issued to the media player application to cause the media player application to seek to a particular location in the video content that satisfies the query.
-
公开(公告)号:US20240038231A1
公开(公告)日:2024-02-01
申请号:US18378083
申请日:2023-10-09
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G10L15/22 , G10L15/32 , G10L15/30 , G06F16/245 , G06F16/248 , G10L15/26
CPC classification number: G10L15/22 , G10L15/32 , G10L15/30 , G06F16/245 , G06F16/248 , G10L15/26 , G10L2015/223
Abstract: Systems and methods for determining whether to combine responses from multiple automated assistants. An automated assistant may be invoked by a user utterance, followed by a query, which is provided to a plurality of automated assistants. A first response is received from a first automated assistant and a second response is received from a second automated assistant. Based on similarity between the responses, a primary automated assistant determines whether to combine the responses into a combined response. Once the combined response has been generated, one or more actions are performed in response to the combined response.
-
公开(公告)号:US20240022809A1
公开(公告)日:2024-01-18
申请号:US18446381
申请日:2023-08-08
Applicant: GOOGLE LLC
Inventor: Felix Weissenberger , Balint Miklos , Victor Carbune , Matthew Sharifi , Domenico Carbotta , Ray Chen , Kevin Fu , Bogdan Prisacari , Fo Lee , Mucun Lu , Neha Garg , Jacopo Sannazzaro Natta , Barbara Poblocka , Jae Seo , Matthew Miao , Thomas Qian , Luv Kothari
IPC: H04N23/60 , G06N20/00 , G10L15/22 , G10L25/51 , H04N5/92 , H04N23/61 , H04N23/62 , H04N23/66 , H04N23/80
CPC classification number: H04N23/64 , G06N20/00 , G10L15/22 , G10L25/51 , H04N5/9201 , H04N23/61 , H04N23/62 , H04N23/66 , H04N23/80 , G10L15/1822
Abstract: Implementations set forth herein relate to an automated assistant that can control a camera according to one or more conditions specified by a user. A condition can be satisfied when, for example, the automated assistant detects a particular environment feature is apparent. In this way, the user can rely on the automated assistant to identify and capture certain moments without necessarily requiring the user to constantly monitor a viewing window of the camera. In some implementations, a condition for the automated assistant to capture media data can be based on application data and/or other contextual data that is associated with the automated assistant. For instance, a relationship between content in a camera viewing window and other content of an application interface can be a condition upon which the automated assistant captures certain media data using a camera.
-
-
-
-
-
-
-
-
-