-
71.
公开(公告)号:US12183348B2
公开(公告)日:2024-12-31
申请号:US18382735
申请日:2023-10-23
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
Abstract: Implementations are directed to biasing speaker authentication on a per-user basis and on a device-by-device basis and/or contextual feature(s) basis. In some of those implementations, in performing speaker authentication based on a spoken utterance, different biasing parameters are determined for each of multiple different registered users of an assistant device at which the spoken utterance was detected. In those implementations, each of the biasing parameters can be used to make it more likely or less likely (in dependence of the biasing parameter) that a corresponding registered user will be verified using the speaker authentication. Through utilization of biasing parameter(s) in performing speaker authentication, accuracy and/or robustness of speaker authentication can be increased.
-
公开(公告)号:US12164584B2
公开(公告)日:2024-12-10
申请号:US17013954
申请日:2020-09-08
Applicant: Google LLC
Inventor: Victor Carbune , Matthew Sharifi
IPC: G06F16/00 , G06F16/2457 , G06F16/29 , G06F16/9535 , G06F16/9537 , G06F16/9538
Abstract: The present disclosure provides a computing device and method for providing personal specific information based on semantic queries. The semantic queries may be input in a natural language form, and may include user specific context, such as by referring to prior or future events related to a place the user is searching for. With the user's authorization, data associated with prior or planned activities of the user may be accessed and information from the accessed data may be identified, wherein the information is correlated with the user specific context. One or more query results are determined based on the identified information and provided for output to the user.
-
公开(公告)号:US12164572B2
公开(公告)日:2024-12-10
申请号:US18531015
申请日:2023-12-06
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G06F15/16 , G06F16/9032 , G10L15/30 , G16Y10/80 , G16Y40/35
Abstract: Implementations can identify a given assistant device from among a plurality of assistant devices in an ecosystem, obtain device-specific signal(s) that are generated by the given assistant device, process the device-specific signal(s) to generate candidate semantic label(s) for the given assistant device, select a given semantic label for the given semantic device from among the candidate semantic label(s), and assigning, in a device topology representation of the ecosystem, the given semantic label to the given assistant device. Implementations can optionally receive a spoken utterance that includes a query or command at the assistant device(s), determine a semantic property of the query or command matches the given semantic label to the given assistant device, and cause the given assistant device to satisfy the query or command.
-
公开(公告)号:US12164556B2
公开(公告)日:2024-12-10
申请号:US17336000
申请日:2021-06-01
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G06F17/00 , G06F7/00 , G06F16/532 , G06T3/40
Abstract: Techniques are described herein for providing smart suggestions for image zoom regions. A method includes: receiving a search query; performing a search using the search query to identify search results that include image search results including a plurality of images that are responsive to the search query; for a given image of the plurality of images included in the image search results, determining at least one zoom region in the given image; and providing the search results including the image search results, including providing the given image and an indication of the at least one zoom region in the given image.
-
公开(公告)号:US12149773B2
公开(公告)日:2024-11-19
申请号:US17902601
申请日:2022-09-02
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G10L25/57 , G06V20/40 , G10L15/22 , H04N21/422 , H04N21/472
Abstract: Voice-based interaction with video content being presented by a media player application is enhanced through the use of an automated assistant capable of identifying when a spoken utterance by a user is a request to playback a specific scene in the video content. A query identified in a spoken utterance may be used to access stored scene metadata associated with video content being presented in the vicinity of the user to identify one or more locations in the video content that correspond to the query, such that a media control command may be issued to the media player application to cause the media player application to seek to a particular location in the video content that satisfies the query.
-
公开(公告)号:US20240378700A1
公开(公告)日:2024-11-14
申请号:US18781937
申请日:2024-07-23
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G06T5/50 , G06T3/00 , G06V10/74 , G06V10/764 , G06V10/774 , G06V10/82
Abstract: System and methods are provided for generating panoramic imagery. An example method may be performed by one or more processors and includes obtaining first panoramic imagery depicting a geographic area. The method also includes obtaining an image depicting one or more physical objects absent from the first panoramic imagery. Further, the method includes transforming the first panoramic imagery into second panoramic imagery depicting the one or more physical objects and including at least a portion of the first panoramic imagery.
-
公开(公告)号:US12106755B2
公开(公告)日:2024-10-01
申请号:US17573418
申请日:2022-01-11
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
Abstract: Techniques are described herein for warm word arbitration between automated assistant devices. A method includes: determining that warm word arbitration is to be initiated between a first assistant device and one or more additional assistant devices, including a second assistant device; broadcasting, by the first assistant device, to the one or more additional assistant devices, an active set of warm words for the first assistant device; for each of the one or more additional assistant devices, receiving, from the additional assistant device, an active set of warm words for the additional assistant device; identifying a matching warm word included in the active set of warm words for the first assistant device and included in the active set of warm words for the second assistant device; and enabling or disabling detection of the matching warm word by the first assistant device, in response to identifying the matching warm word.
-
公开(公告)号:US20240296848A1
公开(公告)日:2024-09-05
申请号:US18662590
申请日:2024-05-13
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G10L17/08 , G06F3/16 , G06F21/32 , G10L15/08 , G10L15/22 , G10L17/02 , G10L17/04 , G10L17/10 , G10L17/14 , G10L17/18 , G10L17/22 , G10L17/24
CPC classification number: G10L17/08 , G06F3/167 , G06F21/32 , G10L15/22 , G10L17/02 , G10L17/04 , G10L17/10 , G10L17/14 , G10L17/18 , G10L17/22 , G10L17/24 , G10L2015/088 , G10L2015/227
Abstract: Implementations relate to automatic generation of speaker features for each of one or more particular text-dependent speaker verifications (TD-SVs) for a user. Implementations can generate speaker features for a particular TD-SV using instances of audio data that each capture a corresponding spoken utterance of the user during normal non-enrollment interactions with an automated assistant via one or more respective assistant devices. For example, a portion of an instance of audio data can be used in response to: (a) determining that recognized term(s) for the spoken utterance captured by that the portion correspond to the particular TD-SV; and (b) determining that an authentication measure, for the user and for the spoken utterance, satisfies a threshold. Implementations additionally or alternatively relate to utilization of speaker features, for each of one or more particular TD-SVs for a user, in determining whether to authenticate a spoken utterance for the user.
-
公开(公告)号:US12057119B2
公开(公告)日:2024-08-06
申请号:US18092883
申请日:2023-01-03
Applicant: GOOGLE LLC
Inventor: Victor Carbune , Matthew Sharifi , Ondrej Skopek , Justin Lu , Daniel Valcarce , Kevin Kilgour , Mohamad Hassan Rom , Nicolo D'Ercole , Michael Golikov
CPC classification number: G10L15/22 , G10L15/05 , G10L15/1815 , G10L25/78 , G10L2015/088 , G10L2015/223
Abstract: Some implementations process, using warm word model(s), a stream of audio data to determine a portion of the audio data that corresponds to particular word(s) and/or phrase(s) (e.g., a warm word) associated with an assistant command, process, using an automatic speech recognition (ASR) model, a preamble portion of the audio data (e.g., that precedes the warm word) and/or a postamble portion of the audio data (e.g., that follows the warm word) to generate ASR output, and determine, based on processing the ASR output, whether a user intended the assistant command to be performed. Additional or alternative implementations can process the stream of audio data using a speaker identification (SID) model to determine whether the audio data is sufficient to identify the user that provided a spoken utterance captured in the stream of audio data, and determine if that user is authorized to cause performance of the assistant command.
-
公开(公告)号:US20240211120A1
公开(公告)日:2024-06-27
申请号:US18594926
申请日:2024-03-04
Applicant: GOOGLE LLC
Inventor: Matthew Sharifi , Victor Carbune
IPC: G06F3/04842 , G06F3/01 , G06F3/0481 , G06F3/16 , G10L15/22
CPC classification number: G06F3/04842 , G06F3/016 , G06F3/0481 , G06F3/167 , G10L15/22 , G10L2015/223
Abstract: Implementations set forth herein relate to an automated assistant that can perform operations to revert various applications to prior states that the applications may have arrived at via certain user inputs. The user can provide a spoken utterance such as, “undo,” in order to cause the automated assistant to identify a particular application that the user may want to affect with the “undo” command. When the particular application is identified, the automated assistant can identify one or more operations recently performed using the particular application. In some implementations, the automated assistant can provide the user with a variety of undo options in response to an “undo” command. For instance, the automated assistant can prompt the user to select one of a first cluster of operations and/or a second cluster of operations to be undone, and each cluster can refer to different operations.
-
-
-
-
-
-
-
-
-