-
公开(公告)号:US20220366903A1
公开(公告)日:2022-11-17
申请号:US17321994
申请日:2021-05-17
申请人: GOOGLE LLC
发明人: Victor Carbune , Matthew Sharifi , Ondrej Skopek , Justin Lu , Daniel Valcarce , Kevin Kilgour , Mohamad Hassan Rom , Nicolo D'Ercole , Michael Golikov
摘要: Some implementations process, using warm word model(s), a stream of audio data to determine a portion of the audio data that corresponds to particular word(s) and/or phrase(s) (e.g., a warm word) associated with an assistant command, process, using an automatic speech recognition (ASR) model, a preamble portion of the audio data (e.g., that precedes the warm word) and/or a postamble portion of the audio data (e.g., that follows the warm word) to generate ASR output, and determine, based on processing the ASR output, whether a user intended the assistant command to be performed. Additional or alternative implementations can process the stream of audio data using a speaker identification (SID) model to determine whether the audio data is sufficient to identify the user that provided a spoken utterance captured in the stream of audio data, and determine if that user is authorized to cause performance of the assistant command.
-
公开(公告)号:US11494667B2
公开(公告)日:2022-11-08
申请号:US15874121
申请日:2018-01-18
申请人: Google LLC
发明人: Victor Carbune , Thomas Deselaers
摘要: Example aspects of the present disclosure are directed to systems and methods that enable improved adversarial training of machine-learned models. An adversarial training system can generate improved adversarial training examples by optimizing or otherwise tuning one or hyperparameters that guide the process of generating of the adversarial examples. The adversarial training system can determine, solicit, or otherwise obtain a realism score for an adversarial example generated by the system. The realism score can indicate whether the adversarial example appears realistic. The adversarial training system can adjust or otherwise tune the hyperparameters to produce improved adversarial examples (e.g., adversarial examples that are still high-quality and effective while also appearing more realistic). Through creation and use of such improved adversarial examples, a machine-learned model can be trained to be more robust against (e.g., less susceptible to) various adversarial techniques, thereby improving model, device, network, and user security and privacy.
-
93.
公开(公告)号:US11490172B2
公开(公告)日:2022-11-01
申请号:US17282492
申请日:2019-07-23
申请人: Google LLC
发明人: Victor Carbune , Andrii Maksai , Sandro Feuz
IPC分类号: H04N21/8541 , H04N21/8405 , H04N21/845 , H04N21/8545
摘要: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, identify and classify the various video pathways in an interactive video based on the content of these video pathways. A video comprising multiple video segments is obtained from a video library. Each video segment is directly linked to at least one other video segment and the multiple video segments comprise a beginning segment, intermediate segments (including interactive segments), and final segments. Multiple video pathways in the video are identified. For each identified video pathway, classification data is generated and each such video pathway is then stored in the video library. When the video is selected from a particular category of the video library, the video segments of a video pathway that has a classification which is the same as the classification associated with the particular category, is then displayed.
-
公开(公告)号:US20220339542A1
公开(公告)日:2022-10-27
申请号:US17766146
申请日:2019-10-04
申请人: GOOGLE LLC
IPC分类号: A63F13/537 , G06V20/40 , G06T11/00
摘要: The disclosed subject matter can receive a source video and identifies one or more player actions based on the source video. A second video can be received that is based on a currently executing game environment. A portion of the source video that exhibits a first gameplay situation that is similar to a gameplay situation in the second video can be determined. A property of the determined portion of the source video can be adjusted to produce a guide video. The guide video can be overlaid on the currently executing game environment.
-
公开(公告)号:US11483170B1
公开(公告)日:2022-10-25
申请号:US16730484
申请日:2019-12-30
申请人: GOOGLE LLC
发明人: Victor Carbune , Daniel Keysers , Thomas Deselaers
摘要: Systems and methods for video conference content auto-retrieval and focus based on learned relevance is provided. In accordance with the systems and methods, audio streams and video streams from client devices participating in a video conference are received. Based on the audio streams, a subject being discussed during the video conference at a point in time is determined. A video stream that is most relevant to the subject being discussed during the video conference at the point in time is determined from the video streams. The determined video stream is provided to the client devices for presentation on the client devices while the subject is being discussed during the video conference.
-
公开(公告)号:US11468052B2
公开(公告)日:2022-10-11
申请号:US16912298
申请日:2020-06-25
申请人: Google LLC
发明人: Matthew Sharifi , Victor Carbune
IPC分类号: G06F17/00 , G06F16/242 , G06F16/2457 , G06K9/62 , G06F16/248 , G06F16/23
摘要: Methods, systems, and computer readable media related to generating a combined search query based on search parameters of a current search query of a user and search parameters of one or more previously submitted search quer(ies) of the user that are determined to be of the same line of inquiry as the current search query. Two or more search queries may be determined to share a line of inquiry when it is determined that they are within a threshold level of semantic similarity to one another. Once a shared line of inquiry has been identified and a combined search query generated, users may interact with the search parameters and/or the search results to update the search parameters of the combined search query.
-
公开(公告)号:US20220292128A1
公开(公告)日:2022-09-15
申请号:US17200238
申请日:2021-03-12
申请人: Google LLC
发明人: Matthew Sharifi , Victor Carbune
IPC分类号: G06F16/432 , G10L15/22 , G06F40/30 , G06F3/16
摘要: Implementations described herein relate to receiving user input directed to an automated assistant, processing the user input to determine whether data from a server and/or third-party application is needed to perform certain fulfillment of an assistant command included in the user input, and generating a prompt that requests a user consent to transmitting of a request to the server and/or the third-party application to obtain the data needed to perform the certain fulfillment. In implementations where the user consents, the data can be obtained and utilized to perform the certain fulfillment. In implementations where the user does not consent, client data can be generated locally at a client device and utilized to perform alternate fulfillment of the assistant command. In various implementations, the request transmitted to the server and/or third-party application can be modified based on ambient noise captured when the user input is received.
-
公开(公告)号:US20220262368A1
公开(公告)日:2022-08-18
申请号:US17737264
申请日:2022-05-05
申请人: Google LLC
摘要: Implementing and applying an adaptive and self-training CAPTCHA (“Completely Automated Public Turing test to tell Computers and Humans Apart”) assistant that distinguishes between a computer-generated communication (e.g., speech and/or typed) and communication that originates from a human. The CAPTCHA assistant utilizes a generative adversarial network that is self-training and includes a generator to generate synthetic answers and a discriminator to distinguish between human answers and synthetic answers. The trained discriminator is applied to potentially malicious remote entities, which are provided challenge phrases. Answers from the remote entities are provided to the discriminator to predict whether the answer originated from a human or was computer-generated.
-
公开(公告)号:US20220172715A1
公开(公告)日:2022-06-02
申请号:US17110046
申请日:2020-12-02
申请人: Google LLC
发明人: Victor Carbune , Matthew Sharifi
摘要: Implementations relate to an automated assistant that can respond to communications received via a third party application and/or other third party communication modality. The automated assistant can determine that the user is participating in multiple different conversations via multiple different third party communication services. In some implementations, conversations can be processed to identify particular features of the conversations. When the automated assistant is invoked to provide input to a conversation, the automated assistant can compare the input to the identified conversation features in order to select the particular conversation that is most relevant to the input. In this way, the automated assistant can assist with any of multiple disparate conversations that are each occurring via a different third party application.
-
公开(公告)号:US20220167049A1
公开(公告)日:2022-05-26
申请号:US17103908
申请日:2020-11-24
申请人: Google LLC
发明人: Victor Carbune , Matthew Sharifi
IPC分类号: H04N21/458 , H04N21/442 , H04N21/439 , H04N21/45 , H04N21/466 , H04N21/472
摘要: While an assistant-enabled device is playing back media content, a method includes receiving a contextual signal from an environment of the assistant-enabled device and executing an event recognition routine to determine whether the received contextual signal is indicative of an event that conflicts with the playback of the media content from the assistant-enabled device. When the event recognition routine determines that the received contextual signal is indicative of the event that conflicts with the playback of the media content, the method also includes adjusting content playback settings of the assistant-enabled device.
-
-
-
-
-
-
-
-
-