-
公开(公告)号:US11929072B2
公开(公告)日:2024-03-12
申请号:US17592042
申请日:2022-02-03
Applicant: Google LLC
Inventor: Victor Carbune , Daniel Keysers , Thomas Deselaers
IPC: G10L15/00 , G06F16/33 , G06F16/9535 , G06Q10/107 , G10L15/06 , G10L15/16 , G10L15/18 , G10L15/22 , G10L15/26 , G06F3/0482
CPC classification number: G10L15/22 , G06F16/3331 , G06F16/9535 , G06Q10/107 , G10L15/063 , G10L15/16 , G10L15/1815 , G10L15/26 , G06F3/0482
Abstract: Methods, apparatus, and computer readable media related to receiving textual input of a user during a dialog between the user and an automated assistant (and optionally one or more additional users), and generating responsive reply content based on the textual input and based on user state information. The reply content is provided for inclusion in the dialog. In some implementations, the reply content is provided as a reply, by the automated assistant, to the user's textual input and may optionally be automatically incorporated in the dialog between the user and the automated assistant. In some implementations, the reply content is suggested by the automated assistant for inclusion in the dialog and is only included in the dialog in response to further user interface input.
-
公开(公告)号:US11893995B2
公开(公告)日:2024-02-06
申请号:US18074758
申请日:2022-12-05
Applicant: GOOGLE LLC
Inventor: Victor Carbune , Pedro Gonnet Anders , Thomas Deselaers , Sandro Feuz
CPC classification number: G10L15/30 , G10L15/22 , G10L13/033 , G10L13/08 , G10L2015/088 , G10L2015/223 , G10L2015/228 , H04W4/80
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for collaboration between multiple voice controlled devices are disclosed. In one aspect, a method includes the actions of identifying, by a first computing device, a second computing device that is configured to respond to a particular, predefined hotword; receiving audio data that corresponds to an utterance; receiving a transcription of additional audio data outputted by the second computing device in response to the utterance; based on the transcription of the additional audio data and based on the utterance, generating a transcription that corresponds to a response to the additional audio data; and providing, for output, the transcription that corresponds to the response.
-
公开(公告)号:US11887016B2
公开(公告)日:2024-01-30
申请号:US17102108
申请日:2020-11-23
Applicant: Google LLC
Inventor: Daniel M. Keysers , Victor Carbune , Thomas Deselaers
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for providing actionable suggestions are disclosed. In one aspect, a method includes receiving (i) an indication that an event detection module has determined that a shared event of a particular type is presently occurring or has occurred, and (ii) data referencing an attribute associated with the shared event. The method includes selecting, from among multiple output templates that are each associated with a different type of shared event, a particular output template associated with the particular type of shared event detected by the module. The method generates a notification for output using at least (i) the selected particular output template, and (ii) the data referencing the attribute associated with the shared event. The method then provides, for output to a user device, the notification that is generated.
-
公开(公告)号:US20230275902A1
公开(公告)日:2023-08-31
申请号:US18142926
申请日:2023-05-03
Applicant: GOOGLE LLC
Inventor: Victor Carbune , Thomas Deselaers , Sandro Feuz
IPC: H04L9/40 , G10L15/22 , H04L67/12 , H04L67/30 , G06F16/635 , G06F3/16 , G10L17/00 , G06F21/32 , G06F9/50
CPC classification number: H04L63/107 , G06F3/167 , G06F9/5055 , G06F16/635 , G06F21/32 , G10L15/22 , G10L17/00 , H04L63/0861 , H04L67/12 , H04L67/30 , G10L15/30 , G10L2015/227
Abstract: The present disclosure is generally directed to a data processing system for customizing content in a voice activated computer network environment. With user consent, the data processing system can improve the efficiency and effectiveness of auditory data packet transmission over one or more computer networks by, for example, increasing the accuracy of the voice identification process used in the generation of customized content. The present solution can make accurate identifications while generating fewer audio identification models, which are computationally intensive to generate.
-
公开(公告)号:US20230117000A1
公开(公告)日:2023-04-20
申请号:US17978498
申请日:2022-11-01
Applicant: Google LLC
Inventor: Victor Carbune , Thomas Deselaers
IPC: G06N5/04 , G06N20/00 , G06N3/084 , G06F18/214 , G06N3/045
Abstract: Example aspects of the present disclosure are directed to systems and methods that enable improved adversarial training of machine-learned models. An adversarial training system can generate improved adversarial training examples by optimizing or otherwise tuning one or hyperparameters that guide the process of generating of the adversarial examples. The adversarial training system can determine, solicit, or otherwise obtain a realism score for an adversarial example generated by the system. The realism score can indicate whether the adversarial example appears realistic. The adversarial training system can adjust or otherwise tune the hyperparameters to produce improved adversarial examples (e.g., adversarial examples that are still high-quality and effective while also appearing more realistic). Through creation and use of such improved adversarial examples, a machine-learned model can be trained to be more robust against (e.g., less susceptible to) various adversarial techniques, thereby improving model, device, network, and user security and privacy.
-
公开(公告)号:US11625575B2
公开(公告)日:2023-04-11
申请号:US16617949
申请日:2019-03-06
Applicant: Google LLC
Inventor: Victor Carbune , Thomas Deselaers
Abstract: Techniques are disclosed that enable automating user interface input by generating a sequence of actions to perform a task utilizing a multi-agent reinforcement learning framework. Various implementations process an intent associated with received user interface input using a holistic reinforcement policy network to select a software reinforcement learning policy network. The sequence of actions can be generated by processing the intent, as well as a sequence of software client state data, using the selected software reinforcement learning policy network. The sequence of actions are utilized to control the software client corresponding to the selected software reinforcement learning policy network.
-
公开(公告)号:US20230031521A1
公开(公告)日:2023-02-02
申请号:US17962636
申请日:2022-10-10
Applicant: GOOGLE LLC
Inventor: Pedro Gonnet Anders , Victor Carbune , Daniel Keysers , Thomas Deselaers , Sandro Feuz
Abstract: Techniques are described herein for enabling an automated assistant to adjust its behavior depending on a detected age range and/or “vocabulary level” of a user who is engaging with the automated assistant. In various implementations, data indicative of a user's utterance may be used to estimate one or more of the user's age range and/or vocabulary level. The estimated age range/vocabulary level may be used to influence various aspects of a data processing pipeline employed by an automated assistant. In various implementations, aspects of the data processing pipeline that may be influenced by the user's age range/vocabulary level may include one or more of automated assistant invocation, speech-to-text (“STT”) processing, intent matching, intent resolution (or fulfillment), natural language generation, and/or text-to-speech (“TTS”) processing. In some implementations, one or more tolerance thresholds associated with one or more of these aspects, such as grammatical tolerances, vocabularic tolerances, etc., may be adjusted.
-
公开(公告)号:US11521600B2
公开(公告)日:2022-12-06
申请号:US16883690
申请日:2020-05-26
Applicant: GOOGLE LLC
Inventor: Pedro Gonnet Anders , Victor Carbune , Daniel Keysers , Thomas Deselaers , Sandro Feuz
Abstract: Techniques are described herein for enabling an automated assistant to adjust its behavior depending on a detected vocabulary level or other vocal characteristics of an input utterance provided to an automated assistant. The estimated vocabulary level or other vocal characteristics may be used to influence various aspects of a data processing pipeline employed by the automated assistant. In some implementations, one or more tolerance thresholds associated with, for example, grammatical tolerances or vocabulary tolerances, may be adjusted based on the estimated vocabulary level or vocal characteristics of the input utterance.
-
79.
公开(公告)号:US20220245332A1
公开(公告)日:2022-08-04
申请号:US17728531
申请日:2022-04-25
Applicant: GOOGLE LLC
Inventor: Victor Carbune , Thomas Deselaers
IPC: G06F40/169 , G06F16/93 , G06F40/20 , G06N3/04
Abstract: Implementations described herein determine, for a given document generated by a given source, one or more portions of content (e.g., phrase(s), image(s), paragraph(s), etc.) of the given document that may be influenced by a source perspective of the given source. Further, implementations determine one or more additional resources that are related to the given source and that are related to the portion(s) of content of the given document. Yet further, implementations utilize the additional resource(s) to determine additional content that provides context for the portion(s) that may be influenced by a source perspective. A relationship, between the additional resource(s) and the portions of the given document, can be defined. Based on the relationship being defined, the additional content can be caused to be rendered at a client device in response to the client device accessing the given document.
-
公开(公告)号:US20220198609A1
公开(公告)日:2022-06-23
申请号:US17603362
申请日:2019-06-10
Applicant: Daniel M. KEYSERS , Thomas DESELAERS , Victor CARBUNE , Google LLC
Inventor: Victor Carbune , Daniel M. Keysers , Thomas Deselaers
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, that use generative adversarial models to increase the quality of sensor data generated by a first environmental sensor to resemble the quality of sensor data generated by another sensor having a higher quality than the first environmental sensor. A set of first and second training data generated by a first environmental sensor having a first quality and a second sensor having a target quality, respectively, is received. A generative adversarial mode is trained, using the set of first training data and the set of second training data, to modify sensor data from the first environmental sensor by reducing a difference in quality between the sensor data generated by the first environmental sensor and sensor data generated by the target environmental sensor.
-
-
-
-
-
-
-
-
-