Systems and Methods for Improved Adversarial Training of Machine-Learned Models

    公开(公告)号:US20190220755A1

    公开(公告)日:2019-07-18

    申请号:US15874121

    申请日:2018-01-18

    Applicant: Google LLC

    CPC classification number: G06N5/04 G06K9/6256 G06N3/0454 G06N3/084 G06N20/00

    Abstract: Example aspects of the present disclosure are directed to systems and methods that enable improved adversarial training of machine-learned models. An adversarial training system can generate improved adversarial training examples by optimizing or otherwise tuning one or hyperparameters that guide the process of generating of the adversarial examples. The adversarial training system can determine, solicit, or otherwise obtain a realism score for an adversarial example generated by the system. The realism score can indicate whether the adversarial example appears realistic. The adversarial training system can adjust or otherwise tune the hyperparameters to produce improved adversarial examples (e.g., adversarial examples that are still high-quality and effective while also appearing more realistic). Through creation and use of such improved adversarial examples, a machine-learned model can be trained to be more robust against (e.g., less susceptible to) various adversarial techniques, thereby improving model, device, network, and user security and privacy.

    User-assigned custom assistant responses to queries being submitted by another user

    公开(公告)号:US12300250B2

    公开(公告)日:2025-05-13

    申请号:US18593563

    申请日:2024-03-01

    Applicant: GOOGLE LLC

    Abstract: Implementations set forth herein relate to an automated assistant that can be customized by a user to provide custom assistant responses to certain assistant queries, which may originate from other users. The user can establish certain custom assistant responses by providing an assistant response request to the automated assistant and/or responding to a request from the automated assistant to establish a particular custom assistant response. In some instances, a user can elect to establish a custom assistant response when the user determines or acknowledges that certain common queries are being submitted to the automated assistant—but the automated assistant is unable to resolve the common query. Establishing such custom assistant responses can therefore condense interactions between other users and the automated assistant. Furthermore, as such interactions are more immediately resolved, the automated assistant can avoid wasteful consumption of computational resources that may otherwise occur during prolonged assistant interactions.

    LARGE LANGUAGE MODEL-BASED RESPONSES TO TARGETED UI ELEMENTS

    公开(公告)号:US20250117594A1

    公开(公告)日:2025-04-10

    申请号:US18480728

    申请日:2023-10-04

    Applicant: Google LLC

    Abstract: Techniques include using a generative model to make changes to content such that the mechanisms used to guide the user into a decision become plain to the user and/or minimizes the perceived urgency. Implementations can operate as part of the browser or as an extension to the browser. Implementations may identify a targeted UI element in browser content (a web page) and use the generative model to modify the targeted UI element before presenting the browser content to the user. In some implementations, the identification of the targeted UI element may be performed by the generative model.

    Generation of Interactive Audio Tracks From Visual Content

    公开(公告)号:US20250061892A1

    公开(公告)日:2025-02-20

    申请号:US18938126

    申请日:2024-11-05

    Applicant: Google LLC

    Abstract: Generating audio tracks is provided. The system selects a digital component object having a visual output format. The system determines to convert the digital component object into an audio output format. The system generates text for the digital component object. The system selects, based on context of the digital component object, a digital voice to render the text. The system constructs a baseline audio track of the digital component object with the text rendered by the digital voice. The system generates, based on the digital component object, non-spoken audio cues. The system combines the non-spoken audio cues with the baseline audio form of the digital component object to generate an audio track of the digital component object. The system provides the audio track of the digital component object to the computing device for output via a speaker of the computing device.

    INFERRING SEMANTIC LABEL(S) FOR ASSISTANT DEVICE(S) BASED ON DEVICE-SPECIFIC SIGNAL(S)

    公开(公告)号:US20250053596A1

    公开(公告)日:2025-02-13

    申请号:US18927101

    申请日:2024-10-25

    Applicant: GOOGLE LLC

    Abstract: Implementations can identify a given assistant device from among a plurality of assistant devices in an ecosystem, obtain device-specific signal(s) that are generated by the given assistant device, process the device-specific signal(s) to generate candidate semantic label(s) for the given assistant device, select a given semantic label for the given semantic device from among the candidate semantic label(s), and assigning, in a device topology representation of the ecosystem, the given semantic label to the given assistant device. Implementations can optionally receive a spoken utterance that includes a query or command at the assistant device(s), determine a semantic property of the query or command matches the given semantic label to the given assistant device, and cause the given assistant device to satisfy the query or command.

    AUTOMATED ASSISTANT FOR FACILITATING COMMUNICATIONS THROUGH DISSIMILAR MESSAGING FEATURES OF DIFFERENT APPLICATIONS

    公开(公告)号:US20250029607A1

    公开(公告)日:2025-01-23

    申请号:US18908231

    申请日:2024-10-07

    Applicant: GOOGLE LLC

    Abstract: Implementations relate to an automated assistant that can respond to communications received via a third party application and/or other third party communication modality. The automated assistant can determine that the user is participating in multiple different conversations via multiple different third party communication services. In some implementations, conversations can be processed to identify particular features of the conversations. When the automated assistant is invoked to provide input to a conversation, the automated assistant can compare the input to the identified conversation features in order to select the particular conversation that is most relevant to the input. In this way, the automated assistant can assist with any of multiple disparate conversations that are each occurring via a different third party application.

    Navigation Directions Preview
    227.
    发明申请

    公开(公告)号:US20250012589A1

    公开(公告)日:2025-01-09

    申请号:US18887286

    申请日:2024-09-17

    Applicant: GOOGLE LLC

    Abstract: To present a navigation directions preview, a server device receives a request for navigation directions from a starting location to a destination location and generates a set of navigation directions in response to the request. The set of navigation directions includes a set of route segments for traversing from the starting location to the destination location. The server device selects a subset of the route segments based on characteristics of each route segment in the set of route segments. For each selected route segment, the server device provides a preview of the route segment to be displayed on a client device. The preview of the route segment includes panoramic street level imagery depicting the route segment.

    Video conference content auto-retrieval and focus based on learned relevance

    公开(公告)号:US12192022B1

    公开(公告)日:2025-01-07

    申请号:US18592428

    申请日:2024-02-29

    Applicant: GOOGLE LLC

    Abstract: Systems and methods for video conference content auto-retrieval and focus based on learned relevance is provided. A method may include determining collaborative documents each associated with at least one user participating in a video conference, and providing one or more first inputs to a machine learning model, the one or more first inputs identifying the plurality of collaborative documents, and comprising an indication of a request to identify a collaborative document that is most relevant to a first subject discussed during the video conference at a first point in time. The method may further include determining, based on one or more first outputs of the machine learning model, the collaborative document that is the most relevant to the first subject discussed during the video conference at the first point in time, and providing the determined collaborative document for presentation on one or more client devices while the first subject is being discussed during the video conference.

    MULTIMODAL INTENT UNDERSTANDING FOR AUTOMATED ASSISTANT

    公开(公告)号:US20250006184A1

    公开(公告)日:2025-01-02

    申请号:US18886315

    申请日:2024-09-16

    Applicant: GOOGLE LLC

    Abstract: Implementations described herein include detecting a stream of audio data that captures a spoken utterance of the user and that captures ambient noise occurring within a threshold time period of the spoken utterance being spoken by the user. Implementations further include processing a portion of the audio data that includes the ambient noise to determine ambient noise classification(s), processing a portion of the audio data that includes the spoken utterance to generate a transcription, processing both the transcription and the ambient noise classification(s) with a machine learning model to generate a user intent and parameter(s) for the user intent, and performing one or more automated assistant actions based on the user intent and using the parameter(s).

    Navigation directions preview
    230.
    发明授权

    公开(公告)号:US12117308B2

    公开(公告)日:2024-10-15

    申请号:US17057074

    申请日:2020-08-18

    Applicant: GOOGLE LLC

    CPC classification number: G01C21/3647 G01C21/3629 G01C21/3641

    Abstract: To present a navigation directions preview, a server device receives a request for navigation directions from a starting location to a destination location and generates a set of navigation directions in response to the request. The set of navigation directions includes a set of route segments for traversing from the starting location to the destination location. The server device selects a subset of the route segments based on characteristics of each route segment in the set of route segments. For each selected route segment, the server device provides a preview of the route segment to be displayed on a client device. The preview of the route segment includes panoramic street level imagery depicting the route segment.

Patent Agency Ranking