User verification of a generative response to a multimodal query

    公开(公告)号:US12277635B1

    公开(公告)日:2025-04-15

    申请号:US18532470

    申请日:2023-12-07

    Applicant: Google LLC

    Abstract: A multimodal search system is described. The system can receive image data from a user device. Additionally, the system can receive a prompt associated with the image data. Moreover, the system can determine, using a computer vision model, a first object in the image data that is associated with the prompt. Furthermore, the system can receive, from the user device, a user indication on whether the image data includes the first object. Subsequently, in response to receiving the user indication, the system can generate a response using a large language model.

    Visual Citations for Information Provided in Response to Multimodal Queries

    公开(公告)号:US20240378237A1

    公开(公告)日:2024-11-14

    申请号:US18314663

    申请日:2023-05-09

    Applicant: Google LLC

    Abstract: Result images are retrieved based on a similarity to a query image. A set of textual inputs is processed with a machine-learned language model to obtain a language output comprising textual content, wherein the set of textual inputs comprises textual content from source documents that include the result images, and a prompt associated with the query image. The language output and the result images are provided to a user computing device. Information is received descriptive of an indication by a user that a first result image is visually dissimilar to the query image. Textual content associated with the source document that includes the first result image from the set of textual inputs is removed. The set of textual inputs is processed with the machine-learned language model to obtain a refined language output. The refined language output is provided to the user computing device.

    Management system for audio and visual content

    公开(公告)号:US11303591B2

    公开(公告)日:2022-04-12

    申请号:US16253586

    申请日:2019-01-22

    Applicant: Google LLC

    Abstract: Systems, apparatuses, and methods for managing message content are provided. In one embodiment, a method includes receiving, by one or more computing devices, a message comprising audio content and visual media content. The method further includes sending, by the one or more computing devices, a first set of data descriptive of the audio content to an audio device. The audio device is configured to communicate the audio content to a user of the audio device. The method includes sending, by the one or more computing devices, a second set of data descriptive of the visual media content to a display device. The display device is configured to display the visual media content for the user. The method further includes providing, by the one or more computing devices, a notification to the user of the audio device to view the visual media content on the display device.

Patent Agency Ranking